linkedin website changed and can not read basic data #36

cyanide2019 · 2020-01-22T07:43:51Z

inished scraping url: https://www.linkedin.com/in/inmudassar-iqbal-a9a9159b/
scrapedin: 2020-01-22T07:42:56.489Z error: [cleanMessageData] LinkedIn website changed and scrapedin can't read basic data. Please report this issue at https://github.com/linkedtales/scrapedin/issues
2020-01-22T07:42:56.490Z error: error on crawling profile: https://linkedin/in/mudassar-iqbal-a9a9159b/
Error: LinkedIn website changed and scrapedin can't read basic data. Please report this issue at https://github.com/linkedtales/scrapedin/issues
2020-01-22T07:42:56.830Z info: starting scraping: https://linkedin/in/nadeem-aslam-057341102/
scrapedin: 2020-01-22T07:42:56.830Z info: [profile] starting scraping url: https://www.linkedin.com/in/innadeem-aslam-057341102/
scrapedin: 2020-01-22T07:42:58.070Z info: [profile] finished scraping url: https://www.linkedin.com/in/inamjad-khan-a03634b7/
scrapedin: 2020-01-22T07:42:58.070Z error: [cleanMessageData] LinkedIn website changed and scrapedin can't read basic data. Please report this issue at https://github.com/linkedtales/scrapedin/issues
2020-01-22T07:42:58.070Z error: error on crawling profile: https://linkedin/in/amjad-khan-a03634b7/
Error: LinkedIn website changed and scrapedin can't read basic data. Please report this issue at https://github.com/linkedtales/scrapedin/issues
2020-01-22T07:42:58.832Z info: starting scraping: https://linkedin/in/baraa-faisal-0529a5a3/
scrapedin: 2020-01-22T07:42:58.833Z info: [profile] starting scraping url: https://www.linkedin.com/in/inbaraa-faisal-0529a5a3/

cyanide2019 · 2020-01-23T02:24:05Z

020-01-23T02:23:36.378Z error: error on crawling profile: https://linkedin.com/in/ahmad-abdelqader-pmp-osha-iso-70493882/
Error: EACCES: permission denied, open './crawledProfiles/ahmad-abdelqader-pmp-osha-iso-70493882.json'
scrapedin: 2020-01-23T02:23:36.555Z info: [profile] finished scraping url: https://www.linkedin.com/in/ibrahim-saadeddine-1320b8100
2020-01-23T02:23:36.556Z error: error on crawling profile: https://linkedin.com/in/ibrahim-saadeddine-1320b8100/
Error: EACCES: permission denied, open './crawledProfiles/ibrahim-saadeddine-1320b8100.json'
2020-01-23T02:23:36.959Z info: starting scraping: https://linkedin.com/in/usman-mohammed-41332845/
scrapedin: 2020-01-23T02:23:36.959Z info: [profile] starting scraping url: https://www.linkedin.com/in/usman-mohammed-41332845
2020-01-23T02:23:37.960Z info: starting scraping: https://linkedin.com/in/smfaisal29/
scrapedin: 2020-01-23T02:23:37.960Z info: [profile] starting scraping url: https://www.linkedin.com/in/smfaisal29
scrapedin: 2020-01-23T02:23:41.554Z info: [profile] scrolling page to the bottom
scrapedin: 2020-01-23T02:23:42.066Z info: [scrollToPageBottom] scrolling to page bottom (1)
scrapedin: 2020-01-23T02:23:42.624Z info: [scrollToPageBottom] scrolling to page bottom (2)
scrapedin: 2020-01-23T02:23:42.988Z info: [profile] applying 1st delay

Zackhardtoname · 2020-02-11T03:01:13Z

Same problem

leonardiwagner · 2020-02-11T15:18:27Z

@Zackhardtoname Are you using a company/recruiter profile to login or just a regular employee one?

Please set isHeadless to false on config.json , this will open the browser while crawling, please check if it's really logged (looking on the LinkedIn top bar)

And also confirm that's 1.0.11 scrapedin version on your package.json.

@cyanide2019 could you do the same please? I couldn't reproduce this error, it's working here, thanks.

Zackhardtoname · 2020-02-11T15:19:14Z

Regular employee

…

On Tue, Feb 11, 2020, 10:18 AM Wagner Leonardi ***@***.***> wrote: @Zackhardtoname <https://github.com/Zackhardtoname> Are you using a company/recruiter profile to login or just a regular employee one? Please set isHeadless to false on config.json , this will open the browser while crawling, please check if it's really logged (looking on the LinkedIn top bar) And also confirm that's 1.0.11 scrapedin version on your package.json. @cyanide2019 <https://github.com/cyanide2019> could you do the same please? I couldn't reproduce this error, it's working here, thanks. — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#36?email_source=notifications&email_token=AGF32XOSGNRGPIKCY6PVASLRCK6UJA5CNFSM4KKBGSU2YY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOELMZVSQ#issuecomment-584686282>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AGF32XMTETLAIOFZK5A4CDLRCK6UJANCNFSM4KKBGSUQ> .

leonardiwagner · 2020-02-11T15:23:35Z

@Zackhardtoname so please do the mentioned configurations and post the results here when you can.

cyanide2019 · 2020-02-11T18:34:02Z

yes, it worked for me , now the issue is , when I trying to gather the profile links from linkedin , they will send me warning and block my account and warning permanent blocking if I continue to send auto query something , how to bypass this mechanism ?

…

On Tue, Feb 11, 2020 at 7:18 AM Wagner Leonardi ***@***.***> wrote: @Zackhardtoname <https://github.com/Zackhardtoname> Are you using a company/recruiter profile to login or just a regular employee one? Please set isHeadless to false on config.json , this will open the browser while crawling, please check if it's really logged (looking on the LinkedIn top bar) And also confirm that's 1.0.11 scrapedin version on your package.json. @cyanide2019 <https://github.com/cyanide2019> could you do the same please? I couldn't reproduce this error, it's working here, thanks. — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#36?email_source=notifications&email_token=AM2OZKQMSPOAIQVMFEXTRWDRCK6UJA5CNFSM4KKBGSU2YY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOELMZVSQ#issuecomment-584686282>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AM2OZKUMAOW7LJD7IZKWW3TRCK6UJANCNFSM4KKBGSUQ> .

Aditya94A · 2020-04-08T14:27:31Z

@cyanide2019 Where you able to find a solution for this?

pushparmar · 2020-05-15T11:12:25Z

What is the use of
"rootProfiles": [
"https://www.linkedin.com/in/place/",
"https://www.linkedin.com/in/here/",
"https://www.linkedin.com/in/profiles/",
"https://www.linkedin.com/in/to-start-the-crawler/"
]
in config.json?

Also, I want to search the profiles based on some particular keywords, but
"relatedProfilesKeywords": ["javascript"], does not seems to work.

PriyaJainDev · 2020-05-15T13:16:26Z

@cyanide2019 Is there any way that I can use particular keywords and then the crawler can search through all available profiles based on those keywords only?

ThomasProctor · 2020-05-27T14:42:29Z

It's a little hard to follow what was happening here, but I think I had the same problem. Login from credentials doesn't work with headless, but everything works fine with the "headed" browser. Headless works fine with cookies for me though.

I suspect that they might just be checking the user-agent in the header and refusing to log you in or giving you a captcha if it says that it's headless. I might do some experimentation there if I find I need headless login.

ThomasProctor · 2020-05-27T14:47:58Z

If I get the time, I'll do some more experimentation and open a separate issue if I really have a diagnosable problem.

leonardiwagner added the waiting for response waiting for issue owner response label Feb 11, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

linkedin website changed and can not read basic data #36

linkedin website changed and can not read basic data #36

cyanide2019 commented Jan 22, 2020

cyanide2019 commented Jan 23, 2020

Zackhardtoname commented Feb 11, 2020

leonardiwagner commented Feb 11, 2020

Zackhardtoname commented Feb 11, 2020 via email

leonardiwagner commented Feb 11, 2020

cyanide2019 commented Feb 11, 2020 via email

Aditya94A commented Apr 8, 2020

pushparmar commented May 15, 2020 •

edited

Loading

PriyaJainDev commented May 15, 2020

ThomasProctor commented May 27, 2020

ThomasProctor commented May 27, 2020

linkedin website changed and can not read basic data #36

linkedin website changed and can not read basic data #36

Comments

cyanide2019 commented Jan 22, 2020

cyanide2019 commented Jan 23, 2020

Zackhardtoname commented Feb 11, 2020

leonardiwagner commented Feb 11, 2020

Zackhardtoname commented Feb 11, 2020 via email

leonardiwagner commented Feb 11, 2020

cyanide2019 commented Feb 11, 2020 via email

Aditya94A commented Apr 8, 2020

pushparmar commented May 15, 2020 • edited Loading

PriyaJainDev commented May 15, 2020

ThomasProctor commented May 27, 2020

ThomasProctor commented May 27, 2020

pushparmar commented May 15, 2020 •

edited

Loading