- Extracts Company Name, Job Title and Postion from data scraped from: https://news.ycombinator.com/jobs
- Node.js
- Sequelize.js
- cheerio
- request
from what i read python and node.js have almost same performance while scraping and i had worked with cheerio (npm package) before so i went ahead with node.js
- Please make sure you have node and npm installed by running the following commands:
node --verison
npm --version
in a terminal run the following commands:
npm install
node scrape.js
- Regex expressions could have been more accurate
- Try to use NLP techniques