Meteor Population with Scraped Content

last updated July 29, 2021 6:01 UTC

UniMeteor

HQ: Remote

more jobs in this category:

  • -> Social Media Work From Home @ HireSociall
  • -> Writers Looking For A Steady Reliable Income @ AmpiFire
  • -> English Content Writer @ Livingston Research
  • -> Marketing / Sales Assistant with Excellent Project Management skills (10 hrs / week) @ The Search Guru
  • -> Chat Assassin (Social Media Sales) @ Made Omni INC

For the first stage, we focus on the following sources, filters and tags. SOURCES Job boards:

  • https://de.indeed.com/?r=us
  • Linkedin jobs
  • Glassdoor jobs
  • Angellist
  • Dribble
  • Behance
  • Github
  • Stackoverflow
  • https://authenticjobs.com

Corporate websites:

  • Uber jobshttps://www.uber.com/jobs/list
  • Airbnbjobshttps://www.airbnb.com/careers/departments
  • Snapchatjobshttps://https://www.snapchat.com/jobs
  • Palantirjobshttps://www.palantir.com/careers/
  • SpaceXjobshttp://www.spacex.com/careers/list
  • Pinterestjobshttps://https://www.pinterestcareers.com
  • Dropboxjobshttps://https://www.dropbox.com/jobs/all-jobs
  • Weworkjobshttps://https://careers.wework.com
  • Theranosjobshttps://www.theranos.com/careers
  • Spotifyjobshttps://www.spotify.com/us/jobs/
  • Squarejobshttp://https://app.jobvite.com/admin/info/404.html. aspx?c=q8Z9VfwV

FILTERS Only post when there a match of BOTH: a title AND a company name. Titles to filter (ignore everything else):

  • “cmo"
  • “marketing"
  • “social media"
  • “brand"
  • “acquisition"
  • “retention"
  • “blogger"
  • “media manager"
  • “internal communications"
  • “business development"
  • “partnership"
  • “corporate development"
  • “corporate strategy"
  • “business strategy"
  • “customer support"
  • “customer experience"
  • “customer success"
  • “product manager"
  • “COO"
  • “operations"
  • “recruiter"
  • “talent"
  • “sourcer"
  • “legal"
  • “paralegal"
  • “counsel"
  • “finance"
  • “treasury"
  • “accounting"
  • “accountant"
  • “tax"
  • “FP&A"
  • “financial analyst"
  • “cfo"

Company names to filter:

  • Uber
  • Airbnb
  • Snapchat
  • Palantir
  • SpaceX
  • Pinterest
  • Dropbox
  • Wework
  • Theranos
  • Spotify
  • Square

Example:

  • Uber – Business Development Manager -> Post
  • Uber – Software Engineer -> Don’t post
  • Procter & Gamble – Business Development Manager -> Don’t post
  • Procter & Gamble – Software Engineer -> Don’t Post

POSTING: The following fields should be identified and properly stored in the database for each post:1. Title2. Company name3. Source name4. URL of original posting (in case of a duplicate use corporate website URL)5. Job content (job description, requirements, etc)6. Location7. Category (full-time, contract, internship)8. Type (remote, local)

TAGGING: All tags / categories:

  • finance
  • legal
  • HR
  • sales
  • marketing
  • customer support
  • business development
  • operations
  • product management
  • strategy
  • data and analytics

Tagging algorithm:Tags such as “marketing & communications” should be applied to posts if any of the keywords are found in job title, not in content:

  • marketing & communications

  • “cmo"

  • “marketing"

  • “social media"

  • “brand"

  • “acquisition"

  • “retention"

  • “blogger"

  • “media manager"

  • “internal communications"

  • business development

  • “business development"

  • “partnership"

  • sales

  • “sales"

  • data and analytics

  • “business analyst"

  • “data analyst"

  • strategy

  • “corporate development"

  • “corporate strategy"

  • “business strategy"

  • customer support

  • “customer support"

  • “customer experience"

  • “customer success"

  • product management

  • “product manager"

  • operations

  • “COO"

  • “operations"

  • HR

  • “recruiter"

  • “talent"

  • “sourcer"

  • legal

  • “legal"

  • “paralegal"

  • “counsel"

  • finance

  • “finance"

  • “treasury"

  • “accounting"

  • “accountant"

  • “tax"

  • “FP&A"

  • “financial analyst"

  • “cfo"

A note on duplicates.Duplicates should be handled adequately. For instance, if a certain Uber job posting is found both on Uber jobs (www.uber.com/jobs/list) AND on https://de.indeed.com/?r=us, the job should only be posted once to Meteor. A note on robustness.The code should properly handle changes in design, layout or structure on any of the sources. In other words, it should work properly when and if any of the websites above completely change their design. For example, if Airbnb chooses not to break down their jobs by departments but display all of them on one page. Or, otherwise, if Snapchat chooses to have a separate URL for each team. Even minor errors are not acceptable. All the fields should be properly identified. A note on scalability. The system should be easily scalable and should handle millions of job postings without any bugs or slowdown in performance. If interested, please, respond with:* Total project cost and following breakdowns:* Project cost if only job boards are used for scraping (corporate job websites are not used)

  • Project cost without tagging

  • Project cost without checking for duplicates (but using both job boards and corporate websites)

  • And other variables that you think are relevant and influence the cost

  • Proposed payment terms

  • Proposed timelines and milestones

  • Any other additional information that is needed on our end

  • Examples of relevant previous projects

    VERY IMPORTANT: To separate you from the spammers, please write I AM REAL as the first line of your bid. We will delete all bids that do not start with this phrase, since most bidders never read the requirements. Thank you for being one who does.

Shopping Cart
There are no products in the cart!
Total
 0.00
0