facebook-pixel

Entity Extraction and Content Curation from Profiles of Experts in Biotechnology

Industry Pharmaceutical and Life Sciences

Specialization Or Business Function

Technical Function Software and Web Development (Information Extraction Web Scraping System)

Technology & Tools Programming Languages and Frameworks (Python)

CLOSED FOR BIDDING

Project Description

We are a startup in the process of building a database of biotechnology experts.  We have identified the following five sources with existing expert profiles:

  1. www.clinicaltrials.gov
  2. researchgate
  3. NIH: https://projectreporter.nih.gov/reporter_summary.cfm
  4. Stanford Med.  https://med.stanford.edu/profiles/browse?affiliations=capFaculty
  5. Harvard Catalyst.  https://connects.catalyst.harvard.edu/Profiles/search/default.aspx?showcolumns=1&searchtype=people&otherfilters=

We would like to extract the following information to import into our database:

  1. Name
  2. Title 
  3. Institution
  4. Specialty/Focus/Interest
  5. Bio/Training
  6. Publications
  7. Photo, if available

Please review the sources of content and provide your approach and ballpark estimate in hours to import 10,000 user profiles into our database. Either SQL server or MySQL are acceptable.

Project Overview

  • Posted
    February 10, 2016
  • Planned Start
    February 15, 2016
  • Delivery Date
    February 22, 2016
  • Preferred Location
    From anywhere

Client Overview


EXPERTISE REQUIRED

Matching Providers