facebook-pixel

Entity Extraction from Websites to MongoDB

Industry

Specialization Or Business Function

Technical Function Analytics (Data Mining)

Technology & Tools

COMPLETED Oct 09, 2014

Project Description

I have an immediate need to extract company, people, and event entities from different websites (including news websites) and populate the results into my MongoDB. Digital Reasoning's tool Synthesys (http://www.digitalreasoning.com/uncommon-technology) seems to do what I need; however, I need someone who can set everything up for me or recommend something better.

1. Recommend an existing open source or commercial tool to extract and resolve company and people entities using only the already built-in fields for those objects (eg. addresses, name, etc.).  No customization required.  Open source preferred (to run on my server) but not required. 
2. Install the tool (if necessary) / turn it on.
3. Add a few destination URLs and RSS feeds
4. Send the output to my MongoDB instance

My developer will help with whatever setup is necessary on our servers.

Project Overview

  • Posted
    September 02, 2014
  • Planned Start
    September 08, 2014
  • Delivery Date
    September 30, 2014
  • Preferred Location
    From anywhere

Client Overview


EXPERTISE REQUIRED

Matching Providers