Experfy Big Data, Analytics, and BI Projects

Browse Projects

381 Projects that match your criteria

Sort by:

Equity Trading Algorithms for Market Opens and Closes

We are looking to develop equities market algorithms for use in trading, with particular focus on market open and closing 5-10 minutes.

Looking at trading situations on micro scale
Analyze, compare and model rapid small movements in price
Detect and formulate repeatable patterns in rapidly changing pace of market opens and closes
What is the deliverable? The deliverables would be algos that could be employed in trading enviroments
Does the deliverable need to be deployed in the cloud or your infrastructure? Not needed in cloud

Financial Services

Data Mining

Forecasting

$75/hr - $175/hr

12 Proposals Status: CLOSED

Client: S******

Posted: Apr 09, 2015

Implement Statistical Model

We are a small international marketing agency specialising in paid advertising. We wish to implement a statistically sound testing model based upon the work done by Optimizely and referenced here:

http://pages.optimizely.com/rs/optimizely/images/stats_engine_technical_paper.pdf

We want to ensure:

that we understand the solution designed by Optimizely and that it is suitable for our needs
determine the formulae to be implemented so our PHP developers can build them programatically
understand any limitations to the suitability of the calculations or improvements that can be made

We have a test data set of over 200 advertisers with millions of ad impressions and clicks which we can validate our tests against.

Our required deliverables are:

written explanations of points 1 and 3 above
formula in the form of an Excel doc or similar that our developers can program against with clear definitions of all inputs and outputs

Once implemented we will seek to continue working with the expert to advance the usable scenarios for the testing methods and then investigate further statistical and predictive models from our data.

Media and Advertising

Clickrate Optimization

Media and Advertising

$50/hr - $150/hr

Starts Apr 16, 2015

8 Proposals Status: IN PROGRESS

Client: I********* ***

Posted: Apr 07, 2015

Revenue Management System Roadmap

Our vision is to construct a revenue management system that will forecast customer move-ins at a segment level and price point and optimize based on property-level constraints (e.g. available inventory & attrition rates). The revenue management system will need to be fully automated, and capable of extracting data from our various systems as well as publishing prices back into those systems. This vision will require a significant amount of data mining and systems architecture.

Our current state is we are working off of manual reports and making judgement-based decisions when to adjust prices at a property. We typically decide to price a property every other month and we use a rules-based excel model to guide the new rate decisions. When we have selected our new rates, we manually input the rates into a cloud-based property management system so that our stores can view the new prices. The current state lacks the scientific approach needed to optimize inventory and maximize revenue. Additionally, the manual approach is inefficient and leaves room for data input error.

Transportation and Warehousing

Pricing and Actuarial

Strategic Business Planning

$100/hr - $150/hr

Starts Mar 27, 2015

1 Proposal Status: COMPLETED

Client: W******* ****** *****

Posted: Mar 26, 2015

Data-Driven Mobile App Design and Development

We are a preeminent national financial (valuation) services firm that provides a broad range of financial advisory services to clients and to attorneys, CPAs, and other professionals involved in advising their high net worth clients on estate planning. Our services include intricate and highly specialized financial opinions, property appraisals, litigation support, and tax and business valuations. These services are provided to clients for a variety of purposes.

INFORMATION RESOURCES

The firm is well known as an authority on all matters relating to estate and gift tax valuation issues and relevant Tax Court decisions. As such, our professionals frequently author articles, alerts, case studies of valuation projects (Solution), and court case summaries and these documents are typically distributed – either by means of e-blasts or traditional mail – to our large database of clients and contacts. Most of this material is also accessible on our website.

PROJECT OBJECTIVES

Broadly stated, we are searching for someone to work with us on a project that includes the following two elements:

UI Design of a mobile app that reflects and/or upgrades our firm's brand
Creation of a mobile app that would enable searching the above web-based documents in a variety of ways e.g. by topic, by Judge, by category, etc. (It is anticipated that, as other documents are added to our information resources, those also will be included in the search capabilities – e.g. actual court cases.)

PROJECT TIMELINE

Our goal is to get this project started and completed as soon as possible, with exact time lines determined soon after we engage with the selected project contractor.

PROJECT COORDINATION

We are located in Orange County, California. While we understand that potential contractors are located throughout the United States, our preference is to have someone in the general Southern California to work on this project with us as this would enable face-to-face meetings, if and when necessary.

TECHNICAL INFORMATION / SPECIFICATIONS

The mobile app will need to be compatible with all mobile phones and tablets. We can consider developing native apps for iOS and Android, or use an HTML 5 mobile application framework. We would like to understand the pros and cons and cost of each in your proposal.
The data is hosted remotely on a CMS running on a LAMP platform.
The mobile app will need to be available through iTunes or Google Play store for download.
After the search is performed and the desired content is displayed, the user should be able to read those articles and save favorites for future use. Therefore, that functionality will require a user login or some way to hold state.
We would like the static files to be cached locally on the mobile device and just call the website when needed.

iOS

Android

UI Design

$20,000 - $50,000

Starts Apr 01, 2015

8 Proposals Status: CLOSED

Client: F*** ********* ****

Posted: Mar 20, 2015

Algorithm to Identify Inappropriate Content on Photo Sharing Platform

We are building a platform like Instagram where users will upload millions of images. We need the ability to flag inappropirate images for moderator review. From our platform perspective, inappropriate images are those that show nudity below the waist. We are comfortable with the idea of receiving a small percentage of "false positives" since we will have moderators who can review them.

We see this as a two-phase project. In the first phase, we would like an alogrithm that would classify these images into two buckets, appropriate and inappropriate, at a 90% confidence level. Down the road, we would have a second phase of development in which moderator intervention (signalling whether the algorithm was right or wrong in its classification) would train the alogrithm further based on whether the image was classified correctly.

Please be very specific about your approach. Our short-term goal is to substantially reduce the number of images that our moderators have to review. The algorithm must deliver the desired result for the project to be considered successful. We are unable to provide any training data since the site has not launched yet.

Media and Advertising

Computer Vision

Image Analysis

$5,000 - $10,000

Starts May 29, 2015

14 Proposals Status: IN PROGRESS

Client: M**** *****

Posted: Mar 17, 2015

Data Dashboards in Tableau for National Education Project

The Digital Youth Network supports organizations, educators and researchers in learning best practices to help develop youths’ technical, creative, and analytical skills. We seek a consultant to create data dashboards for various stakeholders in collaboration with the DYN team.

Education

Game Theory

$50/hr - $150/hr

Starts Mar 16, 2015

1 Proposal Status: IN PROGRESS

Client: D****** **********

Posted: Mar 16, 2015

A multidimensional knowledge mapping platform

The 3 main elements in the software are the database, the 3D visualization, and the machine learning. I'm currently outsourcing my software to a team of programmers to build a functional prototype ASAP. They consider using MySQL, three.js, OpenGL, etc. We have finalized the data relations and visualization but they don't have the capabilities to incorporate the requirements for future scaling and machine learning into the design.

Therefore, I want to consult on the architecture of my software, including choices of programming languages, databases, APIs, etc. to prepare for future scaling and minimize potential costs of database migration.

Since I don't have a technical background, I would really appreciate if your consultation can be as descriptive, detailed, and documented as possible.

In order to set out confidently with the final destination in mind, I'm going to describe the basics of the software, the purposes, and my vision for its full-fledged version (though I'm totally aware of the slim chance that I can reach that point).

----

Pichutz is an online "multidimensional knowledge mapping platform" where each unit of knowledge (a "room") is mapped on 4 dimensions of knowledge. Each dimension has a unique meaning and together they guide user towards creating more "important" knowledge rather than random knowledge - conceptually high, conceptually deep, "possibility-complete," resource-efficient, machine-relevant, multidisciplinary, algorithmic, machine-readable knowledge.

The first 3 dimensions (Conceptual, Human, Scopic) correspond with 3 spatial dimensions (x-, y-, z-axises) in 3D visualization, while the 4th ("Language") dimension can be navigated along by picking a level of machine-readability (natural language -> pseudo-code -> code -> even "rawer languages" which are discussed later).

The interface resembles Google Earth the most with the blue globe replaced by a tetrahedron (all edges are equal) filled with rooms. A user can "fly" through the rooms the same way one flies through Wikipedia articles in WikiGalaxy. This is the Pichutz Home with commands arranged on the top and tools at the bottom.

The most important dimension to understand is the 1st (Conceptual) one on the vertical axis. It's best to imagine Pichutz as a pyramid where each room is supported or "propped up" by lower rooms. "B props A" means A is either based on, inspired by, made possible with, invoked in, incorporates or hints at B. The propping room B may contribute either a high probability (a proving prop) or a low probability (a disproving prop) to A. Without a direct relation, a higher room A always implies a higher physical impact in reality over a lower room B, e.g. an international treaty over the behaviors of an isolated subatomic particle at low temperature.

----

The main guideline in room-opening is to open a room containing a knowledge that you want to prove, so that other users can prove or disprove it with their knowledge, according to their credentials, by "propping" that room with rooms containing their knowledge. This practice helps derive at an overall probability (calculated from all contributed probabilities) for any absolute statement such as "NoSQL databases are more scalable than SQL and provide superior performance" or "climate change will cause human extinction within 15 years." When such statements turned out equally proved and disproved, they suggest more advanced/encompassing ("Higher") technologies/truths. "Backprop" is prop's opposite concept which requires further contemplation.

Two important purposes of Pichutz (among others) are to address existential risks in order to divert investment into businesses or organizations that amass more "important" knowledge such as SpaceX or CERN, and to promote the "theory of weakness" (with weakness abstractly referring to any higher room's failure to incorporate a possible lower room) which addresses insecurities in the age of exponential technological advancement, answers philosophical questions regarding war and peace, ultimately proves human's "inherent weaknesses" and hopefully spawn a science of "weakness management."

A room can be forked and pull changes to incorporate its propping rooms and improve its probability. This way, sentences of knowledge will evolve into full theories (e.g. root of all existential risks), and eventually into coded programs (e.g. plans to survive by secretly dominating the world) to feed to future AIs. This is where the "Language" dimension bears fruit - a pipeline refining "human knowledge" into AIs' executable knowledge. This is the top-down approach, as opposed to the conventional bottom-up approach in AI development, to be complementary.

There is supposed to be a Cambrian explosion of rooms at first, which are mostly short sentences. Bots must be developed to brutally prune and crunch them and, in the process, guide and assist users in systematically learning and collaboratively creating more "important" knowledge as explained above.

----

Bots must be developed to perform many kinds of operations on the rooms, which increasingly require higher level of machine learning, including

1. stitching (putting rooms visualized in 3D together into Wikipedia-styled 2D articles for traditional reading),

2. validating (automatically propping rooms, merging similar rooms, migrating current rooms' probabilities to new rooms),

3. critical thinking (flagging contrasting rooms, pruning low probability rooms, killing spammers),

4. leveling (suggesting higher, unifying rooms based on contrasting lower rooms),

5. questioning (if not able to suggest unifying rooms, generating questions on Quora/StackOverflow),

6. crossing (linking problems with solutions in different fields, identifying similar solutions. See "the idea behind"),

7. translating (translating "human knowledge" into more machine-readable knowledge),

8. analytics tools will possibly be the main revenue stream (besides credentials accreditation, Private Pichutz, planning tool, Pyrofile search, headhunting, "room as a commodity," etc.). This includes summarizing common elements of successful products (e.g. ninja, wizard in games, movies), untapped combinations of product elements (e.g. ninja + wizard in the same movie), or untapped market (e.g. big data experts in North Korea, baby clothes in Vietnam, or an photo-disappearing app for exceptionally unproductive teens).

To enable such tools, data must be mined to generate rooms, and rooms must be designed to be inputted by users in special ways such as "multidimensional checklist," "graphic room," "programmable rooms," or "meta-room," which require further contemplation.

The "Language" dimension will be developed last. It will allow knowledge to be inputted in more machine-readable language (e.g. programming code), or even "rawer languages." Imagine that molecules "talk" with each other in physical interactions and their knowledge exist in physical forms, or imagine that each AI with a different setup and set of parameters stores knowledge in a different "language" (after learning from datasets) that can "magically," for example, recognizes human speech. In my daydream, Pichutz can become a Wikipedia+Github of the future, or more than that.

----

For philosophical details, you can check out this long document bit.ly/ReadTheSP

For how I described Pichutz to programmers, please refer below, which is mainly on data relations and visualization.

Please don't mind minor differences between the document and the description below. The latter is more realistic.

----

Admin Section:

Admin Management
User Management
Taxonomic Tree, Scopic Tree Management
Rooms Management

Front End:

Open Room
Create Prop

Create Scopon
Pyrofiles (user profiles)
Visualization
More functionality to be added

(Visualization)

The full shape of Pichutz is a tetrahedron. All edges are equal to X.

Before any room is opened, it's all black (dark). Only the edges are visible as thin white lines.

Each room when opened looks a like a square window with light coming out of it. As more rooms are opened, the SP will light up and its full shape will become gradually visible.

All rooms appear only on the front triangle (the front face of Pichutz).

There are 6 invisible lines on this front face (not to visualize, just to keep in mind):

(1) the perpendicular bisector that divides the face into the left side and the right side

(2) the Floor 0 line that is 0.6X away from the apex, so that the area of the triangle above it ("High knowledge") and the area of the trapezium below it ("Deep knowledge") are equal.

(3-6) two pairs of Soft lines and Hard lines, dividing the face into two Soft shells, two Hard shells, and a Super Hard core.

(Left or right) If the new room is related to an existing room, it appears on its same side. Otherwise, it appears on the opposite side of the last created room.

(Floor number)

(Floor 0) In theory, rooms on Floor 0 should be the easiest knowledge that humans can acquire. The 1st room we create is not likely to fits this description. So, at any time, the admin can pick a room and set it to Floor 0, and all rooms will be moved accordingly. For example, if a current room on 5th is set to be on 0th, all rooms will go down by 5 floors.

When we "Open room", we will input the room(s) that this new room props. The new room will always be below all existing rooms.

- When a new room is opened, if no prop is inputted, it will be on Floor 0.

- When a new room is opened and one prop (or many) is inputted, it will be 1 floor lower than then lowest propped room, and no existing room will change position. For example, if the new room props rooms on -2, -1, 0, 1, 2, then it will be on -3.

When we "Create a Prop" between two existing rooms, we input the probability that the propping room contributes to the propped room. Following this, existing rooms' floors may or may not change or may not.

- If the propping room is lower than the propped room, then no floor changes.

- If the propping room is higher than the propped room, check this example (create a prop from A to B) in which each arrow is a prop. There are 2 cases:

a) If there is a prop from C to D (C props D), then a relational conflict (a loop in this case) can be identified immediately: A --> B --> C --> D --> A. The prop from A to B thus can't be created.

b) If there is NO prop from C to D, then B and every room above it will move up above A. It means B will move up 5 floors from F0 to F5 (A's floor + 1). All rooms that B props will move up 5 floors, and all rooms that each of them props will also move up 5 floors, and so on. This is recursion. Only rooms that need to move move (colored black). Rooms that don't need to move don't move.

(Taxa/Scopa)

All taxa and scopa will appear inside the bulk of Pichutz. I don't put rooms in there. This space is definitely bigger than the area of the front face where rooms are populated, because eventually there will be more taxa/scopa (groups of rooms) than rooms. This is similar to there being always more synapses than neurons in the brain.

Each taxon/scopon looks like a round circle with light coming out of it.

Trees:

1. Each node on Taxonomic Tree is a taxon (in plural: taxa). Taxa are created by the Admin.

2. Each node on Scopic Tree is a scopon (in plural: scopa). Scopa are created by users.

I'm not sure yet where to download structured data of an accurate Taxonomic Tree but these can be a good start for reference (use whichever you can extract data from, or something else):

https://en.wikipedia.org/wiki/Outline_of_academic_disciplines

http://dewey.info/

http://www.loc.gov/catdir/cpso/lcco/

Authority:

1. Taxonomic Tree is viewed by everyone, and edited by Admins.

2. Scopic Tree is viewed by everyone. Users can only edit the ones they created. Admins can edit anything.

Nesting:

1. A scopon can be nested under any scopon/taxon and/or nest any scopon/taxon under it (as long as no there's hierarchical conflict).

2. There are too special taxa: "Facts" and "Data." They are included in the Taxonomic Tree.

View and management:

1. Taxonomic Tree and Scopic Tree are viewed like label view in Gmail (note: "Facts" and "Data" are separate on top).

2. When a room is opened, selecting a taxon/scopon is like selecting a label in Gmail.

3. Creating a scopon is like creating a label in Gmail.

4. Taxon/Scopon management is like label management (note "Facts" and "Data" on the top and, in this example, 2 taxa on the bottom). When taxa and scopa are nested under a scopon, only the scopa can be edited, the taxa can only be removed.

5. Scopa can be created/editted as per https://www.dropbox.com/s/u5wypu4l8o6sogw/Create-Edit%20Scopon.jpg?dl=0

Visualization of Taxonomic Tree and Scopic Tree:

1. The 2 trees are managed separately but, in visualization, they are "blended" together.

2. In short, the fixed Taxonomic Tree will be created first and serve as the backbone, equally spaced between the front face ABC and farthest vertex O. The lowest ranked taxa are positioned closest to the front face, and the root(s) furthest from the front face (or closest to the furthest vertex).

3. When a scopon is added, it will simply appear above all the taxa/scopa it contains, and below those it is nested under.

4. If the added scopon is on top of the highest taxon (as in the example in the photo above), the spacing will be adjusted and, in effect, all taxa/scopa will move closer to each other. At first, there are only taxa, which are sparsely positioned. Over time, when a lot of scopa have been added, taxa/scopa will be very dense.

(Pyrofile - user profile)

A user can:

- Create a room.

- Create a prop between room A and room B and the probability room A contributes to room B.

- Create a scopon, nest it under other scopon/taxon, and/or nest other taxon/scopon/room under it.

This is when you view your own Pyrofile or any user's Pyrofile: https://www.dropbox.com/s/gsb91wwreujhejq/Pyrofile%20%28user%20profile%29.jpg?dl=0

The "Manage list" hyperlinks link to lists displayed as perhttps://www.dropbox.com/s/y06oajqdsrlthpd/List%20display.jpg?dl=0

Lists can be sorted in ascending or descending order.When you view another user's Pyrofile, "Manage list" becames "View list."

Beta users are whom I absolutely trust and allow to set their own creds for taxon. Refer below.

(Cred)

Each user has a different cred for each taxon (which means a different qualification for each discipline). The default cred for each taxon of each user is 1%. Beta users (whom I trust) can set their own creds up to 50% (no one can be more than 50% sure about almost anything). Normal users in the future are not allowed set their own creds; we do.

If a room is under different taxons on the same line, for example, the room "Schrodinger's equation indicates that the future is predetermined" is under "Quantum Mechanics" which is under "Theoretical Physics" which is under "Physics", then for a user his cred associated with that room is his cred for the closest taxon of that room ("Quantum Mechanics") rather than the remote "Physics" because, statistically speaking, a self-proclaimed physicist hardly knows any QM to draw such conclusion.

(Room)

Each room is a unique room ID and a permanent link.

When a user clicks on a room, he can also see all the probabilities contributed by all rooms that prop or backprop that room, presented in form of a probability distribution of probabilities.

(Overall Probability)

For a room, a probability distribution of probabilities can be informative (e.g. divided opinions), but a single Overall Probability, that takes into account the creds of the users who contributed probabilities to this room, would be useful in most cases. To calculate room A’s OP, the probabilities of room A contributed by each of its propping rooms are weighted by [each propping room’s OP] and [the average cred of the props' users associated with both room A and the propping rooms]. This is the simple formula.

It is because a user needs qualifications/credentials in both fields to make a connection between them. For example, for your prop between a room under “Quantum Mechanics” and a room under “Quantum computing” to matter, you need high creds in both fields.

Since a room's OP takes into account its related rooms' OPs, I think we should employ an iterative algorithm similar to PageRank (section 2.1.1), which means iterating the calculation of all OPs until their values converge.

(Color)

Taxa: light blue

Scopa: light green

Soft rooms: pink

Hard rooms: orange

Super Hard rooms: red

Normally, all rooms glow. However, when we click on a taxon or scopon, only rooms contained by (associated with) that taxon/scopon glow while other rooms dim. When we click outside, all rooms glow again.

Singularity

Big Data

Crowdsourcing

$100/hr - $125/hr

Starts Apr 25, 2015

3 Proposals Status: CLOSED

Client: A*********** ******

Posted: Mar 15, 2015

NLP engine for reviewing and coding medical records (Phase 1: Advisory/Strategy Roadmap)

Develop a strategy and business plan for building a Natural Language Processing engine that utilizes clinical text mining algorithms (trained from medical records previously coded by us) to improve medical coding productivity and lowering time spent per record. We currently employ over 1,000 coders worldwide and the goal is to build a technology tool that can allow us to scale our business without increased headcount and improve efficiencies in our process. The engine should flag clinical diagnosis code opportunities to the medical coder based on text mining algorithms after which the coder will confirm or reject the suggestion.

Healthcare

Bioinformatics

Biostatistics

$50/hr - $100/hr

Starts Mar 23, 2015

7 Proposals Status: IN PROGRESS

Client: E********* ***

Posted: Mar 12, 2015

MongoDB "Expert" level DBA

Needs hands-on but strategic consulting with scale-up issues:

* Swap storms

* Slow query perfomance

* Replication issues

for a rapidly growing implementation with an ever increasing user base.

Hi-Tech

Professional Services

Risk and Compliance

$150/hr - $165/hr

Starts Mar 16, 2015

4 Proposals Status: CLOSED

Client: N********

Posted: Mar 11, 2015

Conversion from Excel Based Systems to an Intergrated Dashboard

We are a UK based product design and sourcing agent working with retailers and brands around the world. We design in London and make in China. We focus on impulse gifting; for example, beauty, skincare, toys, gifts, bags and fashion accessories. We supply retailers like Target, Urban Outfitters, Disney, Marks and Spencers etc. The majority of the product we sell is private label, although we are creating our own range later this year.

Objectives:

To transition from our current excel based structure to an intergrated dashboard with user interface that allows for simplified data entry and KPI driven reporting/analytics that will increase profitability and resource productivity.
To enable decision-making based on data driven analytics rather than intuition.

Current process and tools:

See word doc "Process and Tools" and "Current Excel Docs" zip folder

Current data collected:

PRODUCT (category + sub category, IP ownership, testing pass/fail rate, time spent to develop, no of orders won)
PURCHASE ORDER (cost price, selling price, on time delivery, client penalty)
CLIENT (actual sales, pipeline, target margin, contact details per dept)
FACTORY (location, products supplied, QC pass/fail rate, Social and Ethical status)

Milestones:

We would like to break this projects into several milestones. Please propose a methodology to perform this work to incremetally build the full solution.

Start date: ASAP

Consumer Goods and Retail

Sales Forecasting

Sales Rep Incentive Scheme Analysis

$2,500 - $70,000

Starts Mar 09, 2015

16 Proposals Status: CLOSED

Client: M****** ***

Posted: Mar 06, 2015

FUTURE OF WORK PLATFORM

COMPARE OFFERINGS

UPSKILLING PLATFORM

EXPERFY TALENTCLOUDS

Custom TalentClouds

Browse Projects

Equity Trading Algorithms for Market Opens and Closes

$75/hr - $175/hr

Implement Statistical Model

$50/hr - $150/hr

Revenue Management System Roadmap

$100/hr - $150/hr

Data-Driven Mobile App Design and Development

$20,000 - $50,000

Algorithm to Identify Inappropriate Content on Photo Sharing Platform

$5,000 - $10,000

Data Dashboards in Tableau for National Education Project

$50/hr - $150/hr

A multidimensional knowledge mapping platform

$100/hr - $125/hr

NLP engine for reviewing and coding medical records (Phase 1: Advisory/Strategy Roadmap)

$50/hr - $100/hr

MongoDB "Expert" level DBA

$150/hr - $165/hr

Conversion from Excel Based Systems to an Intergrated Dashboard

$2,500 - $70,000