Industry Hi-Tech
Specialization Or Business Function
Technical Function Software and Web Development (Desktop Applications, Scripts & Utilities)
Technology & Tools Programming Languages and Frameworks (Python)
PROJECT UPDATED MARCH 15th, 2016
We are a health technology startup focused on building machine-learning driven classifiers for medical imaging. We are currently training using the world’s diagnostic image largest (by at least 50x) training set. While an enormous treasure trove, this training set introduces unique challenges as well. We hope resource(s) from Experfy might help us overcome some of these challenges.
Our official runs are conducted on Amazon EC2 instances hitting S3 storage. Most of our experimental and exploratory research agenda is conducted on a local development environment -- an 8-core i7 + 3*TitanX + 64GB RAM hitting a QNAP NAS via NFS.
We are seeking external help on three fronts:
Software Architecture Enhancements
Our training set is 14 terabytes -- approximately 1 million images of 3000x3000 resolution. Extracting images from DICOM medical data files also takes time. We’d like to speed up non-core parts of the training cycle and pipeline.
Neural Network Architecture and General ML Advisory
Our problem is not easily shoehorned into any of the existing problems in ML-driven image classification. Specifically, we have several complexities that preclude out-of-box conv-net approaches:
We’re currently using two approaches: Support Vector Machines and Deep Neural Networks (specifically convolutional networks, variants of AlexNet.) We have a prioritized research path we’re following, but we’re very interested in variations, enhancements, and any out-of-box ideas.
Hardware Setup Validation
We think we’ve set up our hardware pretty well, but there are obviously some bottlenecks. Our guess is that we’re network-constrained currently
We’re in the process of:
We’re open to expert advice on intelligent tweaks.
Matching Providers