Data Science & Machine Learning

OpenAnalytics has in-depth understanding and practical experience at all steps of the data analysis process and organizes its services along 4 pillars

  • 1-Statistical Consulting
  • 2-Scientific Programming
  • 3-ML Application Development and Integration
  • 4-Data Analysis Hardware and Hosting

Statistical Consulting


Open Analytics employs a team of PhD-level statisticians and machine learners that are trained to translate customer questions into appropriate analysis methodology. This can involve the design of experiments or sample surveys, the analysis of data as well as development of novel methods to tackle specific characteristics in the data that require a carefully chosen approach to answer the right questions. In terms of specialization, the team covers a very broad range of techniques including.

Traditional statistical modeling and simulation techniques
Time series analysis and forecasting
Machine learning and artificial intelligence
Data mining and text mining
Digital signal processing
Image processing and analysis
The application domains in our consultancy projects have been very diverse and include analysis of high-dimensional biological data (omics)
Modeling of ecological and environmental data
Health monitoring and IoT data
Six sigma, statistical quality and process control
Analysis of marketing and financial data

A sample of projects are described in our case studies. Based on our large experience we also offer training in data analysis or specific statistical or machine learning methodology.


Scientific Programming


When proper methodologies for analyzing data are identified, software implementations are needed to perform the actual analysis. Often, implementations are readily available, but when methodologies need tuning it may require expert programming expertise to apply the relevant modifications. Open Analytics has a skilled team of software engineers that combine strong methodological skills with solid algorithmic know-how to bring scientific programming problems to a good end. In some circumstances, research papers describe methodologies that are not readily available and Open Analytics has appropriate depth of skills and experience to implement such methods straight from the research paper. Another common scenario where Open Analytics is called in for help is to work on code that does not run efficiently enough for customer purposes. In that case Open Analytics will review and optimize the code or parallelize parts of the code to speed up execution.


When implementation using high-level languages is an option, Open Analytics will develop solutions using R, Python or the Julia language. On certain projects, however, code will be developed in low-level languages (C, C++, Fortran) or languages that are appropriate for the use case (Scala, Java).

Examples of scientific programming tasks are:

Implement very fast methods for non-negative matrix factorization methods
Extend an implementation of the SAEM algorithm to take into account censored data
Port simulation code for optimal design of PK experiments to C++
Fine tune and speed up the implementation of an MRMR based feature selection method
Run advanced dimension reduction methods in-database

A sample of projects are described in our case studies and Open Analytics provides trainings in common data science languages including R, Python and Julia.