Votre navigateur est obsolète !

Pour une expériencenet et une sécurité optimale, mettez à jour votre navigateur. Mettre à jour maintenant

×

Olivier Doubiani

Olivier Doubiani

Data Scientist / Data Engineer consultant

39 years old
Driving License
RUEIL MALMAISON (92500) France
Consultant Open to opportunities
DATA SCIENTIST / ENGINEER
Engineer in Medical Informatics and Big Data.
Multi-skills: Statistics, Computer Science and Biology and Data Sciences / Data Engineering
  • Support of the technical part of the migration of customer data and cars of OPEL / VAUXHALL in the IS of PSA following its purchase for the CRM software SALESFORCES
  • Project management support for the migration of customer data and OPEL / VAUXHALL cars in PSA's IS after its purchase for the SALESFORCES CRM software
  • Mapping database fields for migration
  • Taking part in the meetings of the ComEx Steering Committee
  • Assistance in the management and coordination of the project of opening DSTI offshore diploma "Applied MSc in Data Engineering" at EMPSI Casablanca, Morocco for launch in January 2019
  • Shooting promotional videos and program details
  • Intervention on Moroccan radio LUXE RADIO to talk about the challenges of Big Data in Africa
  • Help in setting up recruitment procedures
  • Feasibility study of a connected object
  • Validation tests of a website
  • Code correction of an Access application
Learn more
  • Analysis, Installation, Deployment, Follow-up at start-up, Training, Validation and Recissioning of a software package for the management of blood donations and transfusion safety for the patient.
  • Creation of stored procedures and statistical queries in SQL and PL / SQL
  • Implementation of SQL Server and Oracle DBD (Creation of Spaces Tables and Data Structure)
  • Installation of interfaces between the Hemobank system and a HL7 laboratory management system
  • Installation of interfaces between the Hemobank system and a patient identity management system (HL7 and HPRIM standard)
  • Installation of interfaces between the Hemobank system and PLCs for Immunohematology, Molecular Virus Detection, and HL7 and ASTM Whole Blood Separator
  • Technical Implementation and Functional Validations of Interfaces with Hospital Management Systems
  • Technical Implementation and Functional Validations of Interfaces with Serology Automaton
  • Technical Implementation and Functional Validations of Interfaces with Patient Identity Data Management Software
  • Client Support
  • Validation of new developments
  • Technical Support
  • Settings adjustments
  • Updating and Maintaining blood bank management software
  • Piloting new developments with two teams of developers, in Nice (5 people in France) and Tunis (25 in Tunisia)
  • Applying Agile methodology with development teams
  • Application of the PMI-PMP methodology with customers and resellers.
  • Help in the definition of test scenarios for functional validation of the settings
  • PERL / PYTHON / LINUX / DOS batch creation for routing files in different folders according to their content
  • Optimization of Algorithms
    Sorting and Searching, Trees and Graph Algorithm, Greedy, Divide-and-Conquer, Back Tracking and Dynamic Programming.
  • IT Project Management PMI-PMP
    PMP – PMI & Agile Approaches.
  • Legal knowledge of the GDPR law
    GDPR, US-EU Data Transfers Regulations.
  • Time series analysis
    Hypothesis Testing, ANOVA Analysis, Survival Analysis, Linear Regression, Cross Validation, Hierarchical Clustering
  • Selections of Predictive Models
    Subset, Shrinkage - Ridge&Lasso, Dimension Reduction with PCA and Paritial Least Square.
  • Classification and Regressions Trees / Random Forest
    CART, Bagging, Random Forest, Boosting Support Vector Machine
  • Artificial Neural Network & Deep Learning
    Data representation, Probabilistic interpretation, backpropagation
  • Data Wrangling backed with MS SQL
    Server
    Advanced SQL queries and .Net for complex ETL, dynamic reporting
  • Semantic Web technologies for Data
    Science
    RDF, SPARQL, VOiD, DCAT
  • Learn to pronounce Dynamic systems
    Causal loop diagrams, Chaos Theory
  • Statistics
  • Machine Learning
  • Statistical Suite and SAS Modeling
  • Data Science Methodology Certification
    https://courses.cognitiveclass.ai/certificates/f557bc0ff31c44488b6f8204b29e9157
  • Python, R,Java, ADA, C/C++, Javascript, SQL, PL/SQL
  • SAS, BO, Talend,Oracle, SQL Serveur, GED, VMWare, Jira, Hematos, Hemobank
  • English: Fluent (TOEIC : 850)
  • Spanish : Notions
  • Arabic : Good Notions
  • French : Native

Machine Learning

OpenClassRooms

January 2019 to 2020
1- Define your learning strategy!
2- Design an application at the service of public health
3- Anticipate the electricity consumption needs of buildings
4- Segment customers of an e-commerce site
5- Categorize questions automatically
6- Classify images using Deep Learning algorithms
7- Develop a proof of concept (internship option)
8- Participate in a Kaggle competition!

MSc in Data Engineering

DSTI Institute

October 2018 to June 2019
Data Science
Applied Mathematics for Data Science (25hrs)
– Calculus – Differentiation – Trigonometry & Complex Numbers
Foundations of Statistical Analysis & Machine Learning (25hrs)
– Probabilities and distribution – Descriptive Statistics – Introduction to Inference
Big Data Processing (25hrs)
Statistical Analysis of Massive and High-dimensional Data (25hrs)
Deep Learning on GPU with pyTorch (25hrs)
Recurrent Neural Networks – LSTM – Residual Networks
IT Fundamentals
Computer Systems (25hrs)
Computer Architecture – Operating Systems & Vistualisation – Networking – Storage
Cloud Computing – Amazon AWS (50 hrs)
Preparation to AWS Certified Solutions Architect – Associate Certification – Comparative overview of Microsoft Azure
Cloud Computing – Microsoft Azure (25 hrs)
Comparative overview with Amazon AWS on core services (Networking, Compute, Storage, Database) & focus on Azure“Data Managed Services” (chiefly Azure Machine Learning Studio, Cognitive Services, Data Lake, Databricks, Stream Analytics)
Semantic Web technologies for Data Science developments (25 hrs)
Representing and querying web-rich data (RDF, SPARQL), Introducing Semantics in Data (RDFS, Ontologies), Tracing and following data history (VOiD, DCAT, PROV-O)
Data management
Advanced SQL for Data Wrangling (25 hrs)
Complex joins & subqueries, stored procedures & triggers

Relational Databases Management Systems (25 hrs)
Using MySQL & Microsoft SQL Server: stand-alone and cluster deployments, integration in software, ETL, persistence frameworks

NoSQL databases (25 hrs)
Key-value store, Document store, Graph database , hybrid approaches with Apache Cassandra

The Hadoop & Spark Ecosystem (50 hrs)
HDFS, scheduling & resources management – Workflow management & ETL, Dataflow management, Scalable Enterprise Serial Bus – Realtime processing, Machine Learning, Data Exploration & Visualisation

Data Pipeline (25 hrs)
XML dataflow, DTD & Schemas, XLS Transformation, JSON & Transformations – Cloud-based solutions with Glue in AWS & AWS Kinesis – Open-source solutions with Apache Kafka & Beam

MSc in Applied Data Science & Big Data

DSTI Institute

October 2017 to October 2018
Core Data Science & Artificial Intelligence – 150hrs (9 ECTS)
Applied Mathematics (1 ECTS)
Calculus – Linear Algebra – Trigonometry & Complex Numbers
Continuous Optimisation (1 ECTS)
Critical points – multiple variables function optimisation – gradient methods – constraint-based optimisation with Lagrange Multipliers
Foundations of Statistical Analysis & Machine Learning Part I & II (2 ECTS)
Probabilities & distribution – tests – inference – regression – clustering
Artificial Neural Networks (1 ECTS)
Data representation & distributed representations – universal interpretation theorem – probabilistic interpretation – backpropagation & stochastic gradient descent
“The SAS Ecosystem DSTI Chair” (3 ECTS)
Preparation for SAS Certified Predictive Modeler using SAS Enterprise Miner 14: SAS/Base & SAS/STAT
Time-Series Analysis using SAS (1 ECTS)
Forecasting using SAS Software: a programming approach (SAS/ETS)
Core Data Engineering – 250hrs (9 ECTS)
Software Engineering (2 ECTS)
Classical design & programming – object-oriented design & programming

Data Wrangling with SQL (1 ECTS)
Advanced SQL queries – dynamic SQL – stored procedures & triggers

Amazon AWS “Cloud-Computing DSTI Chair” (3 ECTS)
Preparation for AWS Certified Solutions Architect – Associate

The Hadoop & Spark Ecosystem (3 ECTS)
HDFS – scheduling & resources management – workflow management & ETL – dataflow management – scalable enterprise serial bus – real time processing – machine learning – data exploration & visualization

Applied Data Science & Artificial Intelligence - 150hrs (10 ECTS)
Deep Learning with Python (2 ECTS)
Introduction to PyTorch – deep learning – neural architectures & their applications – neural network training on a GPU (practice)

Advanced Statistical Analysis & Machine Learning (2 ECTS)
CART & random forests & applications to MapReduce – features selection & engineering – models comparison & competition

Survival Analysis using R (2 ECTS)
Probabilistic description of survival data – parametric / non-parametric / semi-parametric (Cox model) statistical methods – Applications to Big Data with penalised Cox regression

Discrete Optimisation (2 ECTS)
Graph-based modelling & algorithms

Agent-Based Modeling for Population Behaviour (1 ECTS)
Modelling objectives – model types – matching modelling approaches to studies objectives – ODD protocol – ABM objectives & components

Semantic Web for Data Science (1 ECTS)
Representing & querying web-rich data (RDF, SPARQL) – introducing semantics in data (RDFS, ontologies) – tracing & following data history (VOiD, DCAT, PROV-O)

Management, Ethics & Law – 50hrs (2 ECTS)
Data Regulations MEL1 (1 ECTS)
Data ownership and protection laws and regulation: Private Data – Corporate Data – EU Data Protection Act, GDPR, US-EU Data Transfers regulations
Project Management MEL2 (1 ECTS)
Project Management: PMP-PMI and Agile Approaches: PMBOK (PMI) – Agile Approaches – Kanban
Learn more

IUP Physiology and Informatics

Poitiers University

September 2005 to September 2008
MASTER II (DESS) Double Skills Informatics and Biotechnology

Cellular and Molecular Biology Bachelor

Orléans University

September 2001 to 2005

Associate's Degree

Orléans University

June 2003 to September 2004
DUT Informatics

Associate's Degree

Perpignan University (Carcassonne Antenna)

June 1999 to September 2001
IUT Statistics and Computer treatment of Data
  • I gave a lecture on BIG DATA and Artificial Intelligence on October 30, 2018 for the general public as a speaker
  • Member of Rotary Club Sophia Antipolis
  • Cinema Science Fiction, Comedy, Video Games, VR
  • Scientific readings