<center> <h1 style=" font-family: Avenir; font-weight: 300; margin-top: 0px; margin-bottom: 0px ">Nathaniel Dake</h1> </center> <center> <h3 style="font-family: Avenir; font-weight: 200; margin-top: 5px; margin-bottom: 0px">Data Scientist / ML Engineer</h3> </center> <center><p style="margin-top: 10px; margin-bottom: 0px"> [email protected] | www.nathanieldake.com | (716)-913-3440 </p></center> <h2 style="font-family: Avenir; font-weight: 100; margin-top: 5px; margin-bottom: 0px">Experience</h2> <hr style="margin: 0px"> <p style="text-align:left; font-weight:500; font-size: 17.5; font-family: Avenir; margin-top: 10px; margin-bottom: 0px"> Lead Data Scientist / ML Engineer <span style="float:right; color:grey; font-weight:normal"> &nbsp; | &nbsp;Dec 2021 - Present</span> <span style="float:right;"> Seel Inc. </span> </p> <blockquote style="background-color: #e9deee; margin-bottom: 0px; "> Proved core product had path to profitability via architecting and implementing an end to end underwriting system that reduced loss ratio from 200% to 70%. </blockquote> <ul style="padding-left: 35px; padding-right: 35px; font-weight:300"> <li > Owned all phases of ML life cycle: Problem definition, research & experimentation, design & implementation of end to end ML pipeline (ETL, feature engineering, training, evaluation, model serving & decision making, real time monitoring, AB testing) </li> <li > Implemented state of the art technique to perform product and session embeddings. This enabled increasing the availability rate for high risk customers by over 20% </li> <li > Designed novel KNN classifier that improved calibration error by 3%. This lead to a 10% increase in low price offers, increasing attach rate by 7% </li> <li > Lead team by creating a culture of data, critical thinking, experimentation, iteration and learning, and prioritization in a fast paced, dynamic startup environment </li> </ul> <p style="text-align:left; font-weight:500; font-size: 17.5; font-family: Avenir; margin-top: 10px; margin-bottom: 0px"> AI Engineer <span style="float:right; color:grey; font-weight:normal"> &nbsp; | &nbsp;Jan 2021 - Dec 2021</span> <span style="float:right;"> Unsupervised Inc. </span> </p> <ul style="padding-left: 35px; padding-right: 35px; margin-top: 0px; font-weight:300"> <li > Key contributor to (core IP) novel weighted graph search algorithm </li> <li > Collaborated with team to implement novel metadata management system that tied together Data/Feature Engineering, compute, typing, metadata, in a rich graph structure. </li> <li > Bridged gap between Engineering, Data Science and Product. Created tutorials, documentation and workshops outlining how to use our core APIs and demonstrating their power </li> </ul> <p style="text-align:left; font-weight:500; font-size: 17.5; font-family: Avenir; margin-top: 10px; margin-bottom: 0px"> Senior Data Scientist <span style="float:right; color:grey; font-weight:normal"> &nbsp;Oct 2019 - Mar 2021</span> </p> <ul style="padding-left: 35px; margin-right: 35px; margin-top: 0px; font-weight:300"> <li > Researched and developed creative notions of distance, geometry and topology to define a space (representation) that enabled finding interesting, human understandable insights </li> <li > Designed a Mutual Information Finder that would determine how similar two samples from a dataset were and generate an embedding space that could be used in insight selection </li> <li > Creatively used core product on customer projects (working with verticals such as health care, shipping & transportation, and ecommerce). Presented results and findings to customers </li> </ul> <p style="text-align:left; font-weight:500; font-size: 17.5; font-family: Avenir; margin-bottom: 0px; "> Data Engineer <span style="float:right; color:grey; font-weight:normal"> &nbsp; | &nbsp;Jan 2019 - Oct 2019</span> <span style="float:right;"> Uplight Inc. </span> </p> <ul style="padding-left: 35px; padding-right: 10px; margin-top: 0px; font-weight:300"> <li > Architected serverless ETL processes that handled the ingestion of +50 GB of data per day from multiple energy utilities into postgresql data stores </li> <li > Implemented neural network models that provided customers with increased insight into energy usage, while AB testing models performance </li> <li > Optimized compute and memory heavy processes (reducing server costs and increasing developer productivity) </li> </ul> <p style="text-align:left; font-weight:500; font-size: 17.5; font-family: Avenir; margin-bottom: 0px;"> ML Engineer <span style="float:right; color:grey; font-weight:normal"> &nbsp; | &nbsp;May 2017 - Jan 2019</span> <span style="float:right;"> Carimus Inc. </span> </p> <ul style="padding-left: 35px; margin-top: 0px; font-weight:300"> <li > Led effort to utilize machine learning and mathematical modeling to automate manual decision processes </li> <li > Built machine learning pipeline that consisted of a succession of unique models, finally offering recommendations on litigation to pursue. This system decreased the need for manual labor by three-fold </li> <li > Architected and implemented an automated data ingestion processes via a parallelized serverless architecture, with storage in MySQL relational database </li> </ul> <h2 style="font-family: Avenir; font-weight: 100; margin-top: 5px; margin-bottom: 0px">Writing</h2> <hr style="margin: 0px"> <p style="text-align:left; font-weight:500; font-size: 17.5; font-family: Avenir; margin-top: 10px; margin-bottom: 0px"> Data Science and Mathematics Communicator <span style="float:right; color:grey; font-weight:normal"> &nbsp; | &nbsp;Aug 2017 - Present</span> <span style="float:right;"> nathanieldake.com </span> </p> <ul style="padding-left: 35px; padding-top: 0px; margin-top: 0px; font-weight:300"> <li > Authored over 50 articles on complex technical subjects, such as: <a href="https://www.nathanieldake.com/Machine_Learning/08-Bayesian_Machine_Learning-03-Bayes-Classifiers.html"> Bayesian Classifiers</a>, <a href="https://www.nathanieldake.com/Deep_Learning/03-Recurrent_Neural_Networks-01-The-Simple-Recurrent-Unit.html"> Recurrent Neural Nets</a>, <a href="https://www.nathanieldake.com/Mathematics/04-Statistics-04-non-parametric-tests-kolmogorov-smirnov-test.html"> Kolmogorov-Smirnov Test</a>, <a href="https://www.nathanieldake.com/Machine_Learning/05-Hidden_Markov_Models-04-Hidden-Markov-Models-Hidden-Markov-Models-Discrete-Observations.html"> Hidden Markov Models</a>, <a href="https://www.nathanieldake.com/Machine_Learning/07-Dimensionality_Reduction-01-PCA.html"> Principal Components Analysis</a>, <a href="https://www.nathanieldake.com/Machine_Learning/08-Bayesian_Machine_Learning-01-Bayesian-Inference.html"> Bayesian Inference</a>, <a href="https://www.nathanieldake.com/Machine_Learning/08-Bayesian_Machine_Learning-02-Bayesian-AB-Testing.html"> Bayesian AB Testing</a>, <a href="https://www.nathanieldake.com/Mathematics/04-Statistics-03-statistical-inference.html"> Frequentist Hypothesis Testing</a>, <a href="https://www.nathanieldake.com/Mathematics/04-Statistics-02-History-of-Normal-Distribution.html"> Mathematical History of the Normal Distribution</a>, <a href="https://www.nathanieldake.com/Mathematics/06-Functions-02-Inverse-functions-exponentials-and-logarithms.html"> Exploring Inverse Functions, Exponentials and Logarithms</a> </li> </ul> <h2 style="font-family: Avenir; font-weight: 100; margin-top: 5px; margin-bottom: 0px">Education</h2> <hr style="margin: 0px"> <p style="text-align:left; font-weight:500; font-size: 17.5; font-family: Avenir; margin-top: 10px; margin-bottom: 0px"> Northeastern University <span style="float:right; color:grey; font-weight:normal"> &nbsp; | &nbsp;2011-2016</span> <span style="float:right;"> Bachelor of Science: Mechanical Engineering</span> </p> <p style="text-align:left; font-size: 16; font-family: Avenir; margin-bottom: 0px; margin-top: 0px; font-weight:100"> Boston, MA <span style="float:right;"> Minors: Physics, Mathematics, Biomechanical Engineering</span> </p> <h2 style="font-family: Avenir; font-weight: 100; margin-top: 5px; margin-bottom: 0px">Technical Skills</h2> <hr> Python • Pandas • Numpy • Scikit Learn • Dask • SciPy • SQL • AWS Stack • DVC • Git • Docker • Kubernetes • HTML/CSS • Micro Services