Summary
Data engineer with expertise working on the entire analytics pipeline from data extraction to automation.
Goal
To engineer big data and current state of the art predictive analytic solutions that improve and automate business processes.
Skills
- Programming Languages: Python, R, shell, SQL, Spark
- Operating Systems: GNU/Linux, Windows
- Containerization Technologies: Docker
- Developing automated data pipelines from a variety of source systems (REST API, Databases, SFTP Servers, Message Queues)
- Event streaming and stream processing (Kafka, Kafka Connect, ksqlDB)
- Apache NiFi
- Relational databases (Hive/Impala and PostgreSQL)
- Hadoop
- Cloud Computing - AWS
- Git and version control
- Descriptive analytics and data visualization
- Predictive analytics (machine learning and deep learning)
- Web scraping and text mining
Employment
IT Specialist, Federal Aviation Administration, AJR-G200
November 2021 – Present
- Development team lead for the Wilbur data product. Wilbur is AJR-G System Data and Infrastructure Group’s (AJR-G200) newest iteration for ingesting and sharing authoritative operational flight data and associated metrics.
Data Analytics Engineer, Accenture Federal Services
June 2019 – October 2021
- Developed dozens of ETL jobs in Python, Hive (SQL), and Bash extracting data from various source systems (REST API, SFTP server, databases via ODBC).
- Used distributed SQL engines (Hive and Impala) to extract business insights from processes that generate large amounts of data such as web server logs.
- Developed and deployed into production a NLP model to conduct automated expense review on behalf of the Facilities Division at USDA’s Agricultural Research Service. FD leadership estimates roughly 4000 hours saved a year.
- Contributed to the team’s utility library by developing modules and functions that are common across ingest processes.
- As a team lead, provided technical support and guidance to other data engineers including design, code reviews, and deployments into production.
Education
University of Virginia – College of Arts and Sciences BA Statistics; concentration in Mathematical Statistics GPA: 3.96 Graduation Date: Spring 2019
Northern Virginia Community College Associate of Science Degree, Business Administration GPA: 3.951 Graduation Date: Spring 2016
References
References available upon request.