José R. Armenteros

Friendly neighborhood data aficionado

Learn More Download Resume

About Me

Hi, my name is José, but you can also call me Joe, Joey, Pepe, Cheo, or Raúl. I am based out of Northern Virginia and I specialize in data platform, data science, and BI solutions.

My family consists of my lovely wife, my daughter, and our calico cat. In my spare time I work on learning new coding skills and data tools. I unwind by dancing salsa, playing board games, and reading manga.

Please see below for more on my professional interests.


Data Engineering & ETL Design

Most of my projects require the creation of new data layers to plug into reporting and advanced analytic infrastructures. Hence, I often design and implement ETL applications that result in structured relational databases, typically following a star or snowflake schema model. Through my work I have learned several T-SQL and PL/SQL flavors, working with platforms in cluding Spark/Hive, Databricks, Microsoft SQL Server, MySQL, Oracle ADW, and Snowflake SQL. I have also developed full ETL pipeline applications using Docker, Kubernetes, and Python.

Data Science & Machine Learning

I am experienced with regression models, including linear, polynomial, support vectors, and decision trees/random forests. I also have experience with classification and clustering algorithms, primarily on K-Nearest Neighbor and Naive Bayes. I have delved into the NLP/LLM space, specializing on LDA and HDP topic clustering models. I can implement machine learning code in both python and R, with more expertise on the former.

Business Intelligence

I have generated numerous reports, metrics, and key insights across diverse industries, including retail, healthcare, and cloud computing. I quickly adapt to most reporting infrastructures, having strong expertise in Power BI, AWS QuickSight, Excel PowerPivot, Oracle Analytics Cloud (OAC, formerly known as OBIEE), and Tableau.

Data & Machine Learning Operations

I am familiar with novel practices of machine learning and data operations. From testing of code and CI/CD pipelines to implementing robust Infrastucture as Code (IAC) frameworks in Terraform, I am confident in building and modernizing platforms for scalability, effectiveness, and high availability.