Summary

Expertise

Project Highlights

Education

Agency

AC

English:

Upper Intermediate

Arturo C.

vetted by Youteam

Vetted by YouTeam

Venezuela

UTC -04:30

America/Caracas

English:

Upper Intermediate

I am a data engineer with a master's degree in Random Models from Universidad Central de Venezuela.

I am a data engineer with a master's degree in Random Models from Universidad Central de Venezuela. My passion lies in designing and developing scalable and efficient data solutions using a variety of platforms and tools, including Google Cloud Platform, Amazon Web Services, Snowflake, Spark, Airflow, Scala, Python, and R. Currently, I hold the position of Data Engineer at Wizeline, where I have successfully designed the architecture of data pipelines (AWS and GCP) and formulated data migration strategies using various APIs and Airflow. Additionally, I have developed pipelines capable of processing millions of daily transactions using Spark and Scala. My responsibilities also include data ingestion from diverse sources using SQL, specific APIs, and Scala to generate valuable data views for utilization in BI tools like Tableau. Furthermore, I closely monitor the efficiency of data pipelines in terms of performance and time. Prior to joining Wizeline, I served as a Data Engineer at Fractal Software and ACREDITA, where I gained experience in large-scale data processing projects, social network sentiment analysis, and predictive modeling using XGBoost. My ultimate objective is to perpetually enhance my knowledge and expertise in data engineering, applying best practices to leverage the value and impact of data across different domains and industries.

Want to hire this engineer?

Check if Arturo is available

Expertise

Years of commercial development experience

7 years of experience

Core technologies

Python 7 years
AWS 5 years
SQL 7 years
Docker 4 years
ETL 4 years

Other technologies

Apache
AWS
Azure
R
Scala
Spark
Hadoop

Project Highlights

icon
Data Engineer

Data Engineer

Mar `22 - May `24

2 years

Wizeline

QUASH is a No-Code GenAI Credit Risk Decisioning Platform that unifies Alternative Data, ML models and workflows to power your Financial Institution.

Responsibilities & achievements

-Designed and implemented a robust data pipeline architecture on Google Cloud, leveraging BigQuery as the data warehouse, Composer for deploying and managing Airflow as the orchestrator, Dataproc for Apache Spark and Hadoop services, and Cloud SQL for deploying a Postgres instance. This architecture optimized data consumption and improved overall data processing efficiency. - Successfully designed and deployed a data migration pipeline on AWS, enabling seamless data transfer between multiple sources. Utilized EMR tool to create a cluster and leverage Apache Spark for efficient data processing. Implemented MWAA (Managed Workflows for Apache Airflow) as a data orchestrator, ensuring streamlined workflow management. Leveraged Athena to perform complex queries on S3, enabling quick and flexible data analysis. Utilized Redshift as a scalable and high-performance data warehouse.

Apache
API
AWS
Scala
Spark
icon
Data Engineer

Data Engineer

Aug `21 - Sep `22

1 year

Fractal Software

Company that partners with ambitious entrepreneurs to launch the next generation of vertical SaaS (Software as a Service) companies. They provide support, insights, and capital to help build generational businesses, aiming to de-risk startups and increase the likelihood of venture-scale outcomes.

Responsibilities & achievements

-Utilized BigQuery for complex queries on various file formats (JSON, PARQUET, and CSV) stored in Google Cloud Storage (GCS). Developed data synchronization workflows between GCS and BigQuery using Airflow DAGs. - Leveraged PySpark to distribute processing across extensive datasets. Ingested streaming and transactional data from primary sources such as Spark, Redshift, S3, and Python. - Constructed efficient streaming and batch data pipelines on Snowflake.

Apache
Python
Spark
JSON
Snowflake
icon
Data Engineer

Data Engineer

Mar `20 - Aug `21

1 year

ACREDITA

company that specializes in providing decision-making solutions for its clients. They offer products for credit risk management and commercial references. With a focus on banking and credit institutions, they aim to optimize the evaluation and monitoring processes of customer credit risk.

Responsibilities & achievements

Responsible for designing and implementing adaptable information transfer processes using Spark on AWS to meet the varying speed demands of different areas efficiently. I was involved the development of the data pipeline architecture for a cutting-edge product, ensuring seamless data flow and efficient processing. Use of tableau for reporting, using the data available in Redshift.

Python
R
SQL Server
TensorFlow
icon
Machine Learning Engineer

Machine Learning Engineer

Feb `20 - Aug `21

1 year

QUASH.ai

QUASH is a No-Code GenAI Credit Risk Decisioning Platform that unifies Alternative Data, ML models and workflows to power your Financial Institution.

Responsibilities & achievements

Created credit models for various financial institutions. Designed the data pipeline architecture in AWS. Representing model results by creating reports with tableau

API
AWS
TensorFlow
icon
Financial Data Scientist

Financial Data Scientist

Jul `18 - Mar `20

2 years

Synergy.Vision

Technology for Trading, Investment, Economy, and Finance. We develop technological solutions for modern companies in the financial world and provide the best training to enter this field.

Responsibilities & achievements

Creation of a Web application for the monitoring of risk metrics and application of artificial intelligence models.

Python
R
icon
Credit Risk Specialist

Credit Risk Specialist

Sep `17 - Mar `18

6 months

Bancrecer

We are a highly competent team that creates productive and valuable relationships with our clients, who grow with us and demonstrate that our country continues to offer an extraordinary range of opportunities for the development of entrepreneurial initiatives.

Responsibilities & achievements

responsible for quantifying the different risk metrics of the bank

Python
SQL Server
TensorFlow

Education

Higher education in Computer Science

Agency

Software development agency #3757

10-50

GMT-5

Lima, Peru

Core Expertise

Agile
Amazon EC2
Amazon S3
AngularJS
AWS
Azure
C#
Django
Elixir
ETL
Express.js
Flask
Google Analytics
Groovy
Hibernate
HTML5
Ionic
Java
JavaScript
jQuery
Kotlin
Kubernetes
Microsoft
Microsoft Dynamics CRM
MongoDB
.NET
Node.js
PHP
PostgreSQL
Python
QlikView
React.js
React Native
Ruby on Rails
Scala
Selenium
Spark
Spring
SQL
SQL Server
SSIS
Tableau
TypeScript
WordPress
Xamarin
Apache Tomcat
Bootstrap
CSS3
Git
Go
Golang
HTML
iOS
Mocha
Oracle database
Pentaho
Project Scheduling
Scrum
SQL Azure
SQL Programming
Unit Testing
Web Services
Sketch
User Experience Design
Angular 2x
Postman
Project management
Docker
DynamoDB
MariaDB
SQL query
InVision
Redux
Project Manager
Scrum Master
Maven
Spring Boot
Illustrator
Photoshop
Jest
Enzyme
Hadoop
Flutter
.NET Core
Figma
AWS Lambda
Firebase
Next.js
SEO
Power BI
AWS Glue
Pyspark
.NET Framework
Snowflake
SAP HANA

Industries

Architecture & Design, E-Commerce & Retail, Information services & Technologies, Construction & Real estate, Data Science & Machine Learning, Branding, design, web development

Want to hire this engineer?

Check if Arturo is available