dataqbs
Cover
Carlos Carrillo

Carlos Carrillo

(He/Him)

AI-Driven Engineer | Data · Developer · DBA | Snowflake · Azure SQL · ADX/KQL · Python | Remote (EN/ES)

Mexico · Remote (Worldwide) · Contact info

500+ connections

dataqbs

Open to work

SQL Developer, ETL Developer, Data Engineer, Integration Lead, AI Engineer roles

About

I'm a Senior Data Engineer and Cloud Data Consultant with 20+ years of experience modernizing analytics ecosystems with Snowflake, Microsoft Fabric, Azure SQL, and SQL Server. I build automated, scalable pipelines and resilient data models that turn raw data into reliable, actionable insight — especially in high-volume, mission-critical environments where performance, cost efficiency, and long-term maintainability are survival. My toolkit is deep SQL + Python, paired with AI-assisted development (GitHub Copilot, ChatGPT, Claude) to deliver solutions that are cloud-native, operationally practical, and designed to evolve beyond prototypes.

"To live in peace, free from rigid structures — building projects that flow naturally through intelligence and awareness. Technology should serve life, not the other way around."

Experience

Data Integration Lead

Hexaware Technologies contract

Mar 2025 — Present

Mexico · Remote

Led Snowflake → Azure SQL integration for Freeport-McMoRan mining operations. Deployed incremental sync pipelines, built regression testing CLI, optimized Snowflake views, and developed production dashboards and AI chatbots for 7 mining sites.

SnowflakeAzure SQLADX/KQLAzure FunctionsApp ServicePythonStreamlitDockerGitHub EnterpriseCopilotMERGE/UpsertCDC/DeltaCTE RefactoringETL/ELTIoT Sensor DataMining AnalyticsEntra ID/KerberosCI/CD

Senior Database Engineer

FussionHit contract

Jan 2025 — Present

Remote

Database engineer for VCA Animal Hospitals on Azure Database for PostgreSQL. Built a full auditing and DDL export framework, performed schema performance reviews, and delivered ticket-based database remediation with TDD-quality documentation across multiple production databases.

PostgreSQLAzure Database for PostgreSQLNode.jsJavaScriptNunjuckspg_stat_statementsEXPLAINJiraHarvest APIGitHub Copilot

Data Engineer & AI Developer

dataqbs self-employed

Jan 2011 — Present

Guadalajara, Mexico · Remote

Independent consultancy providing BI, data engineering, and database solutions for US and LATAM clients. Also building internal R&D projects: crypto arbitrage scanner, grid trading bots, LLM evaluation engine, email classification system, and this portfolio site with RAG chatbot.

PythonSQL ServerPostgreSQLSnowflakeSSIS/SSRS/SSASTableauPower BIDataiku DSSAzure Data FactoryNode.jsccxtpandasAstroSvelte

ETL Engineer

SVAM International Inc. contract

Nov 2022 — Sep 2024

Mexico · Remote

Led migration from on-prem SQL Server and SSIS to Snowflake for student certification analytics.

SnowflakeSQL ServerSSISPythonSalesforce APISharePoint

Senior ETL Developer

Svitla Systems, Inc. contract

May 2021 — Oct 2023

Mexico · Remote

Designed and deployed the company's first Azure SQL data warehouse, enabling cloud-based sales analytics.

Azure SQLSSISAzure Data FactoryPower BISQL Server

Snowflake Administrator

Epikso Mexico contract

Jan 2022 — Jan 2023

Mexico · Remote

Managed Snowflake security, performance tuning, and Infrastructure-as-Code for automated environment setup.

SnowflakeBitbucketCI/CDInfrastructure-as-Code

Data Technical Lead

Jabil full-time

Jan 2018 — Mar 2022

Guadalajara, Mexico

Directed migration from Hadoop + Impala + SQL Server PDW to Snowflake on AWS for manufacturing analytics.

SnowflakeAWSHadoopImpalaSQL Server PDWPython

Software Engineer Lead

3Pillar Global full-time

Jun 2016 — Jan 2018

Guadalajara, Mexico

Developed EDI data integrations and reporting layers for enterprise clients.

SQL ServerSSISSSRSEDI

SQL SSRS Consultant

HCL Technologies full-time

Aug 2014 — Jun 2016

Guadalajara, Mexico

Migrated and optimized Actuate Reports into SSRS and SharePoint for enterprise reporting.

SQL ServerSSRSSharePointActuate

Database Analyst II

Jabil full-time

Aug 2011 — Aug 2014

Guadalajara, Mexico

Created and maintained ETL workflows integrating Oracle, SAP, and MySQL systems with 24/7 database reliability.

SQL ServerSSISOracleSAPMySQL

BI Developer

C&A México full-time

Sep 2005 — Aug 2011

Guadalajara, Mexico

Designed OLAP cubes and interactive reports for retail analytics across business units.

SQL ServerSSASSSRSSSISOLAP

Developer

FIRMEPLUS full-time

Apr 2004 — May 2005

Guadalajara, Mexico

Software and database development with PHP, SQL Server, and MySQL.

PHPSQL ServerMySQL

Developer Trainee

Jabil Circuit de México full-time

Aug 2003 — May 2004

Guadalajara, Mexico

Supported database and web application development.

SQL ServerWeb Development

Featured Projects

Crypto Arbitrage Scanner

FinTech

Scans 9 exchanges (Binance, Bitget, Bybit, Coinbase, OKX, KuCoin, Kraken, Gate.io, MEXC) for price inefficiencies. Uses Bellman-Ford shortest-path algorithm and triangular arbitrage detection. Includes a Swapper module for executing trades, WebSocket L2 order-book feeds, SDK bootstrapping for native exchange integrations, and a real-time balance monitor.

  • 4,000+ LOC scanner with graph-based arbitrage detection
  • 9 exchange integrations with 4 balance provider backends
  • Live swap executor with dry-run and production modes
PythonccxtpandasWebSocketPyYAMLBinance SDKujson

OAI Code Evaluator

AI / ML

YAML-driven evaluation pipeline with rule-based scoring across Instructions, Accuracy, Optimality, Presentation, and Freshness dimensions. Supports regex/substring matching, threshold conditions, ranking normalization, rewrite post-processing, and structured audit metadata output.

  • 6-stage evaluation pipeline (adjust → rules → rank → rewrite → validate → summary)
  • Declarative YAML rules with regex, substring, and threshold conditions
  • 5-dimension scoring with configurable ideals and tolerances
PythonRichPyYAMLjsonschemaJinja2

Email Collector & Classifier

Automation

Multi-account IMAP collector supporting Gmail, Hotmail (MSAL OAuth device-flow), and Exchange. Classifies emails into Scam/Suspicious/Spam/Clean/Unknown using a weighted scoring engine with 200+ domain rules, URL-shortener detection, phone-pattern matching, and fuzzy deduplication.

  • 5-label classifier with weighted scoring and hard rules
  • 200+ domain classification rules
  • OAuth device-flow for Hotmail/Outlook
Pythonimap-toolsMSALlangdetectPyYAML

dataqbs.com Portfolio

AI / ML

This very website — a LinkedIn-style portfolio with RAG-powered AI chatbot, built with Astro + Svelte + Tailwind on Cloudflare Pages.

  • RAG chatbot with vector embeddings + Groq LLM streaming
  • Knowledge pipeline: markdown → 58 chunks with 768-dim embeddings
  • i18n (EN/ES/DE), dark mode, LinkedIn-style layout
AstroSvelteTailwind CSSCloudflare Workers AIGroqTypeScript

MEMO-GRID

FinTech

Production grid trading microservice using ccxt with Binance Spot. Features Optuna hyperparameter optimization (50K trials), backtest engine with real fee modeling, attribution analysis (alpha vs beta decomposition), Monte Carlo projections, and 22 analysis tools. Includes FIFO inventory tracking, adaptive step sizing, and systemd deployment support.

  • HPO with 50,000 Optuna trials (TPE sampler) for ETH/BTC grid parameters
  • Backtest engine spanning 2017–2026 with maker fee modeling
  • Attribution analysis: alpha vs beta return decomposition
PythonccxtOptunapandasNumPyPyYAMLpytest

VCA PostgreSQL Audits

Data Eng.

Full audit and schema management framework for Azure Database for PostgreSQL. Includes per-object DDL export with Nunjucks templates, automated schema discovery, LLM-friendly schema_knowledge.json generation, and 20+ ticket-based database improvements across index optimization, FK remediation, timestamp normalization, and stored procedure reviews.

  • 20+ tickets: index optimization, FK remediation, schema renames, timestamp fixes
  • Templated per-object DDL exporter (Nunjucks) for CI/CD-friendly snapshots
  • Technical Design Documents for 5+ database systems
PostgreSQLNode.jsJavaScriptNunjucksAzure PostgreSQL

IROC Video Wall Dashboard

Data Eng.

Streamlit-based production monitoring dashboard for IROC operations across 7 Freeport-McMoRan mining sites. Features real-time metrics from Snowflake and Azure Data Explorer (ADX), 34 KPIs covering dig compliance, crusher rates, cycle times, and ROM tonnage. Includes RAG-powered AI chatbot with GitHub Copilot SDK, semantic model with 16 business outcomes per site, and auto-refresh every 60 seconds.

  • 34 KPIs across 7 mining sites with real-time auto-refresh
  • AI chatbot with RAG + GitHub Copilot SDK (zero-cost for enterprise)
  • Semantic model: 16 business outcomes × 7 sites with ADX + Snowflake queries
PythonStreamlitSnowflakeAzure Data ExplorerKQLGitHub Copilot SDK

Skills

💻 Languages

Python Expert
SQL Expert
JavaScript / TypeScript Advanced
KQL (Kusto) Advanced
Bash Advanced
Node.js Advanced
PowerShell Intermediate

☁️ Data & Cloud

Snowflake Expert
Azure (SQL, ADF, Functions) Advanced
Azure Data Explorer (ADX) Advanced
Microsoft Fabric Intermediate
Cloudflare (Pages, Workers, AI) Intermediate

🤖 AI & ML

LLM Evaluation & Prompt Eng. Advanced
RAG (Retrieval-Augmented Gen.) Advanced
Snowflake Cortex AI Advanced
Vector Embeddings & Search Advanced
GitHub Copilot SDK Advanced
Optuna (HPO) Advanced
Fine-Tuning (PEFT / LoRA) Intermediate

📦 Libraries & Frameworks

pandas / NumPy Expert
ccxt (crypto exchanges) Expert
Streamlit Advanced
Playwright Advanced
Astro / Svelte Intermediate
Nunjucks / Jinja2 Advanced
Rich / rapidfuzz Advanced

🔧 DevOps & Tools

GitHub Actions CI/CD Advanced
Poetry / pip Expert
ruff / pre-commit / pytest Advanced
Docker Intermediate
QEMU / KVM Intermediate
Linux (Pop!_OS) Advanced

🗄️ Databases

SQL Server Expert
Snowflake Expert
PostgreSQL Expert
Azure SQL / Azure PostgreSQL Advanced
SQLite Advanced

Contact