Saint
De La Rosa

Founder — Pony Alto

Data operations and analytics professional based in New York. I build the infrastructure, reporting, and analytics that turn raw business data into clear decisions.

Data Pipeline · Live
AI & Tech News Intelligence

Daily news pipeline pulling AI and tech headlines, scoring sentiment with TextBlob, and extracting named entities with spaCy. Data stored in Supabase PostgreSQL and visualized in a live interactive dashboard.

PythonPostgreSQLSupabasespaCyTextBlobStreamlit
Reference Data · Flagship
Reference Data Quality Engine

Financial securities validation system built on PostgreSQL. SQL quality rules across five data dimensions, Pandera schema enforcement, SLA breach detection, vendor scorecard, and a live Streamlit dashboard.

PythonPostgreSQLPandasPanderaStreamlitSQLAlchemy
Data Architect
Data Warehouse Schema Design

Star schema data warehouse for a simulated hedge fund tracking funds, securities, positions, and transactions in PostgreSQL. Five analytical queries proving the schema supports real business questions.

PythonPostgreSQLSQLAlchemyPandas
Analytics Engineer
dbt Transformation Pipeline

dbt project modeling raw food delivery data through staging and mart layers with automated tests and lineage documentation. Built on DuckDB with full dependency graph visualization.

dbtDuckDBSQL
Technical Privacy Officer
PII Detection Pipeline

Python pipeline using Microsoft Presidio and spaCy to detect, mask, and report on personally identifiable information across structured and free-text fields. Produces a structured compliance report.

PythonPresidiospaCyPandas
AI Data Operations
Prompt Labeling QA Simulation

Simulation of the human data labeling workflow behind AI model training. Flags invalid labels and low confidence scores across synthetic annotator data with a per-annotator performance scorecard.

PythonPandasNumPyStreamlit
Data Ops Analyst
Titanic Quality Dashboard

Streamlit dashboard visualizing Pandera schema validation results with summary metrics, failure counts, a drill-down table of bad records, and column-level filtering.

PythonPandasPanderaStreamlit
Data Engineer
Titanic Schema Validation

Automated schema validation pipeline using Pandera enforcing data type, nullability, and value rules across 12 columns. Deliberately injects bad records to verify failure detection works correctly.

PythonPandasPandera
Data Analyst
Titanic SQL Data Quality Checks

SQL validation pipeline running 10 automated data quality checks against the Titanic dataset using SQLite and Python. Results compiled into a single pass/fail table across five quality dimensions.

PythonSQLSQLitePandas
Data Analyst
CSV Data Audit

Foundational data quality audit of the Titanic dataset using Python and Pandas covering completeness, uniqueness, validity, consistency, and shape across 891 records and 12 columns.

PythonPandas
Business Analytics
Pirate Plants Sales Dashboard

Power BI dashboard analyzing sales and profitability across countries and products for a fictitious company. Compares year-to-date vs. prior year to surface underperforming regions.

Power BIDAXData Cleaning
Business Analytics
British Airways Customer Satisfaction Dashboard

Analyzed 1,000+ customer reviews from 2016–2023 in Tableau. Surfaced patterns in service quality, aircraft performance, and geographic satisfaction distribution.

TableauData AnalysisData Cleaning
Operations Dashboard
NYC Real Estate Cost Optimization Dashboards

Centralized cost data across 34 NYC residential buildings replacing manual spreadsheets with automated reporting to support budgeting and decision-making.

Google SheetsExcelData Cleaning
Web Design · Vanguarde
J. Alexander Martin

Personal website for J. Alexander Martin, co-founder of FUBU. Built during my time at Vanguarde, the site covers his work across fashion, tech, media, and business — with sections for consulting, ventures, press, and speaking.

Web DesignSanity CMSMulti-section
Web App
Starsystem

A personal knowledge system with audio memo capture and an interactive knowledge map. Take notes, record voice memos, and watch your ideas connect into a visual graph you can actually navigate.

Web AppKnowledge GraphAudio
Desktop App
Daily Driver

A minimalist daily task manager built with PyQt5. Designed around focus and intentionality — set one Most Important Task (MIT) for the day, maintain a flexible might-do list, and cut through the noise. Dark-mode interface with automatic date tracking and local storage.

PythonPyQt5Desktop
Interactive Map · Thrive Collective
Thrive Mural Map

An interactive map cataloging murals across New York City for Thrive Collective. Lets users explore the city's public art scene geographically — find murals by neighborhood, artist, or location. Demo available on request.

Interactive MapGeolocationNYC
01

A breakdown of ISIN, CUSIP, and SEDOL identifiers — why they matter, where they break, and how data quality failures cost firms millions.

Finance · Reference Data 2026 Read →
02
Building Data Quality Systems Without a Data Engineering Team

How analysts can implement Great Expectations and Pandera to catch data issues before they become business problems — no dedicated engineering team required.

Data Ops · Python Coming soon
03
RLHF From the Ground Up: What Human Data Operations Actually Looks Like

Behind every aligned LLM is an operations layer most people never see. This maps the systems, roles, and quality mechanisms that make human feedback work at scale.

AI · LLM Ops Coming soon

Data operations and analytics professional based in New York. I build the infrastructure, reporting, and analytics that growing businesses need — without the complexity or the wait.


Work History
Pony Alto
Founder
New York · 2026 – Present
  • Founded a data consultancy helping small and mid-size businesses (10–250 people) build data infrastructure, reporting, and analytics
  • Services span data audits, stack implementation (Snowflake, BigQuery, dbt, Fivetran), dashboard builds, and managed analytics retainers
  • Focused on businesses that have outgrown spreadsheets but aren't ready to hire an internal data team
Simply Redefining
Digital Operations Coordinator
New York · 2025 – Present
  • Conducted a HIPAA-aligned GA4 audit for a telehealth group practice, configuring data retention policies, disabling Google Signals, and implementing URL query parameter redaction to eliminate PHI exposure risk
  • Built a clinician compliance tracker in Excel and a treatment plan tracker in Google Sheets with automated daily email alerts via Google Apps Script, reducing manual follow-up across a 9-clinician team
  • Audited 18 therapy directory listings across three tiers, identified critical data inconsistencies, and built a structured citation tracker to centralize remediation tracking
  • Diagnosed email deliverability failures traced to misconfigured SPF, DKIM, and DMARC DNS records; documented root cause and remediation path for implementation
MVP Accelerator
Technical Consultant Intern
New York · 2025 – 2025
  • Designed and built the personal website for J. Alexander Martin, co-founder of FUBU
  • Site covers his work across fashion, tech, media, consulting, and ventures
  • Conducted digital audits across client tech stacks, identifying efficiency gaps and rebuild priorities
  • Delivered UX and branding analysis reports for early-stage companies
  • Worked directly with founders to translate strategy into actionable execution plans
Ops Made Simple
Operations Analyst
New York · 2023 – 2025
  • Analyzed and tracked financial metrics across 34 properties, providing insights that streamlined vendor decision-making
  • Improved financial data accuracy by 25% through systematic validation and reporting workflows
  • Reduced operating costs by 12% via structured vendor trend analysis
  • Built Power BI dashboards for real-time portfolio financial monitoring
  • Designed SOPs that drove a 15% increase in team productivity

Certifications
CompTIA Security+
CompTIA · Active through Oct 2026
Verify →
CompTIA Network+
CompTIA · Active through Oct 2026
Verify →
Asana Workflow Specialist
Asana · December 2024
View →
Generative AI in Business
Continuing Education · 2025

Education
B.S. Data Analytics
Western Governors University · In Progress
Cybersecurity Bootcamp
Blaizing Academy · 2023 · Security+, Network+ earned

Skills
SQLPythonPandas Power BITableauExcel dbtSnowflakeBigQuery FivetranGreat Expectations AirflowPrompt Engineering Reference DataData QualityRLHF
Download Resume ↓