A collection of specialized data processing pipelines designed to handle various types of financial and academic data.
Extracts and enriches company information from news articles using Refinitiv APIs. Identifies company mentions and enriches them with market data, ticker symbols, and other key information.
Analyzes academic paper metadata to track COVID-19 research funding patterns across different countries. Processes institutional affiliations and paper content to map research activity distribution.
Processes stock loan data from ZIP archives, implementing comprehensive validation, cleaning, and analysis pipelines. Handles daily financial data feeds with robust quality checks and audit trails.
- Python
- SQL (PostgreSQL, SQLite)
- REST APIs
- Pandas
- Cloud Infrastructure Discussions
Each project directory contains its own README with detailed documentation and setup instructions.