Skip to content

A collection of specialized data processing pipelines designed to handle various types of financial and academic data.

Notifications You must be signed in to change notification settings

gbourniq/data-ingestion-projects

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

1 Commit
 
 
 
 
 
 
 
 

Repository files navigation

Data Pipeline Projects

A collection of specialized data processing pipelines designed to handle various types of financial and academic data.

Projects

📈 Tickers from Articles

Extracts and enriches company information from news articles using Refinitiv APIs. Identifies company mentions and enriches them with market data, ticker symbols, and other key information.

📚 Text File Ingestion

Analyzes academic paper metadata to track COVID-19 research funding patterns across different countries. Processes institutional affiliations and paper content to map research activity distribution.

📦 ZIP File Ingestion

Processes stock loan data from ZIP archives, implementing comprehensive validation, cleaning, and analysis pipelines. Handles daily financial data feeds with robust quality checks and audit trails.

Tech Stack

  • Python
  • SQL (PostgreSQL, SQLite)
  • REST APIs
  • Pandas
  • Cloud Infrastructure Discussions

Getting Started

Each project directory contains its own README with detailed documentation and setup instructions.

About

A collection of specialized data processing pipelines designed to handle various types of financial and academic data.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages