CMPT-733-Project

Description

This project is aimed at showcasing the end-to-end process of a typical data science project, including data fetching, exploratory data analysis (EDA), machine learning (ML) pipelines, and a web application for visualization of solutions.

Project Structure

The project is organized into the following directories:

EDA: Contains notebooks/scripts for exploratory data analysis.
ML Pipelines: Contains scripts for building and deploying machine learning models.
Data Fetching: Contains scripts for fetching and preprocessing data.
Web App: Contains the backend and frontend code for the visualization web application.

Repository Structure

.
├── README.md
├── data_fetching
│   ├── get_data_for_prediction/
│   ├── new_data_fetch_on_ec2/
│   ├── stock_data_fetch/
│   └── wsb_data_fetch/
├── eda/
├── ml_pipeline
│   ├── ml_pipeline.ipynb
│   └── model_and_scaler/
└── web_app
    ├── back_end/
    └── front_end/

Usage

EDA Explore the notebooks in the EDA directory to understand the data and gain insights.

ML Pipelines Refer to the ML Pipelines directory for scripts related to training, evaluation, and deployment of machine learning models.

Data Fetching Use the scripts in the Data Fetching directory to fetch and preprocess data for analysis.

Web App The Web App directory contains the backend and frontend code for the visualization web application. Follow the instructions in the respective directories to set up and run the web application.

Installation and Launch Instructions

Prerequisites

Before you begin, ensure you have Docker and Docker Compose installed on your system. These tools are required to containerize the application and its services, making it easier to deploy and run on any system without worrying about dependencies.

Getting Started

Clone the Repository

Start by cloning the application repository to your local machine. Open a terminal and run the following command:
```
git clone https://github.sfu.ca/bla175/cmpt-733-project.git
```
or
```
[email protected]:bla175/cmpt-733-project.git
```
Navigate to the Application Directory

Change your current directory to the cloned repository where the docker-compose.yml file is located:
```
cd web_app
```
Launch the Application with Docker Compose

Use the following Docker Compose command to build and start the application in detached mode (running in the background):
```
docker-compose up -d
```
This command reads the docker-compose.yml file in the current directory, builds the necessary Docker images, and starts the services defined in the file.

Accessing the Application

After launching the application with Docker Compose, you can access the various components of the application using the URLs provided below:

Front-End Application: Navigate to http://127.0.0.1:3000/ in your web browser to access the front-end interface of the application.
Additional Services: If your application includes other services that can be accessed via a web interface, you can access them at http://127.0.0.1:8080/.

Logging In

Upon accessing the front-end application, if prompted for a login, the application is designed to automatically fill in the default credentials. However, if the username and password are not pre-loaded, you can manually enter the following default credentials:

Username: admin
Password: 123456

These credentials should grant you access to the application. If you encounter any issues or need further assistance, please refer to the application documentation or contact the support team.

Tools/Technologies

Data Fetch Automation: Amazon EC2
ETL operations: Apache Spark, Pandas, PostgreSQL
Deployment: Docker
Visualization: Apache Echarts
Backend: Django
Frontend: Vue.js, Elment UI

Contributors

Binming Li
Chengkun He
Haimo Xu
Dexin Yang

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CMPT-733-Project

Description

Project Structure

Repository Structure

Usage

Installation and Launch Instructions

Prerequisites

Getting Started

Accessing the Application

Logging In

Tools/Technologies

Contributors

Data source

Reference

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
data_fetching		data_fetching
eda		eda
ml_pipeline		ml_pipeline
web_app		web_app
README.md		README.md

augustmood/SentimentDrivenStockPredictor

Folders and files

Latest commit

History

Repository files navigation

CMPT-733-Project

Description

Project Structure

Repository Structure

Usage

Installation and Launch Instructions

Prerequisites

Getting Started

Accessing the Application

Logging In

Tools/Technologies

Contributors

Data source

Reference

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages