Skip to content

Latest commit

 

History

History
51 lines (31 loc) · 3.76 KB

README.md

File metadata and controls

51 lines (31 loc) · 3.76 KB

HIV EVOLVING SITES TOOL

Team members:

  1. Liusu Wang [email protected]
  2. Kuangyou Yao [email protected]

initial discussion

Links:

Final Paper
Poster

Dataset:

HIV protein sequence from infected individuals

Project:

Introduction:

This project is to create a visualization tool in aid of studying one of the deadliest viruses known to mankind, the Human immunodeficiency virus, commonly known as HIV.HIV is a retrovirus, which has certain features making it difficult to treat and eradicate including a high mutation rate.

Understanding how the virus evolves in the human body is an important part in defeating this disease. Studies have shown that some individuals possess certain immune genotypes that naturally control virus and significantly delay the progression to AIDS. Hypotheses suggest that it is because their immune system recognize virus through a specific region of the virus shell protein that cannot tolerate mutations very well. Most likely, that part of the protein is crucial to the virus. Without it, the virus cannot survive or maintain its ability to replicate.

Therefore, it is of interest to discover parts in the virus, which are crucial to virus replication. This information could be critical to informing effective vaccine strategies, and to save more people in the future. The project is also projected to support other kinds of virus evolution which might shows similar pattern as HIV.

Previous Related Work: HIV envelope regions were sequenced from time of infection until AIDS in nine adult men infected with HIV type 1 In order to help visualize the location and types of mutations that had arisen, large tables were made using Microsoft Excel. Mutations relative to the original founder virus sequence were color-coded based on specific features.

This is a relatively simple and static visualization of the virus. In order for researchers to highlight a certain cell with color, they would have to click on every single cell for the color. Also because of the long protein sequence highlighting the positions with significant variation, researchers would have to collapse the columns without mutation, neglecting a lot of neighboring positions.

Our project provides a basis for additional features like providing meta-data about the mutations e.g. are they in T cell epitopes? Are they immune escapes? Or providing a more integrated comparison of multiple patients.

Work Responsiblities:

Kuangyou Yao: Research, backend implementation, bar chart implementation, paper
Liusu Wang: Research, detail implementation, design, layout, usability test, refinement, paper

Research and development process:

We primarily discuss and learn about the topic with our collaborators. They helped us understand the basic concept and recommended one paper to read. We also conducted research on similiar product and found that the only similiar work that has been done was by excel, which is very time-consuming and lack of flexibility and data analysis. After the initial research we designed and implemented this tool. Development process:

  1. Implement “Context + Focus via brushing” interactive visualization tool using bioinformatics data.
  2. Set up a PHP-powered AJAX server
  3. Implement backend python scripts to generate and send back query results.
  4. Implement detailed table part using JavaScript and JQuery UI
  5. Final CSS polish for DOM Element positioning, color, font, etc.

During our development process, we kept doing evaluation and gether feedback from our user to improve the piece. At the end we conduct survey and observation analysis to evaluate our work.