Biography
As a bioinformatician, I am working at the critical interface between bioinformatics, functional genomics and wet lab biology to decipher “coded message” in our genome. Drawing on my background in computational sciences, I specialize in large-scale data mining, using a combination of high-performance computing, machine learning, and data visualization.
Honors and Awards
Instructor of outstanding higher-education academic courses, Beijing, 2022
Highly Cited Researcher by Clarivate/SCI, 2021-2022
Highly Cited Chinese Researcher by Elsevier, 2020
Best Teacher, 2018
First Prize Teaching Achievement Award, Beijing (Leading Recipient), 2017
First Prize Teaching Achievement Award, Peking University (Leading Recipient), 2017
Outstanding Teaching Award, Peking University, 2014
National Top-Notch Young Professionals, 2013
Lvye Distinguished Young Biomedical Scholar, 2011 and 2016
Service Award, APBioNET, 2010
Professional Society Affiliations
2008-present, Member of Executive Committee & China Liaison, Asia Pacific Bioinformatics Network (APBioNET)
2016-present, Vice President on Education, Asia Pacific Bioinformatics Network (APBioNET)
2008-present, Member of National Technical Committee for Biological Information Standardization
2014-present, Founding Member of Expert Committee for Computational Biology and Bioinformatics, Chinese Society of Biotechnology
2015-present, Founding Member of Expert Committee for Big Data and Biocuration, Genetics Society of China
2015-present, Member of Expert Committee for Bioinformatics and Artificial Life, Chinese Association for Artificial Intelligence
Research Interests
As biology turns increasingly into a data-rich science, the massive amount of data generated by high-throughput technologies present both new opportunities and serious challenges. We are interested in developing novel computational technologies to analyze, integrate and visualize high-throughput biological data effectively and efficiently, with applications to decipher the function and evolution of gene regulatory systems.
Representative Peer-Reviewed Publications
as Essential Author (*: (co-)Corresponding Author)
Handle Biological “BIG DATA” Effectively and Efficiently
1. Cheng S. J., Shi F. Y., Liu H., Ding Y., Jiang S., Liang N., Gao G. 2017. Accurately annotate compound effects of genetic variants using a context-sensitive framework. Nucleic Acids Res. (in press)
2. Hou M., Tian F., Jiang S., Kong L., Yang D., Gao G.* 2016a. LocExpress: a web server for efficiently estimating expression of novel transcripts. BMC Genomics 17(13): 175-179. (Featured as Best Paper at InCoB’16)
3. Hou M., Tang X., Tian F., Shi F., Liu F., Gao G.* 2016b. AnnoLnc: a web server for systematically annotating novel human lncRNAs. BMC Genomics 17(1): 931.
4. Hu B., Jin J., Guo A. Y., Zhang H., Luo J., Gao G.* 2015. GSDS 2.0: an upgraded gene feature visualization server. Bioinformatics 31(8): 1296-1297. (Featured as ESI Highly Cited (Top 1%) Paper)
5. Xiao A., Cheng Z., Kong L., Zhu Z., Lin S., Gao G.*, Zhang B.* 2014. CasOT: a genome-wide Cas9/gRNA off-target searching tool. Bioinformatics 30(8): 1180-1182. (Featured as ESI Highly Cited (Top 1%) Paper)
6. Kong L., Wang J., Zhao S., Gu X., Luo J.*, Gao G.* 2012. ABrowse - a customizable next-generation genome browser framework. BMC Bioinformatics 13: 2. (Featured as “Highly Accessed”)
7. Wang J., Kong L., Zhao S., Zhang H., Tang L., Li Z., Gu X., Luo J.*, Gao G.* 2011. Rice-Map: a new-generation rice genome browser. BMC Genomics 12: 165. (Featured as “Highly Accessed”)
Decipher the Function and Evolution of Gene Regulatory Network
8. Jin J., Tian F., Yang D. C., Meng Y. Q., Kong L., Luo J., Gao G.* 2017. PlantTFDB 4.0: toward a central hub for transcription factors and regulatory interactions in plants. Nucleic Acids Res 45(D1): D1040-D1045. (Featured as ESI Highly Cited (Top 1%) Paper)
9. Jin J., He K., Tang X., Li Z., Lv L., Zhao Y., Luo J., Gao G.* 2015. An Arabidopsis Transcriptional Regulatory Map Reveals Distinct Functional and Evolutionary Features of Novel Transcription Factors. Mol Biol Evol 32(7): 1767-1773.
10. Zhao Y., Tang L., Li Z., Jin J., Luo J., Gao G.* 2015. Identification and analysis of unitary loss of long-established protein-coding genes in Poaceae shows evidences for biased gene loss and putatively functional transcription of relics. BMC Evol Biol 15: 66. (Featured as “Very Good (being of special significance in its field)” by Faculty of 1000)
11. Gao G., Vibranovski M. D., Zhang L., Li Z., Liu M., Zhang Y. E., Li X., Zhang W., Fan Q., VanKuren N. W., Long M., Wei L. 2014. A long-term demasculinization of X-linked intergenic noncoding RNAs in Drosophila melanogaster. Genome Res 24(4): 629-638.
12. Jin J., Zhang H., Kong L., Gao G.*, Luo J.* 2014. PlantTFDB 3.0: a portal for the functional and evolutionary study of plant transcription factors. Nucleic Acids Res 42(1): D1182-1187. (Featured as ESI Highly Cited (Top 1%) Paper)
13. Tang X., Hou M., Ding Y., Li Z., Ren L., Gao G.* 2013. Systematically profiling and annotating long intergenic non-coding RNAs in human embryonic stem cell. BMC Genomics 14(Suppl 5): S3. (Featured as “Highly Accessed”)
14. Chen Z. X., Zhang Y. E., Vibranovski M., Luo J., Gao G.*, Long M.* 2011. Deficiency of X-linked inverted duplicates with male-biased expression and the underlying evolutionary mechanisms in the Drosophila genome. Mol Biol Evol 28(10): 2823-2832.
Education
15. Ding Y., Wang M., He Y., Ye A. Y., Yang X., Liu F., Meng Y., Gao G.*, Wei L.* 2014. `Bioinformatics: Introduction and Methods,` a Bilingual Massive Open Online Course (MOOC) as a New Example for Global Bioinformatics Education. PLoS Computational Biology 10(12): e1003955.
as Co-Author
1. Zhou Y., Wang P., Tian F., Gao G., Huang L., Wei W., Xie X. S. 2017. Painting a specific chromosome with CRISPR/Cas9 for live-cell imaging. Cell Res 27(2): 298-301.
2. Feng S., Zhao Y., Xu Y., Ning S., Huo W., Hou M., Gao G., Ji J., Guo R.*, Xu D.* 2016. Ewing Tumor-associated Antigen 1 Interacts with Replication Protein A to Promote Restart of Stalled Replication Forks. J Biol Chem 291(42): 21956-21962.
3. Xing M., Yang M., Huo W., Feng F., Wei L., Jiang W., Ning S., Yan Z., Li W., Wang Q., Hou M., Dong C., Guo R., Gao G., Ji J., Zha S., Lan L., Liang H., Xu D. 2015. Interactome analysis identifies a new paralogue of XRCC4 in non-homologous end joining DNA repair pathway. Nat Commun 6: 6233.
4. Shu J., Wu C., Wu Y., Li Z., Shao S., Zhao W., Tang X., Yang H., Shen L., Zuo X., Yang W., Shi Y., Chi X., Zhang H., Gao G., Shu Y., Yuan K., He W., Tang C.*., Zhao Y., Deng HK.*. 2013. Induction of pluripotency in mouse somatic cells with lineage specifiers. Cell 153(5): 963-975.
5. Huang Y., Xie C., Ye A. Y., Li C. Y., Gao G., Wei L.* 2013. Recent adaptive events in human brain revealed by meta-analysis of positively selected genes. PloS ONE 8(4): e61280.
6. Wang J., Kong L., Gao G., Luo J.* 2013. A brief introduction to web-based genome browsers. Brief Bioinform 14(2): 131-143.
7. Xiao A., Wu Y., Yang Z., Hu Y., Wang W., Zhang Y., Kong L., Gao G., Zhu Z., Lin S., Zhang B. 2013. EENdb: a database and knowledge base of ZFNs and TALENs for endonuclease engineering. Nucleic Acids Res 41(Database issue): D415-422.
8. Yue R., Li H., Liu H., Li Y., Wei B., Gao G., Jin Y., Liu T., Wei L., Du J., Pei G.* 2012. Thrombin receptor regulates hematopoiesis and endothelial-to-hematopoietic transition. Dev Cell 22(5): 1092-1100.
9. Zhang J., Gao G., Chen J. J., Taylor G., Cui KM.*, He XQ.* 2011. Molecular features of secondary vascular tissue regeneration after bark girdling in Populus. New Phytol 192(4): 869-884.
10. Du P., Wu J., Zhang J., Zhao S., Zheng H., Gao G., Wei L., Li Y.* 2011. Viral Infection Induces Expression of Novel Phased MicroRNAs from Conserved Cellular MicroRNA Precursors. PLoS Pathog 7(8): e1002176.
11. Xie C., Mao X., Huang J., Ding Y., Wu J., Dong S., Kong L., Gao G., Li C. Y., Wei L.* 2011. KOBAS 2.0: a web server for annotation and identification of enriched pathways and diseases. Nucleic Acids Res. 39(Web Server issue): W316-322.
12. Liu M., Liu P. P., Zhang L., Cai Q. C., Gao G., Zhang W. X., Zhu Z. Y., Liu D.*, Fan QC.* 2011. mir-35 is involved in intestine cell G1/S transition and germ cell proliferation in C. elegans. Cell Research 21(11): 1605-1618.
Teaching
Regular Courses
Methods in Bioinformatics, for 100+ graduate and undergraduate students every Fall semester; co-created the course with Dr. Liping Wei in 2006, ran the course as co-lecturer (with Dr. Liping Wei) in 2007, and as lead lecturer from 2008.
Sequencing-oriented Human Genetics, for 20+ selected graduate students every Fall semester, co-created the course with Dr. Ruiqiang Li in 2012.
Current Topics on Genetics, for 15+ selected undergraduate students every Spring semester, co-founded the course with Dr. Bo Zhang and other nine lecturers in 2013.
“Genomics and Molecular Evolution” module in core graduate course “Essential Topics in Modern Life Sciences”, for all (60+) first-year graduate students in the School of Life Sciences every Spring semester, co-founded the course with Dr. Xiaofeng Zheng and other five colleagues in 2015.
“Computational Genomics” module, PTN-BBS Program, for 20+ selected graduate students every year, since 2011.
“Genomic and Bioinformatics” module, Bio2000 Program, for 300+ graduate students in Shanghai and Beijing every Fall semester since 2013, co-taught with Dr. Liping Wei.
Massive Open Online Course (MOOC)
Bioinformatics: Introduction and Methods, the first bilingual Bioinformatics MOOC and one of the first seven PKU MOOC courses, for 70,000+ registered world-wide students every autumn semester; co-created with Dr. Liping Wei in Fall 2013. The MOOC has been highlighted as the first-batch “National High-Quality MOOC” and Drs. Ge Gao and Liping Wei won the First Prize of 2014-2017 Peking University Teaching Achievement Award. The MOOC is currently available through both Coursera (https://www.coursera.org/course/pkubioinfo) and Chinese MOOC (http://www.chinesemooc.org/mooc/4393)
Laboratory Introduction