Wei Wang, Ph. D.
Professor
Computer Science Department
University of California, Los Angeles
Los Angeles, CA 90095-1596
Voice: (310) 794-0009
E-mail: weiwang@cs.ucla.edu
URL: http://www.cs.ucla.edu/~weiwang/


RESEARCH INTEREST       Big Data, Data Mining, Bioinformatics, Database Systems.
   
EDUCATION      
Jul. 1999Ph.D., Department of Computer Science, UCLA.
May 1995 M.S., Department of Systems Science and Industrial Engineering, SUNY at Binghamton.
   
WORK EXPERIENCE      
2014 - present Director of Scalable Analytics Institute, University of California at Los Angeles
2012 - present Professor at Department of Computer Science, University of California at Los Angeles
2010 - 2012 Professor at Department of Computer Science, University of North Carolina at Chapel Hill
2006 - 2010 Associate Professor at Department of Computer Science, University of North Carolina at Chapel Hill
2002 - 2006 Assistant Professor at Department of Computer Science, University of North Carolina at Chapel Hill
1999 - 2002 Research Staff Member at IBM T.J. Watson Research Centers
   
HONORS AND AWARDS    Okawa Foundation Research Award, 2013.
IEEE ICDM Outstanding Service Award, 2012.
Best Research Paper Award, SIGKDD 2008 for the paper "FastANOVA: an efficient algorithm for genome-wide association study".
Best Student Paper Award, ICDE 2008 for the paper "CARE: finding local linear correlations in high dimensional data".
Phillip and Ruth Hettleman Prize for Artistic and Scholarly Achievement, UNC, 2007.
Microsoft Research New Faculty Fellow, Microsoft, 2005.
Faculty Early Career Development (CAREER) Award, NSF, 2005.
Junior Faculty Development Award, UNC, 2003.
Invention Achievement Award, IBM, 2001.
Invention Achievement Award, IBM, 2000.
Dean's Graduate Fellowship, UCLA, 1999.
Adjudged one of the best papers of ICDE 1999 for the paper "STING+: an approach to active spatial data mining".
NCR Graduate Fellowship, 1997 - 1998.
Outstanding Academic Achievement awarded by SUNY at Binghamton, May 1995.
Distinguished Student awarded by Nankai University, 1993.
Fellowships, Nankai University, 1991 - 1993.
   
PROFESSIONAL ACTIVITIES    Associate Editor of the ACM Transactions on Knowledge Discovery in Data (2005 - 2009, 2015 - present)
Board of Directors of the ACM Special Interest Group on Bioinformatics, Computational Biology, and Biomedical Informatics (SIGBio) (2015 - present)
Action Editor for Data Mining and Knowledge Discovery (2014 - present)
Associate Editor of the IEEE Transactions on Big Data (2014 - present)
Guest Editor of the ACM Transactions of Knowledge Discovery in Data Special Issue on Best Papers in 2014 KDD (2015)
Guest Editor of the ACM Transactions on Knowledge Discovery in Data Special Issue on Bioinformatics (2007)
Associate Editor of the International Journal of Knowledge Discovery in Bioinformatics (2009 - present)
Associate Editor of the Knowledge and Information Systems (2007 - 2014)
Review Board Member of the Proceedings of the VLDB Endowment (2008 - 2010)
Editorial Board Member of the Open Artificial Intelligence Journal (2007 - present)
Editorial Board Member of the International Journal of Data Mining and Bioinformatics (2005 - present)
Associate Editor of the IEEE Transactions on Knowledge and Data Engineering (2003 - 2007)
Editorial Board Member of the Journal of Database Management (2000 - 2005)
Guest Editor of the IEEE Transactions on Knowledge and Data Engineering Special Issue on Mining Biological Data vol. 17 no. 8 (2005)
Intensive Working Group Member of the ACM SIGKDD Curriculum Committee (2003 - present)
Panelist of the NSF IIS program (2013)
Panelist of the NSF BIGDATA program (2012, 2014)
Panelist of the NIH BDMA program (2010 - 2014)
Panelist of the NSF IIS program (2011)
Panelist of the NSF IIS program (2010)
Panelist of the NSF IIS program (2009)
Panelist of the NSF IIS program (2008)
Panelist of the NIH BRIN-CC program (2008)
Panelist of the NIH NCBC program (2008)
Panelist of the NIH BDMA program (2007)
Panelist of the NIH CSR program (2007)
Panelist of the NIH System Biology program (2007)
Panelist of the NSF IIS program (2007)
Panelist of the NIH BDMA program (2006)
Panelist of the NIH CSR program (2006)
Panelist of the NSF EMT program (2006)
Panelist of the EPA SBIR program on Computational Toxicology (2005)
Panelist of the NSF SEIII program (2005)
Panelist of the NSF BDI program (2005)
Panelist of the NSF BDI program (2004)
Panelist of the NSF ITR Medium Award (2003)
Senior Program Committee Member of the 22nd ACM SIGKDD Conference on Knowledge Discovery and Data Mining (2016)
Program Committee Member of the 32nd IEEE International Conference on Data Engineering (2016)
Panel Chair and Senior Program Committee Member of the SIAM International Conference on Data Mining (2016)
Senior Program Committee Member of the 20th Pacific-Asia Conference on Knowledge Discovery and Data Mining (2016)
Program Committee Member of the 20th Annual International Conference on Research in Computational Molecular Biology (2016)
Program Committee Co-Chair of the 10th IEEE International Conference on Semantic Computing (2016)
Area Chair of the 14th IEEE International Conference on Data Mining (2015)
Conference Co-Chair of the IEEE International Conference on Data Science and Advanced Analytics (2015)
Senior Program Committee Member of the ACM International Conference on Information and Knowledge Management (2015)
Committee Member of the ACM SIGKDD Doctoral Dissertation Award (2015)
Committee Member of the ACM SIGKDD Test of Time Award (2015)
Student Travel Award Co-Chair and Senior Program Committee Member of the 21st ACM SIGKDD Conference on Knowledge Discovery and Data Mining (2015)
Senior Program Committee Member of the International Joint Conferences on Artificial Intelligence (2015)
Senior Program Committee Member of the SIAM International Conference on Data Mining (2015)
Senior Program Committee Member of the 19th Pacific-Asia Conference on Knowledge Discovery and Data Mining (2015)
Program Committee Co-Chair of the 9th IEEE International Conference on Semantic Computing (2015)
Workshop Co-Chair of the 14th IEEE International Conference on Data Mining (2014)
General Co-Chair of the 5th ACM Conference on Bioinformatics, Computational Biology and Biomedical Informatics (2014)
Program Committee Co-Chair of the 20th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (2014)
Senior Program Committee Member of the 18th Pacific-Asia Conference on Knowledge Discovery and Data Mining (2014)
Program Committee Member of the 13th IEEE International Conference on Data Mining (2013)
Senior Program Committee Member of the ACM International Conference on Information and Knowledge Management (2013)
Program Committee Member of the IEEE International Conference on Big Data (2013)
Senior Program Committee Member of the 19th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (2013)
Senior Program Committee Member of the SIAM International Conference on Data Mining (2013)
Senior Program Committee Member of the 17th Pacific-Asia Conference on Knowledge Discovery and Data Mining (2013)
Program Committee Member of the 29th International Conference on Data Engineering (2013)
Vice Chair of the 12th IEEE International Conference on Data Mining (2012)
Track Co-chair of the ACM Conference on Bioinformatics, Computational Biology and Biomedicine (2012)
Program Committee Member of the IEEE International Conference on Bioinformatics and Biomedicine (2012)
Asia Pacific Track Co-chair and Senior Program Committee Member of the 17th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (2012)
Program Committee member of the European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (2012)
Program Committee Member of the International Conference on Machine Learning (2012)
Senior Program Committee Member of the 16th Pacific-Asia Conference on Knowledge Discovery and Data Mining (2012)
Program Committee Member of the ACM SIGMOD International Conference on Management of Data (2012)
Senior Program Committee Member of the SIAM International Conference on Data Mining (2012)
Program Committee Member of the 28th International Conference on Data Engineering (2012)
General Co-chair of the 11th IEEE International Conference on Data Mining (2011)
Program Committee Member of the IEEE International Conference on Bioinformatics and Biomedicine (2011)
Program Committee Member of the 20th ACM Conference on Information and Knowledge Management (2011)
Program Committee Member of the European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (2011)
Senior Program Committee Member of the 17th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (2011)
Program Committee Member of the 10th International Workshop on Data Mining in Bioinformatics (2011)
Program Committee Co-chair of the ACM Conference on Bioinformatics, Computational Biology and Biomedicine (2011)
Senior Program Committee Member of the SIAM International Conference on Data Mining (2011)
Program Committee Member of the 14th International Conference on Extending Database Technology (2011)
Program Committee Member of the IEEE International Conference on Bioinformatics and Biomedicine (2010)
Vice Chair and Awards Chair of the 10th IEEE International Conference on Data Mining (2010)
Program Committee Member of the 19th ACM Conference on Information and Knowledge Management (2010)
Program Committee Member of the 8th International Conference on Computational Systems Bioinformatics (2010)
Program Committee Member and Best Paper Awards Chair of the ACM International Conference on Bioinformatics and Computational Biology (2010)
Program Committee Member of the 9th International Workshop on Data Mining in Bioinformatics (2010)
Program Committee Member of the 16th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (2010)
Program Committee Member of the ACM SIGMOD International Conference on Management of Data (2010)
Vice Chair of the 26th International Conference on Data Engineering (2010)
Program Committee Co-Chair of the 8th IEEE International Conference on Data Mining (2009)
Program Committee Member of the 18th ACM Conference on Information and Knowledge Management (2009)
Program Committee Member of the 35th International Conference on Very Large Data Bases (2009)
Program Committee Member of the 8th International Conference on Computational Systems Bioinformatics (2009)
Awards Chair of the 15th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (2009)
Publicity Co-Chair of the SIAM International Conference on Data Mining (2009)
Program Committee Member of the 14th International Conference on database Systems for Advanced Applications (2009)
Vice Chair of the 25th International Conference on Data Engineering (2009)
Program Committee Member of the 8th IEEE International Conference on Data Mining (2008)
Program Committee Member of the 17th ACM Conference on Information and Knowledge Management (2008)
Program Committee Member of the European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (2008)
Program Committee Member of the 34th International Conference on Very Large Data Bases (2008)
Program Committee Member of the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (2008)
Program Committee Member of the 8th International Workshop on Data Mining in Bioinformatics (2008)
Area Chair of the 12th Pacific-Asia Conference on Knowledge Discovery and Data Mining (2008)
Proceedings Chair and Program Committee Member of the SIAM International Conference on Data Mining (2008)
Program Committee Member of the 24th IEEE International Conference on Data Engineering (2008)
Program Committee Member of the 13th International Conference on Database Systems for Advanced Applications (2008)
Program Committee member of the 16th ACM Conference on Information and Knowledge Management (2007)
General Co-chair of the 2nd International Workshop on Data and Text Mining in Bioinformatics in Conjunction with the 16th ACM Conference on Information and Knowledge Management (2007)
Vice Chair of the 7th IEEE International Conference on Data Mining (2007)
Program Committee Co-chair of the Workshop on Mining and Management of Biological Data, in Conjunction with the 7th IEEE International Conference on Data Mining (2007)
Program Committee member of the 2nd Workshop on Data Mining ihn Bioinformatics in Conjunction with the 33rd International Conference on Very Large Data Bases (2007)
Program Committee Member of the 9th International Conference on Data Warehousing and Knowledge Discovery (2007)
Program Committee Member of the 13th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (2007)
Program Committee Member of the ACM SIGMOD International Conference on Management of Data (2007)
Area Chair of the 11th Pacific-Asia Conference on Knowledge Discovery and Data Mining (2007)
Program Committee Member of the SIAM International Conference on Data Mining (2007)
Program Committee Member of the 12th International Conference on Database Systems for Advanced Applications (2007)
Program Committee Member of the 6th IEEE International Conference on Data Mining (2006)
Program Committee Member of the 13th International Conference on Management of Data (2006)
Program Committee Member of the 15th ACM Conference on Information and Knowledge Management (2006)
Program Committee Member of the 17th European Conference on Machine Learning and the 10th European Conference on Principles and Practice of Knowledge Discovery in Databases (2006)
Program Committee Member of the 32nd International Conference on Very Large Data Bases (2006)
Program Committee Member of the Ph.D. Workshop in Conjunction with the 32nd International Conference on Very Large Data Bases (2006)
Program Committee Member of the Workshop on Data Mining in Bioinformatics in Conjunction with the 32nd International Conference on Very Large Data Bases (2006)
Program Committee Member of the 8th International Conference on Data Warehousing and Knowledge Discovery (2006)
Senior Program Committee Member of the 12th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (2006)
Program Committee Member of the 6th International Workshop on Data Mining in Bioinformatics in Conjunction with 12th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (2006)
Program Committee Member of the 2nd International Conference on Advanced Data Mining and Applications (2006)
Program Committee Member of the 11th International Conference on Database Systems for Advanced Applications (2006)
Program Committee Member of the 22nd IEEE International Conference on Data Engineering (2006)
Program Committee Member of the International Conference on Semantics of a Networked World (2006)
Program Committee Member of the 10th International Conference on Extending DataBase Technology (2006)
Program Committee Member of the 4th Asia-Pacific Bioinformatics Conference (2006)
Program Committee Member of the 5th IEEE International Conference on Data Mining (2005)
Program Committee Member of the 5th IEEE Symposium on Bioinformatics and Bioengineering (2005)
Program Committee Member of the 6th International Conference on Web-Age Information Management (2005)
Program Committee Member of the 31st International Conference on Very Large Data Bases (2005)
Program Committee Member of the Ph.D. Workshop at the 31st International Conference on Very Large Data Bases (2005)
Program Committee Member of the 3rd International Workshop on Biological Data Management in Conjunction with the 16th International Conference on Database and Expert Systems Applications (2005)
Program Committee Member of 11th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (2005)
Program Committee Co-chair of the 5th Workshop on Data Mining in Bioinformatics in Conjunction with the 11th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (2005)
Program Committee Member of the 1st International Conference on Advanced Data Mining and Applications (2005)
Program Committee Member of the IEEE Workshop on Computer Vision methods for Bioinformatics in Conjunction with IEEE International Conference on Computer Vision and Pattern Recognition (2005)
Program Committee Member of the ACM SIGMOD International Conference on Management of Data (2005)
Corporate Sponsor Committee Member of the ACM SIGMOD International Conference on Management of Data (2005)
Program Committee Member of the 7th Asia Pacific Web Conference (2005)
Program Committee Member of the ACM Symposium on Applied Computing (2005)
Scientific Committee Member of the International Conference on Computational and Information Sciences (2004)
Program Committee Member of the 13th ACM Conference on Information and Knowledge Management (2004)
Program Committee Member of the 4th IEEE International Conference on Data Mining (2004)
Program Committee Member of the ICDM'04 Workshop on Life Sciences Data Mining (2004)
Program Committee Member of the 1st International Workshop on Knowledge Discovery in Data Streams in conjunction with the 15th European Conference on Machine Learning (2004)
Program Committee Member of the 2nd International Workshop on Biological Data Management in conjunction with the 15th International Conference on Database and Expert Systems Applications (2004)
Program Committee Member of the 10th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (2004)
Program Committee Member of the 4th Workshop on Bioinformatics in Data Mining in conjunction with the 10th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (2004)
Program Committee Member of the 5th International Conference on Software Engineering, Artificial Intelligence, Networking, and Parallel/Distributed Computing (2004)
Program Committee Member of the 2nd International Conference on Software Engineering Research, Management & Applications (2004)
Program Committee Member of the 6th Asia Pacific Web Conference (2004)
Scientific Committee Member of the IADIS International Conference on Applied Computing (2004)
Program Committee Member of the ACM Symposium on Applied Computing (2004)
Proceedings Chair of the 4th International Conference on Web-Age Information Management (2003)
Program Committee Member of the 4th International Conference on Web-Age Information Management (2003)
Program Committee Member of the 15th International Conference on Scientific and Statistical Database Management (2003)
Program Committee Member of the International Workshop on Mining Spatial and Temporal Data (2001)
Session Chair of the 24th IEEE International Conference on Data Engineering (2008)
Session Chair of the 7th IEEE International Conference on Data Mining (2007)
Session Chair of the SIAM International Conference on Data Mining (2007)
Session Chair of the 12th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (2006)
Session Chair of the 22nd IEEE International Conference on Data Engineering (2006)
Session Chair of the 11th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (2005)
Session Chair of the ACM SIGMOD International Conference on Management of Data (2005)
Session Chair of the 4th IEEE International Conference on Data Mining (2004)
Session Chair of the 10th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (2004)
Session Chair of the 3rd SIAM International Conference on Data Mining (2002)
Session Chair of the 1st IEEE International Conference on Data Mining (2001)
Referee for ACM SIGMOD, ACM SIGMETTRICS, VLDB, ACM SIGKDD, ICDE, FODO conferences (1997-present)
   
SERVICES  Undergraduate Review Committee Member, UNC (2004)
Graduate Admission Committee Member, UNC (2003-present)
Mentor for Student Summer Intern, IBM (2000-2001)
Mentor for Graduate Students, UCLA (1998-1999)
   
PUBLICATIONS

ARTICLES IN REFEREED CONFERENCES
  1. Robust Multi-Network Clustering via Joint Cross-Domain Cluster Alignment, by Rui Liu, Wei Cheng, Hanghang Tong, Wei Wang, and Xiang Zhang, Proceedings of the 15th IEEE International Conference on Data Mining (ICDM), 2015.
  2. Max-Intensity: Detecting Competitive Advertiser Communities in Sponsored Search Market, by Wenchao Yu, Ariyam Das, Justin Wood, Wei Wang, Carlo Zaniolo, and Ping Luo. Proceedings of the 15th IEEE International Conference on Data Mining (ICDM), 2015.
  3. HapColor: A Graph Coloring Framework for Polyploidy Phasing, by Sepideh Mazrouee and Wei Wang, Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine (BIBM), 2015.
  4. REAFUM: Representative approximate frequent subgraph mining, by Ruirui Li and Wei Wang, Proceedings of the 15th SIAM International Conference on Data Mining (SDM), pp. 757-765, 2015.
  5. FastHap: fast and accurate single individual haplotype reconstruction using fuzzy conflict graphs, by Sepideh Mazrouee and Wei Wang, Proceedings of the 13th European Conference on Computational Biology (ECCB), Special Issue of Bioinformatics, vol. 30, no. 17, pp. i371-378, 2014.
  6. PseudoLasso: leveraging read alignment in homologous region to correct pseudogene expression estimates via RNASeq, by Chelsea Ju, Zhuangtian Zhao, and Wei Wang, Proceedings of the ACM International Conference on Bioinformatics and Computational Biology (ACMBCB), pp. 569-578, 2014.
  7. RNA-Skim: a rapid method for RNA-Seq quantification at transcript level, by Zhaojun Zhang and Wei Wang, Proceedings of the 21st Annual International Conference on Intelligent Systems for Molecular Biology (ISMB), Special Issue of Bioinformatics, vol. 30, no. 12, pp. i283-292, 2014.
  8. Graph Regularized Dual Lasso for Robust eQTL Mapping, by Wei Cheng, Xiang Zhang, Zhishan Guo, Yu Shi, and Wei Wang, Proceedings of the 21st Annual International Conference on Intelligent Systems for Molecular Biology (ISMB), Special Issue of Bioinformatics, vol. 30, no. 12, pp. i139-148, 2014.
  9. Transforming genomes using MOD files with applications, by Shunping Huang, Chia-Yu Kao, Leonard McMillan, and Wei Wang, Proceedings of the ACM International Conference on Bioinformatics and Computational Biology (ACMBCB), pp. 595-604, 2013.
  10. Read annotation pipeline for high-throughput sequencing data, by James Holt, Shunping Huang, Leonard McMillan, and Wei Wang, Proceedings of the ACM International Conference on Bioinformatics and Computational Biology (ACMBCB), pp. 605-613, 2013.
  11. Flexible and robust co-regularized multi-domain graph clustering, by Wei Cheng, Xiang Zhang, Zhishan Guo, Yubao Wu, Patrick Sullivan, and Wei Wang, Proceedings of the 19th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (SIGKDD), pp. 320-328, 2013.
  12. GeneScissors: a comprehensive approach to detecting and correcting spurious transcriptome inference due to RNAseq reads misalignment, by Zhaojun Zhang, Shunping Huang, Jack Wang, Xiang Zhang, Fernando Pardo Manuel de Villena, Leonard McMillan, and Wei Wang, Proceedings of the 21st Annual International Conference on Intelligent Systems for Molecular Biology (ISMB), Special Issue of Bioinformatics, vol. 29, no. 13, pp. i291-299, 2013.
  13. Metric learning from relative comparisons by minimizing squared residual, by Eric Yi Liu, Zhishan Guo, Xiang Zhang, Vladimir Jojic, and Wei Wang, Proceedings of the 12th IEEE International Conference on Data Mining (ICDM), pp. 978-983, 2012.
  14. Hierarchical co-clustering based on entropy splitting, by Wei Cheng, Xiang Zhang, Feng Pan, and Wei Wang. Proceedings of the 20th ACM Conference on Information and Knowledge Management (CIKM), pp. 1472-1476, 2012.
  15. Inferring novel associations between SNP sets and gene sets in eQTL study using sparse graphical model, by Wei Cheng, Xiang Zhang, Wei Wang, Yubao Wu, Xiaolin Yin, Jing Li and David Heckerman. Proceedings of the ACM International Conference on Bioinformatics and Computational Biology (ACMBCB), pp. 466-472, 2012.
  16. Dual transfer learning, by Mingsheng Long, Jianmin Wang, Guiguang Ding, Wei Cheng, Xiang Zhang, and Wei Wang. Proceedings of the 12th SIAM International Conference on Data Mining (SDM), pp. 540-551, 2012.
  17. Measuring opinion relevance in latent topic space, by Wei Cheng, Xiaochuan Ni, Jian-Tao Sun, Xiaoming Jin, Hye-Chung Kum, Xiang Zhang, and Wei Wang, Proceedings of the IEEE International Conference on Social Computing (SocialCom), pp. 323-330, 2011.
  18. Clustering with relative constraints, by Eric Yi Liu, Zhaojun Zhang, and Wei Wang, Proceedings of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (SIGKDD), pp. 947-955, 2011.
  19. LTS: Discriminative subgraph mining by learning from search history, by Ning Jin and Wei Wang, Proceedings of the 27th IEEE International Conference on Data Engineering (ICDE), pp. 207-218, 2011.
  20. Genome-wide compatible SNP intervals and their properties, by Wang, Jeremy, Fernando Pardo-Manuel de Villena, Wei Wang, and Leonard McMillan, Proceedings of the ACM International Conference on Bioinformatics and Computational Biology (ACMBCB), pp. 43-52, 2010.
  21. Gene set analysis using principal components, by Pakatci, Isa, Wei Wang, and Leonard McMillan, Proceedings of the ACM International Conference on Bioinformatics and Computational Biology (ACMBCB), pp. 330-333, 2010.
  22. Efficient genome ancestry inference in complex pedigrees with inbreeding,by Eric Yi Liu, Qi Zhang, Leonard McMillan, Fernando Pardo-Manuel de Villena, and Wei Wang, Proceedings of the 18th Annual International Conference on Intelligent Systems for Molecular Biology (ISMB), Special Issue of Bioinformatics, vol. 26, no. 12, pp. 199-207, 2010.
  23. TEAM: Efficient two-locus epistasis tests in human genome-wide association study,by Xiang Zhang, Shunping Huang, Fei Zou, and Wei Wang,Proceedings of the 18th Annual International Conference on Intelligent Systems for Molecular Biology (ISMB), Special Issue of Bioinformatics, vol. 26, no. 12, pp. 217-227, 2010.
  24. GAIA: Graph classification using evolutionary computation,by Ning Jin, Calvin Young, and Wei Wang. Proceedings of the ACM SIGMOD International Conference on Management of Data (SIGMOD), pp. 879-890, 2010.
  25. Graph classification based on pattern co-occurrence, by Ning Jin, Calvin Young, and Wei Wang. Proceedings of the 18th ACM Conference on Information and Knowledge Management (CIKM), pp. 573-582, 2009.
  26. Split-order distance for clustering and classification hierarchies, by Qi Zhang, Eric Yi Liu, Abhishek Sarkar, and Wei Wang. Proceedings of the 21st International Conference on Scientific and Statistical Database Management (SSDBM), pp. 517-534, 2009.
  27. COE: a general approach for efficient genome-wide two-locus epistasis test in disease association study, by Xiang Zhang, Feng Pan, Yuying Xie, Fei Zou, and Wei Wang. Proceedings of the 13th Annual International Conference on Research in Computational Molecular Biology (RECOMB), pp. 253-269, 2009.
  28. TreeQA: quantitative genome wide association mapping using local perfect phylogeny trees, by Feng Pan, Leonard McMillan, Fernando Pardo-Manuel de Villena, David Threadgill and Wei Wang. Proceedings of the 14th Pacific Symposium on Biocomputing (PSB), pp. 415-426, 2009.
  29. Inferring genome-wide mosaic structure, by Qi Zhang, Wei Wang, Leonard McMillan, Fernando Pardo-Manuel de Villena, and David Threadgill. Proceedings of the 14th Pacific Symposium on Biocomputing (PSB), pp. 150-161, 2009.
  30. FastChi: an efficient algorithm for analyzing gene-gene interactions, by Xiang Zhang, Fei Zou, and Wei Wang. Proceedings of the 14th Pacific Symposium on Biocomputing (PSB), pp. 528-539, 2009.
  31. Quantitative association analysis using tree hierarchies, by Feng Pan, Lynda Yang, Leonard McMillan, Fernando Pardo-Manuel de Villena, David Threadgill and Wei Wang. Proceedings of the 7th IEEE International Conference on Data Mining (ICDM), pp. 971-976, 2008.
  32. Functional neighbors: relationships between non-homologous protein families inferred using family-specific fingerprints,by Deepak Bandyopadhyay, Luke Huan, Jinze Liu, Jan Prins, Jack Snoeyink, Wei Wang, and Alexander Tropsha.Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine (BIBM), pp. 199-206, 2008.
  33. REDUS: finding reducible subspaces in high dimensional data,by Xiang Zhang, Feng Pan, and Wei Wang.Proceedings of the 17th ACM Conference on Information and Knowledge Management (CIKM), pp. 961-970, 2008.
  34. Genotype sequence segmentation: handling constraints and noise, by Qi Zhang, Wei Wang, Leonard McMillan, Jan Prins, Fernando Pardo-Manuel de Villena, and David Threadgill.Proceedings of the 8th Workshop on Algorithms in Bioinformatics (WABI), pp. 271-283, 2008.
  35. Mining non-redundant high order correlations in binary data, by Xiang Zhang, Feng Pan, Wei Wang, and Andrew Nobel.Proceedings of the 34th International Conference on Very Large Data Bases (VLDB), pp. 1178-1188, 2008.
  36. FastANOVA: an efficient algorithm for genome-wide association study, by Xiang Zhang, Fei Zou, and Wei Wang.Proceedings of the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (SIGKDD), pp. 821-829, 2008. (Best Research Paper)
  37. CRD: a general framework for fast co-clustering on large datasets utilizing sample-based matrix decomposition, by Feng Pan, Xiang Zhang, and Wei Wang.Proceedings of the ACM SIGMOD International Conference on Management of Data (SIGMOD), pp. 173-184, 2008.
  38. CARE: finding local linear correlations in high dimensional data, by Xiang Zhang, Feng Pan, and Wei Wang.Proceedings of the 24th IEEE International Conference on Data Engineering (ICDE), pp. 130-139, 2008. (Best Student Paper)
  39. Mining approximate order preserving clusters in the presence of noise, by Mengsheng Zhang, Wei Wang, and Jinze Liu.Proceedings of the 24th IEEE International Conference on Data Engineering (ICDE), pp. 160-168, 2008.
  40. Approximate clustering on distributed data streams, by Qi Zhang, Jinze Liu, and Wei Wang.Proceedings of the 24th IEEE International Conference on Data Engineering (ICDE), pp. 1131-1139, 2008.
  41. A general framework for fast co-clustering on large datasets using matrix decomposition, by Feng Pan, Xiang Zhang, and Wei Wang.Proceedings of the 24th IEEE International Conference on Data Engineering (ICDE), pp. 1337-1339, 2008.
  42. Sample selection for maximal diversity, by Feng Pan, Adam Roberts, Leonard McMillan, Fernando Pardo Manuel de Villena,David Threadgill, and Wei Wang. Proceedings of the 7th IEEE International Conference on Data Mining (ICDM), pp. 262-271, 2007.
  43. Incremental subspace clustering over multiple data streams, by Qi Zhang, Jinze Liu,and Wei Wang. Proceedings of the 7th IEEE International Conference on Data Mining (ICDM), pp. 727-732, 2007.
  44. Inferring missing genotypes in large SNP panels using fast nearest-neighbor searches over sliding windows, by Adam Roberts, Leonard McMillan, Wei Wang, Joel Parker, Ivan Rusyn, and David Threadgill, Proceedings of the 15th Annual International Conference on Intelligent Systems for Molecular Biology (ISMB), Bioinformatics, vol. 23, no. 13, pp. i401-i407, 2007.
  45. An efficient algorithm for mining coherent patterns from heterogeneous Microarrays, by Xiang Zhang and Wei Wang. Proceedings of the 19th International Conference on Scientific and Statistical Database Management (SSDBM), pp. 32, 2007.
  46. A fast algorithm for approximate quantiles in high speed data streams, by Qi Zhang and Wei Wang. Proceedings of the 19th International Conference on Scientific and Statistical Database Management (SSDBM), pp. 29, 2007.
  47. Mining RNA tertiary motifs with structure graphs, by Xueyi Wang, Jun Huan, Jack Snoeyink, and Wei Wang, Proceedings of the 19th International Conference on Scientific and Statistical Database Management (SSDBM), pp. 31, 2007.
  48. Intelligent sequential pattern mining via alignment --- optimization techniques for very large databases, by Hye-Chung Kum, Joong Hyuk Chang, and Wei Wang.Proceedings of the 11th Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD), pp. 587-597, 2007.
  49. On demand phenotype ranking through subspace clustering, by Xiang Zhang, Wei Wang, and Jun Huan.Proceedings of the 7th SIAM Conference on Data Mining (SDM), pp. 623-628, 2007.
  50. Poclustering: lossless clustering of dissimilarity data, by Jinze Liu, Qi Zhang, Wei Wang, Leonard McMillan, and Jan Prins.Proceedings of the 7th SIAM Conference on Data Mining (SDM), pp. 557-562, 2007.
  51. Graph database indexing using structured graph decomposition, by David Williams, Jun Huan, and Wei Wang.Proceedings of the 23rd IEEE International Conference on Data Engineering (ICDE), pp., 976-985, 2007.
  52. Accelerating profile queries in elevation maps, by Feng Pan, Wei Wang, and Leonard McMillan.Proceedings of the 23rd IEEE International Conference on Data Engineering (ICDE), pp., 76-85, 2007.
  53. Mining coherent patterns from heterogeneous microarray data, by Xiang Zhang, and Wei Wang.Proceedings of the 15th ACM Conference on Information and Knowledge Management (CIKM), pp. 838-839, 2006.
  54. Clustering pair-wise dissimilarity data intopartially ordered sets, by Jinze Liu, Qi Zhang, Wei Wang, Leonard McMillan, and Jan Prins.Proceedings of the 12th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (SIGKDD), pp. 637-642, 2006.
  55. Distance-based identification of spatial motifs in proteins using constrained frequent subgraph mining, by Jun Huan, Deepak Bandyopadhyay, Jan Prins,Jack Snoeyink, Alexander Tropsha, and Wei Wang.Proceedings of the LSS Computational Systems Bioinformatics Conference (CSB), pp. 227-238, 2006.
  56. A fast approximation to multidimensional scaling, by Tynia Yang, Jinze Liu, Leonard McMillan, and Wei Wang.Proceedings of the ECCV Workshop on Computation Intensive Methods for Computer Vision (CIMCV), 2006.
  57. Mining Approximate frequent itemset in the presence of noise: algorithm and analysis, by Jinze Liu, Susan Paulsen, Xing Xu, Wei Wang, Andrew Nobel, and Jan Prins. Proceedings of the 6th SIAM Conference on Data Mining (SDM), pp. 405-416, 2006.
  58. Mining shifting-and-scaling co-regulation patterns on gene expression profiles, by Xin Xu, Anthony K. H. Tung, Ying Lu, and Wei Wang.Proceedings of the 22nd IEEE International Conference on Data Engineering (ICDE), pp. 89 (10 pages), 2006.
  59. Human motion estimation from a reduced marker set, by Guodong Liu, Jingdan Zhang, Wei Wang, and Leonard McMillan.Proceedings of the Symposium on Interactive 3D Graphics and Games (SI3D), pp. 35-42, 2006.
  60. Finding representative set from massive data, by Feng Pan, Wei Wang, Anthony K. H. Tung, and Jiong Yang.Proceedings of the 5th IEEE International Conference on Data Mining (ICDM), pp. 338-345, 2005.
  61. Mining approximate frequent itemset from noisy data, by Jinze Liu, Susan Paulsen, Xing Xu, Wei Wang, Andrew Nobel, and Jan Prins. Proceedings of the 5th IEEE International Conference on Data Mining (ICDM), pp. 721-724, 2005.
  62. Rapid determination of local structural features common to a set of proteins (demo), by Jun Huan, Deepak Bandyopadhyay, Jinze Liu, Jan Prins, Jack Snoeyink, Alexander Tropsha, and Wei Wang. Proceedings of the 13th International Conference on Intelligent Systems for Molecular Biology (ISMB), 2005.
  63. A system for analyzing and indexing human motion databases (demo), by Guodong Liu, Jingdan Zhang, Wei Wang, and Leonard McMillan. Proceedings of the ACM SIGMOD International Conference on Management of Data (SIGMOD), pp. 924-926, 2005.
  64. Revealing true subspace clusters in high dimensions, by Jinze Liu, Karl Strohmaier, and Wei Wang. Proceedings of the 4th IEEE International Conference on Data Mining (ICDM), pp. 463-466, 2004.
  65. AGILE: a general approach to detect transitions in evolving data streams, by Jiong Yang and Wei Wang. Proceedings of the 4th IEEE International Conference on Data Mining (ICDM), pp. 559-562, 2004.
  66. A framework for ontology-driven subspace clustering,by Jinze Liu, Wei Wang, and Jiong Yang. Proceedings of the 10th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (SIGKDD), pp. 623-628, 2004.
  67. SPIN: Mining maximal frequent subgraphs from graph databases,by Jun Huan, Wei Wang, Jan Prins, and Jiong Yang. Proceedings of the 10th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (SIGKDD), pp. 581-586, 2004.
  68. Gene ontology friendly biclustering of expression profiles,by Jinze Liu, Jiong Yang, and Wei Wang.Proceedings of the IEEE Computational Systems Bioinformatics Conference (CSB), pp. 436-447, 2004.
  69. Biclustering of gene expression data by tendency,by Jinze Liu, Jiong Yang, and Wei Wang.Proceedings of the IEEE Computational Systems Bioinformatics Conference (CSB), pp. 182-193, 2004.
  70. BASS: approximate search on large string databases,by Jiong Yang, Wei Wang, and Philip Yu.Proceedings of the 16th International Conference on Scientific and Statistical Database Management (SSDBM), pp. 181-192, 2004.
  71. Fast computation of database operations using graphics processors,by Naga Govindaraju, Brandon Lloyd, Wei Wang, Ming Lin, and Dinesh Manocha.Proceedings of the ACM SIGMOD International Conference on Management of Data (SIGMOD), pp. 215-226, 2004.
  72. Understanding social welfare service patterns using sequential analysis, by Hye-Chung Kum, Dean Duncan, and Wei Wang. Proceedings of the NSF National Conference on Digital Government Research (DG.O), 2004.
  73. Successfully adopting IT for social welfare program management, by Dean Duncan, Hye-Chung Kum, Kimberly Flair, and Wei Wang. Proceedings of the NSF National Conference on Digital Government Research (DG.O), 2004.
  74. Successfully adopting IT for social welfare program management (demo), by Dean Duncan, Hye-Chung Kum, Kimberly Flair, and Wei Wang. Proceedings of the NSF National Conference on Digital Government Research (DG.O), 2004.
  75. Mining spatial motifs from protein structure graphs, by Jun Huan, Wei Wang, Deepak Bandyopadhyay, Jack Snoeyink, Jan Prins, and Alex Tropsha. Proceedings of the 8th Annual International Conference on Research in Computational Molecular Biology (RECOMB), pp. 308-315, 2004.
  76. Accurate classification of protein structural families using coherent subgraph analysis, by Jun Huan, Wei Wang, Anglina Washington, Jan Prins, Ruchir Shah, and Alex Tropsha. Proceedings of the Pacific Symposium on Biocomputing (PSB), pp. 411-422, 2004.
  77. OP-Cluster: clustering by tendency in high dimensional space, by Jinze Liu and Wei Wang. Proceedings of the 3rd IEEE International Conference on Data Mining (ICDM), pp. 187-194, 2003.
  78. Efficient mining of frequent subgraph in the presence of isomorphism, by Jun Huan, Wei Wang, and Jan Prins. Proceedings of the 3rd IEEE International Conference on Data Mining (ICDM), pp. 549-552, 2003.
  79. Discovering compact and highly discriminative features or feature combinations of drug activities using support vector machines, by Hwanjo Yu, Jiong Yang, Wei Wang, and Jiawei Han. Proceedings of the IEEE Computer Society Bioinformatics Conference (CSB), pp. 220-228, 2003.
  80. Reconstructing of ancestral gene order after segmental duplication and gene loss, by Jun Huan, Jan Prins, Wei Wang, and Todd Vision. Proceedings of the IEEE Computer Society Bioinformatics Conference (CSB), pp. 484-485, 2003.
  81. Social welfare program administration and evaluation and policy analysis using knowledge discovery and data mining (KDD) on administrative data, by Hye-Chung Kum, Dean Duncan, Kimberly Flair, and Wei Wang. Proceedings of the NSF National Conference on Digital Government Research (DG.O), pp. 39-44, 2003.
  82. Management assistance for Work First via a dynamic website, by Hye-Chung Kum, Dean Duncan, Kimberly Flair, and Wei Wang. Proceedings of the NSF National Conference on Digital Government Research (DG.O), pp. 296, 2003.
  83. STAMP: discovery of statistically important pattern repeats in a long sequence, by Jiong Yang, Wei Wang, and Philip Yu. Proceedings of the 3rd SIAM International Conference on Data Mining (SDM), pp. 224-238, 2003.
  84. ApproxMAP: approximate mining of consensus sequential patterns, by Hye-Chung Kum, Jian Pei, Wei Wang, and Dean Duncan. Proceedings of the 3rd SIAM International Conference on Data Mining (SDM), pp. 311-315, 2003.
  85. Enhanced biclustering on gene expression data,by Jiong Yang, Haixun Wang, Wei Wang, and Philip Yu. Proceedings of the 3rd IEEE Conference on Bioinformatics and Bioengineering (BIBE), pp. 321-327, 2003.
  86. CLUSEQ: efficient and effective sequence clustering, by Jiong Yang and Wei Wang, Proceedings of the 19th IEEE International Conference on Data Engineering (ICDE), pp. 101-112, 2003.
  87. InfoMiner+: mining partial periodic patterns with gap penalties,by Jiong Yang, Wei Wang, and Philip Yu, Proceedings of the 2nd IEEE International Conference on Data Mining (ICDM), pp. 725-728, 2002.
  88. Comparative study of sequential pattern mining frameworks --- support framework vs. multiple alignment framework,by Hye-Chung Kum, Susan Paulsen, and Wei Wang, Proceedings of the 2nd IEEE International Conference on Data Mining (ICDM) Workshopon the Foundation of Data Mining and Discovery, pp. 43-70, 2002.
  89. Towards automatic clustering of protein sequences,by Jiong Yang and Wei Wang, Proceedings of the 1st IEEE Computer Society Conference on Bioinformatics (CSB), pp. 175-186, 2002.
  90. Accelerating approximate subsequence search on large protein sequence databases, by Jiong Yang, Wei Wang, Yi Xia, and Philip Yu, Proceedings of the 1st IEEE Computer Society Conference on Bioinformatics (CSB), pp. 207-218, 2002.
  91. Mining long sequential patterns in a noisy environment, by Jiong Yang, Wei Wang, Philip Yu, and Jiawei Han, Proceedings of the ACM SIGMOD International Conference on Management of Data (SIGMOD), pp. 406-417, 2002.
  92. Clustering by pattern similarity in large data sets, by Haixun Wang, Wei Wang, Jiong Yang, and Philip Yu, Proceedings of the ACM SIGMOD International Conference on Management of Data (SIGMOD), pp. 394-405, 2002.
  93. Accelerating search of approximate match on large protein sequence databases, by Wei Wang, Jiong Yang, Yi Xia, and Philip Yu.Proceedings of the 6th ACM International Conference on Research inComputational Molecular Biology (RECOMB), 2002. (poster)
  94. Improving performance of bicluster discoveryin a large data set, by Jiong Yang, Wei Wang, Haixun Wang, and Philip Yu. Proceedings of the 6th ACM International Conference on Research inComputational Molecular Biology (RECOMB), 2002. (poster)
  95. A framework towards efficient and effective proteinclustering, by Wei Wang and Jiong Yang. Proceedings of the 6th ACM International Conference on Research inComputational Molecular Biology (RECOMB), 2002. (poster)
  96. Efficient filtering of large data sets --- a user-centric paradigm, by Yi Xia, Wei Wang, Jiong Yang, Philip Yu, and Richard Muntz. Proceedings of the 2nd SIAM International Conference on Data Mining (SDM), pp. 112-127, 2002.
  97. Delta-cluster: capturing subspace correlation in a large data set, by Jiong Yang, Wei Wang, Haixun Wang, and Philip Yu, Proceedings of the 18th IEEE International Conference on Data Engineering (ICDE), pp. 517-528, 2002.
  98. A framework towards efficient and effective sequence clustering, by Wei Wang and Jiong Yang, Proceedings of the 18th IEEE International Conference on Data Engineering (ICDE), pp. 282, 2002.
  99. Meta-patterns: revealing hidden periodical patterns, by Wei Wang, Jiong Yang, and Philip Yu, Proceedings of the 1st IEEE International Conference on Data Mining (ICDM), pp. 550-557, 2001.
  100. Info-miner: mining surprising periodic patterns, by Jiong Yang, Wei Wang, and Philip Yu, Proceedings of the 7th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (SIGKDD), pp. 395-400, 2001.
  101. TAR: temporal association rules on evolving numerical attributes, by Wei Wang, Jiong Yang, and Richard Muntz, Proceedings of the 17th IEEE International Conference on Data Engineering (ICDE), pp. 283-292, 2001.
  102. Mining asynchronous periodic patterns in time series data, by Jiong Yang, Wei Wang, and Philip Yu, Proceedings of the 6th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (SIGKDD), pp. 275-279, 2000.
  103. Efficient mining weighted association rules (WAR), by Wei Wang, Jiong Yang, and Philip Yu, Proceedings of the 6th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (SIGKDD), pp. 270-274, 2000.
  104. Collaborative web caching based on proxy affinities, by Jiong Yang, Wei Wang, and Richard Muntz, Proceedings of the 19th ACM SIGMETRICS Conference on the Measurement and Modeling of Computer Systems (SIGMETRICS), pp. 78-89, 2000.
  105. Dynamic adaptive file management in a local area network, by Jiong Yang, Wei Wang, Richard Muntz, and Silvia Nittel, Proceedings of the 20th IEEE International Conference on Distributed Computer Systems (ICDCS), pp. 368-375, 2000.
  106. STING+: an approach to active spatial data mining, by Wei Wang, Jiong Yang, and Richard Muntz, Proceedings of the 15th IEEE International Conference on Data Engineering (ICDE), pp. 116-125, 1999. (Invited to the "Best Papers of ICDE 1999" Special Issue of IEEE Transactions on Knowledge and Data Engineering)
  107. PK-tree: a spatial index structure for high dimensional point data, by Wei Wang, Jiong Yang, and Richard Muntz, Proceedings of the 5th International Conference on Foundations of Data Organization (FODO), pp. 27-36, 1998.
  108. DynamO: dynamic objects with persistent storage, by Jiong Yang, Silvia Nittel, Wei Wang, and Richard Muntz, Proceedings of the 8th International Workshop on Persistent Object Systems (POS8) , 1998.
  109. STING: a statistical information grid approach to spatial data mining, by Wei Wang, Jiong Yang, and Richard Muntz, Proceedings of the 23rd International Conference on Very Large Data Bases (VLDB), pp. 186-195, 1997.
  110. Performance analysis of several algorithms for processing joins between textual attributes, by Weiyi Meng, Clement Yu, Wei Wang, and Naphtali Rishe, Proceedings of the 12th IEEE International Conference on Data Engineering (ICDE), pp. 636-644, 1996.
  111. On fuzzy database systems, by Wei Wang and George Klir, Proceedings of the 5th IEEE Annual Dual-use Technologies & Applications Conference, pp. 330-335, 1995.
  112. The absolute continuity of fuzzy measures, Proceedings of International Joint Conference of the Fourth IEEE International Conference of Fuzzy Systems and Second International Fuzzy Engineering Symposium (FUZZ-IEEE/IFES'95), Japan, pp. 131-136, 1995.
  113. Determining fuzzy measures by Choquet integral, Proceedings of ISUMA-NAFIPS'95, pp. 724-727, 1995.
ARTICLES IN REFEREED JOURNALS
  1. Sparse regression models for unraveling group and individual associations in eQTL mapping, by Wei Cheng, Yu Shi, Xiang Zhang, and Wei Wang. BMC Bioinformatics, 2015.
  2. CGC: A Flexible and Robust Approach to Integrating Co-Regularized Multi-Domain Graph for Clustering, by Wei Cheng, Zhishan Guo, Xiang Zhang, and Wei Wang. ACM Transactions on Knowledge Discovery from Data (TKDD), 2015.
  3. HICC: An Entropy Splitting Based Framework for Hierarchical Co-Clustering, by Wei Cheng, Xiang Zhang, Feng Pan, and Wei Wang. Knowledge and Information Systems (KAIS), 2015.
  4. Allele-specific copy-number discovery from whole-genome and whole-exome sequencing, by WeiBo Wang, Wei Wang, Wei Sun, James J. Crowley and Jin P. Szatkiewicz, Nucleic Acids Research, 2015. doi: 10.1093/nar/gkv319
  5. Fast and Robust Group-Wise eQTL Mapping Using Sparse Graphical Models, by Wei Cheng, Yu Shi, Xiang Zhang, and Wei Wang. BMC Bioinformatics, vol. 16, no. 2, 2015. DOI 10.1186/s12859-014-0421-z
  6. Analyses of Allele-Specific Gene Expression in Highly Divergent Mouse Crosses, by James J Crowley, Vasyl Zhabotynsky, Wei Sun, Shunping Huang, Isa Kemal Pakatci, Yunjung Kim, Jeremy R Wang, Andrew P Morgan, John D Calaway, David L Aylor, Zaining Yun, Timothy A Bell, Ryan J Buus, Mark E Calaway, John P Didion, Terry J Gooch, Stephanie D Hansen, Nashiya N Robinson, Ginger D Shaw, Jason S Spence, Corey R Quackenbush, Cordelia J Barrick, Randal J. Nonneman, Yuying Xie, William Valdar, Alan B Lenarcic, Wei Wang, Catherine E Welsh, Chen-Ping Fu, Zhaojun Zhang, James Holt, Zhishan Guo, David W Threadgill, Lisa M Tarantino, Darla R Miller, Fei Zou, Leonard McMillan, Patrick F Sullivan, and Fernando Pardo-Manuel de Villena. Nature Genetics, vol. 47, no. 4, pp. 353-360, 2015.
  7. Spotlite: An Improved Algorithm and Web Application for Predicting Co-Complexed Proteins from Affinity Purification - Mass Spectrometry Data, by Dennis Goldfarb, Bridgid Hast, Wei Wang, and Michael B. Major. Journal of Proteomics Research, vol. 13, no. 12, pp. 5944-5955, 2014.
  8. A Novel Multi-Alignment Pipeline for High-Throughput Sequencing Data, by Shunping Huang, James Holt, Chia-Yu Kao, Leonard McMillan, and Wei Wang. Database, 2014.
  9. Starting at the ends: high-resolution sex-specific linkage maps of the mouse reveal polarized distribution of recombination in male germline, by Eric Yi Liu, Andrew P Morgan, Elissa J Chesler, Wei Wang, Gary A Churchill, Fernando Pardo-Manuel de Villena, Genetics, vol. 197, no. 1, pp. 91-106, 2014.
  10. Bayesian modeling of haplotype effects in multiparent populations, by Zhaojun Zhang, Wei Wang, and William Valdar. Genetics, vol. 198. No. 1, pp. 139-156, 2014.
  11. Total Orderings Defined on the Set of All Fuzzy Numbers, by Wei Wang and Zhenyuan Wang, Fuzzy Sets and Systems, vol. 243, 131-141, 2014.
  12. Heritability and genomics of gene expression in peripheral blood, by Fred A Wright, Patrick F Sullivan, Andrew I Brooks, Fei Zou, Wei Sun, Kai Xia, Vered Madar, Rick Jansen, Wonil Chung, Yi-Hui Zhou, Abdel Abdellaoui, Sandra Batista, Casey Butler, Guanhua Chen, Ting-Huei Chen, David D'Ambrosio, Paul Gallins, Min Jin Ha, Jouke Jan Hottenga, Shunping Huang, Mathijs Kattenberg, Jaspreet Kochar, Christel M Middeldorp, Ani Qu, Andrey Shabalin, Jay Tischfield, Laura Todd, Jung-Ying Tzeng, Gerard van Grootheest, Jacqueline M Vink, Qi Wang, Wei Wang, Weibo Wang, Gonneke Willemsen, Johannes H Smit, Eco J de Geus, Zhaoyu Yin, Brenda W J H Penninx, and Dorret I Boomsma. Nature Genetics, vol. 46, pp. 430-437, 2014.
  13. Using the Emerging Collaborative Cross to Probe the Immune System, by Jason Phillippi, Yuying Xie, Darla R Miller, Timothy A Bell, Zhaojun Zhang, Alan B Lenarcic, David L Aylor, S Harsha Krovi, David W Threadgill, Fernando Pardo-Manuel de Villena, Wei Wang, William Valdar, and Jeffrey A Frelinger, Genes and Immunology, vol. 15, no. 1, pp. 38-46, 2014.
  14. Searching Dimension Incomplete Databases, by Wei Cheng, Xiaoming Jin, Jian-Tao Sun, Xuemin Lin, Xiang Zhang, and Wei Wang, IEEE Transactions on Data Engineering (TKDE), vol. 26, no. 3, pp. 725-738, 2014.
  15. Improving detection of copy number variation by simultaneous bias correction and read-depth segmentation, by Jin Szatkiewicz, Weibo Wang, Patrick Sullivan, Wei Wang, and Wei Sun, Nucleic Acids Research, vol. 41, no. 3, pp. 1519-1532, 2013.
  16. MaCH-Admix: genotype imputation for admixed populations, by Eric Yi Liu, Mingyao Li, Wei Wang, and Yun Li. Genetic Epidemiology, vol. 37, no. 1, pp. 25-37, 2013.
  17. Mining Genome-Wide Genetic Markers, by Xiang Zhang, Shunping Huang, Zhaojun Zhang, Wei Wang. PLOS Computation Biology, vol. 8, no. 12, e1002828, 2012.
  18. Learning transcriptional regulatory relationships using sparse graphical models, by Xiang Zhang, Wei Cheng, Jennifer Listgarten, Carl Kadie, Shunping Huang, Wei Wang, and David Heckerman, PLoS ONE, vol. 7, no. 5, e357622012, 2012.
  19. Genotype imputation of Metabochip SNPs using a study-specific reference panel of ~4,000 haplotypes in African Americans from the Women′s Health Initiative, by Eric Yi Liu, Steven Buyske, Aaron K. Aragaki, Ulrike Peters, Eric Boerwinkle, Chris Carlson, Cara Carty, Dana C. Crawford, Jeff Haessler, Lucia A. Hindorff, Loic Le Marchand, Teri A. Manolio, Tara Matise, Wei Wang, Charles Kooperberg, Kari E. North, and Yun Li, Genetic Epidemiology, vol. 36, no. 2, pp. 107-117, 2012.
  20. Rapid and robust resampling-based multiple testing correction with application in genome-wide eQTL study, by Xiang Zhang, Shunping Huang, Wei Sun, and Wei Wang. Genetics, vol. 190, no. 4, pp. 1511-1520, 2012.
  21. Genome-wide association mapping of loci for antipsychotic-induced extrapyramidal symptoms in mice, by James J Crowley, Yunjung Kim, Jin Peng Szatkiewicz, Amanda L Pratt, Corey R Quackenbush, Daniel E Adkins, Edwin van den Oord, Molly A Bogue, Hyuna Yang, Wei Wang, David W Threadgill, Fernando Pardo-Manuel de Villena, Howard L McLeod, and Patrick F Sullivan. Mammalian Genome, vol. 23, no. 5-6, pp. 322-335, 2012.
  22. The genome architecture of the Collaborative Cross mouse genetic reference population, by Collaborative Cross Consortium, Genetics, vol. 190, no. 2, pp. 389-401, 2012.
  23. HTreeQA: using semi-perfect phylogeny trees in quantitative trait loci study on genotype data, by Zhaojun Zhang, Xiang Zhang, and Wei Wang. G3: Genes, Genomes, Genetics, vol. 2, no. 2, pp. 175-189, 2012.
  24. seeQTL: a searchable database for human eQTLs, by Kai Xia, Andrey A Shabalin, Shunping Huang, Vered Madar, Yi-Hui Zhou, Wei Wang, Fei Zou, Wei Sun, Patrick F Sullivan, Fred A Wright. Bioinformatics, vol. 28, no. 3, pp. 451-452, 2012.
  25. Classification of mouse sperm motility patterns using an automated multiclass support vector machines mode, by Summer G Goodson, Zhaojun Zhang, James K Tsuruta, Wei Wang, and Deborah A O'Brien. Biology of Reproduction, vol. 84, no. 6, pp. 1207-1215, 2011.
  26. Genetic analysis of complex traits in the emerging collaborative cross, by Aylor DL, Valdar W, Foulds-Mathes W, Buus RJ, Verdugo RA, Baric RS, Ferris MT, Frelinger JA, Heise M, Frieman MB, Gralinski LE, Bell TA, Didion JD, Hua K, Nehrenberg DL, Powell CL, Steigerwalt J, Xie Y, Kelada SN, Collins FS, Yang IV, Schwartz DA, Branstetter LA, Chesler EJ, Miller DR, Spence J, Liu EY, McMillan L, Sarkar A, Wang J, Wang W, Zhang Q, Broman KW, Korstanje R, Durrant C, Mott R, Iraqi FA, Pomp D, Threadgill D, Pardo-Manuel de Villena F, Churchill GA. Genome Research, vol. 21, pp. 1213-1222, 2011.
  27. Tools for efficient epistasis detection in genome-wide association study, by Xiang Zhang, Shunping Huang, Fei Zou, and Wei Wang. Source Code for Biology and Medicine, vol. 6, no. 1, pp. 1-3, 2011.
  28. COE: a general approach for efficient genome-wide two-locus epistasis test in disease association study,by Xiang Zhang, Feng Pan, Yuying Xie, Fei Zou, and Wei Wang.Journal of Computational Biology (JCB), vol. 17, no. 3, pp. 401-415, 2010.
  29. Functional neighbors: Relationships between non-homologous protein families inferred using family-Specific fingerprints, by Deepak Bandyopadhyay, Jun Huan, Jinze Liu, Jan Prins, Jack Snoeyink, Wei Wang, and Alexander Tropsha, IEEE Transaction on Information Technology in Biomedicine, vol. 14, no. 5, pp. 1137-1143, 2010.
  30. Discriminative subgraph mining for protein classification, by Ning Jin, Calvin Young, Wei Wang, International Journal of Knowledge Discovery in Bioinformatics (IJKDB), vol. 1, no. 3, pp. 36-52, 2010.
  31. Identification of family-specific residue packing motifs and their use for structure-based protein function prediction: I. Method development, by Deepak Bandyopadhyay, Jun Huan, Jan Prins, Jack Snoeyink, Wei Wang, and Alexander Tropsha, Journal of Computer-Aided Molecular Design, vol. 23, no. 11, pp. 773-784, 2009.
  32. Identification of family-specific residue packing motifs and their use for structure-based protein function prediction: II. Case studies and applications, by Deepak Bandyopadhyay, Jun Huan, Jan Prins, Jack Snoeyink, Wei Wang, and Alexander Tropsha, Journal of Computer-Aided Molecular Design, vol. 23, no. 11, pp. 785-797, 2009.
  33. Efficient algorithms for genome-wide association study,by Xiang Zhang, Fei Zou, and Wei Wang.ACM Transactions on Knowledge Discovery from Data (TKDD), vol. 3, issue. 4, no. 19, 2009.
  34. The polymorphism architecture of mouse genetic resources elucidated using genome-wide resequencing data: implications for QTL discovery and systems genetics, by Adam Roberts, Fernando Pardo-Manuel de Villena, Wei Wang, Leonard McMillan, and David Threadgill, Mammalian Genome, vol. 18, no. 6, pp. 473-481, 2007.
  35. Benchmarking the effectiveness of sequential pattern mining methods, by Hye-Chung Kum, J. H. Chang, and Wei Wang, Data and Knowledge Engineering, vol. 60, no. 1, pp. 30-50, 2007.
  36. Structure-based function inference using protein family-specific fingerprints, by Deepak Bandyopadhyay, Jun Huan, Jinze Liu, Jan Prins, Jack Snoeyink, Wei Wang, and Alexander Tropsha. Protein Science, vol. 15, pp. 1537-1543, 2006.
  37. Sequential pattern mining in multi-databases via multiple alignment, by Hye-Chung Kum, Joong-Hyuk Chang, and Wei Wang, Data Mining and Knowledge Discovery (DMKD), vol. 12, no. 2-3, pp. 151-180, 2006.
  38. Comparing graph representations of protein structure for mining family-specific residue-based packing motifs, by Jun Huan, Wei Wang, Deepak Bandyopadhyay, Jack Snoeyink, Jan Prins, and Alexander Tropsha. Journal of Computational Biology (JCB), vol. 12, no. 6, pp. 657-671, 2005.
  39. Guest editors' introduction: special issue on mining biological data, by Wei Wang and Jiong Yang, IEEE Transactions on Knowledge and Data Engineering (TKDE), vol. 17, no. 8, pp. 1019-1020, 2005.
  40. An improved biclustering method for analyzing gene expression profiles, by Jiong Yang, Haixun Wang, Wei Wang, and Philip Yu, International Journal on Artificial Intelligence Tools (IJAIT), vol. 14, no. 5, pp. 771-789, 2005.
  41. Mining surprising periodic patterns, by Jiong Yang, Wei Wang, and Philip Yu. Data Mining and Knowledge Discovery (DMKD), vol. 9, no. 2, pp. 189-216, 2004.
  42. Discovering high order periodic patterns, by Jiong Yang, Wei Wang, and Philip Yu. Knowledge and Information Systems Journal (KAIS), vol. 6, no. 3, pp. 243-268, 2004.
  43. WAR: weighted association rules for item intensities, by Wei Wang, Jiong Yang, and Philip Yu. Knowledge and Information Systems Journal (KAIS), vol. 6, no. 2, pp. 203-229, 2004.
  44. Recent progress on selected topics in database research: a report from nine young Chinese researchers working in the United States (invited paper), by Zhiyuan Chen, Chen Li, Jian Pei, Yufei Tao, Haixun Wang, Wei Wang, Jiong Yang, Jun Yang, and Donghui Zhang. Journal of Computer Science and Technology, vol. 18, no. 5, pp. 538 552, 2003.
  45. Mining asynchronous periodic patterns in time series data, by Jiong Yang, Wei Wang, and Philip Yu, IEEE Transactions on Knowledge and Data Engineering (TKDE), vol. 15, no. 3, pp. 613-628, 2003.
  46. Mining patterns in long sequential data with noise, by Wei Wang, Jiong Yang, and Philip Yu, ACM SIGKDD Explorations, vol. 2, no. 2, pp. 28-33, 2000.
  47. An approach to active spatial data mining based on statistical information, by Wei Wang, Jiong Yang, and Richard Muntz, IEEE Transactions on Knowledge and Data Engineering, Special Issue on Best Papers in the 15th IEEE International Conference on Data Engineering, vol. 12, no. 5, pp. 715-728, 2000.
  48. Dynamo: design, implementation, and evaluation of cooperative persistent object management in a local area network, by Jiong Yang, Wei Wang, Silvia Nittel, Richard Muntz, and Vince Busam, Software - Practice and Experience, vol. 30, no. 4, pp. 419-448, 2000.
  49. Performance analysis of three text-join algorithms, by Weiyi Meng, Clement Yu, Wei Wang, and Naphtali Rishe, IEEE Transactions on Knowledge and Data Engineering, vol. 10, no. 3, pp. 477-492, 1998.
  50. Genetic algorithms for determining fuzzy measures from data, by Wei Wang, Zhenyuan Wang, and George J. Klir,Journal of Intelligent & Fuzzy Systems, vol. 6, no. 2, pp. 171-183, 1998.
  51. Monotone set functions defined by Choquet integral, by Zhenyuan Wang, George J. Klir, and Wei Wang, Fuzzy Sets and Systems, vol. 81, pp. 241-252, 1996.
  52. Fuzzy measures defined by fuzzy integral and their absolute continuity, by Zhenyuan Wang, George J. Klir, and Wei Wang, Journal of Mathematical Analysis and Application, vol. 203, pp. 150-165, 1996.
  53. Constructing fuzzy measures by transformations, by George J. Klir, Zhenyuan Wang, and Wei Wang, International Journal of Fuzzy Mathematics, vol. 4, no. 1, pp. 207-215, 1996.
  54. Constructing fuzzy measures by rational transformations, by Wei Wang, George J. Klir, and Zhenyuan Wang, International Journal of Fuzzy Mathematics, vol. 4, no. 3, pp. 665-675, 1996.
  55. Pan-integrals with respect to imprecise probabilities, by Zhenyuan Wang, Wei Wang, and George J. Klir, International Journal of General Systems, vol. 25, no. 3, pp. 229-243, 1996.
BOOK CHAPTERS
  1. Mining Discriminative Subgraph Patterns from Structural Data, by Ning Jin and Wei Wang, Data Mining and Knowledge Discovery for Big Data: Methodologies, Challenges, and Opportunities Chapter 4, by Wesley W. Chu (ed.) Springer, 2013.
  2. Grid-based Clustering, by Wei Cheng, Wei Wang, and Sandra Batista, Data Clustering: Algorithms and Applications Chapter 6, by Charu C. Aggarwal and Chandan K. Reddy (eds.), CRC Press, 2013.
  3. Finding high-order correlations in high-dimensional biological data, by Xiang Zhang, Feng Pan, and Wei Wang, Link Mining: Models, Algorithms and Applications, by Yu, Han, and Faloutsos (eds.), Springer, 2010.
  4. Protein local structure comparison: methods and future directions, by Jun Huan, Wei Wang, and Jan Prins, Advances in Computers by Chau-Wen Tseng (eds.), Elsevier, 2006.
  5. Models for sequential pattern mining, by Hye-Chung Kum, Susan Paulsen, and Wei Wang, A Book on FDM --- Lecture Notes in Computer Science, Springer-Verlag, 2006.
  6. Discovering evolutionary classifier over high speed non-static stream, by Jiong Yang, Xifeng Yan, Jiawei Han, and Wei Wang, Advanced Methods for Knowledge Discovery from Complex Data, pp. 337-364, 2005.
  7. Mining high dimensional data, by Wei Wang and Jiong Yang, Data Mining and Knowledge Discovery Handbook: A Complete Guide for Practitioners and Researchers, Kluwer Academic Publishers, 2005.
  8. PK-tree: a spatial index structure for high dimensional point data (extended version), by Wei Wang, Jiong Yang, and Richard Muntz, Information Organization and Databases, Kluwer Academic Publishers, 2000.
  9. DynamO: dynamic objects with persistent storage, by Jiong Yang, Silvia Nittel, Wei Wang, and Richard Muntz, in Advances in Persistent Object Systems, pp. 199-214, Morgan Kauffmann, 1999.
  10. Extension of lower probabilities and coherence of belief measures, Advances in Intelligent Computing, edited by B. Bouchon - Meunier, R. R. Yager, and L. A. Zadeh, Springer Verlag, pp. 62-69, 1995.
BOOKS
  1. Mining Sequential Patterns from Large Data Sets, by Wei Wang and Jiong Yang, in Series of Advances in Database Systems, edited by Ahmed Elmagarmid, Kluwer, 2005.
  2. Advances in Web-Age Information Management --- Lecture Notes in Computer Science No. 2762, edited by Guozhu Dong, Changjie Tang, and Wei Wang, Springer-Verlag, 2003.
SOFTWARES
  1. Spotlite: Web Application and Augmented Algorithms for Predicting Co-Complexed Proteins from Affinity Purification - Mass Spectrometry Data
  2. RNA-Skim: a rapid method for RNA-Seq quantification at transcript level
  3. GAIN: Efficient genome ancestry inference in complex pedigrees with inbreeding
  4. Multi-Alignment and Read Annotation Pipeline: Lapels, Suspenders
  5. ASGENSENG: A software to detect allele specific CNV from both WGS and WES data
  6. GENSENG: Improving detection of copy number variation by simultaneous bias correction and read-depth segmentation
  7. MaCH-Admix: genotype imputation for admixed populations
  8. GeneScissors: a comprehensive approach to detecting and correcting spurious transcriptome inference due to RNAseq reads misalignment
  9. HTreeQA: using semi-perfect phylogeny trees in quantitative trait loci study on genotype data
  10. LTS: Discriminative subgraph mining by learning from search history
  11. GAIA: Graph classification using evolutionary computation
  12. REM: Rapid and Robust Resampling-Based Multiple-Testing Correction with Application in a Genome-Wide Expression Quantitative Trait Loci Study
  13. FastANOVA: an Efficient Algorithm for Genome-Wide Association Study
  14. Inferring Genome-wide Mosaic Structure
  15. Genotype Sequence Segmentation
  16. TreeQA: Tree-based Genome-wide Association Mapping
  17. NPUTE: Fast Algorithm for Imputing Missing Genotypes in SNPs
  18. FFSM: Fast Frequent Subgraph Mining
PATENTS
  1. System and method for identifying coherent objects with application to E-commerce, 2003.
  2. System and probabilistic method for mining long patterns, 2001.
  3. System and method for mining patterns with noise, 2001.
  4. System and method for meta pattern discovery, 2001. (US Patent 6785663, issued on August 31st, 2004)
  5. Methods for identifying partial periodic patterns and corresponding event subsequences in an event sequence, 2000. (US Patent 6718317, issued on April 6th, 2004)
  6. Methods for identifying partial periodic patterns of infrequent events in an event sequence, 2000.
  7. Methods for mining weighted association rule, 2000 (US Patent 6415287, issued on July 2nd, 2002).