Wei Wang's Resume

Wei Wang, Ph. D.
Leonard Kleinrock Professor
Computer Science Department
University of California, Los Angeles
Los Angeles, CA 90095-1596
Voice: (310) 794-0009
E-mail: weiwang@cs.ucla.edu
URL: http://www.cs.ucla.edu/~weiwang/

RESEARCH INTEREST	Big Data Analytics, Data Mining, Machine Learning, Natural Language Processing, Bioinformatics and Computational Biology, Computational Medicine, AI for Science

EDUCATION
Jul. 1999	Ph.D., Department of Computer Science, UCLA.
May 1995	M.S., Department of Systems Science and Industrial Engineering, SUNY at Binghamton.

WORK EXPERIENCE
2018 - present	Professor at Department of Computational Medicine, University of California at Los Angeles
2014 - present	Director of Scalable Analytics Institute, University of California at Los Angeles
2012 - present	Professor at Department of Computer Science, University of California at Los Angeles
2010 - 2012	Professor at Department of Computer Science, University of North Carolina at Chapel Hill
2006 - 2010	Associate Professor at Department of Computer Science, University of North Carolina at Chapel Hill
2002 - 2006	Assistant Professor at Department of Computer Science, University of North Carolina at Chapel Hill
1999 - 2002	Research Staff Member at IBM T.J. Watson Research Centers

HONORS AND AWARDS	IEEE Fellow, 2022.
	ACM Fellow, 2020.
	Best Student Paper Award, ACM BCB 2020 for the paper "Bio-JOIE: Joint Representation Learning of Biological Knowledge Bases".
	Grand Prize Award of the Yelp Dataset Challenge (Round 9), 2017.
	ACM SIGKDD Service Award, 2016.
	Best Research Paper Runner Up Award, SIGKDD 2016 for the paper "Ranking causal anomalies via temporal and dynamical analysis on vanishing correlations".
	Okawa Foundation Research Award, 2013.
	IEEE ICDM Outstanding Service Award, 2012.
	Best Research Paper Award, SIGKDD 2008 for the paper "FastANOVA: an efficient algorithm for genome-wide association study".
	Best Student Paper Award, ICDE 2008 for the paper "CARE: finding local linear correlations in high dimensional data".
	Phillip and Ruth Hettleman Prize for Artistic and Scholarly Achievement, UNC, 2007.
	Microsoft Research New Faculty Fellow, Microsoft, 2005.
	Faculty Early Career Development (CAREER) Award, NSF, 2005.
	Junior Faculty Development Award, UNC, 2003.
	Invention Achievement Award, IBM, 2001.
	Invention Achievement Award, IBM, 2000.
	Dean's Graduate Fellowship, UCLA, 1999.
	Adjudged one of the best papers of ICDE 1999 for the paper "STING+: an approach to active spatial data mining".
	NCR Graduate Fellowship, 1997 - 1998.
	Outstanding Academic Achievement awarded by SUNY at Binghamton, May 1995.
	Distinguished Student awarded by Nankai University, 1993.
	Fellowships, Nankai University, 1991 - 1993.

PROFESSIONAL ACTIVITIES	Steering Committee Member of the ACM WSDM Conference (2022 - present)
	Member of ACM SIG Governing Board (SGB) (2025 - 2027)
	Chair of ACM Special Interest Group on Knowledge Discovery in Data (SIGKDD) (2021 - 2025)
	Guest Editor of the NPJ Artificial Intelligence Special Issue on AI for Biology, 2025
	IEEE Computer Society Fellow Committee (2023)
	Hong Kong Research Impact Fund Committee (2023 Ð 2026)
	ACM SGB Task Force on SIG overhead (2022)
	Editorial Board Member of the Journal of Computational Biology (2017 - present)
	Steering Committee Member of the IEEE Big Data Conference (2017 - present)
	Associated Editor of the IEEE/ACM Transactions on Computational Biology and Bioinformatics (2015 - 2024)
	Associate Editor of the ACM Transactions on Knowledge Discovery in Data (2005 - 2009, 2015 - 2017)
	Board of Directors of the ACM Special Interest Group on Bioinformatics, Computational Biology, and Biomedical Informatics (SIGBio) (2015 - 2019)
	Action Editor for Data Mining and Knowledge Discovery (2014 - present)
	Associate Editor of the IEEE Transactions on Big Data (2014 - 2018)
	Guest Editor of the ACM Transactions of Knowledge Discovery in Data Special Issue on Best Papers in 2014 KDD (2015)
	Guest Editor of the ACM Transactions on Knowledge Discovery in Data Special Issue on Bioinformatics (2007)
	Associate Editor of the International Journal of Knowledge Discovery in Bioinformatics (2009 - present)
	Associate Editor of the Knowledge and Information Systems (2007 - 2014)
	Review Board Member of the Proceedings of the VLDB Endowment (2008 - 2010)
	Editorial Board Member of the Open Artificial Intelligence Journal (2007 - present)
	Editorial Board Member of the International Journal of Data Mining and Bioinformatics (2005 - present)
	Associate Editor of the IEEE Transactions on Knowledge and Data Engineering (2003 - 2007)
	Editorial Board Member of the Journal of Database Management (2000 - 2005)
	Guest Editor of the IEEE Transactions on Knowledge and Data Engineering Special Issue on Mining Biological Data vol. 17 no. 8 (2005)
	Intensive Working Group Member of the ACM SIGKDD Curriculum Committee (2003 - 2004)
	Standing Member of the NIH BDMA program (2010 - 2014)

	Senior Area Chair of the 42nd International Conference on Machine Learning (2025)
	Co-organizer of the ICML Workshop on Multi-modal Foundation Models and Large Language Models for Life Sciences (2025)
	Area Chair of the 39th AAAI Conference on Artificial Intelligence (2025)
	Senior Area Chair of the NeurIPS Datasets and Benchmarks Track (2024)
	Senior Program Committee Member of the 33rd ACM International Conference on Information and Knowledge Management (2024)
	Best Paper Award Committee Member of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (2024)
	Award Committee Member of the SIAM International Conference on Data Mining (2024)
	Senior Area Chair of the 41st International Conference on Machine Learning (2024)
	Track co-chair of the Web Mining and Content Analysis track of The ACM Web Conference (2024)
	Area Chair of the 38th AAAI Conference on Artificial Intelligence (2024)
	Senior Area Chair of the NeurIPS Datasets and Benchmarks Track (2023)
	Senior Program Committee Member of the 32nd ACM International Conference on Information and Knowledge Management (2023)
	Member of the ACM SIGKDD Award Committee (2023)
	Program Committee Member of the Highlights track of the 27th Annual International Conference on Research in Computational Molecular Biology (2023)
	Program Committee Member of the 27th Annual International Conference on Research in Computational Molecular Biology (2023)
	Area Chair of the 37th AAAI Conference on Artificial Intelligence (2023)
	Program Committee Member of the 5th International Workshop on Health Natural Language Processing (2022)
	Senior Program Committee Member of the 31st ACM International Conference on Information and Knowledge Management (2022)
	Senior Program Committee Member and Organizer of the Trustworthy AI Day of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (2022)
	Member of the ACM SIGKDD Award Committee (2022)
	Program Committee Member of the 26th International Conference on Research in Computational Molecular Biology (2022)
	Associate Chair of the 38th IEEE International Conference on Data Engineering (2022)
	Area Chair of the 36th AAAI Conference on Artificial Intelligence (2022)
	Program committee of the 25th International Conference on Research in Computational Molecular Biology (2021)
	Senior Program Committee Member and Best Paper Award Committee Member of the 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (2021)
	Program Committee Member of the 4th International Workshop on Health Natural Language Processing (2021)
	Area Chair of the 35th AAAI Conference on Artificial Intelligence (2021)
	Program Committee Co-Chair of the 26th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (2020)
	Senior Program Committee Member of the SIAM International Conference on Data Mining (2020)
	Area Chair of the 34th AAAI Conference on Artificial Intelligence (2020)
	Program Committee Co-Chair of the 13th ACM International Conference on Wed Search and Data Mining (2020)
	Program Committee Member of the 19th IEEE International Conference on Data Mining (2019)
	Program Committee of RECOMB/ISCB Conference on Regulatory and Systems Genomics with DREAM Challenges (2019)
	Senior Program Committee Member of the 28th ACM International Conference on Information and Knowledge Management (2019)
	Senior Program Committee Member of the 6th IEEE International Conference on Data Science and Advanced Analytics (2019)
	Program Committee Member of the Annual International Conference on Intelligent Systems for Molecular Biology (2019)
	Program Committee Member of the ACM SIGMOD International Conference on Management of Data (2019)
	Program Committee Member of the second International Workshop on Health Natural Language Processing, in conjunction with the IEEE International Conference on Health Informatics (2019)
	Tutorial Co-Chair of the Web Conference (2019)
	Senior Program Committee Member of the SIAM International Conference on Data Mining (2019)
	Senior Program Committee Member of the 23rd Pacific-Asia Conference on Knowledge Discovery and Data Mining (2019)
	Senior Program Committee Member of the IEEE International Conference on Big Data (2018)
	Program Committee Member of the IEEE International Conference on Big Knowledge (2018)
	Program Committee member of the 27th ACM International Conference on Information and Knowledge Management (2018)
	Program Committee Co-Chair of the 5th IEEE International Conference on Data Science and Advanced Analytics (2018)
	Student Travel Award Chair and Senior Program Committee Member of the 24th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (2018)
	Program Committee Member of the Annual International Conference on Intelligent Systems for Molecular Biology (2018)
	Program Committee Member of the 1st International Workshop on Health Natural Language Processing, in conjunction with the IEEE International Conference on Health Informatics (2018)
	Senior Program Committee Member of the 22nd Pacific-Asia Conference on Knowledge Discovery and Data Mining (2018)
	Senior Program Committee Member of the SIAM International Conference on Data Mining (2018)
	Senior Program Committee Member of the 11th ACM International Conference on Web Search and Data Mining (2018)
	Senior Program Committee Member of the IEEE International Conference on Big Data (2017)
	Program Committee Member of the 16th IEEE International Conference on Data Mining (2017)
	Senior Program Committee Member of the ACM International Conference on Information and Knowledge Management (2017)
	Organization Committee Member of the 7th Annual Translational Bioinformatics Conference (2017)
	Program Committee Member of the International Joint Conference on Artificial Intelligence (2017)
	System Biology Track Co-Chair of the 8th ACM Conference on Bioinformatics, Computational Biology, and Health Informatics (2017)
	Award Committee Chair and Program Committee Member of the 23rd ACM SIGKDD Conference on Knowledge Discovery and Data Mining (2017)
	Senior Program Committee Member of the 21st Pacific-Asia Conference on Knowledge Discovery and Data Mining (2017)
	Program Committee Co-Chair of the SIAM International Conference on Data Mining (2017)
	Program Committee Member of the 10th ACM International Conference on Web Search and Data Mining (2017)
	Senior Program Committee Member of the ACM International Conference on Information and Knowledge Management (2016)
	Chair of the Test of Time Award Committee and Senior Program Committee Member of the 22nd ACM SIGKDD Conference on Knowledge Discovery and Data Mining (2016)
	Program Committee Member of the Annual International Conference on Intelligent Systems for Molecular Biology (2016)
	Program Committee Member of the 32nd IEEE International Conference on Data Engineering (2016)
	Panel Chair and Senior Program Committee Member of the SIAM International Conference on Data Mining (2016)
	Senior Program Committee Member of the 20th Pacific-Asia Conference on Knowledge Discovery and Data Mining (2016)
	Program Committee Member of the 20th Annual International Conference on Research in Computational Molecular Biology (2016)
	Program Committee Co-Chair of the 10th IEEE International Conference on Semantic Computing (2016)
	Area Chair of the 14th IEEE International Conference on Data Mining (2015)
	Conference Co-Chair of the IEEE International Conference on Data Science and Advanced Analytics (2015)
	Senior Program Committee Member of the ACM International Conference on Information and Knowledge Management (2015)
	Committee Member of the ACM SIGKDD Doctoral Dissertation Award (2015)
	Committee Member of the ACM SIGKDD Test of Time Award (2015)
	Student Travel Award Co-Chair and Senior Program Committee Member of the 21st ACM SIGKDD Conference on Knowledge Discovery and Data Mining (2015)
	Senior Program Committee Member of the International Joint Conferences on Artificial Intelligence (2015)
	Senior Program Committee Member of the SIAM International Conference on Data Mining (2015)
	Senior Program Committee Member of the 19th Pacific-Asia Conference on Knowledge Discovery and Data Mining (2015)
	Program Committee Co-Chair of the 9th IEEE International Conference on Semantic Computing (2015)
	Workshop Co-Chair of the 14th IEEE International Conference on Data Mining (2014)
	General Co-Chair of the 5th ACM Conference on Bioinformatics, Computational Biology and Biomedical Informatics (2014)
	Program Committee Co-Chair of the 20th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (2014)
	Senior Program Committee Member of the 18th Pacific-Asia Conference on Knowledge Discovery and Data Mining (2014)
	Program Committee Member of the 13th IEEE International Conference on Data Mining (2013)
	Senior Program Committee Member of the ACM International Conference on Information and Knowledge Management (2013)
	Program Committee Member of the IEEE International Conference on Big Data (2013)
	Senior Program Committee Member of the 19th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (2013)
	Senior Program Committee Member of the SIAM International Conference on Data Mining (2013)
	Senior Program Committee Member of the 17th Pacific-Asia Conference on Knowledge Discovery and Data Mining (2013)
	Program Committee Member of the 29th International Conference on Data Engineering (2013)
	Vice Chair of the 12th IEEE International Conference on Data Mining (2012)
	Track Co-chair of the ACM Conference on Bioinformatics, Computational Biology and Biomedicine (2012)
	Program Committee Member of the IEEE International Conference on Bioinformatics and Biomedicine (2012)
	Asia Pacific Track Co-chair and Senior Program Committee Member of the 17th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (2012)
	Program Committee member of the European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (2012)
	Program Committee Member of the International Conference on Machine Learning (2012)
	Senior Program Committee Member of the 16th Pacific-Asia Conference on Knowledge Discovery and Data Mining (2012)
	Program Committee Member of the ACM SIGMOD International Conference on Management of Data (2012)
	Senior Program Committee Member of the SIAM International Conference on Data Mining (2012)
	Program Committee Member of the 28th International Conference on Data Engineering (2012)
	General Co-chair of the 11th IEEE International Conference on Data Mining (2011)
	Program Committee Member of the IEEE International Conference on Bioinformatics and Biomedicine (2011)
	Program Committee Member of the 20th ACM Conference on Information and Knowledge Management (2011)
	Program Committee Member of the European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (2011)
	Senior Program Committee Member of the 17th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (2011)
	Program Committee Member of the 10th International Workshop on Data Mining in Bioinformatics (2011)
	Program Committee Co-chair of the ACM Conference on Bioinformatics, Computational Biology and Biomedicine (2011)
	Senior Program Committee Member of the SIAM International Conference on Data Mining (2011)
	Program Committee Member of the 14th International Conference on Extending Database Technology (2011)
	Program Committee Member of the IEEE International Conference on Bioinformatics and Biomedicine (2010)
	Vice Chair and Awards Chair of the 10th IEEE International Conference on Data Mining (2010)
	Program Committee Member of the 19th ACM Conference on Information and Knowledge Management (2010)
	Program Committee Member of the 8th International Conference on Computational Systems Bioinformatics (2010)
	Program Committee Member and Best Paper Awards Chair of the ACM International Conference on Bioinformatics and Computational Biology (2010)
	Program Committee Member of the 9th International Workshop on Data Mining in Bioinformatics (2010)
	Program Committee Member of the 16th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (2010)
	Program Committee Member of the ACM SIGMOD International Conference on Management of Data (2010)
	Vice Chair of the 26th International Conference on Data Engineering (2010)
	Program Committee Co-Chair of the 8th IEEE International Conference on Data Mining (2009)
	Program Committee Member of the 18th ACM Conference on Information and Knowledge Management (2009)
	Program Committee Member of the 35th International Conference on Very Large Data Bases (2009)
	Program Committee Member of the 8th International Conference on Computational Systems Bioinformatics (2009)
	Awards Chair of the 15th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (2009)
	Publicity Co-Chair of the SIAM International Conference on Data Mining (2009)
	Program Committee Member of the 14th International Conference on database Systems for Advanced Applications (2009)
	Vice Chair of the 25th International Conference on Data Engineering (2009)
	Program Committee Member of the 8th IEEE International Conference on Data Mining (2008)
	Program Committee Member of the 17th ACM Conference on Information and Knowledge Management (2008)
	Program Committee Member of the European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (2008)
	Program Committee Member of the 34th International Conference on Very Large Data Bases (2008)
	Program Committee Member of the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (2008)
	Program Committee Member of the 8th International Workshop on Data Mining in Bioinformatics (2008)
	Area Chair of the 12th Pacific-Asia Conference on Knowledge Discovery and Data Mining (2008)
	Proceedings Chair and Program Committee Member of the SIAM International Conference on Data Mining (2008)
	Program Committee Member of the 24th IEEE International Conference on Data Engineering (2008)
	Program Committee Member of the 13th International Conference on Database Systems for Advanced Applications (2008)
	Program Committee member of the 16th ACM Conference on Information and Knowledge Management (2007)
	General Co-chair of the 2nd International Workshop on Data and Text Mining in Bioinformatics in Conjunction with the 16th ACM Conference on Information and Knowledge Management (2007)
	Vice Chair of the 7th IEEE International Conference on Data Mining (2007)
	Program Committee Co-chair of the Workshop on Mining and Management of Biological Data, in Conjunction with the 7th IEEE International Conference on Data Mining (2007)
	Program Committee member of the 2nd Workshop on Data Mining in Bioinformatics in Conjunction with the 33rd International Conference on Very Large Data Bases (2007)
	Program Committee Member of the 9th International Conference on Data Warehousing and Knowledge Discovery (2007)
	Program Committee Member of the 13th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (2007)
	Program Committee Member of the ACM SIGMOD International Conference on Management of Data (2007)
	Area Chair of the 11th Pacific-Asia Conference on Knowledge Discovery and Data Mining (2007)
	Program Committee Member of the SIAM International Conference on Data Mining (2007)
	Program Committee Member of the 12th International Conference on Database Systems for Advanced Applications (2007)
	Program Committee Member of the 6th IEEE International Conference on Data Mining (2006)
	Program Committee Member of the 13th International Conference on Management of Data (2006)
	Program Committee Member of the 15th ACM Conference on Information and Knowledge Management (2006)
	Program Committee Member of the 17th European Conference on Machine Learning and the 10th European Conference on Principles and Practice of Knowledge Discovery in Databases (2006)
	Program Committee Member of the 32nd International Conference on Very Large Data Bases (2006)
	Program Committee Member of the Ph.D. Workshop in Conjunction with the 32nd International Conference on Very Large Data Bases (2006)
	Program Committee Member of the Workshop on Data Mining in Bioinformatics in Conjunction with the 32nd International Conference on Very Large Data Bases (2006)
	Program Committee Member of the 8th International Conference on Data Warehousing and Knowledge Discovery (2006)
	Senior Program Committee Member of the 12th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (2006)
	Program Committee Member of the 6th International Workshop on Data Mining in Bioinformatics in Conjunction with 12th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (2006)
	Program Committee Member of the 2nd International Conference on Advanced Data Mining and Applications (2006)
	Program Committee Member of the 11th International Conference on Database Systems for Advanced Applications (2006)
	Program Committee Member of the 22nd IEEE International Conference on Data Engineering (2006)
	Program Committee Member of the International Conference on Semantics of a Networked World (2006)
	Program Committee Member of the 10th International Conference on Extending DataBase Technology (2006)
	Program Committee Member of the 4th Asia-Pacific Bioinformatics Conference (2006)
	Program Committee Member of the 5th IEEE International Conference on Data Mining (2005)
	Program Committee Member of the 5th IEEE Symposium on Bioinformatics and Bioengineering (2005)
	Program Committee Member of the 6th International Conference on Web-Age Information Management (2005)
	Program Committee Member of the 31st International Conference on Very Large Data Bases (2005)
	Program Committee Member of the Ph.D. Workshop at the 31st International Conference on Very Large Data Bases (2005)
	Program Committee Member of the 3rd International Workshop on Biological Data Management in Conjunction with the 16th International Conference on Database and Expert Systems Applications (2005)
	Program Committee Member of 11th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (2005)
	Program Committee Co-chair of the 5th Workshop on Data Mining in Bioinformatics in Conjunction with the 11th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (2005)
	Program Committee Member of the 1st International Conference on Advanced Data Mining and Applications (2005)
	Program Committee Member of the IEEE Workshop on Computer Vision methods for Bioinformatics in Conjunction with IEEE International Conference on Computer Vision and Pattern Recognition (2005)
	Program Committee Member of the ACM SIGMOD International Conference on Management of Data (2005)
	Corporate Sponsor Committee Member of the ACM SIGMOD International Conference on Management of Data (2005)
	Program Committee Member of the 7th Asia Pacific Web Conference (2005)
	Program Committee Member of the ACM Symposium on Applied Computing (2005)
	Scientific Committee Member of the International Conference on Computational and Information Sciences (2004)
	Program Committee Member of the 13th ACM Conference on Information and Knowledge Management (2004)
	Program Committee Member of the 4th IEEE International Conference on Data Mining (2004)
	Program Committee Member of the ICDM'04 Workshop on Life Sciences Data Mining (2004)
	Program Committee Member of the 1st International Workshop on Knowledge Discovery in Data Streams in conjunction with the 15th European Conference on Machine Learning (2004)
	Program Committee Member of the 2nd International Workshop on Biological Data Management in conjunction with the 15th International Conference on Database and Expert Systems Applications (2004)
	Program Committee Member of the 10th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (2004)
	Program Committee Member of the 4th Workshop on Bioinformatics in Data Mining in conjunction with the 10th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (2004)
	Program Committee Member of the 5th International Conference on Software Engineering, Artificial Intelligence, Networking, and Parallel/Distributed Computing (2004)
	Program Committee Member of the 2nd International Conference on Software Engineering Research, Management & Applications (2004)
	Program Committee Member of the 6th Asia Pacific Web Conference (2004)
	Scientific Committee Member of the IADIS International Conference on Applied Computing (2004)
	Program Committee Member of the ACM Symposium on Applied Computing (2004)
	Proceedings Chair of the 4th International Conference on Web-Age Information Management (2003)
	Program Committee Member of the 4th International Conference on Web-Age Information Management (2003)
	Program Committee Member of the 15th International Conference on Scientific and Statistical Database Management (2003)
	Program Committee Member of the International Workshop on Mining Spatial and Temporal Data (2001)
	Session Chair of the 24th IEEE International Conference on Data Engineering (2008)
	Session Chair of the 7th IEEE International Conference on Data Mining (2007)
	Session Chair of the SIAM International Conference on Data Mining (2007)
	Session Chair of the 12th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (2006)
	Session Chair of the 22nd IEEE International Conference on Data Engineering (2006)
	Session Chair of the 11th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (2005)
	Session Chair of the ACM SIGMOD International Conference on Management of Data (2005)
	Session Chair of the 4th IEEE International Conference on Data Mining (2004)
	Session Chair of the 10th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (2004)
	Session Chair of the 3rd SIAM International Conference on Data Mining (2002)
	Session Chair of the 1st IEEE International Conference on Data Mining (2001)
	Referee for ACM SIGMOD, ACM SIGMETTRICS, VLDB, ACM SIGKDD, ICDE, FODO conferences (1997-present)

PUBLICATIONS
ARTICLES IN REFEREED CONFERENCES
	MetamatBench: Integrating Heterogeneous Data, Computational Tools, and Visual Interface for Metamaterial Discovery, by Jianpeng Chen, Wangzhi Zhan, Haohui Wang, Zian Jia, Jingru Gan, Junkai Zhang, Jingyuan Qi, Tingwei Chen, Lifu Huang, Muhao Chen, Ling Li, Wei Wang, and Dawei Zhou, Proceedings of the 28th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (SIGKDD), 2025. Neural Network Pruning for Invariance Learning, by Derek Xu, Yuanzhou Chen, Yizhou Sun, and Wei Wang, Proceedings of the 28th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (SIGKDD), 2025. Inferring from Logits: Exploring Best Practices for Decoding-Free Generative Candidate Selection, by Mingyu Derek Ma, Yanna Ding, Zijie Huang, Jianxi Gao, Yizhou Sun, and Wei Wang, Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (ACL), 2025. GIVE: Structured Reasoning of Large Language Models with Knowledge Graph Inspired Veracity Extrapolation, by Jiashu He, Mingyu Derek Ma, Jinxuan Fan, Dan Roth, Wei Wang, and Alejandro Ribeiro, Proceedings of the 42nd International Conference on Machine Learning (ICML), 2025. Rethink GraphODE Generalization within Coupled Dynamical System, by Guancheng Wan, Zijie Huang, Wanjia Zhao, Xiao Luo, Yizhou Sun, and Wei Wang, Proceedings of the 42nd International Conference on Machine Learning (ICML), 2025. ShuttleSHAP: A Turn-Based Feature Attribution Approach for Analyzing Forecasting Models in Badminton, by Wei-Yao Wang, Wen-Chih Peng, Wei Wang, and Philip S. Yu, Proceedings of the 29th Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD), 2025. GSR-Bench: Benchmarking Deployment in GitHub Science Repositories with LLMs, by Yijia Xiao, Runhui Wang, Luyang Kong, Davor Golac, and Wei Wang, Proceedings of the Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL), 2025. MetaScientist: A Human-AI Synergistic Framework for Automated Mechanical Metamaterial Design, by Jingyuan Qi, Zian Jia, Minqian Liu, Wangzhi Zhan, Junkai Zhang, Xiaofei Wen, Jingru Gan, Jianpeng Chen, Qin Liu, Mingyu Derek Ma, Bangzheng Li, Haohui Wang, Adithya Kulkarni, Muhao Chen, Dawei Zhou, Ling Li, Wei Wang, Lifu Huang, Demo Track, Proceedings of the Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL), 2025. Large Language Models Are Innate Crystal Structure Generators, by Jingru Gan, Peichen Zhong, Yuanqi Du, Yanqiao Zhu, Chenru Duan, Haorui Wang, Daniel Schwalbe-Koda, Carla P Gomes, Kristin Persson, and Wei Wang, Proceedings of the ICLR-AI4MAT Workshop, 2025. Large Language Models Are Innate Crystal Structure Generators, by Jingru Gan, Peichen Zhong, Yuanqi Du, Yanqiao Zhu, Chenru Duan, Haorui Wang, Daniel Schwalbe-Koda, Carla P Gomes, Kristin Persson, and Wei Wang, Proceedings of the ICLR Workshop on Agentic AI, 2025. ProteinGPT: Multimodal LLM for Protein Property Prediction and Structure Understanding, by Yijia Xiao, Edward Sun, Yiqiao Jin, Qifan Wang, and Wei Wang, Proceedings of ICLR-MLGenX Workshop, 2025. DoMiNO: Down-scaling Molecular Dynamics with Neural Graph Ordinary Differential Equations, by Fang Sun, Zijie Huang, Yadi Cao, Xiao Luo, Wei Wang, and Yizhou Sun, Proceedings of ICLR-MLMP Workshop, 2025. Mixture of In-Context Prompters for Tabular PFNs, by Derek Xu, F Olcay Cirit, Reza Asadi, Yizhou Sun, and Wei Wang, Proceedings of the 13th International Conference on Learning Representations (ICLR), 2025. Beyond Answers: Transferring Reasoning Capabilities to Smaller LLMs Using Multi-Teacher Knowledge Distillation, by Yijun Tian, Yikun Han, Xiusi Chen, Wei Wang, and Nitesh Chawla, Proceedings of the 18th ACM International Conference on Web Search and Data Mining (WSDM), pp. 251-260, 2025. Memorize and Rank: Elevating Large Language Models for Clinical Diagnosis Prediction, by Mingyu Derek Ma, Xiaoxuan Wang, Yijia Xiao, Anthony Cuturrufo, Vijay S Nori, Eran Halperin, and Wei Wang, Proceedings of the 39th AAAI Conference on Artificial Intelligence (AAAI), 2025. TradingAgents: Multi-agent LLM financial trading framework, by Yijia Xiao, Edward Sun, Di Luo and Wei Wang. Multi-Agent AI in the Real World Track, Proceedings of the 39th AAAI Conference on Artificial Intelligence (AAAI), 2025. Non-Euclidean Mixture Model for Social Network Embedding by Roshni Iyer, Yewen Wang, Wei Wang, and Yizhou Sun, Proceedings of the 38th Annual Conference on Neural Information Processing Systems (NeurIPS), 2024. Enhancing Large Vision Language Models with Self-Training on Image Comprehension, by Yihe Deng, Pan Lu, Fan Yin, Ziniu Hu, Sheng Shen, Quanquan Gu, James Zou, Kai-Wei Chang, and Wei Wang, Proceedings of the 38th Annual Conference on Neural Information Processing Systems (NeurIPS), 2024. Physics-Informed Regularization for Domain-Agnostic Dynamical System Modeling, by Zijie Huang, Wanjia Zhao, Jingdong Gao, Ziniu Hu, Xiao Luo, Yadi Cao, Yuanzhou Chen, Yizhou Sun, and Wei Wang, Proceedings of the 38th Annual Conference on Neural Information Processing Systems (NeurIPS), 2024. GraphVis: Boosting LLMs with Visual Knowledge Graph Integration, by Yihe Deng, Chenchen Ye, Zijie Huang, Mingyu Derek Ma, Yiwen Kou, and Wei Wang, Proceedings of the 38th Annual Conference on Neural Information Processing Systems (NeurIPS), 2024. BrainODE: Dynamic Brain Signal Analysis via Graph-Aided Neural Ordinary Differential Equations, Kaiqiao Han, Yi Yang, Zijie Huang, Xuan Kan, Ying Guo, Yang Yang, Lifang He, Liang Zhan, Yizhou Sun, Wei Wang, Carl Yang, Proceedings of the IEEE EMBS International Conference on Biomedical and Health Informatics (BHI), 2024. A Comprehensive Survey of Scientific Large Language Models and Their Applications in Scientific Discovery, by Yu Zhang, Xiusi Chen, Bowen Jin, Sheng Wang, Shuiwang Ji, Wei Wang, Jiawei Han, Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024. SPEED++: A Multilingual Event Extraction Framework for Epidemic Prediction and Preparedness, by Tanmay Parekh, Jeffrey Kwan, Jiarui Yu, Sparsh Johri, Hyosang Ahn, Sreya Muppalla, Kai-Wei Chang, Wei Wang, and Nanyun Peng, Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024. Large Language Models Can Be Contextual Privacy Protection Learners, by Yijia Xiao, Yiqiao Jin, Yushi Bai, Yue Wu, Xianjun Yang, Xiao Luo, Wenchao Yu, Xujiang Zhao, Yanchi Liu, Quanquan Gu, Haifeng Chen, Wei Wang, Wei Cheng, Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024. Decoding Susceptibility: Modeling Misbelief to Misinformation Through a Computational Approach, by Yanchen Liu, Mingyu Derek Ma, Wenna Qin, Azure Zhou, Jiaao Chen, Weiyan Shi, Wei Wang, and Diyi Yang, Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024. BioinformaticsBench: A collaboratively built large language model benchmark for Bioinformatics reasoning, by Varuni Sarwal, Seungmo Lee, Rosemary He, Aingela Kattapuram, Xiaoxuan Wang, Eleazar Eskin, Wei Wang, and Serghei Mangul, ICML Workshop on Accessible and Efficient Foundation Models for Biological Discovery (AccMLBio), 2024. BioinformaticsBench: A collaboratively built large language model benchmark for Bioinformatics reasoning, by Varuni Sarwal, Seungmo Lee, Rosemary He, Aingela Kattapuram, Xiaoxuan Wang, Yijia Xiao, Serghei Mangul, and Wei Wang, ICML Workshop on Data-Centric Machine Learning Research (DMLR), 2024. PlayBest: Professional Basketball Player Behavior Synthesis via Planning with Diffusion, by Xiusi Chen, Wei-Yao Wang, Ziniu Hu, David Reynoso, Kun Jin, Mingyan Liu, Jeffrey Brantingham, and Wei Wang, Proceedings of the 33rd ACM International Conference on Information and Knowledge Management (CIKM), 2024. MinPrompt: Graph-based Minimal Prompt Data Augmentation for Few-shot Question Answering, by Xiusi Chen, Jyun-Yu Jiang, Wei-Cheng Chang, Cho-Jui Hsieh, Hsiang-Fu Yu, and Wei Wang, Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (ACL), pp.254-266, 2024. Improving Event Definition Following for Zero-Shot Event Detection, by Zefan Cai, Po-Nien Kung, Ashima Suvarna, Mingyu Derek Ma, Hritik Bansal, Baobao Chang, P. Jeffrey Brantingham, Wei Wang, and Nanyun Peng, Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (ACL), pp. 2842-2863, 2024. The PGNSC Benchmark: How Do We Predict Where Information Spreads?, by Alexander K Taylor and Wei Wang, Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (ACL), pp. 15787-15803, 2024. SciBench: Evaluating College-Level Scientific Problem-Solving Abilities of Large Language Models, by Xiaoxuan Wang, Ziniu Hu, Pan Lu, Yanqiao Zhu, Jieyu Zhang, Satyen Subramaniam, Arjun R Loomba, Shichang Zhang, Yizhou Sun, and Wei Wang, Proceedings of the 41st International Conference on Machine Learning (ICML), 2024. Mitigating Bias for Question Answering Models by Tracking Bias Influence, by Mingyu Derek Ma, Jiun-Yu Kao, Arpit Gupta, Yu-Hsiang Lin, Wenbo Zhao, Tagyoung Chung, Wei Wang, Kai-Wei Chang, and Nanyun Peng, Proceedings of the Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL), 2024. IterAlign: Iterative Constitutional Alignment of Large Language Models, by Xiusi Chen, Hongzhi Wen, Sreyashi Nag, Chen Luo, Qingyu Yin, Ruirui Li, Zheng Li, and Wei Wang, Proceedings of the Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL), pp. 1423-1433, 2024. Event Detection from Social Media for Epidemic Prediction, by Tanmay Parekh, Anh Mac, Jiarui Yu, Yuxuan Dong, Syed Shahriar, Bonnie Liu, Eric J Yang, Kuan-Hao Huang, Wei Wang, Nanyun Peng, and Kai-Wei Chang, Proceedings of the Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL), pp. 5758-5783, 2024. Causal Graph ODE: Continuous Treatment Effect Modeling in Multi-agent Dynamical Systems, by Zijie Huang, Jeehyun Hwang, Junkai Zhang, Jinwoo Baik, Weitong Zhang, Dominik Wodarz, Yizhou Sun, Quanquan Gu, and Wei Wang, Proceedings of the ACM Web Conference (WWW), pp. 4607-4617, 2024. Learning Over Molecular Conformer Ensembles: Datasets and Benchmarks, by Yanqiao Zhu, Jeehyun Hwang, Keir Adams, Zhen Liu, Bozhao Nan, Brock Stenfors, Yuanqi Du, Jatin Chauhan, Olaf Wiest, Olexandr Isayev, Connor W. Coley, Yizhou Sun, and Wei Wang, Proceedings of the 12th International Conference on Learning Representations (ICLR), 2024. STAR: Boosting Low-Resource Information Extraction by Structure-to-Text Data Generation with Large Language Models, by Mingyu Derek Ma, Xiaoxuan Wang, Po-Nien Kung, P. Jeffrey Brantingham, Nanyun Peng, Wei Wang, Proceedings of the 38th AAAI Conference on Artificial Intelligence (AAAI), 2024. Universality and Limitations of Prompt Tuning, by Yihan Wang, Jatin Chauhan, Wei Wang, and Cho-Rui Hsieh, Proceedings of the 37th Conference on Neural Information Processing Systems (NeurIPS), 2023. Learning under Label Proportions for Text Classification, by Jatin Chauhan, Xiaoxuan Wang, and Wei Wang, Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP), pp. 12210-12223, 2023. Graph-based Molecular Representation Learning, by Zhichun Guo, Kehan Guo, Bozhao Nan, Yijun Tian, Roshni G. Iyer, Yihong Ma, Olaf Wiest, Xiangliang Zhang, Wei Wang, Chuxu Zhang, and Nitesh V. Chawla, Proceedings of the 32nd International Joint Conference on Artificial Intelligence (IJCAI), pp. 6638-6646, 2023. Generalizing Graph ODE for Learning Complex System Dynamics across Environments, by Zijie Huang, Yizhou Sun, and Wei Wang, Proceedings of the 27th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (SIGKDD), pp. 798-809, 2023. DICE: Data-Efficient Clinical Event Extraction with Generative Models, by Mingyu Derek Ma, Alexander K. Taylor, Wei Wang, and Nanyun Peng, Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (ACL), pp. 15898-15917, 2023. Concept2Box: Joint Geometric Embeddings for Learning Two-View Knowledge Graphs, by Zijie Huang, Daheng Wang, Binxuan Huang, Chenwei Zhang, Jingbo Shang, Yan Liang, Zhengyang Wang, Xian Li, Christos Faloutsos, Yizhou Sun, and Wei Wang, Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (ACL), pp. 10105-10118, 2023. Introducing Semantics into Speech Encoders, by Derek Xu, Shuyan Dong, Changhan Wang, Suyoun Kim, Zhaojiang Lin, Akshat Shrivastava, Shang-Wen Li, Liang-Hsuan Tseng, Alexei Baevski, Guan-Ting Lin, Bing Liu, Hung-yi Lee, Yizhou Sun, and Wei Wang, Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (ACL), pp. 11413-11429, 2023. InfluencerRank: Discovering Effective Influencers via Graph Convolutional Attentive Recurrent Neural Networks, by Seungbae Kim, Jyun-Yu Jiang, Jinyoung Han, and Wei Wang. Proceedings of the 17th International AAAI Conference on Web and Social Media (ICWSM), 17(1), pp. 482-493, 2023. Where Does Your News Come From? Predicting Information Pathways in Social Media, by Alexander Taylor, Nuan Wen, Po-Nien Kung, Jiaao Chen, Violet Peng, and Wei Wang, Proceedings of the 45th ACM SIGIR International Conference on Research and Development in Information Retrieval (SIGIR), pp. 2511-2515, 2023. Code Recommendation for Open-source Software Developers, by Yiqiao Jin, Yunsheng Bai, Yanqiao Zhu, Yizhou Sun, and Wei Wang, Proceedings of the ACM Web Conference (WWW), pp. 1324-1333, 2023. Gotta: Generative Few-shot Question Answering by Prompt-based Cloze Data Augmentation, by Xiusi Chen, Yu Zhang, Jinliang Deng, Jyun-Yu Jiang, and Wei Wang, Proceedings of the 23rd SIAM International Conference on Data Mining (SDM), 2023. Scalable Graph Representation Learning via Locality Sensitive Hashing, by Xiusi Chen, Jyun-Yu Jiang, and Wei Wang, Proceedings of the 31st ACM International Conference on Information and Knowledge Management (CIKM), 2022. ReLiable: Offline Reinforcement Learning for Tactical Strategies in Professional Basketball Games, by Xiusi Chen, Jyun-Yu Jiang, Kun Jin, Yichao Zhou, Mingyan Liu, Paul Jeffrey Brantingham, and Wei Wang, Proceedings of the 31st ACM International Conference on Information and Knowledge Management (CIKM), 2022. Multi-source Inductive Knowledge Graph Transfer, by Junheng Hao, Lu-An Tang, Yizhou Sun, Zhengzhang Chen, Haifeng Chen, Junghwan Rhee, Chichun Li, and Wei Wang, Proceedings of the European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML/PKDD), 2022. RLogic: Recurrent Logical Rule Learning from Knowledge Graphs, by Kewei Cheng, Jiahao Liu, Wei Wang, and Yizhou Sun, Proceedings of the 27th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (SIGKDD), 2022. Dual-Geometric Space Embedding Model for Two-View Knowledge Graphs, by Roshni Iyer, Yunsheng Bai, Wei Wang, and Yizhou Sun, Proceedings of the 27th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (SIGKDD), 2022. OpBerg: Discovering causal sentences using optimal alignments, by Justin Wood, Nicholas J. Matiasz, Alcino Silva, William Hsu, Alexej Abyzov, and Wei Wang, Proceedings of the 24th International Conference on Big Data Analytics and Knowledge Discovery (DaWaK), 2022. A Bayesian Topic Model for Human-Evaluated Interpretability, by Justin Wood, Corey Arnold and Wei Wang, Proceedings of the 13th International Conference on Language Resources and Evaluation (LREC), pp. 6271-6279, 2022. Multilingual Knowledge Graph Completion with Self-Supervised Adaptive Graph Alignment, by Zijie Huang, Zheng Li, Haoming Jiang, Tianyu Cao, Hanqing Lu, Bing Yin, Karthik Subbian, Yizhou Sun, and Wei Wang, Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (ACL), pp.474-485, 2022. Towards Fine-grained Reasoning for Fake News Detection, by Yiqiao Jin, Xiting Wang, Ruichao Yang, Yizhou Sun, Wei Wang, Hao Liao, and Xing Xie, Proceedings of the 36th AAAI Conference on Artificial Intelligence (AAAI), vol 36, no. 5, pp. 5746-5754, 2022. Bi-Level Attention Graph Neural Networks, by Roshni Iyer, Wei Wang, and Yizhou Sun, Proceedings of the 21st IEEE International Conference on Data Mining (ICDM), pp. 1126-1131, 2021. Recommend for a Reason: Unlocking the Power of Unsupervised Aspect-Sentiment Co-Extraction, by Zeyu Li, Wei Cheng, Reema Kshetramade, John Houser, Haifeng Chen, and Wei Wang, Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP), pp. 763-778, 2021. Powering Comparative Classification with Sentiment Analysis via Domain Adaptive Knowledge Transfer, by Zeyu Li, Yilong Qin, Zihan Liu, and Wei Wang, Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP), pp. 6818-6830, 2021. You Are What and Where You Are: Graph Enhanced Attention Network for Explainable POI Recommendation, by Zeyu Li, Wei Cheng, Haiqi Xiao, Wenchao Yu, Haifeng Chen, and Wei Wang, Proceedings of the 30th ACM International Conference on Information and Knowledge Management (CIKM), pp. 3945-3954, 2021. #StayHome or #Marathon? Social Media Enhanced Pandemic Surveillance on Spatial-temporal Dynamic Graphs, by Yichao Zhou, Jyun-Yu Jiang, Xiusi Chen, and Wei Wang, Proceedings of the 30th ACM International Conference on Information and Knowledge Management (CIKM), pp. 2738-2748, 2021. Towards Robustness of Deep Neural Networks via Regularization, by Yao Li, Martin Renqiang Min, Thomas Lee, Wenchao Yu, Erik Kruus, Wei Wang, and Cho-Jui Hsieh, Proceedings of the International Conference on Computer Vision (ICCV), pp. 7496-7505, 2021. MEDTO: Medical Data to Ontology Matching using Hybrid Graph Neural Networks, by Junheng Hao, Chuan Lei, Vasilis Efthymiou, Abdul Quamar, Fatma Ozcan, Yizhou Sun, and Wei Wang, Proceedings of the 27th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (SIGKDD), pp. 2946-2954, 2021. Coupled Graph ODE for Learning Interacting System Dynamics, by Zijie Huang, Yizhou Sun, and Wei Wang, Proceedings of the 27th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (SIGKDD), pp. 705-715, 2021. GLSearch: Maximum Common Subgraph Detection via Learning to Search, by Yunsheng Bai, Derek Xu, Yizhou Sun, and Wei Wang, Proceedings of the 38th International Conference on Machine Learning (ICML), pp. 588-598, 2021. The Biased Coin Flip Process for Nonparametric Topic Modeling, by Justin Wood, Wei Wang, Corey Arnold, Proceedings of the 16th International Conference on Document Analysis and Recognition (ICDAR), pp. 68-83, 2021. Evaluating Audience Loyalty and Authenticity in Influencer Marketing via Multi-task Multi-relational Learning, by Seungbae Kim, Xiusi Chen, Jyun-Yu Jiang, Jinyoung Han, and Wei Wang, Proceedings of the International AAAI Conference on Web and Social Media (ICWSM), pp. 278-289, 2021. JEDI: Circular RNA Prediction based on Junction Encoders and Deep Interaction among Splice Sites, by Jyun-Yu Jiang, Chelsea J.-T. Ju, Junheng Hao, Muhao Chen, and Wei Wang, Proceedings of the 29th Annual International Conference on Intelligent Systems for Molecular Biology (ISMB), Special Issue of Bioinformatics, vol. 37, pp. i289-i298, 2021. On the Utility of Combining Topic Models and Recurrent Neural Networks, Justin Wood, Bohan Li, Jae Lee, Corey Arnold, and Wei Wang, Proceedings of the 17th International Conference on Computing and Information, Technology (IC2IT), pp. 66-76, 2021. CREATe: Clinical Report Extraction and Annotation Technology (demo), Yichao Zhou, Wei-Ting Chen, Bowen Zhang, David Lee, J. Harry Caufield, Kai-Wei Chang, Yizhou Sun, Peipei Ping, and Wei Wang, Proceedings of the 37th IEEE International Conference on Data Engineering (ICDE), pp. 2677-2680, 2021. Discovering Undisclosed Paid Partnership on Social Media via Aspect-Attentive Sponsored Post Learning, by Seungbae Kim, Jyun-Yu Jiang and Wei Wang, Proceedings of the 14th ACM International Conference on Web Search and Data Mining (WSDM), pp. 319-327, 2021. Clinical Temporal Relation Extraction with Probabilistic Soft Logic Regularization and Global Inference, by Yichao Zhou, Yu Yan, Rujun Han, J. Harry Caufield, Kai-Wei Chang, Yizhou Sun, Peipei Ping, and Wei Wang, Proceedings of the 35th AAAI Conference on Artificial Intelligence (AAAI), pp. 14647-14655, 2021. Learning Continuous System Dynamics from Irregularly-Sampled Partial Observations, by Zijie Huang, Yizhou Sun, and Wei Wang, Proceedings of the 34th Conference on Neural Information Processing Systems (NeurIPS), pp. 16177-16187, 2020. Long Document Ranking with Query-Directed Sparse Transformer, by Jyun-Yu Jiang, Chenyan Xiong, Chia-Jung Lee, and Wei Wang, Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP), pp. 4594-4605, 2020. Fast Adaptation for Cold-start Collaborative Filtering with Meta-learning, by Tianxin Wei, Ziwei Wu, Ruirui Li, Ziniu Hu, Fuli Feng, Xiangnan He, Yizhou Sun, and Wei Wang, Proceedings of the 20th IEEE International Conference on Data Mining (ICDM), pp. 661-670, 2020. P-Companion: A Principled Framework for Diversified Complementary Product Recommendation, by Junheng Hao, Tong Zhao, Jin Li, Xin Luna Dong, Christos Faloutsos, Yizhou Sun, and Wei Wang, Proceedings of the 29th ACM International Conference on Information and Knowledge Management (CIKM), pp. 2517-2524, 2020. MARU: Meta-context Aware Random Walks for Heterogeneous Network Representation Learning, by Jyun-Yu Jiang, Zeyu Li, Chelsea J.-T. Ju, and Wei Wang, Proceedings of the 29th ACM International Conference on Information and Knowledge Management (CIKM), pp. 575-584, 2020. On-demand Influencer Discovery on Social Media, by Cheng Zheng, Qin Zhang, Sean D Young, and Wei Wang, Proceedings of the 29th ACM International Conference on Information and Knowledge Management (CIKM), pp. 2337-2340, 2020. Learning to Create Better Ads: Generation and Ranking Approaches for Ad Creative Refinement, by Shaunak Mishra, Manisha Verma, Yichao Zhou, Kapil Thadani and Wei Wang, Proceedings of the 29th ACM International Conference on Information and Knowledge Management (CIKM), pp. 2653-2660, 2020. SpEC: Sparse Embedding-based Community Detection in Attributed Graphs, by Huidi Chen, Yun Xiong, Changdong Wang, Yangyong Zhu and Wei Wang, Proceedings of the 25th International Conference on Database Systems for Advanced Applications (DASFAA), pp. 53-69, 2020. Bio-JOIE: Joint Representation Learning of Biological Knowledge Bases, by Junheng Hao, Chelsea J.-T. Ju, Muhao Chen, Yizhou Sun, Carlo Zaniolo, and Wei Wang, Proceedings of the 11th ACM Conference on Bioinformatics, Computational Biology, and Health Informatics (ACM BCB), pp. 42:1-10, 2020. (Best Student Paper) Node Classification in Temporal Graphs through Stochastic Sparsification and Temporal Structural Convolution, by Cheng Zheng, Bo Zong, Wei Cheng, Dongjin Song, Jingchao Ni, Wenchao Yu, Haifeng Chen, and Wei Wang, Proceedings of the European Conference on Machine Learning & Principles and Practice of Knowledge Discovery in Databases (ECML/PKDD), pp. 330-346, 2020. Bi-Level Attention Neural Architectures for Relational Data, by Roshni Iyer, Yizhou Sun, and Wei Wang, Proceedings of KDD Workshop on Deep Learning on Graphs: Methods and Applications (DLG-KDD), 2020. Social Media User Geolocation via Hybrid Attention, by Cheng Zheng, Jyun-Yu Jiang, Yichao Zhou, Sean Young, and Wei Wang, Proceedings of the 43rd ACM SIGIR International Conference on Research and Development in Information Retrieval (SIGIR), pp. 1641-1644, 2020. "The Boating Store Had Its Best Sail Ever": Pronunciation-attentive Contextualized Pun Recognition, by Yichao Zhou, Jyun-Yu Jiang, Jieyu Zhao, Kai-Wei Chang, and Wei Wang, Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics (ACL), pp. 813-822, 2020. Robust Graph Representation Learning via Neural Sparsification, by Cheng Zheng, Bo Zong, Wei Cheng, Dongjin Song, Jingchao Ni, Wenchao Yu, Haifeng Chen, and Wei Wang, Proceedings of the 37th International Conference on Machine Learning (ICML), pp. 4470-4480, 2020. Bi-Level Graph Neural Networks for Drug-Drug Interaction Prediction. Yunsheng Bai, Ken Gu, Yizhou Sun, and Wei Wang, Proceedings of the ICML Workshop on Graph Representation Learning and Beyond (GRL+), 2020. Bi-Level Attention Neural Architectures for Relational Data. Roshni G. Iyer, Wei Wang, and Yizhou Sun, Proceedings of the ICML Workshop on Graph Representation Learning and Beyond (GRL+), 2020. Bridging Mixture Density Networks with Meta-Learning for Automatic Speaker Identification, by Ruirui Li, Jyun-Yu Jiang, Xian Wu, Hongda Mao, Chu-Cheng Hsieh, Wei Wang, Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 3522-3526, 2020. Adversarial Cooperative Imitation Learning for Dynamic Treatment Regimes, by Lu Wang, Wenchao Yu, Xiaofeng He, Wei Cheng, Martin Renqiang Ren, Wei Wang, Bo Zong, Haifeng Chen, Hongyuan Zha, Proceedings of the Web Conference, pp. 1785-1795, 2020. Multimodal Post Attentive Profiling for Influencer Marketing, by Seungbae Kim, Jyun-Yu Jiang, Masaki Nakada, Jinyoung Han and Wei Wang, Proceedings of the Web Conference, pp. 2878-2884, 2020. End-to-End Deep Attentive Personalized Item Retrieval for Online Content-sharing Platforms, by Jyun-Yu Jiang, Tao Wu, Georgios Roumpos, Heng-Tze Cheng, Xinyang Yi, Ed Chi, Harish Ganapathy, Nitin Jindal, Pei Cao and Wei Wang, Proceedings of the Web Conference, pp. 2870-2877, 2020. Few-Shot Learning for New User Recommendation in Location-based Social Networks, by Ruirui Li, Xian Wu, Xiusi Chen and Wei Wang, Proceedings of the Web Conference, pp. 2472-2478, 2020. Clustering and Constructing User Coresets to Accelerate Large-scale Top-K Recommender Systems, by Jyun-Yu Jiang, Patrick H. Chen, Cho-Jui Hsieh and Wei Wang, Proceedings of the Web Conference, pp. 2177-2187, 2020. Recommending Themes for Ad Creative Design via Visual-Linguistic Representations, by Yichao Zhou, Shaunak Mishra, Manisha Verma, Narayan Bhamidipati, Wei Wang, Proceedings of the Web Conference, pp. 2521-2527, 2020. Learning-based Efficient Graph Similarity Computation via Multi-Scale Convolutional Set Matching, by Yunsheng Bai, Hao Ding, Ken Gu, Yizhou Sun, and Wei Wang, Proceedings of the 34th AAAI Conference on Artificial Intelligence (AAAI), pp. 3219-3226, 2020. Automatic Speaker Recognition with Limited Data, by Ruirui Li, Jyun-Yu Jiang, Jiahao Liu, Chu-Cheng Hsieh and Wei Wang, Proceedings of the 13th ACM International Conference on Web Search and Data Mining (WSDM), pp. 340-348, 2020. Adversarial Learning to Compare: Self-Attentive Prospective Customer Recommendation in Location based Social Networks, by Ruirui Li, Xian Wu and Wei Wang, Proceedings of the 13th ACM International Conference on Web Search and Data Mining (WSDM), pp. 349-357, 2020. Interpretable Click-through Rate Prediction through Hierarchical Attention, by Zeyu Li, Wei Cheng, Yang Chen, Haifeng Chen, and Wei Wang, Proceedings of the 13th ACM International Conference on Web Search and Data Mining (WSDM), pp. 313-321, 2020. On Generating Dominators of Customer Preferences, by Jiang Bian, Weibo Wang, Xiang Zhang, Wei Wang, Arthur Huang, and Zhishan Guo, Proceedings of the IEEE International Conference on Big Data (BigData), pp. 2177-2186, 2019. Self-Attentive Attributed Network Embedding Through Adversarial Learning, by Wenchao Yu, Wei Cheng, Charu Aggarwal, Bo Zong, Haifeng Chen, and Wei Wang, Proceedings of the 19th IEEE International Conference on Data Mining (ICDM), pp. 758-767, 2019. Learning Robust Representations with Graph Denoising Policy Network, by Lu Wang, Wenchao Yu, Wei Wang, Wei Cheng, Hongyuan Zha, Wei Zhang, Xiaofeng He, and Haifeng Chen, Proceedings of the 19th IEEE International Conference on Data Mining (ICDM), pp. 1378-1383, 2019. Learning to Discriminate Perturbations for Blocking Adversarial Attacks in Text Classification, by Yichao Zhou, Jyun-Yu Jiang, Kai-Wei Chang, and Wei Wang, Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP), pp. 4903-4912, 2019. Learning to Predict Human Stress Level with Incomplete Sensor Data from Wearable Devices, by Jyun-Yu Jiang, Zehan Chao, Andrea L. Bertozzi, Wei Wang, Sean Young and Deanna Needell, Proceedings of the 28th ACM International Conference on Information and Knowledge Management (CIKM), pp. 2773-2781, 2019. Unsupervised Inductive Graph-Level Representation Learning via Graph-Graph Proximity, by Yunsheng Bai, Hao Ding, Yang Qiao, Agustin Marinovic, Ken Gu, Brian Lee, Yizhou Sun, and Wei Wang, Proceedings of the 28th International Joint Conference on Artificial Intelligence (IJCAI), pp. 1988-1994, 2019. Learn smart with less: building better online decision trees with fewer training examples, by Ariyam Das, Jin Wang, Sahil Gandhi, Lae Lee, Wei Wang, and Carlo Zaniolo, Proceedings of the 28th International Joint Conference on Artificial Intelligence (IJCAI), pp. 2209-2215, 2019. Universal Representation Learning of Knowledge Bases by Jointly Embedding Instances and Ontological Concepts, by Junheng Hao, Muhao Chen, Wenchao Yu, Yizhou Sun, and Wei Wang, Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (SIGKDD), pp. 1709-1719, 2019. Unsupervised Inductive Whole-Graph Embedding by Preserving Graph Proximity, by Yunsheng Bai, Hao Ding, Yang Qiao, Agustin Marinovic, Ken Gu, Ting Chen, Yizhou Sun, and Wei Wang, Proceedings of the KDD Workshop on Deep Learning on Graphs: Methods and Applications (DLG), 2019. Enhancing Air Quality Prediction with Social Media and Natural Language Processing, by Jyun-Yu Jiang, Xue Sun, Wei Wang, and Sean Young, Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics (ACL), pp. 2627-2632, 2019. Multifaceted Protein-Protein Interaction Prediction Based on Siamese Residual RCNN, by Muhao Chen, Chelsea J.-T. Ju, Guangyu Zhou, Xuelu Chen, Tianran Zhang, Kai-Wei Chang, Carlo Zaniolo, and Wei Wang. Proceedings of the 27th Annual International Conference on Intelligent Systems for Molecular Biology (ISMB), Special Issue of Bioinformatics, vol. 35, no. 14, pp. i305-i314, 2019. Unsupervised Inductive Whole-Graph Embedding by Preserving Graph Proximity, by Yunsheng Bai, Hao Ding, Yang Qiao, Agustin Marinovic, Ken Gu, Ting Chen, Yizhou Sun, and Wei Wang, Proceedings of the ICLR Workshop on Representation Learning on Graphs and Manifolds, 2019. Click Feedback-Aware Query Recommendation Using Adversarial Examples, by Ruirui Li, Liangda Li, Xian Wu, Yunhong Zhou, and Wei Wang, Proceedings of the World Wide Web Conference (WWW), pp. 2978-2984, 2019. DynGraphGAN: Dynamic Graph Embedding via Generative Adversarial Networks, by Yun Xiong, Yao Zhang, Hanjie Fu, Wei Wang, Yangyong Zhu, and Philip S. Yu, Proceedings of the International Conference on Database Systems for Advanced Applications (DASFAA), pp. 536-552, 2019. Personalized question routing via heterogeneous network embedding, by Zeyu Li, Jyun-Yu Jiang, Yizhou Sun, and Wei Wang, Proceedings of the 33rd AAAI Conference on Artificial Intelligence (AAAI), pp. 192-199, 2019. CORALS: Who are My Potential New Customers? Tapping into the Wisdom of Customers' Decisions, by Ruirui Li, Jyun-Yu Jiang, Chelsea Ju, and Wei Wang, Proceedings of the 12th ACM International Conference on Web Search and Data Mining (WSDM), pp. 69-71, 2019. SimGNN: A Neural Network Approach to Fast Graph Similarity Computation, by Yunsheng Bai, Hao Ding, Song Bian, Yizhou Sun, and Wei Wang, Proceedings of the 12th ACM International Conference on Web Search and Data Mining (WSDM), pp. 384-392, 2019. Convolutional Set Matching for Graph Similarity, by Yunsheng Bai, Hao Ding, Yizhou Sun, and Wei Wang. Proceedings of the NeurIPS Workshop on Relational Representation Learning, 2018. Inferring microbial communities for city scale metagenomics using neural networks, by Guangyu Zhou, Jyun-Yu Jiang, Chelsea Ju, and Wei Wang, Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine (BIBM), pp. 603-608, 2018. Predicting disease-related associations by heterogeneous network embedding, by Yun Xiong, Lu Ruan, Mengjie Guo, Xiangnan Kong, Yangyong Zhu, and Wei Wang, Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine (BIBM), pp. 548-555, 2018. On Multi-Query Local Community Detection, by Yuchen Bian, Yaowei Yan, Wei Cheng, Wei Wang, Dongsheng Luo, and Xiang Zhang, Proceedings of the 18th IEEE International Conference on Data Mining (ICDM), pp. 9-18, 2018. Learning Gender-Neutral Word Embeddings, by Jieyu Zhao, Yichao Zhou, Zeyu Li, Wei Wang, and Kai-Wei Chang, Proceedings of the 23rd Conference on Empirical Methods in Natural Language Processing (EMNLP), pp. 4847-4853, 2018. RIN: Reformulation Inference Network for Context-Aware Query Suggestion, by Jyun-Yu Jiang and Wei Wang, Proceedings of the 27th ACM Conference on Information and Knowledge Management (CIKM), pp. 197-206, 2018. NetWalk: A Flexible Deep Embedding Approach for Anomaly Detection in Dynamic Networks, by Wenchao Yu, Wei Cheng, Charu Aggarwal, Kai Zhang, Haifeng Chen, and Wei Wang, Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining(SIGKDD), pp. 2672-2681, 2018. Learning Deep Network Representations with Adversarially Regularized Autoencoders, by Wenchao Yu, Cheng Zheng, Wei Cheng, Charu Aggarwal, Dongjin Song, Bo Zong, Haifeng Chen, and Wei Wang, Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining(SIGKDD), pp. 2663-2671, 2018. Enhancing Response Generation Using Chat Flow Identification, by Ruirui Li, Jyun-Yu Jiang, Chelsea J.-T. Ju, Cheryl Flynn, Wen-Ling Hsu, Jia Wang, Wei Wang, and Tan Xu. Proceedings of the Conversational AI and its Applications Workshop at KDD 2018 (KDD CAI), 2018. Identifying Users behind Shared Accounts in Online Streaming Services, by Jyun-Yu Jiang, Cheng-Te Li, Yian Chen and Wei Wang, Proceedings of the 41st International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), pp. 65-74, 2018. Learning to Disentangle Interleaved Conversational Threads with a Siamese Hierarchical Network and Similarity Ranking, by Jyun-Yu Jiang, Francine Chen, Yan-Ying Chen, and Wei Wang, Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics Human Language Technologies (NAACL HLT), pp. 1812-1822, 2018. Modeling Co-Evolution Across Multiple Networks, Wenchao Yu, Charu Aggarwal, and Wei Wang, Proceedings of the 18th SIAM International Conference on Data Mining (SDM), pp. 675-683, 2018. Translating literature into causal graphs: toward automated experiment selection, by Nicholas Matiasz, Justin Wood, Wei Wang, Alcino Silva, William Hsu, Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine (BIBM), pp. 573-576, 2017. Event Detection and Summarization using Phrase Networks: PhraseNet, by Sara Melvin, Wenchao Yu, Peng Ju, Sean Young, and Wei Wang, Proceedings of the European Conference on Machine Learning & Principles and Practice of Knowledge Discovery in Databases (ECML/PKDD), pp. 89-101, 2017. Fleximer: Accurate Qantification of RNA-Seq via Variable-Length k-mers, by Chelsea J.-T. Ju, Ruirui Li, Zhengliang Wu, Jyun-Yu Jiang, Zhao Yang and Wei Wang, Proceedings of the 8th ACM Conference on Bioinformatics, Computational Biology, and Health Informatics (ACM BCB), pp. 263-272, 2017. Link Prediction with Spatial and Temporal Consistency in Dynamic Networks, by Wenchao Yu, Wei Cheng, Wei Wang, Charu Agarwal, and Haifeng Chen, Proceedings of the 26th International Joint Conference on Artificial Intelligence (IJCAI), pp. 3343-3349, 2017. Open Source Repository Recommendation in Social Coding, by Jyun-Yu Jiang, Pu-Jen Cheng and Wei Wang, Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), pp. 1173-1176, 2017. Source-LDA: Enhancing probabilistic topic models using prior knowledge sources, by Justin Wood, Patrick Tan, Ariyam Das, Wei Wang, and Corey Arnold, Proceedings of the 33rd IEEE International Conference on Data Engineering (ICDE), pp. 411-422, 2017. Aztec: A Cloud-based Computational Platform to Integrate Biomedical Resources (demo), by Patrick Tan, Yichao Zhou, Xinxin Huang, Giuseppe M. Mazzeo, Chelsea Ju, Vinvent Kyi, Brian Bleakley, Peipei Ping, and Wei Wang, Proceedings of the 33rd IEEE International Conference on Data Engineering (ICDE), 2017. Temporally factorized network modeling for evolutionary network analysis, by Wenchao Yu, Charu Aggarwal and Wei Wang, Proceedings of the 10th ACM International Conference on Web Search and Data Mining (WSDM), pp. 455-464, 2017. ACEMAN: Automated Customer Driven Cellular Service Management (demo), by Ruirui Li, Xinxin Huang, Shuo Song, Jia Wang, and Wei Wang, Proceedings of the 22nd Annual International Conference on Mobile Computing and Networking (MobiCom), 2016. Ranking Causal Anomalies via Temporal and Dynamical Analysis on Vanishing Correlations, by Wei Cheng, Kai Zhang, Haifeng Chen, Guofei Jiang, Zhengzhang Chen, and Wei Wang, Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining(SIGKDD), pp. 805-814, 2016. (Best Research Paper Runner-Up) Robust Multi-Network Clustering via Joint Cross-Domain Cluster Alignment, by Rui Liu, Wei Cheng, Hanghang Tong, Wei Wang, and Xiang Zhang, Proceedings of the 15th IEEE International Conference on Data Mining (ICDM), pp. 291-300, 2015. Max-Intensity: Detecting Competitive Advertiser Communities in Sponsored Search Market, by Wenchao Yu, Ariyam Das, Justin Wood, Wei Wang, Carlo Zaniolo, and Ping Luo. Proceedings of the 15th IEEE International Conference on Data Mining (ICDM), pp. 569-578, 2015. HapColor: A Graph Coloring Framework for Polyploidy Phasing, by Sepideh Mazrouee and Wei Wang, Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine (BIBM), pp. 105-108, 2015. REAFUM: Representative approximate frequent subgraph mining, by Ruirui Li and Wei Wang, Proceedings of the 15th SIAM International Conference on Data Mining (SDM), pp. 757-765, 2015. FastHap: fast and accurate single individual haplotype reconstruction using fuzzy conflict graphs, by Sepideh Mazrouee and Wei Wang, Proceedings of the 13th European Conference on Computational Biology (ECCB), Special Issue of Bioinformatics, vol. 30, no. 17, pp. i371-378, 2014. PseudoLasso: leveraging read alignment in homologous region to correct pseudogene expression estimates via RNASeq, by Chelsea Ju, Zhuangtian Zhao, and Wei Wang, Proceedings of the ACM International Conference on Bioinformatics and Computational Biology (ACMBCB), pp. 569-578, 2014. RNA-Skim: a rapid method for RNA-Seq quantification at transcript level, by Zhaojun Zhang and Wei Wang, Proceedings of the 21st Annual International Conference on Intelligent Systems for Molecular Biology (ISMB), Special Issue of Bioinformatics, vol. 30, no. 12, pp. i283-292, 2014. Graph Regularized Dual Lasso for Robust eQTL Mapping, by Wei Cheng, Xiang Zhang, Zhishan Guo, Yu Shi, and Wei Wang, Proceedings of the 21st Annual International Conference on Intelligent Systems for Molecular Biology (ISMB), Special Issue of Bioinformatics, vol. 30, no. 12, pp. i139-148, 2014. Transforming genomes using MOD files with applications, by Shunping Huang, Chia-Yu Kao, Leonard McMillan, and Wei Wang, Proceedings of the ACM International Conference on Bioinformatics and Computational Biology (ACMBCB), pp. 595-604, 2013. Read annotation pipeline for high-throughput sequencing data, by James Holt, Shunping Huang, Leonard McMillan, and Wei Wang, Proceedings of the ACM International Conference on Bioinformatics and Computational Biology (ACMBCB), pp. 605-613, 2013. Flexible and robust co-regularized multi-domain graph clustering, by Wei Cheng, Xiang Zhang, Zhishan Guo, Yubao Wu, Patrick Sullivan, and Wei Wang, Proceedings of the 19th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (SIGKDD), pp. 320-328, 2013. GeneScissors: a comprehensive approach to detecting and correcting spurious transcriptome inference due to RNAseq reads misalignment, by Zhaojun Zhang, Shunping Huang, Jack Wang, Xiang Zhang, Fernando Pardo Manuel de Villena, Leonard McMillan, and Wei Wang, Proceedings of the 21st Annual International Conference on Intelligent Systems for Molecular Biology (ISMB), Special Issue of Bioinformatics, vol. 29, no. 13, pp. i291-299, 2013. Metric learning from relative comparisons by minimizing squared residual, by Eric Yi Liu, Zhishan Guo, Xiang Zhang, Vladimir Jojic, and Wei Wang, Proceedings of the 12th IEEE International Conference on Data Mining (ICDM), pp. 978-983, 2012. Hierarchical co-clustering based on entropy splitting, by Wei Cheng, Xiang Zhang, Feng Pan, and Wei Wang. Proceedings of the 20th ACM Conference on Information and Knowledge Management (CIKM), pp. 1472-1476, 2012. Inferring novel associations between SNP sets and gene sets in eQTL study using sparse graphical model, by Wei Cheng, Xiang Zhang, Wei Wang, Yubao Wu, Xiaolin Yin, Jing Li and David Heckerman. Proceedings of the ACM International Conference on Bioinformatics and Computational Biology (ACMBCB), pp. 466-472, 2012. Dual transfer learning, by Mingsheng Long, Jianmin Wang, Guiguang Ding, Wei Cheng, Xiang Zhang, and Wei Wang. Proceedings of the 12th SIAM International Conference on Data Mining (SDM), pp. 540-551, 2012. Measuring opinion relevance in latent topic space, by Wei Cheng, Xiaochuan Ni, Jian-Tao Sun, Xiaoming Jin, Hye-Chung Kum, Xiang Zhang, and Wei Wang, Proceedings of the IEEE International Conference on Social Computing (SocialCom), pp. 323-330, 2011. Clustering with relative constraints, by Eric Yi Liu, Zhaojun Zhang, and Wei Wang, Proceedings of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (SIGKDD), pp. 947-955, 2011. LTS: Discriminative subgraph mining by learning from search history, by Ning Jin and Wei Wang, Proceedings of the 27th IEEE International Conference on Data Engineering (ICDE), pp. 207-218, 2011. Genome-wide compatible SNP intervals and their properties, by Wang, Jeremy, Fernando Pardo-Manuel de Villena, Wei Wang, and Leonard McMillan, Proceedings of the ACM International Conference on Bioinformatics and Computational Biology (ACMBCB), pp. 43-52, 2010. Gene set analysis using principal components, by Pakatci, Isa, Wei Wang, and Leonard McMillan, Proceedings of the ACM International Conference on Bioinformatics and Computational Biology (ACMBCB), pp. 330-333, 2010. Efficient genome ancestry inference in complex pedigrees with inbreeding,by Eric Yi Liu, Qi Zhang, Leonard McMillan, Fernando Pardo-Manuel de Villena, and Wei Wang, Proceedings of the 18th Annual International Conference on Intelligent Systems for Molecular Biology (ISMB), Special Issue of Bioinformatics, vol. 26, no. 12, pp. 199-207, 2010. TEAM: Efficient two-locus epistasis tests in human genome-wide association study,by Xiang Zhang, Shunping Huang, Fei Zou, and Wei Wang,Proceedings of the 18th Annual International Conference on Intelligent Systems for Molecular Biology (ISMB), Special Issue of Bioinformatics, vol. 26, no. 12, pp. 217-227, 2010. GAIA: Graph classification using evolutionary computation,by Ning Jin, Calvin Young, and Wei Wang. Proceedings of the ACM SIGMOD International Conference on Management of Data (SIGMOD), pp. 879-890, 2010. Graph classification based on pattern co-occurrence, by Ning Jin, Calvin Young, and Wei Wang. Proceedings of the 18th ACM Conference on Information and Knowledge Management (CIKM), pp. 573-582, 2009. Split-order distance for clustering and classification hierarchies, by Qi Zhang, Eric Yi Liu, Abhishek Sarkar, and Wei Wang. Proceedings of the 21st International Conference on Scientific and Statistical Database Management (SSDBM), pp. 517-534, 2009. COE: a general approach for efficient genome-wide two-locus epistasis test in disease association study, by Xiang Zhang, Feng Pan, Yuying Xie, Fei Zou, and Wei Wang. Proceedings of the 13th Annual International Conference on Research in Computational Molecular Biology (RECOMB), pp. 253-269, 2009. TreeQA: quantitative genome wide association mapping using local perfect phylogeny trees, by Feng Pan, Leonard McMillan, Fernando Pardo-Manuel de Villena, David Threadgill and Wei Wang. Proceedings of the 14th Pacific Symposium on Biocomputing (PSB), pp. 415-426, 2009. Inferring genome-wide mosaic structure, by Qi Zhang, Wei Wang, Leonard McMillan, Fernando Pardo-Manuel de Villena, and David Threadgill. Proceedings of the 14th Pacific Symposium on Biocomputing (PSB), pp. 150-161, 2009. FastChi: an efficient algorithm for analyzing gene-gene interactions, by Xiang Zhang, Fei Zou, and Wei Wang. Proceedings of the 14th Pacific Symposium on Biocomputing (PSB), pp. 528-539, 2009. Quantitative association analysis using tree hierarchies, by Feng Pan, Lynda Yang, Leonard McMillan, Fernando Pardo-Manuel de Villena, David Threadgill and Wei Wang. Proceedings of the 7th IEEE International Conference on Data Mining (ICDM), pp. 971-976, 2008. Functional neighbors: relationships between non-homologous protein families inferred using family-specific fingerprints,by Deepak Bandyopadhyay, Luke Huan, Jinze Liu, Jan Prins, Jack Snoeyink, Wei Wang, and Alexander Tropsha.Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine (BIBM), pp. 199-206, 2008. REDUS: finding reducible subspaces in high dimensional data,by Xiang Zhang, Feng Pan, and Wei Wang.Proceedings of the 17th ACM Conference on Information and Knowledge Management (CIKM), pp. 961-970, 2008. Genotype sequence segmentation: handling constraints and noise, by Qi Zhang, Wei Wang, Leonard McMillan, Jan Prins, Fernando Pardo-Manuel de Villena, and David Threadgill.Proceedings of the 8th Workshop on Algorithms in Bioinformatics (WABI), pp. 271-283, 2008. Mining non-redundant high order correlations in binary data, by Xiang Zhang, Feng Pan, Wei Wang, and Andrew Nobel.Proceedings of the 34th International Conference on Very Large Data Bases (VLDB), pp. 1178-1188, 2008. FastANOVA: an efficient algorithm for genome-wide association study, by Xiang Zhang, Fei Zou, and Wei Wang.Proceedings of the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (SIGKDD), pp. 821-829, 2008. (Best Research Paper) CRD: a general framework for fast co-clustering on large datasets utilizing sample-based matrix decomposition, by Feng Pan, Xiang Zhang, and Wei Wang.Proceedings of the ACM SIGMOD International Conference on Management of Data (SIGMOD), pp. 173-184, 2008. CARE: finding local linear correlations in high dimensional data, by Xiang Zhang, Feng Pan, and Wei Wang.Proceedings of the 24th IEEE International Conference on Data Engineering (ICDE), pp. 130-139, 2008. (Best Student Paper) Mining approximate order preserving clusters in the presence of noise, by Mengsheng Zhang, Wei Wang, and Jinze Liu.Proceedings of the 24th IEEE International Conference on Data Engineering (ICDE), pp. 160-168, 2008. Approximate clustering on distributed data streams, by Qi Zhang, Jinze Liu, and Wei Wang.Proceedings of the 24th IEEE International Conference on Data Engineering (ICDE), pp. 1131-1139, 2008. A general framework for fast co-clustering on large datasets using matrix decomposition, by Feng Pan, Xiang Zhang, and Wei Wang.Proceedings of the 24th IEEE International Conference on Data Engineering (ICDE), pp. 1337-1339, 2008. Sample selection for maximal diversity, by Feng Pan, Adam Roberts, Leonard McMillan, Fernando Pardo Manuel de Villena,David Threadgill, and Wei Wang. Proceedings of the 7th IEEE International Conference on Data Mining (ICDM), pp. 262-271, 2007. Incremental subspace clustering over multiple data streams, by Qi Zhang, Jinze Liu,and Wei Wang. Proceedings of the 7th IEEE International Conference on Data Mining (ICDM), pp. 727-732, 2007. Inferring missing genotypes in large SNP panels using fast nearest-neighbor searches over sliding windows, by Adam Roberts, Leonard McMillan, Wei Wang, Joel Parker, Ivan Rusyn, and David Threadgill, Proceedings of the 15th Annual International Conference on Intelligent Systems for Molecular Biology (ISMB), Bioinformatics, vol. 23, no. 13, pp. i401-i407, 2007. An efficient algorithm for mining coherent patterns from heterogeneous Microarrays, by Xiang Zhang and Wei Wang. Proceedings of the 19th International Conference on Scientific and Statistical Database Management (SSDBM), pp. 32, 2007. A fast algorithm for approximate quantiles in high speed data streams, by Qi Zhang and Wei Wang. Proceedings of the 19th International Conference on Scientific and Statistical Database Management (SSDBM), pp. 29, 2007. Mining RNA tertiary motifs with structure graphs, by Xueyi Wang, Jun Huan, Jack Snoeyink, and Wei Wang, Proceedings of the 19th International Conference on Scientific and Statistical Database Management (SSDBM), pp. 31, 2007. Intelligent sequential pattern mining via alignment --- optimization techniques for very large databases, by Hye-Chung Kum, Joong Hyuk Chang, and Wei Wang.Proceedings of the 11th Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD), pp. 587-597, 2007. On demand phenotype ranking through subspace clustering, by Xiang Zhang, Wei Wang, and Jun Huan.Proceedings of the 7th SIAM Conference on Data Mining (SDM), pp. 623-628, 2007. Poclustering: lossless clustering of dissimilarity data, by Jinze Liu, Qi Zhang, Wei Wang, Leonard McMillan, and Jan Prins.Proceedings of the 7th SIAM Conference on Data Mining (SDM), pp. 557-562, 2007. Graph database indexing using structured graph decomposition, by David Williams, Jun Huan, and Wei Wang.Proceedings of the 23rd IEEE International Conference on Data Engineering (ICDE), pp., 976-985, 2007. Accelerating profile queries in elevation maps, by Feng Pan, Wei Wang, and Leonard McMillan.Proceedings of the 23rd IEEE International Conference on Data Engineering (ICDE), pp., 76-85, 2007. Mining coherent patterns from heterogeneous microarray data, by Xiang Zhang, and Wei Wang.Proceedings of the 15th ACM Conference on Information and Knowledge Management (CIKM), pp. 838-839, 2006. Clustering pair-wise dissimilarity data intopartially ordered sets, by Jinze Liu, Qi Zhang, Wei Wang, Leonard McMillan, and Jan Prins.Proceedings of the 12th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (SIGKDD), pp. 637-642, 2006. Distance-based identification of spatial motifs in proteins using constrained frequent subgraph mining, by Jun Huan, Deepak Bandyopadhyay, Jan Prins,Jack Snoeyink, Alexander Tropsha, and Wei Wang.Proceedings of the LSS Computational Systems Bioinformatics Conference (CSB), pp. 227-238, 2006. A fast approximation to multidimensional scaling, by Tynia Yang, Jinze Liu, Leonard McMillan, and Wei Wang.Proceedings of the ECCV Workshop on Computation Intensive Methods for Computer Vision (CIMCV), 2006. Mining Approximate frequent itemset in the presence of noise: algorithm and analysis, by Jinze Liu, Susan Paulsen, Xing Xu, Wei Wang, Andrew Nobel, and Jan Prins. Proceedings of the 6th SIAM Conference on Data Mining (SDM), pp. 405-416, 2006. Mining shifting-and-scaling co-regulation patterns on gene expression profiles, by Xin Xu, Anthony K. H. Tung, Ying Lu, and Wei Wang.Proceedings of the 22nd IEEE International Conference on Data Engineering (ICDE), pp. 89 (10 pages), 2006. Human motion estimation from a reduced marker set, by Guodong Liu, Jingdan Zhang, Wei Wang, and Leonard McMillan.Proceedings of the Symposium on Interactive 3D Graphics and Games (SI3D), pp. 35-42, 2006. Finding representative set from massive data, by Feng Pan, Wei Wang, Anthony K. H. Tung, and Jiong Yang.Proceedings of the 5th IEEE International Conference on Data Mining (ICDM), pp. 338-345, 2005. Mining approximate frequent itemset from noisy data, by Jinze Liu, Susan Paulsen, Xing Xu, Wei Wang, Andrew Nobel, and Jan Prins. Proceedings of the 5th IEEE International Conference on Data Mining (ICDM), pp. 721-724, 2005. Rapid determination of local structural features common to a set of proteins (demo), by Jun Huan, Deepak Bandyopadhyay, Jinze Liu, Jan Prins, Jack Snoeyink, Alexander Tropsha, and Wei Wang. Proceedings of the 13th International Conference on Intelligent Systems for Molecular Biology (ISMB), 2005. A system for analyzing and indexing human motion databases (demo), by Guodong Liu, Jingdan Zhang, Wei Wang, and Leonard McMillan. Proceedings of the ACM SIGMOD International Conference on Management of Data (SIGMOD), pp. 924-926, 2005. Revealing true subspace clusters in high dimensions, by Jinze Liu, Karl Strohmaier, and Wei Wang. Proceedings of the 4th IEEE International Conference on Data Mining (ICDM), pp. 463-466, 2004. AGILE: a general approach to detect transitions in evolving data streams, by Jiong Yang and Wei Wang. Proceedings of the 4th IEEE International Conference on Data Mining (ICDM), pp. 559-562, 2004. A framework for ontology-driven subspace clustering,by Jinze Liu, Wei Wang, and Jiong Yang. Proceedings of the 10th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (SIGKDD), pp. 623-628, 2004. SPIN: Mining maximal frequent subgraphs from graph databases,by Jun Huan, Wei Wang, Jan Prins, and Jiong Yang. Proceedings of the 10th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (SIGKDD), pp. 581-586, 2004. Gene ontology friendly biclustering of expression profiles,by Jinze Liu, Jiong Yang, and Wei Wang.Proceedings of the IEEE Computational Systems Bioinformatics Conference (CSB), pp. 436-447, 2004. Biclustering of gene expression data by tendency,by Jinze Liu, Jiong Yang, and Wei Wang.Proceedings of the IEEE Computational Systems Bioinformatics Conference (CSB), pp. 182-193, 2004. BASS: approximate search on large string databases,by Jiong Yang, Wei Wang, and Philip Yu.Proceedings of the 16th International Conference on Scientific and Statistical Database Management (SSDBM), pp. 181-192, 2004. Fast computation of database operations using graphics processors,by Naga Govindaraju, Brandon Lloyd, Wei Wang, Ming Lin, and Dinesh Manocha.Proceedings of the ACM SIGMOD International Conference on Management of Data (SIGMOD), pp. 215-226, 2004. Understanding social welfare service patterns using sequential analysis, by Hye-Chung Kum, Dean Duncan, and Wei Wang. Proceedings of the NSF National Conference on Digital Government Research (DG.O), 2004. Successfully adopting IT for social welfare program management, by Dean Duncan, Hye-Chung Kum, Kimberly Flair, and Wei Wang. Proceedings of the NSF National Conference on Digital Government Research (DG.O), 2004. Successfully adopting IT for social welfare program management (demo), by Dean Duncan, Hye-Chung Kum, Kimberly Flair, and Wei Wang. Proceedings of the NSF National Conference on Digital Government Research (DG.O), 2004. Mining spatial motifs from protein structure graphs, by Jun Huan, Wei Wang, Deepak Bandyopadhyay, Jack Snoeyink, Jan Prins, and Alex Tropsha. Proceedings of the 8th Annual International Conference on Research in Computational Molecular Biology (RECOMB), pp. 308-315, 2004. Accurate classification of protein structural families using coherent subgraph analysis, by Jun Huan, Wei Wang, Anglina Washington, Jan Prins, Ruchir Shah, and Alex Tropsha. Proceedings of the Pacific Symposium on Biocomputing (PSB), pp. 411-422, 2004. OP-Cluster: clustering by tendency in high dimensional space, by Jinze Liu and Wei Wang. Proceedings of the 3rd IEEE International Conference on Data Mining (ICDM), pp. 187-194, 2003. Efficient mining of frequent subgraph in the presence of isomorphism, by Jun Huan, Wei Wang, and Jan Prins. Proceedings of the 3rd IEEE International Conference on Data Mining (ICDM), pp. 549-552, 2003. Discovering compact and highly discriminative features or feature combinations of drug activities using support vector machines, by Hwanjo Yu, Jiong Yang, Wei Wang, and Jiawei Han. Proceedings of the IEEE Computer Society Bioinformatics Conference (CSB), pp. 220-228, 2003. Reconstructing of ancestral gene order after segmental duplication and gene loss, by Jun Huan, Jan Prins, Wei Wang, and Todd Vision. Proceedings of the IEEE Computer Society Bioinformatics Conference (CSB), pp. 484-485, 2003. Social welfare program administration and evaluation and policy analysis using knowledge discovery and data mining (KDD) on administrative data, by Hye-Chung Kum, Dean Duncan, Kimberly Flair, and Wei Wang. Proceedings of the NSF National Conference on Digital Government Research (DG.O), pp. 39-44, 2003. Management assistance for Work First via a dynamic website, by Hye-Chung Kum, Dean Duncan, Kimberly Flair, and Wei Wang. Proceedings of the NSF National Conference on Digital Government Research (DG.O), pp. 296, 2003. STAMP: discovery of statistically important pattern repeats in a long sequence, by Jiong Yang, Wei Wang, and Philip Yu. Proceedings of the 3rd SIAM International Conference on Data Mining (SDM), pp. 224-238, 2003. ApproxMAP: approximate mining of consensus sequential patterns, by Hye-Chung Kum, Jian Pei, Wei Wang, and Dean Duncan. Proceedings of the 3rd SIAM International Conference on Data Mining (SDM), pp. 311-315, 2003. Enhanced biclustering on gene expression data,by Jiong Yang, Haixun Wang, Wei Wang, and Philip Yu. Proceedings of the 3rd IEEE Conference on Bioinformatics and Bioengineering (BIBE), pp. 321-327, 2003. CLUSEQ: efficient and effective sequence clustering, by Jiong Yang and Wei Wang, Proceedings of the 19th IEEE International Conference on Data Engineering (ICDE), pp. 101-112, 2003. InfoMiner+: mining partial periodic patterns with gap penalties,by Jiong Yang, Wei Wang, and Philip Yu, Proceedings of the 2nd IEEE International Conference on Data Mining (ICDM), pp. 725-728, 2002. Comparative study of sequential pattern mining frameworks --- support framework vs. multiple alignment framework,by Hye-Chung Kum, Susan Paulsen, and Wei Wang, Proceedings of the 2nd IEEE International Conference on Data Mining (ICDM) Workshopon the Foundation of Data Mining and Discovery, pp. 43-70, 2002. Towards automatic clustering of protein sequences,by Jiong Yang and Wei Wang, Proceedings of the 1st IEEE Computer Society Conference on Bioinformatics (CSB), pp. 175-186, 2002. Accelerating approximate subsequence search on large protein sequence databases, by Jiong Yang, Wei Wang, Yi Xia, and Philip Yu, Proceedings of the 1st IEEE Computer Society Conference on Bioinformatics (CSB), pp. 207-218, 2002. Mining long sequential patterns in a noisy environment, by Jiong Yang, Wei Wang, Philip Yu, and Jiawei Han, Proceedings of the ACM SIGMOD International Conference on Management of Data (SIGMOD), pp. 406-417, 2002. Clustering by pattern similarity in large data sets, by Haixun Wang, Wei Wang, Jiong Yang, and Philip Yu, Proceedings of the ACM SIGMOD International Conference on Management of Data (SIGMOD), pp. 394-405, 2002. Accelerating search of approximate match on large protein sequence databases, by Wei Wang, Jiong Yang, Yi Xia, and Philip Yu.Proceedings of the 6th ACM International Conference on Research inComputational Molecular Biology (RECOMB), 2002. (poster) Improving performance of bicluster discoveryin a large data set, by Jiong Yang, Wei Wang, Haixun Wang, and Philip Yu. Proceedings of the 6th ACM International Conference on Research inComputational Molecular Biology (RECOMB), 2002. (poster) A framework towards efficient and effective proteinclustering, by Wei Wang and Jiong Yang. Proceedings of the 6th ACM International Conference on Research inComputational Molecular Biology (RECOMB), 2002. (poster) Efficient filtering of large data sets --- a user-centric paradigm, by Yi Xia, Wei Wang, Jiong Yang, Philip Yu, and Richard Muntz. Proceedings of the 2nd SIAM International Conference on Data Mining (SDM), pp. 112-127, 2002. Delta-cluster: capturing subspace correlation in a large data set, by Jiong Yang, Wei Wang, Haixun Wang, and Philip Yu, Proceedings of the 18th IEEE International Conference on Data Engineering (ICDE), pp. 517-528, 2002. A framework towards efficient and effective sequence clustering, by Wei Wang and Jiong Yang, Proceedings of the 18th IEEE International Conference on Data Engineering (ICDE), pp. 282, 2002. Meta-patterns: revealing hidden periodical patterns, by Wei Wang, Jiong Yang, and Philip Yu, Proceedings of the 1st IEEE International Conference on Data Mining (ICDM), pp. 550-557, 2001. Info-miner: mining surprising periodic patterns, by Jiong Yang, Wei Wang, and Philip Yu, Proceedings of the 7th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (SIGKDD), pp. 395-400, 2001. TAR: temporal association rules on evolving numerical attributes, by Wei Wang, Jiong Yang, and Richard Muntz, Proceedings of the 17th IEEE International Conference on Data Engineering (ICDE), pp. 283-292, 2001. Mining asynchronous periodic patterns in time series data, by Jiong Yang, Wei Wang, and Philip Yu, Proceedings of the 6th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (SIGKDD), pp. 275-279, 2000. Efficient mining weighted association rules (WAR), by Wei Wang, Jiong Yang, and Philip Yu, Proceedings of the 6th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (SIGKDD), pp. 270-274, 2000. Collaborative web caching based on proxy affinities, by Jiong Yang, Wei Wang, and Richard Muntz, Proceedings of the 19th ACM SIGMETRICS Conference on the Measurement and Modeling of Computer Systems (SIGMETRICS), pp. 78-89, 2000. Dynamic adaptive file management in a local area network, by Jiong Yang, Wei Wang, Richard Muntz, and Silvia Nittel, Proceedings of the 20th IEEE International Conference on Distributed Computer Systems (ICDCS), pp. 368-375, 2000. STING+: an approach to active spatial data mining, by Wei Wang, Jiong Yang, and Richard Muntz, Proceedings of the 15th IEEE International Conference on Data Engineering (ICDE), pp. 116-125, 1999. (Invited to the "Best Papers of ICDE 1999" Special Issue of IEEE Transactions on Knowledge and Data Engineering) PK-tree: a spatial index structure for high dimensional point data, by Wei Wang, Jiong Yang, and Richard Muntz, Proceedings of the 5th International Conference on Foundations of Data Organization (FODO), pp. 27-36, 1998. DynamO: dynamic objects with persistent storage, by Jiong Yang, Silvia Nittel, Wei Wang, and Richard Muntz, Proceedings of the 8th International Workshop on Persistent Object Systems (POS8) , 1998. STING: a statistical information grid approach to spatial data mining, by Wei Wang, Jiong Yang, and Richard Muntz, Proceedings of the 23rd International Conference on Very Large Data Bases (VLDB), pp. 186-195, 1997. Performance analysis of several algorithms for processing joins between textual attributes, by Weiyi Meng, Clement Yu, Wei Wang, and Naphtali Rishe, Proceedings of the 12th IEEE International Conference on Data Engineering (ICDE), pp. 636-644, 1996. On fuzzy database systems, by Wei Wang and George Klir, Proceedings of the 5th IEEE Annual Dual-use Technologies & Applications Conference, pp. 330-335, 1995. The absolute continuity of fuzzy measures, Proceedings of International Joint Conference of the Fourth IEEE International Conference of Fuzzy Systems and Second International Fuzzy Engineering Symposium (FUZZ-IEEE/IFES'95), Japan, pp. 131-136, 1995. Determining fuzzy measures by Choquet integral, Proceedings of ISUMA-NAFIPS'95, pp. 724-727, 1995.
ARTICLES IN REFEREED JOURNALS
	Designing metamaterials with programmable nonlinear responses and geometric constraints in graph space, by Marco Maurizi, Derek Xu, Yu-Tong Wang, Desheng Yao, David Hahn, Mourad Oudich, Anish Satpati, Mathieu Bauchy, Wei Wang, Yizhou Sun, Yun Jing, and Xiaoyu Rayne Zheng. Nature Machine Intelligence, 2025. Neural Network-Assisted Personalized Handwriting Analysis for ParkinsonÕs Disease Diagnostics, by Guorui Chen, Trinny Tat, Yihao Zhou, Zhaoqi Duan, Junkai Zhang, Kamryn Scott, Xun Zhao, Zeyang Liu, Wei Wang, Song Li, Katy A. Cross, and Jun Chen, Nature Chemical Engineering, 2025. Towards LifeSpan Cognitive Systems, by Yu Wang, Chi Han, Tongtong Wu, Xiaoxin He, Wangchunshu Zhou, Nafis Sadeq, Xiusi Chen, Zexue He, Wei Wang, Gholamreza Haffari, Heng Ji, Julian McAuley, Transactions on Machine Learning Research, 2025. How the experience of California wildfires shape Twitter climate change framings, by Jessie W. Y. Ko, Shengquan Ni, Alexander Taylor, Xiusi Chen, Yicong Huang, Avinash Kumar, Sadeem Alsudais, Zuozhi Wang, Xiaozhen Liu, Wei Wang, Chen Li, and Suellen Hopfer, Climatic Change, Vol. 177, no. 17, 2024. Building an Ethical and Trustworthy Biomedical AI Ecosystem for the Translational and Clinical Integration of Foundational Models, by Baradwaj Simha Sankar, Destiny Gilliland, Jack Rincon, Henning Hermjakob, Yu Yan, Irsyad Adam, Gwyneth Lemaster, Dean Wang, Karol Watson, Alex Bui, Wei Wang, and Peipei Ping, Bioengineering, vol. 11, no. 10, pp. 984, 2024. Missing Values in Longitudinal Proteome Dynamics Studies: Making a Case for Data Multiple Imputation, by Yu Yan, Baradwaj S Sankar, Bilal Mirza, Dominic Ng, Alexander Pelletier, Sarah Huang, Wei Wang, Karol Watson, Ding Wang, Peipei Ping, Journal of Proteome Research, 2024. A Survey on Self-Supervised Learning for Non-Sequential Tabular Data, by Wei-Yao Wang, Wei-Wei Du, Derek Xu, Wei Wang, and Wen-Chih Peng, Asian Conference on Machine Learning (ACML) Journal Track, 2024. Learning Molecular Dynamics: Predicting the Dynamics of Glasses by a Machine Learning Simulator, by Han Liu, Zijie Huang, Samuel S. Schoenholz, Ekin D. Cubuk, Morten M. Smedskjaer, Yizhou Sun, Wei Wang, and Mathieu Bauchy, Materials Horizons, vol. 10, no. 9, 2023. MIND-S: A Novel Deep Learning Prediction Model for Elucidating Protein PTMs in Human Diseases, by Yu Yan Jyun-Yu Jiang, Mingzhou Fu, Ding Wang, Alexander Pelletier, Dibakar Sigdel, Dominic Ng, Wei Wang, and Peipei Ping, Cell Reports Methods, Vol.3, no. 3, pp. 100430, 2023. Multivariate time-series classification with hierarchical variational graph pooling, by Ziheng Duan, Haoyan Xu, Yueyang Wang, Yida Huang, Anni Ren, Zhongbin Xu, Yizhou Sun, and Wei Wang, Neural Networks, Vol. 154, pp. 481-490, 2022. Knowledge Source Rankings for Semi-supervised Topic Modeling, by Justin Wood, Corey Arnold, and Wei Wang, Information, vol. 13, no. 2, pp. 57, 2022. TahcoRoll: Fast Genomic Signature Profiling via Thinned Automaton and Rolling Hash, by Chelsea J.-T. Ju, Jyun-Yu Jiang, Ruirui Li, Zeyu Li, and Wei Wang, Medical Review, vol.1, no. 2, pp. 114-125, 2022. COVID-19 Surveiller: Toward a Robust and Effective Pandemic Surveillance System based on Social Media Mining, by Jyun-Yu Jiang, Yichao Zhou, Xiusi Chen, Yan-Ru Jhou, Liqi Zhao, Sabrina Liu, Po-Chun Yang, Jule Ahmar, and Wei Wang, Philosophical Transactions A, vol. 380, issue 2214, 2021. Experiment Selection in Meta-Analytic Piecemeal Causal Discovery, by Nicholas J. Matiasz, Justin Wood, Wei Wang, Alcino J. Silva, and William Hsu, IEEE Access, vol. 9, pp. 97929-97941, 2021. A second look at FAIR in Proteomic Investigations, by J. Harry Caufield, John Fu, Ding Wang, Vladimir Guevara-Gonzalez, Wei Wang, Peipei Ping, Journal of Proteome Research, vol. 20, no. 5, pp. 2182-2186, 2021. SEIZE: Runtime Inspection for Parallel Dataflow Systems, by Youfu Li, Matteo Interlandi, Fotis Psallidas, Wei Wang, and Carlo Zaniolo, IEEE Transactions on Parallel and Distributed Systems, vol. 32, no. 4, pp. 842-851, 2021. Identifying temporal molecular signatures underlying cardiovascular diseases: A data science platform, by Neo Christopher Chung, Howard Choi, Ding Wang, Bilal Mirza, Alexander R Pelletier, Dibakar Sigdel, Wei Wang, Peipei Ping, Journal of Molecular and Cellular Cardiology, vol. 145, pp. 54-58, 2020. Random forest machine learning algorithm predicts virologic treatment adherence, by Susan Kamal, John Urata, Matthias Cavassini, Honghu Liu, Roger Kouyos, Oliver Bugnon, Wei Wang, and Marie-Paule Schneider, AIDS Care, vol. 33, no. 4, pp. 530-536, 2020. Mutation effect estimation on proteinÐprotein interactions using deep contextualized representation learning, by Guangyu Zhou, Muhao Chen, Chelsea JT Ju, Zheng Wang, Jyun-Yu Jiang, Wei Wang, NAR Genomics and Bioinformatics, vol. 2, no. 2, 2020. Measuring Time-Sensitive and Topic-Specific Influence in Social Networks with LSTM and Self-Attention, by Cheng Zheng, Qin Zhang, Guodong Long, Chengqi Zhang, Sean D Young, Wei Wang, IEEE Access, vol. 8, pp. 82481-82492, 2020. Integrated Machine Learning Approach to Identify Metabolome Fingerprints of Pathological Stages Following Heart Failure Treatment, by Howard Choi, Bilal Mirza, David A Liem, Ding Wang, Mario C Deng, Wei Wang, Peipei Ping, The Federation of American Societies for Experimental Biology Journal, vol. 34, no. S1, 2020. Memory-Based Random Walk for Multi-Query Local Community Detection, by Yuchen Bian, Dongsheng Luo, Yaowei Yan, Wei Cheng, Wei Wang, and Xiang Zhang. Knowledge and Information Systems (KAIS), vol. 62, pp. 2067-2101, 2020. PolyCluster: Minimum Fragment Disagreement Clustering for Polyploid Phasing, by Sepideh Mazrouee and Wei Wang, IEEE/ACM Transactions on Computational Biology and Bioinformatics, vol. 17, no. 1, pp. 264-277, 2020. Heterogeneous network embedding enabling accurate disease association predictions, by Yun Xiong, Mengjie Guo, Lu Ruan, Xiangnan Kong, Chunlei Tang, Yangyong Zhu, Wei Wang, BMC Medical Genomics, vol. 12, no. 10, pp. 186, 2019. De novo Nanopore read quality improvement using deep learning, by Nathan LaPierre, Rob Egan, Wei Wang, Zhong Wang, BMC Bioinformatics, vol. 20, no. 1, pp. 552, 2019. Prediction of microbial communities for urban metagenomics using neural network approach, by Guangyu Zhou, Jyun-Yu Jiang, Chelsea Ju, and Wei Wang, BMC Human Genomics, vol. 13, no. 1, pp. 47, 2019. MetaPheno: A Critical Evaluation of Deep Learning and Machine Learning in Metagenome-Based Disease Prediction, by Nathan LaPierre, Chelsea J.-T. Ju, Guangyu Zhou, and Wei Wang. Special Issue on Deep Learning in Bioinformatics, METHODS, vol. 166, pp. 74-82, 2019. Unsupervised Classification of Multi-Omics Data during Cardiac Remodeling using Deep Learning, by Neo Chung, Bilal Mirza, Howard Choi, Jie Wang, Ding Wang, Peipei Ping, and Wei Wang. Special Issue on Deep Learning in Bioinformatics, METHODS, vol. 166, pp. 66-73, 2019. Crowdsourced Traffic Data as an Emerging Tool to Monitor Car Crashes, by Sean D. Young, Wei Wang, and Bharath Chakravarthy. JAMA Surgery, vol. 154, no. 8, pp. 777-778, 2019. Modeling Smart Wristband Sleep Data to Classify Undergraduates' Sleep Deprived Tweets, by Sara Melvin, Amanda Jamal, Kaitlyn Hill, Wei Wang, and Sean Young. JMIR Mental Health, vol. 6, no. 12, pp. e13076, 2019. Machine Learning and Integrative Analysis of Biomedical Big Data, by Bilal Mirza, Wei Wang, Jie Wang, Howard Choi, Neo Christopher Chung, and Peipei Ping. Genes, vol. 10, no. 2, pii. E87, doi:10.3390/genes10020087, 2019. Cloud-Based Phrase Mining and Analysis of User-Defined Phrase-Category Association in Biomedical Publications, by Dibakar Sigdel, Vincent Kyi, Aiden Zhang, Shaun P. Setty, David A. Liem, Yu Shi, Xuan Wang, Jiaming Shen, Wei Wang, Jiawei Han, and Peipei Ping. Journal of Visualized Experiments, vol. 144, e59108, doi: 10.3791/59108, 2019. A reference set of curated biomedical data and metadata from clinical case reports, by J. Harry Caufield, Yijiang Zhou, Anders Garlid, Shaun Setty, David Liem, Quan Cao, Jessica Lee, Sanjana Murali, Sarah Spendlove, Wei Wang, Li Zhang, Yizhou Sun, Alex Bui, Henning Hermjakob, Karol Watson, and Peipei Ping, Nature Scientific Data, vol. 5, article 180258, 2018. A Metadata Extraction Approach for Clinical Case Reports to Enable Advanced Understanding of Biomedical Concepts, by J Harry Caufield, David A Liem, Anders O Garlid, Yijiang Zhou, Karol Watson, Alex A T Bui, Wei Wang, and Peipei Ping. Journal of Visualized Experiments, vol. 139, e58392, doi:10.3791/58392, 2018. Approximating isotope distributions of biomolecule fragments, by Dennis Goldfarb, Michael Lafferty, Laura Herring, Wei Wang, and Michael Major, ACS Omega, vol. 3, no. 9, pp. 11383-11391, 2018. ResearchMaps.org for integrating and planning research, by Nicholas J. Matiasz, Justin Wood, Pranay Doshi, William Speier, Barry Beckemeyer, Wei Wang, William Hsu, and Alcino J. Silva, PLOS ONE, vol. 13, no. 5, 2018. PMC5933701 Phrase Mining of Textual Data to Analyze Extracellular Matrix Protein Patterns Across Cardiovascular Disease, by David Liem, Sanjana Murali, Dibakar Sigdel, Yu Shi, Xuan Wang, Jiaming Shen, Howard Choi, J Caufield, Wei Wang, Peipei Ping, and Jiawei Han, AJP-Heart and Circulatory Physiology, vol. 315, no. 4, pp. H910-H924, 2018. A randomized approach to speed up the analysis of large-scale read-count data in the application of CNV detection, by Weibo Wang, Wei Sun, Wei Wang, and Jin Szatkiewicz, BMC Bioinformatics, vol. 19, no. 74, doi.org/10.1186/s12859-018-2077-6, 2018. Biomedical Informatics on the Cloud: A Treasure Hunt for Advancing Cardiovascular Medicine, by Peipei Ping, Henning Hermjakob, Jennifer Polson, Panagiotis Benos, and Wei Wang. Circulation Research, vol. 122, no. 9, pp. 1290-1301, 2018. Robust Framework on Multi-Network Clustering via Joint Cross-Domain Cluster Alignment, by Rui Liu, Wei Cheng, Hanghang Tong, Wei Wang, and Xiang Zhang. Knowledge and Information Systems (KAIS), 2017. Gracob: a novel graph-based constant-column biclustering method for mining growth phenotype data, by Majed Alzahrani, Hiroyuki Kuwahara, Wei Wang, and Xin Gao, Bioinformatics, vol. 33, no. 16, pp. 2523-2531, 2017. Computer-aided experiment planning toward causal discovery in neuroscience, by Nicholas J. Matiasz, Justin Wood, Wei Wang, Alcino J. Silva, William Hsu, Frontiers in Neuroinformatics, vol. 11, no. 12, 2017. doi: 10.3389/fninf.2017.00012 Ranking causal anomalies for system fault diagnosis via temporal and dynamical analysis on vanishing correlations, by Wei Cheng, Jingchao Ni, Kai Zhang, Haifeng Chen, Guofei Jiang, Yu Shi, Xiang Zhang, and Wei Wang, ACM Transactions on Knowledge Discovery from Data (TKDD), vol. 11, no. 4, pp. 40:1-40:28, 2017. (Best Papers of KDD 2016) Efficient approach to correct read alignment for pseudogene abundance estimates, by Chelsea Ju, Zhuangtian Zhao, and Wei Wang. IEEE/ACM Transactions on Computational Biology and Bioinformatics, vol. 14, no. 3, pp. 522-533, 2017. Toward Automating HIV Identification: Machine Learning for Rapid Identification of HIV-Related Social Media Data, by Sean Young, Wenchao Yu, and Wei Wang. Journal of Acquired Immune Deficiency Syndromes (1999). 2017 February 1;74 Suppl 2:S128-S131. PubMed PMID:28079723; PubMed Central PMCID: PMC5234853. Introduction to the Special Issue of Best Papers in ACM SIGKDD 2014, by Wei Wang and Jure Leskovec. ACM Transactions on Knowledge Discovery from Data (TKDD), vol. 10, no. 4, pp. 33:1-33:2, 2016. Sparse regression models for unraveling group and individual associations in eQTL mapping, by Wei Cheng, Yu Shi, Xiang Zhang, and Wei Wang. BMC Bioinformatics, vol. 17, pp. 136 (11 pages), 2016. CGC: A Flexible and Robust Approach to Integrating Co-Regularized Multi-Domain Graph for Clustering, by Wei Cheng, Zhishan Guo, Xiang Zhang, and Wei Wang. ACM Transactions on Knowledge Discovery from Data (TKDD), vol. 10, no. 4, pp. 46:1-46:27, 2016. HICC: An Entropy Splitting Based Framework for Hierarchical Co-Clustering, by Wei Cheng, Xiang Zhang, Feng Pan, and Wei Wang. Knowledge and Information Systems (KAIS), vol. 46, no. 2, pp. 343-367, 2016. MSAcquisitionSimulator: data-dependent acquisition simulator for LC-MS shotgun proteomics, by Dennis Goldfarb, Wei Wang, and Michael B. Major, Bioinformatics, vol. 32, no. 8, pp. 1269-1271, 2016. Allele-specific copy-number discovery from whole-genome and whole-exome sequencing, by WeiBo Wang, Wei Wang, Wei Sun, James J. Crowley and Jin P. Szatkiewicz, Nucleic Acids Research, vol. 43, no. 14, pp. e90 (18 pages), 2015. doi: 10.1093/nar/gkv319 Fast and Robust Group-Wise eQTL Mapping Using Sparse Graphical Models, by Wei Cheng, Yu Shi, Xiang Zhang, and Wei Wang. BMC Bioinformatics, vol. 16, no. 2, pp. 1-16, 2015. DOI 10.1186/s12859-014-0421-z Analyses of Allele-Specific Gene Expression in Highly Divergent Mouse Crosses, by James J Crowley, Vasyl Zhabotynsky, Wei Sun, Shunping Huang, Isa Kemal Pakatci, Yunjung Kim, Jeremy R Wang, Andrew P Morgan, John D Calaway, David L Aylor, Zaining Yun, Timothy A Bell, Ryan J Buus, Mark E Calaway, John P Didion, Terry J Gooch, Stephanie D Hansen, Nashiya N Robinson, Ginger D Shaw, Jason S Spence, Corey R Quackenbush, Cordelia J Barrick, Randal J. Nonneman, Yuying Xie, William Valdar, Alan B Lenarcic, Wei Wang, Catherine E Welsh, Chen-Ping Fu, Zhaojun Zhang, James Holt, Zhishan Guo, David W Threadgill, Lisa M Tarantino, Darla R Miller, Fei Zou, Leonard McMillan, Patrick F Sullivan, and Fernando Pardo-Manuel de Villena. Nature Genetics, vol. 47, no. 4, pp. 353-360, 2015. Spotlite: An Improved Algorithm and Web Application for Predicting Co-Complexed Proteins from Affinity Purification - Mass Spectrometry Data, by Dennis Goldfarb, Bridgid Hast, Wei Wang, and Michael B. Major. Journal of Proteomics Research, vol. 13, no. 12, pp. 5944-5955, 2014. A Novel Multi-Alignment Pipeline for High-Throughput Sequencing Data, by Shunping Huang, James Holt, Chia-Yu Kao, Leonard McMillan, and Wei Wang. Database, 2014. Starting at the ends: high-resolution sex-specific linkage maps of the mouse reveal polarized distribution of recombination in male germline, by Eric Yi Liu, Andrew P Morgan, Elissa J Chesler, Wei Wang, Gary A Churchill, Fernando Pardo-Manuel de Villena, Genetics, vol. 197, no. 1, pp. 91-106, 2014. Bayesian modeling of haplotype effects in multiparent populations, by Zhaojun Zhang, Wei Wang, and William Valdar. Genetics, vol. 198. No. 1, pp. 139-156, 2014. Total Orderings Defined on the Set of All Fuzzy Numbers, by Wei Wang and Zhenyuan Wang, Fuzzy Sets and Systems, vol. 243, 131-141, 2014. Heritability and genomics of gene expression in peripheral blood, by Fred A Wright, Patrick F Sullivan, Andrew I Brooks, Fei Zou, Wei Sun, Kai Xia, Vered Madar, Rick Jansen, Wonil Chung, Yi-Hui Zhou, Abdel Abdellaoui, Sandra Batista, Casey Butler, Guanhua Chen, Ting-Huei Chen, David D'Ambrosio, Paul Gallins, Min Jin Ha, Jouke Jan Hottenga, Shunping Huang, Mathijs Kattenberg, Jaspreet Kochar, Christel M Middeldorp, Ani Qu, Andrey Shabalin, Jay Tischfield, Laura Todd, Jung-Ying Tzeng, Gerard van Grootheest, Jacqueline M Vink, Qi Wang, Wei Wang, Weibo Wang, Gonneke Willemsen, Johannes H Smit, Eco J de Geus, Zhaoyu Yin, Brenda W J H Penninx, and Dorret I Boomsma. Nature Genetics, vol. 46, pp. 430-437, 2014. Using the Emerging Collaborative Cross to Probe the Immune System, by Jason Phillippi, Yuying Xie, Darla R Miller, Timothy A Bell, Zhaojun Zhang, Alan B Lenarcic, David L Aylor, S Harsha Krovi, David W Threadgill, Fernando Pardo-Manuel de Villena, Wei Wang, William Valdar, and Jeffrey A Frelinger, Genes and Immunology, vol. 15, no. 1, pp. 38-46, 2014. Searching Dimension Incomplete Databases, by Wei Cheng, Xiaoming Jin, Jian-Tao Sun, Xuemin Lin, Xiang Zhang, and Wei Wang, IEEE Transactions on Data Engineering (TKDE), vol. 26, no. 3, pp. 725-738, 2014. Improving detection of copy number variation by simultaneous bias correction and read-depth segmentation, by Jin Szatkiewicz, Weibo Wang, Patrick Sullivan, Wei Wang, and Wei Sun, Nucleic Acids Research, vol. 41, no. 3, pp. 1519-1532, 2013. MaCH-Admix: genotype imputation for admixed populations, by Eric Yi Liu, Mingyao Li, Wei Wang, and Yun Li. Genetic Epidemiology, vol. 37, no. 1, pp. 25-37, 2013. Mining Genome-Wide Genetic Markers, by Xiang Zhang, Shunping Huang, Zhaojun Zhang, Wei Wang. PLOS Computation Biology, vol. 8, no. 12, e1002828, 2012. Learning transcriptional regulatory relationships using sparse graphical models, by Xiang Zhang, Wei Cheng, Jennifer Listgarten, Carl Kadie, Shunping Huang, Wei Wang, and David Heckerman, PLoS ONE, vol. 7, no. 5, e357622012, 2012. Genotype imputation of Metabochip SNPs using a study-specific reference panel of ~4,000 haplotypes in African Americans from the Women′s Health Initiative, by Eric Yi Liu, Steven Buyske, Aaron K. Aragaki, Ulrike Peters, Eric Boerwinkle, Chris Carlson, Cara Carty, Dana C. Crawford, Jeff Haessler, Lucia A. Hindorff, Loic Le Marchand, Teri A. Manolio, Tara Matise, Wei Wang, Charles Kooperberg, Kari E. North, and Yun Li, Genetic Epidemiology, vol. 36, no. 2, pp. 107-117, 2012. Rapid and robust resampling-based multiple testing correction with application in genome-wide eQTL study, by Xiang Zhang, Shunping Huang, Wei Sun, and Wei Wang. Genetics, vol. 190, no. 4, pp. 1511-1520, 2012. Genome-wide association mapping of loci for antipsychotic-induced extrapyramidal symptoms in mice, by James J Crowley, Yunjung Kim, Jin Peng Szatkiewicz, Amanda L Pratt, Corey R Quackenbush, Daniel E Adkins, Edwin van den Oord, Molly A Bogue, Hyuna Yang, Wei Wang, David W Threadgill, Fernando Pardo-Manuel de Villena, Howard L McLeod, and Patrick F Sullivan. Mammalian Genome, vol. 23, no. 5-6, pp. 322-335, 2012. The genome architecture of the Collaborative Cross mouse genetic reference population, by Collaborative Cross Consortium, Genetics, vol. 190, no. 2, pp. 389-401, 2012. HTreeQA: using semi-perfect phylogeny trees in quantitative trait loci study on genotype data, by Zhaojun Zhang, Xiang Zhang, and Wei Wang. G3: Genes, Genomes, Genetics, vol. 2, no. 2, pp. 175-189, 2012. seeQTL: a searchable database for human eQTLs, by Kai Xia, Andrey A Shabalin, Shunping Huang, Vered Madar, Yi-Hui Zhou, Wei Wang, Fei Zou, Wei Sun, Patrick F Sullivan, Fred A Wright. Bioinformatics, vol. 28, no. 3, pp. 451-452, 2012. Classification of mouse sperm motility patterns using an automated multiclass support vector machines mode, by Summer G Goodson, Zhaojun Zhang, James K Tsuruta, Wei Wang, and Deborah A O'Brien. Biology of Reproduction, vol. 84, no. 6, pp. 1207-1215, 2011. Genetic analysis of complex traits in the emerging collaborative cross, by Aylor DL, Valdar W, Foulds-Mathes W, Buus RJ, Verdugo RA, Baric RS, Ferris MT, Frelinger JA, Heise M, Frieman MB, Gralinski LE, Bell TA, Didion JD, Hua K, Nehrenberg DL, Powell CL, Steigerwalt J, Xie Y, Kelada SN, Collins FS, Yang IV, Schwartz DA, Branstetter LA, Chesler EJ, Miller DR, Spence J, Liu EY, McMillan L, Sarkar A, Wang J, Wang W, Zhang Q, Broman KW, Korstanje R, Durrant C, Mott R, Iraqi FA, Pomp D, Threadgill D, Pardo-Manuel de Villena F, Churchill GA. Genome Research, vol. 21, pp. 1213-1222, 2011. Tools for efficient epistasis detection in genome-wide association study, by Xiang Zhang, Shunping Huang, Fei Zou, and Wei Wang. Source Code for Biology and Medicine, vol. 6, no. 1, pp. 1-3, 2011. COE: a general approach for efficient genome-wide two-locus epistasis test in disease association study,by Xiang Zhang, Feng Pan, Yuying Xie, Fei Zou, and Wei Wang.Journal of Computational Biology (JCB), vol. 17, no. 3, pp. 401-415, 2010. Functional neighbors: Relationships between non-homologous protein families inferred using family-Specific fingerprints, by Deepak Bandyopadhyay, Jun Huan, Jinze Liu, Jan Prins, Jack Snoeyink, Wei Wang, and Alexander Tropsha, IEEE Transaction on Information Technology in Biomedicine, vol. 14, no. 5, pp. 1137-1143, 2010. Discriminative subgraph mining for protein classification, by Ning Jin, Calvin Young, Wei Wang, International Journal of Knowledge Discovery in Bioinformatics (IJKDB), vol. 1, no. 3, pp. 36-52, 2010. Identification of family-specific residue packing motifs and their use for structure-based protein function prediction: I. Method development, by Deepak Bandyopadhyay, Jun Huan, Jan Prins, Jack Snoeyink, Wei Wang, and Alexander Tropsha, Journal of Computer-Aided Molecular Design, vol. 23, no. 11, pp. 773-784, 2009. Identification of family-specific residue packing motifs and their use for structure-based protein function prediction: II. Case studies and applications, by Deepak Bandyopadhyay, Jun Huan, Jan Prins, Jack Snoeyink, Wei Wang, and Alexander Tropsha, Journal of Computer-Aided Molecular Design, vol. 23, no. 11, pp. 785-797, 2009. Efficient algorithms for genome-wide association study,by Xiang Zhang, Fei Zou, and Wei Wang.ACM Transactions on Knowledge Discovery from Data (TKDD), vol. 3, issue. 4, no. 19, 2009. The polymorphism architecture of mouse genetic resources elucidated using genome-wide resequencing data: implications for QTL discovery and systems genetics, by Adam Roberts, Fernando Pardo-Manuel de Villena, Wei Wang, Leonard McMillan, and David Threadgill, Mammalian Genome, vol. 18, no. 6, pp. 473-481, 2007. Benchmarking the effectiveness of sequential pattern mining methods, by Hye-Chung Kum, J. H. Chang, and Wei Wang, Data and Knowledge Engineering, vol. 60, no. 1, pp. 30-50, 2007. Structure-based function inference using protein family-specific fingerprints, by Deepak Bandyopadhyay, Jun Huan, Jinze Liu, Jan Prins, Jack Snoeyink, Wei Wang, and Alexander Tropsha. Protein Science, vol. 15, pp. 1537-1543, 2006. Sequential pattern mining in multi-databases via multiple alignment, by Hye-Chung Kum, Joong-Hyuk Chang, and Wei Wang, Data Mining and Knowledge Discovery (DMKD), vol. 12, no. 2-3, pp. 151-180, 2006. Comparing graph representations of protein structure for mining family-specific residue-based packing motifs, by Jun Huan, Wei Wang, Deepak Bandyopadhyay, Jack Snoeyink, Jan Prins, and Alexander Tropsha. Journal of Computational Biology (JCB), vol. 12, no. 6, pp. 657-671, 2005. Guest editors' introduction: special issue on mining biological data, by Wei Wang and Jiong Yang, IEEE Transactions on Knowledge and Data Engineering (TKDE), vol. 17, no. 8, pp. 1019-1020, 2005. An improved biclustering method for analyzing gene expression profiles, by Jiong Yang, Haixun Wang, Wei Wang, and Philip Yu, International Journal on Artificial Intelligence Tools (IJAIT), vol. 14, no. 5, pp. 771-789, 2005. Mining surprising periodic patterns, by Jiong Yang, Wei Wang, and Philip Yu. Data Mining and Knowledge Discovery (DMKD), vol. 9, no. 2, pp. 189-216, 2004. Discovering high order periodic patterns, by Jiong Yang, Wei Wang, and Philip Yu. Knowledge and Information Systems Journal (KAIS), vol. 6, no. 3, pp. 243-268, 2004. WAR: weighted association rules for item intensities, by Wei Wang, Jiong Yang, and Philip Yu. Knowledge and Information Systems Journal (KAIS), vol. 6, no. 2, pp. 203-229, 2004. Recent progress on selected topics in database research: a report from nine young Chinese researchers working in the United States (invited paper), by Zhiyuan Chen, Chen Li, Jian Pei, Yufei Tao, Haixun Wang, Wei Wang, Jiong Yang, Jun Yang, and Donghui Zhang. Journal of Computer Science and Technology, vol. 18, no. 5, pp. 538 – 552, 2003. Mining asynchronous periodic patterns in time series data, by Jiong Yang, Wei Wang, and Philip Yu, IEEE Transactions on Knowledge and Data Engineering (TKDE), vol. 15, no. 3, pp. 613-628, 2003. Mining patterns in long sequential data with noise, by Wei Wang, Jiong Yang, and Philip Yu, ACM SIGKDD Explorations, vol. 2, no. 2, pp. 28-33, 2000. An approach to active spatial data mining based on statistical information, by Wei Wang, Jiong Yang, and Richard Muntz, IEEE Transactions on Knowledge and Data Engineering, Special Issue on Best Papers in the 15th IEEE International Conference on Data Engineering, vol. 12, no. 5, pp. 715-728, 2000. Dynamo: design, implementation, and evaluation of cooperative persistent object management in a local area network, by Jiong Yang, Wei Wang, Silvia Nittel, Richard Muntz, and Vince Busam, Software - Practice and Experience, vol. 30, no. 4, pp. 419-448, 2000. Performance analysis of three text-join algorithms, by Weiyi Meng, Clement Yu, Wei Wang, and Naphtali Rishe, IEEE Transactions on Knowledge and Data Engineering, vol. 10, no. 3, pp. 477-492, 1998. Genetic algorithms for determining fuzzy measures from data, by Wei Wang, Zhenyuan Wang, and George J. Klir,Journal of Intelligent & Fuzzy Systems, vol. 6, no. 2, pp. 171-183, 1998. Monotone set functions defined by Choquet integral, by Zhenyuan Wang, George J. Klir, and Wei Wang, Fuzzy Sets and Systems, vol. 81, pp. 241-252, 1996. Fuzzy measures defined by fuzzy integral and their absolute continuity, by Zhenyuan Wang, George J. Klir, and Wei Wang, Journal of Mathematical Analysis and Application, vol. 203, pp. 150-165, 1996. Constructing fuzzy measures by transformations, by George J. Klir, Zhenyuan Wang, and Wei Wang, International Journal of Fuzzy Mathematics, vol. 4, no. 1, pp. 207-215, 1996. Constructing fuzzy measures by rational transformations, by Wei Wang, George J. Klir, and Zhenyuan Wang, International Journal of Fuzzy Mathematics, vol. 4, no. 3, pp. 665-675, 1996. Pan-integrals with respect to imprecise probabilities, by Zhenyuan Wang, Wei Wang, and George J. Klir, International Journal of General Systems, vol. 25, no. 3, pp. 229-243, 1996.
BOOK CHAPTERS
	Sparse regression models for unraveling group and individual associations in eQTL mapping, by Wei Cheng, Xiang Zhang, Wei Wang, eQTL Analysis, pp. 105-121, 2020. Robust Methods for Expression Quantitative Trait Loci Mapping, by Wei Cheng, Xiang Zhang, and Wei Wang, Big Data Analytics in Genomics. Springer, pp. 25-88, 2016. Mining Discriminative Subgraph Patterns from Structural Data, by Ning Jin and Wei Wang, Data Mining and Knowledge Discovery for Big Data: Methodologies, Challenges, and Opportunities Chapter 4, by Wesley W. Chu (ed.) Springer, 2013. Grid-based Clustering, by Wei Cheng, Wei Wang, and Sandra Batista, Data Clustering: Algorithms and Applications Chapter 6, by Charu C. Aggarwal and Chandan K. Reddy (eds.), CRC Press, 2013. Finding high-order correlations in high-dimensional biological data, by Xiang Zhang, Feng Pan, and Wei Wang, Link Mining: Models, Algorithms and Applications, by Yu, Han, and Faloutsos (eds.), Springer, 2010. Protein local structure comparison: methods and future directions, by Jun Huan, Wei Wang, and Jan Prins, Advances in Computers by Chau-Wen Tseng (eds.), Elsevier, 2006. Models for sequential pattern mining, by Hye-Chung Kum, Susan Paulsen, and Wei Wang, A Book on FDM --- Lecture Notes in Computer Science, Springer-Verlag, 2006. Discovering evolutionary classifier over high speed non-static stream, by Jiong Yang, Xifeng Yan, Jiawei Han, and Wei Wang, Advanced Methods for Knowledge Discovery from Complex Data, pp. 337-364, 2005. Mining high dimensional data, by Wei Wang and Jiong Yang, Data Mining and Knowledge Discovery Handbook: A Complete Guide for Practitioners and Researchers, Kluwer Academic Publishers, 2005. PK-tree: a spatial index structure for high dimensional point data (extended version), by Wei Wang, Jiong Yang, and Richard Muntz, Information Organization and Databases, Kluwer Academic Publishers, 2000. DynamO: dynamic objects with persistent storage, by Jiong Yang, Silvia Nittel, Wei Wang, and Richard Muntz, in Advances in Persistent Object Systems, pp. 199-214, Morgan Kauffmann, 1999. Extension of lower probabilities and coherence of belief measures, Advances in Intelligent Computing, edited by B. Bouchon - Meunier, R. R. Yager, and L. A. Zadeh, Springer Verlag, pp. 62-69, 1995.
BOOKS
	Mining Sequential Patterns from Large Data Sets, by Wei Wang and Jiong Yang, in Series of Advances in Database Systems, edited by Ahmed Elmagarmid, Kluwer, 2005. Advances in Web-Age Information Management --- Lecture Notes in Computer Science No. 2762, edited by Guozhu Dong, Changjie Tang, and Wei Wang, Springer-Verlag, 2003.
SOFTWARES
	CREATe: Clinical Report Extraction and Annotation Technology CTRL-PG: Clinical Temporal Relation Extraction with Probabilistic Soft Logic Regularization and Global Inference JEDI: Circular RNA Prediction based on Junction Encoders and Deep Interaction among Splice Sites SPoD: Discovering Undisclosed Paid Partnership on Social Media via Aspect-Attentive Sponsored Post Learning Bio-JOIE: Joint Representation Learning of Biological Knowledge Bases InterHAt: Interpretable Click-Through Rate Prediction through Hierarchical Attention LG-ODE: Learning Continuous System Dynamics from Irregularly-Sampled Partial Observations MuPIPR: Mutation effect estimation on protein-protein interactions using deep contextualized representation learning PCPR: "The Boating Store Had Its Best Sail Ever": Pronunciation-attentive Contextualized Pun Recognition QDS-Transformer: Long Document Ranking with Query-Directed Sparse Transformer DISP: Learning to Discriminate Perturbations for Blocking Adversarial Attacks in Text Classification JOIE: Universal Representation Learning of Knowledge Bases by Jointly Embedding Instances and Ontological Concepts MetaMLAnn: Inferring Microbial Communities for City Scale Metagenomics Using Neural Networks MetaPheno: A critical evaluation of deep learning and machine learning in metagenome-based disease prediction MiniScrub: de novo long read scrubbing using approximate alignment and deep learning NeRank: Personalized Question Routing via Heterogeneous Network Embedding PIPR: Multifaceted proteinÐprotein interaction prediction based on Siamese residual RCNN SimGNN: A neural network approach to fast graph similarity computation UGRAPHEMB: Unsupervised Inductive Graph-Level Representation Learning via Graph-Graph Proximity GN-GloVe: Learning Gender-Neutral Word Embedding NETRA: Learning Deep Network Representations with Adversarially Regularized Autoencoders NetWalk: A Flexible Deep Embedding Approach for Anomaly Detection in Dynamic Networks RIN: Reformulation Inference Network for Context-Aware Query Suggestion AZTEC: A Cloud-based Computational Platform to Integrate Biomedical Resources Fleximer: Accurate Quantification of RNA-Seq via Variable-Length k-mers TahcoRoll: An Efficient Approach for Signature Profiling in Genomic Data through Variable-Length k-mers CausalRanking: Ranking Causal Anomalies via Temporal and Dynamical Analysis on Vanishing Correlations R-GENSENG: A randomized approach to speed up the analysis of large scale read-count data in the application of CNV detection Gracob: a novel graph-based constant-column biclustering method for mining growth phenotype data Fleximer: Accurate Quantification of RNA-Seq via Variable-Length k-mers PseudoLasso: Efficient approach to correct read alignment for pseudogene abundance estimates Source-LDA: Enhancing probabilistic topic models using prior knowledge sources Spotlite: Web Application and Augmented Algorithms for Predicting Co-Complexed Proteins from Affinity Purification - Mass Spectrometry Data HICC: An Entropy Splitting Based Framework for Hierarchical Co-Clustering RNA-Skim: a rapid method for RNA-Seq quantification at transcript level Graph Regularized Dual Lasso for Robust eQTL Mapping Flexible and Robust Co-Regularized Multi-Domain Graph Clustering GAIN: Efficient genome ancestry inference in complex pedigrees with inbreeding Multi-Alignment and Read Annotation Pipeline: Lapels, Suspenders ASGENSENG: A software to detect allele specific CNV from both WGS and WES data GENSENG: Improving detection of copy number variation by simultaneous bias correction and read-depth segmentation MaCH-Admix: genotype imputation for admixed populations GeneScissors: a comprehensive approach to detecting and correcting spurious transcriptome inference due to RNAseq reads misalignment HTreeQA: using semi-perfect phylogeny trees in quantitative trait loci study on genotype data LTS: Discriminative subgraph mining by learning from search history GAIA: Graph classification using evolutionary computation REM: Rapid and Robust Resampling-Based Multiple-Testing Correction with Application in a Genome-Wide Expression Quantitative Trait Loci Study FastANOVA: an Efficient Algorithm for Genome-Wide Association Study Inferring Genome-wide Mosaic Structure Genotype Sequence Segmentation TreeQA: Tree-based Genome-wide Association Mapping NPUTE: Fast Algorithm for Imputing Missing Genotypes in SNPs FFSM: Fast Frequent Subgraph Mining
PATENTS
	System and method for identifying coherent objects with application to E-commerce, 2003. System and probabilistic method for mining long patterns, 2001. System and method for mining patterns with noise, 2001. System and method for meta pattern discovery, 2001. (US Patent 6785663, issued on August 31st, 2004) Methods for identifying partial periodic patterns and corresponding event subsequences in an event sequence, 2000. (US Patent 6718317, issued on April 6th, 2004) Methods for identifying partial periodic patterns of infrequent events in an event sequence, 2000. Methods for mining weighted association rule, 2000 (US Patent 6415287, issued on July 2nd, 2002).