Home |
Research | Publications |
Teaching | Student | Funding | Service |
Diversity |
Talks |
GitHub |
Fuzzing MLIR compilers with Custom Mutation Synthesis, Ben
Limpanukorn, Jiyuan Wang, Hong Jin Kang, Eric Zitong Zhou, Miryung
Kim, 47th International Conference on Software Engineering (ICSE '25) 12 pages (an earlier
version at arXiv)
Dagstuhl on Code Search: A Journey through Searching Similar
Code, Invited Keynote (pdf)
Natural Symbolic Execution-based Testing for Big Data Analytics,
Yaoxuan Wu, Ahmad Humayun, Muhammad Ali Gulzar, and Miryung Kim,
ACM International Conference on the Foundations of Software
Engineering 2024 (pdf) (NaturalSym github)
(FSE '24)
Scaling Code Pattern Inference with Interactive What-If Analysis,
46th International Conference on Software Engineering (ICSE '24) 12 pages (pdf)
(demo) (SURF
github)
Human-in-the-Loop Synthetic Text Data Inspection with Provenance
Tracking, Hong Jin Kang*, Fabrice Harel-Canada*, Muhammad Ali
Gulzar, Violet Peng, and Miryung Kim, NAACL
'24 (pdf) (Provenance
Inspector github)
DRust: Language-Guided Distributed Shared Memory with Fine
Granularity, Full Transparency, and Ultra Efficiency, Haoran Ma,
Yifan Qiao, Shi Liu, Shan Yu, Yuanjiang Ni, Qingda Lu, Jiesheng
Wu, Yiying Zhang, Miryung Kim, Harry Xu (pdf)
(OSDI '24)
Harvesting Idle Memory for Application-managed Soft State with Midas, Yifan Qiao, Zhenyuan Ruan, Haoran Ma, Adam Belay, Miryung Kim, Harry Xu (pdf) (Midas github) (NSDI '24)
Software Engineering for Data Intensive Scalable Computing and
Heterogeneous Computing, ICSE 2023 Future of Software Engineering,
Invited paper (16 pages) (pdf)
NaturalFuzz: Natural Input Generation for Big Data Analytics,
Ahmad Humayun, Yaoxuan Wu, Miryung Kim, Muhammad Ali Gulzar, 38th
IEEE/ACM International Conference on Automated Software
Engineering (ASE 2023), 12
pages (pdf)
Co-Dependence Aware Fuzzing for Dataflow-based Big Data Analytics,
Ahmad Humayun, Miryung Kim, Muhammad Ali Gulzar, ACM Joint
European Software Engineering Conference and Symposium on the
Foundations of Software Engineering (ESEC/FSE
2023) 12 pages (pdf)
Leveraging Hardware Probes and Optimizations for Accelerating Fuzz
Testing of Heterogeneous Applications, Jiyuan Wang, Qian Zhang,
Hongbo Rong, Harry Xu, Miryung Kim, ACM Joint European Software
Engineering Conference and Symposium on the Foundations of
Software Engineering (ESEC/FSE 2023)
12 pages (pdf)
Hermit: Low-Latency, High-Throughput, and Transparent Remote
Memory via Feedback-Directed Asynchrony, Yifan Qiao and Chenxi
Wang, Zhenyuan Ruan and Adam Belay, Qingda Lu, Yiying Zhang,
Miryung Kim and Guoqing Harry Xu (NSDI
'23) (pdf)
Canvas: Isolated and Adaptive Swapping for Multi-Applications on
Remote Memory, Chenxi Wang, Yifan Qiao, Haoran Ma, Shi Liu, Yiying
Zhang, Wenguang Chen, Ravi Netravali, Miryung Kim, Guoqing Harry
Xu. 18 pages, NSDI '23 (pdf)
A Declarative Metamorphic Testing Framework for Autonomous
Driving, Yao Deng, Xi Zheng, Guannan Lou, Tianyi Zhang, Huai Liu,
Miryung Kim, Tsong Chen, 18 pages, IEEE
TSE 2023 (doi)
HeteroGen: Transpiling C to Heterogeneous HLS Code with Automated
Test Generation and Program Repair, Qian Zhang, Jiyuan Wang, Harry
Xu, Miryung Kim, 27th International Conference on Architectural
Support for Programming Languages and Operating Systems, 12 pages,
ASPLOS '22 (local pdf, slides,
Github HeteroGen)
Mako: A Low-Pause, High-Throughput Evacuating Collector for
Memory-Disaggregated Datacenters, H. Ma, S. Liu, C. Wang, Y. Qiao,
M. Bond, S. Blackburn, M. Kim, G. Xu PLDI
'22, (local_pdf,
slides, Github_Mako)
Concept-Annotated Examples for Library Comparison, Litao Yan,
Miryung Kim, Björn Hartmann, Tianyi Zhang, Elena L. Glassman, ACM
Symposium on User Interface Software and Technology, UIST '22, (pdf)
Sibylvariant Transformatimons for Robust Text Classification,
Fabrice Y Harel-Canada, Muhammad Ali Gulzar, Nanyun Peng, Miryung
Kim. Findings of ACL '22 (local_pdf,
slides,
Github_Sibyl,
video,
doi)
Is Neuron Coverage a Meaningful Measure for Testing Deep Neural
Networks? Fabrice Harel-Canada, Lingxiao Wang, Muhammad Ali
Gulzar, Quanquan Gu, Miryung Kim, ACM SIGSOFT International
Symposium on the Foundations of Software Engineering, ESEC/FSE '20 (local_pdf,
replication_package,
DOI,
video, GitHub-DeepLearningTest)
JShrink: In-depth Investigation into Debloating Modern Java
Applications, Bobby Bruce*, Tianyi Zhang*, Jaspreet Arora, Guoqing
Harry Xu, Miryung Kim, ACM SIGSOFT International Symposium on the
Foundations of Software Engineering, 12 pages, ESEC/FSE '20
(local_pdf, replication
package, web
tutorial, jdebloat
ONR software security summer school, GitHub-JShrink)
* are equal co-first authors
HeteroRefactor: Refactoring for Heterogeneous Computing with
FPGA, Jason Lau*, Aishwarya Sivaraman*, Qian Zhang*, Muhammad Ali
Gulzar, Jason Cong, Miryung Kim [* are equal co-first authors,
ordered alphabetically by their last names.] Proceedings of 42nd
IEEE/ACM International Conference on Software Engineering, 13
pages, ICSE ’20 (local pdf, slides,
video, replicaton
package, GitHub-HeteroRefactor)
Enabling Data-Driven API Design with Community Usage Data: A
Need-Finding Study, Tianyi Zhang, Björn Hartmann, Miryung Kim,
Elena Glassman, 2020 ACM Conference on Human Factors in Computing
Systems, 12 pages, CHI ’20 (local
pdf, slides, video)
An Analysis of Adversarial Attacks and Defenses on Autonomous
Driving Models, 18th Annual IEEE International Conference on
Pervasive Computing and Communications 2020, Yao Deng, Xi Zheng,
Tianyi Zhang, Chen Chen, Guannan Lou, Miryung Kim, 10 pages, PerCom '20 (local pdf)
Characterizing and Identifying Composite Refactorings: Concepts, Heuristics and Patterns, Leonardo Da Silva Sousa, Diego Cedrim, Alessandro Garcia, Willian Oizumi, Ana Carla Bibiano, Daniel Oliveira, Miryung Kim, Anderson Oliveira, Mining Software Repositories 2020, 12 pages, MSR'20 (local pdf)
SE4DA: Software Engineering for Data Analytics, Pre-print for Special Issue on IEEE Software, The AI Effect: Working at the Intersection of AI and Software Engineering, Miryung Kim, IEEE Software (doi, local_pdf)
WebJShrink: A Web Service for Debloating Java Bytecode, Konner
Macias, Mihir Mathur, Bobby R. Bruce, Tianyi Zhang, Miryung Kim,
ACM SIGSOFT International Symposium on the Foundations of Software
Engineering, 12
pages, ESEC/FSE '20 Tool Demonstration (video,
pdf,
DOI,
GitHub-WebJShrink)
BigTest: Symbolic Execution Based Systematic Test Generation Tool
for Apache Spark, Muhammad Ali Gulzar, Madan Musuvathi, Miryung
Kim, Proceedings of 42nd IEEE/ACM International Conference on
Software Engineering, 4 pages, ICSE ’20 Tool
Demonstration (local_pdf, presentation
video, code,
demo)
PerfDebug: Performance Debugging of Computation Skew in Dataflow Systems, Jason Teoh, Muhammad Ali Gulzar, Guoqing Harry Xu, and Miryung Kim, SoCC '19: ACM Symposium on Cloud Computing 2019. 14 pages (local_pdf, slides, GitHub-PerfDebug)
White-Box Testing of Big Data Analytics with Complex User-Defined Functions, Muhammad Ali Gulzar, Shaghayegh Mardani, Madanlal Musuvathi, Miryung Kim, ESEC/FSE '19: The 27th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering (local_pdf, DOI, data set and tool, GitHub-BigTest) 12 pages
Analyzing and Supporting Adaptation of Online Code Examples,
Tianyi Zhang, Di Yang, Cristina Lopes, Miryung Kim, ICSE '19: Proceedings of
41st IEEE/ACM International Conference on Software Engineering, 12
pages (local_pdf,
slides,
DOI,
data
set and tool, GitHub-ExampleStack)
Active Inductive Logic Programming for Code Search, Aishwarya
Sivaraman, Tianyi Zhang, Guy Van den Broeck, Miryung Kim, ICSE '19: Proceedings of
41st IEEE/ACM International Conference on Software Engineering, 12
pages (local_pdf, slides, DOI, data
set and tool, GitHub-ALICE)
An Empirical Study of Common Challenges in Developing Deep
Learning Applications, Tianyi Zhang, Cuiyun Gao, Lei Ma, Michael
R. Lyu and Miryung Kim, ISSRE'19: Proceedings of
30th International Symposium on Software Reliability Engineering,
12 pages. (local
pdf)
Generuk: Thin Computation over Big Native Data Using Speculative
Program Transformation, Cheng Cai, Christian Navasca, Khanh
Nguyen, Brian Demsky, Shan Lu, Miryung Kim, Guoqing Harry Xu: SOSP '19, The 27th ACM
Symposium on Operating Systems Principles, 12 pages. (local pdf, DOI)
Book Chapter on Software Evolution, Miryung Kim, Na Meng, and Tianyi Zhang, Handbook of Software Engineering, Editors: Cha, Taylor, Kang, Springer, pages 223-284 (local_pdf, DOI)
Tianyi Zhang's PhD Dissertation: Leveraging Program Commonalities
and Variations for Systematic Software Development and Maintenance
(local_pdf, UCLA)
Are Code Examples on an Online Q&A Forum Reliable? A Study of
API Misuse on Stack Overflow, Tianyi Zhang, Ganesha Upadhyaya,
Anastasia Reinhardt, Hridesh Rajan and Miryung Kim, ICSE '18: Proceedings of
40th IEEE/ACM International Conference on Software Engineering,
pages 886-896, (local_pdf, slides,
DOI, GitHub-ExampleCheck)
Visualizing API Usage Examples at Scale, Elena L. Glassman*, Tianyi Zhang*, Björn Hartmann, Miryung Kim, CHI '18: Proceedings of ACM CHI Conference on Human Factors in Computing Systems, 12 pages, Paper No. 580, (local_pdf, demo, slides, DOI, demo video). *The two lead authors contributed equally to the work, as part of an equal collaboration between the two institutions.
BigSift: Automated Debugging of Big Data Analytics in
Data-Intensive Scalable Computing, Muhammad Ali Gulzar, Siman
Wang, Miryung Kim, ESEC/FSE '18 Research
Demonstrations, pages 863-866 (local_pdf, demo video, slides, DOI GitHub-BigSift)
Augmenting Stack Overflow with API Usage Patterns Mined from
GitHub, Anastasia Reinhardt, Tianyi Zhang, Mihir Mathur, Miryung
Kim, ESEC/FSE '18 Research Demonstrations,
pages 880-883 (local_pdf,
demo video, slides,
DOI, GitHub-ExampleCheck)
Data Scientists in Software Teams: State of the Art and
Challenges, Miryung Kim, Thomas Zimmermann, Robert DeLine, and
Andrew Begel, TSE:
IEEE Transactions on Software Engineering, 16 pages (Journal
First, Accepted, August 2017) (local_pdf, slides).
A journal first presentation at ICSE 2018
Automated Transplantation and Differential Testing for Clones,
Tianyi Zhang, Miryung Kim, ICSE '17:
Proceedings of 39th IEEE/ACM International Conference on Software
Engineering, pages 665-676 (DOI)
(local_pdf)
(slides)
BigDebug: Debugging Primitives for Interactive Big Data
Processing in Spark, Muhammad Ali Gulzar, Matteo
Interlandi, Seunghyun Yoo, Sai Deep Tetali, Tyson Condie, Todd
Millstein, Miryung Kim, ICSE
'16: Proceedings of 38th IEEE/ACM
International Conference on Software Engineering, pages 784-795 (DOI)
(local_pdf)
(slides) (GitHub-BigDebug)
The Emerging Role of Data Scientists on Software Development
Teams, Miryung Kim, Thomas Zimmermann, Robert DeLine, Andrew
Begel, ICSE
'16: Proceedings of 38th IEEE/ACM
International Conference on Software Engineering, pages 96-107 (DOI)
(local_pdf)
(slides)
Appendix to The Emerging Role of Data Scientists on Software
Development Teams (local_pdf)
(link)
Titian: Data Provenance Support in Spark, Matteo Interlandi,
Kshitij Shah, Sai Tetali, Muhammad Gulzar, Seunghyun Yoo,
Miryung Kim, Todd Millstein, Tyson Condie, VLDB
'16 (PVLDB Volume 9 Issue 3):
Proceedings of the 42nd Conference on Very Large Data Bases. pages
216-227, (DOI)
(local_pdf)
(GitHub-Titian)
BigDebug: Interactive Debugger for Big Data Analytics in Apache
Spark, Muhammad Ali Gulzar, Matteo Interlandi, Tyson
Condie, Miryung Kim, FSE '16: The 24th
ACM SIGSOFT International Symposium on the Foundations of Software
Engineering, Demonstration Track, pages 1033-1037 (DOI)
(local_pdf) (extended
version with screenshots) (demo video)
(poster)
(slides)
(GitHub-BigDebug)
Optimizing Interactive Development of Data-Intensive
Applications, Matteo Interlandi, Sai Deep Tetali, Muhammad Ali
Gulzar, Joseph Noor, Tyson Condie, Miryung Kim, Todd D. Millstein,
SoCC '16: ACM
Symposium on Cloud Computing 2016, pages 510-522 (DOI) (local_pdf)
Interactive Debugging for Big Data Analytics, Muhammad Ali
Gulzar, Xueyuan Han, Matteo Interlandi, Shaghayegh Mardani, Sai
Deep Tetali, Tyson Condie, Todd Millstein, Miryung Kim, HotCloud
2016, The 8th USENIX Workshop on Hot Topics in Cloud Computing, 5
pages (DOI) (local_pdf)
Prioritizing Test Cases for Early Detection of Refactoring Faults, Everton L.G. Alves, Patricia D. L. Machado, Tiago Massoni, Miryung Kim, Journal of Software Testing, Verification, and Reliability, 2016, Preprint DOI: 10.1002/stvr.1603 (Date of Publication: 21 March 2016), Volume 26, Issue 5, pages 402?426, August 2016 (DOI) (local_pdf)
Perceptions on the State of the Art in Verification and Validation in Cyber-Physical Systems, Xi Zheng, Christine Julien, Miryung Kim, Sarfraz Khurshid, IEEE Systems Journal, Preprint DOI: 10.1109/JSYST.2015.2496293 (Date of Publication: 20 November 2015), 14 pages (DOI) (local_pdf)
Does Automated Refactoring Obviate Systematic Editing? Na Meng, Lisa Hua, Miryung Kim, and Kathryn McKinley, ICSE '15: Proceedings of 37th IEEE/ACM International Conference on Software Engineering, pages 392-402 (DOI) (local pdf) (slides)
Interactive Code Review for Systematic Changes, Tianyi Zhang,
Myoungkyu Song, Joseph Pinedo, Miryung Kim, ICSE
'15: Proceedings of 37th IEEE/ACM
International Conference on Software Engineering, pages 111-122 (DOI) (local pdf) (slides)
An Empirical Study of Refactoring Challenges and Benefits at Microsoft, Miryung Kim, Thomas Zimmermann, and Nachiappan Nagappan, TSE: IEEE Transactions on Software Engineering, Volume 40 No. 7: 633-649 (2014) (DOI) (local_pdf)
Cookbook: In Situ Code Completion Using Edit Recipes Learned from Examples, John Jacobellis, Na Meng, Miryung Kim, ICSE '14: Proceedings of 36th IEEE/ACM International Conference on Software Engineering, Formal Demonstration Track. pages, 584-587 (link) (local_pdf) (slides) (demo)
An empirical study on reducing omission errors in practice. Jihun
Park, Miryung Kim, Doo-Hwan Bae, ASE
'14: Proceedings of the 29th ACM/IEEE
international conference on Automated software engineering, pages
121-126 (DOI)
(local_pdf) (slides)
Recommending Program Transformations to Automate Repetitive Software Changes, Book Chapter on Recommendation Systems on Software Engineering. Miryung Kim, Na Meng, Editors: Martin Robillard, Walid Maalej, Robert Walker and Thomas Zimmermann, Springer (link) (local_pdf)
Refdistiller: Detecting anomalies in manual refactoring edits. E.
L. Alves, M. Song, M. Kim, P. D. Machado, and T. Massoni.
Technical report, University of Texas at Austin, TR-ECE-2014-3,
March, 2014. (local_pdf)
Experimental Artifacts for "RefDistiller: Detecting Anomalies in
Manual Refactoring Edits," Everton L. G. Alves, Myoungkyu Song,
Miryung Kim, Patricia Machado, Tiago Massoni, Technical Report (local pdf)
LASE: Locating and Applying Systematic Edits
by Learning from Examples, Na Meng, Miryung Kim, Kathryn McKinley,
ICSE '13:
Proceedings of 35th IEEE/ACM International Conference on Software
Engineering, pages 502-511(DOI) (local pdf) (slides)
LASE: An Example-based Program Transformation Tool for Locating
and Applying Systematic Edits, John Jacobellis, Na Meng, Miryung
Kim, ICSE '13: Proceedings of 35th IEEE/ACM
International Conference on Software Engineering, 4 pages, Formal
Demonstrations Track, pages 1319-1322 (DOI) (local pdf) (demo video)
Detecting and Characterizing Semantic
Inconsistencies in Ported Code, Baishakhi Ray, Miryung Kim,
Suzette Person, Neha Rungta, ASE
'13: Proceedings of The 28th IEEE/ACM
International Conference on Automated Software Engineering, pages
367-377, (DOI)
(local pdf) (slides)
An Empirical Study of API Stability and Adoption in the Android Ecosystem, Tyler McDonnell, Baishakhi Ray, Miryung Kim, ICSM '13: Proceedings of 29th IEEE International Conference on Software Maintenance, pages 70-79, (DOI) (local pdf) (slides)