Identifying Shared Software Components to Support Malware Forensics

Brian E. Ruttenberg,Craig Miles,Lee Kellogg,Vivek Notani,Michael Howard,Charles LeDoux,Arun Lakhotia,Avi Pfeffer

Identifying Shared Software Components to Support Malware Forensics

2014

Recent reports from the anti-malware industry indicate similarity between malware code resulting from code reuse can aid in developing a profile of the attackers. We describe a method for identifying shared components in a large corpus of malware, where a component is a collection of code, such as a set of procedures, that implement a unit of functionality. We develop a general architecture for identifying shared components in a corpus using a two-stage clustering technique. While our method is parametrized on any features extracted from a binary, our implementation uses features abstracting the semantics of blocks of instructions. Our system has been found to identify shared components with extremely high accuracy in a rigorous, controlled experiment conducted independently by MITLL. Our technique provides an automated method to find between malware code functional relationships that may be used to establish evolutionary relationships and aid in forensics.

Keywords:

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations