TY - JOUR AU - Cohen, F. PY - 1987 DA - 1987// TI - Computer viruses JO - Comput. Secur VL - 6 UR - https://doi.org/10.1016/0167-4048(87)90122-2 DO - 10.1016/0167-4048(87)90122-2 ID - Cohen1987 ER - TY - CHAP AU - Mavrommatis, N. P. P. AU - Monrose, M. A. R. F. PY - 2008 DA - 2008// TI - All your iframes point to us BT - USENIX Security Symposium PB - USENIX Association CY - Berkeley ID - Mavrommatis2008 ER - TY - STD TI - McAfee:For Consumers (2014). https://www.mcafee.com/consumer/en-us/store/m0/index.html. Accessed 06 Jan 2016. UR - https://www.mcafee.com/consumer/en-us/store/m0/index.html ID - ref3 ER - TY - STD TI - NortonNorton Anti (2014). http://us.norton.com. Accessed 06 Jan 2016. UR - http://us.norton.com ID - ref4 ER - TY - CHAP AU - Christodorescu, M. AU - Jha, S. AU - Seshia, S. AU - Song, D. AU - Bryant, R. E. PY - 2005 DA - 2005// TI - Semantics-aware malware detection BT - Security and Privacy, 2005 IEEE Symposium On PB - IEEE CY - Los Alamitos UR - https://doi.org/10.1109/SP.2005.20 DO - 10.1109/SP.2005.20 ID - Christodorescu2005 ER - TY - STD TI - P Ször, P Ferrie, in Virus Bulletin Conference. Hunting for metamorphic, (2001). ID - ref6 ER - TY - STD TI - JM Drew, Mass Compromise of IIS Shared Web Hosting for Blackhat SEO: A Case Study (2014). http://blog.jakemdrew.com/2015/03/10/mass-compromise-of-iis-shared-web-hosting-for-blackhat-seo-a-case-study/. Accessed 06 Jan 2016. UR - http://blog.jakemdrew.com/2015/03/10/mass-compromise-of-iis-shared-web-hosting-for-blackhat-seo-a-case-study/ ID - ref7 ER - TY - STD TI - Wikipedia:Agobot (2014). https://en.wikipedia.org/wiki/Agobot. Accessed 06 Jan 2016. UR - https://en.wikipedia.org/wiki/Agobot ID - ref8 ER - TY - CHAP AU - Bailey, M. AU - Oberheide, J. AU - Andersen, J. AU - Mao, Z. M. AU - Jahanian, F. AU - Nazario, J. PY - 2007 DA - 2007// TI - Automated classification and analysis of internet malware BT - Recent Advances in Intrusion Detection PB - Springer CY - Heidelberg UR - https://doi.org/10.1007/978-3-540-74320-0_10 DO - 10.1007/978-3-540-74320-0_10 ID - Bailey2007 ER - TY - STD TI - V Total, File Statistics During Last 7 Days. https://www.virustotal.com/en/statistics/. Accessed 15 Jan 2015. UR - https://www.virustotal.com/en/statistics/ ID - ref10 ER - TY - JOUR AU - Altschul, S. F. AU - Gish, W. AU - Miller, W. AU - Myers, E. W. AU - Lipman, D. J. PY - 1990 DA - 1990// TI - Basic local alignment search tool JO - J. Mol. Biol VL - 215 UR - https://doi.org/10.1016/S0022-2836(05)80360-2 DO - 10.1016/S0022-2836(05)80360-2 ID - Altschul1990 ER - TY - JOUR AU - Kent, W. J. PY - 2002 DA - 2002// TI - Blat-the blast-like alignment tool JO - Genome Res VL - 12 UR - https://doi.org/10.1101/gr.229202. Article published online before March 2002 DO - 10.1101/gr.229202. Article published online before March 2002 ID - Kent2002 ER - TY - JOUR AU - Wang, Q. AU - Garrity, G. M. AU - Tiedje, J. M. AU - Cole, J. R. PY - 2007 DA - 2007// TI - Naive bayesian classifier for rapid assignment of RNA sequences into the new bacterial taxonomy JO - Appl. Environ. Microbiol VL - 73 UR - https://doi.org/10.1128/AEM.00062-07 DO - 10.1128/AEM.00062-07 ID - Wang2007 ER - TY - JOUR AU - Edgar, R. C. PY - 2010 DA - 2010// TI - Search and clustering orders of magnitude faster than blast JO - Bioinformatics VL - 26 UR - https://doi.org/10.1093/bioinformatics/btq461 DO - 10.1093/bioinformatics/btq461 ID - Edgar2010 ER - TY - CHAP AU - Drew, J. AU - Hahsler, M. PY - 2014 DA - 2014// TI - Strand: fast sequence comparison using mapreduce and locality sensitive hashing BT - Proceedings of the 5th ACM Conference on Bioinformatics, Computational Biology, and Health Informatics PB - ACM CY - New York ID - Drew2014 ER - TY - JOUR AU - Wood, D. E. AU - Salzberg, S. L. PY - 2014 DA - 2014// TI - Kraken: ultrafast metagenomic sequence classification using exact alignments JO - Genome Biol VL - 15 UR - https://doi.org/10.1186/gb-2014-15-3-r46 DO - 10.1186/gb-2014-15-3-r46 ID - Wood2014 ER - TY - JOUR AU - Ounit, R. AU - Wanamaker, S. AU - Close, T. J. AU - Lonardi, S. PY - 2015 DA - 2015// TI - Clark: fast and accurate classification of metagenomic and genomic sequences using discriminative k-mers JO - BMC Genomics VL - 16 UR - https://doi.org/10.1186/s12864-015-1419-2 DO - 10.1186/s12864-015-1419-2 ID - Ounit2015 ER - TY - STD TI - E Peterson, D Curtis, A Phillips, J Teuton, C Oehmen, in Intelligence and Security Informatics (ISI), 2013 IEEE International Conference On. A generalized bio-inspired method for discovering sequence-based signatures, (2013), pp. 330–332, doi:10.1109/ISI.2013.6578853. UR - http://dx.doi.org/10.1109/ISI.2013.6578853 ID - ref18 ER - TY - STD TI - Kaggle:Microsoft Malware Classification Challenge (BIG 2015) (2015). https://www.kaggle.com/c/malware-classification. Accessed 04 Nov 2015. UR - https://www.kaggle.com/c/malware-classification ID - ref19 ER - TY - CHAP AU - Drew, J. AU - Hahsler, M. AU - Moore, T. PY - 2016 DA - 2016// TI - Polymorphic malware detection using sequence classification methods BT - International Workshop on Bio-inspired Security, Trust, Assurance and Resilience (BioSTAR 2016) PB - IEEE CY - Los Alamitos ID - Drew2016 ER - TY - JOUR AU - Vinga, S. AU - Almeida, J. PY - 2003 DA - 2003// TI - Alignment-free sequence comparison—review JO - Bioinformatics VL - 19 UR - https://doi.org/10.1093/bioinformatics/btg005 DO - 10.1093/bioinformatics/btg005 ID - Vinga2003 ER - TY - JOUR AU - Shannon, C. E. PY - 2001 DA - 2001// TI - A mathematical theory of communication JO - ACM SIGMOBILE Mobile Comput. Commun. Rev VL - 5 UR - https://doi.org/10.1145/584091.584093 DO - 10.1145/584091.584093 ID - Shannon2001 ER - TY - STD TI - A Gionis, P Indyk, R Motwani, et al, in VLDB, 99. Similarity search in high dimensions via hashing, (1999), pp. 518–529. ID - ref23 ER - TY - STD TI - hadooptutorial.info: Combiner in MapReduce (2014). http://hadooptutorial.info/combiner-in-mapreduce/. Accessed 02 Apr 2015. UR - http://hadooptutorial.info/combiner-in-mapreduce/ ID - ref24 ER - TY - JOUR AU - Dean, J. AU - Ghemawat, S. PY - 2008 DA - 2008// TI - Mapreduce: simplified data processing on large clusters JO - Commun. ACM VL - 51 UR - https://doi.org/10.1145/1327452.1327492 DO - 10.1145/1327452.1327492 ID - Dean2008 ER - TY - CHAP AU - Ioffe, S. PY - 2010 DA - 2010// TI - Improved consistent sampling, weighted minhash and l1 sketching BT - Data Mining (ICDM), 2010 IEEE 10th International Conference On PB - IEEE CY - Los Alamitos UR - https://doi.org/10.1109/ICDM.2010.80 DO - 10.1109/ICDM.2010.80 ID - Ioffe2010 ER - TY - BOOK AU - Rajaraman, A. AU - Ullman, J. D. PY - 2012 DA - 2012// TI - Mining of Massive Datasets PB - Cambridge University Press CY - Cambridge ID - Rajaraman2012 ER - TY - BOOK AU - Leskovec, J. AU - Rajaraman, A. AU - Ullman, J. D. PY - 2014 DA - 2014// TI - Mining of Massive Datasets PB - Cambridge University Press CY - Cambridge UR - https://doi.org/10.1017/CBO9781139924801 DO - 10.1017/CBO9781139924801 ID - Leskovec2014 ER - TY - STD TI - Wikipedia:Simple Matching Coefficient. https://en.wikipedia.org/wiki/Simple_matching_coefficient. Accessed 14 Aug 2015. UR - https://en.wikipedia.org/wiki/Simple_matching_coefficient ID - ref29 ER - TY - STD TI - Kaggle:Evaluation (2016). https://www.kaggle.com/c/malware-classification/details/evaluation Accessed 14 Jan 2016. UR - https://www.kaggle.com/c/malware-classification/details/evaluation ID - ref30 ER - TY - STD TI - Kaggle:Microsoft Malware Winners’ Interview: 1st place, “NO to overfitting” (2015). http://blog.kaggle.com/2015/05/26/microsoft-malware-winners-interview-1st-place-no-to-overfitting Accessed: 02 Nov 2015. UR - http://blog.kaggle.com/2015/05/26/microsoft-malware-winners-interview-1st-place-no-to-overfitting ID - ref31 ER - TY - STD TI - L Wang, Microsoft Malware Classification Challenge (BIG 2015) First Place Team: Say No To Overfitting (2015). https://github.com/xiaozhouwang/kaggle_Microsoft_Malware/blob/master/Saynotooverfitting.pdf Accessed: 02 Nov 2015. UR - https://github.com/xiaozhouwang/kaggle_Microsoft_Malware/blob/master/Saynotooverfitting.pdf ID - ref32 ER - TY - JOUR AU - Marcais, G. AU - Kingsford, C. PY - 2011 DA - 2011// TI - A fast, lock-free approach for efficient parallel counting of occurrences of k-mers JO - Bioinformatics VL - 27 UR - https://doi.org/10.1093/bioinformatics/btr011 DO - 10.1093/bioinformatics/btr011 ID - Marcais2011 ER - TY - STD TI - F Cloutier, x86 Instruction Set Reference. http://www.felixcloutier.com/x86/. Accessed 18 Jul 2015. UR - http://www.felixcloutier.com/x86/ ID - ref34 ER -