演講公告
新聞標題: ( 2014-04-02 )
演講主題:Malicious URL Filtering – A Big Data Application
主講人:李育杰 教授 (臺灣科技大學資訊工程系)
演講日期:2014年4月8日(星期二) 下午2:30 –3:30
演講地點:(光復校區) 科學一館223室
茶會時間:當天下午3:30 (科學一館205室)
摘要內容:
Data deluge has created the Big Data Era. How to extract generalizable knowledge from data has attracted people’s attention from many research areas, industrial and business domains. In this talk, we will give a brief introduction to Big Data from data analytics point of view and demonstrate one of its applications, malicious URLs filtering that comes from information security industry. We present a novel lightweight filter to use before existing processing methods based only on the URL string itself. We run experiments on a large dataset and demonstrate a 75% reduction in workload size while retaining at least 90% of malicious URLs. Existing methods do not scale well with the hundreds of millions of URLs encountered every day as the problem is a heavily imbalanced large scale binary classification problem. Our proposed method is able to handle near two millions URLs less than five minutes. Our filter can significantly reduce the volume of URL queries on which further analysis need be performed, saving both computing time and bandwidth used for content retrieval.
相關檔案:Talk10408.doc
