Web Stats Data Mining網絡統計數據挖掘
September 22nd, 2008 · by David Bradley 2008年9月22號的戴維布拉德利
What can online businesses learn from their web logs?怎麼在線業務學習他們的網頁記錄? According to據 Xueping Li李學平 of the Intelligent Information and Systems Laboratory, at the University of Tennessee, Knoxville, and colleagues, analyzing the information you can gather about the traffic to your website, whether that’s from raw access data on your server or standalone script-based systems such as Google Analytics, GetClicky, or Statcounter.智能信息系統實驗室,田納西大學的Knoxville和同事們,分析信息,您可以收集有關的交通到您的網站,無論是從原材料獲取數據,您的服務器或獨立的腳本為基礎的系統,例如谷歌分析, GetClicky ,或Statcounter 。
As the Web has grown, so competition has intensified.隨著網絡的發展,使競爭加劇。 This is most certainly true among ecommerce sites with the likes of Amazon and Barnes & Noble vying for each other’s business just as intensely as news organizations battle for regular readers.這是最確實的電子商務網站,如亞馬遜和巴諾爭奪對方的業務一樣激烈的新聞爭奪戰組織經常性的讀者。 It is even true in the blogosphere where feedcount and post:comment ratios are apparently all important, despite the thin veneer of sociability among the A-listers.即使這是真正的博客在feedcount後:評論比率顯然是所有重要的是,儘管薄單板之間的社交性的A - listers 。 Everyone wants more visitors and more interaction because that translates into sales one way or another, whether you are selling books, news, or your personal opinions.人人都希望更多的遊客和更多的互動,因為這轉化為銷售一種或另一種方式,無論你是暢銷書 ,新聞,或您的個人意見。
This competition among businesses and the creation of an effective web presence is critical to attracting new customers/readers/community members and retaining current customers/readers/community members and so to the success of the business/community site/blog.這種企業之間的競爭和建立一個有效的網絡存在,至關重要的是吸引新客戶/讀者/社區成員和留住現有的客戶/讀者/社區成員和如此成功的企業/社區網站/博客。
The features of a website such as its design and security and how it evolves over time influence whether a customer will revisit the site or make a transaction. 該功能的網站,如它的設計和安全性以及它如何演變的影響隨著時間的推移客戶是否將重新訪問該網站,或作交易。
Li and colleagues have developed a way of looking at web logs so that webmasters can evolve their web sites to boost and retain visitors significantly.李和他的同事們開發出一種方式來看待網絡日誌,以便管理員可以改進他們的網站,以推動和留住遊客顯著。 Their approach is based on what they describe as simple, yet effective descriptive statistical techniques that reveal the relationship between traffic workload and visitor domains names and geographic locations.他們的做法是根據他們所描述那樣簡單,但有效的描述性統計技術,揭示了交通之間的關係的工作量和遊客域的名稱和地理位置。 The regularities and patterns that emerge can shed light on how to design a better web site and enhance its performance.的規律和模式,形成可以揭示如何設計更好的網站,並提高其性能。
They point out that websites that remain unchanged for months or even years after their initial inauguration, cobwebs, you might call them, do with a few notable exceptions, lose their initial burst of customers very quickly.他們指出,網站,保持不變幾個月甚至幾年後,他們最初的就職典禮,蜘蛛網,你可以給他們打電話,做了幾個明顯的例外,失去其初始爆裂的客戶非常快。 With the rapid growth of the web and intensified competition among businesses, creating and maintaining an effective web presence is a significant challenge.隨著迅速增長的網絡和加強企業之間的競爭,創造和保持一個有效的網絡存在,是一個重大的挑戰。 It is no coincidence that perhaps with some exceptions among the “essential” sites, successful websites tend to be those that are dynamic and whose information is the most sophisticated, diverse, and exciting.這並非巧合,也許有一些例外的“基本”的網站,成功的網站往往是那些充滿活力,其信息是最複雜的,多種多樣的,令人興奮。
But, there is no point in simply changing for the sake of it, if those changes are not informed by insight gleaned from current and past traffic and visitor interactions.但是,沒有任何一點變化只是為了它,如果這些變化並不了解了解見諸於現在和過去的交通和遊客的互動。 Of course, there is nothing new in suggesting that we check our stats and adapt our content and design to optimize for visitors.當然,沒有什麼新的建議,我們要檢查我們的統計和調整我們的內容和設計優化供遊人使用。 However, the system described by Li and colleagues in their IJECRM publication (full reference below) explains how universal trends might be plucked from those stats.然而,系統描述了李和他的同事在其IJECRM出版物(充分參考下文)解釋了如何普遍趨勢可能是這艘船的目的地從這些統計資料。 Their proof of principle is based on sourced historical data, but they explain that there is no reason why a site should not do data mining on its live web stats and so become iteratively dynamic.其證明的原則是根據歷史數據來源,但他們解釋說,沒有任何理由網站不應該對數據挖掘的網上實況統計資料,因此成為反复活躍。
Xueping Li, Laigang Song, and Alberto Garcia-Diaz (2008). 李學平,宋萊鋼和阿爾貝托加西亞迪亞茲( 2008年) 。 Adaptive web presence and evolution through web log analysis Int. 自適應網絡的存在和演化通過網絡日誌分析國際。 J. Electronic Customer Relationship Management , 2 (3), 195-214 學者電子客戶關係管理, 2 ( 3 ) , 195-214段


















2 responses so far ↓第2反應到目前為止↓
Hmm, intriguing.嗯,耐人尋味。
I’d have liked to have seen a little more detail about how these patterns work.我會希望看到一些更詳細的有關如何這些模式的工作。
That said, you have provided a link to the document, so I can always have a read later…這就是說,您提供了一個鏈接文件,所以我總是能夠有一個讀稍後...
Yes, maybe I could splice in some additional information.是的,也許我會剪接一些補充資料。 Drop me a line if you’d like a PDF of the paper to do a follow up for blah.我滴線如果您想要的PDF文件,以做後續的等等。
Leave a Comment發表您的評論