Results 1 - 7 of 7
Crawlzilla 是由台灣高速網路與計算中心開發的搜尋引擎, 主要功能為讓使用者輕鬆建立自訂的搜尋引擎, 不用依靠商業公司所提供的搜尋引擎. 適用於自訂範圍搜尋以及公司或是團體內部使用.
Crawlzill 特色整理如下:
Chukwa is an open source data collection system for monitoring large distributed systems. Chukwa is built on top of the Hadoop Distributed File System (HDFS) and Map/Reduce framework and inherits Hadoop’s scalability and robustness.
Platform: Linux
References: https://incubator.apache.org/chukwa/
X-RIME is a open source project devoted to provide Hadoop based solution for large scale social network analysis.
Platform: Linux;License: Apache License 2.0
References: https://xrime.sourceforge.net/
Hadoop為Apache軟體基金會(Apache Software Foundation, ASF)轄下成功的開放源碼專案。其以java進行撰寫,提供大量資料的分散式運算環境,並採用不具授權拘束特性的Apache-2.0授權釋出。Hadoop在國內外皆擁有眾多愛好者組成分享社群互相交流,並與ASF主力推動的育成專案Apache Hama在運作上密切相關(https://incubator.apache.org/),故未來發展穩定且前景可期。
Nutch is open source web-search software. It builds on Lucene Java, adding web-specifics, such as a crawler, a link-graph database, parsers for HTML and other document formats.
Reference: https://nutch.apache.org/
AppScale is an open-source framework for running Google App Engine applications. It is an implementation of a cloud computing platform (Platform-as-a-Service), supporting Xen, KVM, Amazon EC2 and Eucalyptus. It has been developed and is maintained by the RACELab at UC Santa Barbara.
Reference: https://appscale.cs.ucsb.edu/
Google App Engine is a platform for developing and hosting web applications in Google-managed data centers.