[PerlChina] Fwd: 用Perl统计hao123网址之家首页的网站WEB服务器种类
陈学芹
robbiecn at gmail.com
Wed Aug 15 21:45:24 PDT 2007
欢迎意见:)
---------- Forwarded message ----------
From: 陈学芹 <robbiecn at gmail.com>
Date: 2007-8-12 下午4:53
Subject: 用Perl统计hao123网址之家首页的网站WEB服务器种类
To: 福州linux用户组 <fzlug at googlegroups.com>
Cc: 每日阅读 <nkeric-daily at googlegroups.com>
用Perl写了个分析工具,从hao123网站首页提取出链接,然后向每个链接的服务器发送head请求,分析响应报文头,确定是用哪种服务器.从数据来看,Apache还是web服务器的首选.
程序见附件webanalyse.pl, 以GPL方式发布.
***************
SERVER RAW DATA
***************
888.89178.com : Apache/2.2.0 (Unix) DAV/2 PHP/5.2.1
91.shgao.com : Apache
adfarm.mediaplex.com: Apache-Coyote/1.1
allyesbjafa.allyes.com:
auto.sina.com.cn: Apache/2.0.58 (Unix)
baby.sina.com.cn: Apache/2.0.58 (Unix)
baike.baidu.com: apache 1.0.4.0
blog.sina.com.cn: Nginx/0.5.30
cang.baidu.com : apache 1.0.7.1
china.nba.com : Apache/1.3.34 (Debian) mod_layout/3.2.1
chinahrafaad.allyes.com: Microsoft-IIS/6.0
club.sohu.com : Apache/2.0.55 (Unix) PHP/5.1.6
cn.mail.yahoo.com:
cn.msn.com : Microsoft-IIS/6.0
dict.baidu.com : Apache/2.0.58 (Unix) PHP/4.4.2
download.zol.com.cn: Apache
dzh.mop.com : lighttpd
eladies.sina.com.cn: Apache/2.0.58 (Unix)
file.baidu.com : Apache/2.0.58 (Unix) PHP/4.4.2
finance.sina.com.cn: Apache/2.0.59 (Unix)
flights.ctrip.com: Microsoft-IIS/6.0
fund.eastmoney.com: Microsoft-IIS/6.0
games.sina.com.cn: Apache/2.0.54 (Unix)
geci.baidu.com : apache 1.6.6.0/httpd 1.3.27 (Unix) BAIDU_UENCODE
v1.0.0 mod_gzip/1.3.19.1a mod_image/2.0.1 mod_cache/1.0.0
mod_baidu/4.1.1 mod_baidussa/1.0.0
guba.eastmoney.com: Microsoft-IIS/6.0
hd.www.net.cn : Apache/1.3.26 (Unix) PHP/4.2.2
health.sohu.com: Apache/1.3.37 (Unix) mod_gzip/1.3.26.1a
hi.baidu.com : apache 1.1.16.0
hjsm.tom.com : Apache/1.3.34 (Debian) PHP/5.1.4-0.1
image.baidu.com: apache 1.7.7.0/httpd 1.3.27 (Unix) BAIDU_UENCODE
v1.0.0 mod_gzip/1.3.19.1a mod_image/2.0.1 mod_cache/1.0.0
mod_baidu/4.1.1
junshi.xilu.com: Apache/2.0.59 (Unix) PHP/4.3.10
lady.163.com : Apache/2.0.59 (Unix)
lady.qq.com : Apache
lady.tom.com : Apache/2.2.0 (Unix) DAV/2 PHP/5.1.2
login.mail.sohu.com: Apache/1.3.33 (Unix) Resin/2.0.5 PHP/4.4.1
love21cn.msn.com.cn: Apache
ma.baidu.com : Apache/2.0.52 (Red Hat)
mail.163.com : Apache
mail.sina.com.cn: Apache/2.2.4 (FreeBSD) PHP/5.2.1 with Suhosin-Patch
mail.tom.com : Apache/1.3.31 (Unix)
map.baidu.com : apache 1.2.3.2/httpd 1.3.27 (Unix) mod_gis/1.0.0
mod_xslt/1.0.0 mod_mapurl/1.0.0 mod_gzip/1.3.19.1a mod_cache/1.0.0
mod_baidu/4.1.1 mod_ipcheck/1.0.0
mil.news.sina.com.cn: Apache/2.0.58 (Unix)
military.china.com: Apache
mmscode2.5kcn.com:
mobile.pconline.com.cn: Apache/2.2.3 (Unix) PHP/4.4.5
mobile.pcpop.com: Microsoft-IIS/6.0
mobile.zol.com.cn: Apache
mp3.baidu.com : apache 1.6.6.0/httpd 1.3.27 (Unix) BAIDU_UENCODE
v1.0.0 mod_gzip/1.3.19.1a mod_image/2.0.1 mod_cache/1.0.0
mod_baidu/4.1.1 mod_baidussa/1.0.0
my.51job.com : Apache/1.3.37 (Unix)
news.baidu.com : apache2.0.16.0/1.3.27 (Unix) MOD_NEWSREWRITE v1.0.0
mod_ipcheck/1.0.0 mod_gzip/1.3.19.1a mod_cache/1.0.0 mod_baidu/4.1.1
BAIDU_IMAGE v1.0.2
news.phoenixtv.com: Apache/2.2.3 (Unix)
news.sina.com.cn: Apache/2.0.58 (Unix)
news.sohu.com : Apache/1.3.37 (Unix) mod_gzip/1.3.26.1a
post.baidu.com : apache 2.7.2.0/httpd 1.3.27 (Unix) mod_forum/1.0.0
mod_gzip/1.3.19.1a mod_baidu/4.1.1
quanshiafa.allyes.com: Server
quote.eastmoney.com: Microsoft-IIS/5.0
qzone.qq.com : Apache
service.12530.com: Apache
spaces.live.com: Microsoft-IIS/6.0
spcode.baidu.com: Apache-Coyote/1.1
sports.cctv.com: Sun-ONE-Web-Server/6.1
sports.sina.com.cn: Apache/2.0.59 (Unix)
sports.sohu.com: Apache/1.3.37 (Unix) mod_gzip/1.3.26.1a
sports.tom.com : Apache/2.2.0 (Unix) DAV/2 PHP/5.1.2
tech.sina.com.cn: Apache/2.0.59 (Unix)
tj.28.com : Apache/2.0.54 (Unix) DAV/2 PHP/4.3.6
top.baidu.com : Apache/1.3.29 (Unix) PHP/4.3.4
u.7town.com : Microsoft-IIS/6.0
video.baidu.com: apache 1.0.5.0/httpd 1.3.27 (Unix)
mod_gzip/1.3.19.1a mod_cache/1.0.0 mod_tn/1.0.0 mod_video/1.0.0
mod_ipcheck/1.0.0
weather.tq121.com.cn: Apache/2.0.54 (Unix) PHP/5.0.4
www.126.com : Apache
www.155.com : Apache/2.0.54 (Win32)
www.163.com : Apache/2.0.59 (Unix)
www.17173.com : Apache/2.0.54 (Unix)
www.1860ls.com : Microsoft-IIS/6.0
www.1ting.com : Apache/2.2.3 (Unix) mod_jk/1.2.19
www.21cn.com :
www.3158.cn : Microsoft-IIS/6.0
www.3533.com : Microsoft-IIS/5.0
www.3839.com : Apache
www.39.net : Microsoft-IIS/6.0
www.4399.net : Microsoft-IIS/6.0
www.51.com : Apache
www.51job.com : Apache/1.3.37 (Unix)
www.5460.net : Apache-Coyote/1.1
www.56.com : web server.56
www.6rooms.com : nginx/0.4.9.dev.2
www.96333.com :
www.abchina.com: IBM_HTTP_Server/2.0.47.1 Apache/2.0.47 (Unix)
www.aiting.com : Microsoft-IIS/6.0
www.amazon.cn : Server
www.autohome.com.cn: Microsoft-IIS/6.0
www.babytree.com: Apache
www.baidu.com : BWS/1.0
www.baihe.com : Apache/2.0.59 (Unix)
www.bankcomm.com: IBM_HTTP_SERVER/1.3.28.1 Apache/1.3.28 (Unix)
www.beijing2008.cn: Apache
www.boc.cn : IBM_HTTP_SERVER/1.3.26 Apache/1.3.26 (Unix)
www.bokee.com : Apache/1.3.31 (Unix) mod_gzip/1.3.26.1a
www.caiacai.com: Apache/2.0.59 (Unix) mod_ssl/2.0.59 OpenSSL/0.9.8d PHP/5.2.3
www.ccb.com : Apache/2.0.58 (Unix)
www.cctv.com : Sun-ONE-Web-Server/6.1
www.china.com : Apache
www.chinacars.com: Microsoft-IIS/6.0
www.chinagames.net: Microsoft-IIS/6.0
www.chinamobile.com: Apache
www.chinanews.com.cn: Apache/1.3.36 (Unix)
www.chinaren.com: Apache/1.3.37 (Unix) mod_gzip/1.3.26.1a
www.cjol.com : Microsoft-IIS/6.0
www.cmbchina.com:
www.cmfu.com : Microsoft-IIS/6.0
www.cnfol.com : Apache
www.crsky.com : Microsoft-IIS/6.0
www.ctrip.com : Microsoft-IIS/6.0
www.dangdang.com: Microsoft-IIS/6.0
www.dianping.com: Microsoft-IIS/6.0
www.disney.com.cn: Apache
www.donews.com : Microsoft-IIS/6.0
www.eachnet.com: Apache/2.2.0 (Linux/SUSE)
www.eastmoney.com: Microsoft-IIS/6.0
www.f130.net : Microsoft-IIS/6.0
www.fh21.com.cn: Apache/1.3.34 (Unix) mod_gzip/1.3.26.1a PHP/4.3.11
www.flash8.net : Microsoft-IIS/6.0
www.flowercn.com: Microsoft-IIS/6.0
www.game.com.cn: lighttpd/1.4.15
www.ganji.com : Apache/2.0.55 (Unix) PHP/5.0.5
www.google.cn : GWS/2.1
www.gov.cn : Apache
www.gznet.com : Apache/2.0.49 (Unix)
www.hao123.com : Apache/2.2.4 (Unix) PHP/5.1.4
www.hd315.gov.cn: Microsoft-IIS/5.0
www.hongxiu.com: Microsoft-IIS/6.0
www.hotmail.com: Microsoft-IIS/6.0
www.hunantv.com: Apache/2.0.54 (Unix) PHP/4.4.1
www.icbc.com.cn: Microsoft-IIS/5.0
www.imobile.com.cn: Apache/1.3.37 (Unix) mod_gzip/1.3.26.1a
www.ip138.com : Microsoft-IIS/6.0
www.jrj.com.cn : Microsoft-IIS/6.0
www.ku6.com : Apache
www.lottery.gov.cn: Microsoft-IIS/6.0
www.love21cn.com: Apache
www.marry5.com : lighttpd[L7cc]/1.5.0
www.mydrivers.com:
www.no5.com.cn : Microsoft-IIS/6.0
www.online.sh.cn: Apache/1.3.26 (Unix) mod_gzip/1.3.19.1a
www.onlinedown.net: Microsoft-IIS/6.0
www.openv.tv : Apache
www.ouou.com : lighttpd/1.4.11
www.pcauto.com.cn: Apache/2.2.3 (Unix) PHP/4.4.5
www.pcgames.com.cn: Apache/2.2.3 (Unix) PHP/4.4.5
www.pconline.com.cn: Apache/2.2.3 (Unix) PHP/4.4.5
www.people.com.cn: Apache/1.3.37 (Unix)
www.phoenixtv.com: Apache/2.2.3 (Unix)
www.qq.com : Apache
www.qq163.com : Microsoft-IIS/6.0
www.qunar.com : Apache/2.2.3 (Unix) mod_jk/1.2.18
www.rayli.com.cn: Apache
www.readnovel.com: Apache/2.2.3 (Debian) PHP/4.4.4-8+etch4
www.reuters.com.cn: Microsoft-IIS/5.0
www.rising.com.cn: Microsoft-IIS/6.0
www.rongshuxia.com: Apache/1.3.37 (Unix)
www.sina.com.cn: Apache/2.0.54 (Unix)
www.skycn.com : Who_knows?
www.sogou.com : Apache/2.0.55 (Unix)
www.sogua.com :
www.sohu.com : Apache/1.3.37 (Unix) mod_gzip/1.3.26.1a
www.sooe.cn : Apache/2.0.59 (Unix) DAV/2 PHP/5.2.1
www.spjoy.com : Microsoft-IIS/5.0
www.stockstar.com: Microsoft-IIS/6.0
www.taobao.com : Apache
www.tianya.cn : Microsoft-IIS/5.0
www.tiexue.net : Microsoft-IIS/6.0
www.tom.com : Apache/1.3.34 (Debian) PHP/5.1.2-1
www.wuhan.net.cn:
www.xcar.com.cn: Apache
www.xiaoyouxi.com: Microsoft-IIS/6.0
www.xinhuanet.com: Apache
www.xxsy.net : Microsoft-IIS/6.0
www.xywy.com : Apache/2.2.3 (Unix) DAV/2 PHP/5.1.6
www.yahoo.cn : Apache
www.yaolan.com :
www.youku.com : Apache
www.younet.com : Apache/1.3.29 (Unix) PHP/4.3.4
www.yymp3.com : Microsoft-IIS/6.0
www.zaobao.com : Apache
www.zhaopin.com: Apache/1.3.37 (Unix)
www.zhcw.com : Apache/2.0.55 (Unix) DAV/2
zhidao.baidu.com: apache 1.0.10.0
***************
SERVER STAT
***************
Apache : 105 55.85%
IIS : 48 25.53%
GWS : 1 0.53%
Others : 34 18.09%
--
/*
*@author: chen xueqin
*@email: robbiecn at gmail.com
*@see: http://robbie.bokee.com
*@see: http://groups.google.com/group/fzlug
*@love: freedom,tux,open source
*/
--
/*
*@author: chen xueqin
*@email: robbiecn at gmail.com
*@see: http://robbie.bokee.com
*@see: http://groups.google.com/group/fzlug
*@love: freedom,tux,open source
*/
-------------- next part --------------
A non-text attachment was scrubbed...
Name: webanalyse.pl
Type: application/x-perl
Size: 6444 bytes
Desc: not available
Url : http://mail.pm.org/pipermail/china-pm/attachments/20070816/065c4064/attachment.bin
-------------- next part --------------
A non-text attachment was scrubbed...
Name: wspie.png
Type: image/png
Size: 3647 bytes
Desc: not available
Url : http://mail.pm.org/pipermail/china-pm/attachments/20070816/065c4064/attachment.png
More information about the China-pm
mailing list