谷歌爬虫主要是用C++开发吗从谷歌创始人

从谷歌创始人的论文《The Anatomy of a Large-Scale Hypertextual Web Search Engine》里可以看到，谷歌一开始的爬虫是用Python写的。http://infolab.stanford.edu/pub/papers/google.pdf4.3 Crawling the WebIn order to scale to hundreds of millions of web pages, Google has a fast distributed crawling system. A single URLserver serves lists of URLs to a number of crawlers (we typically ran about 3). Both the URLserver and the crawlers are implemented in Python. Each crawler keeps roughly 300 connections open at once. This is necessary to retrieve web pages at a fast enough pace. At peak speeds, the system can crawl over 100 web pages per second using four crawlers. This amounts to roughly 600K per second of data. A major performance stress is DNS lookup. Each crawler maintains a its own DNS cache so it does not need to do a DNS lookup before crawling each document. Each of the hundreds of connections can be in a number of different states: looking up DNS, connecting to host, sending request, and receiving response. These factors make the crawler a complex component of the system. It uses asynchronous IO to manage events, and a number of queues to move page fetches from state to state.据说现在改成了C++，但是我没有找到明确的材料。
■网友
目前是的Why did Google move from Python to C++ for use in its crawler?

谷歌爬虫主要是用C++开发吗

推荐阅读

罂粟苗可以像食用普通青菜那样食用么

京东数科招股书背后：“to B”基因明显

DNF|DNF：2天翻2张金牌！希洛克金牌率提升，欧皇非酋差距更大了

产业气象站■支付宝季度三连涨继续领跑整个行业，外媒：中国移动支付全球领先

如何使用腹肌板呢

小鹏汽车怎么样？解锁小鹏G3新颜色享受多彩时光

亚克力浴缸价格走势

试驾车在哪里买可以便宜多少 4s店的试驾车能买吗

专家|悬崖上发现多座楚王墓，崖顶上又发现多间寺庙，专家：两者关系可大了

美国法律专家怎么看TikTok起诉美政府？特朗普政府选择了审核程序政治化

苏东坡|古有苏东坡，今有郭沫若：郭沫若为什么自称比苏轼牛，他配吗？

|与吴君如弟弟离婚6年！江美仪与前婆家人聚会，相处融洽不尴尬

心如温暖之夏|往往喜欢做3件事，福气越来越深厚，一个有福气的人

3DMGAMETB《糖豆人：终极淘汰赛》宣传片演示60人撕逼大战

聚成教育|Excel表格技巧—如何根据单元格大小自动调整文字大小

科技部：科技部：老药磷酸氯喹治疗新冠肺炎有疗效

大便的成分是啥

苹果|“iPhone 13”遭国内厂商提前发售：小刘海、侧边指纹只卖599！

沃尔夫斯堡足球俱乐部|比如进她个100球？，下赛季定个小目标

影院|黑龙江发布公告：非必要不离哈！感染者曾连续三天玩剧本杀