|速度超快！字节跳动开源序列推理引擎LightSeq( 五 )

传送门：
GitHub项目地址：
https://github.com/bytedance/lightseq
[1] Vaswani, Ashish, et al. ''Attention is all you need.'' Advances in neural information processing systems. 2017.
[2] Devlin, Jacob, et al. ''Bert: Pre-training of deep bidirectional transformers for language understanding.'' arXiv preprint arXiv:1810.04805 (2018).
[3] Brown, Tom B., et al. ''Language models are few-shot learners.'' arXiv preprint arXiv:2005.14165 (2020).
[4] WMT2020, http://www.statmt.org/wmt20/
[5] Li, Jiwei, Will Monroe, and Dan Jurafsky. ''A simple, fast diverse decoding algorithm for neural generation.'' arXiv preprint arXiv:1611.08562 (2016).
[6] TurboTransformers, https://github.com/Tencent/TurboTransformers
[7] FasterTransformer, https://github.com/NVIDIA/DeepLearningExamples/tree/master/FasterTransformer
[8] NVIDIA Triton Inference Server, https://github.com/triton-inference-server/server
[9] LightSeq proto, https://github.com/bytedance/lightseq/tree/master/proto
[10] LightSeq性能评测报告, https://github.com/bytedance/lightseq/blob/master/docs/performance.md
[11] LightSeq Layer Normalization, https://github.com/bytedance/lightseq/blob/master/kernels/transformerKernels.cu.cc#L269
[12] cuBLAS, https://docs.nvidia.com/cuda/cublas/index.html
【|速度超快！字节跳动开源序列推理引擎LightSeq】[13] GPT2,''Language Models are Unsupervised Multitask Learners''

|速度超快！字节跳动开源序列推理引擎LightSeq( 五 )

推荐阅读

公新翰@但最伤的是另一个决策！，QG四连跪的主要原因找到？没买最初很伤

之夏|博山自闭症疗育中心获第四届“博山之夏”全民才艺大赛“特殊表演奖”

[数码小王]Pro有点像，还是双打孔曲面屏！荣耀30 Pro真机曝光，跟华为P40

熟普洱的保存方法熟普洱保存和储藏方法

宝宝躺着吃奶(躺着给宝宝喂奶好么？)

大洋网|海关归类服务为企业年减关税逾千万元

海外网|8月30日全球疫情观察：至少16国日增确诊超千例印度单日确诊再现最大增幅

提亮嫩肤秘方大公开，教你一键美白上岸

默叔说香水|库尔吉安乌木丝缎心情教科书式乌木玫瑰香

首席生活家开箱体验，智能方便：美的嵌入式蒸烤一体机BS5055W

如何自己办理宠物托运宠物托运手续

中国新闻网|浙江龙泉发现国家二级保护动物阳彩臂金龟

天猫店新店考核不通过天猫店考核期指标没有达到怎么办

『车家号』95 km，加量不加价的宝马530Le增重后如何延续操控，纯电续航升至

女性冬季如何饮食养生？几款专属女性的养生食物

黑喵游妮|阴阳师8月19日体验服更新总结为崽而战斗技中午时长变成2小时

咽喉干燥可喝西洋参麦冬茶

历史|美股三连跌科技股领跌大盘

前无后有|工资太低而消费太低，该怎么办？，职场中的年轻人

生煎包|松江超嗲的生煎包大全，哪家是你最爱吃的？