ReleaseBaidu (ERNIE)Baidu (ERNIE)published Mar 4, 2022seen 5d

PaddlePaddle/Serving v0.8.3

PaddlePaddle/Serving

Open original ↗

Captured source

source ↗
published Mar 4, 2022seen 5dcaptured 13hhttp 200method plain

Release v0.8.3

Repository: PaddlePaddle/Serving

Tag: v0.8.3

Published: 2022-03-04T07:52:08Z

Prerelease: no

Release notes:

新特性

  • 增加C++ Serving 和 Pipeline Serving编译环境检查 #1584
  • C++ Serving 支持修改log日志生成路径 #1592
  • 使用TRT时,新增动态shape配置功能和示例 #1590
  • 新增Python Pipeline Serving 普罗米修斯监控 #1586
  • 新增C++ Serving 普罗米修斯监控 #1568 #1576 #1577
  • 支持异构硬件,包括:x86+DCU、ARM+ascend310、ARM+ascend910 #1544
  • 支持Python39

性能优化

  • C++ Serving增加请求结果缓存功能,相同的请求直接返回 #1585, #1588

功能增强

  • 更便捷的C++串联多模型方式 #1546
  • dockerfile升级,新增centos dockerfile #1618 #1594
  • 新增Pipeline Serving bf16低精度支持 #1594 #1554

文档和示例变更

  • 新增pp-shitu示例 #1572
  • 新增PaddleNLP示例 #1609
  • 新增环境检查文档 #1643
  • 新增动态TRT使用文档 #1643
  • 新增异构硬件使用文档 #1641,#1654
  • 新增请求缓存Cache使用说明文档 #1641, #1588

Bug修复

  • 修复异步框架下内存泄露问题 #1589
  • 修复Pipeline Serving中输入为list[str]的情况 #1598

For English:

New features

  • Add C++ serving and pipeline serving compilation environment check #1584
  • C++ serving supports modifying the log generation path #1592
  • When using TRT, new dynamic shape configuration functions and examples are added #1590
  • Add Python pipeline serving Prometheus monitoring #1586
  • Add C++ serving Prometheus monitoring #1568 #1576 #1577
  • Support heterogeneous hardware, including x86 + DCU, arm + ascend310 and arm + ascend910 #1544
  • Support Python 39

Performance optimization

  • C++ serving adds the request result caching function, and the same request is directly returned #1585, #1588

Function Enhance

  • More convenient C++ series multi model mode #1546
  • Dockerfile upgrade, new Centos dockerfile #1618 #1594
  • New pipeline serving bf16 low precision support #1594 #1554

Documentation and sample changes

  • New PP-Shitu example #1572
  • New paddlenlp example #1609
  • New environmental inspection document #1643
  • New dynamic TRT usage document #1643
  • New heterogeneous hardware usage documents #1641, #1654
  • New request cache instructions #1641, #1588

Bug repair

  • Fix memory leak in asynchronous framework #1589
  • Fix the input of list [STR] in pipeline serving #1598