ReleaseBaidu (ERNIE)Baidu (ERNIE)published May 19, 2021seen 5d

PaddlePaddle/Serving v0.6.0

PaddlePaddle/Serving

Open original ↗

Captured source

source ↗
published May 19, 2021seen 5dcaptured 9hhttp 200method plain

Release v0.6.0

Repository: PaddlePaddle/Serving

Tag: v0.6.0

Published: 2021-05-19T09:56:45Z

Prerelease: no

Release notes:

Paddle Serving v0.6.0 Release note:

  • 新特性:
  • 集成Paddle 2.1 inference, #1221
  • 支持fp16和int8的低精度推理, #1130, #1236
  • 通过Kubernetes部署Serving服务, #1139, #1184, #1193
  • 新增安全网关与Serving协同部署, #1235
  • 支持X86 + XPU环境部署Serving, #1080
  • 功能增强:
  • Python合并paddle_serving_server和paddle_serving_server_gpu成统一服务, #1082
  • Pipeline增加mini-batch推理, #1186
  • Pipeline支持日志切割, #1238
  • Pipeline优化数据传入eval处理,增加channel的跟踪日志, #1209
  • C++ Serving重构预测库调用方法,#1080
  • C++ Serving支持多模型线性组合,#1124
  • C++ Serving资源管理与优化, #1143
  • C++ Serving接口增加String类型输入, #1124
  • C++ Serving优化数据组装方法,使用memcpy替换循环拷贝, #1124
  • C++ Serving编译选型增加GDB开关, #1124
  • 增加Benchmark脚本,更新GPU benchmark数据, #1197, #1175
  • 文档升级:
  • 新增 doc/PADDLE_SERVING_ON_KUBERNETES.md
  • 新增 doc/LOD.md
  • 新增 doc/LOD_CN.md
  • 新增 doc/PROCESS_DATA.md
  • 修改 doc/PIPELINE_SERVING.md
  • 修改 doc/PIPELINE_SERVING_CN.md
  • 修改 doc/CREATING.md
  • 修改 doc/SAVE.md
  • 修改 doc/SAVE_CN.md
  • 修改 doc/TENSOR_RT.md
  • 修改 doc/TENSOR_RT_CN.md
  • 修改 doc/MULTI_SERVICE_ON_ONE_GPU_CN.md
  • 修改 doc/ENCRYPTION.md
  • 修改 doc/ENCRYPTION_CN.md
  • 修改 doc/DESIGN_DOC.md
  • 修改 doc/DESIGN_DOC_CN.md
  • 修改 doc/DOCKER_IMAGES.md
  • 修改 doc/DOCKER_IMAGES_CN.md
  • 修改 doc/LATEST_PACKAGES.md
  • 修改 doc/COMPILE.md
  • 修改 doc/COMPILE_CN.md
  • 修改 doc/BERT_10_MINS.md
  • 修改 doc/BERT_10_MINS_CN.md
  • 修改 doc/BAIDU_KUNLUN_XPU_SERVING.md
  • 修改 doc/BAIDU_KUNLUN_XPU_SERVING_CN.md
  • 修改 README.md
  • 修改 README_CN.md
  • Demo升级:
  • 新增 python/python/examples/low_precision/resnet50
  • 新增 python/examples/xpu/bert
  • 新增python/examples/xpu/ernie
  • 新增 python/examples/xpu/vgg19
  • 新增 python/examples/pipeline/PaddleDetection/faster_rcnn
  • 新增 python/examples/pipeline/PaddleDetection/ppyolo_mbv3
  • 新增 python/examples/pipeline/PaddleDetection/yolov3
  • 新增 python/examples/pipeline/PaddleClas/DarkNet53
  • 新增 python/examples/pipeline/PaddleClas/HRNet_W18_C
  • 新增 python/examples/pipeline/PaddleClas/MobileNetV1
  • 新增 python/examples/pipeline/PaddleClas/MobileNetV2
  • 新增 python/examples/pipeline/PaddleClas/MobileNetV3_large_x1_0
  • 新增 python/examples/pipeline/PaddleClas/ResNeXt101_vd_64x4d
  • 新增 python/examples/pipeline/PaddleClas/ResNet50_vd
  • 新增 python/examples/pipeline/PaddleClas/ResNet50_vd_FPGM
  • 新增 python/examples/pipeline/PaddleClas/ResNet50_vd_KL
  • 新增 python/examples/pipeline/PaddleClas/ResNet50_vd_PACT
  • 新增 python/examples/pipeline/PaddleClas/ResNet_V2_50
  • 新增 python/examples/pipeline/PaddleClas/ShuffleNetV2_x1_0
  • 新增 python/examples/pipeline/bert
  • 新增 python/examples/ocr/ocr_cpp_client.py
  • 修改 python/examples/bert [benchmark]
  • 修改 python/examples/pipeline/ocr[benchmark]
  • docker升级:
  • 新增docker运行镜像(CPU, cuda10.1, cuda10.2, cuda11) (Py36, Py37, Py38)
  • 新增Cuda 11环境的开发docker镜像
  • 新增Kubernetes Demo镜像
  • Bug修复:
  • 修复不规范代码命名,统一infer. h文件和paddle_engine. h中模型参数的命名规范. #1136
  • 修复C++部分框架被绕过的错误. #1124
  • 修复py35下Json.load函数异常的错误.#1124
  • 修复ssd_vgg16_300_240e_voc示例中feed_var缺少参数'im_shape'导致的预测结果异常的错误.#1180
  • 修复多个GRPC因模型路径变更导致的错误.#1147
  • 修复C++log日志打印异常的错误. #1154
  • 修复WebService漏传Thread参数的错误. #1136
  • 修复golang引入的编译错误. #1101
  • 修复Java gRPC模型下的错误. #1215

For English

  • New Features:
  • Integrated Paddle 2.1 Inference, #1221
  • Support low-precision inference of fp16 and int8, #1130, #1236
  • Deploy Serving service through Kubernetes, #1139, #1184, #1193
  • New Security gateway, #1235
  • Serving deployment in X86 + XPU environment, #1080
  • Feature Improvements:
  • Merge paddle_serving_server and paddle_serving_server_gpu into a unified paddle_serving_server, #1082
  • Pipeline supports Mini-batch inference, #1186
  • Pipeline supports log file rotating, #1238
  • Pipeline optimizes data transfer to eval for processing, and increases channel tracking logs, #1209
  • C++ Serving reconstruction prediction engine call method, #1080
  • C++ Serving supports linear combination of multiple models, #1124
  • C++ Serving interface adds direct input of String type, #1124
  • C++ Serving resource management and optimization, #1143
  • C++ Serving performance optimization, changing for loop copy to function memcpy, #1124
  • C++ Serving add GDB compilation options, #1124
  • Add Benchmark script and update GPU benchmark data, #1197, #1175
  • Document Updates:
  • Add doc/PADDLE_SERVING_ON_KUBERNETES.md
  • Add doc/LOD.md
  • Add doc/LOD_CN.md
  • Add doc/PROCESS_DATA.md
  • Modify doc/PIPELINE_SERVING.md
  • Modify doc/PIPELINE_SERVING_CN.md
  • Modify doc/CREATING.md
  • Modify doc/SAVE.md
  • Modify doc/SAVE_CN.md
  • Modify doc/TENSOR_RT.md
  • Modify doc/TENSOR_RT_CN.md
  • Modify doc/MULTI_SERVICE_ON_ONE_GPU_CN.md
  • Modify doc/ENCRYPTION.md
  • Modify doc/ENCRYPTION_CN.md
  • Modify doc/DESIGN_DOC.md
  • Modify doc/DESIGN_DOC_CN.md
  • Modify doc/DOCKER_IMAGES.md
  • Modify doc/DOCKER_IMAGES_CN.md
  • Modify doc/LATEST_PACKAGES.md
  • Modify doc/COMPILE.md
  • Modify doc/COMPILE_CN.md
  • Modify doc/BERT_10_MINS.md
  • Modify doc/BERT_10_MINS_CN.md
  • Modify doc/BAIDU_KUNLUN_XPU_SERVING.md
  • Modify doc/BAIDU_KUNLUN_XPU_SERVING_CN.md
  • Modify README.md
  • Modify README_CN.md
  • Demo Updates:
  • Add python/python/examples/low_precision/resnet50
  • Add python/examples/xpu/bert
  • Add python/examples/xpu/ernie
  • Add python/examples/xpu/vgg19
  • Add python/examples/pipeline/PaddleDetection/faster_rcnn
  • Add python/examples/pipeline/PaddleDetection/ppyolo_mbv3
  • Add python/examples/pipeline/PaddleDetection/yolov3
  • Add python/examples/pipeline/PaddleClas/DarkNet53
  • Add python/examples/pipeline/PaddleClas/HRNet_W18_C
  • Add python/examples/pipeline/PaddleClas/MobileNetV1
  • Add python/examples/pipeline/PaddleClas/MobileNetV2
  • Add python/examples/pipeline/PaddleClas/MobileNetV3_large_x1_0
  • Add python/examples/pipeline/PaddleClas/ResNeXt101_vd_64x4d
  • Add python/examples/pipeline/PaddleClas/ResNet50_vd
  • Add python/examples/pipeline/PaddleClas/ResNet50_vd_FPGM
  • Add python/examples/pipeline/PaddleClas/ResNet50_vd_KL
  • Add python/examples/pipeline/PaddleClas/ResNet50_vd_PACT
  • Add python/examples/pipeline/PaddleClas/ResNet_V2_50
  • Add python/examples/pipeline/PaddleClas/ShuffleNetV2_x1_0
  • Add python/examples/pipeline/bert
  • Add python/examples/ocr/ocr_cpp_client.py
  • Modify python/examples/bert [benchmark]
  • Modify python/examples/pipeline/ocr[benchmark]
  • Docker Updates:
  • Add runtime dockers (CPU, CUDA10.1, CUDA10.2, CUDA11) (Py36, Py37, Py38)
  • Add CUDA 11 develop level docker images
  • Add kubernetes demo images
  • Bug Fixes:
  • Fixed the problem of irregular naming, #1136
  • Fixed the problem that part of C + + multithreading and framework were bypassed due to the adaptation of…

Excerpt shown — open the source for the full document.