Auto-Vectorizing TensorFlow Graphs: Jacobians, Auto-Batching And Beyond

Fei Wang, Daniel Zheng, James Decker, Xilun Wu, Grégory M. Essertel, Tiark Rompf

2019

1 reference

Abstract

We propose a static loop vectorization optimization on top of high level dataflow IR used by frameworks like TensorFlow. A new statically vectorized parallel-for abstraction is provided on top of TensorFlow, and used for applications ranging from auto-batching and per-example gradients, to jacobian computation, optimized map functions and input pipeline optimization. We report huge speedups compared to both loop based implementations, as well as run-time batching adopted by the DyNet framework.

View Paper PDF DOI

🤖 Machine Learning

1 repository

1 reference

Code References

▶ tensorflow/tensorflow

1 file

▶ tensorflow/python/ops/parallel_for/control_flow_ops.py

L459

Auto-Batching and Beyond](https://arxiv.org/pdf/1903.04243.pdf)). The idea

Link copied to clipboard!