Flashlight is much lower level and gives more fine-grained performance control. ...

dwrodri · on April 16, 2021

This is a very interesting claim. I find it credible because it stands to reason that projects like DeepSpeed[1] and TorchScript[2] wouldn't need to exist if inference performance of research PyTorch models was satisfactory for production, but often case it isn't.

It appears as though Flashlight is built on ArrayFire. I haven't seen how gradients are managed in arrayfire-ml, but perhaps it is the case that the autograd implementation in PyTorch was a bottleneck and this is a ground up approach.

Editing as I didn't address your second point. I can neither confirm or deny the motivations for creating Torch being related to FB's desire to depend on an Alphabet-managed codebase. I know there are lots of reasons why programmers prefer the UX of the PyTorch Python API (I do as well), and there are probably other reasons I can't recall off the top of my head. I am only saying that PyTorch is contributing to the ML ecosystem already by the sole virtue that it isn't a Google product.

[1] = https://github.com/microsoft/DeepSpeed [2] = https://pytorch.org/docs/stable/jit.html

ipsum2 · on April 16, 2021

PyTorch is based off of Torch, which was first released in 2002, predating TensorFlow by 13 years.

https://en.wikipedia.org/wiki/Torch_(machine_learning)

liuliu · on April 16, 2021

Torch (Lua) predates TensorFlow and is Lecun's pet project for a few years at that point already. But Lua as a language is unpopular at a time. PyTorch would be a welcome addition then. But even if no PyTorch (nor Caffe2) in an alternative timeline, I would imagine FB would be stuck with Lua Torch for quite some time.

tastyminerals2 · on April 17, 2021

Don’t forget that Lua Torch backend is C while Pytorch is C++ mostly.

sp-matcha · on April 17, 2021

Adding to the above - Tensor Comprehensions was path-finding research and is no longer maintained. The git repo is frozen (archived) as a research artifact.

Halide is still quite active, and was used in products at Adobe and Google circa 2016-2017. Not sure about the current state of industry usage though.

whimsicalism · on April 17, 2021

I'm unfamiliar with the word "path-finding" in this context. From the usage, is it similar to groundbreaking?

mgraczyk · on April 17, 2021

Halide is still used by Google in a few different places.

lunixbochs · on April 18, 2021

I ship an app (Talon) which runs many kinds of wav2letter ASR models on consumer CPUs. It has real-time inference pipelines for both pytorch and flashlight. (I wrote the pytorch code). Performance of both inference pipelines is fine and comparable between the frameworks for me. I'm not sure what you're talking about wrt speech recognition performance.

Maybe for training? But there aren't that many cpu bound components there, and you can write those in native code.

whimsicalism · on April 18, 2021

Fair enough, I've followed your stuff a bit and you definitely know more about Flashlight than I do.

What would you say the motivation is for yet another ML framework, but in C++ this time?