Overview / Usage
Its aim is to generate a natural language description of the video after reading all the frames. Inspired by Adobe Research.
Its aim is to generate a natural language description of the video after reading all the frames. Inspired by Adobe Research.