Meta’s Open-Source ImageBind Works Across Six Modalities

15May/23Off

Meta’s Open-Source ImageBind Works Across Six Modalities

Meta Platforms has built and is open-sourcing ImageBind, an artificial intelligence that combines six modalities: audio, visual, text, thermal, movement and depth data. Currently a research project, it suggests a future in which AI models generate multisensory content. “ImageBind equips machines with a holistic understanding that connects objects in a photo with how they will sound, their 3D shape, how warm or cold they are, and how they move,” Meta says. In other words, ImageBind’s approach more closely approximates human thinking by training on the relationship between things rather than ingesting massive datasets so as absorb every possibility. ...

See the full story here: https://www.etcentric.org/metas-open-source-imagebind-works-across-six-modalities/

and here https://arxiv.org/abs/2305.05665

Filed under: Non-3D stories Comments Off

Comments (0) Trackbacks (0) ( subscribe to comments on this post )

Sorry, the comment form is closed at this time.

Trackbacks are disabled.

Telly Offers Free Smart TVs Featuring Ads on Second Screen » « How AI Knows Things No One Told It

Pages

If your company is an ETC member, you can log in and see more news posts at www.etcentric.org

philip lelyveld The world of entertainment technology

Meta’s Open-Source ImageBind Works Across Six Modalities

Pages

More posts