1487
you are viewing a single comment's thread
view the rest of the comments
[-] Barbarian@sh.itjust.works -3 points 2 years ago* (last edited 2 years ago)

Could you give me an example that uses live feeds of video data, or feeds the output to another system? As far as I'm aware (I could be very wrong! Not an expert), the only things that come close to that are things like OCR systems and character recognition. Describing in machine-readable actionable terms what's happening in an image isn't a thing, as far as I know.

[-] GBU_28@lemm.ee 8 points 2 years ago* (last edited 2 years ago)

No live video no, that didn't seem the topic

But if you had the horsepower, I don't think it's impossible based on what I've worked with. It's just about snipping and distributing the images, from a bottleneck standpoint

[-] Barbarian@sh.itjust.works -2 points 2 years ago* (last edited 2 years ago)

No live videos

Well, that'd be a prerequisite to a transformer model making decisions for a ship scuttling robot, hence why I brought it up.

[-] FooBarrington@lemmy.world 3 points 2 years ago

Describing in machine-readable actionable terms what's happening in an image isn't a thing, as far as I know.

It is. That's actually the basis of multimodal transformers - they have a shared embedding space for multiple modes of data (e.g. text and images). If you encode data and take those embeddings, you suddenly have a vector describing the contents of your input.

this post was submitted on 26 Feb 2024
1487 points (94.9% liked)

Microblog Memes

10898 readers
2249 users here now

A place to share screenshots of Microblog posts, whether from Mastodon, tumblr, ~~Twitter~~ X, KBin, Threads or elsewhere.

Created as an evolution of White People Twitter and other tweet-capture subreddits.

RULES:

  1. Your post must be a screen capture of a microblog-type post that includes the UI of the site it came from, preferably also including the avatar and username of the original poster. Including relevant comments made to the original post is encouraged.
  2. Your post, included comments, or your title/comment should include some kind of commentary or remark on the subject of the screen capture. Your title must include at least one word relevant to your post.
  3. You are encouraged to provide a link back to the source of your screen capture in the body of your post.
  4. Current politics and news are allowed, but discouraged. There MUST be some kind of human commentary/reaction included (either by the original poster or you). Just news articles or headlines will be deleted.
  5. Doctored posts/images and AI are allowed, but discouraged. You MUST indicate this in your post (even if you didn't originally know). If an image is found to be fabricated or edited in any way and it is not properly labeled, it will be deleted.
  6. Absolutely no NSFL content.
  7. Be nice. Don't take anything personally. Take political debates to the appropriate communities. Take personal disagreements & arguments to private messages.
  8. No advertising, brand promotion, or guerrilla marketing.

RELATED COMMUNITIES:

founded 2 years ago
MODERATORS