1
1
submitted 10 minutes ago by [email protected] to c/[email protected]
2
1
I’m all ears (i.redd.it)
submitted 12 minutes ago by [email protected] to c/[email protected]
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/grimdank by /u/Fun-Bullfrog-8542 on 2025-06-18 01:51:54+00:00.

3
1
submitted 13 minutes ago by [email protected] to c/[email protected]
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/grimdank by /u/Hon1c on 2025-06-18 01:04:41+00:00.

4
1
submitted 13 minutes ago by [email protected] to c/[email protected]
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/grimdank by /u/CT-4426 on 2025-06-18 00:21:48+00:00.

5
1
submitted 13 minutes ago by [email protected] to c/[email protected]
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/grimdank by /u/dwaynetheaaakjohnson on 2025-06-18 00:11:39+00:00.

6
1
Well...... (i.redd.it)
submitted 13 minutes ago by [email protected] to c/[email protected]
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/grimdank by /u/Mad_lens_9297 on 2025-06-17 23:50:42+00:00.

7
1
submitted 1 hour ago by [email protected] to c/[email protected]
8
1
submitted 2 hours ago by [email protected] to c/[email protected]
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/InternetIsBeautiful by /u/NoCommunication7 on 2025-06-17 21:44:19+00:00.

9
1
submitted 2 hours ago by [email protected] to c/[email protected]
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/InternetIsBeautiful by /u/Alice_Sullivan on 2025-06-15 09:02:56+00:00.


quaint place, pleasant for a browse.

something about scrolling through respective user contributions is really fun….

10
1
submitted 2 hours ago by [email protected] to c/[email protected]
11
1
submitted 1 hour ago by [email protected] to c/[email protected]
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/Dune_Spiced on 2025-06-17 23:28:29+00:00.


ComfyUI Guide for local use

https://docs.comfy.org/tutorials/image/cosmos/cosmos-predict2-t2i

This model just dropped out of the blue and I have been performing a few test:

1) SPEED TEST on a RTX 3090 @ 1MP (unless indicated otherwise)

FLUX.1-Dev FP16 = 1.45sec / it

Cosmos Predict2 2B = 1.2sec / it. @ 1MP & 1.5MP

Cosmos Predict2 2B = 1.8sec / it. @ 2MP

HiDream Full FP16 = 4.5sec / it.

Cosmos Predict2 14B = 4.9sec / it.

Cosmos Predict2 14B = 7.7sec / it. @ 1.5MP

Cosmos Predict2 14B = 10.65sec / it. @ 2MP

The thing to note here is that the 2B model can produce images at an impressive speed @ 2MP, while the 14B one reaches an atrocious speed.

Prompt: A Photograph of a russian woman with natural blue eyes and blonde hair is walking on the beach at dusk while wearing a red bikini. She is making the peace sign with one hand and winking

2B Model

14B Model

2) PROMPT TEST:

Prompt: An ethereal elven woman stands poised in a vibrant springtime valley, draped in an ornate, skimpy armor adorned with one magical gemstone embedded in its chest. A regal cloak flows behind her, lined with pristine white fur at the neck, adding to her striking presence. She wields a mystical spear pulsating with arcane energy, its luminous aura casting shifting colors across the landscape. Western Anime Style

2B Model

Prompt: A muscled Orc stands poised in a springtime valley, draped in an ornate, leather armor adorned with a small animal skulls. A regal black cloak flows behind him, lined with matted brown fur at the neck, adding to his menacing presence. He wields a rustic large Axe with both hands

2B Model

14B Model

Prompt: A massive spaceship glides silently through the void, approaching the curvature of a distant planet. Its sleek metallic hull reflects the light of a distant star as it prepares for orbital entry. The ship’s thrusters emit a faint, glowing trail, creating a mesmerizing contrast against the deep, inky blackness of space. Wisps of atmospheric haze swirl around its edges as it crosses into the planet’s gravitational pull, the moment captured in a cinematic, hyper-realistic style, emphasizing the grand scale and futuristic elegance of the vessel.

2B Model

Prompt: Under the soft pink canopy of a blooming Sakura tree, a man and a woman stand together, immersed in an intimate exchange. The gentle breeze stirs the delicate petals, causing a flurry of blossoms to drift around them like falling snow. The man, dressed in elegant yet casual attire, gazes at the woman with a warm, knowing smile, while she responds with a shy, delighted laugh, her long hair catching the light. Their interaction is subtle yet deeply expressive—an unspoken understanding conveyed through fleeting touches and lingering glances. The setting is painted in a dreamy, semi-realistic style, emphasizing the poetic beauty of the moment, where nature and emotion intertwine in perfect harmony.

2B Model

PERSONAL CONCLUSIONS FROM THE (PRELIMINARY) TEST:

Cosmos-Predict2-2B-Text2Image A bit weak in understanding styles (maybe it was not trained in them?), but relatively fast even at 2MP and with good prompt adherence (I'll have to test more).

Cosmos-Predict2-14B-Text2Image doesn't seem, to be "better" at first glance than it's 2B "mini-me", and it is HiDream sloooow.

Also, it has a text to Video brother! But, I am not testing it here yet.

The MEME:

Just don't prompt a woman laying on the grass!

Prompt: Photograph of a woman laying on the grass and eating a banana

https://preview.redd.it/9qipubalok7f1.jpg?width=1088&format=pjpg&auto=webp&s=3b7502d820964911e1ec807713ef3014d3d0a417

12
1
submitted 1 hour ago by [email protected] to c/[email protected]
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/omni_shaNker on 2025-06-17 21:52:45+00:00.


After seeing this community post here:

https://www.reddit.com/r/StableDiffusion/comments/1ldn88o/chatterbox_audiobook_and_podcast_studio_all_local/

And this other community post:

https://www.reddit.com/r/StableDiffusion/comments/1ldu8sf/video_guide_how_to_sync_chatterbox_tts_with/

Here is my latest updated fork of Chatterbox-TTS.

NEW FEATURES:

It remembers your last settings and they will be reloaded when you restart the script.

Saves a json file for each audio generation that contains all your configuration data, including the seed, so when you want to use the same settings for other generations, you can load that json file into the json file upload/drag and drop box and all the settings contained in the json file will automatically be applied.

You can now select an alternate whisper sync validation model (faster-whisper) for faster validation and to use less VRAM. For example with the largest models: large (~10–13 GB OpenAI / ~4.5–6.5 GB faster-whisper)

Added the VOICE CONVERSION feature that some had asked for which is already included in the original repo. This is where you can record yourself saying whatever, then take another voice and convert your voice to theirs saying the same thing in the same way, same intonation, timing, etc..

| Category | Features | |


|


| | Input | Text, multi-file upload, reference audio, load/save settings | | Output | WAV/MP3/FLAC, per-gen .json/.csv settings, downloadable & previewable in UI | | Generation | Multi-gen, multi-candidate, random/fixed seed, voice conditioning | | Batching | Sentence batching, smart merge, parallel chunk processing, split by punctuation/length | | Text Preproc | Lowercase, spacing normalization, dot-letter fix, inline ref number removal, sound word edit | | Audio Postproc | Auto-editor silence trim, threshold/margin, keep original, normalization (ebu/peak) | | Whisper Sync | Model selection, faster-whisper, bypass, per-chunk validation, retry logic | | Voice Conversion | Input+target voice, watermark disabled, chunked processing, crossfade, WAV output |

13
1
submitted 1 hour ago by [email protected] to c/[email protected]
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/hippynox on 2025-06-17 19:14:01+00:00.

14
1
submitted 2 hours ago by [email protected] to c/[email protected]
15
1
submitted 2 hours ago by [email protected] to c/[email protected]
16
1
submitted 2 hours ago by [email protected] to c/[email protected]
17
1
submitted 2 hours ago by [email protected] to c/[email protected]
18
1
submitted 2 hours ago by [email protected] to c/[email protected]
19
1
submitted 5 minutes ago by [email protected] to c/[email protected]

Nederland heeft vorig jaar 1105 euro per inwoner uitgegeven aan defensie. Daarmee staat Nederland op plek 7 van alle 32 NAVO-landen. Dat heeft het Centraal Bureau voor de Statistiek berekend.

Op nummer 1 staan de Verenigde Staten met 2641 euro per persoon. Onderaan staat Albanië met 173 euro. Het [...]

20
1
submitted 6 minutes ago by [email protected] to c/[email protected]

Alleen Amerikanen, Britten, Scandinaviërs en Finnen geven in de Navo meer uit aan defensie dan de Nederlander. Per hoofd van de bevolking besteedde Nederland vorig jaar ruim 1100 euro, heeft het Centraal Bureau voor de Statistiek becijferd.

21
1
submitted 5 minutes ago by [email protected] to c/[email protected]

Blokker opent de komende tijd veertig vestigingen in Nederland om de vorig jaar failliet verklaarde winkelketen opnieuw op te bouwen. De eerste twaalf nieuwe winkels worden woensdag geopend. De rest volgt de komende weken, meldt de winkelketen.

22
1
submitted 2 hours ago by [email protected] to c/[email protected]
23
1
submitted 28 minutes ago by [email protected] to c/[email protected]
24
1
submitted 3 hours ago by [email protected] to c/[email protected]
25
2
submitted 25 minutes ago by [email protected] to c/[email protected]
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/thedeprogram by /u/119ak on 2025-06-18 01:31:14+00:00.

view more: next ›

The Lemmy Club

1,346 readers
25 users here now

Welcome to The Lemmy Club!

Instance Rules:

  1. Don't be a dick.
  2. Do NOT make me add new rules.
  3. Racism/slurs/etc use will not be tolerated.
  4. No spamming.
  5. Don't harass other users (See rule 1)
  6. NSFW content must be marked correctly.
  7. All content must comply with US law
  8. Loli/etc. will not be tolerated. Suggestive or sexual art must be reasonably recognizable as adult subjects.
  9. Users or communities that, in the view of the admin team, jeopardize the good standing of The Lemmy Club with other instances may be removed.
  10. These rules apply to all content and users that appear on The Lemmy Club. Moderation is on an as noticed/as reported basis. If you see rule breaking content, I likely have just not seen it yet. Please report it.
  11. Instances/users/communities that tolerate, repeatedly fail to enforce, or allow content that breaks any of these rules may be banned from The Lemmy Club.
  12. The site admin team (well, just @bdonvr really as of now) has final say in interpretations of all rules.

Help contribute towards our operating costs to keep us going and growing: https://opencollective.com/thelemmyclub/

We host MLMYM (a clone of old.reddit) at https://old.thelemmy.club/

We host Voyager (a mobile optimized webapp) at https://app.thelemmy.club/

founded 2 years ago
ADMINS