Qwen3-4B-Thinking-2507 text encoder

    qwen34BThinking2507_v10.safetensors Checkpoint / Qwen

    Model Information
    Model Name
    Qwen3-4B-Thinking-2507 text encoder
    Version
    v1.0
    Creator
    Capitan01R
    Size
    7.49 GB
    Downloads
    166
    Torrent Details
    BTIH
    6508ABDA9012F68EE655667107A2EFD60B631CE7
    BTMH
    F068F2093EA41D97E5CD954C508995834F66251CB1E769A99F8CB3F6FF157A04
    SHA256
    C64F43DE489BF2421F8A544951C405C7FBB37E518C382BB741444AFF54EFF9C7
    Upload Date
    3 days ago
    Uploader
    CivitasBay.org
    Status
    5 Seeders
    5 Peers
    Info
    • A fully ready Qwen3-4B-Thinking-2507 build. Compared to vanilla Qwen3-4B, it delivers noticeably better prompt adherence with Z-Image models and avoids common wording misinterpretations. Highly recommended for both inference and Z-Image LoRA training.

    • Path: ComfyUI_windows_portable\ComfyUI\models\text_encoders\

    • Qwen3-4B-Thinking-2507 USAGE:

      [ANCHOR: who / what exists]
      [ROLE or STATE: what defines them conceptually]
      [ACTION or POSTURE: what they are doing or how they are positioned]
      [RELATIONSHIP: how they relate to space, objects, or viewer]
      [ENVIRONMENT: where this takes place, minimally]
      [INTENT: what the image is meant to communicate]
      [LIGHTING: chosen to support the intent]
      [CAMERA / FRAMING: how the viewer perceives it]
      [STYLE RESTRAINTS: what it should resemble, softly]
      [CONSTRAINTS: what must be avoided]
      
      example:
      a single adult man,
      calm and self-contained rather than expressive,
      standing upright with relaxed posture,
      positioned slightly off-center to create quiet tension,
      inside a simple, uncluttered interior space,
      the focus is on presence and character rather than action,
      soft indirect light so that facial features remain natural,
      eye-level camera, medium framing from the chest up,
      realistic but understated photographic style,
      no exaggerated emotion, no stylization, no dramatic effects
      
      example 2:
      [SUBJECT / ANCHOR],  
      [TRAIT / MOOD / PERSONALITY],  
      [ACTION / POSTURE / STATE],  
      [POSITION / RELATION TO SPACE / COMPOSITION],  
      [ENVIRONMENT / SETTING],  
      [INTENT / WHAT THE IMAGE SHOULD CONVEY],  
      [LIGHTING / ATMOSPHERE],  
      [CAMERA / FRAMING / PERSPECTIVE],  
      [STYLE / ARTISTIC DIRECTION],  
      [FORM CLARITY / SHAPE / TEXTURE / COLOR DIRECTIONS]
      
      example:
      a single adult man,  
      calm and self-contained,  
      standing upright with relaxed posture,  
      positioned slightly off-center to create quiet tension,  
      inside a simple, uncluttered interior space,  
      showing presence and character through posture and expression,  
      soft indirect light to enhance facial features naturally,  
      eye-level camera, medium framing from the chest up,  
      photographic style with subtle tones and understated textures,  
      featuring clear forms, natural proportions, and readable visual composition

    • You use this inside of your positive prompt; meaning the example part only. the explaining part is just for you to understand the layout not the text encoder

    **Please note that Qwen3-4B-Thinking-2507 is just experimental with this model but with right tweaks it can provide great outputs and any trained lora on the vanilla qwen3_4b will not function properly under this encoder so you will need to retrain using this text encoder.

    • full training thread can be found here

    • training config for AiToolKit with Qwen3-4B-Thinking-2507 text encoder Can be found here.

    Gallery
    Media 1
    Trackers
    • udp://tracker.opentrackr.org:1337/announce
    • udp://open.demonii.com:1337/announce
    • udp://open.stealth.si:80/announce
    • udp://tracker.torrent.eu.org:451/announce
    • udp://tracker.qu.ax:6969/announce
    • udp://tracker.bittor.pw:1337/announce