FLUX.1-Turbo-Alpha

    FLUX.1-Turbo-Alpha.safetensors LORA / Flux.1 D

    Model Information
    Model Name
    FLUX.1-Turbo-Alpha
    Version
    FLUX.1-Turbo-Alpha
    Size
    661.93 MB
    Downloads
    2,634
    Torrent Details
    BTIH
    ECB9461D667F2F8BBE8151A1CC2D033DB71E8A25
    BTMH
    EDE2F0B6EFDD52ECD111C2A1BF881071FEFDEA7D59D7B3BA7CAC246D5E1735C4
    SHA256
    77F7523A5E9C3DA6CFC730C6B07461129FA52997EA06168E9ED5312228AA0BFF
    Upload Date
    about a year ago
    Uploader
    CivitasBay.org
    Status
    8 Seeders
    4 Peers
    Info

    中文版Readme

    This repository provides a 8-step distilled lora for FLUX.1-dev model released by AlimamaCreative Team.

    Description

    This checkpoint is a 8-step distilled Lora, trained based on FLUX.1-dev model. We use a multi-head discriminator to improve the distill quality. Our model can be used for T2I, inpainting controlnet and other FLUX related models. The recommended guidance_scale=3.5 and lora_scale=1. Our Lower steps version will release later.

    • Text-to-Image.

    How to use

    diffusers

    This model can be used ditrectly with diffusers

    import torch
    from diffusers.pipelines import FluxPipeline
    
    model_id = "black-forest-labs/FLUX.1-dev"
    adapter_id = "alimama-creative/FLUX.1-Turbo-Alpha"
    
    pipe = FluxPipeline.from_pretrained(
      model_id,
      torch_dtype=torch.bfloat16
    )
    pipe.to("cuda")
    
    pipe.load_lora_weights(adapter_id)
    pipe.fuse_lora()
    
    prompt = "A DSLR photo of a shiny VW van that has a cityscape painted on it. A smiling sloth stands on grass in front of the van and is wearing a leather jacket, a cowboy hat, a kilt and a bowtie. The sloth is holding a quarterstaff and a big book."
    image = pipe(
                prompt=prompt,
                guidance_scale=3.5,
                height=1024,
                width=1024,
                num_inference_steps=8,
                max_sequence_length=512).images[0]
    

    comfyui

    Training Details

    The model is trained on 1M open source and internal sources images, with the aesthetic 6.3+ and resolution greater than 800. We use adversarial training to improve the quality. Our method fix the original FLUX.1-dev transformer as the discriminator backbone, and add multi heads to every transformer layer. We fix the guidance scale as 3.5 during training, and use the time shift as 3.

    Mixed precision: bf16

    Learning rate: 2e-5

    Batch size: 64

    Image size: 1024x1024

    Gallery
    This model contains NSFW content. Click to show gallery.
    Trackers
    • udp://tracker.opentrackr.org:1337/announce
    • udp://open.demonii.com:1337/announce
    • udp://open.stealth.si:80/announce
    • udp://tracker.torrent.eu.org:451/announce
    • udp://tracker.qu.ax:6969/announce
    • udp://tracker.bittor.pw:1337/announce