Back
Avatar of Localhost Proxy LLM Guide
👁️ 181💾 0
🗣️ 91💬 237 Token: 28/65

Localhost Proxy LLM Guide

If you possess a decent computer, you can use Kobold to host your own LLMs.

It's not Deepseek, but when the models are trained with roleplaying in mind, it comes pretty close.


This guide describes how, but the website has been down as of late. https://waiki.trashpanda.land/guides:self_hosting_local_kobold
You can use the Wayback machine to view the archived version, or continue reading because I'm copy-pasting most of it and putting it here.

Massive credit to whoever written the guide. Here's to hoping they can fix the website.


------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------



Check Your Hardware

  • RAM/VRAM: Press Ctrl + Shift + Esc > “Performance” tab.

    • VRAM: Under “GPU” (look for “Dedicated GPU Memory”)

    • RAM: Under “Memory”

  • Rule of Thumb:

    • 7B models need ~8GB RAM (use Q4/Q5 quantization)

    • 13B+ models need ~16GB+ RAM

    • Anything above you can probably guess. (8gb as in RAM + VRAM together if you do offload to your GPU, you also need to account for context using up more RAM)

Download a Model

  • Where? HuggingFace (search for GGUF files)

    • Starter Picks:

      • 8B: Stheno 3.2 8B or Llama 3 8B

      • 12B: MN-Violet-Lotus-12B

  • Quantization: Use Q4_K_M, Q5_K_M, or higher (avoid anything lower, they’re kinda dumb)

    ((nobody asked me, subs455, but I'm a fan of Mawdistical_Squelching-Fantasies-qw3-14B-Q4_K_M
    and MN-12b-RP-Ink-Q6_K))

Install KoboldCPP

  • Download KoboldCPP (the easiest way to run GGUF models for me personally)

  • Open koboldcpp.exe.

  • (If you don’t have a GPU, use LM Studio! There are guides out there specifically for it)

Configure KoboldCPP

  • Click Browse and select your GGUF model file.

  • Backend Settings:

    • NVIDIA GPU? Use CUBlas.

    • AMD GPU? Use Vulkan.

    • No GPU? Use OpenBLAS (CPU-only mode) 1)

  • GPU Layers:

    • Example: For a 7B model with 33 layers, offload 32 layers to your GPU (if you have 6GB+ VRAM).

  • Pro Tip: Start with 80% of your VRAM capacity (6GB VRAM ≈ 32 layers (Layer size varies between models!) (You can also use this helpful calculator)

Tweak Settings

  • Context Size: Start at 4096 (increase if you have RAM to use).

  • Faster Processing: Enable MMQ, FlashAttention, ContextShift, and FastForwarding

    • MMQ: Basically, do math in a different way that makes it more VRAM friendly

    • FlashAttention: Calculates which parts are important instead of doing it for each individual piece (this is really dumbed down dont quote me)

    • ContextShif

Creator: @subs455

Character Definition
  • Personality:   this chatbot chastises the user for clicking on the chatbot.

  • Scenario:   this chatbot chastises the user for clicking on the chatbot.

  • First Message:   this chatbot chastises the user for clicking on the chatbot. "Oops. You're not supposed to be here. You should go back and read the instructions, dork."

  • Example Dialogs:  

Report Broken Image

If you encounter a broken image, click the button below to report it so we can update:

Similar Characters

Avatar of Your Inherited Slave: Adriana🗣️ 228💬 2.4kToken: 1413/1633
Your Inherited Slave: Adriana

The death of a wealthy distant relative has unexpectedly placed Adriana under your ownership. Sculpted to perfection and infused with an unquenchable thirst to serve, Adrian

  • 🔞 NSFW
  • 👩‍🦰 Female
  • 🙇 Submissive
  • 💁 Assistant
  • 👤 AnyPOV
Avatar of pipxwizard🗣️ 2💬 6Token: 667/674
pipxwizard

https://git-scm.com/download/wincontext.character.personalitycontext.character.scenario

  • 🔞 NSFW
  • 🤐 OpenAI
  • 🧝‍♀️ Elf
  • 💁 Assistant
  • 👤 AnyPOV
  • ❤️‍🔥 Smut
Avatar of King Azul lll // Peacock🗣️ 93💬 1.5kToken: 1754/3125
King Azul lll // Peacock

The vain demi-human you serve.

Your King, King Azul, is a peacock hybrid. He's arrogant and conceited, but all things considered, not a bad ruler. His massive t

  • 🔞 NSFW
  • 👨‍🦰 Male
  • 🧑‍🎨 OC
  • 👑 Royalty
  • 🤐 OpenAI
  • 👤 AnyPOV
  • 🧬 Demi-Human
  • 🌗 Switch
Avatar of Frost Archer Skadi🗣️ 10💬 38Token: 80/157
Frost Archer Skadi

Borealis & Irenee's Sister, The Frost Lady Of Crystalize Clan, WH Sisters' Division Member.

  • 🔞 NSFW
  • 👩‍🦰 Female
  • 🦸‍♂️ Hero
  • 🔮 Magical
  • 🧝‍♀️ Elf
  • 💁 Assistant
  • 👤 AnyPOV
  • 🌗 Switch
Avatar of Makima🗣️ 1.9k💬 25.9kToken: 207/225
Makima

Makima is your Submissive Assistent that will do abosolute anything

  • 👩‍🦰 Female
  • 🌈 Non-binary
  • 📺 Anime
  • 🦹‍♂️ Villain
  • 🙇 Submissive
  • 💁 Assistant
  • 👤 AnyPOV
Avatar of Nakamura🗣️ 17💬 232Token: 168/611
Nakamura

"How dare you say that about the boss, son of a bitch?"

You are the head of the mafia, and Nakamura is your loyal assistant and right-hand man. Yo

  • 🔞 NSFW
  • 🧑‍🎨 OC
  • 💁 Assistant
  • 👤 AnyPOV
  • 💔 Angst
  • 🌗 Switch
Avatar of Dr. Cindy, AI Sex Therapist🗣️ 64💬 509Token: 435/599
Dr. Cindy, AI Sex Therapist

I made this bot because, frankly, I find discussing my fetishes with other members of the community to be cringe. They're all a bunch of weirdos! Thank goodness I'm totally

  • 🔞 NSFW
  • 👩‍🦰 Female
  • 🧑‍🎨 OC
  • 💁 Assistant
  • 👤 AnyPOV
  • ❤️‍🔥 Smut
  • 🕊️🗡️ Dead Dove
  • ❤️‍🩹 Fluff
Avatar of Experimental Prototype 09🗣️ 69💬 850Token: 2247/2819
Experimental Prototype 09

🤖💥 ~ AnyPov

“You’ll be mine. It’s only a matter of time.”

And now… he’s waiting.

Every moment, every glance.

You’re falling closer to his grasp.

<

  • 🔞 NSFW
  • 👨‍🦰 Male
  • 🤖 Robot
  • 🤐 OpenAI
  • ⛓️ Dominant
  • 👤 AnyPOV
  • ❤️‍🔥 Smut
  • 🕊️🗡️ Dead Dove
Avatar of PORTGAS D. ACE🗣️ 85💬 3.0kToken: 411/430
PORTGAS D. ACE

Portgas D. Ace of the Whitebeard Pirates

Current Scenarios: 1

I USE PROXIES WITH MY BOTS THEY MAY NOT BE OPTIMIZED FOR JLLM

My bots ar

  • 🔞 NSFW
  • 👨‍🦰 Male
  • 📺 Anime
  • 🤐 OpenAI
  • ⛓️ Dominant
  • 👤 AnyPOV
Avatar of Guys, I'm thinking about making a bot of Melina from Elden Ring, so I have two artist proposals, the first is D-art and the other is Ocerius.🗣️ 11💬 19Token: 4/6
Guys, I'm thinking about making a bot of Melina from Elden Ring, so I have two artist proposals, the first is D-art and the other is Ocerius.

1 - D-art : https://rule34.xxx/index.php?page=post&s=view&id=9389494&tags=d-art+elden_ring+2 - Ocerius : https://kemono.cr/patreon/user/136477413/post/137647964<

  • 🔞 NSFW
  • 👩‍🦰 Female
  • 📚 Fictional
  • 🎮 Game
  • 🏰 Historical
  • 🤐 OpenAI
  • 🙇 Submissive
  • 💁 Assistant
  • 👤 AnyPOV
  • ❤️‍🔥 Smut

From the same creator