For people self hosting LLMs… I have a couple docker images I maintain

@[email protected]

Yes that’s a good comment for an FAQ cause I get it a lot and it’s a very good question haha. The reason I use it is for image size, the base nvidia devel image is needed for a lot of compilation during python package installation and is huge, so instead I use conda, transfer it to the nvidia-runtime image which is… also pretty big, but it saves several GB of space so it’s a worthwhile hack :)

but yes avoiding CUDA messes on my bare machine is definitely my biggest motivation

@[email protected]

lollms-webui is the jankiest of the images, but that one’s newish to the scene and I’m working with the dev a bit to get it nicer (main current problem is the requirement for CLI prompts which he’ll be removing) Koboldcpp and text-gen are in a good place though, happy with how those are running

For people self hosting LLMs… I have a couple docker images I maintain

For people self hosting LLMs… I have a couple docker images I maintain

GitHub - jesseduffield/horcrux: Split your file into encrypted fragments so that you don’t need to remember a passcode

GitHub - jesseduffield/horcrux: Split your file into encrypted fragments so that you don’t need to remember a passcode

For people self hosting LLMs… I have a couple docker images I maintain

For people self hosting LLMs… I have a couple docker images I maintain

GitHub - jesseduffield/horcrux: Split your file into encrypted fragments so that you don’t need to remember a passcodeplus-square

GitHub - jesseduffield/horcrux: Split your file into encrypted fragments so that you don’t need to remember a passcodeplus-square

GitHub - jesseduffield/horcrux: Split your file into encrypted fragments so that you don’t need to remember a passcode

GitHub - jesseduffield/horcrux: Split your file into encrypted fragments so that you don’t need to remember a passcode