Project:LLMQueryService POC: Difference between revisions

From MaRDI portal
No edit summary
No edit summary
 
Line 15: Line 15:
# You might need ssh-port-forwarding
# You might need ssh-port-forwarding
## Example: ''ssh -L 8000:127.0.0.1:8501 -i OPENSTACK_KEY_FILE.pem debian@130.73.240.230''
## Example: ''ssh -L 8000:127.0.0.1:8501 -i OPENSTACK_KEY_FILE.pem debian@130.73.240.230''
=== The ZIB LLM Server ===
# Reachable only from within the ZIB network
# To see the installed models: ''curl https://SERVERNAME/api/tags | jq''
# To install a new model: ''curl https://SERVERNAME/api/pull -d '{"name": "qwen2.5:0.5b"}' | jq''

Latest revision as of 12:56, 2 October 2024

This page describes how to install the proof-of-concept LLM-based query service.

Try it here: http://130.73.240.230/ (only from the ZIB network or VPN)

Using a OpenStack VM

  1. Create a new instance (if you use Debian: at least 12)
    1. See Project:Docker_OpenStackVM
  2. Install necessary libraries
    1. apt-get update
    2. apt-get install git python3-pip python3-venv
  3. Clone the repository and follow the rest of the manual ( https://git.zib.de/bzfconra/mardi_llm_bottest )
  4. Install Ollama ( https://ollama.com/download/linux )
    1. Check the logs: journalctl -u ollama -f
  5. You might need ssh-port-forwarding
    1. Example: ssh -L 8000:127.0.0.1:8501 -i OPENSTACK_KEY_FILE.pem debian@130.73.240.230

The ZIB LLM Server

  1. Reachable only from within the ZIB network
  2. To see the installed models: curl https://SERVERNAME/api/tags | jq
  3. To install a new model: curl https://SERVERNAME/api/pull -d '{"name": "qwen2.5:0.5b"}' | jq