Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Here's a sample of running the 120b model on Ollama with my MBP:

```

total duration: 1m14.16469975s

load duration: 56.678959ms

prompt eval count: 3921 token(s)

prompt eval duration: 10.791402416s

prompt eval rate: 363.34 tokens/s

eval count: 2479 token(s)

eval duration: 1m3.284597459s

eval rate: 39.17 tokens/s

```





Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: