Hacker Read
top
|
best
|
new
|
newcomments
|
leaders
|
about
|
bookmarklet
login
brucethemoose2 | karma 7874 | avg karma 2.4
2023-07-07 20:54:26
|
next
[–]
update item
Yeah, it can split weights. Whatever fraction of the weights that don't fit into vram will be computed on the CPU (with reasonable speed).
Additionally, prompt processing will work with large models even with low vram GPUs.
reply
view as:
tree
latest_first
Legal
|
privacy
Additionally, prompt processing will work with large models even with low vram GPUs.
reply