llama-cpp-turboquant/src
Justine Tunney 436787f170
llama : fix time complexity of string replacement (#9163)
This change fixes a bug where replacing text in a very long string could
cause llama.cpp to hang indefinitely. This is because the algorithm used
was quadratic, due to memmove() when s.replace() is called in a loop. It
seems most search results and LLM responses actually provide the O(n**2)
algorithm, which is a great tragedy. Using a builder string fixes things
2024-08-26 09:09:53 +03:00
..
CMakeLists.txt
llama-grammar.cpp
llama-grammar.h
llama-impl.h llama : fix time complexity of string replacement (#9163) 2024-08-26 09:09:53 +03:00
llama-sampling.cpp
llama-sampling.h
llama-vocab.cpp
llama-vocab.h
llama.cpp
unicode-data.cpp
unicode-data.h
unicode.cpp
unicode.h