'분류 전체보기' 카테고리의 글 목록 (7 Page)

아래의 doc을 참고하여 우선 GPU 서버가 없으므로 CPU에 작은 모델을 서빙 해보려고 한다. https://github.com/triton-inference-server/server GitHub - triton-inference-server/server: The Triton Inference Server provides an optimized cloud and edge inferencing solution. The Triton Inference Server provides an optimized cloud and edge inferencing solution. - GitHub - triton-inference-server/server: The Triton Inference Server provides..

카테고리 없음 2024. 4. 5. 20:15

코테 후기 - 시간 복잡도

이번에 코딩테스트를 치루며 메모리 관리에 대해서도 신경써야한다는 것을 알았다. N=1,000,000 문제를 풀었지만 계속 효율성 문제에 있어 통과하지 못했다 ㅜ.ㅜ 앞으로는 이러한 사항을 고려하면서 공부를 해야할 필요성을 매우 느꼈다. 아래의 표에 따르면 N이 백만 이였으니까 NlogN까지는 허용이 된다. 나는 그때 이중 for문을 사용하여 N2으로 풀었던 것 같다. 이제 앞으로 속도에 대해 생각하며 코딩을 해야겠다.

Programming/코딩테스트 2024. 4. 3. 09:16

[Langchain] 쿼리에 Redis 캐싱을 적용하기

사용자의 쿼리에 대하여 토큰 비용을 줄일 수 있는 방법 중에 캐싱 기법을 적용해보았다. 물론 토큰 비용이 얼마 되지 않아 그냥 해도 되지만 응답 속도는 확연하게 체감이 될 정도로 빨랐다. 랭체인에서 제공하는 라이브러리를 사용하면 캐싱 구현은 정말 간단하다. from langchain.cache import RedisCache from langchain.globals import set_llm_cache set_llm_cache(RedisCache(redis_=Redis(host='redis', port=6379, db=1))) response = rag_chain_with_source.invoke(text.question) 질문을 할때 마다 Key가 생성이 되었다. Key는 Hash로 이루어 져있는데 어..

Framework 2024. 3. 19. 11:26

[sqlalchemy] @contextmanger로 트랜젝션 관리

FastAPI + Sqlalchemy를 사용하여 개발하던 중 insert 실패 후 다른 데이터를 insert를 할때 아래 와 같은 애러가 발생하였다. sqlalchemy.exc.PendingRollbackError: This Session's transaction has been rolled back due to a previous exception during flush. To begin a new transaction with this Session, first issue Session.rollback(). Original exception was: (psycopg2.errors.UniqueViolation) duplicate key value violates unique constraint 위의 애..

Framework 2024. 3. 13. 13:09

이전 1 ··· 4 5 6 7 8 9 10 ··· 27 다음

이전 다음

공지사항

최근에 올라온 글

최근에 달린 댓글

Total

Today

Yesterday

링크

Github

TAG more

« 2025/07 »
일	월	화	수	목	금	토
		1	2	3	4	5
6	7	8	9	10	11	12
13	14	15	16	17	18	19
20	21	22	23	24	25	26
27	28	29	30	31

글 보관함

Techbrad

티스토리툴바