huggingface_provider¶
HuggingFaceProvider
¶
Local HuggingFace embedding provider using sentence-transformers.
Source code in wintermute/ai/providers/huggingface_provider.py
42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 | |
count_tokens(text, model=None)
¶
Returns estimated token count. For now, using a simple heuristic as exact tokenization requires the specific tokenizer.
Source code in wintermute/ai/providers/huggingface_provider.py
110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 | |
embed(texts, model=None)
¶
Embeds texts using a local SentenceTransformer model. Args: texts: List of strings to embed. model: Model name/path (default: 'all-MiniLM-L6-v2').
Source code in wintermute/ai/providers/huggingface_provider.py
87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 | |
list_models()
¶
Returns a list of common embedding models supported.
Source code in wintermute/ai/providers/huggingface_provider.py
61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 | |
register(as_name='local_embedder')
¶
Registers the HuggingFaceProvider.
Source code in wintermute/ai/providers/huggingface_provider.py
128 129 130 131 132 133 134 135 | |