I have a lot of outliers in my project, I want to get the probability of each sentence belonging to each topic, If I use topics,probs= topic_model.fit_transform(sentences,embeddings=embeddings) the probs obtained by this method will contain every sentence (including outliers). Is there any way to extract these outliers, and do these outliers also have topic probability proportions #2220
superseanyoung
started this conversation in
General
Replies: 1 comment 3 replies
-
In the obtained probs, I do not want to get sentences belonging to outliers, how should I post them |
Beta Was this translation helpful? Give feedback.
3 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
topic_model = BERTopic(embedding_model=transformer_model,
min_topic_size=3,
verbose=True,
umap_model=umap_model,
hdbscan_model=hdbscan_model,
ctfidf_model=ctfidf_model,
representation_model=representation_model,
#top_n_words=10,
#min_topic_size=10,
#nr_topics=None,
#low_memory=False,
calculate_probabilities=True
)
topics,probs= topic_model.fit_transform(sentences,embeddings=embeddings)
Beta Was this translation helpful? Give feedback.
All reactions