vllm.entrypoints.pooling.score.protocol ¶
RerankDocument ¶
Bases: BaseModel
Source code in vllm/entrypoints/pooling/score/protocol.py
RerankRequest ¶
Bases: OpenAIBaseModel
Source code in vllm/entrypoints/pooling/score/protocol.py
activation class-attribute instance-attribute ¶
activation: bool | None = Field(
default=None,
description="activation will be deprecated, please use use_activation instead.",
)
mm_processor_kwargs class-attribute instance-attribute ¶
mm_processor_kwargs: dict[str, Any] | None = Field(
default=None,
description="Additional kwargs to pass to the HF processor.",
)
priority class-attribute instance-attribute ¶
priority: int = Field(
default=0,
description="The priority of the request (lower means earlier handling; default: 0). Any priority other than 0 will raise an error if the served model does not use priority scheduling.",
)
softmax class-attribute instance-attribute ¶
softmax: bool | None = Field(
default=None,
description="softmax will be deprecated, please use use_activation instead.",
)
truncate_prompt_tokens class-attribute instance-attribute ¶
RerankResponse ¶
RerankResult ¶
Bases: BaseModel
Source code in vllm/entrypoints/pooling/score/protocol.py
RerankUsage ¶
ScoreRequest ¶
Bases: OpenAIBaseModel
Source code in vllm/entrypoints/pooling/score/protocol.py
activation class-attribute instance-attribute ¶
activation: bool | None = Field(
default=None,
description="activation will be deprecated, please use use_activation instead.",
)
mm_processor_kwargs class-attribute instance-attribute ¶
mm_processor_kwargs: dict[str, Any] | None = Field(
default=None,
description="Additional kwargs to pass to the HF processor.",
)
priority class-attribute instance-attribute ¶
priority: int = Field(
default=0,
description="The priority of the request (lower means earlier handling; default: 0). Any priority other than 0 will raise an error if the served model does not use priority scheduling.",
)
softmax class-attribute instance-attribute ¶
softmax: bool | None = Field(
default=None,
description="softmax will be deprecated, please use use_activation instead.",
)
truncate_prompt_tokens class-attribute instance-attribute ¶
ScoreResponse ¶
Bases: OpenAIBaseModel
Source code in vllm/entrypoints/pooling/score/protocol.py
created class-attribute instance-attribute ¶
id class-attribute instance-attribute ¶
id: str = Field(
default_factory=lambda: f"embd-{random_uuid()}"
)
ScoreResponseData ¶
Bases: OpenAIBaseModel