聊聊Spring AI的Chat Model-软件玩家

admin管理员组
文章数量:1442507

聊聊Spring AI的Chat Model

序

本文主要研究一下Spring AI的Chat Model

Model

spring-ai-core/src/main/java/org/springframework/ai/model/Model.java

代码语言：javascript代码运行次数：0运行复制

public interface Model<TReq extends ModelRequest<?>, TRes extends ModelResponse<?>> {

	/**
	 * Executes a method call to the AI model.
	 * @param request the request object to be sent to the AI model
	 * @return the response from the AI model
	 */
	TRes call(TReq request);

}

Model接口定义了call方法，入参为ModelRequest类型，返回ModelResponse类型

ModelRequest

spring-ai-core/src/main/java/org/springframework/ai/model/ModelRequest.java

代码语言：javascript代码运行次数：0运行复制

public interface ModelRequest<T> {

	/**
	 * Retrieves the instructions or input required by the AI model.
	 * @return the instructions or input required by the AI model
	 */
	T getInstructions(); // required input

	/**
	 * Retrieves the customizable options for AI model interactions.
	 * @return the customizable options for AI model interactions
	 */
	ModelOptions getOptions();

}

ModelRequest定义了getInstructions、getOptions方法

ModelResponse

spring-ai-core/src/main/java/org/springframework/ai/model/ModelResponse.java

代码语言：javascript代码运行次数：0运行复制

public interface ModelResponse<T extends ModelResult<?>> {

	/**
	 * Retrieves the result of the AI model.
	 * @return the result generated by the AI model
	 */
	T getResult();

	/**
	 * Retrieves the list of generated outputs by the AI model.
	 * @return the list of generated outputs
	 */
	List<T> getResults();

	/**
	 * Retrieves the response metadata associated with the AI model's response.
	 * @return the response metadata
	 */
	ResponseMetadata getMetadata();

}

ModelResponse定义了getResult、getMetadata方法，其中result为ModelResult类型

ModelResult

spring-ai-core/src/main/java/org/springframework/ai/model/ModelResult.java

代码语言：javascript代码运行次数：0运行复制

public interface ModelResult<T> {

	/**
	 * Retrieves the output generated by the AI model.
	 * @return the output generated by the AI model
	 */
	T getOutput();

	/**
	 * Retrieves the metadata associated with the result of an AI model.
	 * @return the metadata associated with the result
	 */
	ResultMetadata getMetadata();

}

ModelResult定义了getMetadata方法

StreamingModel

spring-ai-core/src/main/java/org/springframework/ai/model/StreamingModel.java

代码语言：javascript代码运行次数：0运行复制

public interface StreamingModel<TReq extends ModelRequest<?>, TResChunk extends ModelResponse<?>> {

	/**
	 * Executes a method call to the AI model.
	 * @param request the request object to be sent to the AI model
	 * @return the streaming response from the AI model
	 */
	Flux<TResChunk> stream(TReq request);

}

StreamingModel接口定义了stream方法，入参为ModelRequest类型，返回Flux<ModelResponse>

StreamingChatModel

spring-ai-core/src/main/java/org/springframework/ai/chat/model/StreamingChatModel.java

代码语言：javascript代码运行次数：0运行复制

@FunctionalInterface
public interface StreamingChatModel extends StreamingModel<Prompt, ChatResponse> {

	default Flux<String> stream(String message) {
		Prompt prompt = new Prompt(message);
		return stream(prompt).map(response -> (response.getResult() == null || response.getResult().getOutput() == null
				|| response.getResult().getOutput().getText() == null) ? ""
						: response.getResult().getOutput().getText());
	}

	default Flux<String> stream(Message... messages) {
		Prompt prompt = new Prompt(Arrays.asList(messages));
		return stream(prompt).map(response -> (response.getResult() == null || response.getResult().getOutput() == null
				|| response.getResult().getOutput().getText() == null) ? ""
						: response.getResult().getOutput().getText());
	}

	@Override
	Flux<ChatResponse> stream(Prompt prompt);

}

StreamingChatModel继承了StreamingModel接口，指定了入参为Prompt类型，返回类型为Flux<ChatResponse>，并提供了Flux<String> stream(String message)及Flux<String> stream(Message... messages)这两个default方法

ChatModel

spring-ai-core/src/main/java/org/springframework/ai/chat/model/ChatModel.java

代码语言：javascript代码运行次数：0运行复制

public interface ChatModel extends Model<Prompt, ChatResponse>, StreamingChatModel {

	default String call(String message) {
		Prompt prompt = new Prompt(new UserMessage(message));
		Generation generation = call(prompt).getResult();
		return (generation != null) ? generation.getOutput().getText() : "";
	}

	default String call(Message... messages) {
		Prompt prompt = new Prompt(Arrays.asList(messages));
		Generation generation = call(prompt).getResult();
		return (generation != null) ? generation.getOutput().getText() : "";
	}

	@Override
	ChatResponse call(Prompt prompt);

	default ChatOptions getDefaultOptions() {
		return ChatOptions.builder().build();
	}

	default Flux<ChatResponse> stream(Prompt prompt) {
		throw new UnsupportedOperationException("streaming is not supported");
	}

}

ChatModel继承了Model、StreamingChatModel接口，其中Model的入参为Prompt类型，返回为ChatResponse类型 ChatModel在不同模块中有不同的实现，比如spring-ai-ollama(OllamaChatModel)、spring-ai-openai(OpenAiChatModel)、spring-ai-minimax(MiniMaxChatModel)、spring-ai-moonshot(MoonshotChatModel)、spring-ai-zhipuai(ZhiPuAiChatModel)

OllamaAutoConfiguration

org/springframework/ai/autoconfigure/ollama/OllamaAutoConfiguration.java

代码语言：javascript代码运行次数：0运行复制

@AutoConfiguration(after = RestClientAutoConfiguration.class)
@ConditionalOnClass(OllamaApi.class)
@EnableConfigurationProperties({ OllamaChatProperties.class, OllamaEmbeddingProperties.class,
		OllamaConnectionProperties.class, OllamaInitializationProperties.class })
@ImportAutoConfiguration(classes = { RestClientAutoConfiguration.class, WebClientAutoConfiguration.class })
public class OllamaAutoConfiguration {

	@Bean
	@ConditionalOnMissingBean(OllamaConnectionDetails.class)
	public PropertiesOllamaConnectionDetails ollamaConnectionDetails(OllamaConnectionProperties properties) {
		return new PropertiesOllamaConnectionDetails(properties);
	}

	@Bean
	@ConditionalOnMissingBean
	public OllamaApi ollamaApi(OllamaConnectionDetails connectionDetails,
			ObjectProvider<RestClient.Builder> restClientBuilderProvider,
			ObjectProvider<WebClient.Builder> webClientBuilderProvider) {
		return new OllamaApi(connectionDetails.getBaseUrl(),
				restClientBuilderProvider.getIfAvailable(RestClient::builder),
				webClientBuilderProvider.getIfAvailable(WebClient::builder));
	}

	@Bean
	@ConditionalOnMissingBean
	@ConditionalOnProperty(prefix = OllamaChatProperties.CONFIG_PREFIX, name = "enabled", havingValue = "true",
			matchIfMissing = true)
	public OllamaChatModel ollamaChatModel(OllamaApi ollamaApi, OllamaChatProperties properties,
			OllamaInitializationProperties initProperties, List<FunctionCallback> toolFunctionCallbacks,
			FunctionCallbackResolver functionCallbackResolver, ObjectProvider<ObservationRegistry> observationRegistry,
			ObjectProvider<ChatModelObservationConvention> observationConvention) {
		var chatModelPullStrategy = initProperties.getChat().isInclude() ? initProperties.getPullModelStrategy()
				: PullModelStrategy.NEVER;

		var chatModel = OllamaChatModel.builder()
			.ollamaApi(ollamaApi)
			.defaultOptions(properties.getOptions())
			.functionCallbackResolver(functionCallbackResolver)
			.toolFunctionCallbacks(toolFunctionCallbacks)
			.observationRegistry(observationRegistry.getIfUnique(() -> ObservationRegistry.NOOP))
			.modelManagementOptions(
					new ModelManagementOptions(chatModelPullStrategy, initProperties.getChat().getAdditionalModels(),
							initProperties.getTimeout(), initProperties.getMaxRetries()))
			.build();

		observationConvention.ifAvailable(chatModel::setObservationConvention);

		return chatModel;
	}

	@Bean
	@ConditionalOnMissingBean
	@ConditionalOnProperty(prefix = OllamaEmbeddingProperties.CONFIG_PREFIX, name = "enabled", havingValue = "true",
			matchIfMissing = true)
	public OllamaEmbeddingModel ollamaEmbeddingModel(OllamaApi ollamaApi, OllamaEmbeddingProperties properties,
			OllamaInitializationProperties initProperties, ObjectProvider<ObservationRegistry> observationRegistry,
			ObjectProvider<EmbeddingModelObservationConvention> observationConvention) {
		var embeddingModelPullStrategy = initProperties.getEmbedding().isInclude()
				? initProperties.getPullModelStrategy() : PullModelStrategy.NEVER;

		var embeddingModel = OllamaEmbeddingModel.builder()
			.ollamaApi(ollamaApi)
			.defaultOptions(properties.getOptions())
			.observationRegistry(observationRegistry.getIfUnique(() -> ObservationRegistry.NOOP))
			.modelManagementOptions(new ModelManagementOptions(embeddingModelPullStrategy,
					initProperties.getEmbedding().getAdditionalModels(), initProperties.getTimeout(),
					initProperties.getMaxRetries()))
			.build();

		observationConvention.ifAvailable(embeddingModel::setObservationConvention);

		return embeddingModel;
	}

	@Bean
	@ConditionalOnMissingBean
	public FunctionCallbackResolver springAiFunctionManager(ApplicationContext context) {
		DefaultFunctionCallbackResolver manager = new DefaultFunctionCallbackResolver();
		manager.setApplicationContext(context);
		return manager;
	}

	static class PropertiesOllamaConnectionDetails implements OllamaConnectionDetails {

		private final OllamaConnectionProperties properties;

		PropertiesOllamaConnectionDetails(OllamaConnectionProperties properties) {
			this.properties = properties;
		}

		@Override
		public String getBaseUrl() {
			return this.properties.getBaseUrl();
		}

	}

}

spring-ai-spring-boot-autoconfigure提供了一系列的AutoConfiguration，比如OllamaAutoConfiguration自动配置了OllamaChatModel

小结

Spring AI的Model接口定义了call方法，入参为ModelRequest类型，返回ModelResponse类型；StreamingModel接口定义了stream方法，入参为ModelRequest类型，返回Flux<ModelResponse>；StreamingChatModel继承了StreamingModel接口，指定了入参为Prompt类型，返回类型为Flux<ChatResponse>，并提供了Flux<String> stream(String message)及Flux<String> stream(Message... messages)这两个default方法；而ChatModel继承了Model、StreamingChatModel接口，其中Model的入参为Prompt类型，返回为ChatResponse类型。ChatModel在不同模块中有不同的实现，比如spring-ai-ollama(OllamaChatModel)、spring-ai-openai(OpenAiChatModel)、spring-ai-minimax(MiniMaxChatModel)、spring-ai-moonshot(MoonshotChatModel)、spring-ai-zhipuai(ZhiPuAiChatModel)。

doc

chatmodel
chat/comparison

本文标签：聊聊Spring AI的Chat Model

版权声明：本文标题：聊聊Spring AI的Chat Model 内容由网友自发贡献，该文观点仅代表作者本人，转载请联系作者并注明出处：http://www.betaflare.com/biancheng/1748043023a2796685.html，本站仅提供信息存储空间服务，不拥有所有权，不承担相关法律责任。如发现本站有涉嫌抄袭侵权/违法违规的内容，一经查实，本站将立刻删除。

编程频道|软件玩家 - 软件改变生活！

聊聊Spring AI的Chat Model

聊聊Spring AI的Chat Model

序

Model

ModelRequest

ModelResponse

ModelResult

StreamingModel

StreamingChatModel

ChatModel

OllamaAutoConfiguration

小结

doc

更多相关文章

聊聊Spring AI的Chat Model

发表评论

推荐文章

YashanDB｜swap空间暴涨导致报错？group

用户行为分析正在被保险行业广泛采纳

智谱开源AI绘图CogView4，曾经的开源之光回来了。

戴尔笔记本恢复原装系统全攻略

动态规划似包非包系列一＞组合总和IIV

热门文章

Win7提示“此Windows副本不是正版”的解决办法

如何解决504请求超时Gateway Timeout：您的浏览器Your browserSCDN节点SCDN Nodes源站（Error）Origin Server（Error）问题

毕业论文框架搭建（word版）-页眉页脚格式设置

原生多模态大模型也能强化学习，思维链长达几万字，商汤日日新V6来了

Rstudio

Java基础：&amp;和&amp;&amp;、

Examples (示例)

CentOS网络接口配置文件ifcfg

行列式的本质与功能解析

HandsomeTab

最新文章

在家搭建私有云存储！Windows下快速部署黑群晖虚拟机与远程访问指南

【零代码魔法】3分钟搭建AI聊天网站，EdgeOne Pages让技术小白逆袭成极客

在家办公也能实时远程运维？Docker部署MyIP让监控数据随身可看

【HarmonyOS Next之旅】DevEco Studio使用指南(九)

深入解析：HarmonyOS Design设计语言的核心理念

javascript - Type &#39;undefined&#39; is not assignable to type &#39;menuItemProps[]&#39; - Stack Overflow

javascript - VS 2015 Angular 2 import modules cannot be resolved - Stack Overflow

javascript - Get the JSON objects that are not present in another array - Stack Overflow

javascript - How to dismiss a phonegap notification programmatically - Stack Overflow

c - Solaris 10 make Error code 1 Fatal Error when trying to build python 2.7.16 - Stack Overflow

ThinkPad Z16 2023 锐龙版 R7 7840HS32GB2TB4G独显4K 参数报价

戴尔成就 3420 i5 1235U32GB1TB集显参数报价

ThinkPad E15 2022 i7 1255U40GB512GBMX550 参数报价

ThinkPad X13 2023 酷睿版 i5 1340P16GB2TB4G版 参数报价

ThinkPad L490 i5 8265U4GB1TB2G独显 参数报价

Java基础：&和&&、

javascript - Type 'undefined' is not assignable to type 'menuItemProps[]' - Stack Overflow

ThinkPad X13 2023 酷睿版 i5 1340P16GB2TB4G版参数报价

ThinkPad L490 i5 8265U4GB1TB2G独显参数报价