首页 | 本学科首页   官方微博 | 高级检索  
     

基于图文多模态融合推理的产品创新方案设计方法研究
引用本文:马进,范明浩,马良山,胡洁. 基于图文多模态融合推理的产品创新方案设计方法研究[J]. 包装工程, 2024, 0(8): 21-28
作者姓名:马进  范明浩  马良山  胡洁
作者单位:上海交通大学 感知科学与工程学院,上海 200240;上海中软计算机系统工程有限公司,上海 200001;上海交通大学 设计学院,上海 200240
基金项目:国家自然科学基金面上(52375254);上海交通大学医工交叉项目(21X010301670)
摘    要:目的 针对当前产品创新设计领域中对基于图像-文本多模态知识支撑创新设计方法研究不足的问题,提出了一套基于图文多模态的产品创新方案设计方法。方法 首先,对设计师的设计草图与文本要求进行预处理,然后引入产品设计知识图谱来促进设计思维的发散和创新;其次,通过微调的生成式预训练变换器模型和扩散模型生成产品方案及其概念图;最后,利用深度多模态设计评估模型对产品设计方案的可行性和市场潜力进行评估。结果 通过产品设计知识图谱,及深度多模态设计评估模型的引入,该设计流程可以生成富有创新性且具备可行性的产品方案。结论 基于图文多模态的产品创新方案设计流程结合了最新的深度学习技术,不仅提高了设计的效率,还为设计师提供了更广阔的创新视角和灵感来源。

关 键 词:图文多模态  深度生成模型  知识图谱  产品创新设计
收稿时间:2023-11-10

Innovative Product Design Schemes Based on Image-text Multi-modal Fusion Reasoning
MA Jin,FAN Minghao,MA Liangshan,HU Jie. Innovative Product Design Schemes Based on Image-text Multi-modal Fusion Reasoning[J]. Packaging Engineering, 2024, 0(8): 21-28
Authors:MA Jin  FAN Minghao  MA Liangshan  HU Jie
Affiliation:School of Sensing Science and Technology Shanghai 200240, China;Shanghai China Software Computer Systems Engineering Co., Ltd., Shanghai 200001, China; School of Design, Shanghai Jiao Tong University, Shanghai 200240, China
Abstract:The work aims to propose a novel multi-modal process which integrates both image and text elements for innovative product design to address the issue of insufficient innovation and feasibility in product design schemes within the field of AI-assisted product design. The work begins with preprocessing the designer''s sketches and textual requirements, followed by the incorporation of a product design knowledge graph to facilitate divergent thinking and innovation. Subsequently, a fine-tuned generative pre-trained Transformer model and a diffusion model were employed to generate product schemes and their conceptual diagrams. Finally, a deep multi-modal design assessment model was adopted to evaluate the feasibility and market potential of the product design schemes. The results indicated that the introduction of the product design knowledge graph and the deep multi-modal design assessment model enabled the generation of innovative product schemes that also possessed feasibility. In conclusion, this multi-modal approach to innovative product scheme design, leveraging cutting-edge AI and deep learning technologies, not only enhances design efficiency but also provides designers with a broader perspective for innovation and inspiration sources.
Keywords:multi-modal image and text   deep generative models   knowledge graph   innovative product design
点击此处可从《包装工程》浏览原始摘要信息
点击此处可从《包装工程》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号