{"id":109987,"date":"2025-08-02T12:37:45","date_gmt":"2025-08-02T10:37:45","guid":{"rendered":"https:\/\/industry-science.com\/?post_type=article&#038;p=109987"},"modified":"2025-08-11T16:29:10","modified_gmt":"2025-08-11T14:29:10","slug":"technology-assist-order-picking","status":"publish","type":"article","link":"https:\/\/industry-science.com\/en\/articles\/technology-assist-order-picking\/","title":{"rendered":"Technologies for Assisting Manual Order Picking"},"content":{"rendered":"\n<p class=\"wp-block-paragraph\">As automation advances across Industry 4.0, many industrial processes are being redefined. Still, order picking continues to rely heavily on human workers. Why does this task remain so human-centric? And how can emerging technologies enhance, not replace, their role? This article explores these questions and outlines new directions that combine perception and language-based technologies.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Order picking remains one of the most cost-intensive and error-prone processes in warehouse and assembly logistics. Manual approaches, especially picker-to-parts systems, still dominate in Western Europe, accounting for approximately 80% of real-world applications and generating around 55% of warehouse operating costs [1, 2]. This process significantly influences operational efficiency, accuracy, and customer satisfaction. For small and medium-sized enterprises (SMEs), which often face growing product variety and limited resources, efficient and error-free picking is essential [3].&nbsp;<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Picking systems can be divided into human-centered and machine-centered systems [2]. Human-centered approaches include picker-to-parts, put systems, and parts-to-picker strategies. In contrast, machine-based systems involve automated technologies such as vertical lift modules, A-frames, and robotic arms. Manual systems are typically organized using discrete, batch, wave, or zone-based picking strategies [3].<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Paper-based picking systems remain common, particularly among SMEs, due to their simplicity and low implementation costs. In these systems, workers retrieve items using printed pick lists, sometimes augmented by barcode scanning or basic Warehouse Management Systems (WMS). Despite this, error rates range between 3% and 5% [4, 5]. These systems are also limited in terms of scalability and real-time accuracy. As of 2015, approximately 60% of U.S. distribution centers continued to use paper-based systems [6], highlighting their continued relevance [7, 8].<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">This paper first examines existing digital picking systems, analyzing their error reduction capabilities versus their implementation costs, with an emphasis on SME accessibility challenges. It then introduces vision-based AI (using computer vision models) for low-cost real-time verification, demonstrated through Case Study 1.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Next, it explores Large Language Models (LLMs) for adaptive guidance and multilingual interaction within the new \u201cMultimodal Assistance\u201d framework. These technologies converge in a vision-language synergy\u2014combining AI \u201ceyes\u201d (vision) and \u201cbrain\u201d (LLM) for error prevention\u2014experimentally validated through the multimodal SOPHIE prototype (case study 2). The conclusion discusses their combined potential for intelligent, cost-effective picking systems.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\">Digital worker assistance systems in manual picking<\/h2>\n\n\n\n<p class=\"wp-block-paragraph\"><a href=\"https:\/\/industry-science.com\/en\/articles\/cognitive-assistance-systems\/\">Digital assistance systems<\/a> are designed to support workers either cognitively or physically. They can be grouped into three categories. The first category is cognitive support systems (like pick-by-light, pick-by-scan, pick-by-voice and augmented reality headsets), the second category comprises training systems (for example virtual reality simulations), and the third category consists of motoric support tools (for example exoskeletons and automated guided vehicles) [9]. This research focuses on cognitive support systems that assist workers in navigation and item selection.\u00a0<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Assistance systems demonstrate varying error rates. Paper-based systems result in an average of eleven errors per 1,000 picks (1.1%), Radio Frequency (RF) scanning systems average six (0.6%), pick-by-light systems fail four (0.4%) times, and pick-by-voice assistance systems just once (0.1%). Systems based on heads-up displays (HUDs) produce around eight errors (0.8%), while projection-based systems result in up to 11.8 errors (1.18%) [4, 10]. Each error incurs an average cost of approximately $27.50 [4], leading to significant daily expenses depending on volume. These differences are illustrated in <strong>Figure 1<\/strong>, which visualizes the error rates across commonly used picking systems [10].<\/p>\n\n\n\n<figure class=\"wp-block-image aligncenter size-full is-resized\"><img loading=\"lazy\" decoding=\"async\" width=\"1000\" height=\"606\" src=\"https:\/\/industry-science.com\/wp-content\/uploads\/2025\/08\/Siddiqui_I4S-4-25_Figure-1.jpg\" alt=\"Figure 1: Average errors per 1,000 picked parts across various picking systems, adapted from [10].\" class=\"wp-image-109988\" style=\"width:636px;height:auto\" srcset=\"https:\/\/industry-science.com\/wp-content\/uploads\/2025\/08\/Siddiqui_I4S-4-25_Figure-1.jpg 1000w, https:\/\/industry-science.com\/wp-content\/uploads\/2025\/08\/Siddiqui_I4S-4-25_Figure-1-619x375.jpg 619w, https:\/\/industry-science.com\/wp-content\/uploads\/2025\/08\/Siddiqui_I4S-4-25_Figure-1-768x465.jpg 768w, https:\/\/industry-science.com\/wp-content\/uploads\/2025\/08\/Siddiqui_I4S-4-25_Figure-1-482x292.jpg 482w, https:\/\/industry-science.com\/wp-content\/uploads\/2025\/08\/Siddiqui_I4S-4-25_Figure-1-510x309.jpg 510w, https:\/\/industry-science.com\/wp-content\/uploads\/2025\/08\/Siddiqui_I4S-4-25_Figure-1-64x39.jpg 64w\" sizes=\"auto, (max-width: 1000px) 100vw, 1000px\" \/><figcaption class=\"wp-element-caption\"><em>Figure 1: Average errors per 1,000 picked parts across various picking systems, adapted from [10].<\/em><\/figcaption><\/figure>\n\n\n\n<p class=\"wp-block-paragraph\">Implementation costs represent a significant hurdle for many SMEs. Paper-based systems are the most affordable, with estimated setup costs of around $50,000 for 25 users. RF scanning systems follow at approximately $110,000, HUDs at $170,000, pick-by-voice at $270,000, and pick-by-light systems exceed $370,000 [10]. Systems using projection technologies are assumed to fall within the HUD cost range. High acquisition and integration costs often prevent adoption, particularly in resource-constrained settings. <strong>Figure 2<\/strong> illustrates the estimated implementation cost of different picking systems for 25 users.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Although existing technologies address specific needs, significant drawbacks prevent their widespread adoption. Pick-by-voice systems are efficient but affected by environmental noise and user fatigue [11-15]. Pick-by-light systems are rigid and sensitive to misplacement errors [11, 12, 16-18]. RF-based systems provide high accuracy but are ergonomically demanding and time-consuming [15, 17]. AR-based smart glasses and HUDs are expensive, require high technical infrastructure, and pose physical discomfort after prolonged use [9, 19-22].<\/p>\n\n\n\n<figure class=\"wp-block-image aligncenter size-full is-resized\"><img loading=\"lazy\" decoding=\"async\" width=\"1000\" height=\"611\" src=\"https:\/\/industry-science.com\/wp-content\/uploads\/2025\/08\/Siddiqui_I4S-4-25_Figure-2.jpg\" alt=\"Figure 2: Estimated system price for 25 users across different picking systems, adapted from [10].\" class=\"wp-image-109990\" style=\"width:634px;height:auto\" srcset=\"https:\/\/industry-science.com\/wp-content\/uploads\/2025\/08\/Siddiqui_I4S-4-25_Figure-2.jpg 1000w, https:\/\/industry-science.com\/wp-content\/uploads\/2025\/08\/Siddiqui_I4S-4-25_Figure-2-614x375.jpg 614w, https:\/\/industry-science.com\/wp-content\/uploads\/2025\/08\/Siddiqui_I4S-4-25_Figure-2-768x469.jpg 768w, https:\/\/industry-science.com\/wp-content\/uploads\/2025\/08\/Siddiqui_I4S-4-25_Figure-2-478x292.jpg 478w, https:\/\/industry-science.com\/wp-content\/uploads\/2025\/08\/Siddiqui_I4S-4-25_Figure-2-510x312.jpg 510w, https:\/\/industry-science.com\/wp-content\/uploads\/2025\/08\/Siddiqui_I4S-4-25_Figure-2-64x39.jpg 64w\" sizes=\"auto, (max-width: 1000px) 100vw, 1000px\" \/><figcaption class=\"wp-element-caption\"><em>Figure 2: Estimated system price for 25 users across different picking systems, adapted from [10].<\/em><\/figcaption><\/figure>\n\n\n\n<p class=\"wp-block-paragraph\">Most small and mid-sized warehouses still pick orders with paper lists or simple barcode scans because newer guidance systems are expensive and hard to install [6-8]. Yet customers now expect faster service and fewer mistakes [23]. Low-cost cameras, portable computers, and voice or text interfaces offer a possible middle ground, but their joint use in everyday picking has received little attention. This review looks at these practical tools and asks how they might fit into current workflows, improve accuracy, and do so without heavy upfront investment.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\">Object detection models for assisting manual picking<\/h2>\n\n\n\n<p class=\"wp-block-paragraph\">In industrial environments, object detection has become a key technology in automating visual tasks such as quality control, tracking, and verification [24-27]. The ability to recognize and locate physical objects in an image frame (in a live stream for example) makes it especially useful for supporting manual picking processes [24].<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Several real-time object detection algorithms are suitable for industrial use. Among them, the You Only Look Once (YOLO) model series [25, 28] offers an impressive balance of speed and accuracy, making it a suitable choice for fast-paced environments. Lighter versions like Tiny-YOLO and MobileNet-Single Shot MultiBox Detector (MobileNet-SSD) are well-suited for deployment on edge devices [29]. Another efficient alternative is EfficientDet, which deploys an innovative method called compound scaling to balance performance and computational load [30].&nbsp;<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Transformer-based object detection models with enhanced reasoning capabilities, such as RT-DETR, are emerging, though they are typically slower. Recent studies have, however, demonstrated that they are capable of performing better than YOLO [31]. Nonetheless, for real-time feedback in production, YOLO-based models remain the most practical and widely adopted solution [24, 27].<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">In picking scenarios, object detection models have the potential to verify whether the correct item has been picked, whether it is positioned accurately, and whether any defects or anomalies are present. This type of real-time feedback can significantly improve accuracy and reduce errors without interrupting the workflow.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">The first case study was designed to explore whether modern computer vision models (YOLO) can provide real-time part verification in a manual picking scenario using only low-cost hardware. The goal was to demonstrate the practicality, efficiency, and low implementation barrier of such a system for SMEs. Rather than developing a full-scale dataset or benchmark suite, the design focused on feasibility: fine-tuning YOLO on a small set of labeled images and validating whether it could guide picking actions reliably in a controlled lab setting.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\">Case Study 1: An experiment with object detection technology<\/h2>\n\n\n\n<p class=\"wp-block-paragraph\">To evaluate the feasibility of low-cost AI support for manual picking tasks, a compact prototype was developed using a standard webcam and the YOLO object detection model running on a laptop as extra hardware for the assistance system. The system was designed to verify whether parts were picked correctly and in the right sequence\u2014capabilities that are typically limited to costly technologies such as pick-by-light, vision-guided robotics, or scan-based systems. The goal was to replicate these core functions using minimal resources suitable for small and medium sized enterprises (SMEs).&nbsp;<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">The prototype was deployed in a lab environment to support drone assembly, as shown in <strong>Figure 3c<\/strong>, with parts organized in labeled KANBAN bins and a predefined pick sequence: first, a body cover, followed by a baseplate. The setup includes an overhead webcam above an assembly tray, a workstation with an NVIDIA RTX A1000 GPU, and a display guiding the picker through the task. The user interface displays the current part to pick (for example \u201cKommissionieren: Haube gr\u00fcn V1\u201d (engl: \u201cCommission: Body cover green V1\u201d)) and updates in real time as detections occur.<\/p>\n\n\n\n<figure class=\"wp-block-image size-full\"><img loading=\"lazy\" decoding=\"async\" width=\"1000\" height=\"415\" src=\"https:\/\/industry-science.com\/wp-content\/uploads\/2025\/08\/Siddiqui_I4S-4-25_Figure-3.jpg\" alt=\"Figure 3: Overview of the experimental setup: (a) system architecture; (b) sample training images; (c) picking station where the assistance system is deployed.\" class=\"wp-image-109992\" srcset=\"https:\/\/industry-science.com\/wp-content\/uploads\/2025\/08\/Siddiqui_I4S-4-25_Figure-3.jpg 1000w, https:\/\/industry-science.com\/wp-content\/uploads\/2025\/08\/Siddiqui_I4S-4-25_Figure-3-764x317.jpg 764w, https:\/\/industry-science.com\/wp-content\/uploads\/2025\/08\/Siddiqui_I4S-4-25_Figure-3-768x319.jpg 768w, https:\/\/industry-science.com\/wp-content\/uploads\/2025\/08\/Siddiqui_I4S-4-25_Figure-3-514x213.jpg 514w, https:\/\/industry-science.com\/wp-content\/uploads\/2025\/08\/Siddiqui_I4S-4-25_Figure-3-510x212.jpg 510w, https:\/\/industry-science.com\/wp-content\/uploads\/2025\/08\/Siddiqui_I4S-4-25_Figure-3-64x27.jpg 64w\" sizes=\"auto, (max-width: 1000px) 100vw, 1000px\" \/><figcaption class=\"wp-element-caption\"><em>Figure 3: Overview of the experimental setup: (a) system architecture; (b) sample training images; (c) picking station where the assistance system is deployed.<\/em><\/figcaption><\/figure>\n\n\n\n<p class=\"wp-block-paragraph\">When a picking task begins, the webcam captures frames that are streamed via WebSocket to the backend, as shown in <strong>Figure 3a<\/strong>. The GO backend publishes the frames to a Redis topic. A Python-based YOLO worker subscribes, performs inference, and returns two outputs: an annotated frame and the predicted part class. These are sent back via Redis, where the backend forwards the annotated image as an MJPEG stream and the part label via Server-Sent Events. The React frontend listens for these updates and compares the detected class to the expected sequence. If the part is correct, the system highlights it in green and advances to the next step. If not, a red warning is shown.&nbsp;<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">This progression is illustrated in <strong>Figure 4<\/strong>, which shows the full user interaction sequence from session start to task completion. In <strong>Figure 4a<\/strong>, the session begins with the start screen where the user launches a new picking task. In <strong>Figure 4b<\/strong>, the system instructs the worker to place \u201cHaube gr\u00fcn V1\u201d; both part progress indicators are grey, signaling that no item has yet been confirmed. In <strong>Figure 4c<\/strong>, the system detects and confirms the correct body cover, turning the indicator green and activating the next step\u2014&#8221;Grundplatte wei\u00df V1\u201d (engl: \u201cbaseplate white V1\u201d). Finally, in <strong>Figure 4d<\/strong>, after the correct baseplate is placed and validated, the system marks the task as complete, with all status indicators turning green.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">This hands-free confirmation mechanism eliminates the need for manual scanning or button presses. The real-time verification logic is lightweight and decoupled across services using Redis as the communication backbone, ensuring low-latency response. The full architecture\u2014frontend, backend, and detection pipeline\u2014is summarized in <strong>Figure 3a<\/strong>.<\/p>\n\n\n\n<figure class=\"wp-block-image size-full\"><img loading=\"lazy\" decoding=\"async\" width=\"1000\" height=\"504\" src=\"https:\/\/industry-science.com\/wp-content\/uploads\/2025\/08\/Siddiqui_I4S-4-25_Figure-4.jpg\" alt=\"Figure 4: User interface progression during a picking task: (a) session start screen; (b) UI prompts user to place \u201cHaube gr\u00fcn V1\u201d; (c) \u201cHaube gr\u00fcn V1\u201d is detected and confirmed (green); (d) \u201cGrundplatte wei\u00df V1\u201d is placed and confirmed, marking the order as complete.\" class=\"wp-image-109994\" srcset=\"https:\/\/industry-science.com\/wp-content\/uploads\/2025\/08\/Siddiqui_I4S-4-25_Figure-4.jpg 1000w, https:\/\/industry-science.com\/wp-content\/uploads\/2025\/08\/Siddiqui_I4S-4-25_Figure-4-744x375.jpg 744w, https:\/\/industry-science.com\/wp-content\/uploads\/2025\/08\/Siddiqui_I4S-4-25_Figure-4-768x387.jpg 768w, https:\/\/industry-science.com\/wp-content\/uploads\/2025\/08\/Siddiqui_I4S-4-25_Figure-4-514x259.jpg 514w, https:\/\/industry-science.com\/wp-content\/uploads\/2025\/08\/Siddiqui_I4S-4-25_Figure-4-510x257.jpg 510w, https:\/\/industry-science.com\/wp-content\/uploads\/2025\/08\/Siddiqui_I4S-4-25_Figure-4-64x32.jpg 64w\" sizes=\"auto, (max-width: 1000px) 100vw, 1000px\" \/><figcaption class=\"wp-element-caption\"><em>Figure 4: User interface progression during a picking task: (a) session start screen; (b) UI prompts user to place \u201cHaube gr\u00fcn V1\u201d; (c) \u201cHaube gr\u00fcn V1\u201d is detected and confirmed (green); (d) \u201cGrundplatte wei\u00df V1\u201d is placed and confirmed, marking the order as complete.<\/em><\/figcaption><\/figure>\n\n\n\n<p class=\"wp-block-paragraph\">Example images used for model training and validation are shown in <strong>Figure 3b<\/strong>, demonstrating reliable bounding box detection even under common lab conditions. The model was trained on approximately 2,000 labeled images across the nine relevant classes (various color variants of body cover and shape variants of baseplates).<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Despite the modest dataset, it achieved strong classification performance. During live testing, the system maintained consistent responsiveness, with end-to-end detection latency\u2014from camera frame capture to UI feedback\u2014typically ranging between 100 and 300 milliseconds. The setup was realized at the Technology Transfer Center (TTZ) Leipheim, Germany [32], which supports research in smart production and logistics. The facility provides a complete test environment for drone order fulfillment. The selection of GO, Redis, and YOLO played a key role in building a modular, low-cost, and scalable prototype that runs on standard hardware.\u00a0<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">The trained YOLOv8n model achieved high accuracy across all nine drone components. These included labels such as \u201cHaube gr\u00fcn V1\u201d, \u201cHaube braun V2\u201d (engl: \u201cBody cover brown V2\u201d), Haube orange V2 (engl: \u201cBody cover orange V2\u201d), Grundplatte wei\u00df V1, Grundplatte blau V2 (engl: \u201cBaseplate blue V2\u201d), Grundplatte schwarz V1 (engl: \u201cBaseplate black V1\u201d), and other visually distinct body cover and baseplate variants.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">As shown in<strong> Figure 5<\/strong>, training and validation losses consistently declined over 256 epochs. The model achieved a precision level of 99.6%, recall of near 100%, and &#109;&#65;&#80;&#64;&#48;&#46;&#53; of 99.2%, despite using no data augmentation. While these metrics reflect the model\u2019s ability to correctly detect drone parts with high reliability, a real-world test is the key differentiator of actual performance.<\/p>\n\n\n\n<figure class=\"wp-block-image size-full\"><img loading=\"lazy\" decoding=\"async\" width=\"1000\" height=\"476\" src=\"https:\/\/industry-science.com\/wp-content\/uploads\/2025\/08\/Siddiqui_I4S-4-25_Figure-5.jpg\" alt=\"Figure 5: Model training and validation loss curves over 256 epochs.\" class=\"wp-image-109996\" srcset=\"https:\/\/industry-science.com\/wp-content\/uploads\/2025\/08\/Siddiqui_I4S-4-25_Figure-5.jpg 1000w, https:\/\/industry-science.com\/wp-content\/uploads\/2025\/08\/Siddiqui_I4S-4-25_Figure-5-764x364.jpg 764w, https:\/\/industry-science.com\/wp-content\/uploads\/2025\/08\/Siddiqui_I4S-4-25_Figure-5-768x366.jpg 768w, https:\/\/industry-science.com\/wp-content\/uploads\/2025\/08\/Siddiqui_I4S-4-25_Figure-5-514x245.jpg 514w, https:\/\/industry-science.com\/wp-content\/uploads\/2025\/08\/Siddiqui_I4S-4-25_Figure-5-510x243.jpg 510w, https:\/\/industry-science.com\/wp-content\/uploads\/2025\/08\/Siddiqui_I4S-4-25_Figure-5-64x30.jpg 64w\" sizes=\"auto, (max-width: 1000px) 100vw, 1000px\" \/><figcaption class=\"wp-element-caption\"><em>Figure 5: Model training and validation loss curves over 256 epochs.<\/em><\/figcaption><\/figure>\n\n\n\n<p class=\"wp-block-paragraph\">The confusion matrix in <strong>Figure 6<\/strong> shows accurate predictions for nearly all classes, with low confusion even between visually similar baseplate or body cover variants. These results confirm that the system can support reliable, low-cost picking guidance using consumer-grade webcams and minimal training data.<\/p>\n\n\n\n<figure class=\"wp-block-image size-full\"><img loading=\"lazy\" decoding=\"async\" width=\"1000\" height=\"924\" src=\"https:\/\/industry-science.com\/wp-content\/uploads\/2025\/08\/Siddiqui_I4S-4-25_Figure-6.jpg\" alt=\"Figure 6: Confusion matrix showing high accuracy across the nine part classes.\" class=\"wp-image-109998\" srcset=\"https:\/\/industry-science.com\/wp-content\/uploads\/2025\/08\/Siddiqui_I4S-4-25_Figure-6.jpg 1000w, https:\/\/industry-science.com\/wp-content\/uploads\/2025\/08\/Siddiqui_I4S-4-25_Figure-6-406x375.jpg 406w, https:\/\/industry-science.com\/wp-content\/uploads\/2025\/08\/Siddiqui_I4S-4-25_Figure-6-768x710.jpg 768w, https:\/\/industry-science.com\/wp-content\/uploads\/2025\/08\/Siddiqui_I4S-4-25_Figure-6-316x292.jpg 316w, https:\/\/industry-science.com\/wp-content\/uploads\/2025\/08\/Siddiqui_I4S-4-25_Figure-6-196x180.jpg 196w, https:\/\/industry-science.com\/wp-content\/uploads\/2025\/08\/Siddiqui_I4S-4-25_Figure-6-510x471.jpg 510w, https:\/\/industry-science.com\/wp-content\/uploads\/2025\/08\/Siddiqui_I4S-4-25_Figure-6-64x59.jpg 64w\" sizes=\"auto, (max-width: 1000px) 100vw, 1000px\" \/><figcaption class=\"wp-element-caption\"><em>Figure 6: Confusion matrix showing high accuracy across the nine part classes.<\/em><\/figcaption><\/figure>\n\n\n\n<p class=\"wp-block-paragraph\">While the system achieved high accuracy under controlled lab conditions, a few limitations were observed. Under low light conditions, the model occasionally failed to detect the correct part or exhibited delayed responses. All training and validation images were captured in a well-lit environment, which may limit generalization to darker or more variable lighting scenarios. These findings suggest that consistent lighting or additional training data under varied conditions would be necessary to ensure robustness in real-world deployments. Image enhancement techniques will be explored in future iterations to ensure a more robust detection.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">While conventional systems like pick-by-light or projection setups require infrastructure investments exceeding $170,000\u2013$200,000 for 25 users, the case study prototype was implemented using consumer-grade components: a \u20ac70 webcam, a mid-tier laptop with an NVIDIA RTX A1000 GPU, and open-source software (YOLO). The only model training required was fine-tuning on ~2,000 labeled images, a task completed without augmentation and using standard Python libraries.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">No commercial licenses were needed. Once deployed, inference is run entirely locally, eliminating cloud costs and ensuring data privacy. Operationally, the system showed latencies of 100 ms per frame and detected drone parts with 99.2% &#109;&#65;&#80;&#64;&#48;&#46;&#53;. Thus, the solution demonstrates a practical, low-barrier alternative for SMEs seeking verification without the financial or technical burden of traditional systems.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">This prototype and its initial findings were presented at the 2025 POMS Annual Conference in Atlanta [32], where the approach was recognized for its ease of deployment and potential for improving accuracy in manual workflows. While the prototype demonstrates practical feasibility, we acknowledge that a full cost-benefit analysis and systematic performance validation in diverse real-world environments remain necessary.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\">Multimodal assistance\u2014Integrating vision and language models<\/h2>\n\n\n\n<p class=\"wp-block-paragraph\">Large Language Models (LLMs), based on the transformer architectures [33] and trained using massive text datasets, are capable of comprehending, processing, and generating human language [34]. These capabilities make them highly useful in the industrial context. Core capabilities in the industrial setting include Natural Language Understanding (NLU), text generation, reasoning and following instruction [34-36]. NLU helps to process different types of texts like technical manuals, operator logs, or regulatory documents [34]. Text generation is essential for automating the process of writing reports or generating documentation [34]. The reasoning capabilities of an LLM in an industrial setting are important for problem-solving, strategic planning and process optimization [37].<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">LLMs can follow user-defined instructions by enriching them with detailed contextual information via the usage of Retrieval-Augmented Generation (RAG) [36]. The usage of RAG is one method of transforming general-purpose LLMs into Industrial AI Systems. For reliable industrial deployment, LLMs must operate as task-specific agents that leverage contextual awareness and validate outputs against predefined rules [38]. Human-Machine Interfaces (HMIs) also benefit from the capabilities of LLMs. LLMs can replace complex menus in HMIs with a chat interface, where the worker can ask their questions via text or voice command, making the HMIs more accessible and intuitive [39, 40].<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Building on the reliable detection and logic-driven guidance offered by object detection models, integrating LLMs introduces a new layer of adaptability and communication. In this multimodal setting, the vision model acts as the \u201ceyes\u201d of the system, ensuring physical accuracy [41], whereas the LLM acts as the \u201cbrain\u201d and the \u201cvoice\u201d of the system, providing context, reasoning and human-friendly interaction [37].<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">One key function of the LLM in the context of picking assistance could be to guide the worker via step-by-step instructions, which are generated from the order configuration (for example first part X, then part Y) and monitoring the worker completion steps in real time. The multilingual nature of LLMs combined with voice commands allows for the seamless interaction with workers from any background. The LLM can also help to train new employees with interactive tutorials, guide workers in case of an error, and create reports based on vision logs and order configuration [37].<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Most of these LLM functionalities are closely connected to the outputs of the vision model and vice versa, creating a positive synergy. An exemplary workflow of&nbsp;this synergy is the error feedback loop: the vision model detects an error in the picking process and, subsequently, the LLM explains to the worker what the error is and how to fix it. Another synergy is the tracking of the assembly steps via parts verification by the vision model and the parallel generation of assembly instructions for the current step&nbsp;by the LLM. One synergy that leads to error prevention is the following: the vision model recognizes a frequent error pattern and, consequently, the LLM warns the worker preemptively to avoid this error [42].<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">This multimodal approach brings about multiple benefits. It reduces the cognitive load on the workers by guiding them through the process via voice commands [43]. The multimodality also turns the system from solely error detection to error prevention. By preventing errors and giving real-time guidance, the multimodal picking assistant accelerates the picking workflow [44].<\/p>\n\n\n\n<h2 class=\"wp-block-heading\">Need for a real-time context-aware language assistant system<\/h2>\n\n\n\n<p class=\"wp-block-paragraph\">What if the picking assistant could do more than just detect parts? What if it could speak, clarify mistakes, and guide workers through each step in their own language? These questions led us to extend our vision-based system into something more interactive. In this section, we explore how combining object detection with a language model can create an assistant that not only sees what the worker is doing but also responds in real time with helpful feedback and instructions.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Case study 1 demonstrated that low-cost object detection models like YOLO can effectively verify pick sequences in real time using modest hardware. However, while visual verification ensures correctness, it does not address the broader operational challenges SMEs face\u2014namely, onboarding new or temporary workers, navigating language barriers, and coping with a retiring workforce. Traditional retraining approaches are time-consuming and often ineffective across multilingual teams.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">At the same time, intelligent assistance systems in recent literature largely depend on cloud-based LLMs and infrastructure-heavy setups [45, 46], which are incompatible with resource-constrained environments. There remains a clear gap between the practical, real-time needs of small-scale manufacturing and the assumptions embedded in most AI tooling.\u00a0<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">What is needed is a system that offers not just visual verification but interactive, multilingual support that aligns with each stage of the task. In the next section, we explore SOPHIE\u2014our lightweight multimodal assistant that combines local YOLO detection with a context-aware LLM. SOPHIE not only understands the current order but is also aware of the worker\u2019s progress, enabling dynamic support on demand for each individual step without relying on cloud services.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">The second case study extends the first by integrating a local large language model (LLM) into the picking workflow. The design aimed to explore whether combining vision with contextual language understanding could provide real-time, multilingual user support without relying on cloud infrastructure. The LLM was not retrained but operated on structured prompts derived from the current order and detection state. The study focused on scope expansion, user interaction, and demonstrating low-cost deployment using off-the-shelf hardware.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\">Case Study 2: An experiment on the synergies between vision and language<\/h2>\n\n\n\n<p class=\"wp-block-paragraph\">To validate our ideas on the combination of vision and language, we implemented a prototype application (SOPHIE) on top of case study 1, which combines the vision model trained for case study 1 with a local LLM (WizardLM2) [47]. The idea of SOPHIE is that the LLM answers real-time queries from the user based on the current order and the part detected by the YOLO model.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">SOPHIE is implemented as a Tkinter [48] application, shown in <strong>Figure 7<\/strong>, which consists of a camera stream with extra information (FPS, camera in use, start time etc.), a confidence threshold slider for the detection of parts with the YOLO model, a thumbnail and text for the currently expected part, an overview over the whole order with its part sequence and a button for opening the chat popup with the LLM. In the chat popup, the user can either select predefined questions in German and English relevant to the picking task or can ask their own questions [48]. To generate a meaningful response, the LLM is given a prompt that is a combination of different types of contexts.<\/p>\n\n\n\n<figure class=\"wp-block-image size-full\"><img loading=\"lazy\" decoding=\"async\" width=\"1000\" height=\"635\" src=\"https:\/\/industry-science.com\/wp-content\/uploads\/2025\/08\/Siddiqui_I4S-4-25_Figure-7.jpg\" alt=\"Figure 7: SOPHIE UI built using Tkinter. Main application shows order management and live stream with various overlays.\" class=\"wp-image-110000\" srcset=\"https:\/\/industry-science.com\/wp-content\/uploads\/2025\/08\/Siddiqui_I4S-4-25_Figure-7.jpg 1000w, https:\/\/industry-science.com\/wp-content\/uploads\/2025\/08\/Siddiqui_I4S-4-25_Figure-7-591x375.jpg 591w, https:\/\/industry-science.com\/wp-content\/uploads\/2025\/08\/Siddiqui_I4S-4-25_Figure-7-768x488.jpg 768w, https:\/\/industry-science.com\/wp-content\/uploads\/2025\/08\/Siddiqui_I4S-4-25_Figure-7-460x292.jpg 460w, https:\/\/industry-science.com\/wp-content\/uploads\/2025\/08\/Siddiqui_I4S-4-25_Figure-7-510x324.jpg 510w, https:\/\/industry-science.com\/wp-content\/uploads\/2025\/08\/Siddiqui_I4S-4-25_Figure-7-64x41.jpg 64w\" sizes=\"auto, (max-width: 1000px) 100vw, 1000px\" \/><figcaption class=\"wp-element-caption\"><em>Figure 7: SOPHIE UI built using Tkinter. Main application shows order management and live stream with various overlays.<\/em><\/figcaption><\/figure>\n\n\n\n<p class=\"wp-block-paragraph\">The base of the prompt is the system prompt, which stays the same for all queries and tells the LLM its task (multilingual assistant for drone assembly), how it should answer (concise and precise, language-awareness) and that it should use all given context. The context starts with information about the order and its status.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">The order confirmation comprises the order ID, the expected body cover and the expected baseplate, while the status tells the LLM which parts are already verified, which is expected next and if any wrong detections have occurred. The next context segment in the prompt consists of instructions for picking either the body cover or baseplate depending on which part is currently expected. The final piece of context is the user\u2019s actual question.<\/p>\n\n\n\n<figure class=\"wp-block-image size-full\"><img loading=\"lazy\" decoding=\"async\" width=\"1000\" height=\"512\" src=\"https:\/\/industry-science.com\/wp-content\/uploads\/2025\/08\/Siddiqui_I4S-4-25_Figure-8.jpg\" alt=\"Figure 8: SOPHIE Chat interface with SOPHIE multilingual interactions triggered when space is pressed.\" class=\"wp-image-110002\" srcset=\"https:\/\/industry-science.com\/wp-content\/uploads\/2025\/08\/Siddiqui_I4S-4-25_Figure-8.jpg 1000w, https:\/\/industry-science.com\/wp-content\/uploads\/2025\/08\/Siddiqui_I4S-4-25_Figure-8-732x375.jpg 732w, https:\/\/industry-science.com\/wp-content\/uploads\/2025\/08\/Siddiqui_I4S-4-25_Figure-8-768x393.jpg 768w, https:\/\/industry-science.com\/wp-content\/uploads\/2025\/08\/Siddiqui_I4S-4-25_Figure-8-514x263.jpg 514w, https:\/\/industry-science.com\/wp-content\/uploads\/2025\/08\/Siddiqui_I4S-4-25_Figure-8-510x261.jpg 510w, https:\/\/industry-science.com\/wp-content\/uploads\/2025\/08\/Siddiqui_I4S-4-25_Figure-8-64x33.jpg 64w\" sizes=\"auto, (max-width: 1000px) 100vw, 1000px\" \/><figcaption class=\"wp-element-caption\"><em>Figure 8: SOPHIE Chat interface with SOPHIE multilingual interactions triggered when space is pressed.<\/em><\/figcaption><\/figure>\n\n\n\n<p class=\"wp-block-paragraph\">The answers generated by SOPHIE, such as those in <strong>Figure 8<\/strong>, show that, by combining different contexts in the prompt, the system can guide the user through the picking process and help them in case of a picking error. The SOPHIE interface demonstrates four common interaction stages during the picking process, shown in <strong>Figure 8<\/strong>. In <strong>Figure 8a<\/strong>, the user initiates assistance via button or simulated voice input. <\/p>\n\n\n\n<p class=\"wp-block-paragraph\"><strong>Figure 8b<\/strong> presents predefined multilingual questions; here, the user selects a German prompt and receives a German response. In <strong>Figure 8c<\/strong>, the user chooses option 0 to enter a custom question manually, this time in English. <strong>Figure 8d<\/strong> shows SOPHIE\u2019s context-aware response in English, highlighting its ability to seamlessly handle language switching based on user input.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\">Extending capability with LLM assistance at minimal overhead<\/h2>\n\n\n\n<p class=\"wp-block-paragraph\">While combining NLP and vision is often assumed to be resource-intensive, SOPHIE was designed for feasibility and accessibility. The application is built entirely in Python using Tkinter and runs on the same consumer-grade hardware as case study 1 (4 GB Nvidia GPU, 32 GB RAM). The language component (WizardLM2) [47] is a pre-trained, open-source LLM running locally without fine-tuning. This avoids expensive training cycles or server-based inference.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Although the LLM response time (~20-30 seconds per query) is slower than optimal, this is a trade-off made for offline operation, data privacy, and cost control. All core functionalities such as visual validation, multilingual support, and contextual task guidance are achieved without requiring additional licenses or cloud infrastructure.\u00a0<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">As such, SOPHIE represents a practical extension of vision-based picking systems, making interactive assistance more inclusive and adaptable in constrained industrial environments. While the prototype demonstrates practical feasibility, we acknowledge that a full cost-benefit analysis and systematic performance validation in diverse real-world environments remain future work.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\">Toward smarter order picking<\/h2>\n\n\n\n<p class=\"wp-block-paragraph\">The comparison of existing assistance systems highlights a clear trade-off between error rates and implementation costs, often limiting accessibility for many organizations. Our first case study demonstrated that reliable part verification and sequencing can already be achieved using computer vision models and simple logic, implemented with low-cost, widely available hardware.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">At the same time, recent developments in generative AI, especially Large Language Models, offer new possibilities to enhance interaction, adaptability, and multilingual accessibility. Our second case study shows combining vision with language models has the potential to improve the guidance of the picker and thus reduce the error rates. Together, these technologies have the potential to complement or gradually replace traditional systems by offering scalable, intelligent, and cost-effective alternatives.<\/p>\n<hr><div class=\"gito-pub-content-bibliography\"><h2>Bibliography <\/h2>[1]\tGrosse, E. H.; Glock, C. H.; Jaber, M. Y.; Neumann, W. P.: Incorporating human factors in order picking planning models: framework and research opportunities. In: International Journal of Production Research 53 (2015) 3, pp. 695-717. DOI: https:\/\/doi.org\/10.1080\/00207543.2014.919424.\r<br>[2]\tDe Koster, R.; Le-Duc, T.; Roodbergen, K. J.: Design and control of warehouse order picking: A literature review. In: European journal of operational research 182 (2007) 2, pp. 481-501. DOI: https:\/\/doi.org\/10.1016\/j.ejor.2006.07.009.\r<br>[3]\tCasella, G.; Volpi, A.; Montanari, R.; Tebaldi, L.; Bottani, E.: Trends in order picking: A 2007\u20132022 review of the literature. In: Production &#038; Manufacturing Research 11 (2023) 1, p. 2191115. DOI: https:\/\/doi.org\/10.1080\/21693277.2023.2191115.\r<br>[4]\t\u0141opuszy\u0144ski, M.; Janusz, K.; Karwat, D.: Comparative Study of Selected Order-Picking Methods: Efficiency, Ergonomics, and Adaptation Rate of New Employees. In: Sensors 25 (2025) 3, pp. 923. DOI: https:\/\/doi.org\/10.3390\/s25030923.\r<br>[5]\tLi, F.: Comparing pick-by-vision to pick-by-paper: An experimental assessment of pick times, error rates and user satisfaction. Hochschule f\u00fcr angewandte Wissenschaften Neu-Ulm 2020. URL: https:\/\/publications.hs-neu-ulm.de\/1750\/1\/Kunze_Fang_WP_42_Comparing%20pick%20by%20vision%20to%20pick%20by%20paper.pdf, accessed 14.05.2025.\r<br>[6]\tUNEX: Paper Picking Processes: Are They Picking Your Pocket? URL: https:\/\/blog.unex.com\/paper-picking-processes, accessed 14.05.2025.\r<br>[7]\tSchriefer, J.: Why Do The Majority Of DCs Still Use Paper For Picking? URL: https:\/\/www.lucasware.com\/blog-majority-dcs-still-use-paper-picking\/, accessed 14.05.2025.\r<br>[8]\torderwise: Why paper-picking is picking the pockets of your warehouse. URL: https:\/\/orderwise.co.uk\/en\/blog\/why-paper-picking-is-picking-the-pockets-of-your-warehouse, accessed 14.05.2025.\r<br>[9]\tLucchese, A.; Mummolo, G.: Human-Centric Assistive Technologies in Manual Picking and Assembly Tasks: A Literature Review. In: Management and Production Engineering Review 15 (2024) 2, pp. 73-86. DOI: https:\/\/doi.org\/10.24425\/mper.2024.151132.\r<br>[10]\tMandar, E. M.; Dachry, W.; Bensassi, B.: Toward a Real-Time Picking Errors Prevention System Based on RFID Technology. In: Advances on Smart and Soft Computing (2021), Singapore, pp. 303-318. DOI: https:\/\/doi.org\/10.1007\/978-981-15-6048-4_27.\r<br>[11]\tAmit, J.: Voice Picking Systems: Are They The Best Choice for Your Warehouse? URL: https:\/\/aiola.ai\/blog\/voice-picking-systems, accessed 14.05.2025.\r<br>[12]\tBadwi, M.: Voice picking or pick to light: which is best for your business? URL: https:\/\/www.scjunction.com\/blog\/voice-picking-or-pick-to-light-which-is-best-for-your-business, accessed 14.05.2025.\r<br>[13]\tGuest: 3 things to know before investing in voice picking. URL: https:\/\/www.allthingssupplychain.com\/3-things-to-know-before-investing-in-voice-picking\/, accessed 14.05.2025.\r<br>[14]\tMarkowitz, J.: Ergonomics of the Voice. URL: https:\/\/www.speechtechmag.com\/Articles\/Columns\/Forward-Thinking\/Ergonomics-of-the-Voice-34400.aspx, accessed 14.05.2025.\r<br>[15]\tStipp, T.: Comparing Order Picking Technologies. URL: https:\/\/www.procatdt.com\/wp-content\/uploads\/2021\/01\/Comparing-Order-Picking-Technologies.pdf, accessed 14.05.2025.\r<br>[16]\tYzquierdo, J.: What is a Pick to Light System and How Does Voice Compare? URL: https:\/\/www.lucasware.com\/what-is-a-pick-to-light-system-and-how-does-voice-compare\/, accessed 15.05.2025.\r<br>[17]\tHanrahan, D.: Multi-modal picking technology provides ROI for small to mid size order fulfillment processes. URL: https:\/\/parcelindustry.com\/print-article-1322-permanent.html, accessed 14.05.2025.\r<br>[18]\tPackiyo: Pick to Light System explained: All you need to know. URL: https:\/\/www.packiyo.com\/blog\/pick-to-light, accessed 14.05.2025.\r<br>[19]\tAmes, B.: Smartglasses get a second look from warehouses. URL: https:\/\/www.dcvelocity.com\/articles\/28603-smartglasses-get-a-second-look-from-warehouses, accessed 14.05.2025.\r<br>[20]\tSchriefer, J.: The reality of smart glasses for warehouse vision picking. URL: https:\/\/www.lucasware.com\/warehouse-vision-picking\/, accessed 28.03.2025.\r<br>[21]\tHeuts, P.: DHL experiments with augmented reality. 2017. URL: https:\/\/www.etui.org\/sites\/default\/files\/Hesamag_16_EN-22-26.pdf, accessed 28.03.2025.\r<br>[22]\tHerzog, N. V.; Beharic, A.: Effects of the use of smart glasses on eyesight. In: Human Systems Engineering and Design II: Proceedings of the 2nd International Conference on Human Systems Engineering and Design (IHSED2019): Future Trends and Applications, September 16-18, 2019, Universit\u00e4t der Bundeswehr M\u00fcnchen, Munich, Germany (2020), pp. 808-812. DOI: https:\/\/doi.org\/10.1007\/978-3-030-27928-8_123.\r<br>[23]\tLai, K.-h.; Cheng, T. E.: Just-in-time logistics. London 2016.\r<br>[24]\tKhanam, R.; Hussain, M.; Hill, R.; Allen, P.: A comprehensive review of convolutional neural networks for defect detection in industrial applications. In: IEEE Access 12 (2024) pp. 94250-94295. DOI: https:\/\/doi.org\/10.1109\/ACCESS.2024.3425166.\r<br>[25]\tVijayakumar, A.; Vairavasundaram, S.: Yolo-based object detection models: A review and its applications. In: Multimedia Tools and Applications 83 (2024) 35, pp. 83535-83574. DOI: https:\/\/doi.org\/10.1007\/s11042-024-18872-y.\r<br>[26]\tZou, Z.; Chen, K.; Shi, Z.; Guo, Y.; Ye, J.: Object detection in 20 years: A survey. In: Proceedings of the IEEE 111 (2023) 3, pp. 257-276. DOI: https:\/\/doi.org\/10.48550\/arXiv.1905.05055.\r<br>[27]\tAhmad, H. M.; Rahimi, A.: Deep learning methods for object detection in smart manufacturing: A survey. In: Journal of Manufacturing Systems 64 (2022) pp. 181-196. DOI: https:\/\/doi.org\/10.1016\/j.jmsy.2022.06.011.\r<br>[28]\tRedmon, J.; Divvala, S.; Girshick, R.; Farhadi, A.: You Only Look Once: Unified, Real-Time Object Detection. In: 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2016), pp. 779-788. DOI: https:\/\/doi.org\/10.1109\/CVPR.2016.91.\r<br>[29]\tSekar, K.; Dheepa, T.; Sheethal, R.; Suvarna Smita, R.; Teja, V. D.: Efficient Object Detection on Low-Resource Devices Using Lightweight MobileNet-SSD. In: 2025 International Conference on Intelligent Systems and Computational Networks (ICISCN) (2025), Bangalore, pp. 1-6. DOI: https:\/\/doi.org\/10.1109\/ICISCN64258.2025.10934442.\r<br>[30]\tTan, M.; Pang, R.; Le, Q. V.: Efficientdet: Scalable and efficient object detection. In: Proceedings of the IEEE\/CVF conference on computer vision and pattern recognition (2020), pp. 10781-10790. DOI: https:\/\/doi.org\/10.1109\/CVPR42600.2020.01079.\r<br>[31]\tZhao, Y.; Lv, W.; Xu, S.; Wei, J.; Wang, G.; et al.: Detrs beat yolos on real-time object detection. In: Proceedings of the IEEE\/CVF conference on computer vision and pattern recognition (2024), pp. 16965-16974. DOI: https:\/\/doi.org\/10.48550\/arXiv.2304.08069.\r<br>[32]\tSiddiqui, M. K.; Hoffman, B.; Grinninger, J.: Real-Time Object Detection using AI for Enhanced Operational Efficiency. Presented at the 2025 POMS Annual Conference, Atlanta, May 8\u201312.\r<br>[33]\tVaswani, A.; Shazeer, N.; Parmar, N.; Uszkoreit, J.; Jones, L.; et al.: Attention is all you need. In: Advances in neural information processing systems 30 (2017).\r<br>[34]\tZhao, W. X.; Zhou, K.; Li, J.; Tang, T.; Wang, X.; et al.: A survey of large language models. In: Journal (2023). DOI: https:\/\/doi.org\/10.48550\/arXiv.2303.18223.\r<br>[35]\tWasti, S. M.; Pu, K. Q.; Neshati, A.: Large Language User Interfaces: Voice Interactive User Interfaces powered by LLMs. In: Intelligent Systems Conference (2024), pp. 639-655.\r<br>[36]\tXia, Y.; Jazdi, N.; Weyrich, M.: Applying Large Language Models for intelligent industrial automation. In: atp magazin 66 (2024) 6\u20137, pp. 62-71. DOI: https:\/\/doi.org\/10.17560\/atp.v66i6-7.2739.\r<br>[37]\tLi, Y.; Zhao, H.; Jiang, H.; Pan, Y.; Liu, Z.; et al.: Large language models for manufacturing. In: Journal (2024). DOI: https:\/\/doi.org\/10.48550\/arXiv.2410.21418.\r<br>[38]\tHalse, G.: Beyond the Prompt: Harnessing Industrial AI Agents for Reliable Automation. URL: https:\/\/www.gavinhalse.com\/ai-in-manufacturing\/beyond-the-prompt-harnessing-industrial-ai-agents-for-reliable-automation\/, accessed 15.05.2025.\r<br>[39]\tShone, O.: 5 key features and benefits of large language models. URL: https:\/\/www.microsoft.com\/en-us\/microsoft-cloud\/blog\/2024\/10\/09\/5-key-features-and-benefits-of-large-language-models\/, accessed 14.05.2025.\r<br>[40]\tKaur, J.: Enhancing Manufacturing with Large Language Models (LLMs). URL: https:\/\/www.xenonstack.com\/blog\/large-language-model-manufacturing, accessed 15.05.2025.\r<br>[41]\tAdmon, W.: Intelligent Humanoid Robots: An Overview and Focus on Visual Perception Systems. URL: https:\/\/www.basic.ai\/blog-post\/intelligent-humanoid-robots-vision-perception, accessed 15.05.2025.\r<br>[42]\tMiko\u0142ajewska, E.; Miko\u0142ajewski, D.; Miko\u0142ajczyk, T.; Paczkowski, T.: Generative AI in AI-Based Digital Twins for Fault Diagnosis for Predictive Maintenance in Industry 4.0\/5.0. In: Applied Sciences 15 (2025) 6, p. 3166. DOI: https:\/\/doi.org\/10.3390\/app15063166.\r<br>[43]\tGkintoni, E.; Antonopoulou, H.; Sortwell, A.; Halkiopoulos, C.: Challenging Cognitive Load Theory: The Role of Educational Neuroscience and Artificial Intelligence in Redefining Learning Efficacy. In: Brain Sciences 15 (2025) 2, p. 203. DOI: https:\/\/doi.org\/10.3390\/brainsci15020203.\r<br>[44]\tPeiu, B.: Transforming quality control: How AI-powered visual anomaly detection reduces production defects. URL: https:\/\/www.craftworks.ai\/insights\/know-how\/transforming-quality-control-how-ai-powered-visual-anomaly-detection-reduces-production-defects\/, accessed 15.05.2025.\r<br>[45]\tWang, H.; Li, C.; Li, Y.-F.; Tsung, F.: An Intelligent Industrial Visual Monitoring and Maintenance Framework Empowered by Large-Scale Visual and Language Models. In: IEEE Transactions on Industrial Cyber-Physical Systems (2024). \r<br>[46]\tGiacalone, E.: AI-Powered Autonomous Industrial Monitoring: Integrating Robotics, Computer Vision, and Generative AI. 2025, Politecnico di Torino.\r<br>[47]\tollama.com: wizardlm2. URL: https:\/\/ollama.com\/library\/wizardlm2, accessed 26.06.2025.\r<br>[48]\tLove, D.: Tkinter GUI Programming by Example: Learn to create modern GUIs using Tkinter by building real-world projects in Python. 2018.<\/div><div id=\"download-section\" class=\"gito-pub-download-section\" style=\"text-align:center;margin:20px;\"><h2>Your downloads<\/h2><button style=\"font-size:14px;margin-right:15px;\" class=\"button gito-pub-cpt-download-button\" data-postid=\"109987\" data-userid =\"0\" data-filename=\"I4S_04-2025_DE_Siddiqui.pdf\"><span style=\"margin-top:5px !important;\" class=\"dashicons dashicons-download\"><\/span>&nbsp;&nbsp;PDF (DE)<\/button><\/div><br>Solutions: <span class=\"gito-pub-tag-element\"><a href=\"\/en\/functions\/logistik-en\/\">Logistics<\/a><\/span> <span class=\"gito-pub-tag-element\"><a href=\"\/en\/functions\/logistics-technology\/\">Logistics Technology<\/a><\/span> <div class=\"gito-pub-tags-social-share\" style=\"display:flex;justify-content:space-between;\"><div>Tags: <span class=\"gito-pub-tag-element\"><a href=\"\/tag\/computer-vision\/\">Computer Vision<\/a><\/span> <span class=\"gito-pub-tag-element\"><a href=\"\/tag\/cost%e2%80%91efficient-automation\/\">cost\u2011efficient automation<\/a><\/span> <span class=\"gito-pub-tag-element\"><a href=\"\/tag\/human%e2%80%91ai-interaction\/\">human\u2011AI interaction<\/a><\/span> <span class=\"gito-pub-tag-element\"><a href=\"\/tag\/industrie-4-0-en\/\">Industrie 4.0<\/a><\/span> <span class=\"gito-pub-tag-element\"><a href=\"\/tag\/industry-4-0-en\/\">Industry 4.0<\/a><\/span> <span class=\"gito-pub-tag-element\"><a href=\"\/tag\/large-language-models-en\/\">Large Language Models<\/a><\/span> <span class=\"gito-pub-tag-element\"><a href=\"\/tag\/logistics-en\/\">logistics<\/a><\/span> <span class=\"gito-pub-tag-element\"><a href=\"\/tag\/logistik-en\/\">Logistik<\/a><\/span> <span class=\"gito-pub-tag-element\"><a href=\"\/tag\/multimodal-ai\/\">multimodal AI<\/a><\/span> <span class=\"gito-pub-tag-element\"><a href=\"\/tag\/order-picking\/\">order picking<\/a><\/span> <span class=\"gito-pub-tag-element\"><a href=\"\/tag\/picking-assistance-systems\/\">picking assistance systems<\/a><\/span> <span class=\"gito-pub-tag-element\"><a href=\"\/tag\/quality-assurance\/\">quality assurance<\/a><\/span> <span class=\"gito-pub-tag-element\"><a href=\"\/tag\/real%e2%80%91time-object-detection\/\">real\u2011time object detection<\/a><\/span> <span class=\"gito-pub-tag-element\"><a href=\"\/tag\/smart-manufacturing-en\/\">smart manufacturing<\/a><\/span> <span class=\"gito-pub-tag-element\"><a href=\"\/tag\/warehouse-logistics\/\">warehouse logistics<\/a><\/span> <br>Industries: <span class=\"gito-pub-tag-element\"><a href=\"https:\/\/industry-science.com\/en\/industries\/smart-objects\/\">Smart Objects<\/a><\/span> <\/div><div><div class=\"social-icons share-icons share-row relative\" ><a href=\"whatsapp:\/\/send?text=Technologies%20for%20Assisting%20Manual%20Order%20Picking - https:\/\/industry-science.com\/en\/articles\/technology-assist-order-picking\/\" data-action=\"share\/whatsapp\/share\" class=\"icon button circle is-outline tooltip whatsapp show-for-medium\" title=\"Share on WhatsApp\" aria-label=\"Share on WhatsApp\"><i class=\"icon-whatsapp\" aria-hidden=\"true\"><\/i><\/a><a href=\"https:\/\/www.facebook.com\/sharer.php?u=https:\/\/industry-science.com\/en\/articles\/technology-assist-order-picking\/\" data-label=\"Facebook\" onclick=\"window.open(this.href,this.title,'width=500,height=500,top=300px,left=300px'); return false;\" target=\"_blank\" class=\"icon button circle is-outline tooltip facebook\" title=\"Share on Facebook\" aria-label=\"Share on Facebook\" rel=\"noopener nofollow\"><i class=\"icon-facebook\" aria-hidden=\"true\"><\/i><\/a><a href=\"https:\/\/x.com\/share?url=https:\/\/industry-science.com\/en\/articles\/technology-assist-order-picking\/\" onclick=\"window.open(this.href,this.title,'width=500,height=500,top=300px,left=300px'); return false;\" target=\"_blank\" class=\"icon button circle is-outline tooltip x\" title=\"Share on X\" aria-label=\"Share on X\" rel=\"noopener nofollow\"><i class=\"icon-x\" aria-hidden=\"true\"><\/i><\/a><a href=\"mailto:?subject=Technologies%20for%20Assisting%20Manual%20Order%20Picking&body=Check%20this%20out%3A%20https%3A%2F%2Findustry-science.com%2Fen%2Farticles%2Ftechnology-assist-order-picking%2F\" class=\"icon button circle is-outline tooltip email\" title=\"Email to a Friend\" aria-label=\"Email to a Friend\" rel=\"nofollow\"><i class=\"icon-envelop\" aria-hidden=\"true\"><\/i><\/a><a href=\"https:\/\/www.linkedin.com\/shareArticle?mini=true&url=https:\/\/industry-science.com\/en\/articles\/technology-assist-order-picking\/&title=Technologies%20for%20Assisting%20Manual%20Order%20Picking\" onclick=\"window.open(this.href,this.title,'width=500,height=500,top=300px,left=300px'); return false;\" target=\"_blank\" class=\"icon button circle is-outline tooltip linkedin\" title=\"Share on LinkedIn\" aria-label=\"Share on LinkedIn\" rel=\"noopener nofollow\"><i class=\"icon-linkedin\" aria-hidden=\"true\"><\/i><\/a><\/div><\/div><\/div><hr style=\"margin-top:0px;\">\n<h2 class=\"gito-pub-frontend-post-headline\">You might also be interested in<\/h2>\n<!-- GITO_PUB_POST start flex-container -->\n<div class=\"gito-pub-flex-container\">\n   <div class=\"gito-pub-frontend-post-card gito-pub-flex-item gito-pub-flex-item-1\">\n      <a href=\"https:\/\/industry-science.com\/en\/articles\/digital-twins-production-logistics\/\">\n         <div class=\"gito-pub-frontend-post-card-row\">         <div class=\"gito-pub-frontend-post-card-column gito-pub-frontend-post-card-column-image\">\n            <picture>\n               <source media=\"(max-width:640px)\" srcset=\"https:\/\/industry-science.com\/wp-content\/uploads\/2026\/04\/AdobeStock_1784362718_Andrey-Popov-640x325.webp\">\n               <source media=\"(min-width:641px)\" srcset=\"https:\/\/industry-science.com\/wp-content\/uploads\/2026\/04\/AdobeStock_1784362718_Andrey-Popov-196x180.webp\">\n               <img decoding=\"async\" class=\"gito-pub-frontend-post-card-image\" src=\"https:\/\/industry-science.com\/wp-content\/uploads\/2026\/04\/AdobeStock_1784362718_Andrey-Popov-196x180.webp\" alt=\"Experiencing Digital Twins in Production and Logistics\">\n            <\/picture>\n         <\/div>\n            <div class=\"gito-pub-frontend-post-card-column\">               <div class=\"ellipsis\" style=\"height:166px !important;overflow:hidden;\" title=\"Experiencing Digital Twins in Production and Logistics\">                  <table class=\"gito-pub-frontend-post-card-header\">\n            \t     <tr>\n                        <td>                  \t\t   <h4 class=\"gito-pub-frontend-post-card-title\" style=\"line-height:1.2em;\">Experiencing Digital Twins in Production and Logistics<\/h4>\n                        <div class=\"gito-pub-frontend-post-card-subtitle\">The fischertechnik\u00ae Learning Factory 4.0 as a development platform for possible expansion stages<\/div>                        <div class=\"gito-pub-frontend-post-card-author\"><a href=\"\/authors\/deike-gliem\/\">Deike Gliem<\/a> <a href=\"https:\/\/orcid.org\/0000-0001-8098-334X\" target=\"_blank\" title=\"ORCID eintrag \u00f6ffnen.\" rel=\"noopener\">\n        <img decoding=\"async\" src=\"https:\/\/orcid.org\/assets\/vectors\/orcid.logo.icon.svg\" alt=\"ORCID Icon\" style=\"width:16px;height:16px;vertical-align:middle;\"><\/a>, <a href=\"\/authors\/sigrid-wenzel\/\">Sigrid Wenzel<\/a> <a href=\"https:\/\/orcid.org\/0000-0001-9594-1839\" target=\"_blank\" title=\"ORCID eintrag \u00f6ffnen.\" rel=\"noopener\">\n        <img decoding=\"async\" src=\"https:\/\/orcid.org\/assets\/vectors\/orcid.logo.icon.svg\" alt=\"ORCID Icon\" style=\"width:16px;height:16px;vertical-align:middle;\"><\/a>, <a href=\"\/authors\/jan-schickram\/\">Jan Schickram<\/a>, <a href=\"\/authors\/tareq-albeesh\/\">Tareq Albeesh<\/a><\/div>\n                        <\/td>\n                     <\/tr>\n                  <\/table>\n                  <div class=\"gito-pub-frontend-post-card-text\">\n                     The fischertechnik\u00ae Learning Factory 4.0 has proven to be a suitable experimental environment for testing digital twins. Depending on the targeted maturity stage, the functions of a digital twin range from status monitoring and forecasting to the operational control of production and logistics systems. To systematically classify these functions, this article presents a maturity model that serves as a framework for the development of a digital twin. Building on this, selected use cases are implemented in a test and development environment based on a system architecture with multi-layered logic structure. These initial implementations serve to highlight application purposes, relevant methods, and typical challenges and potentials in the transfer to real factory environments.                  <\/div>\n               <\/div>\n               <div class=\"gito-pub-frontend-post-card-scientific\"><strong>Industry 4.0 Science<\/strong> | Volume 42 | Edition 2 | Pages 30-37 | DOI <a style=\"font-weight:bold !important;\" href=\"https:\/\/doi.org\/10.30844\/I4SE.26.2.30\" target=\"_blank\" rel=\"noopener\">10.30844\/I4SE.26.2.30<\/a><\/div>            <\/div>\n         <\/div>\n      <\/a>\n   <\/div>\n   <div class=\"gito-pub-frontend-post-card gito-pub-flex-item gito-pub-flex-item-1\">\n      <a href=\"https:\/\/industry-science.com\/en\/articles\/has-the-time-come-for-an-energy-revolution-in-intralogistics\/\">\n         <div class=\"gito-pub-frontend-post-card-row\">         <div class=\"gito-pub-frontend-post-card-column gito-pub-frontend-post-card-column-image\">\n            <picture>\n               <source media=\"(max-width:640px)\" srcset=\"https:\/\/industry-science.com\/wp-content\/uploads\/2025\/12\/doerm-640x325.jpg\">\n               <source media=\"(min-width:641px)\" srcset=\"https:\/\/industry-science.com\/wp-content\/uploads\/2025\/12\/doerm-196x180.jpg\">\n               <img decoding=\"async\" class=\"gito-pub-frontend-post-card-image\" src=\"https:\/\/industry-science.com\/wp-content\/uploads\/2025\/12\/doerm-196x180.jpg\" alt=\"Has the Time Come for an Energy Revolution in Intralogistics?\">\n            <\/picture>\n         <\/div>\n            <div class=\"gito-pub-frontend-post-card-column\">               <div class=\"ellipsis\" style=\"height:166px !important;overflow:hidden;\" title=\"Has the Time Come for an Energy Revolution in Intralogistics?\">                  <table class=\"gito-pub-frontend-post-card-header\">\n            \t     <tr>\n                        <td>                  \t\t   <h4 class=\"gito-pub-frontend-post-card-title\" style=\"line-height:1.2em;\">Has the Time Come for an Energy Revolution in Intralogistics?<\/h4>\n                        <div class=\"gito-pub-frontend-post-card-subtitle\">The current status of hydrogen fuel cell-powered MHE<\/div>                        <div class=\"gito-pub-frontend-post-card-author\"><a href=\"\/authors\/gustav-boesehans\/\">Gustav B\u00f6sehans<\/a>, <a href=\"\/authors\/joseph-w-doermann-en\/\">Joseph W. D\u00f6rmann<\/a><\/div>\n                        <\/td>\n                     <\/tr>\n                  <\/table>\n                  <div class=\"gito-pub-frontend-post-card-text\">\n                     <div class=\"gito-pub-frontend-post-card-abo-sign gito-pub-login-register-link\" data-targetabo=\"expert\" data-targeturl=\"https:\/\/industry-science.com\/en\/articles\/has-the-time-come-for-an-energy-revolution-in-intralogistics\/\" title=\"please login or register - content can only be read in its entirety with a subscription  expert\">\n\t\t\t                         <img decoding=\"async\" src=\"https:\/\/industry-science.com\/wp-content\/plugins\/gito-publisher\/img\/i4s-login.png\">\n\t\t\t                      <\/div>Hydrogen fuel cells promise to be a sustainable alternative to fossil fuel or battery-electric material handling equipment (MHE) in various production or warehouse contexts. Short refuelling times, an absence of carbon emissions, and constant power input put fuel cell-powered MHE at an advantage in high-intensity work environments. However, various barriers to the adoption of fuel cells remain, including considerations surrounding cost and efficiency.                  <\/div>\n               <\/div>\n               <div class=\"gito-pub-frontend-post-card-scientific\"><strong>Industry 4.0 Science<\/strong> | Volume 41 | 2025 | Edition 6 | Pages 74-80<\/div>            <\/div>\n         <\/div>\n      <\/a>\n   <\/div>\n   <div class=\"gito-pub-frontend-post-card gito-pub-flex-item gito-pub-flex-item-1\">\n      <a href=\"https:\/\/industry-science.com\/en\/articles\/loam-construction-wooden-shelving\/\">\n         <div class=\"gito-pub-frontend-post-card-row\">         <div class=\"gito-pub-frontend-post-card-column gito-pub-frontend-post-card-column-image\">\n            <picture>\n               <source media=\"(max-width:640px)\" srcset=\"https:\/\/industry-science.com\/wp-content\/uploads\/2025\/12\/AdobeStock_1209835783_andov-copie-640x325.webp\">\n               <source media=\"(min-width:641px)\" srcset=\"https:\/\/industry-science.com\/wp-content\/uploads\/2025\/12\/AdobeStock_1209835783_andov-copie-196x180.webp\">\n               <img decoding=\"async\" class=\"gito-pub-frontend-post-card-image\" src=\"https:\/\/industry-science.com\/wp-content\/uploads\/2025\/12\/AdobeStock_1209835783_andov-copie-196x180.webp\" alt=\"Loam Construction and Wooden Shelving\">\n            <\/picture>\n         <\/div>\n            <div class=\"gito-pub-frontend-post-card-column\">               <div class=\"ellipsis\" style=\"height:166px !important;overflow:hidden;\" title=\"Loam Construction and Wooden Shelving\">                  <table class=\"gito-pub-frontend-post-card-header\">\n            \t     <tr>\n                        <td>                  \t\t   <h4 class=\"gito-pub-frontend-post-card-title\" style=\"line-height:1.2em;\">Loam Construction and Wooden Shelving<\/h4>\n                        <div class=\"gito-pub-frontend-post-card-subtitle\">A contribution to sustainability in warehouse logistics<\/div>                        <div class=\"gito-pub-frontend-post-card-author\"><a href=\"\/authors\/viviano-de-giacomo\/\">Viviano De Giacomo<\/a> <a href=\"https:\/\/orcid.org\/0009-0009-4070-9499\" target=\"_blank\" title=\"ORCID eintrag \u00f6ffnen.\" rel=\"noopener\">\n        <img decoding=\"async\" src=\"https:\/\/orcid.org\/assets\/vectors\/orcid.logo.icon.svg\" alt=\"ORCID Icon\" style=\"width:16px;height:16px;vertical-align:middle;\"><\/a>, <a href=\"\/authors\/nathalie-fritsch\/\">Nathalie Fritsch<\/a> <a href=\"https:\/\/orcid.org\/0009-0007-9857-5898\" target=\"_blank\" title=\"ORCID eintrag \u00f6ffnen.\" rel=\"noopener\">\n        <img decoding=\"async\" src=\"https:\/\/orcid.org\/assets\/vectors\/orcid.logo.icon.svg\" alt=\"ORCID Icon\" style=\"width:16px;height:16px;vertical-align:middle;\"><\/a>, <a href=\"\/authors\/jakob-kennert\/\">Jakob Kennert<\/a> <a href=\"https:\/\/orcid.org\/0009-0007-8246-6443\" target=\"_blank\" title=\"ORCID eintrag \u00f6ffnen.\" rel=\"noopener\">\n        <img decoding=\"async\" src=\"https:\/\/orcid.org\/assets\/vectors\/orcid.logo.icon.svg\" alt=\"ORCID Icon\" style=\"width:16px;height:16px;vertical-align:middle;\"><\/a>, <a href=\"\/authors\/dieter-uckelmann\/\">Dieter Uckelmann<\/a> <a href=\"https:\/\/orcid.org\/0000-0001-7657-3292\" target=\"_blank\" title=\"ORCID eintrag \u00f6ffnen.\" rel=\"noopener\">\n        <img decoding=\"async\" src=\"https:\/\/orcid.org\/assets\/vectors\/orcid.logo.icon.svg\" alt=\"ORCID Icon\" style=\"width:16px;height:16px;vertical-align:middle;\"><\/a><\/div>\n                        <\/td>\n                     <\/tr>\n                  <\/table>\n                  <div class=\"gito-pub-frontend-post-card-text\">\n                     <div class=\"gito-pub-frontend-post-card-abo-sign gito-pub-login-register-link\" data-targetabo=\"expert\" data-targeturl=\"https:\/\/industry-science.com\/en\/articles\/loam-construction-wooden-shelving\/\" title=\"please login or register - content can only be read in its entirety with a subscription  expert\">\n\t\t\t                         <img decoding=\"async\" src=\"https:\/\/industry-science.com\/wp-content\/plugins\/gito-publisher\/img\/i4s-login.png\">\n\t\t\t                      <\/div>This study examines the contribution of natural building materials, in particular loam and wood, to the sustainable development of logistics infrastructure, assessing ecological, economic, and technical dimensions across the entire life cycle. Potentials, restrictions, and supportive framework conditions are identified based on literature analyses and expert interviews. Wood proves to be technically mature and ecologically advantageous, especially in high rack construction, while loam offers high potential for energy- and resource-efficient construction. The study concludes with recommendations for research, policy, and practice to establish circular construction methods in logistics.                  <\/div>\n               <\/div>\n               <div class=\"gito-pub-frontend-post-card-scientific\"><strong>Industry 4.0 Science<\/strong> | Volume 41 | Edition 6 | Pages 82-89<\/div>            <\/div>\n         <\/div>\n      <\/a>\n   <\/div>\n   <div class=\"gito-pub-frontend-post-card gito-pub-flex-item gito-pub-flex-item-1\">\n      <a href=\"https:\/\/industry-science.com\/en\/articles\/electric-trucks-pre-post-carriage\/\">\n         <div class=\"gito-pub-frontend-post-card-row\">         <div class=\"gito-pub-frontend-post-card-column gito-pub-frontend-post-card-column-image\">\n            <picture>\n               <source media=\"(max-width:640px)\" srcset=\"https:\/\/industry-science.com\/wp-content\/uploads\/2025\/11\/AdobeStock_544561574_Kalyakan-640x325.jpeg\">\n               <source media=\"(min-width:641px)\" srcset=\"https:\/\/industry-science.com\/wp-content\/uploads\/2025\/11\/AdobeStock_544561574_Kalyakan-196x180.jpeg\">\n               <img decoding=\"async\" class=\"gito-pub-frontend-post-card-image\" src=\"https:\/\/industry-science.com\/wp-content\/uploads\/2025\/11\/AdobeStock_544561574_Kalyakan-196x180.jpeg\" alt=\"Electric Trucks in Intermodal Terminal Pre- and Post-Carriage\">\n            <\/picture>\n         <\/div>\n            <div class=\"gito-pub-frontend-post-card-column\">               <div class=\"ellipsis\" style=\"height:166px !important;overflow:hidden;\" title=\"Electric Trucks in Intermodal Terminal Pre- and Post-Carriage\">                  <table class=\"gito-pub-frontend-post-card-header\">\n            \t     <tr>\n                        <td>                  \t\t   <h4 class=\"gito-pub-frontend-post-card-title\" style=\"line-height:1.2em;\">Electric Trucks in Intermodal Terminal Pre- and Post-Carriage<\/h4>\n                        <div class=\"gito-pub-frontend-post-card-subtitle\">Impact on terminal processes in combined road-rail freight transport<\/div>                        <div class=\"gito-pub-frontend-post-card-author\"><a href=\"\/authors\/ralf-elbert\/\">Ralf Elbert<\/a> <a href=\"https:\/\/orcid.org\/0000-0002-9337-9173\" target=\"_blank\" title=\"ORCID eintrag \u00f6ffnen.\" rel=\"noopener\">\n        <img decoding=\"async\" src=\"https:\/\/orcid.org\/assets\/vectors\/orcid.logo.icon.svg\" alt=\"ORCID Icon\" style=\"width:16px;height:16px;vertical-align:middle;\"><\/a>, <a href=\"\/authors\/samira-ghaneian-sebdani\/\">Samira Ghaneian Sebdani<\/a> <a href=\"https:\/\/orcid.org\/0009-0004-1073-2034\" target=\"_blank\" title=\"ORCID eintrag \u00f6ffnen.\" rel=\"noopener\">\n        <img decoding=\"async\" src=\"https:\/\/orcid.org\/assets\/vectors\/orcid.logo.icon.svg\" alt=\"ORCID Icon\" style=\"width:16px;height:16px;vertical-align:middle;\"><\/a><\/div>\n                        <\/td>\n                     <\/tr>\n                  <\/table>\n                  <div class=\"gito-pub-frontend-post-card-text\">\n                     <div class=\"gito-pub-frontend-post-card-abo-sign gito-pub-login-register-link\" data-targetabo=\"expert\" data-targeturl=\"https:\/\/industry-science.com\/en\/articles\/electric-trucks-pre-post-carriage\/\" title=\"please login or register - content can only be read in its entirety with a subscription  expert\">\n\t\t\t                         <img decoding=\"async\" src=\"https:\/\/industry-science.com\/wp-content\/plugins\/gito-publisher\/img\/i4s-login.png\">\n\t\t\t                      <\/div>Electric trucks (e-trucks) play an important role in reducing CO\u2082 emissions especially on short distances in pre and post-carriage in combined road-rail freight transport (CT). Using the example of a CT terminal, this article highlights the logistical and energy challenges involved in using e-trucks to establish suitable charging infrastructures and ensuring a reliable power supply.                  <\/div>\n               <\/div>\n               <div class=\"gito-pub-frontend-post-card-scientific\"><strong>Industry 4.0 Science<\/strong> | Volume 41 | Edition 6 | Pages 70-77<\/div>            <\/div>\n         <\/div>\n      <\/a>\n   <\/div>\n   <div class=\"gito-pub-frontend-post-card gito-pub-flex-item gito-pub-flex-item-1\">\n      <a href=\"https:\/\/industry-science.com\/en\/articles\/assistance-production-logistics\/\">\n         <div class=\"gito-pub-frontend-post-card-row\">         <div class=\"gito-pub-frontend-post-card-column gito-pub-frontend-post-card-column-image\">\n            <picture>\n               <source media=\"(max-width:640px)\" srcset=\"https:\/\/industry-science.com\/wp-content\/uploads\/2025\/09\/Wenzel_AdobeStock_1560160859_Gorodenkoff-640x325.jpg\">\n               <source media=\"(min-width:641px)\" srcset=\"https:\/\/industry-science.com\/wp-content\/uploads\/2025\/09\/Wenzel_AdobeStock_1560160859_Gorodenkoff-196x180.jpg\">\n               <img decoding=\"async\" class=\"gito-pub-frontend-post-card-image\" src=\"https:\/\/industry-science.com\/wp-content\/uploads\/2025\/09\/Wenzel_AdobeStock_1560160859_Gorodenkoff-196x180.jpg\" alt=\"Assistance for Simulation in Production and Logistics\">\n            <\/picture>\n         <\/div>\n            <div class=\"gito-pub-frontend-post-card-column\">               <div class=\"ellipsis\" style=\"height:166px !important;overflow:hidden;\" title=\"Assistance for Simulation in Production and Logistics\">                  <table class=\"gito-pub-frontend-post-card-header\">\n            \t     <tr>\n                        <td>                  \t\t   <h4 class=\"gito-pub-frontend-post-card-title\" style=\"line-height:1.2em;\">Assistance for Simulation in Production and Logistics<\/h4>\n                        <div class=\"gito-pub-frontend-post-card-subtitle\">A literature-based classification<\/div>                        <div class=\"gito-pub-frontend-post-card-author\"><a href=\"\/authors\/sigrid-wenzel\/\">Sigrid Wenzel<\/a> <a href=\"https:\/\/orcid.org\/0000-0001-9594-1839\" target=\"_blank\" title=\"ORCID eintrag \u00f6ffnen.\" rel=\"noopener\">\n        <img decoding=\"async\" src=\"https:\/\/orcid.org\/assets\/vectors\/orcid.logo.icon.svg\" alt=\"ORCID Icon\" style=\"width:16px;height:16px;vertical-align:middle;\"><\/a>, <a href=\"\/authors\/felix-oezkul\/\">Felix \u00d6zkul<\/a>, <a href=\"\/authors\/robin-sutherland\/\">Robin Sutherland<\/a> <a href=\"https:\/\/orcid.org\/0009-0005-6684-0066\" target=\"_blank\" title=\"ORCID eintrag \u00f6ffnen.\" rel=\"noopener\">\n        <img decoding=\"async\" src=\"https:\/\/orcid.org\/assets\/vectors\/orcid.logo.icon.svg\" alt=\"ORCID Icon\" style=\"width:16px;height:16px;vertical-align:middle;\"><\/a><\/div>\n                        <\/td>\n                     <\/tr>\n                  <\/table>\n                  <div class=\"gito-pub-frontend-post-card-text\">\n                     Despite the commercial availability of simulation tools, using of discrete-event simulation for complex production and logistics systems is becoming increasingly challenging. It requires extensive expertise, high data quality, and considerable time and financial resources. For many years, therefore, there has been high demand for methodological and organizational support for the conduction of simulation studies. This article is based on an analysis of relevant publications and aims to classify previous research on improving the use of simulation. It also raises the question of the need for assistance in applying discrete event simulation and identifies areas for action.                  <\/div>\n               <\/div>\n               <div class=\"gito-pub-frontend-post-card-scientific\"><strong>Industry 4.0 Science<\/strong> | Volume 41 | 2025 | Edition 5 | Pages 66-76 | DOI <a style=\"font-weight:bold !important;\" href=\"https:\/\/doi.org\/10.30844\/I4SE.25.5.64\" target=\"_blank\" rel=\"noopener\">10.30844\/I4SE.25.5.64<\/a><\/div>            <\/div>\n         <\/div>\n      <\/a>\n   <\/div>\n   <div class=\"gito-pub-frontend-post-card gito-pub-flex-item gito-pub-flex-item-1\">\n      <a href=\"https:\/\/industry-science.com\/en\/articles\/transport-automation-in-production-logistics\/\">\n         <div class=\"gito-pub-frontend-post-card-row\">         <div class=\"gito-pub-frontend-post-card-column gito-pub-frontend-post-card-column-image\">\n            <picture>\n               <source media=\"(max-width:640px)\" srcset=\"https:\/\/industry-science.com\/wp-content\/uploads\/2025\/08\/Zoller_en-640x325.jpg\">\n               <source media=\"(min-width:641px)\" srcset=\"https:\/\/industry-science.com\/wp-content\/uploads\/2025\/08\/Zoller_en-196x180.jpg\">\n               <img decoding=\"async\" class=\"gito-pub-frontend-post-card-image\" src=\"https:\/\/industry-science.com\/wp-content\/uploads\/2025\/08\/Zoller_en-196x180.jpg\" alt=\"Bridging Automated and Traditional Approaches in Material Transport\">\n            <\/picture>\n         <\/div>\n            <div class=\"gito-pub-frontend-post-card-column\">               <div class=\"ellipsis\" style=\"height:166px !important;overflow:hidden;\" title=\"Bridging Automated and Traditional Approaches in Material Transport\">                  <table class=\"gito-pub-frontend-post-card-header\">\n            \t     <tr>\n                        <td>                  \t\t   <h4 class=\"gito-pub-frontend-post-card-title\" style=\"line-height:1.2em;\">Bridging Automated and Traditional Approaches in Material Transport<\/h4>\n                        <div class=\"gito-pub-frontend-post-card-subtitle\">Why manual tugger train systems remain prevalent in intralogistics<\/div>                        <div class=\"gito-pub-frontend-post-card-author\"><a href=\"\/authors\/christoph-s-zoller\/\">Christoph S. Zoller<\/a>, <a href=\"\/authors\/wladimir-rempel\/\">Wladimir Rempel<\/a>, <a href=\"\/authors\/justus-langer\/\">Justus Langer<\/a>, <a href=\"\/authors\/bonita-grzechca\/\">Bonita Grzechca<\/a><\/div>\n                        <\/td>\n                     <\/tr>\n                  <\/table>\n                  <div class=\"gito-pub-frontend-post-card-text\">\n                     <div class=\"gito-pub-frontend-post-card-abo-sign gito-pub-login-register-link\" data-targetabo=\"expert\" data-targeturl=\"https:\/\/industry-science.com\/en\/articles\/transport-automation-in-production-logistics\/\" title=\"please login or register - content can only be read in its entirety with a subscription  expert\">\n\t\t\t                         <img decoding=\"async\" src=\"https:\/\/industry-science.com\/wp-content\/plugins\/gito-publisher\/img\/i4s-login.png\">\n\t\t\t                      <\/div>The ongoing automation of production logistics through driverless transport systems (DTS) can significantly enhance the efficiency and quality of transport processes. Despite these advantages, many companies still choose manual tugger train systems for material supply. Semi-structured interviews with industry experts provide insight into the reasons behind these decisions, with particular emphasis factors that extend beyond purely economic assessment. The findings indicate that the lack of flexibility of driverless transport systems and the effort required for implementation effort are key reasons why manual transport solutions are often preferred in intralogistics.                  <\/div>\n               <\/div>\n               <div class=\"gito-pub-frontend-post-card-scientific\"><strong>Industry 4.0 Science<\/strong> | Volume 41 | 2025 | Edition 4 | Pages 60-66<\/div>            <\/div>\n         <\/div>\n      <\/a>\n   <\/div>\n<\/div>\n<!-- GITO_PUB_POST end flex-container -->\n","protected":false},"excerpt":{"rendered":"<p>Manual picking remains common due to the high initial cost of support systems. This paper reviews existing technologies, presents an exploratory vision-based prototype, and examines existing literature that explores how combining object detection with language systems could enhance manual workflows. The findings suggest a promising, low-cost direction for worker support in logistics.<\/p>\n","protected":false},"featured_media":109603,"menu_order":0,"template":"","categories":[79167,79168,79298],"tags":[74630,84508,84510,79627,80127,80180,80272,79365,84509,70924,84507,68013,84511,80270,72188],"product_cat":[],"topic":[79371],"technology":[79493],"knowhow":[],"industry":[79354],"writer":[84296,84297,84295],"content-type":[83932],"potential":[],"solution":[67610,78673],"glossary":[],"class_list":["post-109987","article","type-article","status-publish","has-post-thumbnail","category-design-en","category-translate-en","category-typeset","tag-computer-vision","tag-costefficient-automation","tag-humanai-interaction","tag-industrie-4-0-en","tag-industry-4-0-en","tag-large-language-models-en","tag-logistics-en","tag-logistik-en","tag-multimodal-ai","tag-order-picking","tag-picking-assistance-systems","tag-quality-assurance","tag-realtime-object-detection","tag-smart-manufacturing-en","tag-warehouse-logistics","topic-logistics","technology-digitalization","industry-smart-objects","writer-jonathan-kressel","writer-juergen-grinninger","writer-md-khalid-siddiqui","content-type-article","solution-logistik-en","solution-logistics-technology","product","first","instock","downloadable","virtual","sold-individually","taxable","purchasable","product-type-article"],"uagb_featured_image_src":{"full":["https:\/\/industry-science.com\/wp-content\/uploads\/2025\/08\/siddiqui-AdobeStock_707077453-copie.jpg",1400,788,false],"thumbnail":["https:\/\/industry-science.com\/wp-content\/uploads\/2025\/08\/siddiqui-AdobeStock_707077453-copie-150x150.jpg",150,150,true],"medium":["https:\/\/industry-science.com\/wp-content\/uploads\/2025\/08\/siddiqui-AdobeStock_707077453-copie-666x375.jpg",666,375,true],"medium_large":["https:\/\/industry-science.com\/wp-content\/uploads\/2025\/08\/siddiqui-AdobeStock_707077453-copie-768x432.jpg",768,432,true],"large":["https:\/\/industry-science.com\/wp-content\/uploads\/2025\/08\/siddiqui-AdobeStock_707077453-copie-1024x576.jpg",1020,574,true],"front-page-entry":["https:\/\/industry-science.com\/wp-content\/uploads\/2025\/08\/siddiqui-AdobeStock_707077453-copie-1032x320.jpg",1032,320,true],"post-entry":["https:\/\/industry-science.com\/wp-content\/uploads\/2025\/08\/siddiqui-AdobeStock_707077453-copie-764x376.jpg",764,376,true],"post-teaser":["https:\/\/industry-science.com\/wp-content\/uploads\/2025\/08\/siddiqui-AdobeStock_707077453-copie-392x320.jpg",392,320,true],"post-teaser-mobile":["https:\/\/industry-science.com\/wp-content\/uploads\/2025\/08\/siddiqui-AdobeStock_707077453-copie-608x496.jpg",608,496,true],"post-custom-size":["https:\/\/industry-science.com\/wp-content\/uploads\/2025\/08\/siddiqui-AdobeStock_707077453-copie-640x325.jpg",640,325,true],"whitepaper-teaser":["https:\/\/industry-science.com\/wp-content\/uploads\/2025\/08\/siddiqui-AdobeStock_707077453-copie-274x376.jpg",274,376,true],"card-big":["https:\/\/industry-science.com\/wp-content\/uploads\/2025\/08\/siddiqui-AdobeStock_707077453-copie-514x292.jpg",514,292,true],"card-portrait":["https:\/\/industry-science.com\/wp-content\/uploads\/2025\/08\/siddiqui-AdobeStock_707077453-copie-320x440.jpg",320,440,true],"card-big-company":["https:\/\/industry-science.com\/wp-content\/uploads\/2025\/08\/siddiqui-AdobeStock_707077453-copie-514x289.jpg",514,289,true],"gp-listing":["https:\/\/industry-science.com\/wp-content\/uploads\/2025\/08\/siddiqui-AdobeStock_707077453-copie-196x180.jpg",196,180,true],"1536x1536":["https:\/\/industry-science.com\/wp-content\/uploads\/2025\/08\/siddiqui-AdobeStock_707077453-copie.jpg",1400,788,false],"2048x2048":["https:\/\/industry-science.com\/wp-content\/uploads\/2025\/08\/siddiqui-AdobeStock_707077453-copie.jpg",1400,788,false],"woocommerce_thumbnail":["https:\/\/industry-science.com\/wp-content\/uploads\/2025\/08\/siddiqui-AdobeStock_707077453-copie-510x510.jpg",510,510,true],"woocommerce_single":["https:\/\/industry-science.com\/wp-content\/uploads\/2025\/08\/siddiqui-AdobeStock_707077453-copie-510x287.jpg",510,287,true],"woocommerce_gallery_thumbnail":["https:\/\/industry-science.com\/wp-content\/uploads\/2025\/08\/siddiqui-AdobeStock_707077453-copie-100x100.jpg",100,100,true],"dgwt-wcas-product-suggestion":["https:\/\/industry-science.com\/wp-content\/uploads\/2025\/08\/siddiqui-AdobeStock_707077453-copie-64x36.jpg",64,36,true]},"uagb_author_info":{"display_name":"Florian Goldmann","author_link":"https:\/\/industry-science.com\/en\/author\/"},"uagb_comment_info":0,"uagb_excerpt":"Manual picking remains common due to the high initial cost of support systems. This paper reviews existing technologies, presents an exploratory vision-based prototype, and examines existing literature that explores how combining object detection with language systems could enhance manual workflows. The findings suggest a promising, low-cost direction for worker support in logistics.","_links":{"self":[{"href":"https:\/\/industry-science.com\/en\/wp-json\/wp\/v2\/article\/109987","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/industry-science.com\/en\/wp-json\/wp\/v2\/article"}],"about":[{"href":"https:\/\/industry-science.com\/en\/wp-json\/wp\/v2\/types\/article"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/industry-science.com\/en\/wp-json\/wp\/v2\/media\/109603"}],"wp:attachment":[{"href":"https:\/\/industry-science.com\/en\/wp-json\/wp\/v2\/media?parent=109987"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/industry-science.com\/en\/wp-json\/wp\/v2\/categories?post=109987"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/industry-science.com\/en\/wp-json\/wp\/v2\/tags?post=109987"},{"taxonomy":"product_cat","embeddable":true,"href":"https:\/\/industry-science.com\/en\/wp-json\/wp\/v2\/product_cat?post=109987"},{"taxonomy":"topic","embeddable":true,"href":"https:\/\/industry-science.com\/en\/wp-json\/wp\/v2\/topic?post=109987"},{"taxonomy":"technology","embeddable":true,"href":"https:\/\/industry-science.com\/en\/wp-json\/wp\/v2\/technology?post=109987"},{"taxonomy":"knowhow","embeddable":true,"href":"https:\/\/industry-science.com\/en\/wp-json\/wp\/v2\/knowhow?post=109987"},{"taxonomy":"industry","embeddable":true,"href":"https:\/\/industry-science.com\/en\/wp-json\/wp\/v2\/industry?post=109987"},{"taxonomy":"writer","embeddable":true,"href":"https:\/\/industry-science.com\/en\/wp-json\/wp\/v2\/writer?post=109987"},{"taxonomy":"content-type","embeddable":true,"href":"https:\/\/industry-science.com\/en\/wp-json\/wp\/v2\/content-type?post=109987"},{"taxonomy":"potential","embeddable":true,"href":"https:\/\/industry-science.com\/en\/wp-json\/wp\/v2\/potential?post=109987"},{"taxonomy":"solution","embeddable":true,"href":"https:\/\/industry-science.com\/en\/wp-json\/wp\/v2\/solution?post=109987"},{"taxonomy":"glossary","embeddable":true,"href":"https:\/\/industry-science.com\/en\/wp-json\/wp\/v2\/glossary?post=109987"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}