diff --git a/README.md b/README.md index 5148e78e..6dad3c48 100644 --- a/README.md +++ b/README.md @@ -1,6 +1,6 @@
+
+
+
+ January 2026 | AAAI-26 | Singapore EXPO
++ Agentic AI is poised to transform decision-making in Industry 4.0 by enabling autonomous agents to reason over multimodal inputsโsuch as sensor streams, structured knowledge bases, and unstructured maintenance logsโand act adaptively under uncertainty. Yet, real-world adoption remains challenging due to data fragmentation, integration complexity, limited explainability, and lack of evaluation workflows. +
++ This hands-on tutorial offers a full lifecycle walkthrough for building trustworthy agentic AI systems in industrial settings. Participants will engage in two interactive labs: (i) resolving data silos in smart manufacturing using an open-source platform, and (ii) benchmarking agent performance, reasoning, and explainability in an enterprise-scale industrial simulation. Capabilities such as trace visualizations, real-time introspection, and comparative reasoning will be demonstrated. The session concludes with best practices for governance, monitoring, and reusable evaluation workflows. Participants will leave with practical skills and modular tools to build explainable, robust, and deployable agentic AI systems for real-world Industry 4.0 applications. +
++ Agentic AI is rapidly becoming a cornerstone of intelligent decision-making in Industry 4.0. These agents must reason across heterogeneous data sourcesโincluding sensor time series, structured knowledge graphs, and unstructured logsโwhile adapting under uncertainty. Despite advances in large language models and multimodal learning, building deployable and trustworthy systems remains difficult due to fragmented data, lack of explainability, and limited evaluation protocols. +
++ This tutorial provides a comprehensive, hands-on walkthrough of the lifecycle of multimodal agentic AIโfrom design to deploymentโfeaturing lab sessions on data integration and benchmarking. We explore reasoning strategies, evaluation methods, and governance tools that ensure trustworthy and auditable deployments. +
+| Time | Activity | Presenter(s) |
|---|---|---|
| 10 mins | Introduction and Overview: Multimodal AI agents, use cases in Industry 4.0, objectives. | Amit Sheth, Dhaval Patel |
| 20 mins | Multimodal Agents in Industry 4.0: Overview of architectures (symbolic + neural integration). | Ruwan Wickramarachchi |
| 20 mins | Lab Session 1: Addressing Data Silos and Integration Complexity. | Chathurangi Shyalika |
| 20 mins | Operationalizing and Governing Multimodal Agents: Evaluation and governance techniques. | Dhaval Patel, Saumya Ahuja |
| 20 mins | Lab Session 2: Evaluation Benchmarking at Scale for Industrial Multi-Agent Systems. | Shuxin Lin |
| 15 mins | Q&A and Wrap-up | All Presenters |
Participants should bring a laptop with Python 3 installed. Pre-configured environments and setup instructions will be provided. The tutorial uses:
+
+ Ph.D. student at AIISC, University of South Carolina. Research in Deep Learning, Multimodal-AI, Neurosymbolic-AI, anomaly detection, event understanding.
+
+ AI Engineer Lead, IBM WatsonX ASEAN. Leads Generative AI and Agentic AI projects across APAC. Experienced in LLMs, RAG systems, and enterprise AI deployments.
+
+ Researcher at IBM with expertise in AI for Industry 4.0, agent evaluation, multimodal reasoning, and large-scale industrial AI benchmarks.
+
+ Research Scientist at Bosch Center for AI. Ph.D. from AIISC, USC. Research in Generative AI, Neurosymbolic AI, knowledge graphs, and multimodal representation learning.
+
+ Senior Technical Staff Member, IBM Research. Expert in Data Mining, Machine Learning, Time Series, and industrial AI platforms such as Maximo and AutoAI-TS.
+
+ NCR Chair & Professor, AIISC, USC. Fellow of IEEE, AAAI, ACM, AAAS. Research in trustworthy, explainable, and safe neuro-symbolic AI.
+| \n", - " | title | \n", - "description | \n", - "cluster | \n", - "failure mode | \n", - "
|---|---|---|---|---|
| 0 | \n", - "Repeated Invalid Action Execution | \n", - "The agent persistently and unsuccessfully executed the same invalid 'Finish[answer]' action multiple times despite explicit feedback about its invalidity, indicating a lack of corrective behavior mechanism to recover from action selection errors. | \n", - "2 | \n", - "Invalid Action Formatting | \n", - "
| 1 | \n", - "Specification Hallucination | \n", - "The agent listed failure modes as detectable by specified sensors without cross-verifying with the relevancy mapping, leading to incorrect inclusions in its final answer. | \n", - "3 | \n", - "Overstatement of Task Completion | \n", - "
| 2 | \n", - "Ineffective Remediation Loop | \n", - "Despite receiving direct feedback and suggestions after every attempt, the agent repeats the same ineffective action, indicating a lack of adaptive remediation to past failure. | \n", - "5 | \n", - "Ineffective Error Recovery | \n", - "
| 3 | \n", - "Inadequate Fallback Reasoning | \n", - "When the tools failed, the agent did not adequately default to self-ask and logical deduction, instead returning generic content and not addressing the asset-specific sensor/failure mode mapping as required. | \n", - "3 | \n", - "Overstatement of Task Completion | \n", - "
| 4 | \n", - "Omitted Stepwise Transparency | \n", - "The agent failed to explicitly walk through the manual filtering process, skipping step-by-step reasoning, which made it difficult to verify or understand how the result was achieved. | \n", - "3 | \n", - "Overstatement of Task Completion | \n", - "
| ... | \n", - "... | \n", - "... | \n", - "... | \n", - "... | \n", - "
| 508 | \n", - "Repetitive Error Handling without Escalation | \n", - "The agent persistently retries the same failing action (accessing a non-existent file) without altering its approach, attempting recovery, or providing helpful guidance to the user. | \n", - "5 | \n", - "Ineffective Error Recovery | \n", - "
| 509 | \n", - "Lack of User-Facing Error Explanation | \n", - "Despite multiple failures, the agent does not provide a clear, user-facing summary of the problem or request for user intervention, impeding resolution and transparency. | \n", - "5 | \n", - "Ineffective Error Recovery | \n", - "
| 510 | \n", - "Ineffective Error Recovery | \n", - "Despite identification of errors (e.g., missing or erroneous parameters), the agent persistently retries without substantial changes or escalation to alternative strategies such as seeking help or generating error reports. | \n", - "5 | \n", - "Ineffective Error Recovery | \n", - "
| 511 | \n", - "Lack of Escalation or Assistance Request | \n", - "The agent fails to escalate the issue or request human intervention after repeated failures, as suggested in the agent's own review feedback, resulting in an unproductive loop. | \n", - "3 | \n", - "Overstatement of Task Completion | \n", - "
| 512 | \n", - "Unproductive Error Loop | \n", - "The agent repeatedly attempts the same failing action (reading a missing file) without adapting strategy or escalating error handling, resulting in wasted cycles. | \n", - "5 | \n", - "Ineffective Error Recovery | \n", - "
513 rows ร 4 columns
\n", - "| \n", - " | Failure Mode | \n", - "False Count | \n", - "True Count | \n", - "True Percentage | \n", - "
|---|---|---|---|---|
| 0 | \n", - "1.1 Disobey Task Specification | \n", - "619.0 | \n", - "262.0 | \n", - "13.87 | \n", - "
| 1 | \n", - "1.2 Disobey Role Specification | \n", - "879.0 | \n", - "2.0 | \n", - "0.11 | \n", - "
| 2 | \n", - "1.3 Step Repetition | \n", - "571.0 | \n", - "310.0 | \n", - "16.41 | \n", - "
| 3 | \n", - "1.4 Loss of Conversation History | \n", - "881.0 | \n", - "NaN | \n", - "0.00 | \n", - "
| 4 | \n", - "1.5 Unaware of Termination Conditions | \n", - "749.0 | \n", - "132.0 | \n", - "6.99 | \n", - "
| 5 | \n", - "2.1 Conversation Reset | \n", - "881.0 | \n", - "NaN | \n", - "0.00 | \n", - "
| 6 | \n", - "2.2 Fail to Ask for Clarification | \n", - "688.0 | \n", - "193.0 | \n", - "10.22 | \n", - "
| 7 | \n", - "2.3 Task Derailment | \n", - "799.0 | \n", - "82.0 | \n", - "4.34 | \n", - "
| 8 | \n", - "2.4 Information Withholding | \n", - "839.0 | \n", - "42.0 | \n", - "2.22 | \n", - "
| 9 | \n", - "2.5 Ignored Other Agent's Input | \n", - "842.0 | \n", - "39.0 | \n", - "2.06 | \n", - "
| 10 | \n", - "2.6 Action-Reasoning Mismatch | \n", - "717.0 | \n", - "164.0 | \n", - "8.68 | \n", - "
| 11 | \n", - "3.1 Premature Termination | \n", - "807.0 | \n", - "74.0 | \n", - "3.92 | \n", - "
| 12 | \n", - "3.2 No or Incorrect Verification | \n", - "587.0 | \n", - "294.0 | \n", - "15.56 | \n", - "
| 13 | \n", - "3.3 Weak Verification | \n", - "586.0 | \n", - "295.0 | \n", - "15.62 | \n", - "