AI Infra Summit - Workshop Agenda | Kisaco Research
Agenda Days: 
  • Wednesday, 10 Sep, 2025
    09:00 AM

    Location: Room 201

    Duration: 40 minutes

    1:30 PM

    Join us in this hands-on workshop to learn how to deploy and optimize large language models (LLMs) for scalable inference at enterprise scale. Participants will learn to orchestrate distributed LLM serving with vLLM on Amazon EKS, enabling robust, flexible, and highly available deployments. The session demonstrates how to utilize AWS Trainium hardware within EKS to maximize throughput and cost efficiency, leveraging Kubernetes-native features for automated scaling, resource management, and seamless integration with AWS services.

    Location: Room 206

    Duration: 1 hour

    Author:

    Asheesh Goja

    Principal GenAI Solutions Architect
    AWS

    Asheesh Goja

    Principal GenAI Solutions Architect
    AWS

    Author:

    Pinak Panigrahi

    Sr. Machine Learning Architect - Annapurna ML
    AWS

    Pinak Panigrahi

    Sr. Machine Learning Architect - Annapurna ML
    AWS
    2:45 PM

    Experience the future of GenAI inference architecture with NeuReality’s fully integrated, enterprise-ready NR1® Inference Appliance. In this hands-on workshop, you'll go from cold start to live GenAI applications in under 30 minutes using our AI-CPU-powered system. The NR1® Chip – the world’s first AI-CPU purpose built for interference – pairs with any GPU or AI accelerator and optimizes any AI data workload. We’ll walk you through setup, deployment, and real-time inference using models like LLaMA, Mistral, and DeepSeek on our disaggregated architecture—built for smooth scalability, superior price/performance and near 100% GPU utilization (vs <50% with traditional CPU/NIC architecture). Join us to see how NeuReality eliminates infrastructure complexity and delivers enterprise-ready performance and ROI today.

    Location: Room 201

    Duration: 1 hour

    Author:

    Paul Piezzo

    Enterprise Sales Director
    NeuReality

    Paul Piezzo

    Enterprise Sales Director
    NeuReality

    Author:

    Gaurav Shah

    VP of Business Development
    NeuReality

    Gaurav Shah

    VP of Business Development
    NeuReality

    Author:

    Naveh Grofi

    Customer Success Engineer
    NeuReality

    Naveh Grofi

    Customer Success Engineer
    NeuReality

    Location: Room 206

    Duration: 1 hour

    Location: Room 207

    Duration: 1 hour

    4:00 PM

    The rapid evolution of high-performance computing (HPC) clusters has been instrumental in driving transformative advancements in AI research and applications. These sophisticated systems enable the processing of complex datasets and support groundbreaking innovation. However, as their adoption grows, so do the critical security challenges they face, particularly when handling sensitive data in multi-tenant environments where diverse users and workloads coexist. Organizations are increasingly turning to Confidential Computing as a framework to protect AI workloads, emphasizing the need for robust HPC architectures that incorporate runtime attestation capabilities to ensure trust and integrity.

    In this session, we present an advanced HPC cluster architecture designed to address these challenges, focusing on how runtime attestation of critical components – such as the kernel, Trusted Execution Environments (TEEs), and eBPF layers – can effectively fortify HPC clusters for AI applications operating across disjoint tenants. This architecture leverages cutting-edge security practices, enabling real-time verification and anomaly detection without compromising the performance essential to HPC systems.

    Through use cases and examples, we will illustrate how runtime attestation integrates seamlessly into HPC environments, offering a scalable and efficient solution for securing AI workloads. Participants will leave this session equipped with a deeper understanding of how to leverage runtime attestation and Confidential Computing principles to build secure, reliable, and high-performing HPC clusters tailored for AI innovations.

    Location: Room 201

    Duration: 1 hour

    Author:

    Jason Rogers

    CEO
    Invary

    Jason Rogers is the Chief Executive Officer of Invary, a cybersecurity company that ensures the security and confidentiality of critical systems by verifying their Runtime Integrity. Leveraging NSA-licensed technology, Invary detects hidden threats and reinforces confidence in an existing security posture. Previously, Jason served as the Vice President of Platform at Matterport, successfully launched a consumer-facing IoT platform for Lowe's, and developed numerous IoT and network security software products for Motorola.

    Jason Rogers

    CEO
    Invary

    Jason Rogers is the Chief Executive Officer of Invary, a cybersecurity company that ensures the security and confidentiality of critical systems by verifying their Runtime Integrity. Leveraging NSA-licensed technology, Invary detects hidden threats and reinforces confidence in an existing security posture. Previously, Jason served as the Vice President of Platform at Matterport, successfully launched a consumer-facing IoT platform for Lowe's, and developed numerous IoT and network security software products for Motorola.

    Author:

    Ayal Yogev

    CEO & Co-founder
    Anjuna

    Ayal Yogev

    CEO & Co-founder
    Anjuna

    Location: Room 206

    Duration: 1 hour

    Location: Room 207

    Duration: 1 hour