26/05/2026
🎤 It’s GMUM Seminar time again! We have three speakers this week! 🤩
➡️who? Dmytro Polishchuk
➡️where? Wydział Matematyki i Informatyki UJ
➡️when? Thursday (28 Thursday 2026) at 14.15
➡️Talk: “QModel: Time-Aware GitHub Mining Framework for Empirical Software Quality Studies”
Abstract:
Quality-oriented mining software repositories studies require more than access to commits, issues, pull requests, and CI logs. They require a reproducible way to represent how technical, social, temporal, and defect-related evidence is linked across repository history. Existing MSR infrastructures provide important support for repository mining, but they do not directly expose a project-level, time-aware empirical model in which cross-artifact quality evidence can be queried, inspected, and reused across alternative study designs.
This paper presents QModel, an emerging infrastructure for reproducible GitHub mining in empirical software quality studies. QModel stores Git, GitHub, CI, timeline, reaction, graph, churn, and SZZ-style candidate-defect evidence in a project-centric relational schema. Its SQL-based compilation layer allows researchers to define feature-target datasets over arbitrary analysis units, including pull requests, issues, commits, files, time windows, and custom cross-artifact objects.
We evaluate QModel on ansible/ansible and facebook/react, mining 76,475 commits, 70,509 pull requests, 45,908 issues, 2,174,020 timeline events, 315,235 file-change records, and 446,161 CI records. The early results show that QModel can materialize analysis-ready datasets with computable target, graph, churn, and provenance features, while also making evidence gaps explicit when historical links, CI records, or fixing evidence are incomplete. These results support QModel as an empirical infrastructure for making quality-oriented repository operationalizations explicit, reusable, and systematically evaluable.
➡️who? Turhan Can Kargin
➡️where? Wydział Matematyki i Informatyki UJ
➡️when? Thursday (28 Thursday 2026) at 14.15
➡️Talk: “SpaRRTa: A Synthetic Benchmark for Evaluating Spatial Intelligence in Visual Foundation Models”
➡️Project page: https://sparrta.gmum.net/
Abstract:
Visual Foundation Models (VFMs), such as DINO and CLIP, excel in semantic understanding of images but exhibit limited spatial reasoning capabilities, which limits their applicability to embodied systems. As a result, recent work incorporates some 3D tasks (such as depth estimation) into VFM training. However, VFM performance remains inconsistent across other spatial tasks, raising the question of whether these models truly possess spatial awareness or overfit to specific 3D objectives. To address this question, we introduce the Spatial Relation Recognition Task (SpaRRTa) benchmark, which evaluates the ability of VFMs to identify static directional relations between visible objects. Unlike traditional 3D objectives that focus on precise metric prediction (e.g., surface normal estimation), SpaRRTa probes a fundamental capability underpinning more advanced forms of human-like spatial understanding. SpaRRTa generates an arbitrary number of photorealistic images with diverse scenes and fully controllable object arrangements, along with freely accessible spatial annotations. Evaluating a range of state-of-the-art VFMs, we reveal significant disparities in their ability to encode directional spatial relations. Through our analysis, we provide insights into the mechanisms that support or hinder this form of spatial awareness in modern VFMs. We hope that SpaRRTa will serve as a useful tool for guiding the development of future spatially aware visual models.
➡️who? Piotr Helm
➡️where? Wydział Matematyki i Informatyki UJ
➡️when? Thursday (28 Thursday 2026) at 14.15
➡️Talk: “SHIELD: Secure Hypernetworks for Incremental Expansion Learning Defense”
Abstract:
Continual learning under adversarial conditions remains an open problem, as existing methods often compromise either robustness, scalability, or both. We propose a novel framework that integrates Interval Bound Propagation (IBP) with a hypernetwork-based architecture to enable certifiably robust continual learning across sequential tasks. Our method, SHIELD, generates task-specific model parameters via a shared hypernetwork conditioned solely on compact task embeddings, eliminating the need for replay buffers or full model copies and enabling efficient over time. To further enhance robustness, we introduce Interval MixUp, a novel training strategy that blends virtual examples represented as ℓ∞ balls centered around MixUp points. Leveraging interval arithmetic, this technique guarantees certified robustness while mitigating the wrapping effect, resulting in smoother decision boundaries. We evaluate SHIELD under strong white-box adversarial attacks, including PGD and AutoAttack, across multiple benchmarks. It consistently outperforms existing robust continual learning methods, achieving state-of-the-art average accuracy while maintaining both scalability and certification. These results represent a significant step toward practical and theoretically grounded continual learning in adversarial settings.