Updated on 2025.06.13
Path Planning
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-10 | Optimal Task Offloading with Firm Deadlines for Mobile Edge Computing Systems | Khai Doan et.al. | 2506.09180 | null |
2025-06-10 | Reinforce LLM Reasoning through Multi-Agent Reflection | Yurun Yuan et.al. | 2506.08379 | null |
2025-06-10 | Dynamical System Optimization | Emo Todorov et.al. | 2506.08340 | null |
2025-06-09 | Modelling Nonstationary Time Series using Trend-Stationary Hypothesis | Zhandos Abdikhadir et.al. | 2506.07987 | null |
2025-06-08 | Stochastic Quadratic Dynamic Programming | Vincent Guigues et.al. | 2506.07314 | null |
2025-06-05 | Resilient Pattern Mining | Pengxin Bian et.al. | 2506.04935 | null |
2025-06-05 | Composing Agents to Minimize Worst-case Risk | Guruprerana Shabadi et.al. | 2506.04632 | null |
2025-06-04 | Struct2D: A Perception-Guided Framework for Spatial Reasoning in Large Multimodal Models | Fangrui Zhu et.al. | 2506.04220 | null |
2025-05-28 | Large Neighborhood and Hybrid Genetic Search for Inventory Routing Problems | Jingyi Zhao et.al. | 2506.03172 | null |
2025-06-03 | Dynamic Programming Techniques for Enhancing Cognitive Representation in Knowledge Tracing | Lixiang Xu et.al. | 2506.02949 | null |
2025-06-03 | Reachability Weighted Offline Goal-conditioned Resampling | Wenyan Yang et.al. | 2506.02577 | null |
2025-06-03 | Multi-agent Markov Entanglement | Shuze Chen et.al. | 2506.02385 | null |
2025-06-02 | Scalable In-Context Q-Learning | Jinmei Liu et.al. | 2506.01299 | null |
2025-06-01 | Trilevel Memetic Algorithm for the Electric Vehicle Routing Problem | Ivan Milinović et.al. | 2506.01065 | null |
2025-06-01 | Q-learning with Posterior Sampling | Priyank Agrawal et.al. | 2506.00917 | null |
2025-05-30 | GridRoute: A Benchmark for LLM-Based Route Planning with Cardinal Movement in Grid Environments | Kechen Li et.al. | 2505.24306 | null |
2025-05-30 | Winners vs. Losers: Momentum-based Strategies with Intertemporal Choice for ESG Portfolios | Ayush Jha et.al. | 2505.24250 | null |
2025-05-30 | CLaSp: In-Context Layer Skip for Self-Speculative Decoding | Longze Chen et.al. | 2505.24196 | null |
2025-05-29 | Spoken Language Modeling with Duration-Penalized Self-Supervised Units | Nicol Visser et.al. | 2505.23494 | link |
2025-05-29 | Offline Map Matching Based on Localization Error Distribution Modeling | Ruilin Xu et.al. | 2505.23123 | null |
2025-05-29 | DINGO: Constrained Inference for Diffusion LLMs | Tarun Suresh et.al. | 2505.23061 | null |
2025-05-27 | Learning-Based Tracking Perimeter Control for Two-region Macroscopic Traffic Dynamics | Can Chen et.al. | 2505.21818 | null |
2025-05-27 | When to Deceive: A Cross-Layer Stackelberg Game Framework for Strategic Timing of Cyber Deception | Ya-Ting Yang et.al. | 2505.21244 | null |
2025-05-23 | Evaluating the Energy-Efficiency of the Code Generated by LLMs | Md Arman Islam et.al. | 2505.20324 | null |
2025-05-23 | URB – Urban Routing Benchmark for RL-equipped Connected Autonomous Vehicles | Ahmet Onur Akman et.al. | 2505.17734 | null |
2025-05-23 | Distance Estimation in Outdoor Driving Environments Using Phase-only Correlation Method with Event Cameras | Masataka Kobayashi et.al. | 2505.17582 | null |
2025-05-22 | Tropical Attention: Neural Algorithmic Reasoning for Combinatorial Algorithms | Baran Hashemi et.al. | 2505.17190 | null |
2025-05-22 | Quantum Routing and Entanglement Dynamics Through Bottlenecks | Dhruv Devulapalli et.al. | 2505.16948 | null |
2025-05-22 | Reward-Aware Proto-Representations in Reinforcement Learning | Hon Tik Tse et.al. | 2505.16217 | null |
2025-05-21 | Toward Theoretical Insights into Diffusion Trajectory Distillation via Operator Merging | Weiguo Gao et.al. | 2505.16024 | null |
2025-05-21 | Families of tractable problems with respect to vertex-interval-membership width and its generalisations | Jessica Enright et.al. | 2505.15699 | null |
2025-05-21 | Deep Learning for Continuous-time Stochastic Control with Jumps | Patrick Cheridito et.al. | 2505.15602 | null |
2025-05-19 | Finding Maximum Independent Sets in Dynamic Graphs using Unsupervised Learning | Devendra Parkar et.al. | 2505.13754 | null |
2025-05-24 | Learning to Program Quantum Measurements for Machine Learning | Samuel Yen-Chi Chen et.al. | 2505.13525 | null |
2025-05-19 | Dynamic programming and dimensionality in convex stochastic optimization and control | Teemu Pennanen et.al. | 2505.12787 | null |
2025-05-18 | Resolving Latency and Inventory Risk in Market Making with Reinforcement Learning | Junzhe Jiang et.al. | 2505.12465 | null |
2025-05-16 | Co-Evolutionary Defence of Active Directory Attack Graphs via GNN-Approximated Dynamic Programming | Diksha Goel et.al. | 2505.11710 | null |
2025-05-15 | Multi-Objective Memory Bandwidth Regulation and Cache Partitioning for Multicore Real-Time Systems | Binqi Sun et.al. | 2505.11554 | null |
2025-05-16 | Sobolev Training of End-to-End Optimization Proxies | Andrew W. Rosemberg et.al. | 2505.11342 | null |
2025-05-16 | Beyond KL-divergence: Risk Aware Control Through Cross Entropy and Adversarial Entropy Regularization | Menno van Zutphen et.al. | 2505.11068 | null |
2025-05-15 | Scalable Approximate Biclique Counting over Large Bipartite Graphs | Jingbang Chen et.al. | 2505.10471 | null |
2025-05-14 | Reflected stochastic recursive control problems with jumps: dynamic programming and stochastic verification theorems | Lu Liu et.al. | 2505.09070 | null |
2025-05-13 | Optimal Trajectory Planning with Collision Avoidance for Autonomous Vehicle Maneuvering | Jason Zalev et.al. | 2505.08724 | null |
2025-05-13 | Distributionally Robust LQG with Kullback-Leibler Ambiguity Sets | Marta Fochesato et.al. | 2505.08370 | null |
2025-05-11 | Optimal control of convective Brinkman-Forchheimer equations: Dynamic programming equation and Viscosity solutions | Sagar Gautam et.al. | 2505.07095 | null |
2025-05-10 | Optimizing Railcar Movements to Create Outbound Trains in a Freight Railyard | Ruonan Zhao et.al. | 2505.06510 | null |
2025-05-09 | Scheduled Jacobian Chaining | Simon Märtens et.al. | 2505.06056 | link |
2025-05-09 | Universal Approximation Theorem for Deep Q-Learning via FBSDE System | Qian Qi et.al. | 2505.06023 | null |
2025-05-09 | Data-driven pressure field prediction for ships in regular sea states | Malte Loft et.al. | 2505.06014 | null |
2025-05-09 | Multi-armed Bandit for Stochastic Shortest Path in Mixed Autonomy | Yu Bai et.al. | 2505.05878 | null |
2025-05-10 | Driving with Context: Online Map Matching for Complex Roads Using Lane Markings and Scenario Recognition | Xin Bi et.al. | 2505.05007 | link |
2025-05-08 | Chain-of-Thought Tokens are Computer Program Variables | Fangwei Zhu et.al. | 2505.04955 | link |
2025-05-08 | Network Digital Twin for Route Optimization in 5G/B5G Transport Slicing with What-If Analysis | Rebecca Aben-Athar et.al. | 2505.04879 | null |
2025-05-06 | Stochastic scheduling with Bernoulli-type jobs through policy stratification | Antonios Antoniadis et.al. | 2505.03349 | null |
2025-05-05 | A Fully Data-Driven Value Iteration for Stochastic LQR: Convergence, Robustness and Stability | Leilei Cui et.al. | 2505.02970 | null |
2025-05-03 | Multistage stochastic optimization for drayage procurement in container logistics using stochastic dual dynamic programming | Georgios Vassos et.al. | 2505.01813 | null |
2025-05-03 | Integrated optimization of operations and capacity planning under uncertainty for drayage procurement in container logistics | Georgios Vassos et.al. | 2505.01808 | link |
2025-05-03 | Evaluating Input Modalities for Pilot-Centered Taxiway Navigation: Insights from a Wizard-of-Oz Simulation | Chan Chea Mean et.al. | 2505.01679 | null |
2025-05-03 | Morello: Compiling Fast Neural Networks with Dynamic Programming and Spatial Compression | Samuel J. Kaufman et.al. | 2505.01637 | link |
2025-05-02 | Global Collinearity-aware Polygonizer for Polygonal Building Mapping in Remote Sensing | Fahong Zhang et.al. | 2505.01385 | null |
2025-05-02 | Power System Transition Planning: An Industry-Aligned Framework for Long-Term Optimization | Ahmed Al-Shafei et.al. | 2505.01331 | null |
2025-05-02 | A stochastic Gordon-Loeb model for optimal cybersecurity investment under clustered attacks | Giorgia Callegaro et.al. | 2505.01221 | null |
2025-05-02 | Remote Estimation over Packet-Dropping Wireless Channels with Partial State Information | Ioannis Tzortzis et.al. | 2505.01132 | null |
2025-05-01 | Quantum Computing in Industrial Environments: Where Do We Stand and Where Are We Headed? | Eneko Osaba et.al. | 2505.00891 | null |
2025-05-01 | Platoon Coordination and Leader Selection in Mixed Transportation Systems via Dynamic Programming | Ying Wang et.al. | 2505.00847 | null |
2025-04-24 | Optimal Blackjack Betting Strategies Through Dynamic Programming and Expected Utility Theory | Lucas Bordeu et.al. | 2505.00724 | null |
2025-04-30 | Galvatron: An Automatic Distributed System for Efficient Foundation Model Training | Xinyi Liu et.al. | 2504.21411 | link |
2025-04-29 | DeeP-Mod: Deep Dynamic Programming based Environment Modelling using Feature Extraction | Chris Child et.al. | 2504.20535 | null |
2025-04-28 | Warm-Starting QAOA with XY Mixers: A Novel Approach for Quantum-Enhanced Vehicle Routing Optimization | Rafael S. do Carmo et.al. | 2504.19934 | null |
2025-04-30 | The frequency $K_i$ s for symmetrical traveling salesman problem | Yong Wang et.al. | 2504.19608 | null |
2025-04-28 | Symmetric Policy Design for Multi-Agent Dispatch Coordination in Supply Chains | Sagar Sudhakara et.al. | 2504.19397 | null |
2025-04-24 | Efficient Tree Generation for Globally Optimal Decisions under Probabilistic Outcomes | Berk Ozturk et.al. | 2504.17983 | null |
2025-04-24 | Ergodic control of McKean-Vlasov systems on the Wasserstein space | Marco Fuhrman et.al. | 2504.17958 | null |
2025-04-24 | Fréchet Distance in Unweighted Planar Graphs | Ivor van der Hoog et.al. | 2504.17342 | null |
2025-04-24 | Advancing Frontiers of Path Integral Theory for Stochastic Optimal Control | Apurva Patil et.al. | 2504.17154 | null |
2025-04-22 | Distributed model predictive control without terminal cost under inexact distributed optimization | Xiaoyu Liu et.al. | 2504.15768 | null |
2025-04-22 | Stochastic Programming for Dynamic Temperature Control of Refrigerated Road Transport | Francesco Giliberto et.al. | 2504.15741 | null |
2025-04-22 | Exploring Inevitable Waypoints for Unsolvability Explanation in Hybrid Planning Problems | Mir Md Sajid Sarwar et.al. | 2504.15668 | null |
2025-04-24 | A Quadratic Control Framework for Dynamic Systems | Igor Ladnik et.al. | 2504.15396 | null |
2025-04-21 | The Iterative Chainlet Partitioning Algorithm for the Traveling Salesman Problem with Drone and Neural Acceleration | Jae Hyeok Lee et.al. | 2504.15147 | null |
2025-04-23 | Feedback Stackelberg-Nash equilibria in difference games with quasi-hierarchical interactions and inequality constraints | Partha Sarathi Mohapatra et.al. | 2504.15019 | null |
2025-04-19 | Optimal Operation and Valuation of Electricity Storages | Jean-Philippe Chancelier et.al. | 2504.14292 | null |
2025-04-18 | Code generation for solving and differentiating through convex optimization problems | Maximilian Schaller et.al. | 2504.14099 | null |
2025-04-16 | Beyond ISAC: Toward Integrated Heterogeneous Service Provisioning via Elastic Multi-Dimensional Multiple Access | Jie Chen et.al. | 2504.11692 | null |
2025-04-18 | Traffic Adaptive Moving-window Service Patrolling for Real-time Incident Management during High-impact Events | Haozhe Lei et.al. | 2504.11570 | null |
2025-04-15 | TransitReID: Transit OD Data Collection with Occlusion-Resistant Dynamic Passenger Re-Identification | Kaicong Huang et.al. | 2504.11500 | null |
2025-04-15 | Integration of a high-fidelity model of quantum sensors with a map-matching filter for quantum-enhanced navigation | Samuel Lellouch et.al. | 2504.11119 | null |
2025-04-22 | Breaking the Dimensional Barrier: A Pontryagin-Guided Direct Policy Optimization for Continuous-Time Multi-Asset Portfolio | Jeonggyu Huh et.al. | 2504.11116 | null |
2025-04-15 | Hallucination-Aware Generative Pretrained Transformer for Cooperative Aerial Mobility Control | Hyojun Ahn et.al. | 2504.10831 | null |
2025-04-11 | A Nonlinear Hash-based Optimization Method for SpMV on GPUs | Chen Yan et.al. | 2504.08860 | null |
2025-04-07 | A Constraint Programming Model For Serial Batch Scheduling With Minimum Batch Size | Jorge A. Huertas et.al. | 2504.08793 | null |
2025-04-05 | SLOs-Serve: Optimized Serving of Multi-SLO LLMs | Siyuan Chen et.al. | 2504.08784 | null |
2025-04-11 | Interior Point Differential Dynamic Programming, Redux | Ming Xu et.al. | 2504.08278 | link |
2025-04-10 | Quantum-assured magnetic navigation achieves positioning accuracy better than a strategic-grade INS in airborne and ground-based field trials | Murat Muradoglu et.al. | 2504.08167 | null |
2025-04-10 | Low-Thrust Many-Revolution Transfer between Near Rectilinear Halo Orbit and Low Lunar Orbit Using Hybrid Differential Dynamic Programming | Kohei Oue et.al. | 2504.07723 | null |
2025-04-10 | Joint Travel Route Optimization Framework for Platooning | Akif Adas et.al. | 2504.07623 | null |
2025-04-09 | Rounding the Lovász Theta Function with a Value Function Approximation | Rui Gong et.al. | 2504.07204 | null |
2025-04-09 | Evaluating Retrieval Augmented Generative Models for Document Queries in Transportation Safety | Chad Melton et.al. | 2504.07022 | null |
2025-04-17 | Maximizing Battery Storage Profits via High-Frequency Intraday Trading | David Schaurecker et.al. | 2504.06932 | null |
2025-04-08 | Linear-space LCS enumeration with quadratic-time delay for two strings | Yoshifumi Sakai et.al. | 2504.05742 | null |
2025-04-09 | DDT: Decoupled Diffusion Transformer | Shuai Wang et.al. | 2504.05741 | null |
2025-04-08 | Hamilton-Jacobi-Bellman equation and Viscosity solutions for an optimal control problem for stochastic convective Brinkman-Forchheimer equations | Sagar Gautam et.al. | 2504.05707 | null |
2025-04-06 | Optimized Path Planning for Logistics Robots Using Ant Colony Algorithm under Multiple Constraints | Haopeng Zhao et.al. | 2504.05339 | null |
2025-04-07 | Maximum Shortest Path Interdiction Problem by Upgrading Nodes on Trees under Unit Cost | Qiao Zhang et.al. | 2504.05190 | null |
2025-04-06 | Memetic Search for Green Vehicle Routing Problem with Private Capacitated Refueling Stations | Rui Xu et.al. | 2504.04527 | null |
2025-04-05 | Improving Question Embeddings with Cognitiv Representation Optimization for Knowledge Tracing | Lixiang Xu et.al. | 2504.04121 | null |
2025-04-04 | NeRFlex: Resource-aware Real-time High-quality Rendering of Complex Scenes on Mobile Devices | Zhe Wang et.al. | 2504.03415 | null |
2025-04-04 | Block Toeplitz Sparse Precision Matrix Estimation for Large-Scale Interval-Valued Time Series Forecasting | Wan Tian et.al. | 2504.03322 | null |
2025-04-04 | Quantum Optimization-Based Route Compression for Efficient Navigation Systems | Shunsuke Sotobayashi et.al. | 2504.03227 | null |
2025-04-11 | Dynamic Treewidth in Logarithmic Time | Tuukka Korhonen et.al. | 2504.02790 | null |
2025-04-04 | Controlled Social Learning: Altruism vs. Bias | Raghu Arghal et.al. | 2504.02648 | null |
2025-04-03 | Reinforcement Learning for Solving the Pricing Problem in Column Generation: Applications to Vehicle Routing | Abdo Abouelrous et.al. | 2504.02383 | null |
2025-04-03 | AI-Driven Framework for Multi-Service Multi-Modal Devices in NextG ORAN Systems | Mrityunjoy Gain et.al. | 2504.01730 | null |
2025-04-01 | A Parametric Model for Near-Optimal Online Synthesis with Robust Reach-Avoid Guarantees | Mario Gleirscher et.al. | 2504.01006 | null |
2025-04-01 | Linear models of dynamic optimization with linear constraints | Somdeb Lahiri et.al. | 2504.00630 | null |
2025-03-31 | QUADRO: A Hybrid Quantum Optimization Framework for Drone Delivery | James B. Holliday et.al. | 2503.24301 | null |
2025-04-02 | Unraveling tensor structures in correct-by-design controller synthesis | Ruohan Wang et.al. | 2503.24085 | null |
2025-03-31 | Bi-Level Route Optimization and Path Planning with Hazard Exploration | Jimin Choi et.al. | 2503.24044 | null |
2025-03-31 | Tree-Guided $L_1$ -Convex Clustering | Bingyuan Zhang et.al. | 2503.24012 | link |
2025-03-30 | A Systematic Decade Review of Trip Route Planning with Travel Time Estimation based on User Preferences and Behavior | Nikil Jayasuriya et.al. | 2503.23486 | null |
2025-03-29 | A convergence technique for the game i-Mark | Gabriel Nivasch et.al. | 2503.23196 | null |
2025-03-29 | PartialLoading: User Scheduling and Bandwidth Allocation for Parameter-sharing Edge Inference | Guanqiao Qu et.al. | 2503.22982 | null |
2025-03-28 | Policy Optimization and Multi-agent Reinforcement Learning for Mean-variance Team Stochastic Games | Junkai Hu et.al. | 2503.22779 | null |
2025-04-04 | The Price of Simplicity: Analyzing Decoupled Policies for Multi-Location Inventory Control | Yohan John et.al. | 2503.22639 | null |
2025-03-28 | Scheduling problem of aircrafts on a same runway and dual runways | Peng Lin et.al. | 2503.22124 | null |
2025-03-27 | Optimal Stepsize for Diffusion Sampling | Jianning Pei et.al. | 2503.21774 | link |
2025-03-26 | A Hopf-Lax Type Formula for Multi-Agent Path Planning with Pattern Coordination | Christian Parkinson et.al. | 2503.20974 | link |
2025-03-26 | Infinite Time Horizon Optimal Control of McKean-Vlasov SDEs | Silvia Rudà et.al. | 2503.20572 | null |
2025-03-26 | Optimal reinsurance in a competitive market | Lea Enzi et.al. | 2503.20555 | null |
2025-03-26 | Beyond Worst-Case Subset Sum: An Adaptive, Structure-Aware Solver with Sub- $2^{n/2}$ Enumeration | Jesus Salas et.al. | 2503.20162 | null |
2025-03-31 | Graph neural networks extrapolate out-of-distribution for shortest paths | Robert R. Nerem et.al. | 2503.19173 | null |
2025-03-29 | An Efficient Frequency-Based Approach for Maximal Square Detection in Binary Matrices | Swastik Bhandari et.al. | 2503.18974 | null |
2025-03-23 | Agent-Based Models for Two Stocks with Superhedging | Dario Crisci et.al. | 2503.18165 | null |
2025-03-21 | A New Segment Routing method with Swap Node Selection Strategy Based on Deep Reinforcement Learning for Software Defined Network | Miao Ye et.al. | 2503.16914 | null |
2025-03-20 | Reinforcement Learning-based Heuristics to Guide Domain-Independent Dynamic Programming | Minori Narita et.al. | 2503.16371 | link |
2025-03-19 | On the Functoriality of Belief Propagation Algorithms on finite Partially Ordered Sets | Grégoire Sergeant-Perthuis et.al. | 2503.15705 | null |
2025-03-24 | Distribution and Purification of Entanglement States in Quantum Networks | Xiaojie Fan et.al. | 2503.14712 | null |
2025-03-18 | Designing and Deploying AI Models for Sustainable Logistics Optimization: A Case Study on Eco-Efficient Supply Chains in the USA | Reza E Rabbi Shawon et.al. | 2503.14556 | null |
2025-03-17 | Local-Global Learning of Interpretable Control Policies: The Interface between MPC and Reinforcement Learning | Thomas Banker et.al. | 2503.13289 | null |
2025-03-17 | Exploring 3D Activity Reasoning and Planning: From Implicit Human Intentions to Route-Aware Planning | Xueying Jiang et.al. | 2503.12974 | null |
2025-03-17 | Navigating Heat Exposure: Simulation of Route Planning Based on Visual Language Model Agents | Haoran Ma et.al. | 2503.12731 | null |
2025-03-16 | Routing Guidance for Emerging Transportation Systems with Improved Dynamic Trip Equity | Ting Bai et.al. | 2503.12601 | null |
2025-03-14 | Discrete Effort Distribution via Regrettable Greedy Algorithm | Song Cao et.al. | 2503.11107 | null |
2025-03-13 | Dynamic Programming Algorithms for Finding Cost-Optimal Trajectory on the Terrain | Majid E. Abbasov et.al. | 2503.10922 | null |
2025-03-13 | Enhanced Route Planning with Calibrated Uncertainty Set | Lingxuan Tang et.al. | 2503.10088 | null |
2025-03-12 | PairVDN - Pair-wise Decomposed Value Functions | Zak Buzzard et.al. | 2503.09521 | link |
2025-03-11 | Large Neighborhood Search and Bitmask Dynamic Programming for Wireless Mobile Charging Electric Vehicle Routing Problems in Medical Transportation | Jingyi Zhao et.al. | 2503.08752 | null |
2025-03-11 | DISTINGUISH Workflow: A New Paradigm of Dynamic Well Placement Using Generative Machine Learning | Sergey Alyaev et.al. | 2503.08509 | link |
2025-03-10 | Multi-Objective Routing Optimization Using Coherent Ising Machine in Wireless Multihop Networks | Yu-Xuan Lin et.al. | 2503.07924 | null |
2025-03-10 | Co-Optimizing Distributed Energy Resources under Demand Charges and Bi-Directional Power Flow | Ruixiao Yang et.al. | 2503.07907 | null |
2025-03-10 | Operational route planning under uncertainty for Demand Adaptive Systems | Benedikt Lienkamp et.al. | 2503.07812 | link |
2025-03-09 | Pull-Based Query Scheduling for Goal-Oriented Semantic Communication | Pouya Agheli et.al. | 2503.06725 | null |
2025-03-08 | A Neural Score Follower for Computer Accompaniment of Polyphonic Musical Instruments | Ashwin Pillay et.al. | 2503.06348 | null |
2025-03-11 | Optimal Output Feedback Learning Control for Discrete-Time Linear Quadratic Regulation | Kedi Xie et.al. | 2503.06226 | null |
2025-03-08 | Dynamic Programming in Ordered Vector Space | Nisha Peng et.al. | 2503.06055 | null |
2025-03-04 | Establishment and Solution of a Multi-Stage Decision Model Based on Hypothesis Testing and Dynamic Programming Algorithm | Ziyang Liu et.al. | 2503.05807 | null |
2025-03-07 | On Almost Fair and Equitable Allocations of Indivisible Items for Non-monotone Valuations | Vittorio Bilò et.al. | 2503.05695 | null |
2025-03-06 | Efficient Algorithms for Verifying Kruskal Rank in Sparse Linear Regression and Related Applications | Fengqin Zhou et.al. | 2503.04986 | null |
2025-03-06 | Mean field optimal stopping with uncontrolled state | Andrea Cosso et.al. | 2503.04269 | null |
2025-03-05 | Endpoint-Explicit Differential Dynamic Programming via Exact Resolution | Maria Parilli et.al. | 2503.03897 | null |
2025-03-05 | Composite Nonlinear Trajectory Tracking Control of Co-Driving Vehicles Using Self-Triggered Adaptive Dynamic Programming | Chuan Hu et.al. | 2503.03348 | null |
2025-03-04 | Optimal power procurement for green cellular wireless networks under uncertainty and chance constraints | Nadhir Ben Rached et.al. | 2503.03051 | null |
2025-03-04 | On the optimal stopping problem for diffusions and an approximation result for stopping times | Andrea Cosso et.al. | 2503.02514 | null |
2025-03-04 | JPDS-NN: Reinforcement Learning-Based Dynamic Task Allocation for Agricultural Vehicle Routing Optimization | Yixuan Fan et.al. | 2503.02369 | null |
2025-03-04 | Optimal Control for Remote Patient Monitoring with Multidimensional Health States | Siddharth Chandak et.al. | 2503.02292 | null |
2025-03-03 | CorrA: Leveraging Large Language Models for Dynamic Obstacle Avoidance of Autonomous Vehicles | Shanting Wang et.al. | 2503.02076 | null |
2025-03-03 | Mapping Spiking Neural Networks to Heterogeneous Crossbar Architectures using Integer Linear Programming | Devin Pohl et.al. | 2503.02033 | null |
2025-02-25 | Tracking Control of Euler-Lagrangian Systems with Prescribed State, Input, and Temporal Constraints | Chidre Shravista Kashyap et.al. | 2503.01866 | null |
2025-03-03 | CacheQuant: Comprehensively Accelerated Diffusion Models | Xuewen Liu et.al. | 2503.01323 | null |
2025-03-03 | Parameter-free Video Segmentation for Vision and Language Understanding | Louis Mahon et.al. | 2503.01201 | null |
2025-03-02 | Efficient End-to-end Visual Localization for Autonomous Driving with Decoupled BEV Neural Matching | Jinyu Miao et.al. | 2503.00862 | null |
2025-03-07 | Llamarine: Open-source Maritime Industry-specific Large Language Model | William Nguyen et.al. | 2503.00203 | null |
2025-02-28 | Time-optimal problem in the space of probabilities measures | Yurii Averboukh et.al. | 2502.20871 | null |
2025-02-27 | Dynamic Program Slices Change How Developers Diagnose Gradual Run-Time Type Errors | Felipe Bañados Schwerter et.al. | 2502.20533 | null |
2025-02-27 | Efficient Risk-sensitive Planning via Entropic Risk Measures | Alexandre Marthe et.al. | 2502.20423 | null |
2025-02-27 | Pontryagin-Bellman Differential Dynamic Programming for Low-Thrust Trajectory Optimization with Path Constraints | Yanis Sidhoum et.al. | 2502.20291 | null |
2025-02-27 | SSD: A State-based Stealthy Backdoor Attack For Navigation System in UAV Route Planning | Zhaoxuan Wang et.al. | 2502.20178 | null |
2025-02-27 | GraphSparseNet: a Novel Method for Large Scale Trafffic Flow Prediction | Weiyang Kong et.al. | 2502.19823 | null |
2025-03-04 | Off-Policy Temporal Difference Learning for Perturbed Markov Decision Processes: Theoretical Insights and Extensive Simulations | Ali Forootani et.al. | 2502.18415 | null |
2025-02-25 | Dynamic Factor Model-Based Multiperiod Mean-Variance Portfolio Selection with Portfolio Constraints | Jianjun Gao et.al. | 2502.17915 | link |
2025-02-24 | A Deterministic and Linear Model of Dynamic Optimization | Somdeb Lahiri et.al. | 2502.17012 | null |
2025-02-24 | Be CIM or Be Memory: A Dual-mode-aware DNN Compiler for CIM Accelerators | Shixin Zhao et.al. | 2502.17006 | null |
2025-02-23 | Volume Optimality in Conformal Prediction with Structured Prediction Sets | Chao Gao et.al. | 2502.16658 | null |
2025-02-21 | Near Optimal Decision Trees in a SPLIT Second | Varun Babbar et.al. | 2502.15988 | null |
2025-02-21 | Zweistein: A Dynamic Programming Evaluation Function for Einstein Würfelt Nicht! | Wei Lin. Hsueh et.al. | 2502.15547 | null |
2025-02-21 | Learning Maritime Inventory Routing Optimization | Rui Chen et.al. | 2502.15244 | null |
2025-02-19 | Optimistically Optimistic Exploration for Provably Efficient Infinite-Horizon Reinforcement and Imitation Learning | Antoine Moulin et.al. | 2502.13900 | null |
2025-02-19 | FPT algorithms over linear delta-matroids with applications | Eduard Eiben et.al. | 2502.13654 | null |
2025-03-01 | Value Gradient Sampler: Sampling as Sequential Decision Making | Sangwoong Yoon et.al. | 2502.13280 | link |
2025-02-18 | Autonomous Vehicles Using Multi-Agent Reinforcement Learning for Routing Decisions Can Harm Urban Traffic | Anastasia Psarou et.al. | 2502.13188 | null |
2025-02-18 | GPU Memory Usage Optimization for Backward Propagation in Deep Network Training | Ding-Yong Hong et.al. | 2502.12499 | null |
2025-02-17 | Logarithmic Approximation for Road Pricing on Grids | Andrei Constantinescu et.al. | 2502.11979 | null |
2025-02-17 | Proactive Depot Discovery: A Generative Framework for Flexible Location-Routing | Site Qu et.al. | 2502.11715 | null |
2025-02-16 | The Q-Spellbook: Crafting Surface Code Layouts and Magic State Protocols for Large-Scale Quantum Computing | Avimita Chatterjee et.al. | 2502.11253 | null |
2025-02-14 | Customizable Contraction Hierarchies – A Survey | Thomas Bläsius et.al. | 2502.10519 | null |
2025-02-14 | Scheduling Strategies for Partially-Replicable Task Chains on Two Types of Resources | Diane Orhan et.al. | 2502.10000 | null |
2025-02-14 | Thompson Sampling for Repeated Newsvendor | Weizhou Zhang et.al. | 2502.09900 | null |
2025-02-26 | A quantum speedup algorithm for TSP based on quantum dynamic programming with very few qubits | Bai Xujun et.al. | 2502.08853 | null |
2025-02-12 | Self-Evaluation for Job-Shop Scheduling | Imanol Echeverria et.al. | 2502.08684 | null |
2025-02-11 | TRAVEL: Training-Free Retrieval and Alignment for Vision-and-Language Navigation | Navid Rajabi et.al. | 2502.07306 | null |
2025-02-05 | RLOMM: An Efficient and Robust Online Map Matching Framework with Reinforcement Learning | Minxiao Chen et.al. | 2502.06825 | null |
2025-02-08 | Counting Tree-Like Multigraphs with a Given Number of Vertices and Multiple Edges | Muhammad Ilyas et.al. | 2502.05529 | null |
2025-02-06 | Aligner-Encoders: Self-Attention Transformers Can Be Self-Transducers | Adam Stooke et.al. | 2502.05232 | null |
2025-02-07 | Stochastic internal habit formation and optimality | Michele Aleandri et.al. | 2502.05081 | null |
2025-02-07 | Preference-aware compensation policies for crowdsourced on-demand services | Georgina Nouli et.al. | 2502.05060 | null |
2025-02-07 | A non-zero-sum game with reinforcement learning under mean-variance framework | Junyi Guo et.al. | 2502.04788 | null |
2025-02-06 | Unifying and Optimizing Data Values for Selection via Sequential-Decision-Making | Hongliang Chi et.al. | 2502.04554 | null |
2025-02-06 | Solvability of Approximate Reach-Avoid Games | Mario Gleirscher et.al. | 2502.04544 | null |
2025-02-06 | On the Number of Control Nodes in Boolean Networks with Degree Constraints | Liangjie Sun et.al. | 2502.03839 | null |
2025-02-06 | Iterate to Accelerate: A Unified Framework for Iterative Reasoning and Feedback Convergence | Jacob Fein-Ashley et.al. | 2502.03787 | null |
2025-02-06 | Cascaded Learned Bloom Filter for Optimal Model-Filter Size Balance and Fast Rejection | Atsuki Sato et.al. | 2502.03696 | null |
2025-02-06 | Improving polynomial bounds for the Graphical Traveling Salesman Problem with release dates on paths | Thailsson Clementino et.al. | 2502.02680 | null |
2025-02-04 | Optimal Routing in the Presence of Hooks: Three Case Studies | Tarun Chitra et.al. | 2502.02059 | link |
2025-02-03 | Trajectory Map-Matching in Urban Road Networks Based on RSS Measurements | Zheng Xing et.al. | 2502.01280 | null |
2025-02-08 | Minimum Riesz s-Energy Subset Selection in Ordered Point Sets via Dynamic Programming | Michael Emmerich et.al. | 2502.01163 | null |
2025-02-01 | Model-Free Predictive Control: Introductory Algebraic Calculations, and a Comparison with HEOL and ANNs | Cédric Join et.al. | 2502.00443 | null |
2025-02-01 | A polynomial-based constrained solver for fuel-optimal low-thrust trajectory optimization | Thomas Caleb et.al. | 2502.00398 | null |
2025-02-01 | Left-Deep Join Order Selection with Higher-Order Unconstrained Binary Optimization on Quantum Computers | Valter Uotila et.al. | 2502.00362 | null |
2025-01-31 | Epi-Consistent Approximation of Stochastic Dynamic Programs | Dominic S. T. Keehan et.al. | 2501.19028 | null |
2025-01-30 | Model-Adaptive Approach to Dynamic Discrete Choice Models with Large State Spaces | Ertian Chen et.al. | 2501.18746 | null |
2025-02-05 | Solving Drone Routing Problems with Quantum Computing: A Hybrid Approach Combining Quantum Annealing and Gate-Based Paradigms | Eneko Osaba et.al. | 2501.18432 | null |
2025-01-29 | Stochastic scattering control of spider diffusion governed by an optimal diffraction probability measure selected from its own local-time | Isaac Ohavi et.al. | 2501.18057 | null |
2025-01-15 | Low-Thrust Many-Revolution Trajectory Design Under Operational Uncertainties for DESTINY+ Mission | Naoya Ozaki et.al. | 2501.17867 | null |
2025-02-06 | On characterizing optimal learning trajectories in a class of learning problems | Getachew K Befekadu et.al. | 2501.16521 | null |
2025-01-22 | Modified Patankar Semi-Lagrangian Scheme for the Optimal Control of Production-Destruction systems | Simone Cacace et.al. | 2501.13085 | null |
2025-01-22 | Optimizing Return Distributions with Distributional Dynamic Programming | Bernardo Ávila Pires et.al. | 2501.13028 | null |
2025-01-30 | Pontryagin-Guided Deep Learning for Large-Scale Constrained Dynamic Portfolio Choice | Jeonggyu Huh et.al. | 2501.12600 | null |
2025-01-23 | Treefix: Enabling Execution with a Tree of Prefixes | Beatriz Souza et.al. | 2501.12339 | null |
2025-01-21 | A Dynamic Programming Framework for Generating Approximately Diverse and Optimal Solutions | Waldo Gálvez et.al. | 2501.12261 | null |
2025-01-21 | Dissecting the NVIDIA Hopper Architecture through Microbenchmarking and Multiple Level Analysis | Weile Luo et.al. | 2501.12084 | null |
2025-01-20 | Routing Optimization Based on Distributed Intelligent Network Softwarization for the Internet of Things | Mohamed Ali Zormati et.al. | 2501.11484 | null |
2025-02-01 | OpenLiDARMap: Zero-Drift Point Cloud Mapping using Map Priors | Dominik Kulmer et.al. | 2501.11111 | link |
2025-01-25 | BOOST: Microgrid Sizing using Ordinal Optimization | Mohamad Fares El Hajj Chehade et.al. | 2501.10842 | null |
2025-01-17 | Multiclass Queue Scheduling Under Slowdown: An Approximate Dynamic Programming Approach | Jing Dong et.al. | 2501.10523 | null |
2025-01-17 | Complexity of the Virtual Network Embedding with uniform demands | Amal Benhamiche et.al. | 2501.10154 | null |
2025-01-16 | A Dynamic Unmanned Aerial Vehicle Routing Framework for Urban Traffic Monitoring | Yumeng Bai et.al. | 2501.09249 | null |
2025-01-15 | Stochastic Optimal Control of Prosumers in a District Heating System | Maalvladédon Ganet Somé et.al. | 2501.09088 | null |
2025-01-15 | Family-wise Error Rate Control with E-values | Will Hartog et.al. | 2501.09015 | null |
2025-01-31 | Monte Carlo Tree Search for Comprehensive Exploration in LLM-Based Automatic Heuristic Design | Zhi Zheng et.al. | 2501.08603 | link |
2025-01-14 | Cooperative Patrol Routing: Optimizing Urban Crime Surveillance through Multi-Agent Reinforcement Learning | Juan Palma-Borda et.al. | 2501.08020 | link |
2025-01-14 | Optimal Classification Trees for Continuous Feature Data Using Dynamic Programming with Branch-and-Bound | Catalin E. Brita et.al. | 2501.07903 | link |
2025-01-09 | A Multi-Layer CNN-GRUSKIP model based on transformer for spatial TEMPORAL traffic flow prediction | Karimeh Ibrahim Mohammad Ata et.al. | 2501.07593 | null |
2025-01-13 | An Alternating Approach to Approximate Dynamic Programming | Di Zhang et.al. | 2501.06983 | null |
2025-01-11 | A Linear Complexity Algorithm for Optimal Transport Problem with Log-type Cost | Ziyuan Lyu et.al. | 2501.06578 | null |
2025-01-10 | Exploratory Randomization for Discrete-Time Linear Exponential Quadratic Gaussian (LEQG) Problem | Sebastien Lleo et.al. | 2501.06275 | null |
2025-01-09 | Linear Algebraic Truncation Algorithm with A Posteriori Error Bounds for Computing Markov Chain Equilibrium Gradients | Saied Mahdian et.al. | 2501.06266 | null |
2025-01-09 | ThriftLLM: On Cost-Effective Selection of Large Language Models for Classification Queries | Keke Huang et.al. | 2501.04901 | null |
2025-01-08 | Semilinear Dynamic Programming: Analysis, Algorithms, and Certainty Equivalence Properties | Yuchao Li et.al. | 2501.04668 | null |
2025-01-08 | HypeRL: Parameter-Informed Reinforcement Learning for Parametric PDEs | Nicolò Botteghi et.al. | 2501.04538 | null |
2025-01-08 | Probabilistic Greedy Algorithm Solver Using Magnetic Tunneling Junctions for Traveling Salesman Problem | Ran Zhang et.al. | 2501.04447 | null |
2025-01-07 | Exploring the Potential of Large Language Models in Public Transportation: San Antonio Case Study | Ramya Jonnala et.al. | 2501.03904 | null |
2025-01-07 | Young domination on Hamming rectangles | Janko Gravner et.al. | 2501.03788 | null |
2025-01-06 | Distributionally Robust Control Synthesis for Stochastic Systems with Safety and Reach-Avoid Specifications | Yu Chen et.al. | 2501.03137 | null |
2025-01-06 | MDP3: A Training-free Approach for List-wise Frame Selection in Video-LLMs | Hui Sun et.al. | 2501.02885 | null |
2025-01-06 | Local Reactive Control for Mobile Manipulators with Whole-Body Safety in Complex Environments | Chunxin Zheng et.al. | 2501.02815 | null |
2025-01-06 | Enhancing Robot Route Optimization in Smart Logistics with Transformer and GNN Integration | Hao Luo et.al. | 2501.02749 | null |
2025-01-05 | Approximate Dynamic Programming for a Remanufacture-to-Order System | Amirreza Pashapour et.al. | 2501.02656 | null |
2025-01-05 | Neural Error Covariance Estimation for Precise LiDAR Localization | Minoo Dolatabadi et.al. | 2501.02558 | null |
2025-01-01 | Large Language Models Are Read/Write Policy-Makers for Simultaneous Generation | Shoutao Guo et.al. | 2501.00868 | link |
2024-12-30 | A randomisation method for mean-field control problems with common noise | Robert Denkert et.al. | 2412.20782 | null |
2024-12-28 | RFPPO: Motion Dynamic RRT based Fluid Field - PPO for Dynamic TF/TA Routing Planning | Rongkun Xue et.al. | 2412.20098 | null |
2024-12-27 | Game theoretical asymptotic mean value properties for non-homogeneous $p$ -Laplace problems | Félix del Teso et.al. | 2412.19410 | null |
2024-12-24 | Hybrid Many-Objective Optimization in Probabilistic Mission Design for Compliant and Effective UAV Routing | Simon Kohaut et.al. | 2412.18514 | null |
2024-12-23 | AI-Driven Control of Chaos: A Transformer-Based Approach for Dynamical Systems | David Valle et.al. | 2412.17357 | link |
2024-12-21 | A Bayesian Composite Risk Approach for Stochastic Optimal Control and Markov Decision Processes | Wentao Ma et.al. | 2412.16488 | null |
2024-12-20 | Battery valuation on electricity intraday markets with liquidity costs | Enzo Cognéville et.al. | 2412.15959 | null |
2024-12-19 | Robustness Evaluation of a Physical Internet-based Intermodal Logistic Network | Federico Gallo et.al. | 2412.14658 | null |
2024-12-17 | A Scalable Method for Optimal Path Planning on Manifolds via a Hopf-Lax Type Formula | Edward Huynh et.al. | 2412.13346 | link |
2024-12-16 | Using machine learning to inform harvest control rule design in complex fishery settings | Felipe Montealegre-Mora et.al. | 2412.12400 | link |
2024-12-12 | SprayCraft: Graph-Based Route Optimization for Variable Rate Precision Spraying | Kiran K. Kethineni et.al. | 2412.12176 | null |
2024-12-16 | Witty: An Efficient Solver for Computing Minimum-Size Decision Trees | Luca Pascal Staus et.al. | 2412.11954 | null |
2024-12-16 | LLM-DaaS: LLM-driven Drone-as-a-Service Operations from Text User Requests | Lillian Wassim et.al. | 2412.11672 | null |
2024-12-14 | An Active Parameter Learning Approach to The Identification of Safe Regions | Aneesh Raghavan et.al. | 2412.10627 | null |
2024-12-12 | On Round-Off Errors and Gaussian Blur in Superresolution and in Image Registration | Serap A. Savari et.al. | 2412.09741 | null |
2024-12-20 | MAPLE: A Framework for Active Preference Learning Guided by Large Language Models | Saaduddin Mahmud et.al. | 2412.07207 | null |
2024-12-09 | Phaedrus: Exploring Dynamic Application Behavior with Lightweight Generative Models and Large-Language Models | Bodhisatwa Chatterjee et.al. | 2412.06994 | null |
2024-12-07 | Timely reliable Bayesian decision-making enabled using memristors | Lekai Song et.al. | 2412.06838 | null |
2024-12-08 | DiTer++: Diverse Terrain and Multi-modal Dataset for Multi-Robot SLAM in Multi-session Environments | Juwon Kim et.al. | 2412.05839 | null |
2024-12-08 | SizeGS: Size-aware Compression of 3D Gaussians with Hierarchical Mixed Precision Quantization | Shuzhao Xie et.al. | 2412.05808 | null |
2024-12-07 | Controlled rough SDEs, pathwise stochastic control and dynamic programming principles | Peter K. Friz et.al. | 2412.05698 | null |
2024-12-07 | Quantum Annealing and Tensor Networks: a Powerful Combination to Solve Optimization Problems | Miquel Albertí Binimelis et.al. | 2412.05595 | link |
2024-12-07 | Optimizing Returns from Experimentation Programs | Timothy Sudijono et.al. | 2412.05508 | null |
2024-12-06 | Nonmyopic Global Optimisation via Approximate Dynamic Programming | Filippo Airaldi et.al. | 2412.04882 | link |
2024-12-05 | Generating graph states with a single quantum emitter and the minimum number of fusions | Matthias C. Löbl et.al. | 2412.04587 | null |
2024-12-04 | Summa Summarum: Moessner’s Theorem without Dynamic Programming | Olivier Danvy et.al. | 2412.03127 | null |
2024-11-21 | Quantum Annealing based Hybrid Strategies for Real Time Route Optimization | Sushil Mario et.al. | 2412.02720 | null |
2024-11-30 | A Second Soul: Celebrating the Many Languages of Programming – Festschrift in Honor of Peter Thiemann’s Sixtieth Birthday | Annette Bieniusa et.al. | 2412.01856 | null |
2024-12-01 | Optimization of Delivery Routes for Fresh E-commerce in Pre-warehouse Mode | Alice Harward et.al. | 2412.00634 | null |
2024-11-29 | An Optimal Switching Approach for Bird Migration | Jiawei Chu et.al. | 2411.19467 | null |
2024-11-28 | SPAgent: Adaptive Task Decomposition and Model Selection for General Video Generation and Editing | Rong-Cheng Tu et.al. | 2411.18983 | null |
2024-11-27 | SCoTT: Wireless-Aware Path Planning with Vision Language Models and Strategic Chains-of-Thought | Aladin Djuhera et.al. | 2411.18212 | null |
2024-11-26 | Structural Parameterization of Locating-Dominating Set and Test Cover | Dipayan Chakraborty et.al. | 2411.17948 | null |
2024-11-26 | Pushing the Limits of Large Language Model Quantization via the Linearity Theorem | Vladimir Malinovskii et.al. | 2411.17525 | null |
2024-11-26 | Weakly acyclic diagrams: A data structure for infinite-state symbolic verification | Michael Blondin et.al. | 2411.17250 | null |
2024-11-26 | Dynamic Programming-Based Offline Redundancy Resolution of Redundant Manipulators Along Prescribed Paths with Real-Time Adjustment | Zhihang Yin et.al. | 2411.17052 | null |
2024-11-26 | Dynamic Programming-Based Redundancy Resolution for Path Planning of Redundant Manipulators Considering Breakpoints | Zhihang Yin et.al. | 2411.17034 | null |
2024-11-26 | Entropy-Based Dynamic Programming for Efficient Vehicle Parking | Jean-Luc Lupien et.al. | 2411.17014 | null |
2024-11-25 | Any3DIS: Class-Agnostic 3D Instance Segmentation by 2D Mask Tracking | Phuc Nguyen et.al. | 2411.16183 | null |
2024-11-25 | Using Drone Swarm to Stop Wildfire: A Predict-then-optimize Approach | Shijie Pan et.al. | 2411.16144 | null |
2024-11-24 | Hiding Communication Cost in Distributed LLM Training via Micro-batch Co-execution | Haiquan Wang et.al. | 2411.15871 | null |
2024-11-24 | Revenue Maximization in Choice-Based Matching Markets | Dan Nissim et.al. | 2411.15727 | null |
2024-11-22 | Jovis: A Visualization Tool for PostgreSQL Query Optimizer | Yoojin Choi et.al. | 2411.14788 | null |
2024-11-22 | Construction and Preliminary Validation of a Dynamic Programming Concept Inventory | Matthew Ferland et.al. | 2411.14655 | null |
2024-11-18 | Controlled Occupied Processes and Viscosity Solutions | H. Mete Soner et.al. | 2411.12080 | null |
2024-11-18 | A New Finite-Horizon Dynamic Programming Analysis of Nonanticipative Rate-Distortion Function for Markov Sources | Zixuan He et.al. | 2411.11698 | null |
2024-11-18 | gpuPairHMM: High-speed Pair-HMM Forward Algorithm for DNA Variant Calling on GPUs | Bertil Schmidt et.al. | 2411.11547 | link |
2024-11-17 | Dynamic Programming: Optimality at a Point Implies Optimality Everywhere | John Stachurski et.al. | 2411.11062 | null |
2024-11-15 | AmoebaLLM: Constructing Any-Shape Large Language Models for Efficient and Instant Deployment | Yonggan Fu et.al. | 2411.10606 | link |
2024-11-14 | Latency Optimization in LEO Satellite Communications with Hybrid Beam Pattern and Interference Control | Qianqian Zhang et.al. | 2411.09600 | null |
2024-11-13 | On the numerical integration of the Fokker-Planck equation driven by a mechanical force and the Bismut-Elworthy-Li formula | Julia Sanders et.al. | 2411.08518 | link |
2024-11-13 | Tractable Robust Markov Decision Processes | Julien Grand-Clément et.al. | 2411.08435 | null |
2024-11-12 | dpvis: A Visual and Interactive Learning Tool for Dynamic Programming | David H. Lee et.al. | 2411.07705 | link |
2024-11-11 | DP and QP Based Decision-making and Planning for Autonomous Vehicle | Zhicheng Zhang et.al. | 2411.06751 | null |
2024-11-11 | Resilient control under denial-of-service and uncertainty: An adaptive dynamic programming approach | Weinan Gao et.al. | 2411.06689 | null |
2024-11-11 | Two Kinds of Learning Algorithms for Continuous-Time VWAP Targeting Execution | Xingyu Zhou et.al. | 2411.06645 | null |
2024-11-10 | Robust optimal stopping with regime switching | Siyu Lv et.al. | 2411.06522 | null |
2024-11-07 | Optimal control under unknown intensity with Bayesian learning | Nicolas Baradel et.al. | 2411.04917 | null |
2024-11-07 | Structure Matters: Dynamic Policy Gradient | Sara Klein et.al. | 2411.04913 | null |
2024-11-07 | Minimax Linear Regulator Problems for Positive Systems | Alba Gurpegui et.al. | 2411.04809 | null |
2024-11-07 | Optimal Execution under Incomplete Information | Etienne Chevalier et.al. | 2411.04616 | null |
2024-11-07 | Convergence and Robustness of Value and Policy Iteration for the Linear Quadratic Regulator | Bowen Song et.al. | 2411.04548 | link |
2024-11-05 | DP-HLS: A High-Level Synthesis Framework for Accelerating Dynamic Programming Algorithms in Bioinformatics | Yingqi Cao et.al. | 2411.03398 | link |
2024-11-04 | Stochastic Optimal Control of an Industrial Power-to-Heat System with High-Temperature Heat Pump and Thermal Energy Storage | Eric Pilling et.al. | 2411.02211 | null |
2024-11-03 | ParseCaps: An Interpretable Parsing Capsule Network for Medical Image Diagnosis | Xinyu Geng et.al. | 2411.01564 | null |
2024-10-31 | EARL-BO: Reinforcement Learning for Multi-Step Lookahead, High-Dimensional Bayesian Optimization | Mujin Cheon et.al. | 2411.00171 | null |
2024-10-31 | Q-learning for Quantile MDPs: A Decomposition, Performance, and Convergence Analysis | Jia Lin Hau et.al. | 2410.24128 | link |
2024-10-31 | A dynamic programming principle for multiperiod control problems with bicausal constraints | Ruslan Mirmominov et.al. | 2410.23927 | null |
2024-10-30 | Return Augmented Decision Transformer for Off-Dynamics Reinforcement Learning | Ruhan Wang et.al. | 2410.23450 | null |
2024-10-29 | Approximately Counting Knapsack Solutions in Subquadratic Time | Weiming Feng et.al. | 2410.22267 | null |
2024-10-29 | Beating Bellman’s Algorithm for Subset Sum | Karl Bringmann et.al. | 2410.21942 | null |
2024-10-28 | Analysis of Different Algorithmic Design Techniques for Seam Carving | Owais Aijaz et.al. | 2410.21207 | null |
2024-10-27 | A New Method for Inserting Train Paths into a Timetable | David Dekker et.al. | 2410.20561 | link |
2024-10-27 | On the I/O Complexity of the CYK Algorithm and of a Family of Related DP Algorithms | Lorenzo De Stefani et.al. | 2410.20337 | null |
2024-10-25 | An Enhanced Hierarchical Planning Framework for Multi-Robot Autonomous Exploration | Gengyuan Cai et.al. | 2410.19373 | null |
2024-10-24 | Stochastic dynamic programming under recursive Epstein-Zin preferences | Anna Jaśkiewicz et.al. | 2410.19181 | null |
2024-10-24 | A Counterexample in Cross-Correlation Template Matching | Serap A. Savari et.al. | 2410.19085 | null |
2024-10-23 | Trajectory Optimization for Spatial Microstructure Control in Electron Beam Metal Additive Manufacturing | Mikhail Khrenov et.al. | 2410.18207 | null |
2024-10-24 | Estimating the Spectral Moments of the Kernel Integral Operator from Finite Sample Matrices | Chanwoo Chun et.al. | 2410.17998 | null |
2024-10-21 | Policies with Sparse Inter-Agent Dependencies in Dynamic Games: A Dynamic Programming Approach | Xinjie Liu et.al. | 2410.16441 | null |
2024-10-21 | All You Need is an Improving Column: Enhancing Column Generation for Parallel Machine Scheduling via Transformers | Amira Hijazi et.al. | 2410.15601 | null |
2024-10-21 | How to Find the Exact Pareto Front for Multi-Objective MDPs? | Yining Li et.al. | 2410.15557 | null |
2024-10-20 | CASET: Complexity Analysis using Simple Execution Traces for CS* submissions | Aaryen Mehta et.al. | 2410.15419 | null |
2024-10-19 | The Constrained Layer Tree Problem and Applications to Solar Farm Cabling | Thomas Bläsius et.al. | 2410.15031 | null |
2024-10-18 | On picking operations in e-commerce warehouses: Insights from the complete-information counterpart | Catherine Lorenz et.al. | 2410.14316 | null |
2024-10-17 | Quasi-quantum states and the quasi-quantum PCP theorem | Itai Arad et.al. | 2410.13549 | null |
2024-10-17 | Joint Antenna Selection and Covariance Matrix Optimization for ISAC Systems | Michail Palaiologos et.al. | 2410.13446 | null |
2024-10-17 | Membership Testing for Semantic Regular Expressions | Yifei Huang et.al. | 2410.13262 | null |
2024-10-22 | Research on Travel Route Planing Problems Based on Greedy Algorithm | Yiquan Wang et.al. | 2410.13226 | link |
2024-10-17 | Algorithmic Content Selection and the Impact of User Disengagement | Emilio Calvano et.al. | 2410.13108 | null |
2024-10-16 | Learning Representations for Reasoning: Generalizing Across Diverse Structures | Zhaocheng Zhu et.al. | 2410.13018 | null |
2024-10-16 | Vehicle Localization in GPS-Denied Scenarios Using Arc-Length-Based Map Matching | Nur Uddin Javed et.al. | 2410.12208 | null |
2024-10-15 | Incremental computation of the set of period sets | Eric Rivals et.al. | 2410.12077 | null |
2024-10-15 | Routing and Scheduling Optimization for Urban Air Mobility Fleet Management using Quantum Annealing | Renichiro Haba et.al. | 2410.11231 | null |
2024-10-16 | SplitLLM: Collaborative Inference of LLMs for Model Placement and Throughput Optimization | Akrit Mudvari et.al. | 2410.10759 | null |
2024-10-14 | Learning Sub-Second Routing Optimization in Computer Networks requires Packet-Level Dynamics | Andreas Boltres et.al. | 2410.10377 | null |
2024-10-09 | Rapid Computation of the Assembly Index of Molecular Graphs | Ian Seet et.al. | 2410.09100 | null |
2024-10-11 | Deep Learning Algorithms for Mean Field Optimal Stopping in Finite Space and Discrete Time | Lorenzo Magnino et.al. | 2410.08850 | null |
2024-10-11 | Hybrid Filtering Heuristic for the Sensor-Placement Problem to Discretize 2D Continuous Environments | Jan Mikula et.al. | 2410.08784 | link |
2024-10-10 | Dynamic Programming based Local Search approaches for Multi-Agent Path Finding problems on Directed Graphs | Irene Saccani et.al. | 2410.07954 | null |
2024-10-10 | Partitioning Trillion Edge Graphs on Edge Devices | Adil Chhabra et.al. | 2410.07732 | null |
2024-10-11 | Q-WSL:Leveraging Dynamic Programming for Weighted Supervised Learning in Goal-conditioned RL | Xing Lei et.al. | 2410.06648 | null |
2024-10-08 | Solvability of Equilibrium Riccati Equations: A Direct Approach | Bowen Ma et.al. | 2410.06090 | null |
2024-10-07 | Dynamic HumTrans: Humming Transcription Using CNNs and Dynamic Programming | Shubham Gupta et.al. | 2410.05455 | link |
2024-10-07 | A Predictive and Optimization Approach for Enhanced Urban Mobility Using Spatiotemporal Data | Shambhavi Mishra et.al. | 2410.05358 | null |
2024-10-05 | AI as Humanity’s Salieri: Quantifying Linguistic Creativity of Language Models via Systematic Attribution of Machine Text against Web Text | Ximing Lu et.al. | 2410.04265 | link |
2024-10-05 | A branch-&-price approach to the unrooted maximum agreement forest problem | Martin Frohn et.al. | 2410.04122 | null |
2024-10-02 | Electrification of Transportation: A Hybrid Benders/SDDP Algorithm for Optimal Charging Station Trading | Farnaz Sohrabi et.al. | 2410.03763 | null |
2024-10-02 | Effects of eco-driving on energy consumption and battery degradation for electric vehicles at signalized intersections | Yongqiang Wang et.al. | 2410.01685 | null |
2024-10-02 | Krylov-Safonov theory for Pucci-type extremal inequalities on random data clouds | Ángel Arroyo et.al. | 2410.01642 | null |
2024-10-02 | Automated Curvy Waveguide Routing for Large-Scale Photonic Integrated Circuits | Hongjian Zhou et.al. | 2410.01260 | link |
2024-09-30 | Generalised mixed effects models for changepoint analysis of biomedical time series data | Mark B. Fiecas et.al. | 2410.00183 | null |
2024-09-30 | Opt2Skill: Imitating Dynamically-feasible Whole-Body Trajectories for Versatile Humanoid Loco-Manipulation | Fukang Liu et.al. | 2409.20514 | null |
2024-09-28 | On Computing Elastic Shape Distances between Curves in d-dimensional Space | Javier Bernal et.al. | 2409.19380 | null |
2024-09-25 | MaViLS, a Benchmark Dataset for Video-to-Slide Alignment, Assessing Baseline Accuracy with a Multimodal Alignment Algorithm Leveraging Speech, OCR, and Visual Features | Katharina Anderer et.al. | 2409.16765 | link |
2024-09-25 | DeformStream: Deformation-based Adaptive Volumetric Video Streaming | Boyan Li et.al. | 2409.16615 | null |
2024-09-24 | Partial Elastic Shape Registration of 3D Surfaces using Dynamic Programming | Javier Bernal et.al. | 2409.16462 | null |
2024-09-25 | Efficient Nearest Neighbor Search Using Dynamic Programming | Pengfei Wang et.al. | 2409.15023 | null |
2024-09-22 | Unsupervised Word Discovery: Boundary Detection with Clustering vs. Dynamic Programming | Simon Malan et.al. | 2409.14486 | null |
2024-09-24 | Batch Predictive Inference | Yonghoon Lee et.al. | 2409.13990 | link |
2024-09-20 | A Modified Algorithm for Optimal Picker Routing in a Single Block Warehouse | George Dunn et.al. | 2409.13219 | null |
2024-09-19 | Program Slicing in the Era of Large Language Models | Kimya Khakzad Shahandashti et.al. | 2409.12369 | null |
2024-09-18 | Differential dynamic programming with stagewise equality and inequality constraints using interior point method | Siddharth Prabhu et.al. | 2409.12048 | link |
2024-09-20 | Second-Order Constrained Dynamic Optimization | Yuichiro Aoyama et.al. | 2409.11649 | null |
2024-09-18 | Multi-stage stochastic linear programming for shared autonomous vehicle system operation and design with on-demand and pre-booked requests | Riki Kawase et.al. | 2409.11611 | null |
2024-09-17 | Optimal Investment with Costly Expert Opinions | Christoph Knochenhauer et.al. | 2409.11569 | null |
2024-09-20 | Exact Wavefront Propagation for Globally Optimal One-to-All Path Planning on 2D Cartesian Grids | Ibrahim Ibrahim et.al. | 2409.11545 | link |
2024-09-17 | Neural Networks for Vehicle Routing Problem | László Kovács et.al. | 2409.11290 | null |
2024-09-17 | Selective algorithm processing of subset sum distributions | Nick Dawes et.al. | 2409.11076 | null |
2024-09-17 | Local discontinuous Galerkin method for nonlinear BSPDEs of Neumann boundary conditions with deep backward dynamic programming time-marching | Yixiang Dai et.al. | 2409.11004 | null |
2024-09-17 | Relationship between stochastic maximum principle and dynamic programming principle under convex expectation | Xiaojuan Li et.al. | 2409.10987 | null |
2024-09-16 | Direct Data-Driven Discounted Infinite Horizon Linear Quadratic Regulator with Robustness Guarantees | Ramin Esmzad et.al. | 2409.10703 | null |
2024-09-20 | Motion Forecasting via Model-Based Risk Minimization | Aron Distelzweig et.al. | 2409.10585 | null |
2024-09-16 | Estimates for Optimal Multistage Group Partition Testing | Guojiang Shao et.al. | 2409.10410 | null |
2024-09-16 | Pareto Sums of Pareto Sets: Lower Bounds and Algorithms | Daniel Funke et.al. | 2409.10232 | null |
2024-09-12 | Q-value Regularized Decision ConvFormer for Offline Reinforcement Learning | Teng Yan et.al. | 2409.08062 | null |
2024-09-12 | Super Monotonic Alignment Search | Junhyeok Lee et.al. | 2409.07704 | link |
2024-09-10 | Design of Threshold-Constrained Indirect Quantizers | Ariel Doubchak et.al. | 2409.06839 | null |
2024-09-10 | Cooptimizing Safety and Performance with a Control-Constrained Formulation | Hao Wang et.al. | 2409.06696 | link |
2024-09-12 | Valuation Model of Chinese Convertible Bonds Based on Monte Carlo Simulation | Yu Liu et.al. | 2409.06496 | null |
2024-09-09 | OTFS-MDMA: An Elastic Multi-Domain Resource Utilization Mechanism for High Mobility Scenarios | Jie Chen et.al. | 2409.05724 | null |
2024-09-09 | Enhancing Empathic Accuracy: Penalized Functional Alignment Method to Correct Misalignment in Emotional Perception | Linh H Nghiem et.al. | 2409.05343 | null |
2024-09-08 | Cooperative Learning-Based Framework for VNF Caching and Placement Optimization over Low Earth Orbit Satellite Networks | Khai Doan et.al. | 2409.05025 | null |
2024-09-08 | Fast Deep Predictive Coding Networks for Videos Feature Extraction without Labels | Wenqian Xue et.al. | 2409.04945 | null |
2024-09-17 | Second-Order Stein Variational Dynamic Optimization | Yuichiro Aoyama et.al. | 2409.04644 | null |
2024-09-06 | Refined Bounds on Near Optimality Finite Window Policies in POMDPs and Their Reinforcement Learning | Yunus Emre Demirci et.al. | 2409.04351 | null |
2024-09-05 | Space-Efficient Algorithm for Integer Programming with Few Constraints | Lars Rohwedder et.al. | 2409.03681 | null |
2024-09-05 | Fine-Grained Equivalence for Problems Related to Integer Linear Programming | Lars Rohwedder et.al. | 2409.03675 | null |
2024-09-06 | Revenue Management with Calendar-Aware and Dependent Demands: Asymptotically Tight Fluid Approximations | Weiyuan Li et.al. | 2409.02637 | null |
2024-09-03 | FuzzCoder: Byte-level Fuzzing Test via Large Language Model | Liqun Yang et.al. | 2409.01944 | link |
2024-09-03 | Quantum Algorithms for One-Sided Crossing Minimization | Susanna Caroppo et.al. | 2409.01942 | null |
2024-09-02 | Solving Integrated Process Planning and Scheduling Problem via Graph Neural Network Based Deep Reinforcement Learning | Hongpei Li et.al. | 2409.00968 | link |
2024-09-02 | Multistage Robust Average Randomized Spectral Risk Optimization | Qiong Wu et.al. | 2409.00892 | null |
2024-09-01 | An Optimized Binning and Probabilistic Slice Sharing Algorithm for Motion Correction in Abdominal DW-MRI | Michelle Su et.al. | 2409.00798 | null |
2024-09-01 | Cooperative Path Planning with Asynchronous Multiagent Reinforcement Learning | Jiaming Yin et.al. | 2409.00754 | null |
2024-09-01 | The landscape of deterministic and stochastic optimal control problems: One-shot Optimization versus Dynamic Programming | Jihun Kim et.al. | 2409.00655 | null |
2024-08-31 | Foundations of Multivariate Distributional Reinforcement Learning | Harley Wiltzer et.al. | 2409.00328 | null |
2024-08-30 | Approximation Algorithms for Anchored Multiwatchman Routes | Joseph S. B. Mitchell et.al. | 2408.17343 | null |
2024-08-30 | Stationary Policies are Optimal in Risk-averse Total-reward MDPs with EVaR | Xihong Su et.al. | 2408.17286 | link |
2024-08-30 | A Two-Timescale Decision-Hazard-Decision Formulation for Storage Usage Values Calculation | Camila Martinez Parra et.al. | 2408.17113 | null |
2024-08-29 | Optimization Models for the Quadratic Traveling Salesperson Problem | Yuxiao Chen et.al. | 2408.16680 | null |
2024-08-27 | On the parameterized complexity of computing good edge-labelings | Davi de Andrade et.al. | 2408.15181 | null |
2024-08-26 | Achieving designed texture and flows in bulk active nematics using optimal control theory | Saptorshi Ghosh et.al. | 2408.14596 | null |
2024-08-25 | Decentralized Stochastic Control in Standard Borel Spaces: Centralized MDP Reductions, Near Optimality of Finite Window Local Information, and Q-Learning | Omar Mrani-Zentar et.al. | 2408.13828 | null |
2024-08-23 | The Ultimate Guide to Fine-Tuning LLMs from Basics to Breakthroughs: An Exhaustive Review of Technologies, Research, Best Practices, Applied Research Challenges and Opportunities | Venkatesh Balavadhani Parthasarathy et.al. | 2408.13296 | null |
2024-08-18 | An Introduction to Cognidynamics | Marco Gori et.al. | 2408.13112 | null |
2024-08-20 | Optimal Guarantees for Online Selection Over Time | Sebastian Perez-Salazar et.al. | 2408.11224 | null |
2024-08-20 | Fault Tolerant Dynamic Task Assignment for UAV-based Search Teams | Ali Nasir et.al. | 2408.10564 | null |
2024-08-19 | Efficient Exploration in Deep Reinforcement Learning: A Novel Bayesian Actor-Critic Algorithm | Nikolai Rozanov et.al. | 2408.10055 | null |
2024-08-19 | Continuous-Time Dynamic Decision Making with Costly Information | Christoph Knochenhauer et.al. | 2408.09693 | null |
2024-08-19 | Solving stochastic climate-economy models: A deep least-squares Monte Carlo approach | Aleksandar Arandjelović et.al. | 2408.09642 | null |
2024-08-18 | Exploratory Optimal Stopping: A Singular Control Formulation | Jodi Dianetti et.al. | 2408.09335 | null |
2024-08-17 | Optimal Strip Attitude Command of Earth Observation Satellite using Differential Dynamic Programming | Seungyeop Han et.al. | 2408.09244 | null |
2024-08-17 | Twin Sorting Dynamic Programming Assisted User Association and Wireless Bandwidth Allocation for Hierarchical Federated Learning | Rung-Hung Gau et.al. | 2408.09076 | null |
2024-08-17 | Atlas: Hierarchical Partitioning for Quantum Circuit Simulation on GPUs (Extended Version) | Mingkuan Xu et.al. | 2408.09055 | null |
2024-08-15 | Optimal control problems with generalized mean-field dynamics and viscosity solution to Master Bellman equation | Rainer Buckdahn et.al. | 2408.08046 | null |
2024-08-14 | Differentiating Policies for Non-Myopic Bayesian Optimization | Darian Nwankwo et.al. | 2408.07812 | null |
2024-08-11 | Moderate Exponential-time Quantum Dynamic Programming Across the Subsets for Scheduling Problems | Camille Grange et.al. | 2408.05741 | null |
2024-08-10 | Convergence Guarantee of Dynamic Programming for LTL Surrogate Reward | Zetong Xuan et.al. | 2408.05438 | null |
2024-08-09 | MIDI-to-Tab: Guitar Tablature Inference via Masked Language Modeling | Drew Edwards et.al. | 2408.05024 | null |
2024-08-09 | A Comprehensive System Architecture using Field Programmable Gate Arrays Technology, Dijkstra’s Algorithm, and Edge Computing for Emergency Response in Smart Cities | Mahamat Abdel Aziz Assoul et.al. | 2408.04924 | null |
2024-08-08 | Mathematical Programming For Adaptive Experiments | Ethan Che et.al. | 2408.04570 | null |
2024-08-08 | Non-maximizing policies that fulfill multi-criterion aspirations in expectation | Simon Dima et.al. | 2408.04385 | null |
2024-08-08 | Enhanced Traffic Flow Prediction with Multi-Segment Fusion Tensor Graph Convolutional Networks | Wei Zhang et.al. | 2408.04232 | null |
2024-08-06 | A Course in Dynamic Optimization | Bar Light et.al. | 2408.03034 | null |
2024-08-05 | Positive Dynamic Programming: A Critique | Aaqib Peerzada et.al. | 2408.02809 | null |
2024-08-05 | Multi-level Traffic-Responsive Tilt Camera Surveillance through Predictive Correlated Online Learning | Tao Li et.al. | 2408.02208 | null |
2024-08-04 | Non-local Hamilton-Jacobi-Bellman equations for the stochastic optimal control of path-dependent piecewise deterministic processes | Elena Bandini et.al. | 2408.02147 | null |
2024-08-03 | Leveraging GNSS and Onboard Visual Data from Consumer Vehicles for Robust Road Network Estimation | Balázs Opra et.al. | 2408.01640 | null |
2024-08-02 | Occasionally Observed Piecewise-deterministic Markov Processes | Marissa Gee et.al. | 2408.01335 | null |
2024-08-02 | The Impact of Program Reduction on Automated Program Repair | Linas Vidziunas et.al. | 2408.01134 | null |
2024-08-11 | Deep Learning Approach for Changepoint Detection: Penalty Parameter Optimization | Tung L Nguyen et.al. | 2408.00856 | link |
2024-07-31 | Tractable and Provably Efficient Distributional Reinforcement Learning with General Value Function Approximation | Taehyun Cho et.al. | 2407.21260 | null |
2024-07-30 | A Machine Learning Approach to Boost the Vehicle-2-Grid Scheduling | Gabriele Agliardi et.al. | 2407.20802 | null |
2024-07-30 | Generalized replicator dynamics based on mean-field pairwise comparison dynamic | Hidekazu Yoshioka et.al. | 2407.20751 | null |
2024-08-10 | A UAV-Enabled Time-Sensitive Data Collection Scheme for Grassland Monitoring Edge Networks | Dongbin Jiao et.al. | 2407.20585 | null |
2024-07-29 | A Differential Dynamic Programming Framework for Inverse Reinforcement Learning | Kun Cao et.al. | 2407.19902 | null |
2024-07-27 | Map-Matching Queries under Fréchet Distance on Low-Density Spanners | Kevin Buchin et.al. | 2407.19304 | null |
2024-07-26 | RRO: A Regularized Routing Optimization Algorithm for Enhanced Throughput and Low Latency with Efficient Complexity | David Zenati et.al. | 2407.18683 | null |
2024-07-26 | Mean-field control of non exchangeable systems | Anna De Crescenzo et.al. | 2407.18635 | null |
2024-08-01 | Stochastic Games with Minimally Bounded Action Costs | David Mguni et.al. | 2407.18010 | null |
2024-07-25 | Personalized and Context-aware Route Planning for Edge-assisted Vehicles | Dinesh Cyril Selvaraj et.al. | 2407.17980 | null |
2024-07-23 | Data-Driven Optimal Feedback Laws via Kernel Mean Embeddings | Petar Bevanda et.al. | 2407.16407 | null |
2024-07-23 | Data-driven Multistage Distributionally Robust Linear Optimization with Nested Distance | Rui Gao et.al. | 2407.16346 | null |
2024-07-22 | Faster Optimal Coalition Structure Generation via Offline Coalition Selection and Graph-Based Search | Redha Taguelmimt et.al. | 2407.16092 | null |
2024-07-22 | Scheduling on a Stochastic Number of Machines | Moritz Buchem et.al. | 2407.15737 | null |
2024-07-20 | Interdiction of minimum spanning trees and other matroid bases | Noah Weninger et.al. | 2407.14906 | link |
2024-07-20 | A Tale of Two Scales: Reconciling Horizontal and Vertical Scaling for Inference Serving Systems | Kamran Razavi et.al. | 2407.14843 | null |
2024-07-19 | Dynamic Programming Techniques for Planar Orbital Transfer of Low Earth Orbit Satellites | C. Ciancarelli et.al. | 2407.14675 | null |
2024-07-19 | Generalization Error Analysis of Deep Backward Dynamic Programming for Solving Nonlinear PDEs | Du Ouyang et.al. | 2407.14566 | null |
2024-07-19 | On Policy Evaluation Algorithms in Distributional Reinforcement Learning | Julian Gerstenberg et.al. | 2407.14175 | null |
2024-07-18 | Shaded Route Planning Using Active Segmentation and Identification of Satellite Images | Longchao Da et.al. | 2407.13689 | null |
2024-07-18 | The Madness of Multiple Entries in March Madness | Jeff Decary et.al. | 2407.13438 | null |
2024-07-18 | Double interdiction problem on trees on the sum of root-leaf distances by upgrading edges | Xiao Li et.al. | 2407.13391 | null |
2024-07-18 | Deterministic Trajectory Optimization through Probabilistic Optimal Control | Mohammad Mahmoudi Filabadi et.al. | 2407.13316 | null |
2024-07-18 | Integrated Hardware Architecture and Device Placement Search | Irene Wang et.al. | 2407.13143 | link |
2024-07-18 | Multiobjective Vehicle Routing Optimization with Time Windows: A Hybrid Approach Using Deep Reinforcement Learning and NSGA-II | Rixin Wu et.al. | 2407.13113 | null |
2024-07-17 | Dynamic Programming Principle and Hamilton-Jacobi-Bellman Equation for Optimal Control Problems with Uncertainty | M. Soledad Aronna et.al. | 2407.13045 | null |
2024-07-17 | Estimating the Potential Impact of Combined Race and Ethnicity Reporting on Long-Term Earnings Statistics | Kevin L. McKinney et.al. | 2407.12775 | null |
2024-07-16 | Enabling MCTS Explainability for Sequential Planning Through Computation Tree Logic | Ziyan An et.al. | 2407.10820 | null |
2024-07-14 | Fine Grained Lower Bounds for Multidimensional Knapsack | Ilan Doron-Arad et.al. | 2407.10146 | null |
2024-07-12 | Investigating the Interplay of Prioritized Replay and Generalization | Parham Mohammad Panahi et.al. | 2407.09702 | null |
2024-07-12 | An efficient algorithm to compute the minimum free energy of interacting nucleic acid strands | Ahmed Shalaby et.al. | 2407.09676 | null |
2024-07-12 | Hamilton-Jacobi Reachability in Reinforcement Learning: A Survey | Milan Ganai et.al. | 2407.09645 | null |
2024-07-12 | Integer programs with nearly totally unimodular matrices: the cographic case | Manuel Aprile et.al. | 2407.09477 | null |
2024-07-12 | A new approach to principal-agent problems with volatility control | Alessandro Chiusolo et.al. | 2407.09471 | null |
2024-07-12 | CAACS: A Carbon Aware Ant Colony System | Marina Lin et.al. | 2407.09404 | null |
2024-07-12 | Structure and Independence in Hyperbolic Uniform Disk Graphs | Thomas Bläsius et.al. | 2407.09362 | null |
2024-07-12 | KUNPENG: An Embodied Large Model for Intelligent Maritime | Naiyao Wang et.al. | 2407.09048 | link |
2024-07-09 | Trajectory Data Mining and Trip Travel Time Prediction on Specific Roads | Muhammad Awais Amin et.al. | 2407.07030 | null |
2024-07-08 | Solving Multi-Model MDPs by Coordinate Ascent and Dynamic Programming | Xihong Su et.al. | 2407.06329 | link |
2024-07-08 | Narrowing the Gap between Adversarial and Stochastic MDPs via Policy Optimization | Daniil Tiapkin et.al. | 2407.05704 | null |
2024-07-06 | Advancing Algorithmic Approaches to Probabilistic Argumentation under the Constellation Approach | Andrei Popescu et.al. | 2407.05058 | null |
2024-07-05 | Re-Tuning: Overcoming the Compositionality Limits of Large Language Models with Recursive Tuning | Eric Pasewark et.al. | 2407.04787 | link |
2024-07-05 | GOALPlace: Begin with the End in Mind | Anthony Agnesina et.al. | 2407.04579 | null |
2024-07-04 | Advanced Artificial Intelligence Strategy for Optimizing Urban Rail Network Design using Nature-Inspired Algorithms | Hariram Sampath Kumar et.al. | 2407.04087 | null |
2024-07-04 | Multi-Time Scale Service Caching and Pricing in MEC Systems with Dynamic Program Popularity | Yiming Chen et.al. | 2407.03804 | null |
2024-07-03 | Reconsidering utility: unveiling the limitations of synthetic mobility data generation algorithms in real-life scenarios | Alexandra Kapp et.al. | 2407.03237 | null |
2024-07-12 | A Two-stage Identification Method for Switched Linear Systems | Zheng Wenju et.al. | 2407.02743 | null |
2024-07-02 | DM3D: Distortion-Minimized Weight Pruning for Lossless 3D Object Detection | Kaixin Xu et.al. | 2407.02098 | null |
2024-06-28 | Edge-DIRECT: A Deep Reinforcement Learning-based Method for Solving Heterogeneous Electric Vehicle Routing Problem with Time Window Constraints | Arash Mozhdehi et.al. | 2407.01615 | null |
2024-07-02 | Contractual Reinforcement Learning: Pulling Arms with Invisible Hands | Jibang Wu et.al. | 2407.01458 | null |
2024-07-01 | Exact statistical analysis for response-adaptive clinical trials: a general and computationally tractable approach | Stef Baas et.al. | 2407.01055 | null |
2024-06-30 | Maximum Entropy Inverse Reinforcement Learning of Diffusion Models with Energy-Based Models | Sangwoong Yoon et.al. | 2407.00626 | link |
2024-06-30 | Your Car Tells Me Where You Drove: A Novel Path Inference Attack via CAN Bus and OBD-II Data | Tommaso Bianchi et.al. | 2407.00585 | null |
2024-06-29 | A Two-stage Reinforcement Learning-based Approach for Multi-entity Task Allocation | Aicheng Gong et.al. | 2407.00496 | link |
2024-06-29 | Vector-valued robust stochastic control | Igor Cialenco et.al. | 2407.00266 | null |
2024-06-28 | Leveraging Fixed-Parameter Tractability for Robot Inspection Planning | Yosuke Mizutani et.al. | 2407.00251 | null |
2024-06-28 | Approximate Solutions for Multi-Trip Route Planning in Time-Sensitive Situations | Bahar Cavdar et.al. | 2407.00173 | null |
2024-06-28 | Online Optimization of DNN Inference Network Utility in Collaborative Edge Computing | Rui Li et.al. | 2406.19613 | null |
2024-06-27 | Efficient and Distributed Large-Scale 3D Map Registration using Tomographic Features | Halil Utku Unlu et.al. | 2406.19461 | link |
2024-06-27 | Cuts in Graphs with Matroid Constraints | Aritra Banik et.al. | 2406.19134 | null |
2024-06-27 | State and Input Constrained Output-Feedback Adaptive Optimal Control of Affine Nonlinear Systems | Tochukwu Elijah Ogri et.al. | 2406.18804 | null |
2024-06-26 | Markov Decision Process and Approximate Dynamic Programming for a Patient Assignment Scheduling problem | Malgorzata M. O’Reilly et.al. | 2406.18618 | null |
2024-06-26 | Tiered Service Architecture for Remote Patient Monitoring | Siddharth Chandak et.al. | 2406.18000 | null |
2024-06-25 | Splitting Guarantees for Prophet Inequalities via Nonlinear Systems | Johannes Brustle et.al. | 2406.17767 | null |
2024-06-25 | Using iterated local alignment to aggregate GPS trajectories into a traffic flow map | Tarn Duong et.al. | 2406.17500 | null |
2024-06-24 | A multiplicative surface signature through its Magnus expansion | Ilya Chevyrev et.al. | 2406.16856 | null |
2024-06-24 | Stochastic Path-Dependent Volatility Models for Price-Storage Dynamics in Natural Gas Markets and Discrete-Time Swing Option Pricing | Jinniao Qiu et.al. | 2406.16400 | null |
2024-06-21 | Exact discovery is polynomial for sparse causal Bayesian networks | Felix L. Rios et.al. | 2406.15012 | link |
2024-06-19 | A programmable wafer-scale chiroptical heterostructure of twisted aligned carbon nanotubes and phase change materials | Jichao Fan et.al. | 2406.13190 | null |
2024-06-14 | Interpretable Cascading Mixture-of-Experts for Urban Traffic Congestion Prediction | Wenzhao Jiang et.al. | 2406.12923 | null |
2024-06-26 | LayerMerge: Neural Network Depth Compression through Layer Pruning and Merging | Jinuk Kim et.al. | 2406.12837 | link |
2024-06-17 | LibProf: A Python Profiler for Improving Cold Start Performance in Serverless Applications | Syed Salauddin Mohammad Tariq et.al. | 2406.11734 | null |
2024-06-17 | Statistical Learning of Distributionally Robust Stochastic Control in Continuous State Spaces | Shengbo Wang et.al. | 2406.11281 | null |
2024-06-16 | WeShap: Weak Supervision Source Evaluation with Shapley Values | Naiqing Guan et.al. | 2406.11010 | null |
2024-06-16 | Solving Co-Path/Cycle Packing Faster than $3^k$ | Yuxi Liu et.al. | 2406.10829 | null |
2024-06-15 | Scheduling two types of jobs with minimum makespan | Song Cao et.al. | 2406.10467 | null |
2024-06-14 | CycleTrajectory: An End-to-End Pipeline for Enriching and Analyzing GPS Trajectories to Understand Cycling Behavior and Environment | Meihui Wang et.al. | 2406.10069 | link |
2024-06-13 | Optimal Control of Agent-Based Dynamics under Deep Galerkin Feedback Laws | Frederik Kelbel et.al. | 2406.09141 | link |
2024-06-13 | Coordinated Trading Strategies for Battery Storage in Reserve and Spot Markets | Paul E. Seifert et.al. | 2406.08390 | null |
2024-06-11 | Flow Map Matching | Nicholas M. Boffi et.al. | 2406.07507 | null |
2024-06-11 | Variational inequalities and smooth-fit principle for singular stochastic control problems in Hilbert spaces | Salvatore Federico et.al. | 2406.07242 | null |
2024-06-10 | Stochastic Guidance of Buoyancy Controlled Vehicles under Ice Shelves using Ocean Currents | Federico Rossi et.al. | 2406.06724 | null |
2024-06-10 | Leveraging Hyperscanning EEG and VR Omnidirectional Treadmill to Explore Inter-Brain Synchrony in Collaborative Spatial Navigation | Chun-Hsiang Chuang et.al. | 2406.06327 | null |
2024-06-09 | Production and distribution planning, scheduling, and routing optimization in a yogurt supply chain under demand uncertainty: A case study | Babak Javadi et.al. | 2406.05803 | null |
2024-06-09 | Heart Sound Segmentation Using Deep Learning Techniques | Manas Madine et.al. | 2406.05653 | null |
2024-06-11 | Bisimulation Metrics are Optimal Transport Distances, and Can be Computed Efficiently | Sergio Calo et.al. | 2406.04056 | null |
2024-06-04 | GrootVL: Tree Topology is All You Need in State Space Model | Yicheng Xiao et.al. | 2406.02395 | link |
2024-06-21 | Branches: A Fast Dynamic Programming and Branch & Bound Algorithm for Optimal Decision Trees | Ayman Chaouki et.al. | 2406.02175 | link |
2024-06-03 | An efficient solution to Hidden Markov Models on trees with coupled branches | Farzan Vafa et.al. | 2406.01663 | null |
2024-06-03 | A New View on Planning in Online Reinforcement Learning | Kevin Roice et.al. | 2406.01562 | null |
2024-06-02 | Dual Policy Reinforcement Learning for Real-time Rebalancing in Bike-sharing Systems | Jiaqi Liang et.al. | 2406.00868 | null |
2024-06-02 | Computing Optimal Equilibria in Repeated Games with Restarts | Ratip Emin Berker et.al. | 2406.00851 | null |
2024-06-02 | A Lazy Abstraction Algorithm for Markov Decision Processes: Theory and Initial Evaluation | Dániel Szekeres et.al. | 2406.00824 | null |
2024-06-10 | Model Predictive Control and Reinforcement Learning: A Unified Framework Based on Dynamic Programming | Dimitri P. Bertsekas et.al. | 2406.00592 | null |
2024-06-01 | Optimal Transmission Power Scheduling for Networked Control System under DoS Attack | Siyi Wang et.al. | 2406.00540 | null |
2024-06-01 | A Single-Loop Robust Policy Gradient Method for Robust Markov Decision Processes | Zhenwei Lin et.al. | 2406.00274 | link |
2024-05-31 | Finding Diverse Solutions Parameterized by Cliquewidth | Karolina Drabik et.al. | 2405.20931 | null |
2024-05-29 | A numerical algorithm with linear complexity for Multi-marginal Optimal Transport with $L^1$ Cost | Chunhui Chen et.al. | 2405.19246 | null |
2024-05-28 | A Pontryagin Perspective on Reinforcement Learning | Onno Eberhard et.al. | 2405.18100 | null |
2024-05-27 | Q-value Regularized Transformer for Offline Reinforcement Learning | Shengchao Hu et.al. | 2405.17098 | null |
2024-05-25 | A Bi-Objective Approach to Last-Mile Delivery Routing Considering Driver Preferences | Juan Pablo Mesa et.al. | 2405.16051 | null |
2024-06-03 | Inference of Utilities and Time Preference in Sequential Decision-Making | Haoyang Cao et.al. | 2405.15975 | null |
2024-05-31 | Stability and Performance Analysis of Model Predictive Control of Uncertain Linear Systems | Changrui Liu et.al. | 2405.15552 | link |
2024-05-24 | An Approximate Dynamic Programming Framework for Occlusion-Robust Multi-Object Tracking | Pratyusha Musunuru et.al. | 2405.15137 | null |
2024-05-23 | Two-Stage ML-Guided Decision Rules for Sequential Decision Making under Uncertainty | Andrew Rosemberg et.al. | 2405.14973 | null |
2024-05-23 | A rolling horizon heuristic approach for a multi-stage stochastic waste collection problem | Andrea Spinelli et.al. | 2405.14499 | link |
2024-05-23 | EdgeShard: Efficient LLM Inference via Collaborative Edge Computing | Mingjin Zhang et.al. | 2405.14371 | null |
2024-05-23 | Optimal Whole Body Trajectory Planning for Mobile Manipulators in Planetary Exploration and Construction | Federica Storiale et.al. | 2405.14363 | null |
2024-05-23 | Deterministic Policies for Constrained Reinforcement Learning in Polynomial-Time | Jeremy McMahan et.al. | 2405.14183 | null |
2024-05-22 | Tackling Decision Processes with Non-Cumulative Objectives using Reinforcement Learning | Maximilian Nägele et.al. | 2405.13609 | link |
2024-05-21 | Parallel Algorithm for Optimal Threshold Labeling of Ordinal Regression Methods | Ryoya Yamasaki et.al. | 2405.12756 | link |
2024-05-21 | Short and simple introduction to Bellman filtering and smoothing | Rutger-Jan Lange et.al. | 2405.12668 | null |
2024-05-21 | Data-driven Coordinated AC/DC Control Strategy for Frequency Safety | Qianni Cao et.al. | 2405.12546 | null |
2024-05-20 | Semantic Trajectory Data Mining with LLM-Informed POI Classification | Yifan Liu et.al. | 2405.11715 | null |
2024-05-18 | On the Trajectory Regularity of ODE-based Diffusion Sampling | Defang Chen et.al. | 2405.11326 | link |
2024-05-15 | Harmonizing Human Insights and AI Precision: Hand in Hand for Advancing Knowledge Graph Task | Shurong Wang et.al. | 2405.09477 | null |
2024-05-14 | Treatment Effect Estimation for User Interest Exploration on Recommender Systems | Jiaju Chen et.al. | 2405.08582 | link |
2024-05-27 | Dynamic Programming for Symbolic Boolean Realizability and Synthesis | Yi Lin et.al. | 2405.07975 | null |
2024-05-13 | Space Domain based Ecological Cooperative and Adaptive Cruise Control on Rolling Terrain | Mingyue Lei et.al. | 2405.07553 | null |
2024-05-12 | Deciding regular games: a playground for exponential time algorithms | Zihui Liang et.al. | 2405.07188 | null |
2024-05-12 | Trade execution games in a Markovian environment | Masamitsu Ohnishi et.al. | 2405.07184 | null |
2024-05-10 | Dynamic programming principle and computable prices in financial market models with transaction costs | Emmanuel Lepinette et.al. | 2405.06623 | null |
2024-05-09 | Change point localisation and inference in fragmented functional data | Gengyu Xue et.al. | 2405.05730 | link |
2024-05-09 | Infinite horizon stochastic recursive control problems with jumps: dynamic programming and stochastic verification theorems | Sheng Luo et.al. | 2405.05561 | null |
2024-05-14 | Robust Reward Placement under Uncertainty | Petros Petsinis et.al. | 2405.05433 | null |
2024-05-06 | Novel Tour Construction Heuristic for Pick-Up and Delivery Routing Problems | Mithun Goutham et.al. | 2405.03774 | null |
2024-05-05 | TSP Escapes the $O(2^n n^2)$ Curse | Mihail Stoian et.al. | 2405.03018 | link |
2024-05-02 | DiffusionPipe: Training Large Diffusion Models with Efficient Pipelines | Ye Tian et.al. | 2405.01248 | null |
2024-05-02 | Lipschitz constant estimation for general neural network architectures using control tools | Patricia Pauli et.al. | 2405.01125 | link |
2024-05-01 | A biased random-key genetic algorithm with variable mutants to solve a vehicle routing problem | Paola Festa et.al. | 2405.00268 | null |
2024-04-28 | Bi-objective optimization of a VRP problem applied to urban solid waste collection through a model that includes the visual attraction of routes | Diego Rossit et.al. | 2405.00068 | null |
2024-04-26 | Energy Storage Arbitrage in Two-settlement Markets: A Transformer-Based Approach | Saud Alghumayjan et.al. | 2404.17683 | null |
2024-04-25 | Path integral control under McKean-Vlasov dynamics | Timothy Bennett et.al. | 2404.17006 | null |
2024-04-25 | Parallel and (Nearly) Work-Efficient Dynamic Programming | Xiangyun Ding et.al. | 2404.16314 | link |
2024-04-23 | Prediction from compression for models with infinite memory, with applications to hidden Markov and renewal processes | Yanjun Han et.al. | 2404.15454 | null |
2024-04-26 | Variational Dynamic Programming for Stochastic Optimal Control | Marc Lambert et.al. | 2404.14806 | link |
2024-04-22 | Tile-Weighted Rate-Distortion Optimized Packet Scheduling for 360 $^\circ$ VR Video Streaming | Haopeng Wang et.al. | 2404.14573 | null |
2024-04-21 | Stochastic Multi-round Submodular Optimization with Budget | Vincenzo Auletta et.al. | 2404.13737 | null |
2024-04-21 | Planning of Truck Platooning for Road-Network Capacitated Vehicle Routing Problem | Yilang Hao et.al. | 2404.13512 | null |
2024-04-20 | Liquidity Pool Design on Automated Market Makers | Xue Dong He et.al. | 2404.13291 | null |
2024-04-19 | Decentralized Coordination of Distributed Energy Resources through Local Energy Markets and Deep Reinforcement Learning | Daniel May et.al. | 2404.13142 | null |
2024-04-18 | NLP-enabled trajectory map-matching in urban road networks using transformer sequence-to-sequence model | Sevin Mohammadi et.al. | 2404.12460 | null |
2024-04-18 | Recursive stochastic differential games with non-Lipschitzian generators and viscosity solutions of Hamilton-Jacobi-Bellman-Isaacs equation | Guangchen Wang et.al. | 2404.12129 | null |
2024-04-18 | Actor-Critic Reinforcement Learning with Phased Actor | Ruofan Wu et.al. | 2404.11834 | null |
2024-04-18 | Itō and Itō-Wentzell chain rule for flows of conditional laws of continuous semimartingales: an easy approach | Assil Fadle et.al. | 2404.11010 | null |
2024-04-16 | Zero-Sum Games for Volterra Integral Equations and Viscosity Solutions of Path-Dependent Hamilton-Jacobi Equations | Mikhail I. Gomoyunov et.al. | 2404.10428 | null |
2024-04-16 | Urban Water Sprinkler Routing: A Multi-Depot Mixed Capacitated Arc Routing Problem Incorporating Real-Time Demands | Hongtai Yang et.al. | 2404.10230 | null |
2024-04-13 | Fast Gradient Computation for Gromov-Wasserstein Distance | Wei Zhang et.al. | 2404.08970 | null |
2024-04-12 | A Parametric Approach for Solving Convex Quadratic Optimization with Indicators Over Trees | Aaresh Bhathena et.al. | 2404.08178 | link |
2024-04-06 | Viscosity solutions for mean field optimal switching with a two-time-scale Markov chain | Tian Chen et.al. | 2404.07998 | null |
2024-04-11 | Parameterized Fast and Safe Tracking (FaSTrack) using Deepreach | Hyun Joe Jeong et.al. | 2404.07431 | null |
2024-04-09 | Inexact Policy Iteration Methods for Large-Scale Markov Decision Processes | Matilde Gargiani et.al. | 2404.06136 | null |
2024-04-09 | fastcpd: Fast Change Point Detection in R | Xingchi Li et.al. | 2404.05933 | link |
2024-04-08 | Non-concave distributionally robust stochastic control in a discrete time finite horizon setting | Ariel Neufeld et.al. | 2404.05230 | link |
2024-04-07 | Percentile Criterion Optimization in Offline Reinforcement Learning | Elita A. Lobo et.al. | 2404.05055 | link |
2024-04-05 | A Ground Mobile Robot for Autonomous Terrestrial Laser Scanning-Based Field Phenotyping | Javier Rodriguez-Sanchez et.al. | 2404.04404 | null |
2024-04-04 | Forecasting with Neuro-Dynamic Programming | Pedro Afonso Fernandes et.al. | 2404.03737 | null |
2024-04-03 | Reinforcement Learning in Categorical Cybernetics | Jules Hedges et.al. | 2404.02688 | null |
2024-04-03 | Transformer-based Stagewise Decomposition for Large-Scale Multistage Stochastic Optimization | Chanyeong Kim et.al. | 2404.02583 | null |
2024-04-01 | Versatile Navigation under Partial Observability via Value-guided Diffusion Policy | Gengyu Zhang et.al. | 2404.02176 | null |
2024-03-31 | Adversarially-Robust Inference on Trees via Belief Propagation | Samuel B. Hopkins et.al. | 2404.00768 | null |
2024-03-28 | A Faster Algorithm for Pigeonhole Equal Sums | Ce Jin et.al. | 2403.19117 | null |
2024-03-27 | Policy iteration for discrete-time systems with discounted costs: stability and near-optimality guarantees | Jonathan de Brusse et.al. | 2403.19007 | null |
2024-03-27 | A Dynamic Programming Approach for Road Traffic Estimation | Mattia Laurini et.al. | 2403.18561 | null |
2024-03-26 | Generalized Maximum Entropy Differential Dynamic Programming | Yuichiro Aoyama et.al. | 2403.18130 | null |
2024-03-26 | Accuracy enhancement method for speech emotion recognition from spectrogram using temporal frequency correlation and positional information learning through knowledge transfer | Jeong-Yoon Kim et.al. | 2403.17327 | link |
2024-03-25 | State-Augmented Linear Games with Antagonistic Error for High-Dimensional, Nonlinear Hamilton-Jacobi Reachability | Will Sharpless et.al. | 2403.16982 | link |
2024-03-25 | Semantic-Aware Remote Estimation of Multiple Markov Sources Under Constraints | Jiping Luo et.al. | 2403.16855 | null |
2024-03-24 | On the Navier-Stokes equations and the Hamilton-Jacobi-Bellman equation on the group of volume preserving diffeomorphisms | Xiang-Dong Li et.al. | 2403.15997 | null |
2024-03-23 | On Merton’s Optimal Portfolio Problem under Sporadic Bankruptcy | Yaacov Kopeliovich et.al. | 2403.15923 | link |
2024-03-22 | Transactive Local Energy Markets Enable Community-Level Resource Coordination Using Individual Rewards | Daniel C. May et.al. | 2403.15617 | null |
2024-03-19 | Most Likely Sequence Generation for $n$ -Grams, Transformers, HMMs, and Markov Chains, by Using Rollout Algorithms | Yuchao Li et.al. | 2403.15465 | null |
2024-03-21 | Conservative Linear Envelopes for High-Dimensional, Hamilton-Jacobi Reachability for Nonlinear Systems via the Hopf Formula | Will Sharpless et.al. | 2403.14184 | null |
2024-03-20 | Optimal control of continuous-time symmetric systems with unknown dynamics and noisy measurements | Hamed Taghavian et.al. | 2403.13605 | null |
2024-03-19 | Solving Combinatorial Pricing Problems using Embedded Dynamic Programming Models | Quang Minh Bui et.al. | 2403.12923 | null |
2024-03-18 | AdaMER-CTC: Connectionist Temporal Classification with Adaptive Maximum Entropy Regularization for Automatic Speech Recognition | SooHwan Eom et.al. | 2403.11578 | null |
2024-03-17 | Multiscale Quantile Regression with Local Error Control | Zhi Liu et.al. | 2403.11356 | link |
2024-03-15 | Fast Generation of Feasible Trajectories in Direct Optimal Control | David Kiessling et.al. | 2403.10115 | link |
2024-03-14 | Is Data All That Matters? The Role of Control Frequency for Learning-Based Sampled-Data Control of Uncertain Systems | Ralf Römer et.al. | 2403.09504 | link |
2024-03-14 | Quantum Dynamic Programming | Jeongrak Son et.al. | 2403.09187 | null |
2024-03-15 | Relationship between General MP and DPP for the Stochastic Recursive Optimal Control Problem With Jumps: Viscosity Solution Framework | Bin Wang et.al. | 2403.09044 | null |
2024-03-13 | Model-free Resilient Controller Design based on Incentive Feedback Stackelberg Game and Q-learning | Jiajun Shen et.al. | 2403.08948 | null |
2024-03-13 | Online Multi-Contact Feedback Model Predictive Control for Interactive Robotic Tasks | Seo Wook Han et.al. | 2403.08302 | null |
2024-03-12 | Optimal Design and Implementation of an Open-source Emulation Platform for User-Centric Shared E-mobility Services | Maqsood Hussain Shah et.al. | 2403.07964 | null |
2024-03-12 | The Primal Pathwidth SETH | Michael Lampis et.al. | 2403.07239 | null |
2024-03-10 | A Unified Model for Spatio-Temporal Prediction Queries with Arbitrary Modifiable Areal Units | Liyue Chen et.al. | 2403.07022 | link |
2024-03-11 | Domain-Independent Dynamic Programming and Constraint Programming Approaches for Assembly Line Balancing Problems with Setups | Jiachen Zhang et.al. | 2403.06780 | null |
2024-03-11 | Balanced Substructures in Bicolored Graphs | P. S. Ardra et.al. | 2403.06608 | null |
2024-03-11 | An Efficient Solution to the 2D Visibility Problem in Cartesian Grid Maps and its Application in Heuristic Path Planning | Ibrahim Ibrahim et.al. | 2403.06494 | link |
2024-03-11 | AGAThA: Fast and Efficient GPU Acceleration of Guided Sequence Alignment for Long Read Mapping | Seongyeon Park et.al. | 2403.06478 | link |
2024-03-09 | Spatial Clustering Approach for Vessel Path Identification | Mohamed Abuella et.al. | 2403.05778 | link |
2024-03-07 | On $[1,2]$ -Domination in Interval and Circle Graphs | Mohsen Alambardar Meybodi et.al. | 2403.04694 | null |
2024-03-07 | Fill-and-Spill: Deep Reinforcement Learning Policy Gradient Methods for Reservoir Operation Decision and Control | Sadegh Sadeghi Tabas et.al. | 2403.04195 | null |
2024-03-06 | Global Geolocated Realtime Data of Interfleet Urban Transit Bus Idling | Nicholas Kunz et.al. | 2403.03489 | link |
2024-03-06 | SalienTime: User-driven Selection of Salient Time Steps for Large-Scale Geospatial Data Visualization | Juntong Chen et.al. | 2403.03449 | link |
2024-03-06 | Leveraging The Finite States of Emotion Processing to Study Late-Life Mental Health | Yuanzhe Huang et.al. | 2403.03414 | null |
2024-03-04 | Dynamic programming principle in cost-efficient sequential design: application to switching measurements | Jeongmin Han et.al. | 2403.02245 | null |
2024-03-04 | Cooperative and Interaction-aware Driver Model for Lane Change Maneuver | Jemin Woo et.al. | 2403.01752 | null |
2024-03-01 | DyPyBench: A Benchmark of Executable Python Software | Islem Bouzenia et.al. | 2403.00539 | link |
2024-03-01 | Graph Construction with Flexible Nodes for Traffic Demand Prediction | Jinyan Hou et.al. | 2403.00276 | link |
2024-02-29 | Lifelong Benchmarks: Efficient Model Evaluation in an Era of Rapid Progress | Ameya Prabhu et.al. | 2402.19472 | link |
2024-02-27 | Globally Convergent Distributed Sequential Quadratic Programming with Overlapping Decomposition and Exact Augmented Lagrangian Merit Function | Runxin Ni et.al. | 2402.17170 | null |
2024-02-24 | Selective Task offloading for Maximum Inference Accuracy and Energy efficient Real-Time IoT Sensing Systems | Abdelkarim Ben Sada et.al. | 2402.16904 | null |
2024-02-25 | IKLink: End-Effector Trajectory Tracking with Minimal Reconfigurations | Yeping Wang et.al. | 2402.16154 | link |
2024-02-25 | Evolving E-commerce Logistics Planning- Integrating Embedded Technology and Ant Colony Algorithm for Enhanced Efficiency | Lynn Huang et.al. | 2402.15965 | null |
2024-02-25 | Budget-Constrained Tool Learning with Planning | Yuanhang Zheng et.al. | 2402.15960 | link |
2024-02-23 | Neural optimal controller for stochastic systems via pathwise HJB operator | Zhe Jiao et.al. | 2402.15592 | null |
2024-02-23 | Curve fitting on a quantum annealer for an advanced navigation method | Philipp Isserstedt et.al. | 2402.15308 | null |
2024-02-22 | Quantum Markov Decision Processes Part II: Optimal Solutions and Algorithms | Naci Saldi et.al. | 2402.14651 | null |
2024-02-22 | Quantum Markov Decision Processes Part I: General Theory, Approximations, and Classes of Policies | Naci Saldi et.al. | 2402.14649 | null |
2024-02-21 | Quantum Annealing and Graph Neural Networks for Solving TSP with QUBO | Haoqi He et.al. | 2402.14036 | null |
2024-02-21 | Do Efficient Transformers Really Save Computation? | Kai Yang et.al. | 2402.13934 | null |
2024-02-21 | Benchmarking and Dissecting the Nvidia Hopper GPU Architecture | Weile Luo et.al. | 2402.13499 | null |
2024-02-20 | An Improved Lower Bound on the Number of Pseudoline Arrangements | Fernando Cortés Kühnast et.al. | 2402.13107 | null |
2024-02-20 | Smart Mobility Digital Twin Based Automated Vehicle Navigation System: A Proof of Concept | Kui Wang et.al. | 2402.12682 | null |
2024-02-19 | An algorithm for counting number of all (normal) fuzzy subgroups in $U_{6n}$ | Marek Hyčko et.al. | 2402.12543 | null |
2024-02-29 | Sequoia: Scalable, Robust, and Hardware-aware Speculative Decoding | Zhuoming Chen et.al. | 2402.12374 | link |
2024-02-19 | Scalable Virtual Valuations Combinatorial Auction Design by Combining Zeroth-Order and First-Order Optimization Method | Zhijian Duan et.al. | 2402.11904 | null |
2024-02-19 | Two Online Map Matching Algorithms Based on Analytic Hierarchy Process and Fuzzy Logic | Jeremy J. Lin et.al. | 2402.11866 | null |
2024-02-18 | A Fisher Information based Receding Horizon Control Method for Signal Strength Model Estimation | Yancheng Zhu et.al. | 2402.11483 | null |
2024-02-16 | Optimal Savings and Value of Population in A Stochastic Environment: Transient Behavior | Hao Liu et.al. | 2402.10768 | null |
2024-02-15 | Engraving Oriented Joint Estimation of Pitch Spelling and Local and Global Keys | Augustin Bouquillard et.al. | 2402.10247 | null |
2024-02-14 | Analyzing the Impact of Computation in Adaptive Dynamic Programming for Stochastic LQR Problem | Wenhan Cao et.al. | 2402.09575 | null |
2024-02-13 | Approximate Sequential Optimization for Informative Path Planning | Joshua Ott et.al. | 2402.08841 | link |
2024-02-13 | Sequence graphs realizations and ambiguity in language models | Sammy Khalife et.al. | 2402.08830 | null |
2024-02-11 | GenSTL: General Sparse Trajectory Learning via Auto-regressive Generation of Feature Domains | Yan Lin et.al. | 2402.07232 | link |
2024-02-09 | High-Precision Geosteering via Reinforcement Learning and Particle Filters | Ressi Bonti Muhammad et.al. | 2402.06377 | null |
2024-02-09 | Bellman Conformal Inference: Calibrating Prediction Intervals For Time Series | Zitong Yang et.al. | 2402.05203 | link |
2024-02-04 | Empowering Computing and Networks Convergence System with Distributed Cooperative Routing | Yujiao Hu et.al. | 2402.02381 | null |
2024-02-03 | Multiple sequences Prophet Inequality Under Observation Constraints | Aristomenis Tsopelakos et.al. | 2402.02059 | null |
2024-02-02 | Capturing waste collection planning expert knowledge in a fitness function through preference learning | Laura Fernández Díaz et.al. | 2402.01849 | null |
2024-02-02 | Dynamic programming for the stochastic matching model on general graphs: the case of the `N-graph’ | Loïc Jean et.al. | 2402.01803 | null |
2024-02-01 | AlphaRank: An Artificial Intelligence Approach for Ranking and Selection Problems | Ruihan Zhou et.al. | 2402.00907 | null |
2024-02-01 | Cocco: Hardware-Mapping Co-Exploration towards Memory Capacity-Communication Optimization | Zhanhong Tan et.al. | 2402.00629 | null |
2024-02-02 | Branch and Price for the Length-Constrained Cycle Partition Problem | Mohammed Ghannam et.al. | 2401.17937 | link |
2024-01-31 | Revisiting speech segmentation and lexicon learning with better features | Herman Kamper et.al. | 2401.17902 | null |
2024-02-16 | The computation of approximate feedback Stackelberg equilibria in multi-player nonlinear constrained dynamic games | Jingqi Li et.al. | 2401.15745 | link |
2024-01-28 | HappyRouting: Learning Emotion-Aware Route Trajectories for Scalable In-The-Wild Navigation | David Bethge et.al. | 2401.15695 | null |
2024-01-28 | Constrained Markov decision processes for response-adaptive procedures in clinical trials with binary outcomes | Stef Baas et.al. | 2401.15694 | null |
2024-01-27 | Fair and Efficient Ridesharing: A Dynamic Programming-based Relocation Approach | Aqsa Ashraf Makhdomi et.al. | 2401.15363 | null |
2024-01-27 | Optimal Sparse Survival Trees | Rui Zhang et.al. | 2401.15330 | link |
2024-01-25 | Domain-Independent Dynamic Programming | Ryo Kuroiwa et.al. | 2401.13883 | link |
2024-01-27 | Deep multitask neural networks for solving some stochastic optimal control problems | Christian Yeo et.al. | 2401.12923 | link |
2024-01-23 | Optimal Stopping of Branching Diffusion Processes | Idris Kharroubi et.al. | 2401.12811 | null |
2024-01-22 | On a class of interdiction problems with partition matroids: complexity and polynomial-time algorithms | Sergey S. Ketkov et.al. | 2401.12010 | null |
2024-01-22 | Finite horizon optimal control of reaction-diffusion SIV epidemic system with stochastic environment | Zong Wang et.al. | 2401.11744 | null |
2024-01-20 | Closing the Gap between TD Learning and Supervised Learning – A Generalisation Point of View | Raj Ghugare et.al. | 2401.11237 | link |
Large Language Model
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-11 | Flipping Against All Odds: Reducing LLM Coin Flip Bias via Verbalized Rejection Sampling | Tim Z. Xiao et.al. | 2506.09998 | null |
2025-06-11 | From Judgment to Interference: Early Stopping LLM Harmful Outputs via Streaming Content Monitoring | Yang Li et.al. | 2506.09996 | null |
2025-06-11 | Large Language Models for Toxic Language Detection in Low-Resource Balkan Languages | Amel Muminovic et.al. | 2506.09992 | null |
2025-06-11 | Multiverse: Your Language Models Secretly Decide How to Parallelize and Merge Generation | Xinyu Yang et.al. | 2506.09991 | null |
2025-06-11 | EditInspector: A Benchmark for Evaluation of Text-Guided Image Edits | Ron Yosef et.al. | 2506.09988 | null |
2025-06-11 | A Shortcut-aware Video-QA Benchmark for Physical Understanding via Minimal Video Pairs | Benno Krojer et.al. | 2506.09987 | null |
2025-06-11 | V-JEPA 2: Self-Supervised Video Models Enable Understanding, Prediction and Planning | Mido Assran et.al. | 2506.09985 | null |
2025-06-11 | Step-by-step Instructions and a Simple Tabular Output Format Improve the Dependency Parsing Accuracy of LLMs | Hiroshi Matsuda et.al. | 2506.09983 | null |
2025-06-11 | AnimateAnyMesh: A Feed-Forward 4D Foundation Model for Text-Driven Universal Mesh Animation | Zijie Wu et.al. | 2506.09982 | null |
2025-06-11 | SRLAgent: Enhancing Self-Regulated Learning Skills through Gamification and LLM Assistance | Wentao Ge et.al. | 2506.09968 | null |
2025-06-11 | Resa: Transparent Reasoning Models via SAEs | Shangshang Wang et.al. | 2506.09967 | null |
2025-06-11 | Reinforcing Spatial Reasoning in Vision-Language Models with Interwoven Thinking and Visual Drawing | Junfei Wu et.al. | 2506.09965 | null |
2025-06-11 | Kvasir-VQA-x1: A Multimodal Dataset for Medical Reasoning and Robust MedVQA in Gastrointestinal Endoscopy | Sushant Gautam et.al. | 2506.09958 | null |
2025-06-11 | LLMail-Inject: A Dataset from a Realistic Adaptive Prompt Injection Challenge | Sahar Abdelnabi et.al. | 2506.09956 | null |
2025-06-11 | Query-Focused Retrieval Heads Improve Long-Context Reasoning and Re-ranking | Wuwei Zhang et.al. | 2506.09944 | null |
2025-06-11 | VerIF: Verification Engineering for Reinforcement Learning in Instruction Following | Hao Peng et.al. | 2506.09942 | null |
2025-06-11 | From Intention to Execution: Probing the Generalization Boundaries of Vision-Language-Action Models | Irving Fang et.al. | 2506.09930 | null |
2025-06-11 | PersonaLens: A Benchmark for Personalization Evaluation in Conversational AI Assistants | Zheng Zhao et.al. | 2506.09902 | null |
2025-06-11 | The Emergence of Abstract Thought in Large Language Models Beyond Any Language | Yuxin Chen et.al. | 2506.09890 | null |
2025-06-11 | Attention Head Embeddings with Trainable Deep Kernels for Hallucination Detection in LLMs | Rodion Oblovatny et.al. | 2506.09886 | null |
2025-06-10 | VIKI-R: Coordinating Embodied Multi-Agent Cooperation via Reinforcement Learning | Li Kang et.al. | 2506.09049 | null |
2025-06-10 | Same Task, Different Circuits: Disentangling Modality-Specific Mechanisms in VLMs | Yaniv Nikankin et.al. | 2506.09047 | null |
2025-06-10 | Agentic Neural Networks: Self-Evolving Multi-Agent Systems via Textual Backpropagation | Xiaowen Ma et.al. | 2506.09046 | null |
2025-06-10 | Cosmos-Drive-Dreams: Scalable Synthetic Driving Data Generation with World Foundation Models | Xuanchi Ren et.al. | 2506.09042 | null |
2025-06-10 | Autoregressive Semantic Visual Reconstruction Helps VLMs Understand Better | Dianyi Wang et.al. | 2506.09040 | null |
2025-06-10 | AbstentionBench: Reasoning LLMs Fail on Unanswerable Questions | Polina Kirichenko et.al. | 2506.09038 | null |
2025-06-10 | FZOO: Fast Zeroth-Order Optimizer for Fine-Tuning Large Language Models towards Adam-Scale Speed | Sizhe Dang et.al. | 2506.09034 | null |
2025-06-10 | Router-R1: Teaching LLMs Multi-Round Routing and Aggregation via Reinforcement Learning | Haozhen Zhang et.al. | 2506.09033 | null |
2025-06-10 | Do MIL Models Transfer? | Daniel Shao et.al. | 2506.09022 | null |
2025-06-10 | SPEED-RL: Faster Training of Reasoning Models via Online Curriculum Learning | Ruiqi Zhang et.al. | 2506.09016 | null |
2025-06-10 | Learning to Reason Across Parallel Samples for LLM Reasoning | Jianing Qi et.al. | 2506.09014 | null |
2025-06-10 | Boosting Rust Unit Test Coverage through Hybrid Program Analysis and Large Language Models | Bei Chu et.al. | 2506.09002 | null |
2025-06-10 | Efficient Medical Vision-Language Alignment Through Adapting Masked Vision Models | Chenyu Lian et.al. | 2506.08990 | null |
2025-06-10 | SwS: Self-aware Weakness-driven Problem Synthesis in Reinforcement Learning for LLM Reasoning | Xiao Liang et.al. | 2506.08989 | null |
2025-06-10 | On Finetuning Tabular Foundation Models | Ivan Rubachev et.al. | 2506.08982 | null |
2025-06-10 | AdaDec: Uncertainty-Guided Adaptive Decoding for LLM-based Code Generation | Kaifeng He et.al. | 2506.08980 | null |
2025-06-10 | Propositional Logic for Probing Generalization in Neural Networks | Anna Langedijk et.al. | 2506.08978 | null |
2025-06-10 | Atomic-to-Compositional Generalization for Mobile Agents with A New Benchmark and Scheduling System | Yuan Guo et.al. | 2506.08972 | null |
2025-06-10 | ADAM: Autonomous Discovery and Annotation Model using LLMs for Context-Aware Annotations | Amirreza Rouhi et.al. | 2506.08968 | null |
2025-06-10 | Step-Audio-AQAA: a Fully End-to-End Expressive Large Audio Language Model | Ailin Huang et.al. | 2506.08967 | null |
2025-06-09 | GUI-Reflection: Empowering Multimodal GUI Models with Self-Reflection Behavior | Penghao Wu et.al. | 2506.08012 | null |
2025-06-09 | Play to Generalize: Learning to Reason Through Game Play | Yunfei Xie et.al. | 2506.08011 | null |
2025-06-09 | Vision Transformers Don’t Need Trained Registers | Nick Jiang et.al. | 2506.08010 | null |
2025-06-09 | Hidden in plain sight: VLMs overlook their visual representations | Stephanie Fu et.al. | 2506.08008 | null |
2025-06-09 | Reinforcement Pre-Training | Qingxiu Dong et.al. | 2506.08007 | null |
2025-06-09 | Reparameterized LLM Training via Orthogonal Equivalence Transformation | Zeju Qiu et.al. | 2506.08001 | null |
2025-06-09 | Supporting Construction Worker Well-Being with a Multi-Agent Conversational AI System | Fan Yang et.al. | 2506.07997 | null |
2025-06-09 | HeuriGym: An Agentic Benchmark for LLM-Crafted Heuristics in Combinatorial Optimization | Hongzheng Chen et.al. | 2506.07972 | null |
2025-06-09 | CyberV: Cybernetics for Test-time Scaling in Video Understanding | Jiahao Meng et.al. | 2506.07971 | null |
2025-06-09 | SpaCE-10: A Comprehensive Benchmark for Multimodal Large Language Models in Compositional Spatial Intelligence | Ziyang Gong et.al. | 2506.07966 | null |
2025-06-09 | Reinforcing Multimodal Understanding and Generation with Dual Self-rewards | Jixiang Hong et.al. | 2506.07963 | null |
2025-06-09 | Correlated Errors in Large Language Models | Elliot Kim et.al. | 2506.07962 | null |
2025-06-09 | BridgeVLA: Input-Output Alignment for Efficient 3D Manipulation Learning with Vision-Language Models | Peiyan Li et.al. | 2506.07961 | null |
2025-06-09 | Language Models over Canonical Byte-Pair Encodings | Tim Vieira et.al. | 2506.07956 | null |
2025-06-09 | TokenBreak: Bypassing Text Classification Models Through Token Manipulation | Kasimir Schulz et.al. | 2506.07948 | null |
2025-06-09 | Statistical Hypothesis Testing for Auditing Robustness in Language Models | Paulius Rauba et.al. | 2506.07947 | null |
2025-06-09 | ProtocolLLM: RTL Benchmark for SystemVerilog Generation of Communication Protocols | Arnav Sheth et.al. | 2506.07945 | null |
2025-06-09 | Decoupling the Image Perception and Multimodal Reasoning for Reasoning Segmentation with Digital Twin Representations | Yizhen Li et.al. | 2506.07943 | null |
2025-06-09 | Adversarial Attack Classification and Robustness Testing for Large Language Models for Code | Yang Liu et.al. | 2506.07942 | null |
2025-06-09 | Gradients: When Markets Meet Fine-tuning – A Distributed Approach to Model Optimisation | Christopher Subia-Waud et.al. | 2506.07940 | null |
2025-06-06 | TerraFM: A Scalable Foundation Model for Unified Multisensor Earth Observation | Muhammad Sohail Danish et.al. | 2506.06281 | null |
2025-06-06 | Eigenspectrum Analysis of Neural Networks without Aspect Ratio Bias | Yuanzhe Hu et.al. | 2506.06280 | null |
2025-06-06 | CoMemo: LVLMs Need Image Context with Image Memory | Shi Liu et.al. | 2506.06279 | null |
2025-06-06 | Movie Facts and Fibs (MF $^2$ ): A Benchmark for Long Movie Understanding | Emmanouil Zaranis et.al. | 2506.06275 | null |
2025-06-06 | AdvSumm: Adversarial Training for Bias Mitigation in Text Summarization | Mukur Gupta et.al. | 2506.06273 | null |
2025-06-06 | RecGPT: A Foundation Model for Sequential Recommendation | Yangqin Jiang et.al. | 2506.06270 | null |
2025-06-09 | Cartridges: Lightweight and general-purpose long context representations via self-study | Sabri Eyuboglu et.al. | 2506.06266 | null |
2025-06-06 | PersonaAgent: When Large Language Model Agents Meet Personalization at Test Time | Weizhi Zhang et.al. | 2506.06254 | null |
2025-06-06 | DesignBench: A Comprehensive Benchmark for MLLM-based Front-end Code Generation | Jingyu Xiao et.al. | 2506.06251 | null |
2025-06-06 | Visual Graph Arena: Evaluating Visual Conceptualization of Vision and Multimodal Large Language Models | Zahra Babaiee et.al. | 2506.06242 | null |
2025-06-06 | Bridging External and Parametric Knowledge: Mitigating Hallucination of LLMs with Shared-Private Semantic Synergy in Dual-Stream Knowledge | Yi Sui et.al. | 2506.06240 | null |
2025-06-06 | Explaining Matters: Leveraging Definitions and Semantic Expansion for Sexism Detection | Sahrish Khan et.al. | 2506.06238 | null |
2025-06-06 | Challenging Vision-Language Models with Surgical Data: A New Dataset and Broad Benchmarking Study | Leon Mayer et.al. | 2506.06232 | null |
2025-06-06 | CompilerGPT: Leveraging Large Language Models for Analyzing and Acting on Compiler Optimization Reports | Peter Pirkelbauer et.al. | 2506.06227 | null |
2025-06-06 | PROVSYN: Synthesizing Provenance Graphs for Data Augmentation in Intrusion Detection Systems | Yi Huang et.al. | 2506.06226 | null |
2025-06-06 | GenIR: Generative Visual Feedback for Mental Image Retrieval | Diji Yang et.al. | 2506.06220 | null |
2025-06-06 | STSBench: A Spatio-temporal Scenario Benchmark for Multi-modal Large Language Models in Autonomous Driving | Christian Fruhwirth-Reisinger et.al. | 2506.06218 | null |
2025-06-06 | Corrector Sampling in Language Models | Itai Gat et.al. | 2506.06215 | null |
2025-06-06 | Can Theoretical Physics Research Benefit from Language Agents? | Sirui Lu et.al. | 2506.06214 | null |
2025-06-06 | PuzzleWorld: A Benchmark for Multimodal, Open-Ended Reasoning in Puzzlehunts | Hengzhi Li et.al. | 2506.06211 | null |
2025-06-05 | Why LLM Safety Guardrails Collapse After Fine-tuning: A Similarity Analysis Between Alignment and Fine-tuning Datasets | Lei Hsiung et.al. | 2506.05346 | null |
2025-06-05 | SparseMM: Head Sparsity Emerges from Visual Concept Responses in MLLMs | Jiahui Wang et.al. | 2506.05344 | null |
2025-06-05 | Direct Numerical Layout Generation for 3D Indoor Scene Synthesis via Spatial Reasoning | Xingjian Ran et.al. | 2506.05341 | null |
2025-06-05 | Flattery, Fluff, and Fog: Diagnosing and Mitigating Idiosyncratic Biases in Preference Models | Anirudh Bharadwaj et.al. | 2506.05339 | null |
2025-06-05 | VideoMolmo: Spatio-Temporal Grounding Meets Pointing | Ghazi Shazan Ahmad et.al. | 2506.05336 | null |
2025-06-05 | Search Arena: Analyzing Search-Augmented LLMs | Mihran Miroyan et.al. | 2506.05334 | null |
2025-06-05 | Unleashing Hour-Scale Video Training for Long Video-Language Understanding | Jingyang Lin et.al. | 2506.05332 | null |
2025-06-05 | MINT-CoT: Enabling Interleaved Visual Tokens in Mathematical Chain-of-Thought Reasoning | Xinyan Chen et.al. | 2506.05331 | null |
2025-06-05 | LSM-2: Learning from Incomplete Wearable Sensor Data | Maxwell A. Xu et.al. | 2506.05321 | null |
2025-06-06 | Does Your 3D Encoder Really Work? When Pretrain-SFT from 2D VLMs Meets 3D VLMs | Haoyuan Li et.al. | 2506.05318 | null |
2025-06-05 | Improving Data Efficiency for LLM Reinforcement Fine-tuning Through Difficulty-targeted Online Data Selection and Rollout Replay | Yifan Sun et.al. | 2506.05316 | null |
2025-06-05 | Constrained Entropic Unlearning: A Primal-Dual Framework for Large Language Models | Taha Entesari et.al. | 2506.05314 | null |
2025-06-05 | ProRefine: Inference-time Prompt Refinement with Textual Feedback | Deepak Pandita et.al. | 2506.05305 | null |
2025-06-05 | Perceive Anything: Recognize, Explain, Caption, and Segment Anything in Images and Videos | Weifeng Lin et.al. | 2506.05302 | null |
2025-06-05 | Power Law Guided Dynamic Sifting for Efficient Attention | Nirav Koley et.al. | 2506.05300 | null |
2025-06-05 | Control Tax: The Price of Keeping AI in Check | Mikhail Terekhov et.al. | 2506.05296 | null |
2025-06-05 | Sample Complexity and Representation Ability of Test-time Scaling Paradigms | Baihe Huang et.al. | 2506.05295 | null |
2025-06-05 | EOC-Bench: Can MLLMs Identify, Recall, and Forecast Objects in an Egocentric World? | Yuqian Yuan et.al. | 2506.05287 | null |
2025-06-05 | Micro-Act: Mitigate Knowledge Conflict in Question Answering via Actionable Self-Reasoning | Nan Huo et.al. | 2506.05278 | null |
2025-06-06 | Teaming in the AI Era: AI-Augmented Frameworks for Forming, Simulating, and Optimizing Human Teams | Mohammed Almutairi et.al. | 2506.05265 | null |
2025-06-04 | OWMM-Agent: Open World Mobile Manipulation With Multi-modal Agentic Data Synthesis | Junting Chen et.al. | 2506.04217 | null |
2025-06-04 | Language-Image Alignment with Fixed Text Encoders | Jingfeng Yang et.al. | 2506.04209 | null |
2025-06-04 | Advancing Multimodal Reasoning: From Optimized Cold Start to Staged Reinforcement Learning | Shuang Chen et.al. | 2506.04207 | null |
2025-06-04 | EPiC: Towards Lossless Speedup for Reasoning Training through Edge-Preserving CoT Condensation | Jinghan Jia et.al. | 2506.04205 | null |
2025-06-04 | Cascadia: A Cascade Serving System for Large Language Models | Youhe Jiang et.al. | 2506.04203 | null |
2025-06-04 | TracLLM: A Generic Framework for Attributing Long Context LLMs | Yanting Wang et.al. | 2506.04202 | null |
2025-06-04 | R-Search: Empowering LLM Reasoning with Search via Multi-Reward Reinforcement Learning | Qingfei Zhao et.al. | 2506.04185 | null |
2025-06-04 | SuperWriter: Reflection-Driven Long-Form Generation with Large Language Models | Yuhao Wu et.al. | 2506.04180 | null |
2025-06-04 | SkipGPT: Dynamic Layer Pruning Reinvented with Token Awareness and Module Decoupling | Anhao Zhao et.al. | 2506.04179 | null |
2025-06-04 | Does Prompt Design Impact Quality of Data Imputation by LLMs? | Shreenidhi Srinivasan et.al. | 2506.04172 | null |
2025-06-04 | VISCA: Inferring Component Abstractions for Automated End-to-End Testing | Parsa Alian et.al. | 2506.04161 | null |
2025-06-04 | Image Editing As Programs with Diffusion Models | Yujia Hu et.al. | 2506.04158 | null |
2025-06-04 | A Dataset for Addressing Patient’s Information Needs related to Clinical Course of Hospitalization | Sarvesh Soni et.al. | 2506.04156 | null |
2025-06-04 | Establishing Trustworthy LLM Evaluation via Shortcut Neuron Analysis | Kejian Zhu et.al. | 2506.04142 | null |
2025-06-04 | MMR-V: What’s Left Unsaid? A Benchmark for Multimodal Deep Reasoning in Videos | Kejian Zhu et.al. | 2506.04141 | null |
2025-06-04 | TRiSM for Agentic AI: A Review of Trust, Risk, and Security Management in LLM-based Agentic Multi-Agent Systems | Shaina Raza et.al. | 2506.04133 | null |
2025-06-04 | Recent Advances in Medical Image Classification | Loan Dao et.al. | 2506.04129 | null |
2025-06-04 | Guided Speculative Inference for Efficient Test-Time Alignment of LLMs | Jonathan Geuter et.al. | 2506.04118 | null |
2025-06-05 | Rectified Sparse Attention | Yutao Sun et.al. | 2506.04108 | null |
2025-06-04 | TextAtari: 100K Frames Game Playing with Language Agents | Wenhao Li et.al. | 2506.04098 | link |
2025-06-03 | Causal Estimation of Tokenisation Bias | Pietro Lesci et.al. | 2506.03149 | null |
2025-06-04 | UniWorld: High-Resolution Semantic Encoders for Unified Visual Understanding and Generation | Bin Lin et.al. | 2506.03147 | null |
2025-06-03 | Entity-Augmented Neuroscience Knowledge Retrieval Using Ontology and Semantic Understanding Capability of LLM | Pralaypati Ta et.al. | 2506.03145 | null |
2025-06-03 | Not All Tokens Are Meant to Be Forgotten | Xiangyu Zhou et.al. | 2506.03142 | null |
2025-06-03 | SVGenius: Benchmarking LLMs in SVG Understanding, Editing and Generation | Siqi Chen et.al. | 2506.03139 | null |
2025-06-03 | OmniSpatial: Towards Comprehensive Spatial Reasoning Benchmark for Vision Language Models | Mengdi Jia et.al. | 2506.03135 | null |
2025-06-03 | Native-Resolution Image Synthesis | Zidong Wang et.al. | 2506.03131 | null |
2025-06-03 | AnimeShooter: A Multi-Shot Animation Dataset for Reference-Guided Video Generation | Lu Qiu et.al. | 2506.03126 | null |
2025-06-03 | AUTOCIRCUIT-RL: Reinforcement Learning-Driven LLM for Automated Circuit Topology Generation | Prashanth Vijayaraghavan et.al. | 2506.03122 | null |
2025-06-03 | Targeted Forgetting of Image Subgroups in CLIP Models | Zeliang Zhang et.al. | 2506.03117 | null |
2025-06-04 | Critique-GRPO: Advancing LLM Reasoning with Natural Language and Numerical Feedback | Xiaoying Zhang et.al. | 2506.03106 | null |
2025-06-03 | Beyond Text Compression: Evaluating Tokenizers Across Scales | Jonas F. Lotz et.al. | 2506.03101 | null |
2025-06-03 | TalkingMachines: Real-Time Audio-Driven FaceTime-Style Video via Autoregressive Diffusion Models | Chetwin Low et.al. | 2506.03099 | null |
2025-06-03 | EgoVLM: Policy Optimization for Egocentric Video Understanding | Ashwin Vinod et.al. | 2506.03097 | null |
2025-06-03 | DPO Learning with LLMs-Judge Signal for Computer Use Agents | Man Luo et.al. | 2506.03095 | null |
2025-06-03 | From Flat to Hierarchical: Extracting Sparse Representations with Matching Pursuit | Valérie Costa et.al. | 2506.03093 | null |
2025-06-03 | Literary Evidence Retrieval via Long-Context Language Models | Katherine Thai et.al. | 2506.03090 | null |
2025-06-03 | StreamBP: Memory-Efficient Exact Backpropagation for Long Sequence Training of LLMs | Qijun Luo et.al. | 2506.03077 | null |
2025-06-03 | LEG-SLAM: Real-Time Language-Enhanced Gaussian Splatting for SLAM | Roman Titkov et.al. | 2506.03073 | null |
2025-06-03 | EDITOR: Effective and Interpretable Prompt Inversion for Text-to-Image Diffusion Models | Mingzhe Li et.al. | 2506.03067 | null |
2025-05-30 | ReasonGen-R1: CoT for Autoregressive Image generation models through SFT and RL | Yu Zhang et.al. | 2505.24875 | null |
2025-05-30 | The Road to Generalizable Neuro-Symbolic Learning Should be Paved with Foundation Models | Adam Stein et.al. | 2505.24874 | null |
2025-05-30 | ProxyThinker: Test-Time Guidance through Small Visual Reasoners | Zilin Xiao et.al. | 2505.24872 | null |
2025-05-30 | MoDoMoDo: Multi-Domain Data Mixtures for Multimodal LLM Reinforcement Learning | Yiqing Liang et.al. | 2505.24871 | null |
2025-05-30 | GenSpace: Benchmarking Spatially-Aware Image Generation | Zehan Wang et.al. | 2505.24870 | null |
2025-05-30 | SiLVR: A Simple Language-based Video Reasoning Framework | Ce Zhang et.al. | 2505.24869 | link |
2025-05-30 | Time Blindness: Why Video-Language Models Can’t See What Humans Can? | Ujjwal Upadhyay et.al. | 2505.24867 | null |
2025-05-30 | ProRL: Prolonged Reinforcement Learning Expands Reasoning Boundaries in Large Language Models | Mingjie Liu et.al. | 2505.24864 | link |
2025-05-30 | Beyond Multiple Choice: Evaluating Steering Vectors for Adaptive Free-Form Summarization | Joschka Braun et.al. | 2505.24859 | null |
2025-05-30 | Accelerated Sampling from Masked Diffusion Models via Entropy Bounded Unmasking | Heli Ben-Hamu et.al. | 2505.24857 | null |
2025-05-30 | MiCRo: Mixture Modeling and Context-aware Routing for Personalized Preference Learning | Jingyan Shen et.al. | 2505.24846 | null |
2025-05-30 | Chameleon: A Flexible Data-mixing Framework for Language Model Pretraining and Finetuning | Wanyun Xie et.al. | 2505.24844 | null |
2025-05-30 | Cascading Adversarial Bias from Injection to Distillation in Language Models | Harsh Chaudhari et.al. | 2505.24842 | null |
2025-05-30 | Vision LLMs Are Bad at Hierarchical Visual Understanding, and LLMs Are the Bottleneck | Yuwen Tan et.al. | 2505.24840 | null |
2025-05-30 | VideoCAD: A Large-Scale Video Dataset for Learning UI Interactions and 3D Reasoning from CAD Software | Brandon Man et.al. | 2505.24838 | null |
2025-06-02 | How much do language models memorize? | John X. Morris et.al. | 2505.24832 | null |
2025-05-30 | Improving Reliability and Explainability of Medical Question Answering through Atomic Fact Checking in Retrieval-Augmented LLMs | Juraj Vladika et.al. | 2505.24830 | null |
2025-05-30 | LegalEval-Q: A New Benchmark for The Quality Evaluation of LLM-Generated Legal Text | Li yunhan et.al. | 2505.24826 | null |
2025-05-30 | PhySense: Principle-Based Physics Reasoning Benchmarking for Large Language Models | Yinggan Xu et.al. | 2505.24823 | null |
2025-05-30 | Bi-Manual Joint Camera Calibration and Scene Representation | Haozhan Tang et.al. | 2505.24819 | null |
2025-05-29 | TextRegion: Text-Aligned Region Tokens from Frozen Image-Text Models | Yao Xiao et.al. | 2505.23769 | link |
2025-05-29 | Argus: Vision-Centric Reasoning with Grounded Chain-of-Thought | Yunze Man et.al. | 2505.23766 | null |
2025-05-29 | From Chat Logs to Collective Insights: Aggregative Question Answering | Wentao Zhang et.al. | 2505.23765 | null |
2025-05-29 | MMSI-Bench: A Benchmark for Multi-Image Spatial Intelligence | Sihan Yang et.al. | 2505.23764 | null |
2025-05-29 | ZeroGUI: Automating Online GUI Learning at Zero Human Cost | Chenyu Yang et.al. | 2505.23762 | link |
2025-05-29 | Differential Information: An Information-Theoretic Perspective on Preference Optimization | Yunjae Won et.al. | 2505.23761 | null |
2025-05-29 | Puzzled by Puzzles: When Vision-Language Models Can’t Take a Hint | Heekyung Lee et.al. | 2505.23759 | link |
2025-05-29 | DeepTheorem: Advancing LLM Reasoning for Theorem Proving Through Natural Language and Reinforcement Learning | Ziyin Zhang et.al. | 2505.23754 | link |
2025-05-29 | ThinkGeo: Evaluating Tool-Augmented Agents for Remote Sensing Tasks | Akashah Shabbir et.al. | 2505.23752 | link |
2025-05-29 | Distortion of AI Alignment: Does Preference Optimization Optimize for Preferences? | Paul Gölz et.al. | 2505.23749 | null |
2025-05-29 | Spatial-MLLM: Boosting MLLM Capabilities in Visual-based Spatial Intelligence | Diankun Wu et.al. | 2505.23747 | null |
2025-05-29 | To Trust Or Not To Trust Your Vision-Language Model’s Prediction | Hao Dong et.al. | 2505.23745 | link |
2025-05-29 | LayerPeeler: Autoregressive Peeling for Layer-wise Image Vectorization | Ronghuan Wu et.al. | 2505.23740 | null |
2025-05-29 | ATLAS: Learning to Optimally Memorize the Context at Test Time | Ali Behrouz et.al. | 2505.23735 | null |
2025-05-29 | Bounded Rationality for LLMs: Satisficing Alignment at Inference-Time | Mohamad Chehade et.al. | 2505.23729 | null |
2025-05-29 | PixelThink: Towards Efficient Chain-of-Pixel Reasoning | Song Wang et.al. | 2505.23727 | null |
2025-05-29 | FMG-Det: Foundation Model Guided Robust Object Detection | Darryl Hannan et.al. | 2505.23726 | null |
2025-05-29 | MuLoCo: Muon is a practical inner optimizer for DiLoCo | Benjamin Thérien et.al. | 2505.23725 | null |
2025-05-29 | SC-LoRA: Balancing Efficient Fine-tuning and Knowledge Preservation via Subspace-Constrained LoRA | Minrui Luo et.al. | 2505.23724 | null |
2025-05-29 | ML-Agent: Reinforcing LLM Agents for Autonomous Machine Learning Engineering | Zexi Liu et.al. | 2505.23723 | link |
2025-05-28 | Zero-Shot Vision Encoder Grafting via LLM Surrogates | Kaiyu Yue et.al. | 2505.22664 | link |
2025-05-28 | Training Free Stylized Abstraction | Aimon Rahman et.al. | 2505.22663 | null |
2025-05-28 | AutoL2S: Auto Long-Short Reasoning for Efficient Large Language Models | Feng Luo et.al. | 2505.22662 | null |
2025-05-28 | GuessArena: Guess Who I Am? A Self-Adaptive Framework for Evaluating LLMs in Domain-Specific Knowledge and Reasoning | Qingchen Yu et.al. | 2505.22661 | null |
2025-05-29 | Maximizing Confidence Alone Improves Reasoning | Mihir Prabhudesai et.al. | 2505.22660 | null |
2025-05-28 | 3DLLM-Mem: Long-Term Spatial-Temporal Memory for Embodied 3D Large Language Model | Wenbo Hu et.al. | 2505.22657 | null |
2025-05-28 | Position: Uncertainty Quantification Needs Reassessment for Large-language Model Agents | Michael Kirchhof et.al. | 2505.22655 | null |
2025-05-28 | VScan: Rethinking Visual Token Reduction for Efficient Large Vision-Language Models | Ce Zhang et.al. | 2505.22654 | null |
2025-05-28 | The Climb Carves Wisdom Deeper Than the Summit: On the Noisy Rewards in Learning to Reason | Ang Lv et.al. | 2505.22653 | null |
2025-05-28 | Sherlock: Self-Correcting Reasoning in Vision-Language Models | Yi Ding et.al. | 2505.22651 | null |
2025-05-28 | Characterizing Bias: Benchmarking Large Language Models in Simplified versus Traditional Chinese | Hanjia Lyu et.al. | 2505.22645 | link |
2025-05-28 | Understanding (Un)Reliability of Steering Vectors in Language Models | Joschka Braun et.al. | 2505.22637 | null |
2025-05-28 | Learning Composable Chains-of-Thought | Fangcong Yin et.al. | 2505.22635 | null |
2025-05-28 | Spatial Knowledge Graph-Guided Multimodal Synthesis | Yida Xue et.al. | 2505.22633 | null |
2025-05-28 | Stochastic Chameleons: Irrelevant Context Hallucinations Reveal Class-Based (Mis)Generalization in LLMs | Ziling Cheng et.al. | 2505.22630 | null |
2025-05-28 | Principled Out-of-Distribution Generalization via Simplicity | Jiawei Ge et.al. | 2505.22622 | null |
2025-05-28 | Fast-dLLM: Training-free Acceleration of Diffusion LLM by Enabling KV Cache and Parallel Decoding | Chengyue Wu et.al. | 2505.22618 | null |
2025-05-28 | The Entropy Mechanism of Reinforcement Learning for Reasoning Language Models | Ganqu Cui et.al. | 2505.22617 | null |
2025-05-28 | RICO: Improving Accuracy and Completeness in Image Recaptioning via Visual Reconstruction | Yuchi Wang et.al. | 2505.22613 | null |
2025-05-28 | Effective and Efficient One-pass Compression of Speech Foundation Models Using Sparsity-aware Self-pinching Gates | Haoning Xu et.al. | 2505.22608 | null |
2025-05-27 | Silence is Not Consensus: Disrupting Agreement Bias in Multi-Agent LLMs via Catfish Agent for Clinical Decision Making | Yihan Wang et.al. | 2505.21503 | null |
2025-05-27 | ViewSpatial-Bench: Evaluating Multi-perspective Spatial Localization in Vision-Language Models | Dingming Li et.al. | 2505.21500 | null |
2025-05-27 | AdInject: Real-World Black-Box Attacks on Web Agents via Advertising Delivery | Haowei Wang et.al. | 2505.21499 | link |
2025-05-27 | Adversarial Attacks against Closed-Source MLLMs via Feature Optimal Alignment | Xiaojun Jia et.al. | 2505.21494 | link |
2025-05-27 | Reinforcing General Reasoning without Verifiers | Xiangxin Zhou et.al. | 2505.21493 | null |
2025-05-27 | Robust Hypothesis Generation: LLM-Automated Language Bias for Inductive Logic Programming | Yang Yang et.al. | 2505.21486 | null |
2025-05-27 | Are Language Models Consequentialist or Deontological Moral Reasoners? | Keenan Samway et.al. | 2505.21479 | null |
2025-05-27 | Policy Optimized Text-to-Image Pipeline Design | Uri Gadot et.al. | 2505.21478 | null |
2025-05-27 | Mitigating Hallucination in Large Vision-Language Models via Adaptive Attention Calibration | Mehrdad Fazli et.al. | 2505.21472 | null |
2025-05-27 | Scaling External Knowledge Input Beyond Context Windows of LLMs via Multi-Agent Collaboration | Zijun Liu et.al. | 2505.21471 | link |
2025-05-27 | Accelerating Diffusion Language Model Inference via Efficient KV Caching and Guided Diffusion | Zhanqiu Hu et.al. | 2505.21467 | null |
2025-05-27 | ID-Align: RoPE-Conscious Position Remapping for Dynamic High-Resolution Adaptation in Vision-Language Models | Bozhou Li et.al. | 2505.21465 | null |
2025-05-27 | LazyVLM: Neuro-Symbolic Approach to Video Analytics | Xiangru Jian et.al. | 2505.21459 | null |
2025-05-27 | Do LLMs Need to Think in One Language? Correlation between Latent Language and Task Performance | Shintaro Ozaki et.al. | 2505.21458 | null |
2025-05-27 | Active-O3: Empowering Multimodal Large Language Models with Active Perception via GRPO | Muzhi Zhu et.al. | 2505.21457 | null |
2025-05-27 | Can Large Reasoning Models Self-Train? | Sheikh Shafayat et.al. | 2505.21444 | null |
2025-05-27 | Towards Better Instruction Following Retrieval Models | Yuchen Zhuang et.al. | 2505.21439 | null |
2025-05-27 | Hume: Introducing System-2 Thinking in Visual-Language-Action Model | Haoming Song et.al. | 2505.21432 | null |
2025-05-27 | Policy Induction: Predicting Startup Success via Explainable Memory-Augmented In-Context Learning | Xianling Mu et.al. | 2505.21427 | null |
2025-05-27 | GUARD:Dual-Agent based Backdoor Defense on Chain-of-Thought in Neural Code Generation | Naizhu Jin et.al. | 2505.21425 | null |
2025-05-26 | On Path to Multimodal Historical Reasoning: HistBench and HistAgent | Jiahao Qiu et.al. | 2505.20246 | link |
2025-05-26 | KnowTrace: Bootstrapping Iterative Retrieval-Augmented Generation with Structured Knowledge Tracing | Rui Li et.al. | 2505.20245 | link |
2025-05-26 | It’s High Time: A Survey of Temporal Information Retrieval and Question Answering | Bhawna Piryani et.al. | 2505.20243 | null |
2025-05-26 | RedAHD: Reduction-Based End-to-End Automatic Heuristic Design with Large Language Models | Nguyen Thach et.al. | 2505.20242 | null |
2025-05-26 | DreamPRM: Domain-Reweighted Process Reward Model for Multimodal Reasoning | Qi Cao et.al. | 2505.20241 | null |
2025-05-26 | Efficient Speech Translation through Model Compression and Knowledge Distillation | Yasmin Moslem et.al. | 2505.20237 | link |
2025-05-26 | Seeing is Believing, but How Much? A Comprehensive Analysis of Verbalized Calibration in Vision-Language Models | Weihao Xuan et.al. | 2505.20236 | null |
2025-05-26 | FLAME-MoE: A Transparent End-to-End Research Platform for Mixture-of-Experts Language Models | Hao Kang et.al. | 2505.20225 | link |
2025-05-26 | Chain-of-Thought for Autonomous Driving: A Comprehensive Survey and Future Prospects | Yixin Cui et.al. | 2505.20223 | link |
2025-05-26 | Fine-grained List-wise Alignment for Generative Medication Recommendation | Chenxiao Fan et.al. | 2505.20218 | link |
2025-05-26 | Parameter-Efficient Fine-Tuning with Column Space Projection | Junseo Hwang et.al. | 2505.20211 | null |
2025-05-26 | How to Improve the Robustness of Closed-Source Models on NLI | Joe Stacey et.al. | 2505.20209 | null |
2025-05-26 | Evaluating Large Language Models for Code Review | Umut Cihan et.al. | 2505.20206 | null |
2025-05-26 | PathBench: A comprehensive comparison benchmark for pathology foundation models towards precision oncology | Jiabo Ma et.al. | 2505.20202 | null |
2025-05-26 | Reasoning Is Not All You Need: Examining LLMs for Multi-Turn Mental Health Conversations | Mohit Chandra et.al. | 2505.20201 | null |
2025-05-26 | Adaptive Classifier-Free Guidance via Dynamic Low-Confidence Masking | Pengxiang Li et.al. | 2505.20199 | link |
2025-05-26 | Temporal Sampling for Forgotten Reasoning in LLMs | Yuetai Li et.al. | 2505.20196 | link |
2025-05-26 | FunReason: Enhancing Large Language Models’ Function Calling via Self-Refinement Multiscale Loss and Automated Data Refinement | Bingguang Hao et.al. | 2505.20192 | link |
2025-05-26 | THiNK: Can Large Language Models Think-aloud? | Yongan Yu et.al. | 2505.20184 | link |
2025-05-26 | An Empirical Study on Strong-Weak Model Collaboration for Repo-level Code Generation | Shubham Gandhi et.al. | 2505.20182 | link |
2025-05-26 | Pangu Light: Weight Re-Initialization for Pruning and Accelerating LLMs | Hanting Chen et.al. | 2505.20155 | null |
2025-05-26 | UORA: Uniform Orthogonal Reinitialization Adaptation in Parameter-Efficient Fine-Tuning of Large Models | Xueyan Zhang et.al. | 2505.20154 | null |
2025-05-26 | MineAnyBuild: Benchmarking Spatial Planning for Open-world AI Agents | Ziming Wei et.al. | 2505.20148 | link |
2025-05-26 | FUDOKI: Discrete Flow-based Unified Understanding and Generation via Kinetic-Optimal Velocities | Jin Wang et.al. | 2505.20147 | null |
2025-05-26 | SeMe: Training-Free Language Model Merging via Semantic Alignment | Jian Gu et.al. | 2505.20144 | null |
2025-05-26 | StructEval: Benchmarking LLMs’ Capabilities to Generate Structural Outputs | Jialin Yang et.al. | 2505.20139 | null |
2025-05-26 | AweDist: Attention-aware Embedding Distillation for New Input Token Embeddings | Konstantin Dobler et.al. | 2505.20133 | null |
2025-05-26 | Agentic 3D Scene Generation with Spatially Contextualized VLMs | Xinhang Liu et.al. | 2505.20129 | null |
2025-05-26 | Iterative Self-Incentivization Empowers Large Language Models as Agentic Searchers | Zhengliang Shi et.al. | 2505.20128 | link |
2025-05-26 | Agentic AI Process Observability: Discovering Behavioral Variability | Fabiana Fournier et.al. | 2505.20127 | null |
2025-05-26 | MEBench: A Novel Benchmark for Understanding Mutual Exclusivity Bias in Vision-Language Models | Anh Thai et.al. | 2505.20122 | null |
2025-05-26 | TrojanStego: Your Language Model Can Secretly Be A Steganographic Privacy Leaking Agent | Dominik Meier et.al. | 2505.20118 | link |
2025-05-26 | Named Entity Recognition in Historical Italian: The Case of Giacomo Leopardi’s Zibaldone | Cristian Santini et.al. | 2505.20113 | null |
2025-05-26 | ResSVD: Residual Compensated SVD for Large Language Model Compression | Haolei Bai et.al. | 2505.20112 | null |
2025-05-26 | Language-Agnostic Suicidal Risk Detection Using Large Language Models | June-Woo Kim et.al. | 2505.20109 | null |
2025-05-26 | Adaptive Deep Reasoning: Triggering Deep Thinking When Needed | Yunhao Wang et.al. | 2505.20101 | null |
2025-05-26 | AdaTP: Attention-Debiased Token Pruning for Video Large Language Models | Fengyuan Sun et.al. | 2505.20100 | null |
2025-05-26 | Large Language Models Meet Knowledge Graphs for Question Answering: Synthesis and Opportunities | Chuangtao Ma et.al. | 2505.20099 | link |
2025-05-26 | S2LPP: Small-to-Large Prompt Prediction across LLMs | Liang Cheng et.al. | 2505.20097 | null |
2025-05-26 | Multi-Domain Explainability of Preferences | Nitay Calderon et.al. | 2505.20088 | null |
2025-05-23 | Fann or Flop: A Multigenre, Multiera Benchmark for Arabic Poetry Understanding in LLMs | Wafa Alghallabi et.al. | 2505.18152 | link |
2025-05-23 | First Finish Search: Efficient Test-Time Scaling in Large Language Models | Aradhye Agarwal et.al. | 2505.18149 | null |
2025-05-23 | Lost in the Haystack: Smaller Needles are More Difficult for LLMs to Find | Owen Bianchi et.al. | 2505.18148 | null |
2025-05-23 | Graph-Linguistic Fusion: Using Language Models for Wikidata Vandalism Detection | Mykola Trokhymovych et.al. | 2505.18136 | null |
2025-05-23 | Gaming Tool Preferences in Agentic LLMs | Kazem Faghih et.al. | 2505.18135 | link |
2025-05-23 | VideoGameBench: Can Vision-Language Models complete popular video games? | Alex L. Zhang et.al. | 2505.18134 | null |
2025-05-23 | One RL to See Them All: Visual Triple Unified Reinforcement Learning | Yan Ma et.al. | 2505.18129 | null |
2025-05-23 | Reward Model Overoptimisation in Iterated RLHF | Lorenz Wolf et.al. | 2505.18126 | null |
2025-05-23 | TabSTAR: A Foundation Tabular Model With Semantically Target-Aware Representations | Alan Arazi et.al. | 2505.18125 | null |
2025-05-23 | UNJOIN: Enhancing Multi-Table Text-to-SQL Generation via Schema Simplification | Poojah Ganesan et.al. | 2505.18122 | null |
2025-05-23 | ProgRM: Build Better GUI Agents with Progress Rewards | Danyang Zhang et.al. | 2505.18121 | null |
2025-05-23 | Bidirectional Knowledge Distillation for Enhancing Sequential Recommendation with Large Language Models | Jiongran Wu et.al. | 2505.18120 | null |
2025-05-23 | Watch and Listen: Understanding Audio-Visual-Speech Moments with Multimodal LLM | Zinuo Li et.al. | 2505.18110 | null |
2025-05-23 | ManuSearch: Democratizing Deep Search in Large Language Models with a Transparent and Open Multi-Agent Framework | Lisheng Huang et.al. | 2505.18105 | null |
2025-05-23 | How Can I Publish My LLM Benchmark Without Giving the True Answers Away? | Takashi Ishida et.al. | 2505.18102 | null |
2025-05-23 | Planning without Search: Refining Frontier LLMs with Offline Goal-Conditioned RL | Joey Hong et.al. | 2505.18098 | null |
2025-05-23 | QwenLong-CPRS: Towards $\infty$ -LLMs with Dynamic Context Optimization | Weizhou Shen et.al. | 2505.18092 | null |
2025-05-23 | Data Mixing Can Induce Phase Transitions in Knowledge Acquisition | Xinran Gu et.al. | 2505.18091 | null |
2025-05-23 | CXReasonBench: A Benchmark for Evaluating Structured Diagnostic Reasoning in Chest X-rays | Hyungyung Lee et.al. | 2505.18087 | null |
2025-05-23 | Deep Video Discovery: Agentic Search with Tool Use for Long-form Video Understanding | Xiaoyi Zhang et.al. | 2505.18079 | null |
2025-05-22 | CrossLMM: Decoupling Long Video Sequences from LMMs via Dual Cross-Attention Mechanisms | Shilin Yan et.al. | 2505.17020 | link |
2025-05-22 | Let Androids Dream of Electric Sheep: A Human-like Image Implication Understanding and Reasoning Framework | Chenhao Zhang et.al. | 2505.17019 | link |
2025-05-22 | SophiaVL-R1: Reinforcing MLLMs Reasoning with Thinking Reward | Kaixuan Fan et.al. | 2505.17018 | link |
2025-05-22 | Delving into RL for Image Generation with CoT: A Study on DPO vs. GRPO | Chengzhuo Tong et.al. | 2505.17017 | link |
2025-05-22 | Multi-SpatialMLLM: Multi-Frame Spatial Understanding with Multi-Modal Large Language Models | Runsen Xu et.al. | 2505.17015 | null |
2025-05-22 | SpatialScore: Towards Unified Evaluation for Multimodal Spatial Understanding | Haoning Wu et.al. | 2505.17012 | link |
2025-05-22 | R1-Searcher++: Incentivizing the Dynamic Knowledge Acquisition of LLMs via Reinforcement Learning | Huatong Song et.al. | 2505.17005 | link |
2025-05-22 | Do Large Language Models Excel in Complex Logical Reasoning with Formal Language? | Jin Jiang et.al. | 2505.16998 | link |
2025-05-22 | DecoupledESC: Enhancing Emotional Support Generation via Strategy-Response Decoupled Preference Optimization | Chao Zhang et.al. | 2505.16995 | null |
2025-05-22 | Dimple: Discrete Diffusion Multimodal Large Language Model with Parallel Decoding | Runpeng Yu et.al. | 2505.16990 | link |
2025-05-22 | T1: A Tool-Oriented Conversational Dataset for Multi-Turn Agentic Planning | Amartya Chakraborty et.al. | 2505.16986 | null |
2025-05-22 | UFT: Unifying Supervised and Reinforcement Fine-Tuning | Mingyang Liu et.al. | 2505.16984 | link |
2025-05-22 | LLM as Effective Streaming Processor: Bridging Streaming-Batch Mismatches with Group Position Encoding | Junlong Tong et.al. | 2505.16983 | link |
2025-05-22 | Beyond Correlation: Towards Causal Large Language Model Agents in Biomedicine | Adib Bazgir et.al. | 2505.16982 | null |
2025-05-22 | HyGenar: An LLM-Driven Hybrid Genetic Algorithm for Few-Shot Grammar Generation | Weizhi Tang et.al. | 2505.16978 | link |
2025-05-22 | SWE-Dev: Evaluating and Training Autonomous Feature-Driven Software Development | Yaxin Du et.al. | 2505.16975 | link |
2025-05-22 | CASS: Nvidia to AMD Transpilation with Data, Models, and Benchmark | Ahmed Heakl et.al. | 2505.16968 | link |
2025-05-22 | Invisible Prompts, Visible Threats: Malicious Font Injection in External Resources for Large Language Models | Junjie Xiong et.al. | 2505.16957 | null |
2025-05-22 | On Multilingual Encoder Language Model Compression for Low-Resource Languages | Daniil Gurgurov et.al. | 2505.16956 | null |
2025-05-22 | A Comprehensive Evaluation of Contemporary ML-Based Solvers for Combinatorial Optimization | Shengyu Feng et.al. | 2505.16952 | null |
2025-05-21 | InstructSAM: A Training-Free Framework for Instruction-Oriented Remote Sensing Object Recognition | Yijie Zheng et.al. | 2505.15818 | link |
2025-05-21 | On the creation of narrow AI: hierarchy and nonlocality of neural network skills | Eric J. Michaud et.al. | 2505.15811 | link |
2025-05-21 | MMaDA: Multimodal Large Diffusion Language Models | Ling Yang et.al. | 2505.15809 | link |
2025-05-21 | The Atlas of In-Context Learning: How Attention Heads Shape In-Context Retrieval Augmentation | Patrick Kahardipraja et.al. | 2505.15807 | link |
2025-05-21 | Keep Security! Benchmarking Security Policy Preservation in Large Language Model Contexts Against Indirect Attacks in Question Answering | Hwan Chang et.al. | 2505.15805 | link |
2025-05-21 | STAR-R1: Spacial TrAnsformation Reasoning by Reinforcing Multimodal LLMs | Zongzhao Li et.al. | 2505.15804 | null |
2025-05-21 | VerifyBench: Benchmarking Reference-based Reward Systems for Large Language Models | Yuchen Yan et.al. | 2505.15801 | null |
2025-05-21 | Model Merging is Secretly Certifiable: Non-Vacuous Generalisation Bounds for Low-Shot Learning | Taehoon Kim et.al. | 2505.15798 | null |
2025-05-21 | Reverse Engineering Human Preferences with Reinforcement Learning | Lisa Alazraki et.al. | 2505.15795 | null |
2025-05-21 | HCRMP: A LLM-Hinted Contextual Reinforcement Learning Framework for Autonomous Driving | Zhiwen Chen et.al. | 2505.15793 | null |
2025-05-21 | Large Language Models as Computable Approximations to Solomonoff Induction | Jun Wan et.al. | 2505.15784 | null |
2025-05-21 | dKV-Cache: The Cache for Diffusion Language Models | Xinyin Ma et.al. | 2505.15781 | link |
2025-05-21 | ConvSearch-R1: Enhancing Query Reformulation for Conversational Search with Reasoning via Reinforcement Learning | Changtai Zhu et.al. | 2505.15776 | link |
2025-05-21 | Beyond Hard and Soft: Hybrid Context Compression for Balancing Local and Global Information Retention | Huanxuan Liao et.al. | 2505.15774 | link |
2025-05-21 | MIKU-PAL: An Automated and Standardized Multi-Modal Method for Speech Paralinguistic and Affect Labeling | Cheng Yifan et.al. | 2505.15772 | null |
2025-05-21 | An Empirical Analysis of Vulnerability Detection Tools for Solidity Smart Contracts Using Line Level Manually Annotated Vulnerabilities | Francesco Salzano et.al. | 2505.15756 | null |
2025-05-21 | Exploring The Visual Feature Space for Multimodal Neural Decoding | Weihao Xia et.al. | 2505.15755 | null |
2025-05-21 | Scalable Defense against In-the-wild Jailbreaking Attacks with Safety Context Retrieval | Taiye Chen et.al. | 2505.15753 | null |
2025-05-21 | Multi-modal Integration Analysis of Alzheimer’s Disease Using Large Language Models and Knowledge Graphs | Kanan Kiguchi et.al. | 2505.15747 | null |
2025-05-21 | Evolutionary Computation and Large Language Models: A Survey of Methods, Synergies, and Applications | Dikshit Chauhan et.al. | 2505.15741 | null |
2025-05-20 | Language Models use Lookbacks to Track Beliefs | Nikhil Prakash et.al. | 2505.14685 | null |
2025-05-21 | Mind the Gap: Bridging Thought Leap for Improved Chain-of-Thought Tuning | Haolei Xu et.al. | 2505.14684 | null |
2025-05-20 | Emerging Properties in Unified Multimodal Pretraining | Chaorui Deng et.al. | 2505.14683 | null |
2025-05-20 | UniGen: Enhanced Training & Test-Time Strategies for Unified Multimodal Understanding and Generation | Rui Tian et.al. | 2505.14682 | null |
2025-05-20 | UltraEdit: Training-, Subject-, and Memory-Free Lifelong Editing in Large Language Models | Xiaojie Gu et.al. | 2505.14679 | link |
2025-05-20 | Visionary-R1: Mitigating Shortcuts in Visual Reasoning with Reinforcement Learning | Jiaer Xia et.al. | 2505.14677 | null |
2025-05-20 | Reward Reasoning Model | Jiaxin Guo et.al. | 2505.14674 | null |
2025-05-20 | UniCTokens: Boosting Personalized Understanding and Generation via Unified Concept Tokens | Ruichuan An et.al. | 2505.14671 | null |
2025-05-20 | Quartet: Native FP4 Training Can Be Optimal for Large Language Models | Roberto L. Castro et.al. | 2505.14669 | link |
2025-05-20 | ContextAgent: Context-Aware Proactive LLM Agents with Open-World Sensory Perceptions | Bufang Yang et.al. | 2505.14668 | null |
2025-05-20 | Beyond Words: Multimodal LLM Knows When to Speak | Zikai Liao et.al. | 2505.14654 | null |
2025-05-21 | General-Reasoner: Advancing LLM Reasoning Across All Domains | Xueguang Ma et.al. | 2505.14652 | null |
2025-05-20 | Vox-Profile: A Speech Foundation Model Benchmark for Characterizing Diverse Speaker and Speech Traits | Tiantian Feng et.al. | 2505.14648 | link |
2025-05-20 | CAD-Coder: An Open-Source Vision-Language Model for Computer-Aided Design Code Generation | Anna C. Doris et.al. | 2505.14646 | link |
2025-05-21 | Think Only When You Need with Large Hybrid-Reasoning Models | Lingjie Jiang et.al. | 2505.14631 | null |
2025-05-20 | KERL: Knowledge-Enhanced Personalized Recipe Recommendation using Large Language Models | Fnu Mohbat et.al. | 2505.14629 | link |
2025-05-20 | Debating for Better Reasoning: An Unsupervised Multimodal Approach | Ashutosh Adhikari et.al. | 2505.14627 | null |
2025-05-20 | TinyV: Reducing False Negatives in Verification Improves RL for LLM Reasoning | Zhangchen Xu et.al. | 2505.14625 | link |
2025-05-20 | Enhancing Learned Knowledge in LoRA Adapters Through Efficient Contrastive Decoding on Ascend NPUs | Morgan Lindsay Heisler et.al. | 2505.14620 | null |
2025-05-20 | Linear Control of Test Awareness Reveals Differential Compliance in Reasoning Models | Sahar Abdelnabi et.al. | 2505.14617 | link |
2025-05-19 | CIE: Controlling Language Model Text Generations Using Continuous Signals | Vinay Samuel et.al. | 2505.13448 | link |
2025-05-19 | Trust, But Verify: A Self-Verification Approach to Reinforcement Learning with Verifiable Rewards | Xiaoyuan Liu et.al. | 2505.13445 | link |
2025-05-19 | ChartMuseum: Testing Visual Reasoning Capabilities of Large Vision-Language Models | Liyan Tang et.al. | 2505.13444 | null |
2025-05-19 | GraspMolmo: Generalizable Task-Oriented Grasping via Large-Scale Synthetic Data Generation | Abhay Deshpande et.al. | 2505.13441 | null |
2025-05-19 | Optimizing Anytime Reasoning via Budget Relative Policy Optimization | Penghui Qi et.al. | 2505.13438 | link |
2025-05-19 | SMOTExT: SMOTE meets Large Language Models | Mateusz Bystroński et.al. | 2505.13434 | null |
2025-05-19 | Fine-tuning Quantized Neural Networks with Zeroth-order Optimization | Sifeng Shang et.al. | 2505.13430 | null |
2025-05-19 | MM-PRM: Enhancing Multimodal Mathematical Reasoning with Scalable Step-Level Supervision | Lingxiao Du et.al. | 2505.13427 | link |
2025-05-19 | G1: Bootstrapping Perception and Reasoning Abilities of Vision-Language Model via Reinforcement Learning | Liang Chen et.al. | 2505.13426 | link |
2025-05-19 | Learnware of Language Models: Specialized Small Language Models Can Do Big | Zhi-Hao Tan et.al. | 2505.13425 | link |
2025-05-19 | Make Still Further Progress: Chain of Thoughts for Tabular Data Leaderboard | Si-Yang Liu et.al. | 2505.13421 | null |
2025-05-19 | FEALLM: Advancing Facial Emotion Analysis in Multimodal Large Language Models with Emotional Synergy and Reasoning | Zhuozhao Hu et.al. | 2505.13419 | link |
2025-05-19 | CoT-Kinetics: A Theoretical Modeling Assessing LRM Reasoning Process | Jinhe Bi et.al. | 2505.13408 | null |
2025-05-19 | AutoMathKG: The automated mathematical knowledge graph based on LLM and vector database | Rong Bian et.al. | 2505.13406 | null |
2025-05-19 | MR. Judge: Multimodal Reasoner as a Judge | Renjie Pi et.al. | 2505.13403 | null |
2025-05-19 | R3: Robust Rubric-Agnostic Reward Models | David Anugraha et.al. | 2505.13388 | link |
2025-05-19 | CompeteSMoE – Statistically Guaranteed Mixture of Experts Training via Competition | Nam V. Nguyen et.al. | 2505.13380 | link |
2025-05-19 | Thinkless: LLM Learns When to Think | Gongfan Fang et.al. | 2505.13379 | link |
2025-05-19 | Seeing, Saying, Solving: An LLM-to-TL Framework for Cooperative Robots | Dan BW Choe et.al. | 2505.13376 | null |
2025-05-19 | Multi-Armed Bandits Meet Large Language Models | Djallel Bouneffouf et.al. | 2505.13355 | null |
2025-05-16 | Modeling cognitive processes of natural reading with transformer-based Language Models | Bruno Bianchi et.al. | 2505.11485 | null |
2025-05-16 | msf-CNN: Patch-based Multi-Stage Fusion with Convolutional Neural Networks for TinyML | Zhaolan Huang et.al. | 2505.11483 | link |
2025-05-16 | Improving Assembly Code Performance with Large Language Models via Reinforcement Learning | Anjiang Wei et.al. | 2505.11480 | null |
2025-05-16 | HelpSteer3-Preference: Open Human-Annotated Preference Data across Diverse Tasks and Languages | Zhilin Wang et.al. | 2505.11475 | null |
2025-05-16 | Disentangling Reasoning and Knowledge in Medical Large Language Models | Rahul Thapa et.al. | 2505.11462 | null |
2025-05-16 | ProxyPrompt: Securing System Prompts against Prompt Extraction Attacks | Zhixiong Zhuang et.al. | 2505.11459 | null |
2025-05-16 | LLMs unlock new paths to monetizing exploits | Nicholas Carlini et.al. | 2505.11449 | null |
2025-05-16 | Is Compression Really Linear with Code Intelligence? | Xianzhen Luo et.al. | 2505.11441 | null |
2025-05-16 | GODBench: A Benchmark for Multimodal Large Language Models in Video Comment Art | Chenkai Zhang et.al. | 2505.11436 | link |
2025-05-16 | MegaScale-MoE: Large-Scale Communication-Efficient Training of Mixture-of-Experts Models in Production | Chao Jin et.al. | 2505.11432 | null |
2025-05-16 | Mergenetic: a Simple Evolutionary Model Merging Library | Adrian Robert Minut et.al. | 2505.11427 | link |
2025-05-16 | When Thinking Fails: The Pitfalls of Reasoning for Instruction-Following in LLMs | Xiaomin Li et.al. | 2505.11423 | null |
2025-05-16 | Towards Cultural Bridge by Bahnaric-Vietnamese Translation Using Transfer Learning of Sequence-To-Sequence Pre-training Language Model | Phan Tran Minh Dat et.al. | 2505.11421 | null |
2025-05-16 | EdgeWisePersona: A Dataset for On-Device User Profiling from Natural Language Interactions | Patryk Bartkowiak et.al. | 2505.11417 | link |
2025-05-16 | MoE-CAP: Benchmarking Cost, Accuracy and Performance of Sparse Mixture-of-Experts Systems | Yinsicheng Jiang et.al. | 2505.11415 | null |
2025-05-16 | CARES: Comprehensive Evaluation of Safety and Adversarial Robustness in Medical LLMs | Sijia Chen et.al. | 2505.11413 | null |
2025-05-16 | Visual Planning: Let’s Think Only with Images | Yi Xu et.al. | 2505.11409 | link |
2025-05-16 | Large Language Model Use Impact Locus of Control | Jenny Xiyu Fu et.al. | 2505.11406 | null |
2025-05-16 | EmotionHallucer: Evaluating Emotion Hallucinations in Multimodal Large Language Models | Bohao Xing et.al. | 2505.11405 | link |
2025-05-16 | Patho-R1: A Multimodal Reinforcement Learning-Based Pathology Expert Reasoner | Wenchuan Zhang et.al. | 2505.11404 | link |
2025-05-15 | End-to-End Vision Tokenizer Tuning | Wenxuan Wang et.al. | 2505.10562 | null |
2025-05-15 | Neural Thermodynamic Laws for Large Language Model Training | Ziming Liu et.al. | 2505.10559 | null |
2025-05-15 | Does Feasibility Matter? Understanding the Impact of Feasibility on Synthetic Training Data | Yiwen Liu et.al. | 2505.10551 | link |
2025-05-15 | Real-Time Out-of-Distribution Failure Prevention via Multi-Modal Reasoning | Milan Ganai et.al. | 2505.10547 | null |
2025-05-15 | Towards a Deeper Understanding of Reasoning Capabilities in Large Language Models | Annie Wong et.al. | 2505.10543 | link |
2025-05-15 | Exploring Implicit Visual Misunderstandings in Multimodal Large Language Models through Attention Analysis | Pengfei Wang et.al. | 2505.10541 | link |
2025-05-15 | S3C2 Summit 2024-09: Industry Secure Software Supply Chain Summit | Imranur Rahman et.al. | 2505.10538 | null |
2025-05-15 | WorldPM: Scaling Human Preference Modeling | Binghai Wang et.al. | 2505.10527 | link |
2025-05-15 | MASSV: Multimodal Adaptation and Self-Data Distillation for Speculative Decoding of Vision-Language Models | Mugilan Ganesan et.al. | 2505.10526 | null |
2025-05-15 | Multi-Token Prediction Needs Registers | Anastasios Gerontopoulos et.al. | 2505.10518 | link |
2025-05-15 | RouteNator: A Router-Based Multi-Modal Architecture for Generating Synthetic Training Data for Function Calling LLMs | Vibha Belavadi et.al. | 2505.10495 | null |
2025-05-15 | Can You Really Trust Code Copilots? Evaluating Large Language Models from a Code Security Perspective | Yutao Mou et.al. | 2505.10494 | link |
2025-05-15 | CL-RAG: Bridging the Gap in Retrieval-Augmented Generation with Curriculum Learning | Shaohan Wang et.al. | 2505.10493 | null |
2025-05-15 | Campus AI vs Commercial AI: A Late-Breaking Study on How LLM As-A-Service Customizations Shape Trust and Usage Patterns | Leon Hannig et.al. | 2505.10490 | null |
2025-05-15 | Parallel Scaling Law for Language Models | Mouxiang Chen et.al. | 2505.10475 | link |
2025-05-15 | Large Language Models for Cancer Communication: Evaluating Linguistic Quality, Safety, and Accessibility in Generative AI | Agnik Saha et.al. | 2505.10472 | null |
2025-05-15 | AI Agents vs. Agentic AI: A Conceptual Taxonomy, Applications and Challenge | Ranjan Sapkota et.al. | 2505.10468 | null |
2025-05-15 | Superposition Yields Robust Neural Scaling | Yizhou liu et.al. | 2505.10465 | link |
2025-05-15 | Vision language models have difficulty recognizing virtual objects | Tyler Tran et.al. | 2505.10453 | null |
2025-05-15 | Reinforcing the Diffusion Chain of Lateral Thought with Diffusion Language Models | Zemin Huang et.al. | 2505.10446 | null |
2025-05-14 | Language Agents Mirror Human Causal Reasoning Biases. How Can We Help Them Think Like Scientists? | Anthony GX-Chen et.al. | 2505.09614 | null |
2025-05-14 | Customizing a Large Language Model for VHDL Design of High-Performance Microprocessors | Nicolas Dupuis et.al. | 2505.09610 | null |
2025-05-14 | Adversarial Suffix Filtering: a Defense Pipeline for LLMs | David Khachaturov et.al. | 2505.09602 | null |
2025-05-14 | How Hungry is AI? Benchmarking Energy, Water, and Carbon Footprint of LLM Inference | Nidhal Jegham et.al. | 2505.09598 | null |
2025-05-14 | WorldView-Bench: A Benchmark for Evaluating Global Cultural Perspectives in Large Language Models | Abdullah Mushtaq et.al. | 2505.09595 | null |
2025-05-14 | Variational Visual Question Answering | Tobias Jan Wieczorek et.al. | 2505.09591 | null |
2025-05-15 | Beyond Likes: How Normative Feedback Complements Engagement Signals on Social Media | Yuchen Wu et.al. | 2505.09583 | null |
2025-05-14 | VTLA: Vision-Tactile-Language-Action Model with Preference Learning for Insertion Manipulation | Chaofan Zhang et.al. | 2505.09577 | null |
2025-05-14 | Ethics and Persuasion in Reinforcement Learning from Human Feedback: A Procedural Rhetorical Approach | Shannon Lodoen et.al. | 2505.09576 | null |
2025-05-14 | MIGRATION-BENCH: Repository-Level Code Migration Benchmark from Java 8 | Linbo Liu et.al. | 2505.09569 | link |
2025-05-14 | Using Foundation Models as Pseudo-Label Generators for Pre-Clinical 4D Cardiac CT Segmentation | Anne-Marie Rickmann et.al. | 2505.09564 | null |
2025-05-14 | WavReward: Spoken Dialogue Models With Generalist Reward Evaluators | Shengpeng Ji et.al. | 2505.09558 | link |
2025-05-14 | PT-MoE: An Efficient Finetuning Framework for Integrating Mixture-of-Experts into Prompt Tuning | Zongqian Li et.al. | 2505.09519 | link |
2025-05-15 | Towards Fair In-Context Learning with Tabular Foundation Models | Patrik Kenfack et.al. | 2505.09503 | null |
2025-05-14 | Layered Unlearning for Adversarial Relearning | Timothy Qian et.al. | 2505.09500 | link |
2025-05-14 | Flash-VL 2B: Optimizing Vision-Language Model Performance for Ultra-Low Latency and High Throughput | Bo Zhang et.al. | 2505.09498 | null |
2025-05-14 | Card Sorting Simulator: Augmenting Design of Logical Information Architectures with Large Language Models | Eduard Kuric et.al. | 2505.09478 | null |
2025-05-14 | Deploying Foundation Model-Enabled Air and Ground Robots in the Field: Challenges and Opportunities | Zachary Ravichandran et.al. | 2505.09477 | null |
2025-05-14 | Evaluating GPT- and Reasoning-based Large Language Models on Physics Olympiad Problems: Surpassing Human Performance and Implications for Educational Assessment | Paul Tschisgale et.al. | 2505.09438 | null |
2025-05-14 | CXMArena: Unified Dataset to benchmark performance in realistic CXM Scenarios | Raghav Garg et.al. | 2505.09436 | link |
2025-05-13 | CodePDE: An Inference Framework for LLM-driven PDE Solver Generation | Shanda Li et.al. | 2505.08783 | link |
2025-05-13 | HealthBench: Evaluating Large Language Models Towards Improved Human Health | Rahul K. Arora et.al. | 2505.08775 | link |
2025-05-14 | Towards Autonomous UAV Visual Object Search in City Space: Benchmark and Agentic Methodology | Yatai Ji et.al. | 2505.08765 | null |
2025-05-13 | Aya Vision: Advancing the Frontier of Multilingual Multimodality | Saurabh Dash et.al. | 2505.08751 | null |
2025-05-13 | AC-Reason: Towards Theory-Guided Actual Causality Reasoning with Large Language Models | Yanxi Zhang et.al. | 2505.08750 | link |
2025-05-13 | DeepMath-Creative: A Benchmark for Evaluating Mathematical Creativity of Large Language Models | Xiaoyang Chen et.al. | 2505.08744 | link |
2025-05-13 | Probability Consistency in Large Language Models: Theoretical Foundations Meet Empirical Discrepancies | Xiaoliang Luo et.al. | 2505.08739 | link |
2025-05-13 | Towards Foundation Models for Experimental Readout Systems Combining Discrete and Continuous Data | James Giroux et.al. | 2505.08736 | link |
2025-05-13 | NurValues: Real-World Nursing Values Evaluation for Large Language Models in Clinical Context | Ben Yao et.al. | 2505.08734 | null |
2025-05-13 | Securing RAG: A Risk Assessment and Mitigation Framework | Lukas Ammann et.al. | 2505.08728 | null |
2025-05-13 | Memorization-Compression Cycles Improve Generalization | Fangyuan Yu et.al. | 2505.08727 | null |
2025-05-13 | Extending Large Vision-Language Model for Diverse Interactive Tasks in Autonomous Driving | Zongchuang Zhao et.al. | 2505.08725 | link |
2025-05-13 | TiMo: Spatiotemporal Foundation Model for Satellite Image Time Series | Xiaolei Qin et.al. | 2505.08723 | link |
2025-05-13 | PWC-MoE: Privacy-Aware Wireless Collaborative Mixture of Experts | Yang Su et.al. | 2505.08719 | null |
2025-05-13 | Controllable Image Colorization with Instance-aware Texts and Masks | Yanru An et.al. | 2505.08705 | null |
2025-05-13 | LLM-based Prompt Ensemble for Reliable Medical Entity Recognition from EHRs | K M Sajjadul Islam et.al. | 2505.08704 | null |
2025-05-14 | Granite-speech: open-source speech-aware LLMs with strong English ASR capabilities | George Saon et.al. | 2505.08699 | null |
2025-05-13 | VizCV: AI-assisted visualization of researchers’ publications tracks | Vladimír Lazárik et.al. | 2505.08691 | null |
2025-05-13 | Adaptive Schema-aware Event Extraction with Retrieval-Augmented Generation | Sheng Liang et.al. | 2505.08690 | null |
2025-05-13 | A Social Robot with Inner Speech for Dietary Guidance | Valerio Belcamino et.al. | 2505.08664 | link |
2025-05-12 | DanceGRPO: Unleashing GRPO on Visual Generation | Zeyue Xue et.al. | 2505.07818 | null |
2025-05-12 | Imagine, Verify, Execute: Memory-Guided Agentic Exploration with Vision-Language Models | Seungjae Lee et.al. | 2505.07815 | null |
2025-05-12 | Learning Dynamics in Continual Pre-Training for Large Language Models | Xingjin Wang et.al. | 2505.07796 | null |
2025-05-12 | Domain Regeneration: How well do LLMs match syntactic properties of text domains? | Da Ju et.al. | 2505.07784 | null |
2025-05-12 | Relative Overfitting and Accept-Reject Framework | Yanxin Liu et.al. | 2505.07783 | null |
2025-05-12 | MLE-Dojo: Interactive Environments for Empowering LLM Agents in Machine Learning Engineering | Rushi Qiang et.al. | 2505.07782 | link |
2025-05-12 | Must Read: A Systematic Survey of Computational Persuasion | Nimet Beyza Bozdag et.al. | 2505.07775 | link |
2025-05-12 | Agent RL Scaling Law: Agent RL with Spontaneous Code Execution for Mathematical Problem Solving | Xinji Mai et.al. | 2505.07773 | link |
2025-05-12 | Enhancing Code Generation via Bidirectional Comment-Level Mutual Grounding | Yifeng Di et.al. | 2505.07768 | link |
2025-05-12 | BodyGPS: Anatomical Positioning System | Halid Ziya Yerebakan et.al. | 2505.07744 | null |
2025-05-12 | Assessing the Chemical Intelligence of Large Language Models | Nicholas T. Runcie et.al. | 2505.07735 | link |
2025-05-12 | Spoken Language Understanding on Unseen Tasks With In-Context Learning | Neeraj Agrawal et.al. | 2505.07731 | null |
2025-05-12 | Reproducibility, Replicability, and Insights into Visual Document Retrieval with Late Interaction | Jingfen Qiao et.al. | 2505.07730 | link |
2025-05-12 | Circuit Partitioning Using Large Language Models for Quantum Compilation and Simulations | Pranav Sinha et.al. | 2505.07711 | null |
2025-05-12 | Through the Looking Glass: Common Sense Consistency Evaluation of Weird Images | Elisei Rykov et.al. | 2505.07704 | null |
2025-05-12 | PatchTrack: A Comprehensive Analysis of ChatGPT’s Influence on Pull Request Outcomes | Daniel Ogenrwot et.al. | 2505.07700 | null |
2025-05-12 | Beyond CLIP Generalization: Against Forward&Backward Forgetting Adapter for Continual Learning of Vision-Language Models | Songlin Dong et.al. | 2505.07690 | null |
2025-05-12 | S-GRPO: Early Exit via Reinforcement Learning in Reasoning Models | Muzhi Dai et.al. | 2505.07686 | null |
2025-05-12 | Multimodal Survival Modeling in the Age of Foundation Models | Steven Song et.al. | 2505.07683 | link |
2025-05-12 | SpecRouter: Adaptive Routing for Multi-Level Speculative Decoding in Large Language Models | Hang Wu et.al. | 2505.07680 | null |
2025-05-09 | Towards a Unified Representation Evaluation Framework Beyond Downstream Tasks | Christos Plachouras et.al. | 2505.06224 | link |
2025-05-09 | Adapting a Segmentation Foundation Model for Medical Image Classification | Pengfei Gu et.al. | 2505.06217 | null |
2025-05-09 | From Millions of Tweets to Actionable Insights: Leveraging LLMs for User Profiling | Vahid Rahimzadeh et.al. | 2505.06184 | null |
2025-05-09 | A Large Language Model-Enhanced Q-learning for Capacitated Vehicle Routing Problem with Time Windows | Linjiang Cao et.al. | 2505.06178 | null |
2025-05-09 | MonetGPT: Solving Puzzles Enhances MLLMs’ Image Retouching Skills | Niladri Shekhar Dutt et.al. | 2505.06176 | null |
2025-05-09 | Turbo-ICL: In-Context Learning-Based Turbo Equalization | Zihang Song et.al. | 2505.06175 | null |
2025-05-09 | MM-Skin: Enhancing Dermatology Vision-Language Model with an Image-Text Dataset Derived from Textbooks | Wenqi Zeng et.al. | 2505.06152 | link |
2025-05-09 | A Scaling Law for Token Efficiency in LLM Fine-Tuning Under Fixed Compute Budgets | Ryan Lagasse et.al. | 2505.06150 | null |
2025-05-09 | Can Prompting LLMs Unlock Hate Speech Detection across Languages? A Zero-shot and Few-shot Study | Faeze Ghorbanpour et.al. | 2505.06149 | null |
2025-05-09 | LLMs Get Lost In Multi-Turn Conversation | Philippe Laban et.al. | 2505.06120 | link |
2025-05-09 | LLMs Outperform Experts on Challenging Biology Benchmarks | Lennart Justen et.al. | 2505.06108 | null |
2025-05-09 | Free and Fair Hardware: A Pathway to Copyright Infringement-Free Verilog Generation using LLMs | Sam Bush et.al. | 2505.06096 | null |
2025-05-09 | Assessing Tenstorrent’s RISC-V MatMul Acceleration Capabilities | Hiari Pizzini Cavagna et.al. | 2505.06085 | null |
2025-05-09 | Healthy LLMs? Benchmarking LLM Knowledge of UK Government Public Health Information | Joshua Harris et.al. | 2505.06046 | null |
2025-05-09 | Short-circuiting Shortcuts: Mechanistic Investigation of Shortcuts in Text Classification | Leon Eshuijs et.al. | 2505.06032 | link |
2025-05-09 | Unilogit: Robust Machine Unlearning for LLMs Using Uniform-Target Self-Distillation | Stefan Vasilev et.al. | 2505.06027 | null |
2025-05-09 | ArtRAG: Retrieval-Augmented Generation with Structured Context for Visual Art Understanding | Shuai Wang et.al. | 2505.06020 | null |
2025-05-09 | Exploring the Feasibility of Multilingual Grammatical Error Correction with a Single LLM up to 9B parameters: A Comparative Study of 17 Models | Dawid Wisniewski et.al. | 2505.06004 | link |
2025-05-09 | Task-Adapter++: Task-specific Adaptation with Order-aware Alignment for Few-shot Action Recognition | Congqi Cao et.al. | 2505.06002 | link |
2025-05-09 | Towards Developmentally Plausible Rewards: Communicative Success as a Learning Signal for Interactive Language Models | Lennart Stöpler et.al. | 2505.05970 | null |
2025-05-08 | Mogao: An Omni Foundation Model for Interleaved Multi-Modal Generation | Chao Liao et.al. | 2505.05472 | null |
2025-05-08 | Generating Physically Stable and Buildable LEGO Designs from Text | Ava Pun et.al. | 2505.05469 | link |
2025-05-08 | StreamBridge: Turning Your Offline Video Large Language Model into a Proactive Streaming Assistant | Haibo Wang et.al. | 2505.05467 | null |
2025-05-08 | ComPO: Preference Alignment via Comparison Oracles | Peter Chen et.al. | 2505.05465 | null |
2025-05-08 | Bring Reason to Vision: Understanding Perception and Reasoning through Model Merging | Shiqi Chen et.al. | 2505.05464 | link |
2025-05-08 | UKElectionNarratives: A Dataset of Misleading Narratives Surrounding Recent UK General Elections | Fatima Haouari et.al. | 2505.05459 | null |
2025-05-08 | SITE: towards Spatial Intelligence Thorough Evaluation | Wenqi Wang et.al. | 2505.05456 | null |
2025-05-08 | Conversational Process Model Redesign | Nataliia Klievtsova et.al. | 2505.05453 | null |
2025-05-08 | clem:todd: A Framework for the Systematic Benchmarking of LLM-Based Task-Oriented Dialogue System Realisations | Chalamalasetti Kranti et.al. | 2505.05445 | null |
2025-05-08 | GesPrompt: Leveraging Co-Speech Gestures to Augment LLM-Based Interaction in Virtual Reality | Xiyun Hu et.al. | 2505.05441 | null |
2025-05-09 | EcoAgent: An Efficient Edge-Cloud Collaborative Multi-Agent Framework for Mobile Automation | Biao Yi et.al. | 2505.05440 | null |
2025-05-08 | Ultra-FineWeb: Efficient Data Filtering and Verification for High-Quality LLM Training Data | Yudong Wang et.al. | 2505.05427 | null |
2025-05-09 | LiTransProQA: an LLM-based Literary Translation evaluation metric with Professional Question Answering | Ran Zhang et.al. | 2505.05423 | link |
2025-05-08 | Crosslingual Reasoning through Test-Time Scaling | Zheng-Xin Yong et.al. | 2505.05408 | link |
2025-05-08 | Frame In, Frame Out: Do LLMs Generate More Biased News Headlines than Humans? | Valeria Pastorino et.al. | 2505.05406 | null |
2025-05-08 | A Pain Assessment Framework based on multimodal data and Deep Machine Learning methods | Stefanos Gkikas et.al. | 2505.05396 | null |
2025-05-08 | DSDrive: Distilling Large Language Model for Lightweight End-to-End Autonomous Driving with Unified Reasoning and Planning | Wenru Liu et.al. | 2505.05360 | null |
2025-05-08 | Hearing and Seeing Through CLIP: A Framework for Self-Supervised Sound Source Localization | Sooyoung Park et.al. | 2505.05343 | link |
2025-05-08 | FLAM: Frame-Wise Language-Audio Modeling | Yusong Wu et.al. | 2505.05335 | null |
2025-05-08 | ICon: In-Context Contribution for Automatic Data Selection | Yixin Yang et.al. | 2505.05327 | null |
2025-05-07 | EchoInk-R1: Exploring Audio-Visual Reasoning in Multimodal LLMs via Reinforcement Learning | Zhenghao Xing et.al. | 2505.04623 | link |
2025-05-07 | On Path to Multimodal Generalist: General-Level and General-Bench | Hao Fei et.al. | 2505.04620 | null |
2025-05-07 | OmniGIRL: A Multilingual and Multimodal Benchmark for GitHub Issue Resolution | Lianghong Guo et.al. | 2505.04606 | link |
2025-05-07 | OpenVision: A Fully-Open, Cost-Effective Family of Advanced Vision Encoders for Multimodal Learning | Xianhang Li et.al. | 2505.04601 | null |
2025-05-08 | MonoCoP: Chain-of-Prediction for Monocular 3D Object Detection | Zhihao Zhang et.al. | 2505.04594 | null |
2025-05-07 | ZeroSearch: Incentivize the Search Capability of LLMs without Searching | Hao Sun et.al. | 2505.04588 | link |
2025-05-07 | SlideItRight: Using AI to Find Relevant Slides and Provide Feedback for Open-Ended Questions | Chloe Qianhui Zhao et.al. | 2505.04584 | link |
2025-05-07 | Fight Fire with Fire: Defending Against Malicious RL Fine-Tuning via Reward Neutralization | Wenjun Cao et.al. | 2505.04578 | null |
2025-05-07 | Communication-Efficient Federated Fine-Tuning of Language Models via Dynamic Update Schedules | Michail Theologitis et.al. | 2505.04535 | link |
2025-05-07 | Overcoming Data Scarcity in Generative Language Modelling for Low-Resource Languages: A Systematic Review | Josh McGiff et.al. | 2505.04531 | null |
2025-05-07 | Comparative Analysis of Carbon Footprint in Manual vs. LLM-Assisted Code Development | Kuen Sum Cheung et.al. | 2505.04521 | null |
2025-05-07 | Pangu Ultra MoE: How to Train Your Big MoE on Ascend NPUs | Yehui Tang et.al. | 2505.04519 | null |
2025-05-07 | “I Can See Forever!”: Evaluating Real-time VideoLLMs for Assisting Individuals with Visual Impairments | Ziyi Zhang et.al. | 2505.04488 | null |
2025-05-07 | CAD-Llama: Leveraging Large Language Models for Computer-Aided Design Parametric 3D Model Generation | Jiahao Li et.al. | 2505.04481 | null |
2025-05-07 | TrajEvo: Designing Trajectory Prediction Heuristics via LLM-driven Evolution | Zhikai Zhao et.al. | 2505.04480 | link |
2025-05-07 | Miipher-2: A Universal Speech Restoration Model for Million-Hour Scale Data Restoration | Shigeki Karita et.al. | 2505.04457 | link |
2025-05-07 | M2Rec: Multi-scale Mamba for Efficient Sequential Recommendation | Qianru Zhang et.al. | 2505.04445 | null |
2025-05-07 | Towards Effectively Leveraging Execution Traces for Program Repair with Code LLMs | Mirazul Haque et.al. | 2505.04441 | null |
2025-05-07 | OBLIVIATE: Robust and Practical Machine Unlearning for Large Language Models | Xiaoyu Xu et.al. | 2505.04416 | null |
2025-05-07 | DeCLIP: Decoupled Learning for Open-Vocabulary Dense Perception | Junjie Wang et.al. | 2505.04410 | link |
2025-05-06 | VITA-Audio: Fast Interleaved Cross-Modal Token Generation for Efficient Large Speech-Language Model | Zuwei Long et.al. | 2505.03739 | link |
2025-05-06 | Decentralized Nonconvex Optimization under Heavy-Tailed Noise: Normalization and Optimal Convergence | Shuhua Yu et.al. | 2505.03736 | null |
2025-05-06 | Meta-Optimization and Program Search using Language Models for Task and Motion Planning | Denis Shcherba et.al. | 2505.03725 | null |
2025-05-06 | Fill the Gap: Quantifying and Reducing the Modality Gap in Image-Text Representation Learning | François Role et.al. | 2505.03703 | null |
2025-05-06 | Fairness of Automatic Speech Recognition in Cleft Lip and Palate Speech | Susmita Bhattacharjee et.al. | 2505.03697 | null |
2025-05-06 | Graph Drawing for LLMs: An Empirical Evaluation | Walter Didimo et.al. | 2505.03678 | null |
2025-05-06 | Distribution-Conditional Generation: From Class Distribution to Creative Generation | Fu Feng et.al. | 2505.03667 | null |
2025-05-06 | Binding threshold units with artificial oscillatory neurons | Vladimir Fanaskov et.al. | 2505.03648 | link |
2025-05-06 | PhysLLM: Harnessing Large Language Models for Cross-Modal Remote Physiological Sensing | Yiping Xie et.al. | 2505.03621 | null |
2025-05-06 | Learning Unknown Spoof Prompts for Generalized Face Anti-Spoofing Using Only Real Face Images | Fangling Jiang et.al. | 2505.03611 | null |
2025-05-06 | Learning Knowledge-based Prompts for Robust 3D Mask Presentation Attack Detection | Fangling Jiang et.al. | 2505.03610 | null |
2025-05-06 | DyGEnc: Encoding a Sequence of Textual Scene Graphs to Reason and Answer Questions in Dynamic Scenes | Sergey Linok et.al. | 2505.03581 | link |
2025-05-06 | LlamaFirewall: An open source guardrail system for building secure AI agents | Sahana Chennabasappa et.al. | 2505.03574 | null |
2025-05-06 | Say It Another Way: A Framework for User-Grounded Paraphrasing | Cléa Chataigner et.al. | 2505.03563 | null |
2025-05-06 | A Comprehensive Survey of Large AI Models for Future Communications: Foundations, Applications and Challenges | Feibo Jiang et.al. | 2505.03556 | link |
2025-05-06 | A Hashgraph-Inspired Consensus Mechanism for Reliable Multi-Model Reasoning | Kolawole E. Ogunsina et.al. | 2505.03553 | null |
2025-05-06 | STORY2GAME: Generating (Almost) Everything in an Interactive Fiction Game | Eric Zhou et.al. | 2505.03547 | null |
2025-05-06 | Faster MoE LLM Inference for Extremely Large Models | Haoqi Yang et.al. | 2505.03531 | null |
2025-05-06 | Ruled by the Representation Space: On the University’s Embrace of Large Language Models | Katia Schwerzmann et.al. | 2505.03513 | null |
2025-05-06 | BadLingual: A Novel Lingual-Backdoor Attack against Large Language Models | Zihan Wang et.al. | 2505.03501 | null |
2025-05-05 | Scenethesis: A Language and Vision Agentic Framework for 3D Scene Generation | Lu Ling et.al. | 2505.02836 | null |
2025-05-05 | R1-Reward: Training Multimodal Reward Model Through Stable Reinforcement Learning | Yi-Fan Zhang et.al. | 2505.02835 | link |
2025-05-05 | No Other Representation Component Is Needed: Diffusion Transformers Can Provide Representation Guidance by Themselves | Dengyang Jiang et.al. | 2505.02831 | link |
2025-05-05 | LISAT: Language-Instructed Segmentation Assistant for Satellite Imagery | Jerome Quenum et.al. | 2505.02829 | null |
2025-05-05 | ReplaceMe: Network Simplification via Layer Pruning and Linear Transformations | Dmitriy Shopkhoev et.al. | 2505.02819 | link |
2025-05-05 | Knowing You Don’t Know: Learning When to Continue Search in Multi-round RAG through Self-Practicing | Diji Yang et.al. | 2505.02811 | link |
2025-05-05 | Towards Quantifying the Hessian Structure of Neural Networks | Zhaorui Dong et.al. | 2505.02809 | link |
2025-05-05 | Generating HomeAssistant Automations Using an LLM-based Chatbot | Mathyas Giudici et.al. | 2505.02802 | null |
2025-05-05 | HSplitLoRA: A Heterogeneous Split Parameter-Efficient Fine-Tuning Framework for Large Language Models | Zheng Lin et.al. | 2505.02795 | null |
2025-05-05 | Giving Simulated Cells a Voice: Evolving Prompt-to-Intervention Models for Cellular Control | Nam H. Le et.al. | 2505.02766 | null |
2025-05-05 | Bye-bye, Bluebook? Automating Legal Procedure with Large Language Models | Matthew Dahl et.al. | 2505.02763 | null |
2025-05-05 | Using Knowledge Graphs to harvest datasets for efficient CLIP model training | Simon Ging et.al. | 2505.02746 | link |
2025-05-06 | Knowledge Graphs for Enhancing Large Language Models in Entity Disambiguation | Gerard Pons et.al. | 2505.02737 | null |
2025-05-05 | FormalMATH: Benchmarking Formal Mathematical Reasoning of Large Language Models | Zhouliang Yu et.al. | 2505.02735 | link |
2025-05-05 | Enhancing LLMs’ Clinical Reasoning with Real-World Data from a Nationwide Sepsis Registry | Junu Kim et.al. | 2505.02722 | link |
2025-05-05 | Less is More: Efficient Weight Farcasting with 1-Layer Neural Network | Xiao Shou et.al. | 2505.02714 | null |
2025-05-05 | Technical Report: Evaluating Goal Drift in Language Model Agents | Rauno Arike et.al. | 2505.02709 | null |
2025-05-05 | Voila: Voice-Language Foundation Models for Real-Time Autonomous Interaction and Voice Role-Play | Yemin Shi et.al. | 2505.02707 | link |
2025-05-05 | AI Standardized Patient Improves Human Conversations in Advanced Cancer Care | Kurtis Haut et.al. | 2505.02694 | link |
2025-05-05 | Predicting Movie Hits Before They Happen with LLMs | Shaghayegh Agah et.al. | 2505.02693 | null |
2025-05-02 | How Effective are Large Time Series Models in Hydrology? A Study on Water Level Forecasting in Everglades | Rahuul Rangaraj et.al. | 2505.01415 | null |
2025-05-02 | Dynamic Robot Tool Use with Vision Language Models | Noah Trupin et.al. | 2505.01399 | null |
2025-05-02 | FreeInsert: Disentangled Text-Guided Object Insertion in 3D Gaussian Scene without Spatial Priors | Chenxi Li et.al. | 2505.01322 | null |
2025-05-02 | Helping Big Language Models Protect Themselves: An Enhanced Filtering and Summarization System | Sheikh Samit Muhaimin et.al. | 2505.01315 | null |
2025-05-02 | Enhancing SPARQL Query Rewriting for Complex Ontology Alignments | Anicet Lepetit Ondo et.al. | 2505.01309 | null |
2025-05-02 | Document Retrieval Augmented Fine-Tuning (DRAFT) for safety-critical software assessments | Regan Bolton et.al. | 2505.01307 | null |
2025-05-02 | FlowDubber: Movie Dubbing with LLM-based Semantic-aware Learning and Flow Matching based Voice Enhancing | Gaoxiang Cong et.al. | 2505.01263 | null |
2025-05-02 | Digital Pathway Curation (DPC): a comparative pipeline to assess the reproducibility, consensus and accuracy across Gemini, PubMed, and scientific reviewers in biomedical research | Flavio Lichtenstein et.al. | 2505.01259 | null |
2025-05-02 | Can Foundation Models Really Segment Tumors? A Benchmarking Odyssey in Lung CT Imaging | Elena Mulero Ayllón et.al. | 2505.01239 | null |
2025-05-02 | CaReAQA: A Cardiac and Respiratory Audio Question Answering Model for Open-Ended Diagnostic Reasoning | Tsai-Ning Wang et.al. | 2505.01199 | null |
2025-05-02 | Gender Bias in Explainability: Investigating Performance Disparity in Post-hoc Methods | Mahdi Dhaini et.al. | 2505.01198 | link |
2025-05-05 | TSTMotion: Training-free Scene-aware Text-to-motion Generation | Ziyan Guo et.al. | 2505.01182 | null |
2025-05-02 | LLM Security: Vulnerabilities, Attacks, Defenses, and Countermeasures | Francisco Aguilera-Martínez et.al. | 2505.01177 | null |
2025-05-02 | On the Limitations of Steering in Language Model Alignment | Chebrolu Niranjan et.al. | 2505.01162 | null |
2025-05-02 | Methodological Foundations for AI-Driven Survey Question Generation | Ted K. Mburu et.al. | 2505.01150 | null |
2025-05-02 | Retrieval-Augmented Generation in Biomedicine: A Survey of Technologies, Datasets, and Clinical Applications | Jiawei He et.al. | 2505.01146 | null |
2025-05-02 | MateICL: Mitigating Attention Dispersion in Large-Scale In-Context Learning | Murtadha Ahmed et.al. | 2505.01110 | null |
2025-05-02 | Self-Supervision Enhances Instance-based Multiple Instance Learning Methods in Digital Pathology: A Benchmark Study | Ali Mammadov et.al. | 2505.01109 | link |
2025-05-02 | Nesterov Method for Asynchronous Pipeline Parallel Optimization | Thalaiyasingam Ajanthan et.al. | 2505.01099 | link |
2025-05-02 | Evaluating Vision Language Model Adaptations for Radiology Report Generation in Low-Resource Languages | Marco Salmè et.al. | 2505.01096 | null |
2025-05-01 | T2I-R1: Reinforcing Image Generation with Collaborative Semantic-level and Token-level CoT | Dongzhi Jiang et.al. | 2505.00703 | link |
2025-05-01 | Robotic Visual Instruction | Yanbang Li et.al. | 2505.00693 | null |
2025-05-01 | Visual Test-time Scaling for GUI Agent Grounding | Tiange Luo et.al. | 2505.00684 | link |
2025-05-01 | Steering Large Language Models with Register Analysis for Arbitrary Style Transfer | Xinchen Yang et.al. | 2505.00679 | null |
2025-05-01 | Rethinking Memory in AI: Taxonomy, Operations, Topics, and Future Directions | Yiming Du et.al. | 2505.00675 | link |
2025-05-01 | DeepCritic: Deliberate Critique with Large Language Models | Wenkai Yang et.al. | 2505.00662 | link |
2025-05-01 | On the generalization of language models from in-context learning and finetuning: a controlled study | Andrew K. Lampinen et.al. | 2505.00661 | null |
2025-05-01 | Large Language Models Understanding: an Inherent Ambiguity Barrier | Daniel N. Nissani et.al. | 2505.00654 | null |
2025-05-01 | Open-Source LLM-Driven Federated Transformer for Predictive IoV Management | Yazan Otoum et.al. | 2505.00651 | null |
2025-05-01 | Investigating Task Arithmetic for Zero-Shot Information Retrieval | Marco Braga et.al. | 2505.00649 | link |
2025-05-01 | Brain Foundation Models with Hypergraph Dynamic Adapter for Brain Disease Analysis | Zhongying Deng et.al. | 2505.00627 | null |
2025-05-01 | The Illusion of Role Separation: Hidden Shortcuts in LLM Role Learning (and How to Fix Them) | Zihao Wang et.al. | 2505.00626 | null |
2025-05-01 | FineScope : Precision Pruning for Domain-Specialized Large Language Models Using SAE-Guided Self-Data Cultivation | Chaitali Bhattacharyya et.al. | 2505.00624 | null |
2025-05-01 | Pixel3DMM: Versatile Screen-Space Priors for Single-Image 3D Face Reconstruction | Simon Giebenhain et.al. | 2505.00615 | null |
2025-05-01 | Combining LLMs with Logic-Based Framework to Explain MCTS | Ziyan An et.al. | 2505.00610 | null |
2025-05-01 | Can LLMs Help Improve Analogical Reasoning For Strategic Decisions? Experimental Evidence from Humans and GPT-4 | Phanish Puranam et.al. | 2505.00603 | null |
2025-05-02 | Fast and Low-Cost Genomic Foundation Models via Outlier Removal | Haozheng Luo et.al. | 2505.00598 | link |
2025-05-01 | Block Circulant Adapter for Large Language Models | Xinyu Ding et.al. | 2505.00582 | null |
2025-05-01 | Parameter-Efficient Fine-Tuning with Circulant and Diagonal Vectors | Xinyu Ding et.al. | 2505.00580 | null |
2025-05-01 | FreqKV: Frequency Domain Key-Value Compression for Efficient Context Window Extension | Jushi Kai et.al. | 2505.00570 | null |
2025-04-30 | TRUST: An LLM-Based Dialogue System for Trauma Understanding and Structured Assessments | Sichang Tu et.al. | 2504.21851 | null |
2025-04-30 | COMPACT: COMPositional Atomic-to-Complex Visual Capability Tuning | Xindi Wu et.al. | 2504.21850 | null |
2025-04-30 | Early Exit and Multi Stage Knowledge Distillation in VLMs for Video Summarization | Anas Anwarul Haq Khan et.al. | 2504.21831 | null |
2025-04-30 | Why Compress What You Can Generate? When GPT-4o Generation Ushers in Image Compression Fields | Yixin Gao et.al. | 2504.21814 | null |
2025-04-30 | A simple and effective approach for body part recognition on CT scans based on projection estimation | Franko Hrzic et.al. | 2504.21810 | null |
2025-04-30 | An Empirical Study on the Effectiveness of Large Language Models for Binary Code Understanding | Xiuwei Shang et.al. | 2504.21803 | null |
2025-04-30 | DeepSeek-Prover-V2: Advancing Formal Mathematical Reasoning via Reinforcement Learning for Subgoal Decomposition | Z. Z. Ren et.al. | 2504.21801 | link |
2025-04-30 | SWE-smith: Scaling Data for Software Engineering Agents | John Yang et.al. | 2504.21798 | null |
2025-04-30 | MAC-Tuning: LLM Multi-Compositional Problem Reasoning with Enhanced Knowledge Boundary Awareness | Junsheng Huang et.al. | 2504.21773 | null |
2025-04-30 | LASHED: LLMs And Static Hardware Analysis for Early Detection of RTL Bugs | Baleegh Ahmad et.al. | 2504.21770 | null |
2025-04-30 | LLM-based Interactive Imitation Learning for Robotic Manipulation | Jonas Werner et.al. | 2504.21769 | link |
2025-04-30 | Investigating Literary Motifs in Ancient and Medieval Novels with Large Language Models | Emelie Hallenberg et.al. | 2504.21742 | null |
2025-04-30 | TheraQuest: A Gamified, LLM-Powered Simulation for Massage Therapy Training | Shengqian Wang et.al. | 2504.21735 | null |
2025-04-30 | XBreaking: Explainable Artificial Intelligence for Jailbreaking LLMs | Marco Arazzi et.al. | 2504.21700 | null |
2025-04-30 | Visual Text Processing: A Comprehensive Review and Unified Evaluation | Yan Shu et.al. | 2504.21682 | link |
2025-04-30 | Hoist with His Own Petard: Inducing Guardrails to Facilitate Denial-of-Service Attacks on Retrieval-Augmented Generation of LLMs | Pan Suo et.al. | 2504.21680 | null |
2025-04-30 | Traceback of Poisoning Attacks to Retrieval-Augmented Generation | Baolei Zhang et.al. | 2504.21668 | null |
2025-04-30 | From Precision to Perception: User-Centred Evaluation of Keyword Extraction Algorithms for Internet-Scale Contextual Advertising | Jingwen Cai et.al. | 2504.21667 | null |
2025-04-30 | AdaR1: From Long-CoT to Hybrid-CoT via Bi-Level Adaptive Reasoning Optimization | Haotian Luo et.al. | 2504.21659 | link |
2025-04-30 | Sadeed: Advancing Arabic Diacritization Through Small Language Model | Zeina Aldallal et.al. | 2504.21635 | null |
2025-04-29 | Toward Efficient Exploration by Large Language Model Agents | Dilip Arumugam et.al. | 2504.20997 | null |
2025-04-29 | X-Fusion: Introducing New Modality to Frozen Large Language Models | Sicheng Mo et.al. | 2504.20996 | null |
2025-04-29 | ACE: A Security Architecture for LLM-Integrated App Systems | Evan Li et.al. | 2504.20984 | null |
2025-04-29 | Real-Time Wayfinding Assistant for Blind and Low-Vision Users | Dabbrata Das et.al. | 2504.20976 | null |
2025-04-29 | SetKE: Knowledge Editing for Knowledge Elements Overlap | Yifan Wei et.al. | 2504.20972 | null |
2025-04-29 | OSVBench: Benchmarking LLMs on Specification Generation Tasks for Operating System Verification | Shangyu Li et.al. | 2504.20964 | link |
2025-04-29 | Information Gravity: A Field-Theoretic Model for Token Selection in Large Language Models | Maryna Vyshnyvetska et.al. | 2504.20951 | null |
2025-04-29 | Trace-of-Thought: Enhanced Arithmetic Problem Solving via Reasoning Distillation From Large to Small Language Models | Tyler McDonald et.al. | 2504.20946 | null |
2025-04-29 | ChestX-Reasoner: Advancing Radiology Foundation Models with Reasoning through Step-by-Step Verification | Ziqing Fan et.al. | 2504.20930 | link |
2025-04-29 | An Empirical Study on the Capability of LLMs in Decomposing Bug Reports | Zhiyuan Chen et.al. | 2504.20911 | null |
2025-04-29 | Classifier-to-Bias: Toward Unsupervised Automatic Bias Detection for Visual Classifiers | Quentin Guimard et.al. | 2504.20902 | null |
2025-04-29 | LELANTE: LEveraging LLM for Automated ANdroid TEsting | Shamit Fatin et.al. | 2504.20896 | null |
2025-04-29 | FedMVP: Federated Multi-modal Visual Prompt Tuning for Vision-Language Models | Mainak Singha et.al. | 2504.20860 | null |
2025-04-29 | X-Cross: Dynamic Integration of Language Models for Cross-Domain Sequential Recommendation | Guy Hadad et.al. | 2504.20859 | null |
2025-04-29 | JaccDiv: A Metric and Benchmark for Quantifying Diversity of Generated Marketing Text in the Music Industry | Anum Afzal et.al. | 2504.20849 | null |
2025-04-29 | Language Model for Large-Text Transmission in Noisy Quantum Communications | Yuqi Li et.al. | 2504.20842 | null |
2025-04-29 | Universal language model with the intervention of quantum theory | D. -F. Qin et.al. | 2504.20839 | null |
2025-04-29 | Enhancing Non-Core Language Instruction-Following in Speech LLMs via Semi-Implicit Cross-Lingual CoT Reasoning | Hongfei Xue et.al. | 2504.20835 | null |
2025-04-29 | Reinforcement Learning for LLM Reasoning Under Memory Constraints | Alan Lee et.al. | 2504.20834 | null |
2025-04-30 | Ascendra: Dynamic Request Prioritization for Efficient LLM Serving | Azam Ikram et.al. | 2504.20828 | null |
2025-04-28 | Learning Streaming Video Representation via Multitask Training | Yibin Yan et.al. | 2504.20041 | null |
2025-04-28 | AutoJudge: Judge Decoding Without Manual Annotation | Roman Garipov et.al. | 2504.20039 | null |
2025-04-28 | SpatialReasoner: Towards Explicit and Generalizable 3D Spatial Reasoning | Wufei Ma et.al. | 2504.20024 | null |
2025-04-28 | Better To Ask in English? Evaluating Factual Accuracy of Multilingual LLMs in English and Low-Resource Languages | Pritika Rohera et.al. | 2504.20022 | null |
2025-04-28 | Modular Machine Learning: An Indispensable Path towards New-Generation Large Language Models | Xin Wang et.al. | 2504.20020 | null |
2025-04-29 | LLM-Generated Fake News Induces Truth Decay in News Ecosystem: A Case Study on Neural News Recommendation | Beizhe Hu et.al. | 2504.20013 | null |
2025-04-28 | Towards Automated Scoping of AI for Social Good Projects | Jacob Emmerson et.al. | 2504.20010 | null |
2025-04-28 | Knowledge Distillation of Domain-adapted LLMs for Question-Answering in Telecom | Rishika Sen et.al. | 2504.20000 | null |
2025-04-28 | HJRNO: Hamilton-Jacobi Reachability with Neural Operators | Yankai Li et.al. | 2504.19989 | null |
2025-04-28 | TD-EVAL: Revisiting Task-Oriented Dialogue Evaluation by Combining Turn-Level Precision with Dialogue-Level Comparisons | Emre Can Acikgoz et.al. | 2504.19982 | null |
2025-04-28 | Accurate and Diverse LLM Mathematical Reasoning via Automated PRM-Guided GFlowNets | Adam Younsi et.al. | 2504.19981 | null |
2025-04-29 | From Concept to Practice: an Automated LLM-aided UVM Machine for RTL Verification | Junhao Ye et.al. | 2504.19959 | null |
2025-04-28 | Enhancing Surgical Documentation through Multimodal Visual-Temporal Transformers and Generative AI | Hugo Georgenthum et.al. | 2504.19918 | null |
2025-04-28 | Can AI Agents Design and Implement Drug Discovery Pipelines? | Khachik Smbatyan et.al. | 2504.19912 | null |
2025-04-28 | GenCLS++: Pushing the Boundaries of Generative Classification in LLMs Through Comprehensive SFT and RL Studies Across Diverse Datasets | Mingqian He et.al. | 2504.19898 | null |
2025-04-28 | CineVerse: Consistent Keyframe Synthesis for Cinematic Scene Composition | Quynh Phung et.al. | 2504.19894 | null |
2025-04-28 | semi-PD: Towards Efficient LLM Serving via Phase-Wise Disaggregated Computation and Unified Storage | Ke Hong et.al. | 2504.19867 | null |
2025-04-28 | CoherenDream: Boosting Holistic Text Coherence in 3D Generation via Multimodal Large Language Models Feedback | Chenhan Jiang et.al. | 2504.19860 | null |
2025-04-28 | Efficient Domain-adaptive Continual Pretraining for the Process Industry in the German Language | Anastasia Zhukova et.al. | 2504.19856 | null |
2025-04-29 | The Automation Advantage in AI Red Teaming | Rob Mulla et.al. | 2504.19855 | null |
2025-04-25 | Generalization Capability for Imitation Learning | Yixiao Wang et.al. | 2504.18538 | null |
2025-04-25 | TRACE Back from the Future: A Probabilistic Reasoning Approach to Controllable Language Generation | Gwen Yidou Weng et.al. | 2504.18535 | null |
2025-04-25 | Eval3D: Interpretable and Fine-grained Evaluation for 3D Generation | Shivam Duggal et.al. | 2504.18509 | null |
2025-04-25 | Investigating Co-Constructive Behavior of Large Language Models in Explanation Dialogues | Leandra Fichtel et.al. | 2504.18483 | null |
2025-04-25 | Generative Induction of Dialogue Task Schemas with Streaming Refinement and Simulated Interactions | James D. Finch et.al. | 2504.18474 | null |
2025-04-25 | Fast-Slow Thinking for Large Vision-Language Model Reasoning | Wenyi Xiao et.al. | 2504.18458 | null |
2025-04-25 | Pseudo-Asynchronous Local SGD: Robust and Efficient Data-Parallel Training | Hiroki Naganuma et.al. | 2504.18454 | null |
2025-04-25 | Reason Like a Radiologist: Chain-of-Thought and Reinforcement Learning for Verifiable Report Generation | Peiyuan Jing et.al. | 2504.18453 | null |
2025-04-25 | Kimi-Audio Technical Report | KimiTeam et.al. | 2504.18425 | link |
2025-04-25 | LLMpatronous: Harnessing the Power of LLMs For Vulnerability Detection | Rajesh Yarra et.al. | 2504.18423 | null |
2025-04-25 | BitNet v2: Native 4-bit Activations with Hadamard Transformation for 1-bit LLMs | Hongyu Wang et.al. | 2504.18415 | null |
2025-04-25 | An Empirical Study of Evaluating Long-form Question Answering | Ning Xian et.al. | 2504.18413 | link |
2025-04-25 | Expressing stigma and inappropriate responses prevents LLMs from safely replacing mental health providers | Jared Moore et.al. | 2504.18412 | link |
2025-04-25 | HRScene: How Far Are VLMs from Effective High-Resolution Image Understanding? | Yusen Zhang et.al. | 2504.18406 | null |
2025-04-25 | Unsupervised Visual Chain-of-Thought Reasoning via Preference Optimization | Kesen Zhao et.al. | 2504.18397 | null |
2025-04-25 | Bridge the Domains: Large Language Models Enhanced Cross-domain Sequential Recommendation | Qidong Liu et.al. | 2504.18383 | null |
2025-04-25 | Pushing the boundary on Natural Language Inference | Pablo Miralles-González et.al. | 2504.18376 | null |
2025-04-25 | Auto-SLURP: A Benchmark Dataset for Evaluating Multi-Agent Frameworks in Smart Personal Assistant | Lei Shen et.al. | 2504.18373 | link |
2025-04-25 | ThreMoLIA: Threat Modeling of Large Language Model-Integrated Applications | Felix Viktor Jedrzejewski et.al. | 2504.18369 | null |
2025-04-25 | Testing Individual Fairness in Graph Neural Networks | Roya Nasiri et.al. | 2504.18353 | null |
2025-04-24 | Token-Shuffle: Towards High-Resolution Image Generation with Autoregressive Models | Xu Ma et.al. | 2504.17789 | null |
2025-04-24 | Replay to Remember: Retaining Domain Knowledge in Streaming Language Models | Sneh Pillai et.al. | 2504.17780 | null |
2025-04-24 | Conversational Assistants to support Heart Failure Patients: comparing a Neurosymbolic Architecture with ChatGPT | Anuja Tayal et.al. | 2504.17753 | null |
2025-04-24 | Towards Robust LLMs: an Adversarial Robustness Measurement Framework | Natan Levy et.al. | 2504.17723 | null |
2025-04-24 | Multilingual Performance Biases of Large Language Models in Education | Vansh Gupta et.al. | 2504.17720 | null |
2025-04-24 | PICO: Reconstructing 3D People In Contact with Objects | Alpár Cseke et.al. | 2504.17695 | null |
2025-04-24 | Ensemble Bayesian Inference: Leveraging Small Language Models to Achieve LLM-level Accuracy in Profile Matching Tasks | Haru-Tada Sato et.al. | 2504.17685 | null |
2025-04-24 | INSIGHT: Bridging the Student-Teacher Gap in Times of Large Language Models | Jarne Thys et.al. | 2504.17677 | null |
2025-04-24 | Energy Considerations of Large Language Model Inference and Efficiency Optimizations | Jared Fernandez et.al. | 2504.17674 | null |
2025-04-24 | Cross-region Model Training with Communication-Computation Overlapping and Delay Compensation | Ying Zhu et.al. | 2504.17672 | null |
2025-04-25 | Data-Driven Calibration of Prediction Sets in Large Vision-Language Models Based on Inductive Conformal Prediction | Yuanchang Ye et.al. | 2504.17671 | null |
2025-04-24 | Towards a HIPAA Compliant Agentic AI System in Healthcare | Subash Neupane et.al. | 2504.17669 | null |
2025-04-24 | Evaluating Grounded Reasoning by Code-Assisted Large Language Models for Mathematics | Zena Al-Khalili et.al. | 2504.17665 | null |
2025-04-24 | Effortless, Simulation-Efficient Bayesian Inference using Tabular Foundation Models | Julius Vetter et.al. | 2504.17660 | null |
2025-04-24 | Portability of Optimizations from SC to TSO | Akshay Gopalakrishnan et.al. | 2504.17646 | null |
2025-04-24 | L3: DIMM-PIM Integrated Architecture and Coordination for Scalable Long-Context LLM Inference | Qingyuan Liu et.al. | 2504.17584 | null |
2025-04-25 | DeepDistill: Enhancing LLM Reasoning Capabilities via Large-Scale Difficulty-Graded Data Training | Xiaoyu Tian et.al. | 2504.17565 | null |
2025-04-24 | When Does Metadata Conditioning (NOT) Work for Language Model Pre-Training? A Study with Context-Free Grammars | Rei Higuchi et.al. | 2504.17562 | null |
2025-04-24 | HalluLens: LLM Hallucination Benchmark | Yejin Bang et.al. | 2504.17550 | null |
2025-04-24 | A Comprehensive Survey of Knowledge-Based Vision Question Answering Systems: The Lifecycle of Knowledge in Visual Reasoning Task | Jiaqi Deng et.al. | 2504.17547 | null |
2025-04-23 | Generalized Neighborhood Attention: Multi-dimensional Sparse Attention at the Speed of Light | Ali Hassani et.al. | 2504.16922 | null |
2025-04-23 | IberBench: LLM Evaluation on Iberian Languages | José Ángel González et.al. | 2504.16921 | null |
2025-04-23 | Tracing Thought: Using Chain-of-Thought Reasoning to Identify the LLM Behind AI-Generated Text | Shifali Agrahari et.al. | 2504.16913 | null |
2025-04-23 | Do Large Language Models know who did what to whom? | Joseph M. Denning et.al. | 2504.16884 | null |
2025-04-23 | Enhancing Critical Thinking with AI: A Tailored Warning System for RAG Models | Xuyang Zhu et.al. | 2504.16883 | null |
2025-04-23 | Context-Enhanced Vulnerability Detection Based on Large Language Model | Yixin Yang et.al. | 2504.16877 | null |
2025-04-24 | Exploring How LLMs Capture and Represent Domain-Specific Knowledge | Mirian Hipolito Garcia et.al. | 2504.16871 | null |
2025-04-23 | Common Functional Decompositions Can Mis-attribute Differences in Outcomes Between Populations | Manuel Quintero et.al. | 2504.16864 | null |
2025-04-23 | Planning with Diffusion Models for Target-Oriented Dialogue Systems | Hanwen Du et.al. | 2504.16858 | null |
2025-04-23 | Emo Pillars: Knowledge Distillation to Support Fine-Grained Context-Aware and Context-Less Emotion Classification | Alexander Shvets et.al. | 2504.16856 | null |
2025-04-23 | Monte Carlo Planning with Large Language Model for Text-Based Game Agents | Zijing Shi et.al. | 2504.16855 | null |
2025-04-23 | Improving Significant Wave Height Prediction Using Chronos Models | Yilin Zhai et.al. | 2504.16834 | null |
2025-04-23 | LRASGen: LLM-based RESTful API Specification Generation | Sida Deng et.al. | 2504.16833 | null |
2025-04-23 | GreenMind: A Next-Generation Vietnamese Large Language Model for Structured and Logical Reasoning | Luu Quy Tung et.al. | 2504.16832 | null |
2025-04-23 | Decoupled Global-Local Alignment for Improving Compositional Understanding | Xiaoxing Hu et.al. | 2504.16801 | null |
2025-04-23 | MOOSComp: Improving Lightweight Long-Context Compressor via Mitigating Over-Smoothing and Incorporating Outlier Scores | Fengwei Zhou et.al. | 2504.16786 | null |
2025-04-23 | Graph2Nav: 3D Object-Relation Graph Generation to Robot Navigation | Tixiao Shan et.al. | 2504.16782 | null |
2025-04-23 | How Effective are Generative Large Language Models in Performing Requirements Classification? | Waad Alhoshan et.al. | 2504.16768 | null |
2025-04-23 | Lightweight Latent Verifiers for Efficient Meta-Generation Strategies | Bartosz Piotrowski et.al. | 2504.16760 | null |
2025-04-23 | HEMA : A Hippocampus-Inspired Extended Memory Architecture for Long-Context AI Conversations | Kwangseob Ahn et.al. | 2504.16754 | null |
2025-04-22 | TTRL: Test-Time Reinforcement Learning | Yuxin Zuo et.al. | 2504.16084 | link |
2025-04-22 | MMInference: Accelerating Pre-filling for Long-Context VLMs via Modality-Aware Permutation Sparse Attention | Yucheng Li et.al. | 2504.16083 | null |
2025-04-22 | MR. Video: “MapReduce” is the Principle for Long Video Understanding | Ziqi Pang et.al. | 2504.16082 | null |
2025-04-22 | From Reflection to Perfection: Scaling Inference-Time Optimization for Text-to-Image Diffusion Models via Reflection Tuning | Le Zhuo et.al. | 2504.16080 | null |
2025-04-22 | LLMs are Greedy Agents: Effects of RL Fine-tuning on Decision-Making Abilities | Thomas Schmied et.al. | 2504.16078 | null |
2025-04-22 | PHYBench: Holistic Evaluation of Physical Perception and Reasoning in Large Language Models | Shi Qiu et.al. | 2504.16074 | null |
2025-04-22 | Guiding VLM Agents with Process Rewards at Inference Time for GUI Navigation | Zhiyuan Hu et.al. | 2504.16073 | null |
2025-04-22 | Describe Anything: Detailed Localized Image and Video Captioning | Long Lian et.al. | 2504.16072 | null |
2025-04-22 | A Python Tool for Reconstructing Full News Text from GDELT | A. Fronzetti Colladon et.al. | 2504.16063 | link |
2025-04-22 | Vision language models are unreliable at trivial spatial cognition | Sangeet Khemlani et.al. | 2504.16061 | null |
2025-04-22 | Vision-Language Models Are Not Pragmatically Competent in Referring Expression Generation | Ziqiao Ma et.al. | 2504.16060 | link |
2025-04-22 | Automated Static Vulnerability Detection via a Holistic Neuro-symbolic Approach | Penghui Li et.al. | 2504.16057 | null |
2025-04-22 | Honey, I Shrunk the Language Model: Impact of Knowledge Distillation Methods on Performance and Explainability | Daniel Hendriks et.al. | 2504.16056 | null |
2025-04-22 | LongMamba: Enhancing Mamba’s Long Context Capabilities via Training-Free Receptive Field Enlargement | Zhifan Ye et.al. | 2504.16053 | link |
2025-04-22 | Evaluating Vision Language Models (VLMs) for Radiology: A Comprehensive Analysis | Frank Li et.al. | 2504.16047 | null |
2025-04-23 | Certified Mitigation of Worst-Case LLM Copyright Infringement | Jingyu Zhang et.al. | 2504.16046 | null |
2025-04-22 | LLMs meet Federated Learning for Scalable and Secure IoT Management | Yazan Otoum et.al. | 2504.16032 | null |
2025-04-22 | LiveCC: Learning Video LLM with Streaming Speech Transcription at Scale | Joya Chen et.al. | 2504.16030 | null |
2025-04-22 | Benchmarking LLM for Code Smells Detection: OpenAI GPT-4.0 vs DeepSeek-V3 | Ahmed R. Sadik et.al. | 2504.16027 | null |
2025-04-22 | Efficient Temporal Consistency in Diffusion-Based Video Editing with Adaptor Modules: A Theoretical Framework | Xinyuan Song et.al. | 2504.16016 | null |
2025-04-21 | Seeing from Another Perspective: Evaluating Multi-View Understanding in MLLMs | Chun-Hsiao Yeh et.al. | 2504.15280 | link |
2025-04-21 | VisuLogic: A Benchmark for Evaluating Visual Reasoning in Multi-modal Large Language Models | Weiye Xu et.al. | 2504.15279 | null |
2025-04-21 | Stop Summation: Min-Form Credit Assignment Is All Process Reward Model Needs for Reasoning | Jie Cheng et.al. | 2504.15275 | link |
2025-04-21 | Eagle 2.5: Boosting Long-Context Post-Training for Frontier Vision-Language Models | Guo Chen et.al. | 2504.15271 | null |
2025-04-21 | Roll the dice & look before you leap: Going beyond the creative limits of next-token prediction | Vaishnavh Nagarajan et.al. | 2504.15266 | link |
2025-04-21 | Interpretable Locomotion Prediction in Construction Using a Memory-Driven LLM Agent With Chain-of-Thought Reasoning | Ehsan Ahmadi et.al. | 2504.15263 | null |
2025-04-21 | Leveraging Language Models for Automated Patient Record Linkage | Mohammad Beheshti et.al. | 2504.15261 | null |
2025-04-21 | CRUST-Bench: A Comprehensive Benchmark for C-to-safe-Rust Transpilation | Anirudh Khatry et.al. | 2504.15254 | link |
2025-04-21 | Evaluating Judges as Evaluators: The JETTS Benchmark of LLM-as-Judges as Test-Time Scaling Evaluators | Yilun Zhou et.al. | 2504.15253 | link |
2025-04-21 | MR. Guard: Multilingual Reasoning Guardrail using Curriculum Learning | Yahan Yang et.al. | 2504.15241 | null |
2025-04-21 | Values in the Wild: Discovering and Analyzing Values in Real-World Language Model Interactions | Saffron Huang et.al. | 2504.15236 | null |
2025-04-21 | A Self-Improving Coding Agent | Maxime Robeyns et.al. | 2504.15228 | null |
2025-04-21 | EvalAgent: Discovering Implicit Evaluation Criteria from the Web | Manya Wadhwa et.al. | 2504.15219 | null |
2025-04-21 | Integrating Symbolic Execution into the Fine-Tuning of Code-Generating LLMs | Marina Sakharova et.al. | 2504.15210 | null |
2025-04-21 | Compute-Optimal LLMs Provably Generalize Better With Scale | Marc Finzi et.al. | 2504.15208 | null |
2025-04-21 | Support Evaluation for the TREC 2024 RAG Track: Comparing Human versus LLM Judges | Nandan Thakur et.al. | 2504.15205 | null |
2025-04-22 | Synergistic Weak-Strong Collaboration by Aligning Preferences | Yizhu Jiao et.al. | 2504.15188 | null |
2025-04-21 | DSPO: Direct Semantic Preference Optimization for Real-World Image Super-Resolution | Miaomiao Cai et.al. | 2504.15176 | null |
2025-04-21 | The Synthetic Imputation Approach: Generating Optimal Synthetic Texts For Underrepresented Categories In Supervised Classification Tasks | Joan C. Timoneda et.al. | 2504.15160 | null |
2025-04-21 | KGMEL: Knowledge Graph-Enhanced Multimodal Entity Linking | Juyeon Kim et.al. | 2504.15135 | link |
2025-04-18 | Generative AI Act II: Test Time Scaling Drives Cognition Engineering | Shijie Xia et.al. | 2504.13828 | link |
2025-04-18 | Feature Alignment and Representation Transfer in Knowledge Distillation for Large Language Models | Junjie Yang et.al. | 2504.13825 | null |
2025-04-18 | CheXWorld: Exploring Image World Modeling for Radiograph Representation Learning | Yang Yue et.al. | 2504.13820 | link |
2025-04-18 | Not All Rollouts are Useful: Down-Sampling Rollouts in LLM Reinforcement Learning | Yixuan Even Xu et.al. | 2504.13818 | null |
2025-04-18 | BadApex: Backdoor Attack Based on Adaptive Optimization Mechanism of Black-box Large Language Models | Zhengxian Wu et.al. | 2504.13775 | null |
2025-04-18 | DP2Unlearning: An Efficient and Guaranteed Unlearning Framework for LLMs | Tamim Al Mahmud et.al. | 2504.13774 | link |
2025-04-18 | Detecting Malicious Source Code in PyPI Packages with LLMs: Does RAG Come in Handy? | Motunrayo Ibiyo et.al. | 2504.13769 | null |
2025-04-18 | Decoding Vision Transformers: the Diffusion Steering Lens | Ryota Takatsuki et.al. | 2504.13763 | link |
2025-04-18 | Scaling sparse feature circuit finding for in-context learning | Dmitrii Kharlapenko et.al. | 2504.13756 | null |
2025-04-18 | Learning to Attribute with Attention | Benjamin Cohen-Wang et.al. | 2504.13752 | link |
2025-04-18 | Controlled Territory and Conflict Tracking (CONTACT): (Geo-)Mapping Occupied Territory from Open Source Intelligence | Paul K. Mandal et.al. | 2504.13730 | link |
2025-04-18 | OpenDeception: Benchmarking and Investigating AI Deceptive Behaviors via Open-ended Interaction Simulation | Yichen Wu et.al. | 2504.13707 | null |
2025-04-18 | Exploring Multimodal Prompt for Visualization Authoring with Large Language Models | Zhen Wen et.al. | 2504.13700 | null |
2025-04-18 | Analysing the Robustness of Vision-Language-Models to Common Corruptions | Muhammad Usama et.al. | 2504.13690 | null |
2025-04-18 | Intelligent Interaction Strategies for Context-Aware Cognitive Augmentation | Xiangrong et.al. | 2504.13684 | null |
2025-04-18 | Revisiting Uncertainty Quantification Evaluation in Language Models: Spurious Interactions with Response Length Bias Results | Andrea Santilli et.al. | 2504.13677 | null |
2025-04-18 | Large Language Models Will Change The Way Children Think About Technology And Impact Every Interaction Paradigm | Russell Beale et.al. | 2504.13667 | null |
2025-04-18 | Do Prompt Patterns Affect Code Quality? A First Empirical Assessment of ChatGPT-Generated Code | Antonio Della Porta et.al. | 2504.13656 | null |
2025-04-18 | EyecareGPT: Boosting Comprehensive Ophthalmology Understanding with Tailored Dataset, Benchmark and Model | Sijing Li et.al. | 2504.13650 | link |
2025-04-18 | Exploring the Potential for Large Language Models to Demonstrate Rational Probabilistic Beliefs | Gabriel Freedman et.al. | 2504.13644 | link |
2025-04-17 | Perception Encoder: The best visual embeddings are not at the output of the network | Daniel Bolya et.al. | 2504.13181 | null |
2025-04-17 | PerceptionLM: Open-Access Data and Models for Detailed Visual Understanding | Jang Hyun Cho et.al. | 2504.13180 | link |
2025-04-17 | It’s All Connected: A Journey Through Test-Time Memorization, Attentional Bias, Retention, and Online Optimization | Ali Behrouz et.al. | 2504.13173 | null |
2025-04-17 | Sleep-time Compute: Beyond Inference Scaling at Test-time | Kevin Lin et.al. | 2504.13171 | link |
2025-04-17 | Generate, but Verify: Reducing Hallucination in Vision-Language Models with Retrospective Resampling | Tsung-Han Wu et.al. | 2504.13169 | link |
2025-04-17 | CLIMB: CLustering-based Iterative Data Mixture Bootstrapping for Language Model Pre-training | Shizhe Diao et.al. | 2504.13161 | null |
2025-04-17 | Digital Twin Generation from Visual Data: A Survey | Andrew Melnik et.al. | 2504.13159 | link |
2025-04-17 | MIB: A Mechanistic Interpretability Benchmark | Aaron Mueller et.al. | 2504.13151 | link |
2025-04-17 | Exploring Expert Failures Improves LLM Agent Tuning | Li-Cheng Lan et.al. | 2504.13145 | null |
2025-04-17 | Syntactic and Semantic Control of Large Language Models via Sequential Monte Carlo | João Loula et.al. | 2504.13139 | null |
2025-04-17 | Energy-Based Reward Models for Robust Language Model Alignment | Anamika Lochab et.al. | 2504.13134 | link |
2025-04-17 | LLMs Meet Finance: Fine-Tuning Foundation Models for the Open FinLLM Leaderboard | Varun Rao et.al. | 2504.13125 | null |
2025-04-17 | Low-hallucination Synthetic Captions for Large-Scale Vision-Language Model Pre-training | Xinsong Zhang et.al. | 2504.13123 | null |
2025-04-17 | VistaDPO: Video Hierarchical Spatial-Temporal Direct Preference Optimization for Large Video Models | Haojian Huang et.al. | 2504.13122 | link |
2025-04-17 | Probing and Inducing Combinational Creativity in Vision-Language Models | Yongqian Peng et.al. | 2504.13120 | null |
2025-04-17 | Object-Driven Narrative in AR: A Scenario-Metaphor Framework with VLM Integration | Yusi Sun et.al. | 2504.13119 | null |
2025-04-17 | Uncertainty-Aware Trajectory Prediction via Rule-Regularized Heteroscedastic Deep Classification | Kumar Manas et.al. | 2504.13111 | null |
2025-04-17 | EventVAD: Training-Free Event-Aware Video Anomaly Detection | Yihua Shao et.al. | 2504.13092 | null |
2025-04-17 | Retrieval-Augmented Generation with Conflicting Evidence | Han Wang et.al. | 2504.13079 | link |
2025-04-18 | SkyReels-V2: Infinite-length Film Generative Model | Guibin Chen et.al. | 2504.13074 | link |
2025-04-16 | BitNet b1.58 2B4T Technical Report | Shuming Ma et.al. | 2504.12285 | null |
2025-04-16 | HLS-Eval: A Benchmark and Framework for Evaluating LLMs on High-Level Synthesis Design Tasks | Stefan Abi-Karam et.al. | 2504.12268 | link |
2025-04-16 | FLIP Reasoning Challenge | Andreas Plesner et.al. | 2504.12256 | link |
2025-04-16 | AnomalyGen: An Automated Semantic Log Sequence Generation Framework with LLM for Anomaly Detection | Xinyu Li et.al. | 2504.12250 | null |
2025-04-16 | MOS: Towards Effective Smart Contract Vulnerability Detection through Mixture-of-Experts Tuning of Large Language Models | Hang Yuan et.al. | 2504.12234 | null |
2025-04-16 | Watermarking Needs Input Repetition Masking | David Khachaturov et.al. | 2504.12229 | null |
2025-04-16 | d1: Scaling Reasoning in Diffusion Large Language Models via Reinforcement Learning | Siyan Zhao et.al. | 2504.12216 | null |
2025-04-16 | What Do Large Language Models Know? Tacit Knowledge as a Potential Causal-Explanatory Structure | Céline Budding et.al. | 2504.12187 | null |
2025-04-16 | SALAD: Improving Robustness and Generalization through Contrastive Learning with Structure-Aware and LLM-Driven Augmented Data | Suyoung Bae et.al. | 2504.12185 | null |
2025-04-16 | Trusting CHATGPT: how minor tweaks in the prompts lead to major differences in sentiment classification | Jaime E. Cuellar et.al. | 2504.12180 | null |
2025-04-16 | Multilingual Contextualization of Large Language Models for Document-Level Machine Translation | Miguel Moura Ramos et.al. | 2504.12140 | null |
2025-04-16 | Efficient Contrastive Decoding with Probabilistic Hallucination Detection - Mitigating Hallucinations in Large Vision Language Models - | Laura Fieback et.al. | 2504.12137 | null |
2025-04-16 | Clarifying Ambiguities: on the Role of Ambiguity Types in Prompting Methods for Clarification Generation | Anfu Tang et.al. | 2504.12113 | null |
2025-04-16 | Entropy-Guided Watermarking for LLMs: A Test-Time Framework for Robust and Traceable Text Generation | Shizhan Cai et.al. | 2504.12108 | null |
2025-04-16 | Logits DeConfusion with CLIP for Few-Shot Learning | Shuo Li et.al. | 2504.12104 | link |
2025-04-16 | Gauging Overprecision in LLMs: An Empirical Study | Adil Bahaj et.al. | 2504.12098 | null |
2025-04-16 | Reasoning-Based AI for Startup Evaluation (R.A.I.S.E.): A Memory-Augmented, Multi-Step Decision Framework | Jack Preuveneers et.al. | 2504.12090 | null |
2025-04-16 | Self-alignment of Large Video Language Models with Refined Regularized Preference Optimization | Pritam Sarkar et.al. | 2504.12083 | null |
2025-04-16 | Selective Demonstration Retrieval for Improved Implicit Hate Speech Detection | Yumin Kim et.al. | 2504.12082 | null |
2025-04-16 | Subitizing-Inspired_Large_Language_Models_for_Floorplanning | Shao-Chien Lu et.al. | 2504.12076 | null |
2025-04-16 | Elucidating the Design Space of Multimodal Protein Language Models | Cheng-Yen Hsieh et.al. | 2504.11454 | null |
2025-04-15 | TextArena | Leon Guertler et.al. | 2504.11442 | link |
2025-04-15 | Masculine Defaults via Gendered Discourse in Podcasts and Large Language Models | Maria Teleki et.al. | 2504.11431 | link |
2025-04-15 | A Dual-Space Framework for General Knowledge Distillation of Large Language Models | Xue Zhang et.al. | 2504.11426 | null |
2025-04-15 | Reinforcing Compositional Retrieval: Retrieving Step-by-Step for Composing Informative Contexts | Quanyu Long et.al. | 2504.11420 | null |
2025-04-15 | Efficient Hybrid Language Model Compression through Group-Aware SSM Pruning | Ali Taghibakhshi et.al. | 2504.11409 | null |
2025-04-15 | DataDecide: How to Predict Best Pretraining Data with Small Experiments | Ian Magnusson et.al. | 2504.11393 | null |
2025-04-15 | RankAlign: A Ranking View of the Generator-Validator Gap in Large Language Models | Juan Diego Rodriguez et.al. | 2504.11381 | link |
2025-04-15 | Cancer-Myth: Evaluating AI Chatbot on Patient Questions with False Presuppositions | Wang Bill Zhu et.al. | 2504.11373 | link |
2025-04-15 | OpenTuringBench: An Open-Model-based Benchmark and Framework for Machine-Generated Text Detection and Attribution | Lucio La Cava et.al. | 2504.11369 | null |
2025-04-15 | From Gaze to Insight: Bridging Human Visual Attention and Vision Language Model Explanation for Weakly-Supervised Medical Image Segmentation | Jingkun Chen et.al. | 2504.11368 | null |
2025-04-15 | Teaching Large Language Models to Reason through Learning and Forgetting | Tianwei Ni et.al. | 2504.11364 | link |
2025-04-15 | Kimina-Prover Preview: Towards Large Formal Reasoning Models with Reinforcement Learning | Haiming Wang et.al. | 2504.11354 | link |
2025-04-16 | Seedream 3.0 Technical Report | Yu Gao et.al. | 2504.11346 | null |
2025-04-15 | A Minimalist Approach to LLM Reasoning: from Rejection Sampling to Reinforce | Wei Xiong et.al. | 2504.11343 | link |
2025-04-15 | REWARD CONSISTENCY: Improving Multi-Objective Alignment from a Data-Centric Perspective | Zhihao Xu et.al. | 2504.11337 | null |
2025-04-15 | Looking beyond the next token | Abitha Thankaraj et.al. | 2504.11336 | null |
2025-04-15 | Optimizing LLM Inference: Fluid-Guided Online Scheduling with Memory Constraints | Ruicheng Ao et.al. | 2504.11320 | link |
2025-04-15 | Learning to Be A Doctor: Searching for Effective Medical Agent Architectures | Yangyang Zhuang et.al. | 2504.11301 | null |
2025-04-16 | Automated Python Translation | Joshua Otten et.al. | 2504.11290 | null |
2025-04-14 | InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models | Jinguo Zhu et.al. | 2504.10479 | link |
2025-04-14 | Weight Ensembling Improves Reasoning in Language Models | Xingyu Dang et.al. | 2504.10478 | null |
2025-04-14 | MIEB: Massive Image Embedding Benchmark | Chenghao Xiao et.al. | 2504.10471 | link |
2025-04-14 | Pixel-SAIL: Single Transformer For Pixel-Grounded Understanding | Tao Zhang et.al. | 2504.10465 | link |
2025-04-14 | The Scalability of Simplicity: Empirical Analysis of Vision-Language Learning with a Single Transformer | Weixian Lei et.al. | 2504.10462 | link |
2025-04-15 | GUI-R1 : A Generalist R1-Style Vision-Language Action Model For GUI Agents | Xiaobo Xia et.al. | 2504.10458 | null |
2025-04-14 | M1: Towards Scalable Test-Time Compute with Mamba Reasoning Models | Junxiong Wang et.al. | 2504.10449 | link |
2025-04-14 | Multimodal Long Video Modeling Based on Temporal Dynamic Context | Haoran Hao et.al. | 2504.10443 | link |
2025-04-14 | LLM Can be a Dangerous Persuader: Empirical Study of Persuasion Safety in Large Language Models | Minqian Liu et.al. | 2504.10430 | null |
2025-04-14 | Foundation models for electronic health records: representation dynamics and transferability | Michael C. Burkhart et.al. | 2504.10422 | link |
2025-04-14 | Can We Edit LLMs for Long-Tail Biomedical Knowledge? | Xinhao Yi et.al. | 2504.10421 | link |
2025-04-15 | Unchecked and Overlooked: Addressing the Checkbox Blind Spot in Large Language Models with CheckboxQA | Michał Turski et.al. | 2504.10419 | link |
2025-04-14 | CliniChat: A Multi-Source Knowledge-Driven Framework for Clinical Interview Dialogue Reconstruction and Evaluation | Jing Chen et.al. | 2504.10418 | null |
2025-04-14 | LLM-SRBench: A New Benchmark for Scientific Equation Discovery with Large Language Models | Parshin Shojaee et.al. | 2504.10415 | link |
2025-04-14 | Performance of Large Language Models in Supporting Medical Diagnosis and Treatment | Diogo Sousa et.al. | 2504.10405 | null |
2025-04-14 | Satellite Federated Fine-Tuning for Foundation Models in Space Computing Power Networks | Yan zhu et.al. | 2504.10403 | null |
2025-04-14 | Can LLMs Assist Expert Elicitation for Probabilistic Causal Modeling? | Olha Shaposhnyk et.al. | 2504.10397 | null |
2025-04-14 | SymRTLO: Enhancing RTL Code Optimization with LLMs and Neuron-Inspired Symbolic Reasoning | Yiting Wang et.al. | 2504.10369 | null |
2025-04-14 | DICE: A Framework for Dimensional and Contextual Evaluation of Language Models | Aryan Shrivastava et.al. | 2504.10359 | null |
2025-04-14 | Pseudo-Autoregressive Neural Codec Language Models for Efficient Zero-Shot Text-to-Speech Synthesis | Yifan Yang et.al. | 2504.10352 | null |
2025-04-11 | Quantum Large Language Model Fine-Tuning | Sang Hyub Kim et.al. | 2504.08732 | null |
2025-04-11 | DocAgent: A Multi-Agent System for Automated Code Documentation Generation | Dayu Yang et.al. | 2504.08725 | link |
2025-04-11 | SWAN-GPT: An Efficient and Scalable Approach for Long-Context Language Modeling | Krishna C. Puvvada et.al. | 2504.08719 | null |
2025-04-11 | SWE-PolyBench: A multi-language benchmark for repository level evaluation of coding agents | Muhammad Shihab Rashid et.al. | 2504.08703 | link |
2025-04-11 | Large Language Models as Span Annotators | Zdeněk Kasner et.al. | 2504.08697 | null |
2025-04-11 | TP-RAG: Benchmarking Retrieval-Augmented Large Language Model Agents for Spatiotemporal-Aware Travel Planning | Hang Ni et.al. | 2504.08694 | null |
2025-04-11 | Fast-Slow-Thinking: Complex Task Solving with Large Language Models | Yiliu Sun et.al. | 2504.08690 | null |
2025-04-11 | Voice Interaction With Conversational AI Could Facilitate Thoughtful Reflection and Substantive Revision in Writing | Jiho Kim et.al. | 2504.08687 | null |
2025-04-11 | Seaweed-7B: Cost-Effective Training of Video Generation Foundation Model | Team Seawead et.al. | 2504.08685 | null |
2025-04-11 | Variability-Driven User-Story Generation using LLM and Triadic Concept Analysis | Alexandre Bazin et.al. | 2504.08666 | null |
2025-04-11 | Quality evaluation of Tabby coding assistant using real source code snippets | Marta Borek et.al. | 2504.08650 | link |
2025-04-11 | Do LLMs trust AI regulation? Emerging behaviour of game-theoretic LLM agents | Alessio Buscemi et.al. | 2504.08640 | null |
2025-04-11 | Latent Diffusion Autoencoders: Toward Efficient and Meaningful Unsupervised Representation Learning in Medical Imaging | Gabriele Lozupone et.al. | 2504.08635 | link |
2025-04-11 | MooseAgent: A LLM Based Multi-agent Framework for Automating Moose Simulation | Tao Zhang et.al. | 2504.08621 | link |
2025-04-11 | Analyzing 16,193 LLM Papers for Fun and Profits | Zhiqiu Xia et.al. | 2504.08619 | null |
2025-04-11 | Playpen: An Environment for Exploring Learning Through Conversational Interaction | Nicola Horst et.al. | 2504.08590 | link |
2025-04-11 | AstroLLaVA: towards the unification of astronomical data and natural language | Sharaf Zaman et.al. | 2504.08583 | null |
2025-04-11 | UoB-NLP at SemEval-2025 Task 11: Leveraging Adapters for Multilingual and Cross-Lingual Emotion Detection | Frances Laureano De Leon et.al. | 2504.08543 | null |
2025-04-11 | Embodied Image Captioning: Self-supervised Learning Agents for Spatially Coherent Image Descriptions | Tommaso Galliena et.al. | 2504.08531 | null |
2025-04-11 | On The Landscape of Spoken Language Models: A Comprehensive Survey | Siddhant Arora et.al. | 2504.08528 | null |
2025-04-10 | Cat, Rat, Meow: On the Alignment of Language Model and Human Term-Similarity Judgments | Lorenz Linhardt et.al. | 2504.07965 | null |
2025-04-10 | C3PO: Critical-Layer, Core-Expert, Collaborative Pathway Optimization for Test-Time Expert Re-Mixing | Zhongyang Li et.al. | 2504.07964 | link |
2025-04-10 | GLUS: Global-Local Reasoning Unified into A Single Large Language Model for Video Segmentation | Lang Lin et.al. | 2504.07962 | null |
2025-04-10 | Detect Anything 3D in the Wild | Hanxue Zhang et.al. | 2504.07958 | link |
2025-04-10 | MM-IFEngine: Towards Multimodal Instruction Following | Shengyuan Ding et.al. | 2504.07957 | link |
2025-04-10 | VCR-Bench: A Comprehensive Evaluation Framework for Video Chain-of-Thought Reasoning | Yukun Qi et.al. | 2504.07956 | null |
2025-04-10 | Dynamic Cheatsheet: Test-Time Learning with Adaptive Memory | Mirac Suzgun et.al. | 2504.07952 | link |
2025-04-10 | We Are All Creators: Generative AI, Collective Knowledge, and the Path Towards Human-AI Synergy | Jordi Linares-Pellicer et.al. | 2504.07936 | null |
2025-04-10 | Echo Chamber: RL Post-training Amplifies Behaviors Learned in Pretraining | Rosie Zhao et.al. | 2504.07912 | link |
2025-04-10 | Porting an LLM based Application from ChatGPT to an On-Premise Environment | Teemu Paloniemi et.al. | 2504.07907 | null |
2025-04-10 | Redefining Machine Translation on Social Network Services with Large Language Models | Hongcheng Guo et.al. | 2504.07901 | link |
2025-04-10 | How do Large Language Models Understand Relevance? A Mechanistic Interpretability Perspective | Qi Liu et.al. | 2504.07898 | link |
2025-04-10 | Fast Adaptation with Behavioral Foundation Models | Harshit Sikchi et.al. | 2504.07896 | null |
2025-04-10 | Benchmarking Adversarial Robustness to Bias Elicitation in Large Language Models: Scalable Automated Assessment with LLM-as-a-Judge | Riccardo Cantini et.al. | 2504.07887 | link |
2025-04-11 | An LLM-Driven Multi-Agent Debate System for Mendelian Diseases | Xinyang Zhou et.al. | 2504.07881 | null |
2025-04-10 | Token Level Routing Inference System for Edge Devices | Jianshu She et.al. | 2504.07878 | null |
2025-04-10 | SAMJAM: Zero-Shot Video Scene Graph Generation for Egocentric Kitchen Videos | Joshua Li et.al. | 2504.07867 | null |
2025-04-11 | Pangu Ultra: Pushing the Limits of Dense Large Language Models on Ascend NPUs | Yichun Yin et.al. | 2504.07866 | null |
2025-04-10 | Robust Hallucination Detection in LLMs via Adaptive Token Selection | Mengjia Niu et.al. | 2504.07863 | null |
2025-04-10 | 2D-Curri-DPO: Two-Dimensional Curriculum Learning for Direct Preference Optimization | Mengyang Li et.al. | 2504.07856 | null |
2025-04-09 | Sculpting Subspaces: Constrained Full Fine-Tuning in LLMs for Continual Learning | Nikhil Shivakumar Nayak et.al. | 2504.07097 | link |
2025-04-09 | OLMoTrace: Tracing Language Model Outputs Back to Trillions of Training Tokens | Jiacheng Liu et.al. | 2504.07096 | null |
2025-04-09 | Are We Done with Object-Centric Learning? | Alexander Rubinstein et.al. | 2504.07092 | link |
2025-04-09 | KG-LLM-Bench: A Scalable Benchmark for Evaluating LLM Reasoning on Textualized Knowledge Graphs | Elan Markowitz et.al. | 2504.07087 | null |
2025-04-09 | A Sober Look at Progress in Language Model Reasoning: Pitfalls and Paths to Reproducibility | Andreas Hochlehnert et.al. | 2504.07086 | null |
2025-04-09 | Self-Steering Language Models | Gabriel Grand et.al. | 2504.07081 | null |
2025-04-09 | DeduCE: Deductive Consistency as a Framework to Evaluate LLM Reasoning | Atharva Pandey et.al. | 2504.07080 | null |
2025-04-09 | Kaleidoscope: In-language Exams for Massively Multilingual Vision Evaluation | Israfel Salazar et.al. | 2504.07072 | null |
2025-04-09 | A Survey on Personalized and Pluralistic Preference Alignment in Large Language Models | Zhouhang Xie et.al. | 2504.07070 | null |
2025-04-09 | HalluciNot: Hallucination Detection Through Context and Common Knowledge Verification | Bibek Paudel et.al. | 2504.07069 | null |
2025-04-09 | Teaching pathology foundation models to accurately predict gene expression with parameter efficient knowledge transfer | Shi Pan et.al. | 2504.07061 | null |
2025-04-09 | TASTE: Text-Aligned Speech Tokenization and Embedding for Spoken Language Modeling | Liang-Hsuan Tseng et.al. | 2504.07053 | link |
2025-04-09 | To Backtrack or Not to Backtrack: When Sequential Search Limits Model Reasoning | Tian Qin et.al. | 2504.07052 | null |
2025-04-09 | Evaluating Retrieval Augmented Generative Models for Document Queries in Transportation Safety | Chad Melton et.al. | 2504.07022 | null |
2025-04-09 | LLM-IFT: LLM-Powered Information Flow Tracking for Secure Hardware | Nowfel Mashnoor et.al. | 2504.07015 | null |
2025-04-09 | Towards LLMs Robustness to Changes in Prompt Format Styles | Lilian Ngweta et.al. | 2504.06969 | null |
2025-04-09 | Efficient Self-Supervised Learning for Earth Observation via Dynamic Dataset Curation | Thomas Kerdreux et.al. | 2504.06962 | null |
2025-04-10 | VideoChat-R1: Enhancing Spatio-Temporal Perception via Reinforcement Fine-Tuning | Xinhao Li et.al. | 2504.06958 | null |
2025-04-09 | Adaptive Computation Pruning for the Forgetting Transformer | Zhixuan Lin et.al. | 2504.06949 | null |
2025-04-09 | RuOpinionNE-2024: Extraction of Opinion Tuples from Russian News Texts | Natalia Loukachevitch et.al. | 2504.06947 | link |
2025-04-08 | GOLLuM: Gaussian Process Optimized LLMs – Reframing LLM Finetuning through Bayesian Optimization | Bojana Ranković et.al. | 2504.06265 | link |
2025-04-08 | OmniSVG: A Unified Scalable Vector Graphics Generation Model | Yiying Yang et.al. | 2504.06263 | null |
2025-04-09 | Hogwild! Inference: Parallel LLM Generation via Concurrent Attention | Gleb Rodionov et.al. | 2504.06261 | null |
2025-04-08 | FEABench: Evaluating Language Models on Multiphysics Reasoning Ability | Nayantara Mudur et.al. | 2504.06260 | link |
2025-04-08 | Orb-v3: atomistic simulation at scale | Benjamin Rhodes et.al. | 2504.06231 | link |
2025-04-08 | LExT: Towards Evaluating Trustworthiness of Natural Language Explanations | Krithi Shailya et.al. | 2504.06227 | null |
2025-04-08 | Encoder-Decoder Gemma: Improving the Quality-Efficiency Trade-Off via Adaptation | Biao Zhang et.al. | 2504.06225 | null |
2025-04-09 | Earth-Adapter: Bridge the Geospatial Domain Gaps with Mixture of Frequency Adaptation | Xiaoxing Hu et.al. | 2504.06220 | link |
2025-04-08 | Can Performant LLMs Be Ethical? Quantifying the Impact of Web Crawling Opt-Outs | Dongyang Fan et.al. | 2504.06219 | null |
2025-04-08 | From 128K to 4M: Efficient Training of Ultra-Long Context Large Language Models | Chejian Xu et.al. | 2504.06214 | null |
2025-04-08 | TxGemma: Efficient and Agentic LLMs for Therapeutics | Eric Wang et.al. | 2504.06196 | null |
2025-04-08 | A Self-Supervised Framework for Space Object Behaviour Characterisation | Ian Groves et.al. | 2504.06176 | null |
2025-04-08 | Assessing how hyperparameters impact Large Language Models’ sarcasm detection performance | Montgomery Gole et.al. | 2504.06166 | null |
2025-04-09 | Navigating the Rabbit Hole: Emergent Biases in LLM-Generated Attack Narratives Targeting Mental Health Groups | Rijul Magu et.al. | 2504.06160 | null |
2025-04-08 | A Large-Scale Analysis on Contextual Self-Supervised Video Representation Learning | Akash Kumar et.al. | 2504.06153 | null |
2025-04-08 | V-MAGE: A Game Evaluation Framework for Assessing Visual-Centric Capabilities in Multimodal Large Language Models | Xiangxi Zheng et.al. | 2504.06148 | link |
2025-04-08 | ARLO: A Tailorable Approach for Transforming Natural Language Software Requirements into Architecture using LLMs | Tooraj Helmi et.al. | 2504.06143 | null |
2025-04-08 | Adversarial Training of Reward Models | Alexander Bukharin et.al. | 2504.06141 | null |
2025-04-08 | A Multimedia Analytics Model for the Foundation Model Era | Marcel Worring et.al. | 2504.06138 | null |
2025-04-08 | QGen Studio: An Adaptive Question-Answer Generation, Training and Evaluation Platform | Movina Moses et.al. | 2504.06136 | null |
2025-04-07 | URECA: Unique Region Caption Anything | Sangbeom Lim et.al. | 2504.05305 | null |
2025-04-07 | InteractVLM: 3D Interaction Reasoning from 2D Foundational Models | Sai Kumar Dwivedi et.al. | 2504.05303 | link |
2025-04-07 | SmolVLM: Redefining small and efficient multimodal models | Andrés Marafioti et.al. | 2504.05299 | null |
2025-04-07 | Truthful or Fabricated? Using Causal Attribution to Mitigate Reward Hacking in Explanations | Pedro Ferreira et.al. | 2504.05294 | null |
2025-04-07 | The challenge of uncertainty quantification of large language models in medicine | Zahra Atf et.al. | 2504.05278 | null |
2025-04-07 | Enhancing LLM-Based Short Answer Grading with Retrieval-Augmented Generation | Yucheng Chu et.al. | 2504.05276 | null |
2025-04-07 | Do PhD-level LLMs Truly Grasp Elementary Addition? Probing Rule Learning vs. Memorization in Large Language Models | Yang Yan et.al. | 2504.05262 | null |
2025-04-07 | Learning to Reason Over Time: Timeline Self-Reflection for Improved Temporal Reasoning in Language Models | Adrián Bazaga et.al. | 2504.05258 | null |
2025-04-07 | Explaining Low Perception Model Competency with High-Competency Counterfactuals | Sara Pohland et.al. | 2504.05254 | null |
2025-04-07 | LLM-based Automated Grading with Human-in-the-Loop | Hang Li et.al. | 2504.05239 | null |
2025-04-08 | NoveltyBench: Evaluating Language Models for Humanlike Diversity | Yiming Zhang et.al. | 2504.05228 | null |
2025-04-07 | A Reality Check of Vision-Language Pre-training in Radiology: Have We Progressed Using Text? | Julio Silva-Rodríguez et.al. | 2504.05227 | null |
2025-04-07 | Vision-Language Model Predictive Control for Manipulation Planning and Trajectory Generation | Jiaming Chen et.al. | 2504.05225 | link |
2025-04-08 | Leveraging LLMs for Utility-Focused Annotation: Reducing Manual Effort for Retrieval and RAG | Hengran Zhang et.al. | 2504.05220 | null |
2025-04-07 | Unleashing the Power of LLMs in Dense Retrieval with Query Likelihood Modeling | Hengran Zhang et.al. | 2504.05216 | null |
2025-04-07 | Post-Training Language Models for Continual Relation Extraction | Sefika Efeoglu et.al. | 2504.05214 | null |
2025-04-07 | Quantum Program Linting with LLMs: Emerging Results from a Comparative Study | Seung Yeob Shin et.al. | 2504.05204 | null |
2025-04-07 | Training state-of-the-art pathology foundation models with orders of magnitude less data | Mikhail Karasikov et.al. | 2504.05186 | null |
2025-04-07 | Concise Reasoning via Reinforcement Learning | Mehdi Fatemi et.al. | 2504.05185 | link |
2025-04-07 | BRIDGES: Bridging Graph Modality and Large Language Models within EDA Tasks | Wei Li et.al. | 2504.05180 | null |
2025-04-04 | Shape My Moves: Text-Driven Shape-Aware Synthesis of Human Motions | Ting-Hsuan Liao et.al. | 2504.03639 | null |
2025-04-04 | Do Larger Language Models Imply Better Reasoning? A Pretraining Scaling Law for Reasoning | Xinyi Wang et.al. | 2504.03635 | null |
2025-04-04 | Align to Structure: Aligning Large Language Models with Structural Information | Zae Myung Kim et.al. | 2504.03622 | null |
2025-04-04 | VISTA-OCR: Towards generative and interactive end to end OCR models | Laziz Hamdi et.al. | 2504.03621 | null |
2025-04-04 | Multilingual Retrieval-Augmented Generation for Knowledge-Intensive Task | Leonardo Ranaldi et.al. | 2504.03616 | null |
2025-04-04 | AIR: A Systematic Analysis of Annotations, Instructions, and Response Pairs in Preference Dataset | Bingxiang He et.al. | 2504.03612 | null |
2025-04-04 | MedSAM2: Segment Anything in 3D Medical Images and Videos | Jun Ma et.al. | 2504.03600 | link |
2025-04-04 | EnrichIndex: Using LLMs to Enrich Retrieval Indices Offline | Peter Baile Chen et.al. | 2504.03598 | null |
2025-04-04 | PF3Det: A Prompted Foundation Feature Assisted Visual LiDAR 3D Detector | Kaidong Li et.al. | 2504.03563 | null |
2025-04-04 | Agentic Knowledgeable Self-awareness | Shuofei Qiao et.al. | 2504.03553 | link |
2025-04-04 | RANa: Retrieval-Augmented Navigation | Gianluca Monaci et.al. | 2504.03524 | null |
2025-04-04 | Neutralizing the Narrative: AI-Powered Debiasing of Online News Articles | Chen Wei Kuo et.al. | 2504.03520 | null |
2025-04-04 | SpectR: Dynamically Composing LM Experts with Spectral Routing | William Fleshman et.al. | 2504.03454 | null |
2025-04-04 | Optimizing Specific and Shared Parameters for Efficient Parameter Tuning | Van-Anh Nguyen et.al. | 2504.03450 | null |
2025-04-04 | LLMSched: Uncertainty-Aware Workload Scheduling for Compound LLM Applications | Botao Zhu et.al. | 2504.03444 | null |
2025-04-04 | Know What You do Not Know: Verbalized Uncertainty Estimation Robustness on Corrupted Images in Vision-Language Models | Mirko Borszukovszki et.al. | 2504.03440 | null |
2025-04-04 | Locations of Characters in Narratives: Andersen and Persuasion Datasets | Batuhan Ozyurt et.al. | 2504.03434 | link |
2025-04-04 | Online Difficulty Filtering for Reasoning Oriented Reinforcement Learning | Sanghwan Bae et.al. | 2504.03380 | null |
2025-04-04 | MultiClear: Multimodal Soft Exoskeleton Glove for Transparent Object Grasping Assistance | Chen Hu et.al. | 2504.03379 | null |
2025-04-04 | Sustainable LLM Inference for Edge AI: Evaluating Quantized LLMs for Energy Efficiency, Output Accuracy, and Inference Latency | Erik Johannes Husom et.al. | 2504.03360 | null |
2025-04-03 | STING-BEE: Towards Vision-Language Model for Real-World X-ray Baggage Security Inspection | Divya Velayudhan et.al. | 2504.02823 | null |
2025-04-03 | Sparse Autoencoders Learn Monosemantic Features in Vision-Language Models | Mateusz Pach et.al. | 2504.02821 | link |
2025-04-03 | Generative Evaluation of Complex Reasoning in Large Language Models | Haowei Lin et.al. | 2504.02810 | link |
2025-04-03 | MegaMath: Pushing the Limits of Open Math Corpora | Fan Zhou et.al. | 2504.02807 | link |
2025-04-03 | F-ViTA: Foundation Model Guided Visible to Thermal Translation | Jay N. Paranjape et.al. | 2504.02801 | link |
2025-04-04 | A Survey of Large Language Models in Mental Health Disorder Detection on Social Media | Zhuohan Ge et.al. | 2504.02800 | null |
2025-04-03 | Systematic Evaluation of Large Vision-Language Models for Surgical Artificial Intelligence | Anita Rau et.al. | 2504.02799 | null |
2025-04-03 | A Framework for Situating Innovations, Opportunities, and Challenges in Advancing Vertical Systems with Large AI Models | Gaurav Verma et.al. | 2504.02793 | null |
2025-04-03 | Unified World Models: Coupling Video and Action Diffusion for Pretraining on Large Robotic Datasets | Chuning Zhu et.al. | 2504.02792 | null |
2025-04-03 | A Framework for Robust Cognitive Evaluation of LLMs | Karin de Langis et.al. | 2504.02789 | null |
2025-04-03 | From Consumption to Collaboration: Measuring Interaction Patterns to Augment Human Cognition in Open-Ended Tasks | Joshua Holstein et.al. | 2504.02780 | null |
2025-04-03 | BT-ACTION: A Test-Driven Approach for Modular Understanding of User Instruction Leveraging Behaviour Trees and LLMs | Alexander Leszczynski et.al. | 2504.02779 | link |
2025-04-03 | How Deep Do Large Language Models Internalize Scientific Literature and Citation Practices? | Andres Algaba et.al. | 2504.02767 | link |
2025-04-03 | Robot-Led Vision Language Model Wellbeing Assessment of Children | Nida Itrat Abbasi et.al. | 2504.02765 | null |
2025-04-03 | Enhancing LLM Robustness to Perturbed Instructions: An Empirical Study | Aryan Agrawal et.al. | 2504.02733 | link |
2025-04-04 | Why do LLMs attend to the first token? | Federico Barbero et.al. | 2504.02732 | null |
2025-04-03 | ERPO: Advancing Safety Alignment via Ex-Ante Reasoning Preference Optimization | Kehua Feng et.al. | 2504.02725 | null |
2025-04-03 | TeleMoM: Consensus-Driven Telecom Intelligence via Mixture of Models | Xinquan Wang et.al. | 2504.02712 | null |
2025-04-03 | The Hidden Space of Safety: Understanding Preference-Tuned LLMs in Multilingual context | Nikhil Verma et.al. | 2504.02708 | null |
2025-04-03 | LLM for Complex Reasoning Task: An Exploratory Study in Fermi Problems | Zishuo Liu et.al. | 2504.02671 | null |
2025-04-02 | Slot-Level Robotic Placement via Visual Imitation from Single Human Video | Dandan Shan et.al. | 2504.01959 | null |
2025-04-02 | Towards Unified Referring Expression Segmentation Across Omni-Level Visual Target Granularities | Jing Liu et.al. | 2504.01954 | null |
2025-04-02 | The LLM Wears Prada: Analysing Gender Bias and Stereotypes through Online Shopping Data | Massimiliano Luca et.al. | 2504.01951 | null |
2025-04-02 | Efficient Federated Learning Tiny Language Models for Mobile Network Feature Prediction | Daniel Becking et.al. | 2504.01947 | null |
2025-04-02 | OpenCodeReasoning: Advancing Data Distillation for Competitive Coding | Wasi Uddin Ahmad et.al. | 2504.01943 | null |
2025-04-02 | Critical Thinking: Which Kinds of Complexity Govern Optimal Reasoning Length? | Celine Lee et.al. | 2504.01935 | link |
2025-04-02 | A thorough benchmark of automatic text classification: From traditional approaches to large language models | Washington Cunha et.al. | 2504.01930 | link |
2025-04-02 | Gen-C: Populating Virtual Worlds with Generative Crowds | Andreas Panayiotou et.al. | 2504.01924 | null |
2025-04-02 | Is Less Really More? Fake News Detection with Limited Information | Zhaoyang Cao et.al. | 2504.01922 | link |
2025-04-03 | Bridging the Linguistic Divide: A Survey on Leveraging Large Language Models for Machine Translation | Baban Gain et.al. | 2504.01919 | null |
2025-04-02 | FineLIP: Extending CLIP’s Reach via Fine-Grained Alignment with Longer Text Inputs | Mothilal Asokan et.al. | 2504.01916 | link |
2025-04-02 | Advancing AI-Scientist Understanding: Making LLM Think Like a Physicist with Interpretable Reasoning | Yinggan Xu et.al. | 2504.01911 | null |
2025-04-02 | Is Temporal Prompting All We Need For Limited Labeled Action Recognition? | Shreyank N Gowda et.al. | 2504.01890 | null |
2025-04-02 | TransientTables: Evaluating LLMs’ Reasoning on Temporally Evolving Semi-structured Tables | Abhilash Shankarampeta et.al. | 2504.01879 | null |
2025-04-02 | From Code Generation to Software Testing: AI Copilot with Context-Based RAG | Yuchen Wang et.al. | 2504.01866 | null |
2025-04-02 | Cross-Lingual Consistency: A Novel Inference Framework for Advancing Reasoning in Large Language Models | Zhiwei Yu et.al. | 2504.01857 | null |
2025-04-02 | Code Red! On the Harmfulness of Applying Off-the-shelf Large Language Models to Programming Tasks | Ali Al-Kaswan et.al. | 2504.01850 | null |
2025-04-02 | LARGE: Legal Retrieval Augmented Generation Evaluation Tool | Minhu Park et.al. | 2504.01840 | link |
2025-04-02 | Prompting Medical Vision-Language Models to Mitigate Diagnosis Bias by Generating Realistic Dermoscopic Images | Nusrat Munia et.al. | 2504.01838 | link |
2025-04-02 | YourBench: Easy Custom Evaluation Sets for Everyone | Sumuk Shashidhar et.al. | 2504.01833 | link |
2025-03-31 | Any2Caption:Interpreting Any Condition to Caption for Controllable Video Generation | Shengqiong Wu et.al. | 2503.24379 | null |
2025-03-31 | ACPBench Hard: Unrestrained Reasoning about Action, Change, and Planning | Harsha Kokel et.al. | 2503.24378 | null |
2025-03-31 | Harnessing the Reasoning Economy: A Survey of Efficient Reasoning for Large Language Models | Rui Wang et.al. | 2503.24377 | link |
2025-03-31 | Exploring the Effect of Reinforcement Learning on Video Understanding: Insights from SEED-Bench-R1 | Yi Chen et.al. | 2503.24376 | link |
2025-03-31 | Effectively Controlling Reasoning Models through Thinking Intervention | Tong Wu et.al. | 2503.24370 | null |
2025-03-31 | Adapting Vision Foundation Models for Real-time Ultrasound Image Segmentation | Xiaoran Zhang et.al. | 2503.24368 | null |
2025-03-31 | ORAL: Prompting Your Large-Scale LoRAs via Conditional Recurrent Diffusion | Rana Muhammad Shahroz Khan et.al. | 2503.24354 | null |
2025-03-31 | PathOrchestra: A Comprehensive Foundation Model for Computational Pathology with Over 100 Diverse Clinical-Grade Tasks | Fang Yan et.al. | 2503.24345 | null |
2025-03-31 | Can Test-Time Scaling Improve World Foundation Model? | Wenyan Cong et.al. | 2503.24320 | link |
2025-03-31 | BEATS: Bias Evaluation and Assessment Test Suite for Large Language Models | Alok Abhishek et.al. | 2503.24310 | null |
2025-03-31 | A Systematic Evaluation of LLM Strategies for Mental Health Text Analysis: Fine-tuning vs. Prompt Engineering vs. RAG | Arshia Kermani et.al. | 2503.24307 | null |
2025-03-31 | Rec-R1: Bridging Generative Large Language Models and User-Centric Recommendation Systems via Reinforcement Learning | Jiacheng Lin et.al. | 2503.24289 | link |
2025-03-31 | Style Quantization for Data-Efficient GAN Training | Jian Wang et.al. | 2503.24282 | null |
2025-03-31 | Evaluating and Designing Sparse Autoencoders by Approximating Quasi-Orthogonality | Sewoong Lee et.al. | 2503.24277 | link |
2025-03-31 | Enhancing Large Language Models (LLMs) for Telecommunications using Knowledge Graphs and Retrieval-Augmented Generation | Dun Yuan et.al. | 2503.24245 | null |
2025-03-31 | What, How, Where, and How Well? A Survey on Test-Time Scaling in Large Language Models | Qiyuan Zhang et.al. | 2503.24235 | link |
2025-03-31 | Synthetic News Generation for Fake News Classification | Abdul Sittar et.al. | 2503.24206 | null |
2025-03-31 | TwT: Thinking without Tokens by Habitual Reasoning Distillation with Multi-Teachers’ Guidance | Jingxian Xu et.al. | 2503.24198 | null |
2025-04-02 | Text2Tracks: Prompt-based Music Recommendation via Generative Retrieval | Enrico Palumbo et.al. | 2503.24193 | null |
2025-03-31 | Output Constraints as Attack Surface: Exploiting Structured Generation to Bypass LLM Safety Mechanisms | Shuoming Zhang et.al. | 2503.24191 | null |
2025-03-28 | Q-Insight: Understanding Image Quality via Visual Reinforcement Learning | Weiqi Li et.al. | 2503.22679 | link |
2025-03-28 | QuestBench: Can LLMs ask the right question to acquire information in reasoning tasks? | Belinda Z. Li et.al. | 2503.22674 | link |
2025-03-28 | Exploring the Effectiveness of Multi-stage Fine-tuning for Cross-encoder Re-rankers | Francesca Pezzuti et.al. | 2503.22672 | link |
2025-03-28 | Understanding Co-speech Gestures in-the-wild | Sindhu B Hegde et.al. | 2503.22668 | null |
2025-03-28 | Unicorn: Text-Only Data Synthesis for Vision Language Model Training | Xiaomin Yu et.al. | 2503.22655 | link |
2025-03-28 | Evaluating Multimodal Language Models as Visual Assistants for Visually Impaired Users | Antonia Karamolegkou et.al. | 2503.22610 | null |
2025-03-28 | On the Alignment of Post-Publication Reviews & Bibliometric and Altmetric Impact – A Case Study on Expert Statements from the Science Media Center Germany | Dirk Tunger et.al. | 2503.22594 | null |
2025-03-28 | LLM-enabled Instance Model Generation | Fengjunjie Pan et.al. | 2503.22587 | null |
2025-03-28 | Historical Ink: Exploring Large Language Models for Irony Detection in 19th-Century Spanish | Kevin Cohen et.al. | 2503.22585 | link |
2025-03-28 | Beyond Vanilla Fine-Tuning: Leveraging Multistage, Multilingual, and Domain-Specific Methods for Low-Resource Machine Translation | Sarubi Thillainathan et.al. | 2503.22582 | null |
2025-03-28 | Breaking Language Barriers in Visual Language Models via Multilingual Textual Regularization | Iñigo Pikabea et.al. | 2503.22577 | null |
2025-03-28 | Niyama : Breaking the Silos of LLM Inference Serving | Kanishk Goel et.al. | 2503.22562 | null |
2025-03-28 | Bridging the Dimensional Chasm: Uncover Layer-wise Dimensional Reduction in Transformers through Token Correlation | Zhuo-Yang Song et.al. | 2503.22547 | null |
2025-03-28 | Exploiting Mixture-of-Experts Redundancy Unlocks Multimodal Generative Abilities | Raman Dutt et.al. | 2503.22517 | null |
2025-03-28 | Assessing Foundation Models for Sea Ice Type Segmentation in Sentinel-1 SAR Imagery | Samira Alkaee Taleghan et.al. | 2503.22516 | null |
2025-03-28 | Probabilistic Uncertain Reward Model: A Natural Generalization of Bradley-Terry Reward Model | Wangtao Sun et.al. | 2503.22480 | null |
2025-03-28 | WorkTeam: Constructing Workflows from Natural Language with Multi-Agents | Hanchao Liu et.al. | 2503.22473 | null |
2025-03-28 | Evaluating LLM-based Agents for Multi-Turn Conversations: A Survey | Shengyue Guan et.al. | 2503.22458 | null |
2025-03-28 | Entropy-guided sequence weighting for efficient exploration in RL-based LLM fine-tuning | Abdullah Vanlioglu et.al. | 2503.22456 | null |
2025-03-28 | STADE: Standard Deviation as a Pruning Metric | Diego Coello de Portugal Mecke et.al. | 2503.22451 | link |
2025-03-27 | Mobile-VideoGPT: Fast and Accurate Video Understanding Language Model | Abdelrahman Shaker et.al. | 2503.21782 | link |
2025-03-27 | Video-R1: Reinforcing Video Reasoning in MLLMs | Kaituo Feng et.al. | 2503.21776 | link |
2025-03-27 | Stable-SCore: A Stable Registration-based Framework for 3D Shape Correspondence | Haolin Liu et.al. | 2503.21766 | null |
2025-03-27 | Uni4D: Unifying Visual Foundation Models for 4D Modeling from a Single Video | David Yifan Yao et.al. | 2503.21761 | link |
2025-03-27 | MemInsight: Autonomous Memory Augmentation for LLM Agents | Rana Salama et.al. | 2503.21760 | null |
2025-03-27 | Fwd2Bot: LVLM Visual Token Compression with Double Forward Bottleneck | Adrian Bulat et.al. | 2503.21757 | null |
2025-03-27 | GateLens: A Reasoning-Enhanced LLM Agent for Automotive Software Release Analytics | Arsham Gholamzadeh Khoee et.al. | 2503.21735 | null |
2025-03-27 | Effective Skill Unlearning through Intervention and Abstention | Yongce Li et.al. | 2503.21730 | link |
2025-03-27 | Collab: Controlled Decoding using Mixture of Agents for LLM Alignment | Souradip Chakraborty et.al. | 2503.21720 | null |
2025-03-28 | Outlier dimensions favor frequent tokens in language models | Iuri Macocco et.al. | 2503.21718 | null |
2025-03-27 | As easy as PIE: understanding when pruning causes language models to disagree | Pietro Tropeano et.al. | 2503.21714 | link |
2025-03-27 | Enhancing Repository-Level Software Repair via Repository-Aware Knowledge Graphs | Boyang Yang et.al. | 2503.21710 | null |
2025-03-27 | LLM-Gomoku: A Large Language Model-Based System for Strategic Gomoku with Self-Play and Reinforcement Learning | Hui Wang et.al. | 2503.21683 | null |
2025-03-27 | JiraiBench: A Bilingual Benchmark for Evaluating Large Language Models’ Detection of Human Self-Destructive Behavior Content in Jirai Community | Yunze Xiao et.al. | 2503.21679 | null |
2025-03-27 | How do language models learn facts? Dynamics, curricula and hallucinations | Nicolas Zucchet et.al. | 2503.21676 | null |
2025-03-27 | Intelligent IoT Attack Detection Design via ODLLM with Feature Ranking-based Knowledge Base | Satvik Verma et.al. | 2503.21674 | link |
2025-03-27 | Model Assembly Learning with Heterogeneous Layer Weight Merging | Yi-Kai Zhang et.al. | 2503.21657 | null |
2025-03-27 | UI-R1: Enhancing Action Prediction of GUI Agents by Reinforcement Learning | Zhengxi Lu et.al. | 2503.21620 | link |
2025-03-27 | Leveraging Language Models for Analyzing Longitudinal Experiential Data in Education | Ahatsham Hayat et.al. | 2503.21617 | null |
2025-03-27 | Evaluating book summaries from internal knowledge in Large Language Models: a cross-model and semantic consistency approach | Javier Coronado-Blázquez et.al. | 2503.21613 | null |
2025-03-26 | Mobile-MMLU: A Mobile Intelligence Language Understanding Benchmark | Sondos Mahmoud Bsharat et.al. | 2503.20786 | link |
2025-03-26 | Free4D: Tuning-free 4D Scene Generation with Spatial-Temporal Consistency | Tianqi Liu et.al. | 2503.20785 | link |
2025-03-26 | Feature4X: Bridging Any Monocular Video to 4D Agentic AI with Versatile Gaussian Feature Fields | Shijie Zhou et.al. | 2503.20776 | null |
2025-03-26 | ASGO: Adaptive Structured Gradient Optimization | Kang An et.al. | 2503.20762 | null |
2025-03-26 | MCTS-RAG: Enhancing Retrieval-Augmented Generation with Monte Carlo Tree Search | Yunhai Hu et.al. | 2503.20757 | null |
2025-03-27 | Reason-RFT: Reinforcement Fine-Tuning for Visual Reasoning | Huajie Tan et.al. | 2503.20752 | null |
2025-03-26 | UniSTD: Towards Unified Spatio-Temporal Learning across Diverse Disciplines | Chen Tang et.al. | 2503.20748 | null |
2025-03-26 | MATHGLANCE: Multimodal Large Language Models Do Not Know Where to Look in Mathematical Diagrams | Yanpeng Sun et.al. | 2503.20745 | null |
2025-03-26 | Dynamic Motion Blending for Versatile Motion Editing | Nan Jiang et.al. | 2503.20724 | null |
2025-03-26 | From Annotation to Adaptation: Metrics, Synthetic Data, and Aspect Extraction for Aspect-Based Sentiment Analysis with Large Language Models | Nikita Neveditsin et.al. | 2503.20715 | null |
2025-03-26 | MMMORRF: Multimodal Multilingual Modularized Reciprocal Rank Fusion | Saron Samuel et.al. | 2503.20698 | null |
2025-03-26 | Graph-Enhanced Model-Free Reinforcement Learning Agents for Efficient Power Grid Topological Control | Eloy Anguiano Batanero et.al. | 2503.20688 | null |
2025-03-27 | Flip Learning: Weakly Supervised Erase to Segment Nodules in Breast Ultrasound | Yuhao Huang et.al. | 2503.20685 | null |
2025-03-27 | Mitigating Low-Level Visual Hallucinations Requires Self-Awareness: Database, Model and Training Strategy | Yinan Sun et.al. | 2503.20673 | null |
2025-03-26 | TAMA: A Human-AI Collaborative Thematic Analysis Framework Using Multi-Agent LLMs for Clinical Interviews | Huimin Xu et.al. | 2503.20666 | null |
2025-03-26 | AutoRad-Lung: A Radiomic-Guided Prompting Autoregressive Vision-Language Model for Lung Nodule Malignancy Prediction | Sadaf Khademi et.al. | 2503.20662 | null |
2025-03-26 | AccidentSim: Generating Physically Realistic Vehicle Collision Videos from Real-World Accident Reports | Xiangwen Zhang et.al. | 2503.20654 | null |
2025-03-26 | Unlocking Efficient Long-to-Short LLM Reasoning with Model Merging | Han Wu et.al. | 2503.20641 | link |
2025-03-26 | Collaborative Storytelling and LLM: A Linguistic Analysis of Automatically-Generated Role-Playing Game Sessions | Alessandro Maisto et.al. | 2503.20623 | null |
2025-03-26 | IAP: Improving Continual Learning of Vision-Language Models via Instance-Aware Prompting | Hao Fu et.al. | 2503.20612 | link |
2025-03-25 | SuperFlow++: Enhanced Spatiotemporal Consistency for Cross-Modal Data Pretraining | Xiang Xu et.al. | 2503.19912 | link |
2025-03-25 | CoLLM: A Large Language Model for Composed Image Retrieval | Chuong Huynh et.al. | 2503.19910 | link |
2025-03-25 | FullDiT: Multi-Task Video Generative Foundation Model with Full Attention | Xuan Ju et.al. | 2503.19907 | null |
2025-03-25 | CAFe: Unifying Representation and Generation with Contrastive-Autoregressive Finetuning | Hao Yu et.al. | 2503.19900 | link |
2025-03-25 | A Multi-Agent Framework Integrating Large Language Models and Generative AI for Accelerated Metamaterial Design | Jie Tian et.al. | 2503.19889 | null |
2025-03-25 | CausalRAG: Integrating Causal Graphs into Retrieval-Augmented Generation | Nengbo Wang et.al. | 2503.19878 | null |
2025-03-25 | Scaling Evaluation-time Compute with Reasoning Models as Process Evaluators | Seungone Kim et.al. | 2503.19877 | null |
2025-03-25 | SLA-Awareness for AI-assisted coding | Kishanthan Thangarajah et.al. | 2503.19876 | null |
2025-03-25 | Think Twice: Enhancing LLM Reasoning by Scaling Multi-round Test-time Thinking | Xiaoyu Tian et.al. | 2503.19855 | null |
2025-03-25 | Towards Online Multi-Modal Social Interaction Understanding | Xinpeng Li et.al. | 2503.19851 | link |
2025-03-25 | FALCONEye: Finding Answers and Localizing Content in ONE-hour-long videos with multi-modal LLMs | Carlos Plou et.al. | 2503.19850 | null |
2025-03-25 | A Comparative Analysis of Word Segmentation, Part-of-Speech Tagging, and Named Entity Recognition for Historical Chinese Sources, 1900-1950 | Zhao Fang et.al. | 2503.19844 | null |
2025-03-25 | FireEdit: Fine-grained Instruction-based Image Editing via Region-aware Vision Language Model | Jun Zhou et.al. | 2503.19839 | null |
2025-03-25 | Domain-incremental White Blood Cell Classification with Privacy-aware Continual Learning | Pratibha Kumari et.al. | 2503.19819 | null |
2025-03-25 | SeLIP: Similarity Enhanced Contrastive Language Image Pretraining for Multi-modal Head MRI | Zhiyang Liu et.al. | 2503.19801 | null |
2025-03-25 | SemEval-2025 Task 9: The Food Hazard Detection Challenge | Korbinian Randl et.al. | 2503.19800 | null |
2025-03-25 | PAVE: Patching and Adapting Video Large Language Models | Zhuoming Liu et.al. | 2503.19794 | link |
2025-03-25 | Fine-Grained Erasure in Text-to-Image Diffusion-based Foundation Models | Kartik Thakral et.al. | 2503.19783 | null |
2025-03-25 | LPOSS: Label Propagation Over Patches and Pixels for Open-vocabulary Semantic Segmentation | Vladan Stojnić et.al. | 2503.19777 | link |
2025-03-25 | OpenLex3D: A New Evaluation Benchmark for Open-Vocabulary 3D Scene Representations | Christina Kassab et.al. | 2503.19764 | null |
2025-03-24 | DINO in the Room: Leveraging 2D Foundation Models for 3D Segmentation | Karim Abou Zeid et.al. | 2503.18944 | link |
2025-03-24 | SlowFast-LLaVA-1.5: A Family of Token-Efficient Video Large Language Models for Long-Form Video Understanding | Mingze Xu et.al. | 2503.18943 | null |
2025-03-24 | Video-T1: Test-Time Scaling for Video Generation | Fangfu Liu et.al. | 2503.18942 | null |
2025-03-24 | Exploring Training and Inference Scaling Laws in Generative Retrieval | Hongru Cai et.al. | 2503.18941 | link |
2025-03-24 | CoMP: Continual Multimodal Pre-training for Vision Foundation Models | Yitong Chen et.al. | 2503.18931 | link |
2025-03-24 | Trajectory Balance with Asynchrony: Decoupling Exploration and Learning for Fast, Scalable LLM Post-Training | Brian R. Bartoldson et.al. | 2503.18929 | null |
2025-03-24 | Video SimpleQA: Towards Factuality Evaluation in Large Video Language Models | Meng Cao et.al. | 2503.18923 | null |
2025-03-24 | FFN Fusion: Rethinking Sequential Computation in Large Language Models | Akhiad Bercovich et.al. | 2503.18908 | null |
2025-03-24 | xKV: Cross-Layer SVD for KV-Cache Compression | Chi-Chih Chang et.al. | 2503.18893 | link |
2025-03-24 | AgentDropout: Dynamic Agent Elimination for Token-Efficient and High-Performance LLM-Based Multi-Agent Collaboration | Zhexuan Wang et.al. | 2503.18891 | link |
2025-03-24 | Toward building next-generation Geocoding systems: a systematic review | Zhengcong Yin et.al. | 2503.18888 | null |
2025-03-24 | I Have Covered All the Bases Here: Interpreting Reasoning Features in Large Language Models via Sparse Autoencoders | Andrey Galichin et.al. | 2503.18878 | link |
2025-03-24 | Efficient Self-Supervised Adaptation for Medical Image Analysis | Moein Sorkhei et.al. | 2503.18873 | link |
2025-03-24 | Reimagining Memory Access for LLM Inference: Compression-Aware Memory Controller Design | Rui Xie et.al. | 2503.18869 | null |
2025-03-24 | Reasoning to Learn from Latent Thoughts | Yangjun Ruan et.al. | 2503.18866 | null |
2025-03-25 | Structuring Scientific Innovation: A Framework for Modeling and Discovering Impactful Knowledge Combinations | Junlan Chen et.al. | 2503.18865 | null |
2025-03-25 | MC-LLaVA: Multi-Concept Personalized Vision-Language Model | Ruichuan An et.al. | 2503.18854 | link |
2025-03-24 | Enhanced OoD Detection through Cross-Modal Alignment of Multi-Modal Representations | Jeonghyeon Kim et.al. | 2503.18817 | link |
2025-03-24 | Defeating Prompt Injections by Design | Edoardo Debenedetti et.al. | 2503.18813 | null |
2025-03-24 | SKDU at De-Factify 4.0: Vision Transformer with Data Augmentation for AI-Generated Image Detection | Shrikant Malviya et.al. | 2503.18812 | link |
2025-03-21 | Dancing with Critiques: Enhancing LLM Reasoning with Stepwise Natural Language Self-Critique | Yansi Li et.al. | 2503.17363 | null |
2025-03-21 | HCAST: Human-Calibrated Autonomy Software Tasks | David Rein et.al. | 2503.17354 | link |
2025-03-21 | NdLinear Is All You Need for Representation Learning | Alex Reneau et.al. | 2503.17353 | link |
2025-03-21 | OpenVLThinker: An Early Exploration to Complex Vision-Language Reasoning via Iterative Self-Improvement | Yihe Deng et.al. | 2503.17352 | link |
2025-03-21 | Beyond Semantics: Rediscovering Spatial Awareness in Vision-Language Models | Jianing Qi et.al. | 2503.17349 | null |
2025-03-21 | Capturing Individual Human Preferences with Reward Features | André Barreto et.al. | 2503.17338 | null |
2025-03-21 | Efficient Intent-Based Filtering for Multi-Party Conversations Using Knowledge Distillation from LLMs | Reem Gody et.al. | 2503.17336 | null |
2025-03-21 | CVE-Bench: A Benchmark for AI Agents’ Ability to Exploit Real-World Web Application Vulnerabilities | Yuxuan Zhu et.al. | 2503.17332 | link |
2025-03-21 | LLM+MAP: Bimanual Robot Task Planning using Large Language Models and Planning Domain Definition Language | Kun Chu et.al. | 2503.17309 | link |
2025-03-21 | Bugdar: AI-Augmented Secure Code Review for GitHub Pull Requests | John Naulty et.al. | 2503.17302 | null |
2025-03-21 | FastCuRL: Curriculum Reinforcement Learning with Progressive Context Extension for Efficient Training R1-like Reasoning Models | Mingyang Song et.al. | 2503.17287 | link |
2025-03-21 | CASE – Condition-Aware Sentence Embeddings for Conditional Semantic Textual Similarity Measurement | Gaifan Zhang et.al. | 2503.17279 | null |
2025-03-21 | Revisiting End To End Sparse Autoencoder Training – A Short Finetune is All You Need | Adam Karvonen et.al. | 2503.17272 | link |
2025-03-21 | SafeMERGE: Preserving Safety Alignment in Fine-Tuned Large Language Models via Selective Layer-Wise Model Merging | Aladin Djuhera et.al. | 2503.17239 | link |
2025-03-21 | Slide-Level Prompt Learning with Vision Language Models for Few-Shot Multiple Instance Learning in Histopathology | Devavrat Tomar et.al. | 2503.17238 | link |
2025-03-21 | FactSelfCheck: Fact-Level Black-Box Hallucination Detection for LLMs | Albert Sawczyn et.al. | 2503.17229 | null |
2025-03-21 | Automating Adjudication of Cardiovascular Events Using Large Language Models | Sonish Sivarajkumar et.al. | 2503.17222 | null |
2025-03-21 | A Language Anchor-Guided Method for Robust Noisy Domain Generalization | Zilin Dai et.al. | 2503.17211 | null |
2025-03-21 | TreeSynth: Synthesizing Diverse Data from Scratch via Tree-Guided Subspace Partitioning | Sheng Wang et.al. | 2503.17195 | null |
2025-03-21 | LLMs Love Python: A Study of LLMs’ Bias for Programming Languages and Libraries | Lukas Twist et.al. | 2503.17181 | link |
2025-03-20 | DynamicVis: An Efficient and General Visual Foundation Model for Remote Sensing Image Understanding | Keyan Chen et.al. | 2503.16426 | link |
2025-03-20 | Stop Overthinking: A Survey on Efficient Reasoning for Large Language Models | Yang Sui et.al. | 2503.16419 | link |
2025-03-20 | M3: 3D-Spatial MultiModal Memory | Xueyan Zou et.al. | 2503.16413 | link |
2025-03-20 | The Emperor’s New Clothes in Benchmarking? A Rigorous Examination of Mitigation Strategies for LLM Benchmark Data Contamination | Yifan Sun et.al. | 2503.16402 | link |
2025-03-20 | Exploring the Hidden Reasoning Process of Large Language Models by Misleading Them | Guanyu Chen et.al. | 2503.16401 | null |
2025-03-20 | Deconstructing Long Chain-of-Thought: A Structured Reasoning Optimization Framework for Long CoT Distillation | Yijia Luo et.al. | 2503.16385 | link |
2025-03-20 | LaPIG: Cross-Modal Generation of Paired Thermal and Visible Facial Images | Leyang Wang et.al. | 2503.16376 | null |
2025-03-20 | JARVIS-VLA: Post-Training Large-Scale Vision Language Models to Play Visual Games with Keyboards and Mouse | Muyao Li et.al. | 2503.16365 | null |
2025-03-20 | CaKE: Circuit-aware Editing Enables Generalizable Knowledge Learners | Yunzhi Yao et.al. | 2503.16356 | link |
2025-03-20 | Lyra: An Efficient and Expressive Subquadratic Architecture for Modeling Biological Sequences | Krithik Ramesh et.al. | 2503.16351 | null |
2025-03-20 | LLM Braces: Straightening Out LLM Predictions with Relevant Sub-Updates | Ying Shen et.al. | 2503.16334 | null |
2025-03-20 | OmniGeo: Towards a Multimodal Large Language Models for Geospatial Artificial Intelligence | Long Yuan et.al. | 2503.16326 | null |
2025-03-20 | Issue2Test: Generating Reproducing Test Cases from Issue Reports | Noor Nashid et.al. | 2503.16320 | null |
2025-03-21 | Bridging Technology and Humanities: Evaluating the Impact of Large Language Models on Social Sciences Research with DeepSeek-R1 | Peiran Gu et.al. | 2503.16304 | null |
2025-03-20 | Generalized Few-shot 3D Point Cloud Segmentation with Vision-Language Model | Zhaochong An et.al. | 2503.16282 | link |
2025-03-21 | Uni-3DAR: Unified 3D Generation and Understanding via Autoregression on Compressed Spatial Tokens | Shuqi Lu et.al. | 2503.16278 | link |
2025-03-20 | Chain of Functions: A Programmatic Pipeline for Fine-Grained Chart Reasoning Data | Zijian Li et.al. | 2503.16260 | null |
2025-03-20 | Plug-and-Play 1.x-Bit KV Cache Quantization for Video Large Language Models | Keda Tao et.al. | 2503.16257 | null |
2025-03-21 | Fin-R1: A Large Language Model for Financial Reasoning through Reinforcement Learning | Zhaowei Liu et.al. | 2503.16252 | link |
2025-03-20 | Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn’t | Quy-Anh Dang et.al. | 2503.16219 | link |
2025-03-19 | TULIP: Towards Unified Language-Image Pretraining | Zineng Tang et.al. | 2503.15485 | null |
2025-03-19 | SWEET-RL: Training Multi-Turn LLM Agents on Collaborative Reasoning Tasks | Yifei Zhou et.al. | 2503.15478 | link |
2025-03-19 | What Makes a Reward Model a Good Teacher? An Optimization Perspective | Noam Razin et.al. | 2503.15477 | link |
2025-03-19 | Cube: A Roblox View of 3D Intelligence | Foundation AI Team et.al. | 2503.15475 | link |
2025-03-19 | EgoDTM: Towards 3D-Aware Egocentric Video-Language Pretraining | Boshen Xu et.al. | 2503.15470 | link |
2025-03-19 | From 1,000,000 Users to Every User: Scaling Up Personalized Preference for User-level Alignment | Jia-Nan Li et.al. | 2503.15463 | link |
2025-03-19 | SkyLadder: Better and Faster Pretraining via Context Window Scheduling | Tongyao Zhu et.al. | 2503.15450 | link |
2025-03-19 | VenusFactory: A Unified Platform for Protein Engineering Data Retrieval and Language Model Fine-Tuning | Yang Tan et.al. | 2503.15438 | link |
2025-03-19 | Visual Position Prompt for MLLM based Visual Grounding | Wei Tang et.al. | 2503.15426 | link |
2025-03-19 | Probing the topology of the space of tokens with structured prompts | Michael Robinson et.al. | 2503.15421 | null |
2025-03-19 | Visual Persona: Foundation Model for Full-Body Human Customization | Jisu Nam et.al. | 2503.15406 | null |
2025-03-19 | FedSCA: Federated Tuning with Similarity-guided Collaborative Aggregation for Heterogeneous Medical Image Segmentation | Yumin Zhang et.al. | 2503.15390 | null |
2025-03-19 | EfficientLLaVA:Generalizable Auto-Pruning for Large Vision-language Models | Yinan Liang et.al. | 2503.15369 | null |
2025-03-19 | SemEval-2025 Task 1: AdMIRe – Advancing Multimodal Idiomaticity Representation | Thomas Pickard et.al. | 2503.15358 | null |
2025-03-19 | SPILL: Domain-Adaptive Intent Clustering based on Selection and Pooling with Large Language Models | I-Fan Lin et.al. | 2503.15351 | null |
2025-03-19 | TruthLens:A Training-Free Paradigm for DeepFake Detection | Ritabrata Chakraborty et.al. | 2503.15342 | null |
2025-03-19 | Uncertainty-Guided Chain-of-Thought for Code Generation with LLMs | Yuqi Zhu et.al. | 2503.15341 | null |
2025-03-19 | Solla: Towards a Speech-Oriented LLM That Hears Acoustic Context | Junyi Ao et.al. | 2503.15338 | link |
2025-03-19 | Recover and Match: Open-Vocabulary Multi-Label Recognition through Knowledge-Constrained Optimal Transport | Hao Tan et.al. | 2503.15337 | link |
2025-03-19 | Euclid Quick Data Release (Q1) Exploring galaxy properties with a multi-modal foundation model | Euclid Collaboration et.al. | 2503.15312 | link |
2025-03-18 | Aligning Multimodal LLM with Human Preference: A Survey | Tao Yu et.al. | 2503.14504 | link |
2025-03-18 | Engineering Scientific Assistants using Interactive Structured Induction of Programs | Shraddha Surana et.al. | 2503.14488 | null |
2025-03-18 | Gricean Norms as a Basis for Effective Collaboration | Fardin Saad et.al. | 2503.14484 | link |
2025-03-19 | Creation-MMBench: Assessing Context-Aware Creative Intelligence in MLLM | Xinyu Fang et.al. | 2503.14478 | link |
2025-03-18 | Characterizing Data Visualization Literacy: a Systematic Literature Review | Sara Beschi et.al. | 2503.14468 | null |
2025-03-18 | RWKV-7 “Goose” with Expressive Dynamic State Evolution | Bo Peng et.al. | 2503.14456 | link |
2025-03-18 | EnvBench: A Benchmark for Automated Environment Setup | Aleksandra Eliseeva et.al. | 2503.14443 | link |
2025-03-18 | LLM-FE: Automated Feature Engineering for Tabular Data with LLMs as Evolutionary Optimizers | Nikhil Abhyankar et.al. | 2503.14434 | link |
2025-03-18 | PLAY2PROMPT: Zero-shot Tool Instruction Optimization for LLM Agents via Tool Play | Wei Fang et.al. | 2503.14432 | null |
2025-03-18 | ExDDV: A New Dataset for Explainable Deepfake Detection in Video | Vlad Hondru et.al. | 2503.14421 | link |
2025-03-18 | Unifying Text Semantics and Graph Structures for Temporal Text-attributed Graphs with Large Language Models | Siwei Zhang et.al. | 2503.14411 | null |
2025-03-18 | Large Language Models for Virtual Human Gesture Selection | Parisa Ghanad Torshizi et.al. | 2503.14408 | null |
2025-03-18 | DUNE: Distilling a Universal Encoder from Heterogeneous 2D and 3D Teachers | Mert Bulent Sariyildiz et.al. | 2503.14405 | null |
2025-03-18 | From “Hallucination” to “Suture”: Insights from Language Philosophy to Enhance Large Language Models | Qiantong Wang et.al. | 2503.14392 | null |
2025-03-18 | How much do LLMs learn from negative examples? | Shadi Hamdan et.al. | 2503.14391 | null |
2025-03-18 | Good/Evil Reputation Judgment of Celebrities by LLMs via Retrieval Augmented Generation | Rikuto Tsuchida et.al. | 2503.14382 | null |
2025-03-18 | On the Standard Performance Criteria for Applied Control Design: PID, MPC or Machine Learning Controller? | Pouria Sarhadi et.al. | 2503.14379 | link |
2025-03-18 | Tiled Flash Linear Attention: More Efficient Linear RNN and xLSTM Kernels | Maximilian Beck et.al. | 2503.14376 | link |
2025-03-18 | MAST-Pro: Dynamic Mixture-of-Experts for Adaptive Segmentation of Pan-Tumors with Knowledge-Driven Prompts | Runqi Meng et.al. | 2503.14355 | null |
2025-03-19 | MoonCast: High-Quality Zero-Shot Podcast Generation | Zeqian Ju et.al. | 2503.14345 | link |
2025-03-17 | MetaScale: Test-Time Scaling with Evolving Meta-Thoughts | Qin Liu et.al. | 2503.13447 | null |
2025-03-17 | MoManipVLA: Transferring Vision-language-action Models for General Mobile Manipulation | Zhenyu Wu et.al. | 2503.13446 | null |
2025-03-17 | Faithfulness of LLM Self-Explanations for Commonsense Tasks: Larger Is Better, and Instruction-Tuning Allows Trade-Offs but Not Pareto Dominance | Noah Y. Siegel et.al. | 2503.13445 | null |
2025-03-17 | VideoMind: A Chain-of-LoRA Agent for Long Video Reasoning | Ye Liu et.al. | 2503.13444 | link |
2025-03-17 | DPC: Dual-Prompt Collaboration for Tuning Vision-Language Models | Haoyang Li et.al. | 2503.13443 | link |
2025-03-18 | MaTVLM: Hybrid Mamba-Transformer for Efficient Vision-Language Modeling | Yingyue Li et.al. | 2503.13440 | link |
2025-03-17 | xLSTM 7B: A Recurrent LLM for Fast and Efficient Inference | Maximilian Beck et.al. | 2503.13427 | link |
2025-03-17 | SuperBPE: Space Travel for Language Models | Alisa Liu et.al. | 2503.13423 | null |
2025-03-17 | A Comprehensive Survey on Multi-Agent Cooperative Decision-Making: Scenarios, Approaches, Challenges and Perspectives | Weiqiang Jin et.al. | 2503.13415 | null |
2025-03-18 | DLPO: Towards a Robust, Efficient, and Generalizable Prompt Optimization Framework from a Deep-Learning Perspective | Dengyun Peng et.al. | 2503.13413 | link |
2025-03-17 | Using the Tools of Cognitive Science to Understand Large Language Models at Different Levels of Analysis | Alexander Ku et.al. | 2503.13401 | null |
2025-03-17 | MicroVQA: A Multimodal Reasoning Benchmark for Microscopy-Based Scientific Research | James Burgess et.al. | 2503.13399 | link |
2025-03-17 | Aligned Probing: Relating Toxic Behavior and Model Internals | Andreas Waldis et.al. | 2503.13390 | null |
2025-03-17 | Cream of the Crop: Harvesting Rich, Scalable and Transferable Multi-Modal Data for Instruction Fine-Tuning | Mengyao Lyu et.al. | 2503.13383 | null |
2025-03-17 | Sightation Counts: Leveraging Sighted User Feedback in Building a BLV-aligned Dataset of Diagram Descriptions | Wan Ju Kang et.al. | 2503.13369 | null |
2025-03-17 | Mitigating Visual Forgetting via Take-along Visual Conditioning for Multi-modal Long CoT Reasoning | Hai-Long Sun et.al. | 2503.13360 | null |
2025-03-17 | Agents Play Thousands of 3D Video Games | Zhongwen Xu et.al. | 2503.13356 | null |
2025-03-17 | Valid Text-to-SQL Generation with Unification-based DeepStochLog | Ying Jiao et.al. | 2503.13342 | link |
2025-03-17 | LearnMate: Enhancing Online Education with LLM-Powered Personalized Learning Plans and Support | Xinyu Jessica Wang et.al. | 2503.13340 | null |
2025-03-17 | Reliable and Efficient Amortized Model-based Evaluation | Sang Truong et.al. | 2503.13335 | null |
2025-03-14 | Tit-for-Tat: Safeguarding Large Vision-Language Models Against Jailbreak Attacks via Adversarial Defense | Shuyang Hao et.al. | 2503.11619 | null |
2025-03-14 | ASMA-Tune: Unlocking LLMs’ Assembly Code Comprehension via Structural-Semantic Instruction Tuning | Xinyi Wang et.al. | 2503.11617 | link |
2025-03-14 | Rethinking Few-Shot Adaptation of Vision-Language Models in Two Stages | Matteo Farina et.al. | 2503.11609 | link |
2025-03-14 | Do Construction Distributions Shape Formal Language Learning In German BabyLMs? | Bastian Bunzeck et.al. | 2503.11593 | null |
2025-03-14 | Pathology Image Compression with Pre-trained Autoencoders | Srikar Yellapragada et.al. | 2503.11591 | null |
2025-03-14 | Broaden your SCOPE! Efficient Multi-turn Conversation Planning for LLMs using Semantic Space | Zhiliang Chen et.al. | 2503.11586 | link |
2025-03-14 | SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion | Ahmed Nassar et.al. | 2503.11576 | null |
2025-03-14 | Synthesizing Access Control Policies using Large Language Models | Adarsh Vatsa et.al. | 2503.11573 | null |
2025-03-14 | Implicit Bias-Like Patterns in Reasoning Models | Messi H. J. Lee et.al. | 2503.11572 | null |
2025-03-14 | VERIFY: A Benchmark of Visual Explanation and Reasoning for Investigating Multimodal Reasoning Fidelity | Jing Bi et.al. | 2503.11557 | null |
2025-03-14 | Similarity-Aware Token Pruning: Your VLM but Faster | Ahmadreza Jeddi et.al. | 2503.11549 | link |
2025-03-14 | Potential of large language model-powered nudges for promoting daily water and energy conservation | Zonghan Li et.al. | 2503.11531 | null |
2025-03-14 | Exploring Typographic Visual Prompts Injection Threats in Cross-Modality Generation Models | Hao Cheng et.al. | 2503.11519 | null |
2025-03-14 | HiTVideo: Hierarchical Tokenizers for Enhancing Text-to-Video Generation with Autoregressive Large Language Models | Ziqin Zhou et.al. | 2503.11513 | null |
2025-03-14 | V-STaR: Benchmarking Video-LLMs on Video Spatio-Temporal Reasoning | Zixu Cheng et.al. | 2503.11495 | null |
2025-03-14 | A Review of DeepSeek Models’ Key Innovative Techniques | Chengen Wang et.al. | 2503.11486 | null |
2025-03-14 | Integrating LLMs in Gamified Systems | Carlos J. Costa et.al. | 2503.11458 | null |
2025-03-14 | D3: Diversity, Difficulty, and Dependability-Aware Data Selection for Sample-Efficient LLM Instruction Tuning | Jia Zhang et.al. | 2503.11441 | null |
2025-03-14 | Text Compression for Efficient Language Generation | David Gu et.al. | 2503.11426 | null |
2025-03-14 | Empowering Time Series Analysis with Synthetic Data: A Survey and Outlook in the Era of Foundation Models | Xu Liu et.al. | 2503.11411 | null |
2025-03-13 | GoT: Unleashing Reasoning Capability of Multimodal Large Language Model for Visual Generation and Editing | Rongyao Fang et.al. | 2503.10639 | link |
2025-03-13 | A Frustratingly Simple Yet Highly Effective Attack Baseline: Over 90% Success Rate Against the Strong Black-box Models of GPT-4.5/4o/o1 | Zhaoyi Li et.al. | 2503.10635 | link |
2025-03-13 | HybridVLA: Collaborative Diffusion and Autoregression in a Unified Vision-Language-Action Model | Jiaming Liu et.al. | 2503.10631 | null |
2025-03-13 | UniGoal: Towards Universal Zero-shot Goal-oriented Navigation | Hang Yin et.al. | 2503.10630 | null |
2025-03-13 | Transformers without Normalization | Jiachen Zhu et.al. | 2503.10622 | null |
2025-03-13 | From TOWER to SPIRE: Adding the Speech Modality to a Text-Only LLM | Kshitij Ambilduke et.al. | 2503.10620 | link |
2025-03-13 | Siege: Autonomous Multi-Turn Jailbreaking of Large Language Models with Tree Search | Andy Zhou et.al. | 2503.10619 | null |
2025-03-13 | Compositional Subspace Representation Fine-tuning for Adaptive Large Language Models | Andy Zhou et.al. | 2503.10617 | null |
2025-03-13 | R1-Onevision: Advancing Generalized Multimodal Reasoning through Cross-Modal Formalization | Yi Yang et.al. | 2503.10615 | link |
2025-03-13 | CoSTA $\ast$ : Cost-Sensitive Toolpath Agent for Multi-turn Image Editing | Advait Gupta et.al. | 2503.10613 | link |
2025-03-13 | TruthPrInt: Mitigating LVLM Object Hallucination Via Latent Truthful-Guided Pre-Intervention | Jinhao Duan et.al. | 2503.10602 | link |
2025-03-13 | GroundingSuite: Measuring Complex Multi-Granular Pixel Grounding | Rui Hu et.al. | 2503.10596 | link |
2025-03-13 | Unlock the Power of Unlabeled Data in Language Driving Model | Chaoqun Wang et.al. | 2503.10586 | null |
2025-03-13 | VisualWebInstruct: Scaling up Multimodal Instruction Data through Web Search | Yiming Jia et.al. | 2503.10582 | null |
2025-03-13 | Unveiling the Mathematical Reasoning in DeepSeek Models: A Comparative Study of Large Language Models | Afrar Jahin et.al. | 2503.10573 | null |
2025-03-13 | ASIDE: Architectural Separation of Instructions and Data in Language Models | Egor Zverev et.al. | 2503.10566 | null |
2025-03-13 | Short-term AI literacy intervention does not reduce over-reliance on incorrect ChatGPT recommendations | Brett Puppart et.al. | 2503.10556 | null |
2025-03-13 | KUDA: Keypoints to Unify Dynamics Learning and Visual Prompting for Open-Vocabulary Robotic Manipulation | Zixian Liu et.al. | 2503.10546 | null |
2025-03-13 | DP-GPL: Differentially Private Graph Prompt Learning | Jing Xu et.al. | 2503.10544 | null |
2025-03-13 | Language Models, Graph Searching, and Supervision Adulteration: When More Supervision is Less and How to Make More More | Arvid Frydenlund et.al. | 2503.10542 | null |
2025-03-12 | MoC: Mixtures of Text Chunking Learners for Retrieval-Augmented Generation System | Jihao Zhao et.al. | 2503.09600 | link |
2025-03-12 | How to Protect Yourself from 5G Radiation? Investigating LLM Responses to Implicit Misinformation | Ruohao Guo et.al. | 2503.09598 | link |
2025-03-12 | SimLingo: Vision-Only Closed-Loop Autonomous Driving with Language-Action Alignment | Katrin Renz et.al. | 2503.09594 | null |
2025-03-12 | BIMBA: Selective-Scan Compression for Long-Range Video Question Answering | Md Mohaiminul Islam et.al. | 2503.09590 | link |
2025-03-12 | Cost-Optimal Grouped-Query Attention for Long-Context LLMs | Yingfa Chen et.al. | 2503.09579 | link |
2025-03-12 | Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models | Marianne Arriola et.al. | 2503.09573 | link |
2025-03-12 | Plan-and-Act: Improving Planning of Agents for Long-Horizon Tasks | Lutfi Eren Erdogan et.al. | 2503.09572 | null |
2025-03-13 | Towards Reasoning Era: A Survey of Long Chain-of-Thought for Reasoning Large Language Models | Qiguang Chen et.al. | 2503.09567 | null |
2025-03-12 | PolyPythias: Stability and Outliers across Fifty Language Model Pre-Training Runs | Oskar van der Wal et.al. | 2503.09543 | link |
2025-03-13 | Large Language Models for Multi-Facility Location Mechanism Design | Nguyen Thach et.al. | 2503.09533 | null |
2025-03-13 | SAEBench: A Comprehensive Benchmark for Sparse Autoencoders in Language Model Interpretability | Adam Karvonen et.al. | 2503.09532 | null |
2025-03-12 | Search-R1: Training LLMs to Reason and Leverage Search Engines with Reinforcement Learning | Bowen Jin et.al. | 2503.09516 | link |
2025-03-12 | Reinforcement Learning is all You Need | Yongsheng Lian et.al. | 2503.09512 | null |
2025-03-12 | ReMA: Learning to Meta-think for LLMs with Multi-Agent Reinforcement Learning | Ziyu Wan et.al. | 2503.09501 | link |
2025-03-12 | MindGYM: Enhancing Vision-Language Models via Synthetic Self-Challenging Questions | Zhe Xu et.al. | 2503.09499 | link |
2025-03-12 | Parameter-Efficient Adaptation of Geospatial Foundation Models through Embedding Deflection | Romain Thoreau et.al. | 2503.09493 | null |
2025-03-12 | Project-Probe-Aggregate: Efficient Fine-Tuning for Group Robustness | Beier Zhu et.al. | 2503.09487 | null |
2025-03-12 | BAMBI: Developing Baby Language Models for Italian | Alice Suozzi et.al. | 2503.09481 | null |
2025-03-12 | SurgicalVLM-Agent: Towards an Interactive AI Co-Pilot for Pituitary Surgery | Jiayuan Huang et.al. | 2503.09474 | null |
2025-03-12 | Explicit Learning and the LLM in Machine Translation | Malik Marmonier et.al. | 2503.09454 | link |
2025-03-11 | QuoTA: Query-oriented Token Assignment via CoT Query Decouple for Long Video Comprehension | Yongdong Luo et.al. | 2503.08689 | link |
2025-03-11 | Randomness, Not Representation: The Unreliability of Evaluating Cultural Alignment in LLMs | Ariba Khan et.al. | 2503.08688 | link |
2025-03-11 | Perplexity Trap: PLM-Based Retrievers Overrate Low Perplexity Documents | Haoyu Wang et.al. | 2503.08684 | link |
2025-03-11 | Self-Taught Self-Correction for Small Language Models | Viktor Moskvoretskii et.al. | 2503.08681 | null |
2025-03-11 | Understanding and Mitigating Distribution Shifts For Machine Learning Force Fields | Tobias Kreiman et.al. | 2503.08674 | null |
2025-03-11 | Generating Robot Constitutions & Benchmarks for Semantic Safety | Pierre Sermanet et.al. | 2503.08663 | null |
2025-03-11 | Exploring the Word Sense Disambiguation Capabilities of Large Language Models | Pierpaolo Basile et.al. | 2503.08662 | null |
2025-03-11 | YuE: Scaling Open Foundation Models for Long-Form Music Generation | Ruibin Yuan et.al. | 2503.08638 | link |
2025-03-11 | LightGen: Efficient Image Generation through Knowledge Distillation and Direct Preference Optimization | Xianfeng Wu et.al. | 2503.08619 | link |
2025-03-11 | EMMOE: A Comprehensive Benchmark for Embodied Mobile Manipulation in Open Environments | Dongping Li et.al. | 2503.08604 | link |
2025-03-11 | NSF-SciFy: Mining the NSF Awards Database for Scientific Claims | Delip Rao et.al. | 2503.08600 | null |
2025-03-11 | Proc4Gem: Foundation models for physical agency through procedural generation | Yixin Lin et.al. | 2503.08593 | null |
2025-03-11 | BiasEdit: Debiasing Stereotyped Language Models via Model Editing | Xin Xu et.al. | 2503.08588 | link |
2025-03-11 | HierarQ: Task-Aware Hierarchical Q-Former for Enhanced Video Understanding | Shehreen Azad et.al. | 2503.08585 | null |
2025-03-11 | RAG-Adapter: A Plug-and-Play RAG-enhanced Framework for Long Video Understanding | Xichen Tan et.al. | 2503.08576 | null |
2025-03-11 | DeepReview: Improving LLM-based Paper Review with Human-like Deep Thinking Process | Minjun Zhu et.al. | 2503.08569 | null |
2025-03-11 | Reasoning and Sampling-Augmented MCQ Difficulty Prediction via LLMs | Wanyong Feng et.al. | 2503.08551 | null |
2025-03-11 | Transferring Extreme Subword Style Using Ngram Model-Based Logit Scaling | Craig Messner et.al. | 2503.08550 | null |
2025-03-11 | Graph of AI Ideas: Leveraging Knowledge Graphs and LLMs for AI Research Idea Generation | Xian Gao et.al. | 2503.08549 | null |
2025-03-11 | TLA: Tactile-Language-Action Model for Contact-Rich Manipulation | Peng Hao et.al. | 2503.08548 | null |
2025-03-10 | Robusto-1 Dataset: Comparing Humans and VLMs on real out-of-distribution Autonomous Driving VQA from Peru | Dunant Cusipuma et.al. | 2503.07587 | null |
2025-03-10 | Talking to GDELT Through Knowledge Graphs | Audun Myers et.al. | 2503.07584 | null |
2025-03-10 | VisBias: Measuring Explicit and Implicit Social Biases in Vision Language Models | Jen-tse Huang et.al. | 2503.07575 | link |
2025-03-10 | AutoSpatial: Visual-Language Reasoning for Social Robot Navigation through Efficient Spatial Reasoning Learning | Yangzhe Kong et.al. | 2503.07557 | null |
2025-03-10 | Junior Software Developers’ Perspectives on Adopting LLMs for Software Engineering: a Systematic Literature Review | Samuel Ferino et.al. | 2503.07556 | null |
2025-03-10 | KSOD: Knowledge Supplement for LLMs On Demand | Haoran Li et.al. | 2503.07550 | null |
2025-03-10 | Bi-Directional Mental Model Reconciliation for Human-Robot Interaction with Large Language Models | Nina Moorman et.al. | 2503.07547 | null |
2025-03-10 | Queueing, Predictions, and LLMs: Challenges and Open Problems | Michael Mitzenmacher et.al. | 2503.07545 | null |
2025-03-10 | XIFBench: Evaluating Large Language Models on Multilingual Instruction Following | Zhenyu Li et.al. | 2503.07539 | null |
2025-03-10 | Building English ASR model with regional language support | Purvi Agrawal et.al. | 2503.07522 | null |
2025-03-10 | GRITHopper: Decomposition-Free Multi-Hop Dense Retrieval | Justus-Jonas Erker et.al. | 2503.07519 | link |
2025-03-10 | TokenButler: Token Importance is Predictable | Yash Akhauri et.al. | 2503.07518 | link |
2025-03-10 | Language Models Fail to Introspect About Their Knowledge of Language | Siyuan Song et.al. | 2503.07513 | link |
2025-03-10 | Plume: Scaffolding Text Composition in Dashboards | Maxim Lisnic et.al. | 2503.07512 | null |
2025-03-10 | Sometimes the Model doth Preach: Quantifying Religious Bias in Open LLMs through Demographic Analysis in Asian Nations | Hari Shankar et.al. | 2503.07510 | link |
2025-03-10 | Think Before You Segment: High-Quality Reasoning Segmentation with GPT Chain of Thoughts | Shiu-hong Kao et.al. | 2503.07503 | null |
2025-03-10 | V2Flow: Unifying Visual Tokenization and Large Language Model Vocabularies for Autoregressive Image Generation | Guiwei Zhang et.al. | 2503.07493 | link |
2025-03-10 | LLaVA-RadZ: Can Multimodal Large Language Models Effectively Tackle Zero-shot Radiology Recognition? | Bangyan Li et.al. | 2503.07487 | null |
2025-03-10 | Chameleon: Fast-slow Neuro-symbolic Lane Topology Extraction | Zongzheng Zhang et.al. | 2503.07485 | link |
2025-03-10 | VLRMBench: A Comprehensive and Challenging Benchmark for Vision-Language Reward Models | Jiacheng Ruan et.al. | 2503.07478 | link |
2025-03-07 | Fairness-Aware Low-Rank Adaptation Under Demographic Privacy Constraints | Parameswaran Kamalaruban et.al. | 2503.05684 | null |
2025-03-07 | Understanding the Limits of Lifelong Knowledge Editing in LLMs | Lukas Thede et.al. | 2503.05683 | null |
2025-03-07 | A Survey of Large Language Model Empowered Agents for Recommendation and Search: Towards Next-Generation Information Retrieval | Yu Zhang et.al. | 2503.05659 | link |
2025-03-07 | Learning LLM Preference over Intra-Dialogue Pairs: A Framework for Utterance-level Understandings | Xuanqing Liu et.al. | 2503.05620 | null |
2025-03-07 | A Survey on Sparse Autoencoders: Interpreting the Internal Mechanisms of Large Language Models | Dong Shu et.al. | 2503.05613 | null |
2025-03-07 | From Theory to Application: A Practical Introduction to Neural Operators in Scientific Computing | Prashant K. Jha et.al. | 2503.05598 | link |
2025-03-07 | R1-Searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning | Huatong Song et.al. | 2503.05592 | null |
2025-03-07 | Quantifying the Robustness of Retrieval-Augmented Language Models Against Spurious Features in Grounding Data | Shiping Yang et.al. | 2503.05587 | null |
2025-03-07 | Evaluating open-source Large Language Models for automated fact-checking | Nicolo’ Fontana et.al. | 2503.05565 | null |
2025-03-07 | Revitalizing Saturated Benchmarks: A Weighted Metric Approach for Differentiating Large Language Model Performance | Bryan Etzine et.al. | 2503.05551 | null |
2025-03-07 | Leveraging Approximate Caching for Faster Retrieval-Augmented Generation | Shai Bergman et.al. | 2503.05530 | null |
2025-03-07 | PoSSUM: A Protocol for Surveying Social-media Users with Multimodal LLMs | Roberto Cerina et.al. | 2503.05529 | null |
2025-03-07 | Cognitive Bias Detection Using Advanced Prompt Engineering | Frederic Lemieux et.al. | 2503.05516 | null |
2025-03-07 | Grammar-Based Code Representation: Is It a Worthy Pursuit for LLMs? | Qingyuan Liang et.al. | 2503.05507 | null |
2025-03-07 | Statistical Guarantees of Correctness Coverage for Medical Multiple-Choice Question Answering | Yusong Ke et.al. | 2503.05505 | null |
2025-03-07 | Benchmarking LLMs in Recommendation Tasks: A Comparative Evaluation with Conventional Recommenders | Qijiong Liu et.al. | 2503.05493 | null |
2025-03-07 | Maximum Hallucination Standards for Domain-Specific Large Language Models | Tingmingke Lu et.al. | 2503.05481 | null |
2025-03-07 | The Society of HiveMind: Multi-Agent Optimization of Foundation Model Swarms to Unlock the Potential of Collective Intelligence | Noah Mamie et.al. | 2503.05473 | null |
2025-03-07 | Soft Policy Optimization: Online Off-Policy RL for Sequence Models | Taco Cohen et.al. | 2503.05453 | null |
2025-03-07 | LLM-based Iterative Approach to Metamodeling in Automotive | Nenad Petrovic et.al. | 2503.05449 | null |
2025-03-06 | L $^2$ M: Mutual Information Scaling Law for Long-Context Language Modeling | Zhuo Chen et.al. | 2503.04725 | link |
2025-03-06 | LLMVoX: Autoregressive Streaming Text-to-Speech Model for Any LLM | Sambal Shikhar et.al. | 2503.04724 | null |
2025-03-07 | Shifting Long-Context LLMs Research from Input to Output | Yuhao Wu et.al. | 2503.04723 | null |
2025-03-06 | Enough Coin Flips Can Make LLMs Act Bayesian | Ritwik Gupta et.al. | 2503.04722 | null |
2025-03-06 | Full-Duplex-Bench: A Benchmark to Evaluate Full-duplex Spoken Dialogue Models on Turn-taking Capabilities | Guan-Ting Lin et.al. | 2503.04721 | link |
2025-03-06 | Predictable Scale: Part I – Optimal Hyperparameter Scaling Law in Large Language Model Pretraining | Houyi Li et.al. | 2503.04715 | null |
2025-03-06 | Scaling Rich Style-Prompted Text-to-Speech Datasets | Anuj Diwan et.al. | 2503.04713 | link |
2025-03-06 | Universality of Layer-Level Entropy-Weighted Quantization Beyond Model Architecture and Size | Alireza Behtash et.al. | 2503.04704 | null |
2025-03-06 | L1: Controlling How Long A Reasoning Model Thinks With Reinforcement Learning | Pranjal Aggarwal et.al. | 2503.04697 | null |
2025-03-06 | UIPE: Enhancing LLM Unlearning by Removing Knowledge Related to Forgetting Targets | Wenyu Wang et.al. | 2503.04693 | null |
2025-03-06 | Quantifying the Reasoning Abilities of LLMs on Real-world Clinical Cases | Pengcheng Qiu et.al. | 2503.04691 | null |
2025-03-06 | LLM-guided Plan and Retrieval: A Strategic Alignment for Interpretable User Satisfaction Estimation in Dialogue | Sangyeop Kim et.al. | 2503.04675 | null |
2025-03-06 | An Information-theoretic Multi-task Representation Learning Framework for Natural Language Understanding | Dou Hu et.al. | 2503.04667 | link |
2025-03-06 | CLDyB: Towards Dynamic Benchmarking for Continual Learning with Pre-trained Models | Shengzhuang Chen et.al. | 2503.04655 | link |
2025-03-06 | Transferable Foundation Models for Geometric Tasks on Point Cloud Representations: Geometric Neural Operators | Blaine Quackenbush et.al. | 2503.04649 | link |
2025-03-06 | Implicit Cross-Lingual Rewarding for Efficient Multilingual Preference Alignment | Wen Yang et.al. | 2503.04647 | null |
2025-03-06 | Enhancing SAM with Efficient Prompting and Preference Optimization for Semi-supervised Medical Image Segmentation | Aishik Konwer et.al. | 2503.04639 | null |
2025-03-06 | Mark Your LLM: Detecting the Misuse of Open-Source Large Language Models via Watermarking | Yijie Xu et.al. | 2503.04636 | null |
2025-03-06 | Better Process Supervision with Bi-directional Rewarding Signals | Wenxiang Chen et.al. | 2503.04618 | null |
2025-03-06 | Towards Data-Efficient Language Models: A Child-Inspired Approach to Language Learning | Mohammad Amin Ghanizadeh et.al. | 2503.04611 | null |
2025-03-05 | The MASK Benchmark: Disentangling Honesty From Accuracy in AI Systems | Richard Ren et.al. | 2503.03750 | null |
2025-03-05 | Process-based Self-Rewarding Language Models | Shimao Zhang et.al. | 2503.03746 | link |
2025-03-05 | CHOP: Mobile Operating Assistant with Constrained High-frequency Optimized Subtask Planning | Yuqi Zhou et.al. | 2503.03743 | link |
2025-03-05 | Towards Understanding Distilled Reasoning Models: A Representational Approach | David D. Baek et.al. | 2503.03730 | null |
2025-03-05 | Improving LLM Safety Alignment with Dual-Objective Optimization | Xuandong Zhao et.al. | 2503.03710 | link |
2025-03-05 | Effective LLM Knowledge Learning via Model Generalization | Mingkang Zhu et.al. | 2503.03705 | null |
2025-03-05 | A Practical Memory Injection Attack against LLM Agents | Shen Dong et.al. | 2503.03704 | null |
2025-03-05 | Developing and Utilizing a Large-Scale Cantonese Dataset for Multi-Tasking in Large Language Models | Jiyue Jiang et.al. | 2503.03702 | null |
2025-03-05 | Addressing Overprescribing Challenges: Fine-Tuning Large Language Models for Medication Recommendation Tasks | Zihao Zhao et.al. | 2503.03687 | link |
2025-03-05 | Attentive Reasoning Queries: A Systematic Method for Optimizing Instruction-Following in Large Language Models | Bar Karov et.al. | 2503.03669 | link |
2025-03-05 | Analogical Reasoning Inside Large Language Models: Concept Vectors and the Limits of Abstraction | Gustaw Opiełka et.al. | 2503.03666 | link |
2025-03-05 | Robust Learning of Diverse Code Edits | Tushar Aggarwal et.al. | 2503.03656 | null |
2025-03-05 | Improving Neutral Point of View Text Generation through Parameter-Efficient Reinforcement Learning and a Small-Scale High-Quality Dataset | Jessica Hoffmann et.al. | 2503.03654 | null |
2025-03-05 | Token-Level Privacy in Large Language Models | Re’em Harel et.al. | 2503.03652 | null |
2025-03-05 | Psy-Copilot: Visual Chain of Thought for Counseling | Keqi Chen et.al. | 2503.03645 | null |
2025-03-05 | Large language models in finance: estimating financial sentiment for stock prediction | Kemal Kirtac et.al. | 2503.03612 | null |
2025-03-05 | Enhancing the Accuracy and Comprehensibility in Architectural Tactics Detection via Small Model-Augmented Prompt Engineering | Lingli Cao et.al. | 2503.03609 | link |
2025-03-05 | Psy-Insight: Explainable Multi-turn Bilingual Dataset for Mental Health Counseling | Keqi Chen et.al. | 2503.03607 | null |
2025-03-05 | Feature-Level Insights into Artificial Text Detection with Sparse Autoencoders | Kristian Kuznetsov et.al. | 2503.03601 | null |
2025-03-05 | Small but Mighty: Enhancing Time Series Forecasting with Lightweight LLMs | Haoran Fan et.al. | 2503.03594 | link |
2025-03-04 | Wikipedia in the Era of LLMs: Evolution and Risks | Siming Huang et.al. | 2503.02879 | link |
2025-03-04 | Language Models can Self-Improve at State-Value Estimation for Better Search | Ethan Mendes et.al. | 2503.02878 | link |
2025-03-04 | SPIDER: A Comprehensive Multi-Organ Supervised Pathology Dataset and Baseline Models | Dmitry Nechaev et.al. | 2503.02876 | link |
2025-03-04 | The First Few Tokens Are All You Need: An Efficient and Effective Unsupervised Prefix Fine-Tuning Method for Reasoning Models | Ke Ji et.al. | 2503.02875 | null |
2025-03-04 | Prompting Generative AI with Interaction-Augmented Instructions | Leixian Shen et.al. | 2503.02874 | null |
2025-03-04 | FairSense-AI: Responsible AI Meets Sustainability | Shaina Raza et.al. | 2503.02865 | null |
2025-03-04 | Calibrating LLM Confidence with Semantic Steering: A Multi-Prompt Aggregation Framework | Ziang Zhou et.al. | 2503.02863 | null |
2025-03-04 | Privacy and Accuracy-Aware AI/ML Model Deduplication | Hong Guan et.al. | 2503.02862 | null |
2025-03-04 | (How) Do Language Models Track State? | Belinda Z. Li et.al. | 2503.02854 | null |
2025-03-04 | Shakespearean Sparks: The Dance of Hallucination and Creativity in LLMs’ Decoding Layers | Zicong He et.al. | 2503.02851 | link |
2025-03-04 | Mask-DPO: Generalizable Fine-grained Factuality Alignment of LLMs | Yuzhe Gu et.al. | 2503.02846 | link |
2025-03-04 | Beyond Cosine Decay: On the effectiveness of Infinite Learning Rate Schedule for Continual Pre-training | Paul Janson et.al. | 2503.02844 | null |
2025-03-04 | AlignDistil: Token-Level Language Model Alignment as Adaptive Policy Distillation | Songming Zhang et.al. | 2503.02832 | null |
2025-03-04 | Developing a PET/CT Foundation Model for Cross-Modal Anatomical and Functional Imaging | Yujin Oh et.al. | 2503.02824 | null |
2025-03-04 | “What If Smart Homes Could See Our Homes?”: Exploring DIY Smart Home Building Experiences with VLM-Based Camera Sensors | Sojeong Yun et.al. | 2503.02816 | null |
2025-03-04 | Q-Filters: Leveraging QK Geometry for Efficient KV Cache Compression | Nathan Godey et.al. | 2503.02812 | link |
2025-03-04 | RAAD-LLM: Adaptive Anomaly Detection Using LLMs and RAG Integration | Alicia Russell-Gilbert et.al. | 2503.02800 | null |
2025-03-04 | Multimodal AI predicts clinical outcomes of drug combinations from preclinical data | Yepeng Huang et.al. | 2503.02781 | link |
2025-03-04 | Implicit Bias in LLMs: A Survey | Xinru Lin et.al. | 2503.02776 | null |
2025-03-04 | InSerter: Speech Instruction Following with Unsupervised Interleaved Pre-training | Dingdong Wang et.al. | 2503.02769 | null |
2025-02-28 | LLM Post-Training: A Deep Dive into Reasoning Large Language Models | Komal Kumar et.al. | 2502.21321 | link |
2025-02-28 | Raccoon: Multi-stage Diffusion Training with Coarse-to-Fine Curating Videos | Zhiyu Tan et.al. | 2502.21314 | null |
2025-02-28 | FANformer: Improving Large Language Models Through Effective Periodicity Modeling | Yihong Dong et.al. | 2502.21309 | link |
2025-02-28 | Contextualizing biological perturbation experiments through language | Menghua Wu et.al. | 2502.21290 | link |
2025-02-28 | Adaptive Keyframe Sampling for Long Video Understanding | Xi Tang et.al. | 2502.21271 | null |
2025-03-03 | Foundation Models – A Panacea for Artificial Intelligence in Pathology? | Nita Mulliqi et.al. | 2502.21264 | null |
2025-02-28 | Modeling Human Beliefs about AI Behavior for Scalable Oversight | Leon Lang et.al. | 2502.21262 | null |
2025-02-28 | PET Image Denoising via Text-Guided Diffusion: Integrating Anatomical Priors through Text Prompts | Boxiao Yu et.al. | 2502.21260 | null |
2025-02-28 | RoboBrain: A Unified Brain Model for Robotic Manipulation from Abstract to Concrete | Yuheng Ji et.al. | 2502.21257 | null |
2025-02-28 | TimesBERT: A BERT-Style Foundation Model for Time Series Understanding | Haoran Zhang et.al. | 2502.21245 | null |
2025-02-28 | Semantic Volume: Quantifying and Detecting both External and Internal Uncertainty in LLMs | Xiaomin Li et.al. | 2502.21239 | null |
2025-02-28 | Transforming Tuberculosis Care: Optimizing Large Language Models For Enhanced Clinician-Patient Communication | Daniil Filienko et.al. | 2502.21236 | null |
2025-02-28 | ByteScale: Efficient Scaling of LLM Training with a 2048K Context Length on More Than 12,000 GPUs | Hao Ge et.al. | 2502.21231 | null |
2025-03-03 | ECLeKTic: a Novel Challenge Set for Evaluation of Cross-Lingual Knowledge Transfer | Omer Goldman et.al. | 2502.21228 | null |
2025-02-28 | Transformers Learn to Implement Multi-step Gradient Descent with Chain of Thought | Jianhao Huang et.al. | 2502.21212 | null |
2025-02-28 | Chronologically Consistent Large Language Models | Songrun He et.al. | 2502.21206 | null |
2025-02-28 | $Δ$ -model correction of Foundation Model based on the models own understanding | Mads-Peter Verner Christiansen et.al. | 2502.21179 | null |
2025-03-03 | Causality Is Key to Understand and Balance Multiple Goals in Trustworthy ML and Foundation Models | Ruta Binkyte et.al. | 2502.21123 | null |
2025-02-28 | Optimizing Large Language Models for ESG Activity Detection in Financial Texts | Mattia Birti et.al. | 2502.21112 | link |
2025-02-28 | Large Language Model-Based Benchmarking Experiment Settings for Evolutionary Multi-Objective Optimization | Lie Meng Pang et.al. | 2502.21108 | null |
2025-02-27 | R2-T2: Re-Routing in Test-Time for Multimodal Mixture-of-Experts | Zhongyang Li et.al. | 2502.20395 | link |
2025-02-27 | Why Are Web AI Agents More Vulnerable Than Standalone LLMs? A Security Analysis | Jeffrey Yang Fan Chiang et.al. | 2502.20383 | null |
2025-02-27 | Multi-Agent Verification: Scaling Test-Time Compute with Multiple Verifiers | Shalev Lifshitz et.al. | 2502.20379 | null |
2025-02-27 | PhantomWiki: On-Demand Datasets for Reasoning and Retrieval Evaluation | Albert Gong et.al. | 2502.20377 | link |
2025-02-27 | Bridging Legal Knowledge and AI: Retrieval-Augmented Generation with Vector Stores, Knowledge Graphs, and Hierarchical Non-negative Matrix Factorization | Ryan C. Barron et.al. | 2502.20364 | link |
2025-02-27 | Bridging the Creativity Understanding Gap: Small-Scale Human Alignment Enables Expert-Level Humor Ranking in LLMs | Kuan Lok Zhou et.al. | 2502.20356 | null |
2025-02-27 | KEDRec-LM: A Knowledge-distilled Explainable Drug Recommendation Large Language Model | Kai Zhang et.al. | 2502.20350 | null |
2025-02-27 | Sparse Auto-Encoder Interprets Linguistic Features in Large Language Models | Yi Jing et.al. | 2502.20344 | null |
2025-02-27 | Thinking Slow, Fast: Scaling Inference Compute with Distilled Reasoners | Daniele Paliotta et.al. | 2502.20339 | null |
2025-02-27 | Expertise Is What We Want | Alan Ashworth et.al. | 2502.20335 | null |
2025-02-27 | Emergent Symbolic Mechanisms Support Abstract Reasoning in Large Language Models | Yukang Yang et.al. | 2502.20332 | null |
2025-02-27 | Long-Context Inference with Retrieval-Augmented Speculative Decoding | Guanzheng Chen et.al. | 2502.20330 | link |
2025-02-27 | LangProBe: a Language Programs Benchmark | Shangyin Tan et.al. | 2502.20315 | null |
2025-02-27 | EAIRA: Establishing a Methodology for Evaluating AI Models as Scientific Research Assistants | Franck Cappello et.al. | 2502.20309 | link |
2025-02-27 | M^3Builder: A Multi-Agent System for Automated Machine Learning in Medical Imaging | Jinghao Feng et.al. | 2502.20301 | null |
2025-02-27 | An exploration of features to improve the generalisability of fake news detection models | Nathaniel Hoy et.al. | 2502.20299 | null |
2025-02-27 | Judge a Book by its Cover: Investigating Multi-Modal LLMs for Multi-Page Handwritten Document Transcription | Benjamin Gutteridge et.al. | 2502.20295 | link |
2025-02-27 | Visual Adaptive Prompting for Compositional Zero-Shot Learning | Kyle Stein et.al. | 2502.20292 | null |
2025-02-27 | Conformal Tail Risk Control for Large Language Model Alignment | Catherine Yu-Chi Chen et.al. | 2502.20285 | null |
2025-02-27 | Evaluating Human Trust in LLM-Based Planners: A Preliminary Study | Shenghui Chen et.al. | 2502.20284 | null |
2025-02-26 | Hi Robot: Open-Ended Instruction Following with Hierarchical Vision-Language-Action Models | Lucy Xiaoyang Shi et.al. | 2502.19417 | null |
2025-02-26 | Norm Growth and Stability Challenges in Localized Sequential Knowledge Editing | Akshat Gupta et.al. | 2502.19416 | null |
2025-02-26 | Can Language Models Falsify? Evaluating Algorithmic Reasoning with Counterexample Creation | Shiven Sinha et.al. | 2502.19414 | link |
2025-02-26 | Project Alexandria: Towards Freeing Scientific Knowledge from Copyright Burdens via LLMs | Christoph Schuhmann et.al. | 2502.19413 | null |
2025-02-26 | Code to Think, Think to Code: A Survey on Code-Enhanced Reasoning and Reasoning-Driven Code Intelligence in LLMs | Dayu Yang et.al. | 2502.19411 | link |
2025-02-26 | Less or More: Towards Glanceable Explanations for LLM Recommendations Using Ultra-Small Devices | Xinru Wang et.al. | 2502.19410 | null |
2025-02-26 | ImageChain: Advancing Sequential Image-to-Text Reasoning in Multimodal Large Language Models | Danae Sánchez Villegas et.al. | 2502.19409 | null |
2025-02-26 | Learning Code-Edit Embedding to Model Student Debugging Behavior | Hasnain Heickal et.al. | 2502.19407 | null |
2025-02-26 | General Reasoning Requires Learning to Reason from the Get-go | Seungwook Han et.al. | 2502.19402 | null |
2025-02-26 | TheoremExplainAgent: Towards Multimodal Explanations for LLM Theorem Understanding | Max Ku et.al. | 2502.19400 | null |
2025-02-26 | LiDAR Registration with Visual Foundation Models | Niclas Vödisch et.al. | 2502.19374 | null |
2025-02-26 | Deep Learning For Time Series Analysis With Application On Human Motion | Ali Ismail-Fawaz et.al. | 2502.19364 | null |
2025-02-26 | DataMan: Data Manager for Pre-training Large Language Models | Ru Peng et.al. | 2502.19363 | null |
2025-02-26 | Can Large Language Models Detect Errors in Long Chain-of-Thought Reasoning? | Yancheng He et.al. | 2502.19361 | link |
2025-02-26 | Controlled Diversity: Length-optimized Natural Language Generation | Diana Marie Schenke et.al. | 2502.19347 | null |
2025-02-26 | Evaluating LLMs and Pre-trained Models for Text Summarization Across Diverse Datasets | Tohida Rehman et.al. | 2502.19339 | null |
2025-02-26 | I Know What I Don’t Know: Improving Model Cascades Through Confidence Tuning | Stephan Rabanser et.al. | 2502.19335 | null |
2025-02-26 | Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems | Hao Peng et.al. | 2502.19328 | link |
2025-02-26 | Shh, don’t say that! Domain Certification in LLMs | Cornelius Emde et.al. | 2502.19320 | null |
2025-02-26 | Rethinking LLM Unlearning Objectives: A Gradient Perspective and Go Beyond | Qizhou Wang et.al. | 2502.19301 | null |
2025-02-25 | DRAMA: Diverse Augmentation from Large Language Models to Smaller Dense Retrievers | Xueguang Ma et.al. | 2502.18460 | link |
2025-02-25 | LLM-Based Design Pattern Detection | Christian Schindler et.al. | 2502.18458 | null |
2025-02-25 | Evaluating the Effectiveness of Small Language Models in Detecting Refactoring Bugs | Rohit Gheyi et.al. | 2502.18454 | null |
2025-02-25 | FRIDA to the Rescue! Analyzing Synthetic Data Effectiveness in Object-Based Common Sense Reasoning for Disaster Response | Mollie Shichman et.al. | 2502.18452 | null |
2025-02-25 | SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution | Yuxiang Wei et.al. | 2502.18449 | null |
2025-02-25 | olmOCR: Unlocking Trillions of Tokens in PDFs with Vision Language Models | Jake Poznanski et.al. | 2502.18443 | link |
2025-02-25 | MAPoRL: Multi-Agent Post-Co-Training for Collaborative Large Language Models with Reinforcement Learning | Chanwoo Park et.al. | 2502.18439 | null |
2025-02-25 | Reversal Blessing: Thinking Backward May Outpace Thinking Forward in Multi-choice Questions | Yizhe Zhang et.al. | 2502.18435 | null |
2025-02-25 | Exploring Gender Disparities in Automatic Speech Recognition Technology | Hend ElGhazaly et.al. | 2502.18434 | null |
2025-02-25 | TextGames: Learning to Self-Play Text-Based Puzzle Games via Language Model Reasoning | Frederikus Hudi et.al. | 2502.18431 | link |
2025-02-25 | PyEvalAI: AI-assisted evaluation of Jupyter Notebooks for immediate personalized feedback | Nils Wandel et.al. | 2502.18425 | null |
2025-02-25 | Compressing Language Models for Specialized Domains | Miles Williams et.al. | 2502.18424 | null |
2025-02-25 | Rank1: Test-Time Compute for Reranking in Information Retrieval | Orion Weller et.al. | 2502.18418 | link |
2025-02-25 | OmniAlign-V: Towards Enhanced Alignment of MLLMs with Human Preference | Xiangyu Zhao et.al. | 2502.18411 | link |
2025-02-25 | Enhancing DNA Foundation Models to Address Masking Inefficiencies | Monireh Safari et.al. | 2502.18405 | null |
2025-02-25 | Monte Carlo Temperature: a robust sampling strategy for LLM’s uncertainty quantification methods | Nicola Cecere et.al. | 2502.18389 | null |
2025-02-25 | How Far are LLMs from Real Search? A Comprehensive Study on Efficiency, Completeness, and Inherent Capabilities | Minhua Lin et.al. | 2502.18387 | null |
2025-02-25 | MindMem: Multimodal for Predicting Advertisement Memorability Using LLMs and Deep Learning | Sepehr Asgarian et.al. | 2502.18371 | null |
2025-02-25 | Responsible AI Agents | Deven R. Desai et.al. | 2502.18359 | null |
2025-02-25 | Which Contributions Deserve Credit? Perceptions of Attribution in Human-AI Co-Creation | Jessica He et.al. | 2502.18357 | null |
2025-02-24 | Introducing Visual Perception Token into Multimodal Large Language Model | Runpeng Yu et.al. | 2502.17425 | link |
2025-02-24 | MLLMs Know Where to Look: Training-free Perception of Small Visual Details with Multimodal LLMs | Jiarui Zhang et.al. | 2502.17422 | link |
2025-02-24 | LongSpec: Long-Context Speculative Decoding with Efficient Drafting and Verification | Penghui Yang et.al. | 2502.17421 | link |
2025-02-24 | The Geometry of Refusal in Large Language Models: Concept Cones and Representational Independence | Tom Wollschläger et.al. | 2502.17420 | null |
2025-02-24 | From System 1 to System 2: A Survey of Reasoning Large Language Models | Zhong-Zhi Li et.al. | 2502.17419 | link |
2025-02-24 | Reasoning with Latent Thoughts: On the Power of Looped Transformers | Nikunj Saunshi et.al. | 2502.17416 | null |
2025-02-24 | COSMOS: A Hybrid Adaptive Optimizer for Memory-Efficient Training of LLMs | Liming Liu et.al. | 2502.17410 | link |
2025-02-24 | Large Language Models are Powerful EHR Encoders | Stefan Hegselmann et.al. | 2502.17403 | link |
2025-02-24 | Big-Math: A Large-Scale, High-Quality Math Dataset for Reinforcement Learning in Language Models | Alon Albalak et.al. | 2502.17387 | link |
2025-02-24 | Bridging Gaps in Natural Language Processing for Yorùbá: A Systematic Review of a Decade of Progress and Prospects | Toheeb A. Jimoh et.al. | 2502.17364 | null |
2025-02-24 | A Closer Look at TabPFN v2: Strength, Limitation, and Extension | Han-Jia Ye et.al. | 2502.17361 | null |
2025-02-24 | RELICT: A Replica Detection Framework for Medical Image Generation | Orhun Utku Aydin et.al. | 2502.17360 | link |
2025-02-24 | DIS-CO: Discovering Copyrighted Content in VLMs Training Data | André V. Duarte et.al. | 2502.17358 | link |
2025-02-24 | Distributional Scaling Laws for Emergent Capabilities | Rosie Zhao et.al. | 2502.17356 | null |
2025-02-24 | On Relation-Specific Neurons in Large Language Models | Yihong Liu et.al. | 2502.17355 | link |
2025-02-24 | How Scientists Use Large Language Models to Program | Gabrielle O’Brien et.al. | 2502.17348 | null |
2025-02-24 | Time series forecasting based on optimized LLM for fault prediction in distribution power grid insulators | João Pedro Matos-Carvalho et.al. | 2502.17341 | null |
2025-02-24 | Tokenized SAEs: Disentangling SAE Reconstructions | Thomas Dooms et.al. | 2502.17332 | null |
2025-02-24 | HIPPO: Enhancing the Table Understanding Capability of Large Language Models through Hybrid-Modal Preference Optimization | Zhenghao Liu et.al. | 2502.17315 | link |
2025-02-24 | `Generalization is hallucination’ through the lens of tensor completions | Liang Ze Wong et.al. | 2502.17305 | null |
2025-02-21 | ELIP: Enhanced Visual-Language Foundation Models for Image Retrieval | Guanqi Zhan et.al. | 2502.15682 | null |
2025-02-21 | Privacy Ripple Effects from Adding or Removing Personal Information in Language Model Training | Jaydeep Borkar et.al. | 2502.15680 | link |
2025-02-21 | BOSS: Benchmark for Observation Space Shift in Long-Horizon Task | Yue Yang et.al. | 2502.15679 | null |
2025-02-21 | Testing the limits of fine-tuning to improve reasoning in vision language models | Luca M. Schulze Buschoff et.al. | 2502.15678 | null |
2025-02-21 | FLEKE: Federated Locate-then-Edit Knowledge Editing | Zongkai Zhao et.al. | 2502.15677 | link |
2025-02-21 | AutoToM: Automated Bayesian Inverse Planning and Model Discovery for Open-ended Theory of Mind | Zhining Zhang et.al. | 2502.15676 | link |
2025-02-21 | Almost AI, Almost Human: The Challenge of Detecting AI-Polished Writing | Shoumik Saha et.al. | 2502.15666 | link |
2025-02-21 | Machine-generated text detection prevents language model collapse | George Drayson et.al. | 2502.15654 | link |
2025-02-21 | Empowering LLMs with Logical Reasoning: A Comprehensive Survey | Fengxiang Cheng et.al. | 2502.15652 | null |
2025-02-21 | Steering into New Embedding Spaces: Analyzing Cross-Lingual Alignment Induced by Model Interventions in Multilingual Language Models | Anirudh Sundar et.al. | 2502.15639 | null |
2025-02-21 | Mantis: Lightweight Calibrated Foundation Model for User-Friendly Time Series Classification | Vasilii Feofanov et.al. | 2502.15637 | link |
2025-02-21 | The Relationship Between Reasoning and Performance in Large Language Models – o3 (mini) Thinks Harder, Not Longer | Marthe Ballon et.al. | 2502.15631 | link |
2025-02-21 | Extraction multi-étiquettes de relations en utilisant des couches de Transformer | Ngoc Luyen Le et.al. | 2502.15619 | null |
2025-02-21 | Probe Pruning: Accelerating LLMs through Dynamic Pruning via Model-Probing | Qi Le et.al. | 2502.15618 | link |
2025-02-21 | PDeepPP:A Deep learning framework with Pretrained Protein language for peptide classification | Jixiu Zhai et.al. | 2502.15610 | link |
2025-02-21 | On the Robustness of Transformers against Context Hijacking for Linear Classification | Tianle Li et.al. | 2502.15609 | null |
2025-02-21 | Cross-Format Retrieval-Augmented Generation in XR with LLMs for Context-Aware Maintenance Assistance | Akos Nagy et.al. | 2502.15604 | null |
2025-02-21 | Do Multilingual LLMs Think In English? | Lisa Schut et.al. | 2502.15603 | null |
2025-02-21 | WorldCraft: Photo-Realistic 3D World Creation and Customization via LLM Agents | Xinhang Liu et.al. | 2502.15601 | null |
2025-02-21 | SafeInt: Shielding Large Language Models from Jailbreak Attacks via Safety-Aware Representation Intervention | Jiaqi Wu et.al. | 2502.15594 | null |
2025-02-20 | LServe: Efficient Long-sequence LLM Serving with Unified Sparse Attention | Shang Yang et.al. | 2502.14866 | link |
2025-02-20 | Aligning LLMs to Ask Good Questions A Case Study in Clinical Reasoning | Shuyue Stella Li et.al. | 2502.14860 | link |
2025-02-20 | FR-Spec: Accelerating Large-Vocabulary Language Models via Frequency-Ranked Speculative Sampling | Weilin Zhao et.al. | 2502.14856 | null |
2025-02-20 | Prompt-to-Leaderboard | Evan Frick et.al. | 2502.14855 | link |
2025-02-20 | GATE: Graph-based Adaptive Tool Evolution Across Diverse Tasks | Jianwen Luo et.al. | 2502.14848 | link |
2025-02-20 | Red-Teaming LLM Multi-Agent Systems via Communication Attacks | Pengfei He et.al. | 2502.14847 | null |
2025-02-20 | Scaling Text-Rich Image Understanding via Code-Guided Synthetic Multimodal Data Generation | Yue Yang et.al. | 2502.14846 | null |
2025-02-20 | Revealing and Mitigating Over-Attention in Knowledge Editing | Pinzheng Wang et.al. | 2502.14838 | link |
2025-02-20 | LongWriter-V: Enabling Ultra-Long and High-Fidelity Generation in Vision-Language Models | Shangqing Tu et.al. | 2502.14834 | link |
2025-02-20 | Middle-Layer Representation Alignment for Cross-Lingual Transfer in Fine-Tuned LLMs | Danni Liu et.al. | 2502.14830 | link |
2025-02-20 | Measuring Faithfulness of Chains of Thought by Unlearning Reasoning Steps | Martin Tutek et.al. | 2502.14829 | link |
2025-02-20 | Exploring Advanced Techniques for Visual Question Answering: A Comprehensive Comparison | Aiswarya Baby et.al. | 2502.14827 | null |
2025-02-20 | A Survey of Model Architectures in Information Retrieval | Zhichao Xu et.al. | 2502.14822 | null |
2025-02-20 | eC-Tab2Text: Aspect-Based Text Generation from e-Commerce Product Tables | Luis Antonio Gutiérrez Guanilo et.al. | 2502.14820 | null |
2025-02-20 | Dynamic Low-Rank Sparse Adaptation for Large Language Models | Weizhong Huang et.al. | 2502.14816 | link |
2025-02-20 | FetalCLIP: A Visual-Language Foundation Model for Fetal Ultrasound Image Analysis | Fadillah Maani et.al. | 2502.14807 | link |
2025-02-20 | From RAG to Memory: Non-Parametric Continual Learning for Large Language Models | Bernal Jiménez Gutiérrez et.al. | 2502.14802 | link |
2025-02-20 | A Multi-Agent Perspective on Modern Information Retrieval | Haya Nachimovsky et.al. | 2502.14796 | null |
2025-02-20 | Rapid Word Learning Through Meta In-Context Learning | Wentao Wang et.al. | 2502.14791 | null |
2025-02-20 | SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features | Michael Tschannen et.al. | 2502.14786 | link |
2025-02-19 | Where’s the Bug? Attention Probing for Scalable Fault Localization | Adam Stein et.al. | 2502.13966 | null |
2025-02-19 | Autellix: An Efficient Serving Engine for LLM Agents as General Programs | Michael Luo et.al. | 2502.13965 | null |
2025-02-19 | MuDAF: Long-Context Multi-Document Attention Focusing through Contrastive Learning on Attention Heads | Weihao Liu et.al. | 2502.13963 | link |
2025-02-19 | Is That Your Final Answer? Test-Time Scaling Improves Selective Question Answering | William Jurayj et.al. | 2502.13962 | null |
2025-02-19 | LIDDIA: Language-based Intelligent Drug Discovery Agent | Reza Averly et.al. | 2502.13959 | null |
2025-02-19 | Neurosymbolic artificial intelligence via large language models and coherence-driven inference | Steve Huntsman et.al. | 2502.13953 | null |
2025-02-19 | Why Safeguarded Ships Run Aground? Aligned Large Language Models’ Safety Mechanisms Tend to Be Anchored in The Template Region | Chak Tou Leong et.al. | 2502.13946 | null |
2025-02-19 | A Chain-of-Thought Subspace Meta-Learning for Few-shot Image Captioning with Large Vision and Language Models | Hao Huang et.al. | 2502.13942 | null |
2025-02-19 | Symmetrical Visual Contrastive Optimization: Aligning Vision-Language Models with Minimal Contrastive Images | Shengguang Wu et.al. | 2502.13928 | null |
2025-02-19 | Beyond Single Frames: Can LMMs Comprehend Temporal and Contextual Narratives in Image Sequences? | Xiaochen Wang et.al. | 2502.13925 | null |
2025-02-19 | LongPO: Long Context Self-Evolution of Large Language Models through Short-to-Long Preference Optimization | Guanzheng Chen et.al. | 2502.13922 | link |
2025-02-19 | Exploring Code Language Models for Automated HLS-based Hardware Generation: Benchmark, Infrastructure and Analysis | Jiahao Gai et.al. | 2502.13921 | null |
2025-02-19 | Exploring Personalized Health Support through Data-Driven, Theory-Guided LLMs: A Case Study in Sleep Health | Xingbo Wang et.al. | 2502.13920 | link |
2025-02-19 | TESS 2: A Large-Scale Generalist Diffusion Language Model | Jaesung Tae et.al. | 2502.13917 | link |
2025-02-19 | How Do LLMs Perform Two-Hop Reasoning in Context? | Tianyu Guo et.al. | 2502.13913 | null |
2025-02-19 | Lost in Sequence: Do Large Language Models Understand Sequential Recommendation? | Sein Kim et.al. | 2502.13909 | link |
2025-02-19 | Judging the Judges: A Collection of LLM-Generated Relevance Judgements | Hossein A. Rahmani et.al. | 2502.13908 | link |
2025-02-19 | DataSciBench: An LLM Agent Benchmark for Data Science | Dan Zhang et.al. | 2502.13897 | link |
2025-02-19 | NavigateDiff: Visual Predictors are Zero-Shot Navigation Assistants | Yiran Qin et.al. | 2502.13894 | null |
2025-02-19 | Refining embeddings with fill-tuning: data-efficient generalised performance improvements for materials foundation models | Matthew P. Wilson et.al. | 2502.13886 | link |
2025-02-18 | Re-Align: Aligning Vision Language Models via Retrieval-Augmented Direct Preference Optimization | Shuo Xing et.al. | 2502.13146 | link |
2025-02-18 | Multimodal Mamba: Decoder-only Multimodal State Space Model via Quadratic to Linear Distillation | Bencheng Liao et.al. | 2502.13145 | link |
2025-02-18 | Pre-training Auto-regressive Robotic Models with 4D Representations | Dantong Niu et.al. | 2502.13142 | null |
2025-02-18 | UniGuardian: A Unified Defense for Detecting Prompt Injection, Backdoor Attacks and Adversarial Attacks in Large Language Models | Huawei Lin et.al. | 2502.13141 | link |
2025-02-18 | AIDE: AI-Driven Exploration in the Space of Code | Zhengyao Jiang et.al. | 2502.13138 | link |
2025-02-18 | Theorem Prover as a Judge for Synthetic Data Generation | Joshua Ong Jun Leang et.al. | 2502.13137 | null |
2025-02-18 | Sleepless Nights, Sugary Days: Creating Synthetic Users with Health Conditions for Realistic Coaching Agent Interactions | Taedong Yun et.al. | 2502.13135 | null |
2025-02-18 | Learning to Defer for Causal Discovery with Imperfect Experts | Oscar Clivio et.al. | 2502.13132 | null |
2025-02-18 | Rethinking Diverse Human Preference Learning through Principal Component Analysis | Feng Luo et.al. | 2502.13131 | null |
2025-02-18 | Magma: A Foundation Model for Multimodal AI Agents | Jianwei Yang et.al. | 2502.13130 | link |
2025-02-18 | Facilitating Long Context Understanding via Supervised Chain-of-Thought Reasoning | Jingyang Lin et.al. | 2502.13127 | null |
2025-02-18 | RuozhiBench: Evaluating LLMs with Logical Fallacies and Misleading Premises | Zenan Zhai et.al. | 2502.13125 | link |
2025-02-18 | Adapting Psycholinguistic Research for LLMs: Gender-inclusive Language in a Coreference Context | Marion Bartl et.al. | 2502.13120 | null |
2025-02-18 | STEER-ME: Assessing the Microeconomic Reasoning of Large Language Models | Narun Raman et.al. | 2502.13119 | null |
2025-02-18 | Performance Evaluation of Large Language Models in Statistical Programming | Xinyi Song et.al. | 2502.13117 | link |
2025-02-18 | MatterChat: A Multi-Modal LLM for Material Science | Yingheng Tang et.al. | 2502.13107 | null |
2025-02-18 | Understanding and Rectifying Safety Perception Distortion in VLMs | Xiaohan Zou et.al. | 2502.13095 | null |
2025-02-18 | Text2World: Benchmarking Large Language Models for Symbolic World Model Generation | Mengkang Hu et.al. | 2502.13092 | null |
2025-02-18 | KAPPA: A Generic Patent Analysis Framework with Keyphrase-Based Portraits | Xin Xia et.al. | 2502.13076 | null |
2025-02-18 | Cramming 1568 Tokens into a Single Vector and Back Again: Exploring the Limits of Embedding Space Capacity | Yuri Kuratov et.al. | 2502.13063 | link |
2025-02-17 | Idiosyncrasies in Large Language Models | Mingjie Sun et.al. | 2502.12150 | link |
2025-02-17 | HermesFlow: Seamlessly Closing the Gap in Multimodal Understanding and Generation | Ling Yang et.al. | 2502.12148 | link |
2025-02-17 | Fast or Better? Balancing Accuracy and Cost in Retrieval-Augmented Generation with Flexible User Control | Jinyan Su et.al. | 2502.12145 | link |
2025-02-17 | Small Models Struggle to Learn from Strong Reasoners | Yuetai Li et.al. | 2502.12143 | null |
2025-02-17 | SoftCoT: Soft Chain-of-Thought for Efficient Reasoning with LLMs | Yige Xu et.al. | 2502.12134 | link |
2025-02-17 | Transformer Dynamics: A neuroscientific approach to interpretability of large language models | Jesseba Fernando et.al. | 2502.12131 | null |
2025-02-17 | Scaling Autonomous Agents via Automatic Reward Modeling And Planning | Zhenfang Chen et.al. | 2502.12130 | null |
2025-02-17 | On the Query Complexity of Verifier-Assisted Language Generation | Edoardo Botta et.al. | 2502.12123 | null |
2025-02-17 | Minimal Ranks, Maximum Confidence: Parameter-efficient Uncertainty Quantification for LoRA | Patryk Marszałek et.al. | 2502.12122 | link |
2025-02-17 | LLMs on the Line: Data Determines Loss-to-Loss Scaling Laws | Prasanna Mayilvahanan et.al. | 2502.12120 | null |
2025-02-17 | PRISM: Self-Pruning Intrinsic Selection Method for Training-Free Multimodal Data Selection | Jinhe Bi et.al. | 2502.12119 | null |
2025-02-17 | A-MEM: Agentic Memory for LLM Agents | Wujiang Xu et.al. | 2502.12110 | link |
2025-02-17 | Personality Structured Interview for Large Language Model Simulation in Personality Research | Pengda Wang et.al. | 2502.12109 | null |
2025-02-17 | Relational Norms for Human-AI Cooperation | Brian D. Earp et.al. | 2502.12102 | null |
2025-02-17 | Token Communications: A Unified Framework for Cross-modal Context-aware Semantic Communications | Li Qiao et.al. | 2502.12096 | null |
2025-02-17 | Descriminative-Generative Custom Tokens for Vision-Language Models | Pramuditha Perera et.al. | 2502.12095 | null |
2025-02-17 | Meta-Statistical Learning: Supervised Learning of Statistical Inference | Maxime Peyrard et.al. | 2502.12088 | null |
2025-02-17 | APB: Accelerating Distributed Long-Context Inference by Passing Compressed Context Blocks across GPUs | Yuxiang Huang et.al. | 2502.12085 | link |
2025-02-17 | VLM $^2$ -Bench: A Closer Look at How Well VLMs Implicitly Link Explicit Matching Visual Cues | Jianshu Zhang et.al. | 2502.12084 | null |
2025-02-17 | AdaSplash: Adaptive Sparse Flash Attention | Nuno Gonçalves et.al. | 2502.12082 | link |
2025-02-14 | MM-RLHF: The Next Step Forward in Multimodal LLM Alignment | Yi-Fan Zhang et.al. | 2502.10391 | null |
2025-02-14 | Aspect-Oriented Summarization for Psychiatric Short-Term Readmission Prediction | WonJin Yoon et.al. | 2502.10388 | null |
2025-02-14 | Unknown Word Detection for English as a Second Language (ESL) Learners Using Gaze and Pre-trained Language Models | Jiexin Ding et.al. | 2502.10378 | null |
2025-02-14 | Robustness tests for biomedical foundation models should tailor to specification | R. Patrick Xian et.al. | 2502.10374 | link |
2025-02-14 | Enhancing Multilingual LLM Pretraining with Model-Based Data Selection | Bettina Messmer et.al. | 2502.10361 | null |
2025-02-14 | Organize the Web: Constructing Domains Enhances Pre-Training Data Curation | Alexander Wettig et.al. | 2502.10341 | null |
2025-02-14 | Evaluating the Meta- and Object-Level Reasoning of Large Language Models for Question Answering | Nick Ferguson et.al. | 2502.10338 | null |
2025-02-14 | LLM-Powered Preference Elicitation in Combinatorial Assignment | Ermis Soumalias et.al. | 2502.10308 | null |
2025-02-14 | SPIRIT: Short-term Prediction of solar IRradIance for zero-shot Transfer learning using Foundation Models | Aditya Mishra et.al. | 2502.10307 | null |
2025-02-14 | Open-Source AI-Powered Optimization in Scalene: Advancing Python Performance Profiling with DeepSeek-R1 and LLaMA 3.2 | Saem Hasan et.al. | 2502.10299 | null |
2025-02-14 | DeltaProduct: Increasing the Expressivity of DeltaNet Through Products of Householders | Julien Siems et.al. | 2502.10297 | link |
2025-02-14 | Probing Perceptual Constancy in Large Vision Language Models | Haoran Sun et.al. | 2502.10273 | null |
2025-02-14 | Are Large Language Models the future crowd workers of Linguistics? | Iris Ferrazzo et.al. | 2502.10266 | null |
2025-02-14 | Large Language Models and Synthetic Data for Monitoring Dataset Mentions in Research Papers | Aivin V. Solatorio et.al. | 2502.10263 | link |
2025-02-14 | VisCon-100K: Leveraging Contextual Web Data for Fine-tuning Vision Language Models | Gokul Karthik Kumar et.al. | 2502.10250 | null |
2025-02-14 | Step-Video-T2V Technical Report: The Practice, Challenges, and Future of Video Foundation Model | Guoqing Ma et.al. | 2502.10248 | link |
2025-02-14 | Efficient Zero-Order Federated Finetuning of Language Models for Resource-Constrained Devices | Mohamed Aboelenien Ahmed et.al. | 2502.10239 | null |
2025-02-14 | AdaPTS: Adapting Univariate Foundation Models to Probabilistic Multivariate Time Series Forecasting | Abdelhakim Benechehab et.al. | 2502.10235 | link |
2025-02-14 | Do Large Language Models Reason Causally Like Us? Even Better? | Hanna M. Dettki et.al. | 2502.10215 | null |
2025-02-14 | Can Post-Training Quantization Benefit from an Additional QLoRA Integration? | Xiliang Zhu et.al. | 2502.10202 | null |
2025-02-13 | Theoretical Benefit and Limitation of Diffusion Language Model | Guhao Feng et.al. | 2502.09622 | null |
2025-02-13 | MME-CoT: Benchmarking Chain-of-Thought in Large Multimodal Models for Reasoning Quality, Robustness, and Efficiency | Dongzhi Jiang et.al. | 2502.09621 | null |
2025-02-13 | Exploring the Potential of Encoder-free Architectures in 3D LMMs | Yiwen Tang et.al. | 2502.09620 | link |
2025-02-13 | Human-LLM Coevolution: Evidence from Academic Writing | Mingmeng Geng et.al. | 2502.09606 | null |
2025-02-13 | SelfCite: Self-Supervised Alignment for Context Attribution in Large Language Models | Yung-Sung Chuang et.al. | 2502.09604 | link |
2025-02-13 | GAIA: A Global, Multi-modal, Multi-scale Vision-Language Dataset for Remote Sensing Image Analysis | Angelos Zavras et.al. | 2502.09598 | link |
2025-02-13 | Do LLMs Recognize Your Preferences? Evaluating Personalized Preference Following in LLMs | Siyan Zhao et.al. | 2502.09597 | link |
2025-02-13 | KIMAs: A Configurable Knowledge Integrated Multi-Agent System | Zitao Li et.al. | 2502.09596 | null |
2025-02-13 | Logical forms complement probability in understanding language model (and human) performance | Yixuan Wang et.al. | 2502.09589 | null |
2025-02-13 | Polymind: Parallel Visual Diagramming with Large Language Models to Support Prewriting Through Microtasks | Qian Wan et.al. | 2502.09577 | null |
2025-02-13 | MorphNLI: A Stepwise Approach to Natural Language Inference Using Text Morphing | Vlad Andrei Negru et.al. | 2502.09567 | null |
2025-02-13 | Zero-shot generation of synthetic neurosurgical data with large language models | Austin A. Barr et.al. | 2502.09566 | link |
2025-02-13 | MDCrow: Automating Molecular Dynamics Workflows with Large Language Models | Quintina Campbell et.al. | 2502.09565 | link |
2025-02-13 | EmbodiedBench: Comprehensive Benchmarking Multi-modal Large Language Models for Vision-Driven Embodied Agents | Rui Yang et.al. | 2502.09560 | null |
2025-02-13 | Explainable AI-assisted Optimization for Feynman Integral Reduction | Zhuo-Yang Song et.al. | 2502.09544 | null |
2025-02-13 | Mind the Gap! Choice Independence in Using Multilingual LLMs for Persuasive Co-Writing Tasks in Different Languages | Shreyan Biswas et.al. | 2502.09532 | null |
2025-02-13 | When and How Does CLIP Enable Domain and Compositional Generalization? | Elias Kempf et.al. | 2502.09507 | null |
2025-02-13 | Improve LLM-based Automatic Essay Scoring with Linguistic Features | Zhaoyi Joey Hou et.al. | 2502.09497 | null |
2025-02-13 | Foundation Neural-Network Quantum States | Riccardo Rende et.al. | 2502.09488 | null |
2025-02-13 | Objective quantification of mood states using large language models | Jakub Onysk et.al. | 2502.09487 | null |
2025-02-12 | SwiftSketch: A Diffusion Model for Image-to-Vector Sketch Generation | Ellie Arar et.al. | 2502.08642 | null |
2025-02-12 | Examining Multilingual Embedding Models Cross-Lingually Through LLM-Generated Adversarial Examples | Andrianos Michail et.al. | 2502.08638 | null |
2025-02-12 | Ensemble based approach to quantifying uncertainty of LLM based classifications | Srijith Rajamohan et.al. | 2502.08631 | null |
2025-02-12 | Continuous Cardiac Arrest Prediction in ICU using PPG Foundation Model | Saurabh Kataria et.al. | 2502.08612 | null |
2025-02-12 | Causal Analysis of ASR Errors for Children: Quantifying the Impact of Physiological, Cognitive, and Extrinsic Factors | Vishwanath Pratap Singh et.al. | 2502.08587 | null |
2025-02-12 | Commercial LLM Agents Are Already Vulnerable to Simple Yet Dangerous Attacks | Ang Li et.al. | 2502.08586 | null |
2025-02-12 | COAST: Intelligent Time-Adaptive Neural Operators | Zhikai Wu et.al. | 2502.08574 | null |
2025-02-12 | QA-Expand: Multi-Question Answer Generation for Enhanced Query Expansion in Information Retrieval | Wonduk Seo et.al. | 2502.08557 | null |
2025-02-12 | Human-Centric Foundation Models: Perception, Generation and Agentic Modeling | Shixiang Tang et.al. | 2502.08556 | link |
2025-02-12 | Fostering Appropriate Reliance on Large Language Models: The Role of Explanations, Sources, and Inconsistencies | Sunnie S. Y. Kim et.al. | 2502.08554 | null |
2025-02-12 | LLMs can implicitly learn from mistakes in-context | Lisa Alazraki et.al. | 2502.08550 | null |
2025-02-12 | Representation Learning to Advance Multi-institutional Studies with Electronic Health Record Data | Doudou Zhou et.al. | 2502.08547 | null |
2025-02-12 | Moment of Untruth: Dealing with Negative Queries in Video Moment Retrieval | Kevin Flanagan et.al. | 2502.08544 | link |
2025-02-12 | LLM Pretraining with Continuous Concepts | Jihoon Tack et.al. | 2502.08524 | null |
2025-02-12 | The Paradox of Stochasticity: Limited Creativity and Computational Decoupling in Temperature-Varied LLM Outputs of Structured Fictional Data | Evgenii Evstafev et.al. | 2502.08515 | null |
2025-02-12 | Faithful, Unfaithful or Ambiguous? Multi-Agent Debate with Initial Stance for Summary Evaluation | Mahnaz Koupaee et.al. | 2502.08514 | link |
2025-02-12 | Measuring Diversity in Synthetic Datasets | Yuchang Zhu et.al. | 2502.08512 | link |
2025-02-12 | Explanation based In-Context Demonstrations Retrieval for Multilingual Grammatical Error Correction | Wei Li et.al. | 2502.08507 | link |
2025-02-12 | Salamandra Technical Report | Aitor Gonzalez-Agirre et.al. | 2502.08489 | link |
2025-02-12 | One-Shot Federated Learning with Classifier-Free Diffusion Models | Obaidullah Zaland et.al. | 2502.08488 | null |
2025-02-11 | DarwinLM: Evolutionary Structured Pruning of Large Language Models | Shengkun Tang et.al. | 2502.07780 | link |
2025-02-11 | Auditing Prompt Caching in Language Model APIs | Chenchen Gu et.al. | 2502.07776 | link |
2025-02-11 | Automatic Robot Task Planning by Integrating Large Language Model with Genetic Programming | Azizjon Kobilov et.al. | 2502.07772 | null |
2025-02-11 | Breaking Down Bias: On The Limits of Generalizable Pruning Strategies | Sibo Ma et.al. | 2502.07771 | null |
2025-02-11 | Great Power Brings Great Responsibility: Personalizing Conversational AI for Diverse Problem-Solvers | Italo Santos et.al. | 2502.07763 | null |
2025-02-11 | Scalable Fingerprinting of Large Language Models | Anshul Nasery et.al. | 2502.07760 | null |
2025-02-11 | Towards Efficient Optimizer Design for LLM via Structured Fisher Approximation with a Low-Rank Extension | Wenbo Gong et.al. | 2502.07752 | null |
2025-02-11 | WHODUNIT: Evaluation benchmark for culprit detection in mystery stories | Kshitij Gupta et.al. | 2502.07747 | link |
2025-02-11 | The Economics of Large Language Models: Token Allocation, Fine-Tuning, and Optimal Pricing | Dirk Bergemann et.al. | 2502.07736 | null |
2025-02-11 | Economics of Sourcing Human Data | Sebastin Santy et.al. | 2502.07732 | null |
2025-02-11 | Verifying LLM-Generated Code in the Context of Software Verification with Ada/SPARK | Marcos Cramer et.al. | 2502.07728 | null |
2025-02-11 | Making Language Models Robust Against Negation | MohammadHossein Rezaei et.al. | 2502.07717 | link |
2025-02-11 | Magic 1-For-1: Generating One Minute Video Clips within One Minute | Hongwei Yi et.al. | 2502.07701 | link |
2025-02-11 | A Framework for LLM-powered Design Assistants | Swaroop Panda et.al. | 2502.07698 | null |
2025-02-11 | Large Language Models as Proxies for Theories of Human Linguistic Cognition | Imry Ziv et.al. | 2502.07687 | null |
2025-02-11 | SymGPT: Auditing Smart Contracts via Combining Symbolic Execution with Large Language Models | Shihao Xia et.al. | 2502.07644 | null |
2025-02-11 | FoQA: A Faroese Question-Answering Dataset | Annika Simonsen et.al. | 2502.07642 | null |
2025-02-11 | Goedel-Prover: A Frontier Model for Open-Source Automated Theorem Proving | Yong Lin et.al. | 2502.07640 | link |
2025-02-11 | Exploring Mobile Touch Interaction with Large Language Models | Tim Zindulka et.al. | 2502.07629 | null |
2025-02-11 | Scaling Pre-training to One Hundred Billion Data for Vision Language Models | Xiao Wang et.al. | 2502.07617 | null |
2025-02-10 | EVEv2: Improved Baselines for Encoder-Free Vision-Language Models | Haiwen Diao et.al. | 2502.06788 | link |
2025-02-10 | Visual Agentic AI for Spatial Reasoning with a Dynamic API | Damiano Marsili et.al. | 2502.06787 | null |
2025-02-10 | DeepCrossAttention: Supercharging Transformer Residual Connections | Mike Heddes et.al. | 2502.06785 | null |
2025-02-10 | Towards Internet-Scale Training For Agents | Brandon Trabucco et.al. | 2502.06776 | null |
2025-02-10 | Enhancing Trust in Language Model-Based Code Optimization through RLHF: A Research Design | Jingzhi Gong et.al. | 2502.06769 | null |
2025-02-10 | Exploiting Sparsity for Long Context Inference: Million Token Contexts on Commodity GPUs | Ryan Synk et.al. | 2502.06766 | link |
2025-02-10 | Rationalization Models for Text-to-SQL | Gaetano Rossiello et.al. | 2502.06759 | null |
2025-02-10 | Accelerating Data Processing and Benchmarking of AI Models for Pathology | Andrew Zhang et.al. | 2502.06750 | link |
2025-02-10 | Gradient Multi-Normalization for Stateless and Scalable LLM Training | Meyer Scetbon et.al. | 2502.06742 | null |
2025-02-10 | VersaPRM: Multi-Domain Process Reward Model via Synthetic Reasoning Data | Thomas Zeng et.al. | 2502.06737 | null |
2025-02-10 | Dynamic Loss-Based Sample Reweighting for Improved Large Language Model Pretraining | Daouda Sow et.al. | 2502.06733 | null |
2025-02-10 | Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling | Runze Liu et.al. | 2502.06703 | link |
2025-02-10 | EquiTabPFN: A Target-Permutation Equivariant Prior Fitted Networks | Michael Arbel et.al. | 2502.06684 | null |
2025-02-10 | Boosting Self-Efficacy and Performance of Large Language Models via Verbal Efficacy Stimulations | Rui Chen et.al. | 2502.06669 | null |
2025-02-10 | Automatic Evaluation of Healthcare LLMs Beyond Question-Answering | Anna Arias-Duart et.al. | 2502.06666 | null |
2025-02-10 | Evaluation of Deep Audio Representations for Hearables | Fabian Gröger et.al. | 2502.06664 | null |
2025-02-10 | EfficientLLM: Scalable Pruning-Aware Pretraining for Architecture-Agnostic Edge Language Models | Xingrun Xing et.al. | 2502.06663 | null |
2025-02-10 | Unbiased Evaluation of Large Language Models from a Causal Perspective | Meilin Chen et.al. | 2502.06655 | null |
2025-02-10 | In-Context Learning (and Unlearning) of Length Biases | Stephanie Schoch et.al. | 2502.06653 | null |
2025-02-10 | Transparent NLP: Using RAG and LLM Alignment for Privacy Q&A | Anna Leschanowsky et.al. | 2502.06652 | null |
2025-02-07 | Long-VITA: Scaling Large Multi-modal Models to 1 Million Tokens with Leading Short-Context Accuray | Yunhang Shen et.al. | 2502.05177 | link |
2025-02-07 | Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach | Jonas Geiping et.al. | 2502.05171 | link |
2025-02-07 | NoLiMa: Long-Context Evaluation Beyond Literal Matching | Ali Modarressi et.al. | 2502.05167 | link |
2025-02-07 | Multitwine: Multi-Object Compositing with Text and Layout Control | Gemma Canet Tarrés et.al. | 2502.05165 | null |
2025-02-07 | DuoGuard: A Two-Player RL-Driven Framework for Multilingual LLM Guardrails | Yihe Deng et.al. | 2502.05163 | link |
2025-02-07 | A Lightweight Method to Disrupt Memorized Sequences in LLM | Parjanya Prajakta Prashant et.al. | 2502.05159 | null |
2025-02-07 | Transforming Science with Large Language Models: A Survey on AI-assisted Scientific Discovery, Experimentation, Content Generation, and Evaluation | Steffen Eger et.al. | 2502.05151 | link |
2025-02-07 | CodeSCM: Causal Analysis for Multi-Modal Code Generation | Mukur Gupta et.al. | 2502.05150 | link |
2025-02-07 | An Annotated Reading of ‘The Singer of Tales’ in the LLM Era | Kush R. Varshney et.al. | 2502.05148 | null |
2025-02-07 | Chest X-ray Foundation Model with Global and Local Representations Integration | Zefan Yang et.al. | 2502.05142 | link |
2025-02-07 | Refining Integration-by-Parts Reduction of Feynman Integrals with Machine Learning | Matt von Hippel et.al. | 2502.05121 | null |
2025-02-07 | Flexible and Efficient Grammar-Constrained Decoding | Kanghee Park et.al. | 2502.05111 | null |
2025-02-07 | Lost in Time: Clock and Calendar Understanding Challenges in Multimodal LLMs | Rohit Saxena et.al. | 2502.05092 | null |
2025-02-07 | DCFormer: Efficient 3D Vision-Language Modeling with Decomposed Convolutions | Gorkem Can Ates et.al. | 2502.05091 | null |
2025-02-07 | Mitigating Unintended Memorization with LoRA in Federated Learning for LLMs | Thierry Bossy et.al. | 2502.05087 | link |
2025-02-07 | Causality can systematically address the monsters under the bench(marks) | Felix Leeb et.al. | 2502.05085 | null |
2025-02-07 | ChallengeMe: An Adversarial Learning-enabled Text Summarization Framework | Xiaoyu Deng et.al. | 2502.05084 | null |
2025-02-07 | Adaptive Graph of Thoughts: Test-Time Adaptive Reasoning Unifying Chain, Tree, and Graph Structures | Tushar Pandey et.al. | 2502.05078 | link |
2025-02-07 | nvAgent: Automated Data Visualization from Natural Language via Collaborative Agent Workflow | Geliang Ouyang et.al. | 2502.05036 | link |
2025-02-07 | EnseSmells: Deep ensemble and programming language models for automated code smells detection | Anh Ho et.al. | 2502.05012 | link |
2025-02-06 | Ola: Pushing the Frontiers of Omni-Modal Language Model with Progressive Modality Alignment | Zuyan Liu et.al. | 2502.04328 | link |
2025-02-06 | Speak Easy: Eliciting Harmful Jailbreaks from LLMs with Simple Interactions | Yik Siu Chan et.al. | 2502.04322 | link |
2025-02-06 | ConceptAttention: Diffusion Transformers Learn Highly Interpretable Features | Alec Helbling et.al. | 2502.04320 | link |
2025-02-06 | sshELF: Single-Shot Hierarchical Extrapolation of Latent Features for 3D Reconstruction from Sparse-Views | Eyvaz Najafli et.al. | 2502.04318 | null |
2025-02-06 | ChamaleonLLM: Batch-Aware Dynamic Low-Rank Adaptation via Inference-Time Clusters | Kamer Ali Yuksel et.al. | 2502.04315 | link |
2025-02-06 | Great Models Think Alike and this Undermines AI Oversight | Shashwat Goel et.al. | 2502.04313 | link |
2025-02-06 | ScoreFlow: Mastering LLM Agent Workflows via Score-based Preference Optimization | Yinjie Wang et.al. | 2502.04306 | link |
2025-02-06 | Beyond Prompt Content: Enhancing LLM Performance via Content-Format Integrated Prompt Optimization | Yuanye Liu et.al. | 2502.04295 | link |
2025-02-06 | PILAF: Optimal Human Preference Sampling for Reward Modeling | Yunzhen Feng et.al. | 2502.04270 | null |
2025-02-06 | How does a Multilingual LM Handle Multiple Languages? | Santhosh Kakarla et.al. | 2502.04269 | null |
2025-02-06 | Cross the Gap: Exposing the Intra-modal Misalignment in CLIP via Modality Inversion | Marco Mistretta et.al. | 2502.04263 | link |
2025-02-06 | Efficient Randomized Experiments Using Foundation Models | Piersilvio De Bartolomeis et.al. | 2502.04262 | link |
2025-02-06 | MAGA: MAssive Genre-Audience Reformulation to Pretraining Corpus Expansion | Xintong Hao et.al. | 2502.04235 | null |
2025-02-06 | Can LLMs Hack Enterprise Networks? Autonomous Assumed Breach Penetration-Testing Active Directory Networks | Andreas Happe et.al. | 2502.04227 | null |
2025-02-06 | Keep It Light! Simplifying Image Clustering Via Text-Free Adapters | Yicen Li et.al. | 2502.04226 | null |
2025-02-06 | Éclair – Extracting Content and Layout with Integrated Reading Order for Documents | Ilia Karmanov et.al. | 2502.04223 | null |
2025-02-06 | Sports and Women’s Sports: Gender Bias in Text Generation with Olympic Data | Laura Biester et.al. | 2502.04218 | null |
2025-02-06 | Algorithmic causal structure emerging through compression | Liang Wendong et.al. | 2502.04210 | null |
2025-02-06 | “Short-length” Adversarial Training Helps LLMs Defend “Long-length” Jailbreak Attacks: Theoretical and Empirical Evidence | Shaopeng Fu et.al. | 2502.04204 | link |
2025-02-06 | The Best Instruction-Tuning Data are Those That Fit | Dylan Zhang et.al. | 2502.04194 | null |
2025-02-05 | Do Large Language Model Benchmarks Test Reliability? | Joshua Vendrow et.al. | 2502.03461 | link |
2025-02-05 | Adapt-Pruner: Adaptive Structural Pruning for Efficient Small Language Model Training | Boyao Wang et.al. | 2502.03460 | null |
2025-02-05 | SKI Models: Skeleton Induced Vision-Language Embeddings for Understanding Activities of Daily Living | Arkaprava Sinha et.al. | 2502.03459 | null |
2025-02-05 | A Schema-Guided Reason-while-Retrieve framework for Reasoning on Scene Graphs with Large-Language-Models (LLMs) | Yiye Chen et.al. | 2502.03450 | null |
2025-02-05 | BFS-Prover: Scalable Best-First Tree Search for LLM-based Automatic Theorem Proving | Ran Xin et.al. | 2502.03438 | null |
2025-02-05 | On Fairness of Unified Multimodal Large Language Model for Image Generation | Ming Liu et.al. | 2502.03429 | null |
2025-02-05 | Harnessing Large Language Models for Curated Code Reviews | Oussama Ben Sghaier et.al. | 2502.03425 | link |
2025-02-05 | Think or Step-by-Step? UnZIPping the Black Box in Zero-Shot Prompts | Nikta Gohari Sadr et.al. | 2502.03418 | null |
2025-02-05 | SPRI: Aligning Large Language Models with Context-Situated Principles | Hongli Zhan et.al. | 2502.03397 | null |
2025-02-05 | Benchmarking Time Series Forecasting Models: From Statistical Techniques to Foundation Models in Real-World Applications | Issar Arab et.al. | 2502.03395 | null |
2025-02-05 | LIMO: Less is More for Reasoning | Yixin Ye et.al. | 2502.03387 | link |
2025-02-05 | Transformers and Their Roles as Time Series Foundation Models | Dennis Wu et.al. | 2502.03383 | null |
2025-02-05 | High-Fidelity Simultaneous Speech-To-Speech Translation | Tom Labiausse et.al. | 2502.03382 | link |
2025-02-05 | Demystifying Long Chain-of-Thought Reasoning in LLMs | Edward Yeo et.al. | 2502.03373 | link |
2025-02-05 | PalimpChat: Declarative and Interactive AI analytics | Chunwei Liu et.al. | 2502.03368 | null |
2025-02-05 | Minerva: A Programmable Memory Test Benchmark for Language Models | Menglin Xia et.al. | 2502.03358 | null |
2025-02-05 | RadVLM: A Multitask Conversational Vision-Language Model for Radiology | Nicolas Deperrois et.al. | 2502.03333 | null |
2025-02-05 | ECM: A Unified Electronic Circuit Model for Explaining the Emergence of In-Context Learning and Chain-of-Thought in Large Language Model | Qiguang Chen et.al. | 2502.03325 | null |
2025-02-05 | Out-of-Distribution Detection using Synthetic Data Generation | Momin Abbas et.al. | 2502.03323 | null |
2025-02-05 | Simplifying Formal Proof-Generating Models with ChatGPT and Basic Searching Techniques | Sangjun Han et.al. | 2502.03321 | null |
2025-02-04 | Articulate AnyMesh: Open-Vocabulary 3D Articulated Objects Modeling | Xiaowen Qiu et.al. | 2502.02590 | null |
2025-02-04 | COCONut-PanCap: Joint Panoptic Segmentation and Grounded Captions for Fine-Grained Understanding and Generation | Xueqing Deng et.al. | 2502.02589 | null |
2025-02-04 | A comparison of translation performance between DeepL and Supertext | Alex Flückiger et.al. | 2502.02577 | link |
2025-02-04 | Are Language Models Up to Sequential Optimization Problems? From Evaluation to a Hegelian-Inspired Enhancement | Soheil Abbasloo et.al. | 2502.02573 | null |
2025-02-04 | Learning the RoPEs: Better 2D and 3D Position Encodings with STRING | Connor Schenck et.al. | 2502.02562 | null |
2025-02-04 | Mosaic3D: Foundation Dataset and Model for Open-Vocabulary 3D Segmentation | Junha Lee et.al. | 2502.02548 | null |
2025-02-04 | LLMs for Generation of Architectural Components: An Exploratory Empirical Study in the Serverless World | Shrikara Arun et.al. | 2502.02539 | null |
2025-02-04 | Adaptive Self-improvement LLM Agentic System for ML Library Development | Genghan Zhang et.al. | 2502.02534 | link |
2025-02-04 | Multi-Agent Design: Optimizing Agents with Better Prompts and Topologies | Han Zhou et.al. | 2502.02533 | null |
2025-02-04 | Satori: Reinforcement Learning with Chain-of-Action-Thought Enhances LLM Reasoning via Autoregressive Search | Maohao Shen et.al. | 2502.02508 | null |
2025-02-04 | Analyzing Similarity Metrics for Data Selection for Language Model Pretraining | Dylan Sam et.al. | 2502.02494 | null |
2025-02-04 | EasySpec: Layer-Parallel Speculative Decoding for Efficient Multi-GPU Utilization | Yize Wu et.al. | 2502.02493 | null |
2025-02-04 | Multilingual Machine Translation with Open Large Language Models at Practical Scale: An Empirical Study | Menglong Cui et.al. | 2502.02481 | null |
2025-02-04 | Mind the Gap: Evaluating Patch Embeddings from General-Purpose and Histopathology Foundation Models for Cell Segmentation and Classification | Valentina Vadori et.al. | 2502.02471 | link |
2025-02-04 | Modular Training of Neural Networks aids Interpretability | Satvik Golechha et.al. | 2502.02470 | null |
2025-02-04 | SAISA: Towards Multimodal Large Language Models with Both Training and Inference Efficiency | Qianhao Yuan et.al. | 2502.02458 | link |
2025-02-04 | IMDPrompter: Adapting SAM to Image Manipulation Detection by Cross-View Automated Prompt Learning | Quan Zhang et.al. | 2502.02454 | null |
2025-02-04 | Personalization Toolkit: Training Free Personalization of Large Vision Language Models | Soroush Seifi et.al. | 2502.02452 | null |
2025-02-04 | Beyond English: Evaluating Automated Measurement of Moral Foundations in Non-English Discourse with a Chinese Case Study | Calvin Yixiang Cheng et.al. | 2502.02451 | link |
2025-02-04 | Generative Psycho-Lexical Approach for Constructing Value Systems in Large Language Models | Haoran Ye et.al. | 2502.02444 | null |
2025-01-31 | Low-Rank Adapting Models for Sparse Autoencoders | Matthew Chen et.al. | 2501.19406 | link |
2025-01-31 | Vintix: Action Model via In-Context Reinforcement Learning | Andrey Polubarov et.al. | 2501.19400 | link |
2025-01-31 | Scalable-Softmax Is Superior for Attention | Ken M. Nakanishi et.al. | 2501.19399 | null |
2025-01-31 | Do LLMs Strategically Reveal, Conceal, and Infer Information? A Theoretical and Empirical Analysis in The Chameleon Game | Mustafa O. Karabag et.al. | 2501.19398 | link |
2025-02-03 | s1: Simple test-time scaling | Niklas Muennighoff et.al. | 2501.19393 | link |
2025-01-31 | Cache Me If You Must: Adaptive Key-Value Quantization for Large Language Models | Alina Shutova et.al. | 2501.19392 | link |
2025-01-31 | Federated Sketching LoRA: On-Device Collaborative Fine-Tuning of Large Language Models | Wenzhi Fang et.al. | 2501.19389 | link |
2025-01-31 | Decoding-based Regression | Xingyou Song et.al. | 2501.19383 | link |
2025-01-31 | TableMaster: A Recipe to Advance Table Understanding with Language Models | Lang Cao et.al. | 2501.19378 | null |
2025-02-03 | SELMA: A Speech-Enabled Language Model for Virtual Assistant Interactions | Dominik Wagner et.al. | 2501.19377 | null |
2025-01-31 | We’re Different, We’re the Same: Creative Homogeneity Across LLMs | Emily Wenger et.al. | 2501.19361 | null |
2025-01-31 | Mechanical Properties of the Meninges: Large Language Model Assisted Systematic Review of over 25,000 Studies | Brandon P. Chelstrom et.al. | 2501.19359 | null |
2025-01-31 | The Energy Loss Phenomenon in RLHF: A New Perspective on Mitigating Reward Hacking | Yuchun Miao et.al. | 2501.19358 | null |
2025-01-31 | Towards Adaptive Self-Improvement for Smarter Energy Systems | Alexander Sommer et.al. | 2501.19340 | null |
2025-01-31 | PixelWorld: Towards Perceiving Everything as Pixels | Zhiheng Lyu et.al. | 2501.19339 | null |
2025-01-31 | Homogeneity Bias as Differential Sampling Uncertainty in Language Models | Messi H. J. Lee et.al. | 2501.19337 | null |
2025-01-31 | Reward-Guided Speculative Decoding for Efficient LLM Reasoning | Baohao Liao et.al. | 2501.19324 | null |
2025-01-31 | MINDSTORES: Memory-Informed Neural Decision Synthesis for Task-Oriented Reinforcement in Embodied Systems | Anirudh Chari et.al. | 2501.19318 | null |
2025-01-31 | LLM-based Affective Text Generation Quality Based on Different Quantization Values | Yarik Menchaca Resendiz et.al. | 2501.19317 | null |
2025-01-31 | An Efficient Approach for Machine Translation on Low-resource Languages: A Case Study in Vietnamese-Chinese | Tran Ngoc Son et.al. | 2501.19314 | null |
2025-01-30 | Foundational Models for 3D Point Clouds: A Survey and Outlook | Vishal Thengane et.al. | 2501.18594 | null |
2025-01-30 | Advances in Multimodal Adaptation and Generalization: From Traditional Approaches to Foundation Models | Hao Dong et.al. | 2501.18592 | link |
2025-01-30 | Thoughts Are All Over the Place: On the Underthinking of o1-Like LLMs | Yue Wang et.al. | 2501.18585 | null |
2025-01-30 | Prediction-Powered Inference with Imputed Covariates and Nonuniform Sampling | Dan M. Kluger et.al. | 2501.18577 | link |
2025-01-30 | Token-Hungry, Yet Precise: DeepSeek R1 Highlights the Need for Multi-Step Reasoning Over Speed in MATH | Evgenii Evstafev et.al. | 2501.18576 | null |
2025-01-30 | BounTCHA: A CAPTCHA Utilizing Boundary Identification in AI-extended Videos | Lehao Lin et.al. | 2501.18565 | null |
2025-01-30 | SAM2Act: Integrating Visual Foundation Model with A Memory Architecture for Robotic Manipulation | Haoquan Fang et.al. | 2501.18564 | link |
2025-01-30 | Semantic Web and Creative AI – A Technical Report from ISWS 2023 | Raia Abu Ahmad et.al. | 2501.18542 | null |
2025-01-30 | Loss Functions and Operators Generated by f-Divergences | Vincent Roulet et.al. | 2501.18537 | null |
2025-01-30 | Illusions of Relevance: Using Content Injection Attacks to Deceive Retrievers, Rerankers, and LLM Judges | Manveer Singh Tamber et.al. | 2501.18536 | link |
2025-01-30 | Rethinking Bottlenecks in Safety Fine-Tuning of Vision Language Models | Yi Ding et.al. | 2501.18533 | null |
2025-01-30 | Differentially Private Steering for Large Language Model Alignment | Anmol Goel et.al. | 2501.18532 | link |
2025-01-30 | Learn from the Past: Language-conditioned Object Rearrangement with Large Language Models | Guanqun Cao et.al. | 2501.18516 | null |
2025-01-30 | Streaming DiLoCo with overlapping communication: Towards a Distributed Free Lunch | Arthur Douillard et.al. | 2501.18512 | null |
2025-01-30 | WILDCHAT-50M: A Deep Dive Into the Role of Synthetic Data in Post-Training | Benjamin Feuer et.al. | 2501.18511 | link |
2025-01-30 | CLEAR: Cue Learning using Evolution for Accurate Recognition Applied to Sustainability Data Extraction | Peter J. Bentley et.al. | 2501.18504 | null |
2025-01-30 | A Tool for In-depth Analysis of Code Execution Reasoning of Large Language Models | Changshu Liu et.al. | 2501.18482 | null |
2025-01-30 | CLoQ: Enhancing Fine-Tuning of Quantized LLMs via Calibrated LoRA Initialization | Yanxia Deng et.al. | 2501.18475 | null |
2025-01-30 | Tuning Vision Foundation Model via Test-Time Prompt-Guided Training for VFSS Segmentations | Chengxi Zeng et.al. | 2501.18474 | null |
2025-01-30 | A Benchmark and Evaluation for Real-World Out-of-Distribution Detection Using Vision-Language Models | Shiho Noda et.al. | 2501.18463 | link |
2025-01-29 | Learning Beyond the Surface: How Far Can Continual Pre-Training with LoRA Enhance LLMs’ Domain-Specific Insight Learning? | Pouya Pezeshkpour et.al. | 2501.17840 | link |
2025-01-29 | Matrix Product Sketching via Coordinated Sampling | Majid Daliri et.al. | 2501.17836 | null |
2025-01-29 | Aggregation Schemes for Single-Vector WSI Representation Learning in Digital Pathology | Sobhan Hemati et.al. | 2501.17822 | null |
2025-01-29 | Leveraging Multimodal LLM for Inspirational User Interface Search | Seokhyeon Park et.al. | 2501.17799 | link |
2025-01-29 | BreezyVoice: Adapting TTS for Taiwanese Mandarin with Enhanced Polyphone Disambiguation – Challenges and Insights | Chan-Jan Hsu et.al. | 2501.17790 | null |
2025-01-29 | Reasoning Over the Glyphs: Evaluation of LLM’s Decipherment of Rare Scripts | Yu-Fei Shih et.al. | 2501.17785 | null |
2025-01-29 | AdditiveLLM: Large Language Models Predict Defects in Additive Manufacturing | Peter Pak et.al. | 2501.17784 | null |
2025-01-29 | 2SSP: A Two-Stage Framework for Structured Pruning of LLMs | Fabrizio Sandri et.al. | 2501.17771 | link |
2025-01-29 | Hybrid Graphs for Table-and-Text based Question Answering using LLMs | Ankush Agarwal et.al. | 2501.17767 | null |
2025-01-29 | On the Partitioning of GPU Power among Multi-Instances | Tirth Vamja et.al. | 2501.17752 | null |
2025-01-29 | Early External Safety Testing of OpenAI’s o3-mini: Insights from the Pre-Deployment Evaluation | Aitor Arrieta et.al. | 2501.17749 | null |
2025-01-29 | A technical review of multi-omics data integration methods: from classical statistical to deep generative approaches | Ana R. Baião et.al. | 2501.17729 | null |
2025-01-29 | Using Code Generation to Solve Open Instances of Combinatorial Design Problems | Christopher D. Rosin et.al. | 2501.17725 | link |
2025-01-29 | RICoTA: Red-teaming of In-the-wild Conversation with Test Attempts | Eujeong Choi et.al. | 2501.17715 | link |
2025-01-29 | Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate | Yubo Wang et.al. | 2501.17703 | null |
2025-01-29 | Planning with Vision-Language Models and a Use Case in Robot-Assisted Teaching | Xuzhe Dang et.al. | 2501.17665 | null |
2025-01-29 | Exploring Vision Language Models for Multimodal and Multilingual Stance Detection | Jake Vasilakes et.al. | 2501.17654 | null |
2025-01-29 | Tonguescape: Exploring Language Models Understanding of Vowel Articulation | Haruki Sakajo et.al. | 2501.17643 | link |
2025-01-29 | Efficient Redundancy Reduction for Open-Vocabulary Semantic Segmentation | Lin Chen et.al. | 2501.17642 | null |
2025-01-29 | In-Context Meta LoRA Generation | Yihua Shao et.al. | 2501.17635 | null |
2025-01-28 | SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training | Tianzhe Chu et.al. | 2501.17161 | null |
2025-01-28 | AxBench: Steering LLMs? Even Simple Baselines Outperform Sparse Autoencoders | Zhengxuan Wu et.al. | 2501.17148 | link |
2025-01-28 | FactCG: Enhancing Fact Checkers with Graph-Based Multi-Hop Data | Deren Lei et.al. | 2501.17144 | link |
2025-01-28 | ASTRAL: Automated Safety Testing of Large Language Models | Miriam Ugarte et.al. | 2501.17132 | null |
2025-01-28 | Scenario Understanding of Traffic Scenes Through Large Visual Language Models | Rivera Esteban et.al. | 2501.17131 | null |
2025-01-28 | Histoires Morales: A French Dataset for Assessing Moral Alignment | Thibaud Leteno et.al. | 2501.17117 | link |
2025-01-28 | Optimizing Large Language Model Training Using FP4 Quantization | Ruizhe Wang et.al. | 2501.17116 | null |
2025-01-28 | Unlocking Transparent Alignment Through Enhanced Inverse Constitutional AI for Principle Extraction | Carl-Leander Henneking et.al. | 2501.17112 | null |
2025-01-28 | COS(M+O)S: Curiosity and RL-Enhanced MCTS for Exploring Story Space via Language Models | Tobias Materzok et.al. | 2501.17104 | null |
2025-01-28 | Token-by-Token Regeneration and Domain Biases: A Benchmark of LLMs on Advanced Mathematical Problem-Solving | Evgenii Evstafev et.al. | 2501.17084 | null |
2025-01-28 | Contextual Self-paced Learning for Weakly Supervised Spatio-Temporal Video Grounding | Akash Kumar et.al. | 2501.17053 | null |
2025-01-28 | How Linguistics Learned to Stop Worrying and Love the Language Models | Richard Futrell et.al. | 2501.17047 | null |
2025-01-28 | Enhanced Retrieval of Long Documents: Leveraging Fine-Grained Block Representations with Large Language Models | Minghan Li et.al. | 2501.17039 | null |
2025-01-28 | Challenges in Ensuring AI Safety in DeepSeek-R1 Models: The Shortcomings of Reinforcement Learning Strategies | Manojkumar Parmar et.al. | 2501.17030 | null |
2025-01-28 | Automated Refactoring of Non-Idiomatic Python Code: A Differentiated Replication with LLMs | Alessandro Midolo et.al. | 2501.17024 | link |
2025-01-28 | Mobile Manipulation Instruction Generation from Multiple Images with Automatic Metric Enhancement | Kei Katsumata et.al. | 2501.17022 | link |
2025-01-28 | Large Language Models for Code Generation: The Practitioners Perspective | Zeeshan Rasheed et.al. | 2501.16998 | link |
2025-01-28 | Artificial Intelligence Clones | Annie Liang et.al. | 2501.16996 | null |
2025-01-28 | FedEFM: Federated Endovascular Foundation Model with Unseen Data | Tuong Do et.al. | 2501.16992 | null |
2025-01-28 | Modulating CNN Features with Pre-Trained ViT Representations for Open-Vocabulary Object Detection | Xiangyu Gao et.al. | 2501.16981 | null |
2025-01-27 | LUCY: Linguistic Understanding and Control Yielding Early Stage of Her | Heting Gao et.al. | 2501.16327 | link |
2025-01-27 | Evaluating The Performance of Using Large Language Models to Automate Summarization of CT Simulation Orders in Radiation Oncology | Meiyun Cao et.al. | 2501.16309 | null |
2025-01-27 | RAPID: Retrieval-Augmented Parallel Inference Drafting for Text-Based Video Event Retrieval | Long Nguyen et.al. | 2501.16303 | null |
2025-01-27 | Matryoshka Re-Ranker: A Flexible Re-Ranking Architecture With Configurable Depth and Width | Zheng Liu et.al. | 2501.16302 | null |
2025-01-27 | Large Models in Dialogue for Active Perception and Anomaly Detection | Tzoulio Chamiti et.al. | 2501.16300 | link |
2025-01-27 | FALCON: Resolving Visual Redundancy and Fragmentation in High-resolution Multimodal Large Language Models via Visual Registers | Renshan Zhang et.al. | 2501.16297 | null |
2025-01-27 | Brain-Adapter: Enhancing Neurological Disorder Analysis with Adapter-Tuning Multimodal Large Language Models | Jing Zhang et.al. | 2501.16282 | null |
2025-01-27 | Do LLMs Have Visualization Literacy? An Evaluation on Modified Visualizations to Test Generalization in Data Interpretation | Jiayi Hong et.al. | 2501.16277 | link |
2025-01-27 | URAG: Implementing a Unified Hybrid RAG for Precise Answers in University Admission Chatbots – A Case Study at HCMUT | Long Nguyen et.al. | 2501.16276 | null |
2025-01-27 | Return of the Encoder: Maximizing Parameter Efficiency for SLMs | Mohamed Elfeki et.al. | 2501.16273 | link |
2025-01-27 | A foundation model for human-AI collaboration in medical literature mining | Zifeng Wang et.al. | 2501.16255 | null |
2025-01-27 | Multi-Agent Geospatial Copilots for Remote Sensing Workflows | Chaehong Lee et.al. | 2501.16254 | null |
2025-01-27 | Zero-Shot Decision Tree Construction via Large Language Models | Lucas Carrasco et.al. | 2501.16247 | null |
2025-01-27 | CLISC: Bridging clip and sam by enhanced cam for unsupervised brain tumor segmentation | Xiaochuan Ma et.al. | 2501.16246 | null |
2025-01-27 | Phase Transitions in Large Language Models and the $O(N)$ Model | Youran Sun et.al. | 2501.16241 | null |
2025-01-27 | AiGet: Transforming Everyday Moments into Hidden Knowledge Discovery with AI Assistance on Smart Glasses | Runze Cai et.al. | 2501.16240 | link |
2025-01-27 | Distilling foundation models for robust and efficient models in digital pathology | Alexandre Filiot et.al. | 2501.16239 | null |
2025-01-27 | Language-Based Bayesian Optimization Research Assistant (BORA) | Abdoulatif Cissé et.al. | 2501.16224 | null |
2025-01-27 | Enhancing Visual Inspection Capability of Multi-Modal Large Language Models on Medical Time Series with Supportive Conformalized and Interpretable Small Specialized Models | Huayu Li et.al. | 2501.16215 | link |
2025-01-27 | Provence: efficient and robust context pruning for retrieval-augmented generation | Nadezhda Chirkova et.al. | 2501.16214 | null |
2025-01-24 | HERMES: A Unified Self-Driving World Model for Simultaneous 3D Scene Understanding and Generation | Xin Zhou et.al. | 2501.14729 | link |
2025-01-24 | Do LLMs Provide Consistent Answers to Health-Related Questions across Languages? | Ipek Baris Schlicht et.al. | 2501.14719 | null |
2025-01-24 | Towards Better Understanding Table Instruction Tuning: Decoupling the Effects from Data versus Models | Naihao Deng et.al. | 2501.14717 | null |
2025-01-24 | FlexiGPT: Pruning and Extending Large Language Models with Low-Rank Weight Sharing | James Seale Smith et.al. | 2501.14713 | null |
2025-01-24 | The Karp Dataset | Mason DiCicco et.al. | 2501.14705 | null |
2025-01-24 | Rethinking Table Instruction Tuning | Naihao Deng et.al. | 2501.14693 | null |
2025-01-24 | Rethinking Foundation Models for Medical Image Classification through a Benchmark Study on MedMNIST | Fuping Wu et.al. | 2501.14685 | null |
2025-01-24 | An Empirical Study on LLM-based Classification of Requirements-related Provisions in Food-safety Regulations | Shabnam Hassani et.al. | 2501.14683 | null |
2025-01-24 | Diffusion based Text-to-Music Generationwith Global and Local Text based Conditioning | Jisi Zhang et.al. | 2501.14680 | null |
2025-01-24 | MedAgentBench: Dataset for Benchmarking LLMs as Agents in Medical Applications | Yixing Jiang et.al. | 2501.14654 | link |
2025-01-24 | Investigating the (De)Composition Capabilities of Large Language Models in Natural-to-Formal Language Conversion | Ziyao Xu et.al. | 2501.14649 | link |
2025-01-24 | Recommending Actionable Strategies: A Semantic Approach to Integrating Analytical Frameworks with Decision Heuristics | Renato Ghisellini et.al. | 2501.14634 | null |
2025-01-24 | Extracting Problem Structure with LLMs for Optimized SAT Local Search | André Schilder et.al. | 2501.14630 | null |
2025-01-24 | ReferDINO: Referring Video Object Segmentation with Visual Grounding Foundations | Tianming Liang et.al. | 2501.14607 | null |
2025-01-24 | Knowledge Graphs Construction from Criminal Court Appeals: Insights from the French Cassation Court | Alexander V. Belikov et.al. | 2501.14579 | null |
2025-01-24 | ZETA: Leveraging Z-order Curves for Efficient Top-k Attention | Qiuhao Zeng et.al. | 2501.14577 | null |
2025-01-24 | Large-scale and Fine-grained Vision-language Pre-training for Enhanced CT Image Understanding | Zhongyi Shui et.al. | 2501.14548 | link |
2025-01-24 | Leveraging ChatGPT’s Multimodal Vision Capabilities to Rank Satellite Images by Poverty Level: Advancing Tools for Social Science Research | Hamid Sarmadi et.al. | 2501.14546 | null |
2025-01-24 | VERUS-LM: a Versatile Framework for Combining LLMs with Symbolic Reasoning | Benjamin Callewaert et.al. | 2501.14540 | null |
2025-01-24 | Design and Implementation of a Psychiatry Resident Training System Based on Large Language Models | Zhenguang Zhong et.al. | 2501.14530 | link |
2025-01-23 | CRPO: Confidence-Reward Driven Preference Optimization for Machine Translation | Guofeng Cui et.al. | 2501.13927 | null |
2025-01-23 | The Breeze 2 Herd of Models: Traditional Chinese LLMs Based on Llama with Vision-Aware and Function-Calling Capabilities | Chan-Jan Hsu et.al. | 2501.13921 | link |
2025-01-23 | Analysis of Indic Language Capabilities in LLMs | Aatman Vaidya et.al. | 2501.13912 | null |
2025-01-23 | Privacy-Preserving Personalized Federated Prompt Learning for Multimodal Large Language Models | Linh Tran et.al. | 2501.13904 | null |
2025-01-23 | Exploring Finetuned Audio-LLM on Heart Murmur Features | Adrian Florea et.al. | 2501.13884 | null |
2025-01-23 | The machine learning platform for developers of large systems | Alexey Naikov et.al. | 2501.13881 | null |
2025-01-23 | A RAG-Based Institutional Assistant | Gustavo Kuratomi et.al. | 2501.13880 | null |
2025-01-23 | Dual-Modal Prototype Joint Learning for Compositional Zero-Shot Learning | Shiyu Zhang et.al. | 2501.13859 | null |
2025-01-23 | Large Vision-Language Models for Knowledge-Grounded Data Annotation of Memes | Shiling Deng et.al. | 2501.13851 | link |
2025-01-23 | Think Outside the Data: Colonial Biases and Systemic Issues in Automated Moderation Pipelines for Low-Resource Languages | Farhana Shahid et.al. | 2501.13836 | null |
2025-01-23 | On the Reasoning Capacity of AI Models and How to Quantify It | Santosh Kumar Radha et.al. | 2501.13833 | null |
2025-01-23 | Predicting Compact Phrasal Rewrites with Large Language Models for ASR Post Editing | Hao Zhang et.al. | 2501.13831 | null |
2025-01-23 | Hallucinations Can Improve Large Language Models in Drug Discovery | Shuzhou Yuan et.al. | 2501.13824 | null |
2025-01-23 | Large Language Model driven Policy Exploration for Recommender Systems | Jie Wang et.al. | 2501.13816 | null |
2025-01-23 | Enhancing LLMs for Governance with Human Oversight: Evaluating and Aligning LLMs on Expert Classification of Climate Misinformation for Detecting False or Misleading Claims about Climate Change | Mowafak Allaham et.al. | 2501.13802 | null |
2025-01-23 | PromptMono: Cross Prompting Attention for Self-Supervised Monocular Depth Estimation in Challenging Environments | Changhao Wang et.al. | 2501.13796 | null |
2025-01-23 | Training-Free Zero-Shot Temporal Action Detection with Vision-Language Models | Chaolei Han et.al. | 2501.13795 | link |
2025-01-23 | Parameter-Efficient Fine-Tuning for Foundation Models | Dan Zhang et.al. | 2501.13787 | link |
2025-01-23 | Not Every AI Problem is a Data Problem: We Should Be Intentional About Data Scaling | Tanya Rodchenko et.al. | 2501.13779 | null |
2025-01-23 | Explainable XR: Understanding User Behaviors of XR Environments using LLM-assisted Analytics Framework | Yoonsang Kim et.al. | 2501.13778 | link |
2025-01-22 | VideoLLaMA 3: Frontier Multimodal Foundation Models for Image and Video Understanding | Boqiang Zhang et.al. | 2501.13106 | link |
2025-01-22 | Refining Input Guardrails: Enhancing LLM-as-a-Judge Efficiency Through Chain-of-Thought Fine-Tuning and Alignment | Melissa Kazemi Rad et.al. | 2501.13080 | null |
2025-01-22 | Autonomy-of-Experts Models | Ang Lv et.al. | 2501.13074 | null |
2025-01-22 | Does Table Source Matter? Benchmarking and Improving Multimodal Scientific Table Understanding and Reasoning | Bohao Yang et.al. | 2501.13042 | link |
2025-01-22 | Pairwise RM: Perform Best-of-N Sampling with Knockout Tournament | Yantao Liu et.al. | 2501.13007 | link |
2025-01-22 | Large Language Model-Based Semantic Communication System for Image Transmission | Soheyb Ribouh et.al. | 2501.12988 | null |
2025-01-22 | LLM4WM: Adapting LLM for Wireless Multi-Tasking | Xuanyu Liu et.al. | 2501.12983 | null |
2025-01-22 | OnionEval: An Unified Evaluation of Fact-conflicting Hallucination for Small-Large Language Models | Chongren Sun et.al. | 2501.12975 | link |
2025-01-22 | Accessible Smart Contracts Verification: Synthesizing Formal Models with Tamed LLMs | Jan Corazza et.al. | 2501.12972 | link |
2025-01-22 | It’s complicated. The relationship of algorithmic fairness and non-discrimination regulations in the EU AI Act | Kristof Meding et.al. | 2501.12962 | null |
2025-01-22 | Efficient Prompt Compression with Evaluator Heads for Long-Context Transformer Inference | Weizhi Fei et.al. | 2501.12959 | null |
2025-01-22 | GANQ: GPU-Adaptive Non-Uniform Quantization for Large Language Models | Pengxiang Zhao et.al. | 2501.12956 | null |
2025-01-22 | Correctness Assessment of Code Generated by Large Language Models Using Internal Representations | Tuan-Dung Bui et.al. | 2501.12934 | link |
2025-01-22 | DynamicEarth: How Far are We from Open-Vocabulary Change Detection? | Kaiyu Li et.al. | 2501.12931 | null |
2025-01-22 | A Functional Software Reference Architecture for LLM-Integrated Systems | Alessio Bucaioni et.al. | 2501.12904 | null |
2025-01-22 | Architectural Fusion Through Contextual Partitioning in Large Language Models: A Novel Approach to Parameterized Knowledge Integration | Offa Kingsleigh et.al. | 2501.12901 | null |
2025-01-22 | Test-Time Preference Optimization: On-the-Fly Alignment via Iterative Textual Feedback | Yafu Li et.al. | 2501.12895 | link |
2025-01-22 | Generative AI Misuse Potential in Cyber Security Education: A Case Study of a UK Degree Program | Carlton Shepherd et.al. | 2501.12883 | null |
2025-01-22 | WisdomBot: Tuning Large Language Models with Artificial Intelligence Knowledge | Jingyuan Chen et.al. | 2501.12877 | null |
2025-01-22 | HierPromptLM: A Pure PLM-based Framework for Representation Learning on Heterogeneous Text-rich Networks | Qiuyu Zhu et.al. | 2501.12857 | null |
2025-01-21 | InternVideo2.5: Empowering Video MLLMs with Long and Rich Context Modeling | Yi Wang et.al. | 2501.12386 | link |
2025-01-21 | MMVU: Measuring Expert-Level Multi-Discipline Video Understanding | Yilun Zhao et.al. | 2501.12380 | link |
2025-01-21 | Expertise elevates AI usage: experimental evidence comparing laypeople and professional artists | Thomas F. Eisenmann et.al. | 2501.12374 | link |
2025-01-21 | Is Long Context All You Need? Leveraging LLM’s Extended Context for NL2SQL | Yeounoh Chung et.al. | 2501.12372 | link |
2025-01-21 | Parameters vs FLOPs: Scaling Laws for Optimal Sparsity for Mixture-of-Experts Language Models | Samira Abnar et.al. | 2501.12370 | null |
2025-01-21 | InternLM-XComposer2.5-Reward: A Simple Yet Effective Multi-Modal Reward Model | Yuhang Zang et.al. | 2501.12368 | link |
2025-01-21 | Vision-Language Models for Automated Chest X-ray Interpretation: Leveraging ViT and GPT-2 | Md. Rakibul Islam et.al. | 2501.12356 | null |
2025-01-21 | Automatic Labelling with Open-source LLMs using Dynamic Label Schema Integration | Thomas Walshe et.al. | 2501.12332 | null |
2025-01-21 | Cinepro: Robust Training of Foundation Models for Cancer Detection in Prostate Ultrasound Cineloops | Mohamed Harmanani et.al. | 2501.12331 | link |
2025-01-21 | VARGPT: Unified Understanding and Generation in a Visual Autoregressive Multimodal Large Language Model | Xianwei Zhuang et.al. | 2501.12327 | link |
2025-01-21 | LLM-Assisted Knowledge Graph Completion for Curriculum and Domain Modelling in Personalized Higher Education Recommendations | Hasan Abu-Rasheed et.al. | 2501.12300 | null |
2025-01-21 | MoGERNN: An Inductive Traffic Predictor for Unobserved Locations in Dynamic Sensing Networks | Qishen Zhou et.al. | 2501.12281 | link |
2025-01-21 | Condor: Enhance LLM Alignment with Knowledge-Driven Data Synthesis and Refinement | Maosong Cao et.al. | 2501.12273 | link |
2025-01-21 | CBVLM: Training-free Explainable Concept-based Large Vision Language Models for Medical Image Classification | Cristiano Patrício et.al. | 2501.12266 | null |
2025-01-21 | FOCUS: First Order Concentrated Updating Scheme | Yizhou Liu et.al. | 2501.12243 | null |
2025-01-21 | InsTALL: Context-aware Instructional Task Assistance with Multi-modal Large Language Models | Pha Nguyen et.al. | 2501.12231 | null |
2025-01-21 | CDW-CoT: Clustered Distance-Weighted Chain-of-Thoughts Reasoning | Yuanheng Fang et.al. | 2501.12226 | null |
2025-01-21 | Leveraging Large Language Models for Realizing Truly Intelligent User Interfaces | Allard Oelen et.al. | 2501.12221 | null |
2025-01-21 | You Can’t Eat Your Cake and Have It Too: The Performance Degradation of LLMs with Jailbreak Defense | Wuyuao Mai et.al. | 2501.12210 | null |
2025-01-21 | Fixing Imbalanced Attention to Mitigate In-Context Hallucination of Large Vision-Language Model | Kazi Hasan Ibn Arif et.al. | 2501.12206 | link |
2025-01-17 | FaceXBench: Evaluating Multimodal LLMs on Face Understanding | Kartik Narayan et.al. | 2501.10360 | link |
2025-01-17 | Agent4Edu: Generating Learner Response Data by Generative Agents for Intelligent Education Systems | Weibo Gao et.al. | 2501.10332 | link |
2025-01-17 | BoK: Introducing Bag-of-Keywords Loss for Interpretable Dialogue Response Generation | Suvodip Dey et.al. | 2501.10328 | link |
2025-01-17 | Large language models for automated scholarly paper review: A survey | Zhenzhen Zhuang et.al. | 2501.10326 | null |
2025-01-17 | Hierarchical Autoregressive Transformers: Combining Byte-~and Word-Level Processing for Robust, Adaptable Language Models | Pit Neitemeier et.al. | 2501.10322 | null |
2025-01-17 | HiMix: Reducing Computational Complexity in Large Vision-Language Models | Xuange Zhang et.al. | 2501.10318 | null |
2025-01-17 | Addressing Popularity Bias in Third-Party Library Recommendations Using LLMs | Claudio Di Sipio et.al. | 2501.10313 | null |
2025-01-17 | Computational Protein Science in the Era of Large Language Models (LLMs) | Wenqi Fan et.al. | 2501.10282 | null |
2025-01-17 | Test Wars: A Comparative Study of SBST, Symbolic Execution, and LLM-Based Approaches to Unit Test Generation | Azat Abdullin et.al. | 2501.10200 | null |
2025-01-17 | Generative Artificial Intelligence: Implications for Biomedical and Health Professions Education | William Hersh et.al. | 2501.10186 | null |
2025-01-17 | Multi-stage Training of Bilingual Islamic LLM for Neural Passage Retrieval | Vera Pavlova et.al. | 2501.10175 | null |
2025-01-17 | Dual Debiasing: Remove Stereotypes and Keep Factual Gender for Fair Language Modeling and Translation | Tomasz Limisiewicz et.al. | 2501.10150 | null |
2025-01-17 | A Vision-Language Framework for Multispectral Scene Representation Using Language-Grounded Features | Enes Karanfil et.al. | 2501.10144 | null |
2025-01-17 | Exploring the Impact of Generative Artificial Intelligence in Education: A Thematic Analysis | Abhishek Kaushik et.al. | 2501.10134 | null |
2025-01-17 | ComplexFuncBench: Exploring Multi-Step and Constrained Function Calling under Long-Context Scenario | Lucen Zhong et.al. | 2501.10132 | link |
2025-01-17 | PaSa: An LLM Agent for Comprehensive Academic Paper Search | Yichen He et.al. | 2501.10120 | link |
2025-01-17 | LLM Reasoner and Automated Planner: A new NPC approach | Israel Puerta-Merino et.al. | 2501.10106 | null |
2025-01-17 | Universal Actions for Enhanced Embodied Foundation Models | Jinliang Zheng et.al. | 2501.10105 | link |
2025-01-17 | Few-shot Structure-Informed Machinery Part Segmentation with Foundation Models and Graph Neural Networks | Michael Schwingshackl et.al. | 2501.10080 | link |
2025-01-17 | SpatialCoT: Advancing Spatial Reasoning through Coordinate Alignment and Chain-of-Thought for Embodied Task Planning | Yuecheng Liu et.al. | 2501.10074 | null |
2025-01-16 | Distilling Multi-modal Large Language Models for Autonomous Driving | Deepti Hegde et.al. | 2501.09757 | null |
2025-01-16 | Lost in Translation, Found in Context: Sign Language Translation with Contextual Cues | Youngjoon Jang et.al. | 2501.09754 | null |
2025-01-16 | OmniThink: Expanding Knowledge Boundaries in Machine Writing through Thinking | Zekun Xi et.al. | 2501.09751 | link |
2025-01-16 | Enhancing Lexicon-Based Text Embeddings with Large Language Models | Yibin Lei et.al. | 2501.09749 | null |
2025-01-16 | Suggesting Code Edits in Interactive Machine Learning Notebooks Using Large Language Models | Bihui Jin et.al. | 2501.09745 | null |
2025-01-16 | Inference-Time Scaling for Diffusion Models beyond Scaling Denoising Steps | Nanye Ma et.al. | 2501.09732 | null |
2025-01-16 | A Simple Aerial Detection Baseline of Multimodal Language Models | Qingyun Li et.al. | 2501.09720 | link |
2025-01-16 | CyberMentor: AI Powered Learning Tool Platform to Address Diverse Student Needs in Cybersecurity Education | Tianyu Wang et.al. | 2501.09709 | link |
2025-01-16 | Domain Adaptation of Foundation LLMs for e-Commerce | Christian Herold et.al. | 2501.09706 | null |
2025-01-16 | Cueless EEG imagined speech for subject identification: dataset and benchmarks | Ali Derakhshesh et.al. | 2501.09700 | link |
2025-01-16 | Mitigating Hallucinations in Large Vision-Language Models via DPO: On-Policy Data Hold the Key | Zhihe Yang et.al. | 2501.09695 | link |
2025-01-16 | Simulated Interactive Debugging | Yannic Noller et.al. | 2501.09694 | null |
2025-01-16 | Towards Large Reasoning Models: A Survey of Reinforced Reasoning with Large Language Models | Fengli Xu et.al. | 2501.09686 | null |
2025-01-16 | Reward-Guided Controlled Generation for Inference-Time Alignment in Diffusion Models: Tutorial and Review | Masatoshi Uehara et.al. | 2501.09685 | null |
2025-01-16 | Robin: a Suite of Multi-Scale Vision-Language Models and the CHIRP Evaluation Benchmark | Alexis Roger et.al. | 2501.09672 | null |
2025-01-16 | A Survey of Research in Large Language Models for Electronic Design Automation | Jingyu Pan et.al. | 2501.09655 | null |
2025-01-16 | The Heap: A Contamination-Free Multilingual Code Dataset for Evaluating Large Language Models | Jonathan Katzy et.al. | 2501.09653 | null |
2025-01-16 | CarMem: Enhancing Long-Term Memory in LLM Voice Assistants through Category-Bounding | Johannes Kirmayr et.al. | 2501.09645 | link |
2025-01-16 | LLM-Based Routing in Mixture of Experts: A Novel Framework for Trading | Kuan-Ming Liu et.al. | 2501.09636 | null |
2025-01-16 | Empowering Large Language Models in Wireless Communication: A Novel Dataset and Fine-Tuning Framework | Yushen Lin et.al. | 2501.09631 | null |
2025-01-15 | Towards Fast, Specialized Machine Learning Force Fields: Distilling Foundation Models via Energy Hessians | Ishan Amin et.al. | 2501.09009 | link |
2025-01-15 | Aegis2.0: A Diverse AI Safety Dataset and Risks Taxonomy for Alignment of LLM Guardrails | Shaona Ghosh et.al. | 2501.09004 | null |
2025-01-15 | Vision Foundation Models for Computed Tomography | Suraj Pai et.al. | 2501.09001 | link |
2025-01-15 | CityLoc: 6 DoF Localization of Text Descriptions in Large-Scale Scenes with Gaussian Representation | Qi Ma et.al. | 2501.08982 | null |
2025-01-15 | Development and Validation of the Provider Documentation Summarization Quality Instrument for Large Language Models | Emma Croxford et.al. | 2501.08977 | null |
2025-01-15 | Learning to Extract Cross-Domain Aspects and Understanding Sentiments Using Large Language Models | Karukriti Kaushik Ghosh et.al. | 2501.08974 | null |
2025-01-15 | Analyzing the Ethical Logic of Six Large Language Models | W. Russell Neuman et.al. | 2501.08951 | null |
2025-01-15 | Applying General Turn-taking Models to Conversational Human-Robot Interaction | Gabriel Skantze et.al. | 2501.08946 | null |
2025-01-15 | Disentangling Exploration of Large Language Models by Optimal Exploitation | Tim Grams et.al. | 2501.08925 | null |
2025-01-15 | GenAI Content Detection Task 3: Cross-Domain Machine-Generated Text Detection Challenge | Liam Dugan et.al. | 2501.08913 | link |
2025-01-15 | Leveraging Large Language Models as Knowledge-Driven Agents for Reliable Retrosynthesis Planning | Qinyu Ma et.al. | 2501.08897 | link |
2025-01-15 | Generative Planning with 3D-vision Language Pre-training for End-to-End Autonomous Driving | Tengpeng Li et.al. | 2501.08861 | link |
2025-01-15 | Exploring Task-Level Optimal Prompts for Visual In-Context Learning | Yan Zhu et.al. | 2501.08841 | null |
2025-01-15 | IDEA: Image Description Enhanced CLIP-Adapter | Zhipeng Ye et.al. | 2501.08816 | link |
2025-01-15 | How Developers Interact with AI: A Taxonomy of Human-AI Collaboration in Software Engineering | Christoph Treude et.al. | 2501.08774 | null |
2025-01-15 | Admitting Ignorance Helps the Video Question Answering Models to Answer | Haopeng Li et.al. | 2501.08771 | null |
2025-01-15 | Enhanced Large Language Models for Effective Screening of Depression and Anxiety | June M. Liu et.al. | 2501.08769 | null |
2025-01-15 | Leveraging LLM Agents for Translating Network Configurations | Yunze Wei et.al. | 2501.08760 | null |
2025-01-15 | Expanding Vietnamese SentiWordNet to Improve Performance of Vietnamese Sentiment Analysis Models | Hong-Viet Tran et.al. | 2501.08758 | null |
2025-01-15 | The Inherent Limits of Pretrained LLMs: The Unexpected Convergence of Instruction Tuning and In-Context Learning Capabilities | Irina Bigoulaeva et.al. | 2501.08716 | link |
2025-01-14 | PokerBench: Training Large Language Models to become Professional Poker Players | Richard Zhuang et.al. | 2501.08328 | link |
2025-01-14 | Omni-RGPT: Unifying Image and Video Region-level Understanding via Token Marks | Miran Heo et.al. | 2501.08326 | null |
2025-01-14 | ADAM-1: AI and Bioinformatics for Alzheimer’s Detection and Microbiome-Clinical Data Integrations | Ziyuan Huang et.al. | 2501.08324 | null |
2025-01-14 | Exploring Robustness of Multilingual LLMs on Real-World Noisy Data | Amirhossein Aliakbarzadeh et.al. | 2501.08322 | link |
2025-01-14 | Enhancing Automated Interpretability with Output-Centric Feature Descriptions | Yoav Gur-Arieh et.al. | 2501.08319 | link |
2025-01-14 | MiniMax-01: Scaling Foundation Models with Lightning Attention | MiniMax et.al. | 2501.08313 | null |
2025-01-14 | HALoGEN: Fantastic LLM Hallucinations and Where to Find Them | Abhilasha Ravichander et.al. | 2501.08292 | null |
2025-01-14 | LLaVA-ST: A Multimodal Large Language Model for Fine-Grained Spatial-Temporal Understanding | Hongyu Li et.al. | 2501.08282 | link |
2025-01-14 | Exploring Robustness of LLMs to Sociodemographically-Conditioned Paraphrasing | Pulkit Arora et.al. | 2501.08276 | null |
2025-01-14 | Addressing the sustainable AI trilemma: a case study on LLM agents and RAG | Hui Wu et.al. | 2501.08262 | link |
2025-01-14 | Eliciting In-context Retrieval and Reasoning for Long-context Large Language Models | Yifu Qiu et.al. | 2501.08248 | null |
2025-01-14 | Text-Diffusion Red-Teaming of Large Language Models: Unveiling Harmful Behaviors with Proximity Constraints | Jonathan Nöther et.al. | 2501.08246 | null |
2025-01-14 | Investigating Energy Efficiency and Performance Trade-offs in LLM Inference Across Tasks and DVFS Settings | Paul Joe Maliakel et.al. | 2501.08219 | null |
2025-01-14 | ASTRID – An Automated and Scalable TRIaD for the Evaluation of RAG-based Clinical Question Answering Systems | Mohita Chowdhury et.al. | 2501.08208 | null |
2025-01-14 | ArithmAttack: Evaluating Robustness of LLMs to Noisy Context in Math Problem Solving | Zain Ul Abedin et.al. | 2501.08203 | null |
2025-01-14 | CWEval: Outcome-driven Evaluation on Functionality and Security of LLM Code Generation | Jinjun Peng et.al. | 2501.08200 | link |
2025-01-14 | OpenCSG Chinese Corpus: A Series of High-quality Chinese Datasets for LLM Training | Yijiong Yu et.al. | 2501.08197 | link |
2025-01-14 | PRESERVE: Prefetching Model Weights and KV-Cache in Distributed LLM Serving | Ahmet Caner Yüzügüler et.al. | 2501.08192 | null |
2025-01-14 | A Critical Synthesis of Uncertainty Quantification and Foundation Models in Monocular Depth Estimation | Steven Landgraf et.al. | 2501.08188 | null |
2025-01-14 | A Multi-Modal AI Copilot for Single-Cell Analysis with Instruction Following | Yin Fang et.al. | 2501.08187 | link |
2025-01-13 | Training-Free Motion-Guided Video Generation with Enhanced Temporal Consistency Using Motion Consistency Loss | Xinyu Zhang et.al. | 2501.07563 | null |
2025-01-13 | SST-EM: Advanced Metrics for Evaluating Semantic, Spatial and Temporal Aspects in Video Editing | Varun Biyyala et.al. | 2501.07554 | link |
2025-01-13 | Imagine while Reasoning in Space: Multimodal Visualization-of-Thought | Chengzu Li et.al. | 2501.07542 | null |
2025-01-13 | ML Mule: Mobile-Driven Context-Aware Collaborative Learning | Haoxiang Yu et.al. | 2501.07536 | null |
2025-01-13 | Investigating Large Language Models in Inferring Personality Traits from User Conversations | Jianfeng Zhu et.al. | 2501.07532 | null |
2025-01-13 | RadAlign: Advancing Radiology Report Generation with Vision-Language Concept Alignment | Difei Gu et.al. | 2501.07525 | link |
2025-01-13 | Parallel Key-Value Cache Fusion for Position Invariant RAG | Philhoon Oh et.al. | 2501.07523 | null |
2025-01-13 | Exploring and Mitigating Adversarial Manipulation of Voting-Based Leaderboards | Yangsibo Huang et.al. | 2501.07493 | null |
2025-01-13 | TiEBe: A Benchmark for Assessing the Current Knowledge of Large Language Models | Thales Sales Almeida et.al. | 2501.07482 | link |
2025-01-13 | A Survey of Embodied AI in Healthcare: Techniques, Applications, and Opportunities | Yihao Liu et.al. | 2501.07468 | null |
2025-01-13 | Understanding and Benchmarking Artificial Intelligence: OpenAI’s o3 Is Not AGI | Rolf Pfister et.al. | 2501.07458 | null |
2025-01-13 | Enhancing LLM’s Ability to Generate More Repository-Aware Unit Tests Through Precise Contextual Information Injection | Xin Yin et.al. | 2501.07425 | null |
2025-01-13 | Initial Findings on Sensor based Open Vocabulary Activity Recognition via Text Embedding Inversion | Lala Shakti Swarup Ray et.al. | 2501.07408 | null |
2025-01-13 | Zero-Shot Scene Understanding for Automatic Target Recognition Using Large Vision-Language Models | Yasiru Ranasinghe et.al. | 2501.07396 | null |
2025-01-13 | Enhancing Retrieval-Augmented Generation: A Study of Best Practices | Siran Li et.al. | 2501.07391 | link |
2025-01-13 | Extracting Participation in Collective Action from Social Media | Arianna Pera et.al. | 2501.07368 | null |
2025-01-13 | Emergent effects of scaling on the functional hierarchies within large language models | Paul C. Bogdan et.al. | 2501.07359 | null |
2025-01-13 | Evaluating Pre-Trained Models for Multi-Language Vulnerability Patching | Zanis Ali Khan et.al. | 2501.07339 | null |
2025-01-13 | TempoGPT: Enhancing Temporal Reasoning via Quantizing Embedding | Haochuan Zhang et.al. | 2501.07335 | null |
2025-01-13 | Foundation Models at Work: Fine-Tuning for Fairness in Algorithmic Hiring | Buse Sibel Korkmaz et.al. | 2501.07324 | link |
2025-01-10 | LlamaV-o1: Rethinking Step-by-step Visual Reasoning in LLMs | Omkar Thawakar et.al. | 2501.06186 | link |
2025-01-10 | PEACE: Empowering Geologic Map Holistic Understanding with MLLMs | Yangyu Huang et.al. | 2501.06184 | null |
2025-01-10 | VideoAuteur: Towards Long Narrative Video Generation | Junfei Xiao et.al. | 2501.06173 | null |
2025-01-10 | Multilingual Performance of a Multimodal Artificial Intelligence System on Multisubject Physics Concept Inventories | Gerd Kortemeyer et.al. | 2501.06143 | null |
2025-01-10 | Supervision policies can shape long-term risk management in general-purpose AI models | Manuel Cebrian et.al. | 2501.06137 | link |
2025-01-10 | CoDriveVLM: VLM-Enhanced Urban Cooperative Dispatching and Motion Planning for Future Autonomous Mobility on Demand Systems | Haichao Liu et.al. | 2501.06132 | link |
2025-01-10 | Contextual ASR Error Handling with LLMs Augmentation for Goal-Oriented Conversational AI | Yuya Asano et.al. | 2501.06129 | null |
2025-01-10 | Merging Feed-Forward Sublayers for Compressed Transformers | Neha Verma et.al. | 2501.06126 | link |
2025-01-10 | Fleurs-SLU: A Massively Multilingual Benchmark for Spoken Language Understanding | Fabian David Schmidt et.al. | 2501.06117 | link |
2025-01-10 | From Conversation to Automation: Leveraging Large Language Models to Analyze Strategies in Problem Solving Therapy | Elham Aghakhani et.al. | 2501.06101 | null |
2025-01-10 | Personalized Language Model Learning on Text Data Without User Identifiers | Yucheng Ding et.al. | 2501.06062 | link |
2025-01-10 | AI-powered virtual tissues from spatial proteomics for clinical diagnostics and biomedical discovery | Johann Wenckstern et.al. | 2501.06039 | link |
2025-01-10 | Generate, Transduct, Adapt: Iterative Transduction with VLMs | Oindrila Saha et.al. | 2501.06031 | null |
2025-01-10 | Addressing speaker gender bias in large scale speech translation systems | Shubham Bansal et.al. | 2501.05989 | null |
2025-01-10 | Comparing Self-Supervised Learning Models Pre-Trained on Human Speech and Animal Vocalizations for Bioacoustics Processing | Eklavya Sarkar et.al. | 2501.05987 | link |
2025-01-10 | Exploring LLMs for Automated Pre-Testing of Cross-Cultural Surveys | Divya Mani Adhikari et.al. | 2501.05985 | null |
2025-01-10 | Hermit Kingdom Through the Lens of Multiple Perspectives: A Case Study of LLM Hallucination on North Korea | Eunjung Cho et.al. | 2501.05981 | null |
2025-01-10 | Model Inversion in Split Learning for Personalized LLMs: New Insights from Information Bottleneck Theory | Yunmeng Shu et.al. | 2501.05965 | null |
2025-01-10 | Effective faking of verbal deception detection with target-aligned adversarial attacks | Bennett Kleinberg et.al. | 2501.05962 | null |
2025-01-10 | Scalable Vision Language Model Training via High Quality Data Curation | Hongyuan Dong et.al. | 2501.05952 | null |
2025-01-09 | An Empirical Study of Autoregressive Pre-training from Videos | Jathushan Rajasegaran et.al. | 2501.05453 | null |
2025-01-09 | ReFocus: Visual Editing as a Chain of Thought for Structured Image Understanding | Xingyu Fu et.al. | 2501.05452 | null |
2025-01-09 | Relative Pose Estimation through Affine Corrections of Monocular Depth Priors | Yifan Yu et.al. | 2501.05446 | link |
2025-01-09 | Can MLLMs Reason in Multimodality? EMMA: An Enhanced MultiModal ReAsoning Benchmark | Yunzhuo Hao et.al. | 2501.05444 | link |
2025-01-09 | A survey of textual cyber abuse detection using cutting-edge language models and large language models | Jose A. Diaz-Garcia et.al. | 2501.05443 | null |
2025-01-09 | Using LLMs to Infer Non-Binary COVID-19 Sentiments of Chinese Micro-bloggers | Jerry Chongyi Hu et.al. | 2501.05423 | null |
2025-01-09 | LongProc: Benchmarking Long-Context Language Models on Long Procedural Generation | Xi Ye et.al. | 2501.05414 | null |
2025-01-09 | Seeing Sound: Assembling Sounds from Visuals for Audio-to-Image Generation | Darius Petermann et.al. | 2501.05413 | null |
2025-01-09 | A Novel Pathology Foundation Model by Mayo Clinic, Charité, and Aignostics | Maximilian Alber et.al. | 2501.05409 | null |
2025-01-09 | Mechanistic understanding and validation of large AI models with SemanticLens | Maximilian Dreyer et.al. | 2501.05398 | link |
2025-01-09 | FairCode: Evaluating Social Bias of LLMs in Code Generation | Yongkang Du et.al. | 2501.05396 | link |
2025-01-09 | Large Physics Models: Towards a collaborative approach with Large Language Models and Foundation Models | Kristian G. Barman et.al. | 2501.05382 | null |
2025-01-09 | Arc2Avatar: Generating Expressive 3D Avatars from a Single Image via ID Guidance | Dimitrios Gerogiannis et.al. | 2501.05379 | null |
2025-01-09 | Accelerated Diffusion Models via Speculative Sampling | Valentin De Bortoli et.al. | 2501.05370 | null |
2025-01-09 | Stream Aligner: Efficient Sentence-Level Alignment via Distribution Induction | Hantao Lou et.al. | 2501.05336 | link |
2025-01-09 | “What’s Happening”- A Human-centered Multimodal Interpreter Explaining the Actions of Autonomous Vehicles | Xuewen Luo et.al. | 2501.05322 | null |
2025-01-09 | Comparison Study: Glacier Calving Front Delineation in Synthetic Aperture Radar Images With Deep Learning | Nora Gourmelon et.al. | 2501.05281 | link |
2025-01-09 | CellViT++: Energy-Efficient and Adaptive Cell Segmentation and Classification Using Foundation Models | Fabian Hörst et.al. | 2501.05269 | link |
2025-01-09 | Enhancing Plagiarism Detection in Marathi with a Weighted Ensemble of TF-IDF and BERT Embeddings for Low-Resource Language Processing | Atharva Mutsaddi et.al. | 2501.05260 | link |
2025-01-09 | CallNavi: A Study and Challenge on Function Calling Routing and Invocation in Large Language Models | Yewei Song et.al. | 2501.05255 | null |
2025-01-08 | EditAR: Unified Conditional Generation with Autoregressive Models | Jiteng Mu et.al. | 2501.04699 | null |
2025-01-08 | Re-ranking the Context for Multimodal Retrieval Augmented Generation | Matin Mortaheb et.al. | 2501.04695 | null |
2025-01-08 | URSA: Understanding and Verifying Chain-of-thought Reasoning in Multimodal Mathematics | Ruilin Luo et.al. | 2501.04686 | link |
2025-01-08 | Enhancing Financial VQA in Vision Language Models using Intermediate Structured Representations | Archita Srivastava et.al. | 2501.04675 | null |
2025-01-08 | DRIVINGVQA: Analyzing Visual Chain-of-Thought Reasoning of Vision Language Models in Real-World Scenarios with Driving Theory Tests | Charles Corbière et.al. | 2501.04671 | null |
2025-01-08 | On The Origin of Cultural Biases in Language Models: From Pre-training Data to Linguistic Phenomena | Tarek Naous et.al. | 2501.04662 | link |
2025-01-08 | Assessing Language Comprehension in Large Language Models Using Construction Grammar | Wesley Scivetti et.al. | 2501.04661 | null |
2025-01-08 | Multi-task retriever fine-tuning for domain-specific and efficient RAG | Patrice Béchard et.al. | 2501.04652 | null |
2025-01-08 | FlairGPT: Repurposing LLMs for Interior Designs | Gabrielle Littlefair et.al. | 2501.04648 | null |
2025-01-08 | A Statistical Theory of Contrastive Pre-training and Multimodal Generative AI | Kazusato Oko et.al. | 2501.04641 | link |
2025-01-08 | Knowledge Retrieval Based on Generative AI | Te-Lun Yang et.al. | 2501.04635 | null |
2025-01-08 | “Can you be my mum?”: Manipulating Social Robots in the Large Language Models Era | Giulio Antonio Abbo et.al. | 2501.04633 | null |
2025-01-08 | MedCoDi-M: A Multi-Prompt Foundation Model for Multimodal Medical Data Generation | Daniele Molino et.al. | 2501.04614 | null |
2025-01-08 | Quantum-inspired Embeddings Projection and Similarity Metrics for Representation Learning | Ivan Kankeu et.al. | 2501.04591 | link |
2025-01-08 | Boosting Salient Object Detection with Knowledge Distillated from Large Foundation Models | Miaoyang He et.al. | 2501.04582 | null |
2025-01-08 | InfiGUIAgent: A Multimodal Generalist GUI Agent with Native Reasoning and Reflection | Yuhang Liu et.al. | 2501.04575 | link |
2025-01-08 | Supervision-free Vision-Language Alignment | Giorgio Giannone et.al. | 2501.04568 | null |
2025-01-08 | OpenOmni: Large Language Models Pivot Zero-shot Omnimodal Alignment across Language with Real-time Self-Aware Emotional Speech Synthesis | Run Luo et.al. | 2501.04561 | link |
2025-01-08 | The Impostor is Among Us: Can Large Language Models Capture the Complexity of Human Personas? | Christopher Lazik et.al. | 2501.04543 | null |
2025-01-08 | rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking | Xinyu Guan et.al. | 2501.04519 | null |
2025-01-07 | LargeAD: Large-Scale Cross-Sensor Data Pretraining for Autonomous Driving | Lingdong Kong et.al. | 2501.04005 | null |
2025-01-07 | Are VLMs Ready for Autonomous Driving? An Empirical Study from the Reliability, Data, and Metric Perspectives | Shaoyuan Xie et.al. | 2501.04003 | link |
2025-01-07 | Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos | Haobo Yuan et.al. | 2501.04001 | link |
2025-01-07 | RAG-Check: Evaluating Multimodal Retrieval Augmented Generation Performance | Matin Mortaheb et.al. | 2501.03995 | null |
2025-01-07 | Influences on LLM Calibration: A Study of Response Agreement, Loss Functions, and Prompt Styles | Yuxi Xia et.al. | 2501.03991 | null |
2025-01-07 | (De)-Indexing and the Right to be Forgotten | Salvatore Vilella et.al. | 2501.03989 | null |
2025-01-07 | VLM-driven Behavior Tree for Context-aware Task Planning | Naoki Wake et.al. | 2501.03968 | link |
2025-01-07 | Vision Language Models as Values Detectors | Giulio Antonio Abbo et.al. | 2501.03957 | null |
2025-01-07 | Localizing AI: Evaluating Open-Weight Language Models for Languages of Baltic States | Jurgita Kapočiūtė-Dzikienė et.al. | 2501.03952 | null |
2025-01-07 | Not all tokens are created equal: Perplexity Attention Weighted Networks for AI generated text detection | Pablo Miralles-González et.al. | 2501.03940 | null |
2025-01-07 | Visual question answering: from early developments to recent advances – a survey | Ngoc Dung Huynh et.al. | 2501.03939 | null |
2025-01-07 | Exploring the Potential of Large Language Models in Public Transportation: San Antonio Case Study | Ramya Jonnala et.al. | 2501.03904 | null |
2025-01-07 | LLaVA-Mini: Efficient Image and Video Large Multimodal Models with One Vision Token | Shaolei Zhang et.al. | 2501.03895 | link |
2025-01-07 | AlphaPO – Reward shape matters for LLM alignment | Aman Gupta et.al. | 2501.03884 | null |
2025-01-07 | CL3DOR: Contrastive Learning for 3D Large Multimodal Models via Odds Ratio on High-Resolution Point Clouds | Keonwoo Kim et.al. | 2501.03879 | null |
2025-01-07 | Improving Dialectal Slot and Intent Detection with Auxiliary Tasks: A Multi-Dialectal Bavarian Case Study | Xaver Maria Krückl et.al. | 2501.03863 | link |
2025-01-07 | Progressive Document-level Text Simplification via Large Language Models | Dengzhao Fang et.al. | 2501.03857 | null |
2025-01-07 | BabyLMs for isiXhosa: Data-Efficient Language Modelling in a Low-Resource Context | Alexis Matzopoulos et.al. | 2501.03855 | null |
2025-01-07 | OmniManip: Towards General Robotic Manipulation via Object-Centric Interaction Primitives as Spatial Constraints | Mingjie Pan et.al. | 2501.03841 | null |
2025-01-07 | MedFocusCLIP : Improving few shot classification in medical datasets using pixel wise attention | Aadya Arora et.al. | 2501.03839 | null |
2025-01-06 | BoostStep: Boosting mathematical capability of Large Language Models via improved single-step reasoning | Beichen Zhang et.al. | 2501.03226 | link |
2025-01-06 | Automated Generation of Challenging Multiple-Choice Questions for Vision Language Model Evaluation | Yuhui Zhang et.al. | 2501.03225 | link |
2025-01-06 | Leveraging Explainable AI for LLM Text Attribution: Differentiating Human-Written and Multiple LLMs-Generated Text | Ayat Najjar et.al. | 2501.03212 | null |
2025-01-06 | Detecting AI-Generated Text in Educational Content: Leveraging Machine Learning and Explainable AI for Academic Integrity | Ayat A. Najjar et.al. | 2501.03203 | null |
2025-01-06 | The FACTS Grounding Leaderboard: Benchmarking LLMs’ Ability to Ground Responses to Long-Form Input | Alon Jacovi et.al. | 2501.03200 | null |
2025-01-06 | CLIX: Cross-Lingual Explanations of Idiomatic Expressions | Aaron Gluck et.al. | 2501.03191 | null |
2025-01-06 | Semantic Captioning: Benchmark Dataset and Graph-Aware Few-Shot In-Context Learning for SQL2Text | Ali Al-Lawati et.al. | 2501.03166 | link |
2025-01-06 | Segment Anything Model for Zero-shot Single Particle Tracking in Liquid Phase Transmission Electron Microscopy | Risha Goel et.al. | 2501.03153 | link |
2025-01-06 | Large language models for artificial general intelligence (AGI): A survey of foundational principles and approaches | Alhassan Mumuni et.al. | 2501.03151 | null |
2025-01-06 | VicSim: Enhancing Victim Simulation with Emotional and Linguistic Fidelity | Yerong Li et.al. | 2501.03139 | null |
2025-01-06 | PRMBench: A Fine-grained and Challenging Benchmark for Process-Level Reward Models | Mingyang Song et.al. | 2501.03124 | link |
2025-01-06 | CAT: Content-Adaptive Image Tokenization | Junhong Shen et.al. | 2501.03120 | null |
2025-01-06 | LangFair: A Python Package for Assessing Bias and Fairness in Large Language Model Use Cases | Dylan Bouchard et.al. | 2501.03112 | link |
2025-01-06 | Sentiment-guided Commonsense-aware Response Generation for Mental Health Counseling | Aseem Srivastava et.al. | 2501.03088 | null |
2025-01-06 | Retrieval-Augmented TLAPS Proof Generation with Large Language Models | Yuhao Zhou et.al. | 2501.03073 | null |
2025-01-06 | Trust Modeling in Counseling Conversations: A Benchmark Study | Aseem Srivastava et.al. | 2501.03064 | null |
2025-01-06 | ChronoSense: Exploring Temporal Understanding in Large Language Models with Time Intervals of Events | Duygu Sezen Islakoglu et.al. | 2501.03040 | null |
2025-01-06 | Piano Transcription by Hierarchical Language Modeling with Pretrained Roll-based Encoders | Dichucheng Li et.al. | 2501.03038 | null |
2025-01-06 | Quantization Meets Reasoning: Exploring LLM Low-Bit Quantization Degradation for Mathematical Reasoning | Zhen Li et.al. | 2501.03035 | null |
2025-01-06 | CALM: Curiosity-Driven Auditing for Large Language Models | Xiang Zheng et.al. | 2501.02997 | link |
2025-01-03 | VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction | Chaoyou Fu et.al. | 2501.01957 | link |
2025-01-03 | Metadata Conditioning Accelerates Language Model Pre-training | Tianyu Gao et.al. | 2501.01956 | link |
2025-01-03 | Cold-Start Recommendation towards the Era of Large Language Models (LLMs): A Comprehensive Survey and Roadmap | Weizhi Zhang et.al. | 2501.01945 | link |
2025-01-03 | Abstractive Text Summarization for Contemporary Sanskrit Prose: Issues and Challenges | Shagun Sinha et.al. | 2501.01933 | null |
2025-01-03 | Bridging Classification and Segmentation in Osteosarcoma Assessment via Foundation and Discrete Diffusion Models | Manh Duong Nguyen et.al. | 2501.01932 | link |
2025-01-03 | Mitigating Hallucination for Large Vision Language Model by Inter-Modality Correlation Calibration Decoding | Jiaming Li et.al. | 2501.01926 | link |
2025-01-03 | Virgo: A Preliminary Exploration on Reproducing o1-like MLLM | Yifan Du et.al. | 2501.01904 | link |
2025-01-03 | QuArch: A Question-Answering Dataset for AI Agents in Computer Architecture | Shvetank Prakash et.al. | 2501.01892 | null |
2025-01-03 | Turning Logic Against Itself : Probing Model Defenses Through Contrastive Questions | Rachneet Sachdeva et.al. | 2501.01872 | link |
2025-01-03 | Multi-Agent Conversational Online Learning for Adaptive LLM Response Identification | Xiangxiang Dai et.al. | 2501.01849 | link |
2025-01-03 | MoColl: Agent-Based Specific and General Model Collaboration for Image Captioning | Pu Yang et.al. | 2501.01834 | null |
2025-01-03 | Time Series Language Model for Descriptive Caption Generation | Mohamed Trabelsi et.al. | 2501.01832 | null |
2025-01-03 | Auto-RT: Automatic Jailbreak Strategy Exploration for Red-Teaming Large Language Models | Yanjiang Liu et.al. | 2501.01830 | null |
2025-01-03 | SDPO: Segment-Level Direct Preference Optimization for Social Agents | Aobo Kong et.al. | 2501.01821 | link |
2025-01-03 | BERT4MIMO: A Foundation Model using BERT Architecture for Massive MIMO Channel State Information Prediction | Ferhat Ozgur Catak et.al. | 2501.01802 | link |
2025-01-03 | Reading Between the Lines: A dataset and a study on why some texts are tougher than others | Nouran Khallaf et.al. | 2501.01796 | link |
2025-01-03 | Creating Artificial Students that Never Existed: Leveraging Large Language Models and CTGANs for Synthetic Data Generation | Mohammad Khalil et.al. | 2501.01793 | link |
2025-01-03 | Efficient LLM Inference with Activation Checkpointing and Hybrid Caching | Sanghyeon Lee et.al. | 2501.01792 | null |
2025-01-03 | LogicAD: Explainable Anomaly Detection via VLM-based Text Feature Extraction | Er Jin et.al. | 2501.01767 | null |
2025-01-03 | SaLoRA: Safety-Alignment Preserved Low-Rank Adaptation | Mingjie Li et.al. | 2501.01765 | null |
2025-01-02 | GPT4Scene: Understand 3D Scenes from Videos with Vision-Language Models | Zhangyang Qi et.al. | 2501.01428 | link |
2025-01-02 | Unifying Specialized Visual Encoders for Video Language Models | Jihoon Chung et.al. | 2501.01426 | link |
2025-01-02 | Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models | Jingfeng Yao et.al. | 2501.01423 | link |
2025-01-02 | OmniChat: Enhancing Spoken Dialogue Systems with Scalable Synthetic Data for Diverse Scenarios | Xize Cheng et.al. | 2501.01384 | null |
2025-01-02 | Training Medical Large Vision-Language Models with Abnormal-Aware Feedback | Yucheng Zhou et.al. | 2501.01377 | null |
2025-01-02 | ScarNet: A Novel Foundation Model for Automated Myocardial Scar Quantification from LGE in Cardiac MRI | Neda Tavakoli et.al. | 2501.01372 | link |
2025-01-02 | CLIP-UP: CLIP-Based Unanswerable Problem Detection for Visual Question Answering | Ben Vardi et.al. | 2501.01371 | null |
2025-01-02 | Large Vision-Language Model Alignment and Misalignment: A Survey Through the Lens of Explainability | Dong Shu et.al. | 2501.01346 | null |
2025-01-02 | Aligning Large Language Models for Faithful Integrity Against Opposing Argument | Yong Zhao et.al. | 2501.01336 | link |
2025-01-02 | CySecBench: Generative AI-based CyberSecurity-focused Prompt Dataset for Benchmarking Large Language Models | Johan Wahréus et.al. | 2501.01335 | link |
2025-01-02 | Decoding Knowledge in Large Language Models: A Framework for Categorization and Comprehension | Yanbo Fang et.al. | 2501.01332 | null |
2025-01-02 | The Prompt Alchemist: Automated LLM-Tailored Prompt Optimization for Test Case Generation | Shuzheng Gao et.al. | 2501.01329 | null |
2025-01-02 | Think More, Hallucinate Less: Mitigating Hallucinations via Dual Process of Fast and Slow Thinking | Xiaoxue Cheng et.al. | 2501.01306 | null |
2025-01-02 | Large Language Models for Mental Health Diagnostic Assessments: Exploring The Potential of Large Language Models for Assisting with Mental Health Diagnostic Assessments – The Depression and Anxiety Case | Kaushik Roy et.al. | 2501.01305 | null |
2025-01-02 | NeutraSum: A Language Model can help a Balanced Media Diet by Neutralizing News Summaries | Xi Luo et.al. | 2501.01284 | null |
2025-01-02 | CultureVLM: Characterizing and Improving Cultural Understanding of Vision-Language Models for over 100 Countries | Shudong Liu et.al. | 2501.01282 | null |
2025-01-02 | Language Models for Code Optimization: Survey, Challenges and Future Directions | Jingzhi Gong et.al. | 2501.01277 | link |
2025-01-02 | Does a Large Language Model Really Speak in Human-Like Language? | Mose Park et.al. | 2501.01273 | null |
2025-01-02 | ProgCo: Program Helps Self-Correction of Large Language Models | Xiaoshuai Song et.al. | 2501.01264 | link |
2025-01-02 | CodeElo: Benchmarking Competition-level Code Generation of LLMs with Human-comparable Elo Ratings | Shanghaoran Quan et.al. | 2501.01257 | null |
2024-12-30 | Distributed Mixture-of-Agents for Edge Inference with Large Language Models | Purbesh Mitra et.al. | 2412.21200 | link |
2024-12-31 | HumanEval Pro and MBPP Pro: Evaluating Large Language Models on Self-invoking Code Generation | Zhaojian Yu et.al. | 2412.21199 | link |
2024-12-30 | Aviary: training language agents on challenging scientific tasks | Siddharth Narayanan et.al. | 2412.21154 | link |
2024-12-30 | Facilitating large language model Russian adaptation with Learned Embedding Propagation | Mikhail Tikhomirov et.al. | 2412.21140 | link |
2024-12-30 | Training Software Engineering Agents and Verifiers with SWE-Gym | Jiayi Pan et.al. | 2412.21139 | link |
2024-12-30 | Adaptive Batch Size Schedules for Distributed Training of Language Models with Data and Model Parallelism | Tim Tsz-Kit Lau et.al. | 2412.21124 | null |
2024-12-30 | ExpShield: Safeguarding Web Text from Unauthorized Crawling and Language Modeling Exploitation | Ruixuan Liu et.al. | 2412.21123 | null |
2024-12-30 | Vinci: A Real-time Embodied Smart Assistant based on Egocentric Vision-Language Model | Yifei Huang et.al. | 2412.21080 | link |
2024-12-30 | Efficient Multi-Task Inferencing with a Shared Backbone and Lightweight Task-Specific Adapters for Automatic Scoring | Ehsan Latif et.al. | 2412.21065 | null |
2024-12-30 | Toward Intelligent and Secure Cloud: Large Language Model Empowered Proactive Defense | Yuyang Zhou et.al. | 2412.21051 | link |
2024-12-30 | Visual Style Prompt Learning Using Diffusion Models for Blind Face Restoration | Wanglong Lu et.al. | 2412.21042 | link |
2024-12-30 | TangoFlux: Super Fast and Faithful Text to Audio Generation with Flow Matching and Clap-Ranked Preference Optimization | Chia-Yu Hung et.al. | 2412.21037 | link |
2024-12-30 | GePBench: Evaluating Fundamental Geometric Perception for Multimodal Large Language Models | Shangyu Xing et.al. | 2412.21036 | null |
2024-12-30 | MapQaTor: A System for Efficient Annotation of Map Query Datasets | Mahir Labib Dihan et.al. | 2412.21015 | link |
2024-12-31 | Verbosity-Aware Rationale Reduction: Effective Reduction of Redundant Rationale via Principled Criteria | Joonwon Jang et.al. | 2412.21006 | null |
2024-12-30 | Plug-and-Play Training Framework for Preference Optimization | Jingyuan Ma et.al. | 2412.20996 | null |
2024-12-30 | KARPA: A Training-free Method of Adapting Knowledge Graph as References for Large Language Model’s Reasoning Path Aggregation | Siyuan Fang et.al. | 2412.20995 | null |
2024-12-30 | Efficiently Serving LLM Reasoning Programs with Certaindex | Yichao Fu et.al. | 2412.20993 | null |
2024-12-30 | AlignAb: Pareto-Optimal Energy Alignment for Designing Nature-Like Antibodies | Yibo Wen et.al. | 2412.20984 | null |
2024-12-30 | UnrealZoo: Enriching Photo-realistic Virtual Worlds for Embodied AI | Fangwei Zhong et.al. | 2412.20977 | null |
2024-12-27 | MVTamperBench: Evaluating Robustness of Vision-Language Models | Amit Agarwal et.al. | 2412.19794 | null |
2024-12-27 | InfAlign: Inference-aware language model alignment | Ananth Balashankar et.al. | 2412.19792 | null |
2024-12-27 | Enhancing Whisper’s Accuracy and Speed for Indian Languages through Prompt-Tuning and Tokenization | Kumud Tripathi et.al. | 2412.19785 | null |
2024-12-27 | Can AI Help with Your Personal Finances? | Oudom Hean et.al. | 2412.19784 | null |
2024-12-27 | Fortran2CPP: Automating Fortran-to-C++ Migration using LLMs via Multi-Turn Dialogue and Dual-Agent Integration | Le Chen et.al. | 2412.19770 | link |
2024-12-27 | On dual-projectively equivalent connections associated to second order superintegrable systems | Andreas Vollmer et.al. | 2412.19739 | null |
2024-12-27 | Can Large Language Models Adapt to Other Agents In-Context? | Matthew Riemer et.al. | 2412.19726 | null |
2024-12-27 | OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis | Qiushi Sun et.al. | 2412.19723 | null |
2024-12-27 | Toward Adaptive Reasoning in Large Language Models with Thought Rollback | Sijia Chen et.al. | 2412.19707 | link |
2024-12-27 | A Large-scale Interpretable Multi-modality Benchmark for Facial Image Forgery Localization | Jingchun Lian et.al. | 2412.19685 | null |
2024-12-27 | Boosting Private Domain Understanding of Efficient MLLMs: A Tuning-free, Adaptive, Universal Prompt Optimization Framework | Jiang Liu et.al. | 2412.19684 | null |
2024-12-27 | CAD-GPT: Synthesising CAD Construction Sequence with Spatial Reasoning-Enhanced Multimodal LLMs | Siyu Wang et.al. | 2412.19663 | null |
2024-12-27 | Asymmetrical Reciprocity-based Federated Learning for Resolving Disparities in Medical Diagnosis | Jiaqi Wang et.al. | 2412.19654 | link |
2024-12-27 | FreStega: A Plug-and-Play Method for Boosting Imperceptibility and Capacity in Generative Linguistic Steganography for Real-World Scenarios | Kaiyi Pang et.al. | 2412.19652 | null |
2024-12-27 | Xmodel-2 Technical Report | Wang Qun et.al. | 2412.19638 | null |
2024-12-27 | IMTP: Search-based Code Generation for In-memory Tensor Programs | Yongwon Shin et.al. | 2412.19630 | null |
2024-12-27 | Signatures of prediction during natural listening in MEG data? | Sahel Azizpour et.al. | 2412.19622 | null |
2024-12-27 | Gradient Weight-normalized Low-rank Projection for Efficient LLM Training | Jia-Hong Huang et.al. | 2412.19616 | link |
2024-12-27 | Let Watermarks Speak: A Robust and Unforgeable Watermark for Language Models | Minhao Bai et.al. | 2412.19603 | null |
2024-12-27 | SocRATES: Towards Automated Scenario-based Testing of Social Navigation Algorithms | Shashank Rao Marpally et.al. | 2412.19595 | null |
2024-12-24 | Video-Panda: Parameter-efficient Alignment for Encoder-free Video-Language Models | Jinhui Yi et.al. | 2412.18609 | link |
2024-12-24 | Orient Anything: Learning Robust Object Orientation Estimation from Rendering 3D Models | Zehan Wang et.al. | 2412.18605 | link |
2024-12-24 | Explaining in Diffusion: Explaining a Classifier Through Hierarchical Semantics with Text-to-Image Diffusion Models | Tahira Kazimi et.al. | 2412.18604 | null |
2024-12-24 | Long-Form Speech Generation with Spoken Language Models | Se Jin Park et.al. | 2412.18603 | link |
2024-12-24 | Decentralized Intelligence in GameFi: Embodied AI Agents and the Convergence of DeFi and Virtual Ecosystems | Fernando Jia et.al. | 2412.18601 | link |
2024-12-24 | A Paragraph is All It Takes: Rich Robot Behaviors from Interacting, Trusted LLMs | OpenMind et.al. | 2412.18588 | null |
2024-12-24 | Exploring Embedding Priors in Prompt-Tuning for Improved Interpretability and Control | Sergey Sedov et.al. | 2412.18582 | null |
2024-12-24 | Zero-resource Speech Translation and Recognition with LLMs | Karel Mundnich et.al. | 2412.18566 | null |
2024-12-24 | Distilling Fine-grained Sentiment Understanding from Large Language Models | Yice Zhang et.al. | 2412.18552 | link |
2024-12-24 | Token-Budget-Aware LLM Reasoning | Tingxu Han et.al. | 2412.18547 | link |
2024-12-24 | Consistency Checks for Language Model Forecasters | Daniel Paleka et.al. | 2412.18544 | null |
2024-12-24 | PLD-Tree: Persistent Laplacian Decision Tree for Protein-Protein Binding Free Energy Prediction | Xingjian Xu et.al. | 2412.18541 | null |
2024-12-24 | Harnessing Large Language Models for Knowledge Graph Question Answering via Adaptive Multi-Aspect Retrieval-Augmentation | Derong Xu Xinhang Li et.al. | 2412.18537 | link |
2024-12-24 | Automated Code Review In Practice | Umut Cihan et.al. | 2412.18531 | null |
2024-12-24 | The Key of Understanding Vision Tasks: Explanatory Instructions | Yang Shen et.al. | 2412.18525 | link |
2024-12-24 | Large Language Model guided Deep Reinforcement Learning for Decision Making in Autonomous Driving | Hao Pang et.al. | 2412.18511 | null |
2024-12-24 | Think or Remember? Detecting and Directing LLMs Towards Memorization or Generalization | Yi-Fu Fu et.al. | 2412.18497 | null |
2024-12-24 | Generating event descriptions under syntactic and semantic constraints | Angela Cao et.al. | 2412.18496 | link |
2024-12-24 | Segment-Based Attention Masking for GPTs | Shahar Katz et.al. | 2412.18487 | link |
2024-12-24 | 3DGraphLLM: Combining Semantic Graphs and Large Language Models for 3D Scene Understanding | Tatiana Zemskova et.al. | 2412.18450 | link |
2024-12-23 | ChatGarment: Garment Estimation, Generation and Editing via Large Language Models | Siyuan Bian et.al. | 2412.17811 | null |
2024-12-23 | Reconstructing People, Places, and Cameras | Lea Müller et.al. | 2412.17806 | link |
2024-12-23 | Examining Imbalance Effects on Performance and Demographic Fairness of Clinical Language Models | Precious Jones et.al. | 2412.17803 | null |
2024-12-23 | Comprehensive Multi-Modal Prototypes are Simple and Effective Classifiers for Vast-Vocabulary Object Detection | Yitong Chen et.al. | 2412.17800 | link |
2024-12-23 | Automating the Search for Artificial Life with Foundation Models | Akarsh Kumar et.al. | 2412.17799 | link |
2024-12-23 | Memory makes computation universal, remember? | Erik Garrison et.al. | 2412.17794 | null |
2024-12-23 | Cross-Lingual Text-Rich Visual Comprehension: An Information Theory Perspective | Xinmiao Yu et.al. | 2412.17787 | null |
2024-12-23 | PepTune: De Novo Generation of Therapeutic Peptides with Multi-Objective-Guided Discrete Diffusion | Sophia Tang et.al. | 2412.17780 | null |
2024-12-23 | ResearchTown: Simulator of Human Research Community | Haofei Yu et.al. | 2412.17767 | link |
2024-12-23 | Survey of Large Multimodal Model Datasets, Application Categories and Taxonomy | Priyaranjan Pattnayak et.al. | 2412.17759 | null |
2024-12-23 | ADC: Enhancing Function Calling Via Adversarial Datasets and Code Line-Level Feedback | Wei Zhang et.al. | 2412.17754 | null |
2024-12-23 | Deliberation in Latent Space via Differentiable Cache Augmentation | Luyang Liu et.al. | 2412.17747 | null |
2024-12-23 | YuLan-Mini: An Open Data-efficient Language Model | Yiwen Hu et.al. | 2412.17743 | link |
2024-12-23 | **Reasoning to Attend: Try to Understand How |
Rui Qian et.al. | 2412.17741 | link |
2024-12-23 | Fourier Position Embedding: Enhancing Attention’s Periodic Extension for Length Generalization | Ermo Hua et.al. | 2412.17739 | link |
2024-12-23 | Knowledge Editing through Chain-of-Thought | Changyue Wang et.al. | 2412.17727 | link |
2024-12-23 | From Models to Microtheories: Distilling a Model’s Topical Knowledge for Grounded Question Answering | Nathaniel Weir et.al. | 2412.17701 | link |
2024-12-23 | Understanding the Logic of Direct Preference Alignment through Logic | Kyle Richardson et.al. | 2412.17696 | null |
2024-12-23 | FedTLU: Federated Learning with Targeted Layer Updates | Jong-Ik Park et.al. | 2412.17692 | null |
2024-12-23 | Large Language Model Safety: A Holistic Survey | Dan Shi et.al. | 2412.17686 | link |
2024-12-20 | HoVLE: Unleashing the Power of Monolithic Vision-Language Models with Holistic Vision-Language Embedding | Chenxin Tao et.al. | 2412.16158 | null |
2024-12-20 | Frequency Is What You Need: Word-frequency Masking Benefits Vision-Language Model Pre-training | Mingliang Liang et.al. | 2412.16148 | link |
2024-12-20 | Offline Reinforcement Learning for LLM Multi-Step Reasoning | Huaijie Wang et.al. | 2412.16145 | link |
2024-12-20 | Can LLMs Obfuscate Code? A Systematic Analysis of Large Language Models into Assembly Code Obfuscation | Seyedreza Mohseni et.al. | 2412.16135 | null |
2024-12-20 | Data-Driven Mechanism Design: Jointly Eliciting Preferences and Information | Dirk Bergemann et.al. | 2412.16132 | null |
2024-12-20 | PromptOptMe: Error-Aware Prompt Compression for LLM-based MT Evaluation Metrics | Daniil Larionov et.al. | 2412.16120 | null |
2024-12-20 | Deciphering the Underserved: Benchmarking LLM OCR for Low-Resource Scripts | Muhammad Abdullah Sohail et.al. | 2412.16119 | link |
2024-12-20 | PruneVid: Visual Token Pruning for Efficient Video Large Language Models | Xiaohu Huang et.al. | 2412.16117 | link |
2024-12-20 | The Content Moderator’s Dilemma: Removal of Toxic Content and Distortions to Online Discourse | Mahyar Habibi et.al. | 2412.16114 | null |
2024-12-20 | Demystifying the Potential of ChatGPT-4 Vision for Construction Progress Monitoring | Ahmet Bahaddin Ersoz et.al. | 2412.16108 | null |
2024-12-20 | Interleaved Speech-Text Language Models are Simple Streaming Text to Speech Synthesizers | Yifan Yang et.al. | 2412.16102 | null |
2024-12-20 | Logical Consistency of Large Language Models in Fact-checking | Bishwamittra Ghosh et.al. | 2412.16100 | null |
2024-12-20 | The Evolution of LLM Adoption in Industry Data Curation Practices | Crystal Qian et.al. | 2412.16089 | null |
2024-12-20 | Efficient MedSAMs: Segment Anything in Medical Images on Laptop | Jun Ma et.al. | 2412.16085 | link |
2024-12-20 | Formal Mathematical Reasoning: A New Frontier in AI | Kaiyu Yang et.al. | 2412.16075 | null |
2024-12-20 | A Framework for Streaming Event-Log Prediction in Business Processes | Benedikt Bollig et.al. | 2412.16032 | null |
2024-12-20 | The Only Way is Ethics: A Guide to Ethical Research with Large Language Models | Eddie L. Ungless et.al. | 2412.16022 | link |
2024-12-20 | Fearful Falcons and Angry Llamas: Emotion Category Annotations of Arguments by Humans and LLMs | Lynn Greschner et.al. | 2412.15993 | null |
2024-12-20 | BabyHGRN: Exploring RNNs for Sample-Efficient Training of Language Models | Patrick Haller et.al. | 2412.15978 | null |
2024-12-20 | Legommenders: A Comprehensive Content-Based Recommendation Library with LLM Support | Qijiong Liu et.al. | 2412.15973 | link |
2024-12-19 | PRIMA: Multi-Image Vision-Language Models for Reasoning Segmentation | Muntasir Wahed et.al. | 2412.15209 | null |
2024-12-19 | OpenEMMA: Open-Source Multimodal Model for End-to-End Autonomous Driving | Shuo Xing et.al. | 2412.15208 | link |
2024-12-19 | AutoTrust: Benchmarking Trustworthiness in Large Vision Language Models for Autonomous Driving | Shuo Xing et.al. | 2412.15206 | link |
2024-12-19 | MMLU-CF: A Contamination-free Multi-task Language Understanding Benchmark | Qihao Zhao et.al. | 2412.15194 | link |
2024-12-19 | EarthDial: Turning Multi-sensory Earth Observations to Interactive Dialogues | Sagar Soni et.al. | 2412.15190 | null |
2024-12-19 | LlamaFusion: Adapting Pretrained Language Models for Multimodal Generation | Weijia Shi et.al. | 2412.15188 | null |
2024-12-19 | Data for Mathematical Copilots: Better Ways of Presenting Proofs for Machine Learning | Simon Frieder et.al. | 2412.15184 | null |
2024-12-19 | STRAP: Robot Sub-Trajectory Retrieval for Augmented Policy Learning | Marius Memmel et.al. | 2412.15182 | null |
2024-12-19 | HPC-Coder-V2: Studying Code LLMs Across Low-Resource Parallel Languages | Aman Chaturvedi et.al. | 2412.15178 | null |
2024-12-19 | Critical-Questions-of-Thought: Steering LLM reasoning with Argumentative Querying | Federico Castagna et.al. | 2412.15177 | link |
2024-12-19 | Rethinking Uncertainty Estimation in Natural Language Generation | Lukas Aichberger et.al. | 2412.15176 | null |
2024-12-19 | Language Models as Continuous Self-Evolving Data Engineers | Peidong Wang et.al. | 2412.15151 | null |
2024-12-19 | Adaptive Pruning for Large Language Models with Structural Importance Awareness | Haotian Zheng et.al. | 2412.15127 | null |
2024-12-19 | Outcome-Refining Process Supervision for Code Generation | Zhuohao Yu et.al. | 2412.15118 | link |
2024-12-19 | Qwen2.5 Technical Report | Qwen et.al. | 2412.15115 | link |
2024-12-19 | Associative memory inspires improvements for in-context learning using a novel attention residual stream architecture | Thomas F Burns et.al. | 2412.15113 | link |
2024-12-19 | Knowing Where to Focus: Attention-Guided Alignment for Text-based Person Search | Lei Tan et.al. | 2412.15106 | null |
2024-12-19 | Review-Then-Refine: A Dynamic Framework for Multi-Hop Question Answering with Temporal Adaptability | Xiangsen Chen et.al. | 2412.15101 | null |
2024-12-19 | Nano-ESG: Extracting Corporate Sustainability Information from News Articles | Fabian Billert et.al. | 2412.15093 | link |
2024-12-19 | ScamChatBot: An End-to-End Analysis of Fake Account Recovery on Social Media via Chatbots | Bhupendra Acharya et.al. | 2412.15072 | null |
2024-12-18 | Thinking in Space: How Multimodal Large Language Models See, Remember, and Recall Spaces | Jihan Yang et.al. | 2412.14171 | link |
2024-12-18 | TheAgentCompany: Benchmarking LLM Agents on Consequential Real World Tasks | Frank F. Xu et.al. | 2412.14161 | link |
2024-12-18 | Advanced Reasoning and Transformation Engine for Multi-Step Insight Synthesis in Data Analytics with Large Language Models | Atin Sakkeer Hussain et.al. | 2412.14146 | null |
2024-12-18 | Incorporating Feature Pyramid Tokenization and Open Vocabulary Semantic Segmentation | Jianyu Zhang et.al. | 2412.14145 | null |
2024-12-18 | LLMs can realize combinatorial creativity: generating creative ideas via LLMs for scientific research | Tianyang Gu et.al. | 2412.14141 | null |
2024-12-18 | Design choices made by LLM-based test generators prevent them from finding bugs | Noble Saji Mathews et.al. | 2412.14137 | null |
2024-12-18 | Performance Gap in Entity Knowledge Extraction Across Modalities in Vision Language Models | Ido Cohen et.al. | 2412.14133 | link |
2024-12-18 | Foundation Models Meet Low-Cost Sensors: Test-Time Adaptation for Rescaling Disparity for Zero-Shot Metric Depth Estimation | Rémi Marsal et.al. | 2412.14103 | null |
2024-12-18 | Adaptive Concept Bottleneck for Foundation Models Under Distribution Shifts | Jihye Choi et.al. | 2412.14097 | null |
2024-12-18 | Alignment faking in large language models | Ryan Greenblatt et.al. | 2412.14093 | link |
2024-12-18 | Future Research Avenues for Artificial Intelligence in Digital Gaming: An Exploratory Report | Markus Dablander et.al. | 2412.14085 | null |
2024-12-18 | Rango: Adaptive Retrieval-Augmented Proving for Automated Software Verification | Kyle Thompson et.al. | 2412.14063 | link |
2024-12-18 | Understanding and Evaluating Trust in Generative AI and Large Language Models for Spreadsheets | Simon Thorne et.al. | 2412.14062 | null |
2024-12-18 | Towards Generalist Robot Policies: What Matters in Building Vision-Language-Action Models | Xinghang Li et.al. | 2412.14058 | null |
2024-12-18 | A Review of Multimodal Explainable Artificial Intelligence: Past, Present and Future | Shilin Sun et.al. | 2412.14056 | link |
2024-12-18 | Digestion Algorithm in Hierarchical Symbolic Forests: A Fast Text Normalization Algorithm and Semantic Parsing Framework for Specific Scenarios and Lightweight Deployment | Kevin You et.al. | 2412.14054 | link |
2024-12-18 | Cross-Lingual Transfer of Debiasing and Detoxification in Multilingual LLMs: An Extensive Investigation | Vera Neplenbroek et.al. | 2412.14050 | link |
2024-12-18 | CAD-Recode: Reverse Engineering CAD Code from Point Clouds | Danila Rukhovich et.al. | 2412.14042 | link |
2024-12-18 | Hansel: Output Length Controlling Framework for Large Language Models | Seoha Song et.al. | 2412.14033 | null |
2024-12-18 | Discovering maximally consistent distribution of causal tournaments with Large Language Models | Federico Baldo et.al. | 2412.14019 | null |
2024-12-17 | Proposer-Agent-Evaluator(PAE): Autonomous Skill Discovery For Foundation Model Internet Agents | Yifei Zhou et.al. | 2412.13194 | null |
2024-12-17 | GaussTR: Foundation Model-Aligned Gaussian Transformer for Self-Supervised 3D Spatial Understanding | Haoyi Jiang et.al. | 2412.13193 | link |
2024-12-17 | HandsOnVLM: Vision-Language Models for Hand-Object Interaction Prediction | Chen Bao et.al. | 2412.13187 | null |
2024-12-17 | Feather the Throttle: Revisiting Visual Token Pruning for Vision-Language Model Acceleration | Mark Endo et.al. | 2412.13180 | null |
2024-12-17 | SafeAgentBench: A Benchmark for Safe Task Planning of Embodied LLM Agents | Sheng Yin et.al. | 2412.13178 | link |
2024-12-17 | DnDScore: Decontextualization and Decomposition for Factuality Verification in Long-Form Text Generation | Miriam Wanner et.al. | 2412.13175 | null |
2024-12-17 | Locate n’ Rotate: Two-stage Openable Part Detection with Foundation Model Priors | Siqi Li et.al. | 2412.13173 | link |
2024-12-17 | Compressed Chain of Thought: Efficient Reasoning Through Dense Representations | Jeffrey Cheng et.al. | 2412.13171 | null |
2024-12-17 | Algorithmic Fidelity of Large Language Models in Generating Synthetic German Public Opinions: A Case Study | Bolei Ma et.al. | 2412.13169 | link |
2024-12-17 | C-FedRAG: A Confidential Federated Retrieval-Augmented Generation System | Parker Addison et.al. | 2412.13163 | null |
2024-12-17 | SWAN: Preprocessing SGD Enables Adam-Level Performance On LLM Training With Significant Memory Reduction | Chao Ma et.al. | 2412.13148 | null |
2024-12-17 | Are Your LLMs Capable of Stable Reasoning? | Junnan Liu et.al. | 2412.13147 | link |
2024-12-17 | A Knowledge-enhanced Pathology Vision-language Foundation Model for Cancer Diagnosis | Xiao Zhou et.al. | 2412.13126 | null |
2024-12-17 | AI PERSONA: Towards Life-long Personalization of LLMs | Tiannan Wang et.al. | 2412.13103 | null |
2024-12-17 | AIR-Bench: Automated Heterogeneous Information Retrieval Benchmark | Jianlyu Chen et.al. | 2412.13102 | link |
2024-12-17 | Uchaguzi-2022: A Dataset of Citizen Reports on the 2022 Kenyan Election | Roberto Mondini et.al. | 2412.13098 | null |
2024-12-17 | LMUnit: Fine-grained Evaluation with Natural Language Unit Tests | Jon Saad-Falcon et.al. | 2412.13091 | null |
2024-12-17 | Taming Multi-Domain, -Fidelity Data: Towards Foundation Models for Atomistic Scale Simulations | Tomoya Shiota et.al. | 2412.13088 | link |
2024-12-17 | Modality-Inconsistent Continual Learning of Multimodal Large Language Models | Weiguo Pian et.al. | 2412.13050 | null |
2024-12-17 | Harnessing Event Sensory Data for Error Pattern Prediction in Vehicles: A Language Model Approach | Hugo Math et.al. | 2412.13041 | link |
2024-12-16 | SepLLM: Accelerate Large Language Models by Compressing One Segment into One Separator | Guoxuan Chen et.al. | 2412.12094 | link |
2024-12-16 | Instruction-based Image Manipulation by Watching How Things Move | Mingdeng Cao et.al. | 2412.12087 | null |
2024-12-16 | CPath-Omni: A Unified Multimodal Foundation Model for Patch and Whole Slide Image Analysis in Computational Pathology | Yuxuan Sun et.al. | 2412.12077 | null |
2024-12-16 | CG-Bench: Clue-grounded Question Answering Benchmark for Long Video Understanding | Guo Chen et.al. | 2412.12075 | null |
2024-12-16 | Making FETCH! Happen: Finding Emergent Dog Whistles Through Common Habitats | Kuleen Sasse et.al. | 2412.12072 | link |
2024-12-16 | How Private are Language Models in Abstractive Summarization? | Anthony Hughes et.al. | 2412.12040 | null |
2024-12-16 | Can LLM Prompting Serve as a Proxy for Static Analysis in Vulnerability Detection | Ira Ceka et.al. | 2412.12039 | null |
2024-12-16 | FSFM: A Generalizable Face Security Foundation Model via Self-Supervised Facial Representation Learning | Gaojian Wang et.al. | 2412.12032 | link |
2024-12-16 | SpeechPrune: Context-aware Token Pruning for Speech Information Retrieval | Yueqian Lin et.al. | 2412.12009 | link |
2024-12-16 | The Open Source Advantage in Large Language Models (LLMs) | Jiya Manchanda et.al. | 2412.12004 | null |
2024-12-16 | LLM-RG4: Flexible and Factual Radiology Report Generation across Diverse Input Contexts | Zhuhao Wang et.al. | 2412.12001 | link |
2024-12-16 | SAMIC: Segment Anything with In-Context Spatial Prompt Engineering | Savinay Nagendra et.al. | 2412.11998 | null |
2024-12-16 | Combining Large Language Models with Tutoring System Intelligence: A Case Study in Caregiver Homework Support | Devika Venugopalan et.al. | 2412.11995 | link |
2024-12-16 | ExecRepoBench: Multi-level Executable Code Completion Evaluation | Jian Yang et.al. | 2412.11990 | null |
2024-12-16 | SciFaultyQA: Benchmarking LLMs on Faulty Science Question Detection with a GAN-Inspired Approach to Synthetic Dataset Generation | Debarshi Kundu et.al. | 2412.11988 | link |
2024-12-16 | Cost-Effective Label-free Node Classification with LLMs | Taiyan Zhang et.al. | 2412.11983 | link |
2024-12-16 | AlphaZero Neural Scaling and Zipf’s Law: a Tale of Board Games and Power Laws | Oren Neumann et.al. | 2412.11979 | link |
2024-12-16 | Speech Foundation Models and Crowdsourcing for Efficient, High-Quality Data Collection | Beomseok Lee et.al. | 2412.11978 | null |
2024-12-16 | Emma-X: An Embodied Multimodal Action Model with Grounded Chain of Thought and Look-ahead Spatial Reasoning | Qi Sun et.al. | 2412.11974 | link |
2024-12-16 | DARWIN 1.5: Large Language Models as Materials Science Adapted Learners | Tong Xie et.al. | 2412.11970 | link |
2024-12-13 | UniMed-CLIP: Towards a Unified Image-Text Pretraining Paradigm for Diverse Medical Imaging Modalities | Muhammad Uzair Khattak et.al. | 2412.10372 | link |
2024-12-13 | A Grounded Typology of Word Classes | Coleman Haley et.al. | 2412.10369 | null |
2024-12-13 | Robust image classification with multi-modal large language models | Francesco Villani et.al. | 2412.10353 | null |
2024-12-13 | Towards a foundation model for heavy-ion collision experiments through point cloud diffusion | Manjunath Omana Kuttan et.al. | 2412.10352 | null |
2024-12-13 | A dual contrastive framework | Yuan Sun et.al. | 2412.10348 | null |
2024-12-13 | COMET: Benchmark for Comprehensive Biological Multi-omics Evaluation Tasks and Language Models | Yuchen Ren et.al. | 2412.10347 | null |
2024-12-13 | Iris: Breaking GUI Complexity with Adaptive Focus and Self-Refining | Zhiqi Ge et.al. | 2412.10342 | null |
2024-12-13 | AdvPrefix: An Objective for Nuanced LLM Jailbreaks | Sicheng Zhu et.al. | 2412.10321 | link |
2024-12-13 | BrushEdit: All-In-One Image Inpainting and Editing | Yaowei Li et.al. | 2412.10316 | null |
2024-12-13 | DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding | Zhiyu Wu et.al. | 2412.10302 | link |
2024-12-13 | Still “Talking About Large Language Models”: Some Clarifications | Murray Shanahan et.al. | 2412.10291 | null |
2024-12-13 | One world, one opinion? The superstar effect in LLM responses | Sofie Goethals et.al. | 2412.10281 | null |
2024-12-13 | Benchmarking Linguistic Diversity of Large Language Models | Yanzhu Guo et.al. | 2412.10271 | link |
2024-12-13 | Cultural Evolution of Cooperation among LLM Agents | Aron Vallinder et.al. | 2412.10270 | null |
2024-12-13 | Does Multiple Choice Have a Future in the Age of Generative AI? A Posttest-only RCT | Danielle R. Thomas et.al. | 2412.10267 | link |
2024-12-13 | Reasoner Outperforms: Generative Stance Detection with Rationalization for Social Media | Jiaqing Yuan et.al. | 2412.10266 | null |
2024-12-13 | Targeted Angular Reversal of Weights (TARS) for Knowledge Removal in Large Language Models | Harry J. Davies et.al. | 2412.10257 | null |
2024-12-13 | Detecting LLM Hallucination Through Layer-wise Information Deficiency: Analysis of Unanswerable Questions and Ambiguous Prompts | Hazel Kim et.al. | 2412.10246 | null |
2024-12-13 | Efficient Continual Pre-training of LLMs for Low-resource Languages | Arijit Nag et.al. | 2412.10244 | null |
2024-12-13 | Retrieval-Augmented Semantic Parsing: Using Large Language Models to Improve Generalization | Xiao Zhang et.al. | 2412.10207 | null |
2024-12-12 | EasyRef: Omni-Generalized Group Image Reference for Diffusion Models via Multimodal LLM | Zhuofan Zong et.al. | 2412.09618 | null |
2024-12-12 | V2PE: Improving Multimodal Long-Context Capability of Vision-Language Models with Variable Visual Position Encoding | Junqi Ge et.al. | 2412.09616 | link |
2024-12-12 | PVC: Progressive Visual Token Compression for Unified Image and Video Processing in Large Vision-Language Models | Chenyu Yang et.al. | 2412.09613 | null |
2024-12-12 | Olympus: A Universal Task Router for Computer Vision Tasks | Yuanze Lin et.al. | 2412.09612 | link |
2024-12-12 | Feat2GS: Probing Visual Foundation Models with Gaussian Splatting | Yue Chen et.al. | 2412.09606 | null |
2024-12-12 | AgentTrek: Agent Trajectory Synthesis via Guiding Replay with Web Tutorials | Yiheng Xu et.al. | 2412.09605 | null |
2024-12-12 | SynerGen-VL: Towards Synergistic Image Understanding and Generation with Vision Experts and Token Folding | Hao Li et.al. | 2412.09604 | null |
2024-12-12 | Do Multimodal Large Language Models See Like Humans? | Jiaying Lin et.al. | 2412.09603 | null |
2024-12-12 | Hidden Biases of End-to-End Driving Datasets | Julian Zimmerlin et.al. | 2412.09602 | link |
2024-12-12 | InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions | Pan Zhang et.al. | 2412.09596 | link |
2024-12-12 | OpenNER 1.0: Standardized Open-Access Named Entity Recognition Datasets in 50+ Languages | Chester Palen-Michel et.al. | 2412.09587 | null |
2024-12-12 | DiverseAgentEntropy: Quantifying Black-Box LLM Uncertainty through Diverse Perspectives and Multi-Agent Interaction | Yu Feng et.al. | 2412.09572 | null |
2024-12-12 | Does Representation Matter? Exploring Intermediate Layers in Large Language Models | Oscar Skean et.al. | 2412.09563 | null |
2024-12-12 | Foundational Large Language Models for Materials Research | Vaibhav Mishra et.al. | 2412.09560 | link |
2024-12-12 | Video Creation by Demonstration | Yihong Sun et.al. | 2412.09551 | null |
2024-12-12 | Exemplar Masking for Multimodal Incremental Learning | Yi-Lun Lee et.al. | 2412.09549 | link |
2024-12-12 | Capturing the Temporal Dependence of Training Data Influence | Jiachen T. Wang et.al. | 2412.09538 | null |
2024-12-12 | Dynamic-VLM: Simple Dynamic Visual Token Compression for VideoLLM | Han Wang et.al. | 2412.09530 | link |
2024-12-12 | Can Modern LLMs Act as Agent Cores in Radiology~Environments? | Qiaoyu Zheng et.al. | 2412.09529 | link |
2024-12-12 | Efficient and Comprehensive Feature Extraction in Large Vision-Language Model for Clinical Pathology Analysis | Shengxuming Zhang et.al. | 2412.09521 | null |
2024-12-11 | Generative Semantic Communication: Architectures, Technologies, and Applications | Jinke Ren et.al. | 2412.08642 | null |
2024-12-11 | Fast Prompt Alignment for Text-to-Image Generation | Khalil Mrini et.al. | 2412.08639 | link |
2024-12-11 | Multimodal Latent Language Modeling with Next-Token Diffusion | Yutao Sun et.al. | 2412.08635 | link |
2024-12-11 | Synthetic Vision: Training Vision-Language Models to Understand Physics | Vahid Balazadeh et.al. | 2412.08619 | null |
2024-12-11 | Exploiting the Index Gradients for Optimization-Based Jailbreaking on Large Language Models | Jiahui Li et.al. | 2412.08615 | link |
2024-12-11 | Benchmarking Large Vision-Language Models via Directed Scene Graph for Comprehensive Image Captioning | Fan Lu et.al. | 2412.08614 | link |
2024-12-11 | Competition and Diversity in Generative AI | Manish Raghavan et.al. | 2412.08610 | link |
2024-12-11 | AdvWave: Stealthy Adversarial Jailbreak Attack against Large Audio-Language Models | Mintong Kang et.al. | 2412.08608 | null |
2024-12-11 | Preference Discerning with LLM-Enhanced Generative Retrieval | Fabian Paischer et.al. | 2412.08604 | null |
2024-12-11 | Empirical Measurements of AI Training Power Demand on a GPU-Accelerated Node | Imran Latif et.al. | 2412.08602 | null |
2024-12-11 | Leveraging Graph-RAG and Prompt Engineering to Enhance LLM-Based Automated Requirement Traceability and Compliance Checks | Arsalan Masoudifard et.al. | 2412.08593 | null |
2024-12-11 | Advancing Single- and Multi-task Text Classification through Large Language Model Fine-tuning | Hang Zhao et.al. | 2412.08587 | null |
2024-12-11 | TURBOATTENTION: Efficient Attention Approximation For High Throughputs LLMs | Hao Kang et.al. | 2412.08585 | null |
2024-12-11 | LAION-SG: An Enhanced Large-Scale Dataset for Training Complex Image-Text Models with Structural Annotations | Zejian Li et.al. | 2412.08580 | link |
2024-12-11 | Underestimated Privacy Risks for Minority Populations in Large Language Model Unlearning | Rongzhe Wei et.al. | 2412.08559 | null |
2024-12-11 | MaestroMotif: Skill Design from Artificial Intelligence Feedback | Martin Klissarov et.al. | 2412.08542 | null |
2024-12-11 | SenCLIP: Enhancing zero-shot land-use mapping for Sentinel-2 with ground-level prompting | Pallavi Jain et.al. | 2412.08536 | link |
2024-12-11 | Continual Learning for Encoder-only Language Models via a Discrete Key-Value Bottleneck | Andor Diera et.al. | 2412.08528 | null |
2024-12-11 | EMS: Adaptive Evict-then-Merge Strategy for Head-wise KV Cache Compression Based on Global-Local Importance | Yingxin Li et.al. | 2412.08521 | null |
2024-12-11 | Bridging Relevance and Reasoning: Rationale Distillation in Retrieval-Augmented Generation | Pengyue Jia et.al. | 2412.08519 | null |
2024-12-10 | Bayesian Optimization of Antibodies Informed by a Generative Model of Evolving Sequences | Alan Nawzad Amin et.al. | 2412.07763 | link |
2024-12-10 | SAT: Spatial Aptitude Training for Multimodal Language Models | Arijit Ray et.al. | 2412.07755 | null |
2024-12-10 | LoRA3D: Low-Rank Self-Calibration of 3D Geometric Foundation Models | Ziqi Lu et.al. | 2412.07746 | null |
2024-12-10 | Zero-Shot ATC Coding with Large Language Models for Clinical Assessments | Zijian Chen et.al. | 2412.07743 | null |
2024-12-10 | AI Expands Scientists’ Impact but Contracts Science’s Focus | Qianyue Hao et.al. | 2412.07727 | link |
2024-12-10 | Granite Guardian | Inkit Padhi et.al. | 2412.07724 | link |
2024-12-10 | Leveraging Content and Context Cues for Low-Light Image Enhancement | Igor Morawski et.al. | 2412.07693 | link |
2024-12-10 | DriveMM: All-in-One Large Multimodal Model for Autonomous Driving | Zhijian Huang et.al. | 2412.07689 | link |
2024-12-10 | Privacy-Preserving Customer Support: A Framework for Secure and Scalable Interactions | Anant Prakash Awasthi et.al. | 2412.07687 | null |
2024-12-10 | TRIM: Token Reduction and Inference Modeling for Cost-Effective Language Generation | Alfredo Garrachón Ruiz et.al. | 2412.07682 | null |
2024-12-10 | RADIO Amplified: Improved Baselines for Agglomerative Vision Foundation Models | Greg Heinrich et.al. | 2412.07679 | link |
2024-12-10 | Ask Humans or AI? Exploring Their Roles in Visualization Troubleshooting | Shuyu Shen et.al. | 2412.07673 | link |
2024-12-10 | FlexLLM: Exploring LLM Customization for Moving Target Defense on Black-Box LLMs Against Jailbreak Attacks | Bocheng Chen et.al. | 2412.07672 | null |
2024-12-10 | Automating Business Intelligence Requirements with Generative AI and Semantic Search | Nimrod Busany et.al. | 2412.07668 | null |
2024-12-10 | Searching for Structure: Investigating Emergent Communication with Large Language Models | Tom Kouwenhoven et.al. | 2412.07646 | null |
2024-12-10 | TrojanWhisper: Evaluating Pre-trained LLMs to Detect and Localize Hardware Trojans | Md Omar Faruque et.al. | 2412.07636 | null |
2024-12-10 | ChocoLlama: Lessons Learned From Teaching Llamas Dutch | Matthieu Meeus et.al. | 2412.07633 | null |
2024-12-10 | Piece of Table: A Divide-and-Conquer Approach for Selecting Sub-Tables in Table Question Answering | Wonjin Lee et.al. | 2412.07629 | null |
2024-12-10 | OmniDocBench: Benchmarking Diverse PDF Document Parsing with Comprehensive Annotations | Linke Ouyang et.al. | 2412.07626 | link |
2024-12-10 | DRUM: Learning Demonstration Retriever for Large MUlti-modal Models | Ellen Yi-Ge et.al. | 2412.07619 | null |
2024-12-09 | Delve into Visual Contrastive Decoding for Hallucination Mitigation of Large Vision-Language Models | Yi-Lun Lee et.al. | 2412.06775 | link |
2024-12-09 | Visual Lexicon: Rich Image Features in Language Space | XuDong Wang et.al. | 2412.06774 | null |
2024-12-09 | Training Large Language Models to Reason in a Continuous Latent Space | Shibo Hao et.al. | 2412.06769 | link |
2024-12-09 | Ranking-aware adapter for text-driven image ordering with CLIP | Wei-Hsiang Yu et.al. | 2412.06760 | link |
2024-12-09 | Why Do Developers Engage with ChatGPT in Issue-Tracker? Investigating Usage and Reliance on ChatGPT-Generated Code | Joy Krishan Das et.al. | 2412.06757 | null |
2024-12-09 | Refusal Tokens: A Simple Way to Calibrate Refusals in Large Language Models | Neel Jain et.al. | 2412.06748 | null |
2024-12-09 | ONEBench to Test Them All: Sample-Level Benchmarking Over Open-Ended Capabilities | Adhiraj Ghosh et.al. | 2412.06745 | null |
2024-12-09 | JAPAGEN: Efficient Few/Zero-shot Learning via Japanese Training Dataset Generation with LLM | Takuro Fujii et.al. | 2412.06738 | link |
2024-12-09 | AutoDCWorkflow: LLM-based Data Cleaning Workflow Auto-Generation and Benchmark | Lan Li et.al. | 2412.06724 | link |
2024-12-09 | How to Merge Your Multimodal Models Over Time? | Sebastian Dziadzio et.al. | 2412.06712 | link |
2024-12-09 | OmniEvalKit: A Modular, Lightweight Toolbox for Evaluating Large Language Model and its Omni-Extensions | Yi-Kai Zhang et.al. | 2412.06693 | null |
2024-12-09 | Exploring Critical Testing Scenarios for Decision-Making Policies: An LLM Approach | Weichao Xu et.al. | 2412.06684 | null |
2024-12-09 | Toward LLM-Agent-Based Modeling of Transportation Systems: A Conceptual Framework | Tianming Liu et.al. | 2412.06681 | null |
2024-12-09 | I Don’t Know: Explicit Modeling of Uncertainty with an [IDK] Token | Roi Cohen et.al. | 2412.06676 | null |
2024-12-09 | ILLUME: Illuminating Your LLMs to See, Draw, and Self-Enhance | Chunwei Wang et.al. | 2412.06673 | null |
2024-12-09 | MuMu-LLaMA: Multi-modal Music Understanding and Generation via Large Language Models | Shansong Liu et.al. | 2412.06660 | link |
2024-12-09 | Chatbots im Schulunterricht: Wir testen das Fobizz-Tool zur automatischen Bewertung von Hausaufgaben | Rainer Mühlhoff et.al. | 2412.06651 | null |
2024-12-09 | The Narrow Gate: Localized Image-Text Communication in Vision-Language Models | Alessandro Serra et.al. | 2412.06646 | null |
2024-12-09 | MAVias: Mitigate any Visual Bias | Ioannis Sarridis et.al. | 2412.06632 | null |
2024-12-09 | Copyright-Protected Language Generation via Adaptive Model Fusion | Javier Abad et.al. | 2412.06619 | link |
2024-12-06 | Birth and Death of a Rose | Chen Geng et.al. | 2412.05278 | null |
2024-12-06 | Sparse autoencoders reveal selective remapping of visual concepts during adaptation | Hyesu Lim et.al. | 2412.05276 | link |
2024-12-06 | Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling | Zhe Chen et.al. | 2412.05271 | link |
2024-12-06 | APOLLO: SGD-like Memory, AdamW-level Performance | Hanqing Zhu et.al. | 2412.05270 | link |
2024-12-06 | Uncertainty Quantification for Transformer Models for Dark-Pattern Detection | Javier Muñoz et.al. | 2412.05251 | null |
2024-12-06 | Enhancing Foundation Models for Time Series Forecasting via Wavelet-based Tokenization | Luca Masserano et.al. | 2412.05244 | null |
2024-12-06 | CompCap: Improving Multimodal Large Language Models with Composite Captions | Xiaohui Chen et.al. | 2412.05243 | null |
2024-12-06 | MAmmoTH-VL: Eliciting Multimodal Reasoning with Instruction Tuning at Scale | Jarvis Guo et.al. | 2412.05237 | null |
2024-12-06 | BEExformer: A Fast Inferencing Transformer Architecture via Binarization with Multiple Early Exits | Wazib Ansar et.al. | 2412.05225 | null |
2024-12-06 | 100% Hallucination Elimination Using Acurai | Michael C. Wood et.al. | 2412.05223 | link |
2024-12-06 | Evaluating and Aligning CodeLLMs on Human Preference | Jian Yang et.al. | 2412.05210 | null |
2024-12-06 | A Survey of Large Language Model-Based Generative AI for Text-to-SQL: Benchmarks, Applications, Use Cases, and Challenges | Aditi Singh et.al. | 2412.05208 | null |
2024-12-06 | Are Frontier Large Language Models Suitable for Q&A in Science Centres? | Jacob Watson et.al. | 2412.05200 | null |
2024-12-06 | SurgBox: Agent-Driven Operating Room Sandbox with Surgery Copilot | Jinlin Wu et.al. | 2412.05187 | link |
2024-12-06 | LinVT: Empower Your Image-level Large Language Model to Understand Videos | Lishuai Gao et.al. | 2412.05185 | link |
2024-12-06 | QueEn: A Large Language Model for Quechua-English Translation | Junhao Chen et.al. | 2412.05184 | null |
2024-12-06 | Benchmarking Open-ended Audio Dialogue Understanding for Large Audio-Language Models | Kuofeng Gao et.al. | 2412.05167 | null |
2024-12-06 | Enhancing Cross-Language Code Translation via Task-Specific Embedding Alignment in Retrieval-Augmented Generation | Manish Bhattarai et.al. | 2412.05159 | null |
2024-12-06 | Multimodal Fact-Checking with Vision Language Models: A Probing Classifier based Solution with Embedding Strategies | Recep Firat Cekinel et.al. | 2412.05155 | link |
2024-12-06 | A text-to-tabular approach to generate synthetic patient data using LLMs | Margaux Tornqvist et.al. | 2412.05153 | link |
2024-12-05 | Stereo Anywhere: Robust Zero-Shot Deep Stereo Matching Even Where Either Stereo or Mono Fail | Luca Bartolomei et.al. | 2412.04472 | link |
2024-12-05 | NVILA: Efficient Frontier Visual Language Models | Zhijian Liu et.al. | 2412.04468 | null |
2024-12-05 | VisionZip: Longer is Better but Not Necessary in Vision Language Models | Senqiao Yang et.al. | 2412.04467 | link |
2024-12-05 | Code-as-Monitor: Constraint-aware Visual Programming for Reactive and Proactive Robotic Failure Detection | Enshen Zhou et.al. | 2412.04455 | null |
2024-12-05 | p-MoD: Building Mixture-of-Depths MLLMs via Progressive Ratio Decay | Jun Zhang et.al. | 2412.04449 | link |
2024-12-05 | EgoPlan-Bench2: A Benchmark for Multimodal Large Language Model Planning in Real-World Scenarios | Lu Qiu et.al. | 2412.04447 | null |
2024-12-05 | DiCoDe: Diffusion-Compressed Deep Tokens for Autoregressive Video Generation with Language Models | Yizhuo Li et.al. | 2412.04446 | null |
2024-12-05 | Moto: Latent Motion Token as the Bridging Language for Robot Manipulation | Yi Chen et.al. | 2412.04445 | link |
2024-12-05 | Towards Real-Time Open-Vocabulary Video Instance Segmentation | Bin Yan et.al. | 2412.04434 | null |
2024-12-05 | Divot: Diffusion Powers Video Tokenizer for Comprehension and Generation | Yuying Ge et.al. | 2412.04432 | link |
2024-12-05 | Grounding Descriptions in Images informs Zero-Shot Visual Recognition | Shaunak Halbe et.al. | 2412.04429 | link |
2024-12-05 | Florence-VL: Enhancing Vision-Language Models with Generative Vision Encoder and Depth-Breadth Fusion | Jiuhai Chen et.al. | 2412.04424 | link |
2024-12-05 | Targeting the Core: A Simple and Effective Method to Attack RAG-based Agents via Direct LLM Manipulation | Xuying Li et.al. | 2412.04415 | null |
2024-12-05 | Establishing Task Scaling Laws via Compute-Efficient Model Ladders | Akshita Bhagia et.al. | 2412.04403 | null |
2024-12-05 | SeeGround: See and Ground for Zero-Shot Open-Vocabulary 3D Visual Grounding | Rong Li et.al. | 2412.04383 | null |
2024-12-05 | Discriminative Fine-tuning of LVLMs | Yassine Ouali et.al. | 2412.04378 | null |
2024-12-05 | Finer Behavioral Foundation Models via Auto-Regressive Features and Advantage Weighting | Edoardo Cetin et.al. | 2412.04368 | null |
2024-12-05 | Approximate Top- $k$ for Increased Parallelism | Oscar Key et.al. | 2412.04358 | null |
2024-12-05 | Retrieval-Augmented Machine Translation with Unstructured Knowledge | Jiaan Wang et.al. | 2412.04342 | link |
2024-12-05 | Liquid: Language Models are Scalable Multi-modal Generators | Junfeng Wu et.al. | 2412.04332 | link |
2024-12-04 | From Individual to Society: A Survey on Social Simulation Driven by Large Language Model-based Agents | Xinyi Mou et.al. | 2412.03563 | link |
2024-12-04 | FLAIR: VLM with Fine-grained Language-informed Image Representations | Rui Xiao et.al. | 2412.03561 | link |
2024-12-04 | Best-of-N Jailbreaking | John Hughes et.al. | 2412.03556 | link |
2024-12-04 | PaliGemma 2: A Family of Versatile VLMs for Transfer | Andreas Steiner et.al. | 2412.03555 | null |
2024-12-04 | SPICE: Smart Projection Interface for Cooking Enhancement | Vera Prohaska et.al. | 2412.03551 | link |
2024-12-04 | Perception Tokens Enhance Visual Reasoning in Multimodal Language Models | Mahtab Bigverdi et.al. | 2412.03548 | null |
2024-12-04 | Evaluating Gender Bias Transfer between Pre-trained and Prompt-Adapted Language Models | Natalie Mackraz et.al. | 2412.03537 | null |
2024-12-04 | A Review on Scientific Knowledge Extraction using Large Language Models in Biomedical Sciences | Gabriel Lino Garcia et.al. | 2412.03531 | null |
2024-12-04 | FANAL – Financial Activity News Alerting Language Modeling Framework | Urjitkumar Patel et.al. | 2412.03527 | null |
2024-12-04 | You’re (Not) My Type – Can LLMs Generate Feedback of Specific Types for Introductory Programming Tasks? | Dominic Lohr et.al. | 2412.03516 | null |
2024-12-04 | Distillation of Diffusion Features for Semantic Correspondence | Frank Fundel et.al. | 2412.03512 | null |
2024-12-04 | Tight PAC-Bayesian Risk Certificates for Contrastive Learning | Anna van Elst et.al. | 2412.03486 | link |
2024-12-04 | Training-Free Mitigation of Language Reasoning Degradation After Multimodal Instruction Tuning | Neale Ratzlaff et.al. | 2412.03467 | null |
2024-12-04 | Pre-trained Multiple Latent Variable Generative Models are good defenders against Adversarial Attacks | Dario Serez et.al. | 2412.03453 | link |
2024-12-04 | From Words to Workflows: Automating Business Processes | Laura Minkova et.al. | 2412.03446 | null |
2024-12-04 | Assessing Foundation Models’ Transferability to Physiological Signals in Precision Medicine | Matthias Christenson et.al. | 2412.03427 | null |
2024-12-04 | PrefixKV: Adaptive Prefix KV Cache is What Vision Instruction-Following Models Need for Efficient Generation | Ao Wang et.al. | 2412.03409 | link |
2024-12-04 | RedStone: Curating General, Code, Math, and QA Data for Large Language Models | Yaoyao Chang et.al. | 2412.03398 | null |
2024-12-04 | Enhancing Supply Chain Visibility with Generative AI: An Exploratory Case Study on Relationship Prediction in Knowledge Graphs | Ge Zheng et.al. | 2412.03390 | null |
2024-12-04 | WiS Platform: Enhancing Evaluation of LLM-Based Multi-Agent Systems Through Game-Based Analysis | Chengwei Hu et.al. | 2412.03359 | null |
2024-12-03 | T-REG: Preference Optimization with Token-Level Reward Regularization | Wenxuan Zhou et.al. | 2412.02685 | null |
2024-12-03 | Mind the Gap: Examining the Self-Improvement Capabilities of Large Language Models | Yuda Song et.al. | 2412.02674 | null |
2024-12-03 | LLM-Enhanced Path Planning: Safe and Efficient Autonomous Navigation with Instructional Inputs | Pranav Doma et.al. | 2412.02655 | null |
2024-12-03 | Time-Reversal Provides Unsupervised Feedback to LLMs | Yerram Varun et.al. | 2412.02626 | null |
2024-12-03 | Medical Multimodal Foundation Models in Clinical Diagnosis and Treatment: Applications, Challenges, and Future Directions | Kai Sun et.al. | 2412.02621 | null |
2024-12-03 | Improving Dynamic Object Interactions in Text-to-Video Generation with AI Feedback | Hiroki Furuta et.al. | 2412.02617 | null |
2024-12-03 | GLM-4-Voice: Towards Intelligent and Human-Like End-to-End Spoken Chatbot | Aohan Zeng et.al. | 2412.02612 | link |
2024-12-03 | AV-Odyssey Bench: Can Your Multimodal LLMs Really Understand Audio-Visual Information? | Kaixiong Gong et.al. | 2412.02611 | null |
2024-12-03 | Interpretable Company Similarity with Sparse Autoencoders | Marco Molinari et.al. | 2412.02605 | null |
2024-12-03 | CEGI: Measuring the trade-off between efficiency and carbon emissions for SLMs and VLMs | Abhas Kumar et.al. | 2412.02602 | null |
2024-12-03 | PrefixLLM: LLM-aided Prefix Circuit Design | Weihua Xiao et.al. | 2412.02594 | link |
2024-12-03 | OCR Hinders RAG: Evaluating the Cascading Impact of OCR on Retrieval-Augmented Generation | Junyuan Zhang et.al. | 2412.02592 | link |
2024-12-03 | Explainable CTR Prediction via LLM Reasoning | Xiaohan Yu et.al. | 2412.02588 | null |
2024-12-03 | Remote Sensing Temporal Vision-Language Models: A Comprehensive Survey | Chenyang Liu et.al. | 2412.02573 | link |
2024-12-03 | SJTU:Spatial judgments in multimodal models towards unified segmentation through coordinate detection | Joongwon Chae et.al. | 2412.02565 | link |
2024-12-03 | Semantic Tokens in Retrieval Augmented Generation | Joel Suro et.al. | 2412.02563 | null |
2024-12-03 | Patent-CR: A Dataset for Patent Claim Revision | Lekang Jiang et.al. | 2412.02549 | null |
2024-12-03 | Multimodal Remote Sensing Scene Classification Using VLMs and Dual-Cross Attention Networks | Jinjin Cai et.al. | 2412.02531 | null |
2024-12-03 | LLMForecaster: Improving Seasonal Event Forecasts with Unstructured Textual Data | Hanyu Zhang et.al. | 2412.02525 | null |
2024-12-03 | OODFace: Benchmarking Robustness of Face Recognition under Common Corruptions and Appearance Variations | Caixin Kang et.al. | 2412.02479 | null |
2024-12-02 | T2Vid: Translating Long Text into Multi-Image is the Catalyst for Video-LLMs | Shukang Yin et.al. | 2411.19951 | link |
2024-12-02 | Critical Tokens Matter: Token-Level Contrastive Estimation Enhances LLM’s Reasoning Capability | Zicheng Lin et.al. | 2411.19943 | link |
2024-11-29 | VLSBench: Unveiling Visual Leakage in Multimodal Safety | Xuhao Hu et.al. | 2411.19939 | link |
2024-11-29 | On Domain-Specific Post-Training for Multimodal Large Language Models | Daixuan Cheng et.al. | 2411.19930 | null |
2024-11-29 | SIMS: Simulating Human-Scene Interactions with Real World Script Planning | Wenjia Wang et.al. | 2411.19921 | null |
2024-11-29 | FlowCLAS: Enhancing Normalizing Flow Via Contrastive Learning For Anomaly Segmentation | Chang Won Lee et.al. | 2411.19888 | null |
2024-11-29 | PDDLFuse: A Tool for Generating Diverse Planning Domains | Vedant Khandelwal et.al. | 2411.19886 | null |
2024-12-02 | LUMIA: Linear probing for Unimodal and MultiModal Membership Inference Attacks leveraging internal LLM states | Luis Ibanez-Lissen et.al. | 2411.19876 | null |
2024-11-29 | DeMo: Decoupled Momentum Optimization | Bowen Peng et.al. | 2411.19870 | link |
2024-11-29 | AIDetx: a compression-based method for identification of machine-learning generated text | Leonardo Almeida et.al. | 2411.19869 | link |
2024-11-29 | Reverse Thinking Makes LLMs Stronger Reasoners | Justin Chih-Yao Chen et.al. | 2411.19865 | null |
2024-11-29 | Cross-Domain Recommendation Meets Large Language Models | Ajay Krishna Vajjala et.al. | 2411.19862 | link |
2024-11-29 | What fifty-one years of Linguistics and Artificial Intelligence research tell us about their correlation: A scientometric review | Mohammed Q. Shormani et.al. | 2411.19858 | null |
2024-11-29 | Sensitive Content Classification in Social Media: A Holistic Resource and Evaluation | Dimosthenis Antypas et.al. | 2411.19832 | null |
2024-11-29 | Advanced System Integration: Analyzing OpenAPI Chunking for Retrieval-Augmented Generation | Robin D. Pesl et.al. | 2411.19804 | null |
2024-11-29 | INCLUDE: Evaluating Multilingual Language Understanding with Regional Knowledge | Angelika Romanou et.al. | 2411.19799 | null |
2024-11-29 | MoTe: Learning Motion-Text Diffusion Model for Multiple Generation Tasks | Yiming Wu et.al. | 2411.19786 | null |
2024-11-29 | PerLA: Perceptive 3D Language Assistant | Guofeng Mei et.al. | 2411.19774 | null |
2024-11-29 | LongVALE: Vision-Audio-Language-Event Benchmark Towards Time-Aware Omni-Modal Perception of Long Videos | Tiantian Geng et.al. | 2411.19772 | link |
2024-11-29 | Dual Risk Minimization: Towards Next-Level Robustness in Fine-tuning Zero-Shot Models | Kaican Li et.al. | 2411.19757 | link |
2024-11-27 | Lift3D Foundation Policy: Lifting 2D Large-Scale Pretrained Models for Robust 3D Robotic Manipulation | Yueru Jia et.al. | 2411.18623 | null |
2024-11-27 | Cross-modal Information Flow in Multimodal Large Language Models | Zhi Zhang et.al. | 2411.18620 | link |
2024-11-27 | Diffusion Self-Distillation for Zero-Shot Customized Image Generation | Shengqu Cai et.al. | 2411.18616 | null |
2024-11-27 | Automated Literature Review Using NLP Techniques and LLM-Based Retrieval-Augmented Generation | Nurshat Fateh Ali et.al. | 2411.18583 | null |
2024-11-27 | Challenges in Adapting Multilingual LLMs to Low-Resource Languages using LoRA PEFT Tuning | Omkar Khade et.al. | 2411.18571 | null |
2024-11-27 | A Pipeline of Neural-Symbolic Integration to Enhance Spatial Reasoning in Large Language Models | Rong Wang et.al. | 2411.18564 | null |
2024-11-27 | DexDiffuser: Interaction-aware Diffusion Planning for Adaptive Dexterous Manipulation | Zhixuan Liang et.al. | 2411.18562 | null |
2024-11-27 | Retrofitting (Large) Language Models with Dynamic Tokenization | Darius Feher et.al. | 2411.18553 | null |
2024-11-27 | AdaVLN: Towards Visual Language Navigation in Continuous Indoor Environments with Moving Humans | Dillon Loh et.al. | 2411.18539 | link |
2024-11-27 | Emergence of Self-Identity in AI: A Mathematical Framework and Empirical Study with Generative Large Language Models | Minhyeok Lee et.al. | 2411.18530 | link |
2024-11-27 | LLM-ABBA: Understand time series via symbolic approximation | Erin Carson et.al. | 2411.18506 | null |
2024-11-27 | GATE OpenING: A Comprehensive Benchmark for Judging Open-ended Interleaved Image-Text Generation | Pengfei Zhou et.al. | 2411.18499 | null |
2024-11-27 | Beyond Examples: High-level Automated Reasoning Paradigm in In-Context Learning via MCTS | Jinyang Wu et.al. | 2411.18478 | null |
2024-11-27 | Draft Model Knows When to Stop: A Self-Verification Length Policy for Speculative Decoding | Ziyin Zhang et.al. | 2411.18462 | link |
2024-11-27 | Is my Meeting Summary Good? Estimating Quality with a Multi-LLM Evaluator | Frederic Kirstein et.al. | 2411.18444 | null |
2024-11-27 | An AI-Assisted Multi-Agent Dual Dialogue System to Support Mental Health Care Providers | Onno P. Kampman et.al. | 2411.18429 | null |
2024-11-27 | FastSwitch: Optimizing Context Switching Efficiency in Fairness-aware Large Language Model Serving | Ao Shen et.al. | 2411.18424 | null |
2024-11-27 | Politicians vs ChatGPT. A study of presuppositions in French and Italian political communication | Davide Garassino et.al. | 2411.18403 | null |
2024-11-27 | Topic Modeling and Sentiment Analysis on Japanese Online Media’s Coverage of Nuclear Energy | Yifan Sun et.al. | 2411.18383 | null |
2024-11-27 | ChatGPT as speechwriter for the French presidents | Dominique Labbé et.al. | 2411.18382 | null |
2024-11-26 | Adaptive Deployment of Untrusted LLMs Reduces Distributed Threats | Jiaxin Wen et.al. | 2411.17693 | null |
2024-11-26 | Low-Bit Quantization Favors Undertrained LLMs: Scaling Laws for Quantized LLMs with 100T Training Tokens | Xu Ouyang et.al. | 2411.17691 | null |
2024-11-26 | Rethinking Token Reduction in MLLMs: Towards a Unified Paradigm for Training-Free Acceleration | Yuhang Han et.al. | 2411.17686 | null |
2024-11-26 | Enhancing Character-Level Understanding in LLMs through Token Internal Structure Learning | Zhu Xu et.al. | 2411.17679 | link |
2024-11-26 | Instance-Aware Graph Prompt Learning | Jiazheng Li et.al. | 2411.17676 | null |
2024-11-26 | Push the Limit of Multi-modal Emotion Recognition by Prompting LLMs with Receptive-Field-Aware Attention Weighting | Liyun Zhang et.al. | 2411.17674 | null |
2024-11-26 | SketchAgent: Language-Driven Sequential Sketch Generation | Yael Vinker et.al. | 2411.17673 | null |
2024-11-26 | Synthetic Data Generation with LLM for Improved Depression Prediction | Andrea Kang et.al. | 2411.17672 | null |
2024-11-26 | How do Multimodal Foundation Models Encode Text and Speech? An Analysis of Cross-Lingual and Cross-Modal Representations | Hyunji Lee et.al. | 2411.17666 | null |
2024-11-26 | Toward High-Performance LLM Serving: A Simulation-Based Approach for Identifying Optimal Parallelism | Yi-Chien Lin et.al. | 2411.17651 | link |
2024-11-26 | On Limitations of LLM as Annotator for Low Resource Languages | Suramya Jadhav et.al. | 2411.17637 | null |
2024-11-26 | MALMM: Multi-Agent Large Language Models for Zero-Shot Robotics Manipulation | Harsh Singh et.al. | 2411.17636 | null |
2024-11-26 | Data-driven development of cycle prediction models for lithium metal batteries using multi modal mining | Jaewoong Lee et.al. | 2411.17625 | null |
2024-11-26 | Scaling Speech-Text Pre-training with Synthetic Interleaved Data | Aohan Zeng et.al. | 2411.17607 | null |
2024-11-26 | HyperSeg: Towards Universal Visual Segmentation with Large Language Model | Cong Wei et.al. | 2411.17606 | link |
2024-11-26 | Making History Readable | Bipasha Banerjee et.al. | 2411.17600 | null |
2024-11-26 | Agentic AI for Improving Precision in Identifying Contributions to Sustainable Development Goals | William A. Ingram et.al. | 2411.17598 | null |
2024-11-26 | Can artificial intelligence predict clinical trial outcomes? | Shuyi Jin et.al. | 2411.17595 | null |
2024-11-26 | RTL-Breaker: Assessing the Security of LLMs against Backdoor Attacks on HDL Code Generation | Lakshmi Likhitha Mankali et.al. | 2411.17569 | null |
2024-11-26 | Natural Language Understanding and Inference with MLLM in Visual Question Answering: A Survey | Jiayi Kuang et.al. | 2411.17558 | null |
2024-11-25 | Do Large Language Models Perform Latent Multi-Hop Reasoning without Exploiting Shortcuts? | Sohee Yang et.al. | 2411.16679 | null |
2024-11-25 | Diffusion Features for Zero-Shot 6DoF Object Pose Estimation | Bernd Von Gimborn et.al. | 2411.16668 | null |
2024-11-25 | DreamRunner: Fine-Grained Storytelling Video Generation with Retrieval-Augmented Motion Adaptation | Zun Wang et.al. | 2411.16657 | null |
2024-11-25 | Self-Generated Critiques Boost Reward Modeling for Language Models | Yue Yu et.al. | 2411.16646 | null |
2024-11-25 | Preventing Jailbreak Prompts as Malicious Tools for Cybercriminals: A Cyber Defense Perspective | Jean Marie Tshimula et.al. | 2411.16642 | null |
2024-11-25 | StructFormer: Document Structure-based Masked Attention and its Impact on Language Model Pre-Training | Kaustubh Ponkshe et.al. | 2411.16618 | null |
2024-11-25 | Chat2SVG: Vector Graphics Generation with Large Language Models and Image Diffusion Models | Ronghuan Wu et.al. | 2411.16602 | null |
2024-11-25 | From Generation to Judgment: Opportunities and Challenges of LLM-as-a-judge | Dawei Li et.al. | 2411.16594 | link |
2024-11-25 | Large Language Model-based Decision-making for COLREGs and the Control of Autonomous Surface Vehicles | Klinsmann Agyei et.al. | 2411.16587 | link |
2024-11-25 | MarketGPT: Developing a Pre-trained transformer (GPT) for Modeling Financial Time Series | Aaron Wheeler et.al. | 2411.16585 | link |
2024-11-25 | Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision | Zhiheng Xi et.al. | 2411.16579 | null |
2024-11-25 | Predictive Power of LLMs in Financial Markets | Jerick Shi et.al. | 2411.16569 | null |
2024-11-25 | EnStack: An Ensemble Stacking Framework of Large Language Models for Enhanced Vulnerability Detection in Source Code | Shahriyar Zaman Ridoy et.al. | 2411.16561 | null |
2024-11-25 | Generating Out-Of-Distribution Scenarios Using Language Models | Erfan Aasi et.al. | 2411.16554 | null |
2024-11-25 | Representation Collapsing Problems in Vector Quantization | Wenhao Zhao et.al. | 2411.16550 | null |
2024-11-25 | RoboSpatial: Teaching Spatial Understanding to 2D and 3D Vision-Language Models for Robotics | Chan Hee Song et.al. | 2411.16537 | null |
2024-11-25 | Profiling Bias in LLMs: Stereotype Dimensions in Contextual Word Embeddings | Carolin M. Schuster et.al. | 2411.16527 | link |
2024-11-25 | Fundamental Limits of Prompt Tuning Transformers: Universality, Capacity and Efficiency | Jerry Yao-Chieh Hu et.al. | 2411.16525 | null |
2024-11-25 | LaB-RAG: Label Boosted Retrieval Augmented Generation for Radiology Report Generation | Steven Song et.al. | 2411.16523 | link |
2024-11-25 | Noise Diffusion for Enhancing Semantic Faithfulness in Text-to-Image Synthesis | Boming Miao et.al. | 2411.16503 | null |
2024-11-22 | Measuring Bullshit in the Language Games played by ChatGPT | Alessandro Trevisan et.al. | 2411.15129 | null |
2024-11-22 | Health AI Developer Foundations | Atilla P. Kiraly et.al. | 2411.15128 | null |
2024-11-22 | TÜLU 3: Pushing Frontiers in Open Language Model Post-Training | Nathan Lambert et.al. | 2411.15124 | link |
2024-11-22 | RE-Bench: Evaluating frontier AI R&D capabilities of language model agents against human experts | Hjalmar Wijk et.al. | 2411.15114 | link |
2024-11-22 | Efficient Pruning of Text-to-Image Models: Insights from Pruning Stable Diffusion | Samarth N Ramesh et.al. | 2411.15113 | null |
2024-11-22 | AttriBoT: A Bag of Tricks for Efficiently Approximating Leave-One-Out Context Attribution | Fengyuan Liu et.al. | 2411.15102 | link |
2024-11-22 | What You See is Not What You Get: Neural Partial Differential Equations and The Illusion of Learning | Arvind Mohan et.al. | 2411.15101 | null |
2024-11-22 | XGrammar: Flexible and Efficient Structured Generation Engine for Large Language Models | Yixin Dong et.al. | 2411.15100 | null |
2024-11-22 | Context-Aware Multimodal Pretraining | Karsten Roth et.al. | 2411.15099 | null |
2024-11-22 | mR $^2$ AG: Multimodal Retrieval-Reflection-Augmented Generation for Knowledge-Based VQA | Tao Zhang et.al. | 2411.15041 | null |
2024-11-22 | One to rule them all: natural language to bind communication, perception and action | Simone Colombani et.al. | 2411.15033 | null |
2024-11-22 | Time is on my sight: scene graph filtering for dynamic environment perception in an LLM-driven robot | Simone Colombani et.al. | 2411.15027 | null |
2024-11-22 | DyCoke: Dynamic Compression of Tokens for Fast Video Large Language Models | Keda Tao et.al. | 2411.15024 | link |
2024-11-22 | FTA generation using GenAI with an Autonomy sensor Usecase | Sneha Sudhir Shetiya et.al. | 2411.15007 | null |
2024-11-22 | ScribeAgent: Towards Specialized Web Agents Using Production-Scale Workflow Data | Junhong Shen et.al. | 2411.15004 | link |
2024-11-22 | Generative AI may backfire for counterspeech | Dominik Bär et.al. | 2411.14986 | null |
2024-11-22 | Exploring Foundation Models Fine-Tuning for Cytology Classification | Manon Dausort et.al. | 2411.14975 | link |
2024-11-22 | Open-Amp: Synthetic Data Framework for Audio Effect Foundation Models | Alec Wright et.al. | 2411.14972 | link |
2024-11-22 | SwissADT: An Audio Description Translation System for Swiss Languages | Lukas Fischer et.al. | 2411.14967 | null |
2024-11-22 | LoRA-FAIR: Federated LoRA Fine-Tuning with Aggregation and Initialization Refinement | Jieming Bian et.al. | 2411.14961 | null |
2024-11-21 | Insight-V: Exploring Long-Chain Visual Reasoning with Multimodal Large Language Models | Yuhao Dong et.al. | 2411.14432 | link |
2024-11-21 | Unleashing the Potential of Multi-modal Foundation Models and Video Diffusion for 4D Dynamic Physical Scene Simulation | Zhuoman Liu et.al. | 2411.14423 | null |
2024-11-21 | From RNNs to Foundation Models: An Empirical Study on Commercial Building Energy Consumption | Shourya Bose et.al. | 2411.14421 | null |
2024-11-21 | Beyond Training: Dynamic Token Merging for Zero-Shot Video Understanding | Yiming Zhang et.al. | 2411.14401 | null |
2024-11-21 | Lightweight Safety Guardrails Using Fine-tuned BERT Embeddings | Aaron Zheng et.al. | 2411.14398 | null |
2024-11-21 | UnifiedCrawl: Aggregated Common Crawl for Affordable Adaptation of LLMs on Low-Resource Languages | Bethel Melesse Tessema et.al. | 2411.14343 | link |
2024-11-21 | SplatR : Experience Goal Visual Rearrangement with 3D Gaussian Splatting and Dense Feature Matching | Arjun P S et.al. | 2411.14322 | link |
2024-11-21 | Velocitune: A Velocity-based Dynamic Domain Reweighting Method for Continual Pre-training | Zheheng Luo et.al. | 2411.14318 | null |
2024-11-21 | Automated Generation of Code Debugging Exercises | Victor-Alexandru Pădurean et.al. | 2411.14303 | null |
2024-11-21 | Auto-SPICE: Leveraging LLMs for Dataset Creation via Automated SPICE Netlist Extraction from Analog Circuit Diagrams | Jitendra Bhandari et.al. | 2411.14299 | link |
2024-11-21 | EasyHOI: Unleashing the Power of Large Models for Reconstructing Hand-Object Interactions in the Wild | Yumeng Liu et.al. | 2411.14280 | link |
2024-11-21 | Looking Beyond Text: Reducing Language bias in Large Vision-Language Models via Multimodal Dual-Attention and Soft-Image Guidance | Haozhe Zhao et.al. | 2411.14279 | null |
2024-11-21 | Efficient Aspect-Based Summarization of Climate Change Reports with Small Language Models | Iacopo Ghinassi et.al. | 2411.14272 | link |
2024-11-21 | Knowledge Graphs, Large Language Models, and Hallucinations: An NLP Perspective | Ernests Lavrinovics et.al. | 2411.14258 | null |
2024-11-21 | Do I Know This Entity? Knowledge Awareness and Hallucinations in Language Models | Javier Ferrando et.al. | 2411.14257 | null |
2024-11-21 | Generalizing End-To-End Autonomous Driving In Real-World Environments Using Zero-Shot LLMs | Zeyu Dong et.al. | 2411.14256 | null |
2024-11-21 | Intent-Aware Dialogue Generation and Multi-Task Contrastive Learning for Multi-Turn Intent Classification | Junhua Liu et.al. | 2411.14252 | null |
2024-11-21 | Natural Language Reinforcement Learning | Xidong Feng et.al. | 2411.14251 | link |
2024-11-21 | FocusLLaVA: A Coarse-to-Fine Approach for Efficient and Effective Visual Token Compression | Yuke Zhu et.al. | 2411.14228 | null |
2024-11-21 | Towards Context-Rich Automated Biodiversity Assessments: Deriving AI-Powered Insights from Camera Trap Data | Paul Fergus et.al. | 2411.14219 | null |
2024-11-20 | Find Any Part in 3D | Ziqi Ma et.al. | 2411.13550 | null |
2024-11-20 | SpecTool: A Benchmark for Characterizing Errors in Tool-Use LLMs | Shirley Kokane et.al. | 2411.13547 | null |
2024-11-20 | Promoting User Data Autonomy During the Dissolution of a Monopolistic Firm | Rushabh Solanki et.al. | 2411.13546 | null |
2024-11-20 | BALROG: Benchmarking Agentic LLM and VLM Reasoning On Games | Davide Paglieri et.al. | 2411.13543 | null |
2024-11-20 | Metacognition for Unknown Situations and Environments (MUSE) | Rodolfo Valiente et.al. | 2411.13537 | null |
2024-11-20 | Predictive Insights into LGBTQ+ Minority Stress: A Transductive Exploration of Social Media Discourse | S. Chapagain et.al. | 2411.13534 | link |
2024-11-20 | Advancing Complex Medical Communication in Arabic with Sporo AraSum: Surpassing Existing Large Language Models | Chanseo Lee et.al. | 2411.13518 | null |
2024-11-20 | Disentangling Memory and Reasoning Ability in Large Language Models | Mingyu Jin et.al. | 2411.13504 | link |
2024-11-20 | Neural machine translation of seismic waves for petrophysical inversion | José Cunha Teixeira et.al. | 2411.13491 | null |
2024-11-20 | Utilizing Large Language Models to Synthesize Product Desirability Datasets | John D. Hastings et.al. | 2411.13485 | null |
2024-11-20 | PatentEdits: Framing Patent Novelty as Textual Entailment | Ryan Lee et.al. | 2411.13477 | null |
2024-11-20 | When Precision Meets Position: BFloat16 Breaks Down RoPE in Long-Context Training | Haonan Wang et.al. | 2411.13476 | link |
2024-11-20 | SoK: A Systems Perspective on Compound AI Threats and Countermeasures | Sarbartha Banerjee et.al. | 2411.13459 | null |
2024-11-20 | LIMBA: An Open-Source Framework for the Preservation and Valorization of Low-Resource Languages using Generative Models | Salvatore Mario Carta et.al. | 2411.13453 | null |
2024-11-20 | AdaptAgent: Adapting Multimodal Web Agents with Few-Shot Learning from Human Demonstrations | Gaurav Verma et.al. | 2411.13451 | null |
2024-11-20 | WaterPark: A Robustness Assessment of Language Model Watermarking | Jiacheng Liang et.al. | 2411.13425 | link |
2024-11-20 | Unleashing the Power of Large Language Models for Group POI Recommendations | Jing Long et.al. | 2411.13415 | null |
2024-11-20 | A Survey On Enhancing Reinforcement Learning in Complex Environments: Insights from Human and LLM Feedback | Alireza Rashidi Laleh et.al. | 2411.13410 | null |
2024-11-20 | Unification of Balti and trans-border sister dialects in the essence of LLMs and AI Technology | Muhammad Sharif et.al. | 2411.13409 | null |
2024-11-20 | Transformer-Based Contextualized Language Models Joint with Neural Networks for Natural Language Inference in Vietnamese | Dat Van-Thanh Nguyen et.al. | 2411.13407 | null |
2024-11-19 | ACING: Actor-Critic for Instruction Learning in Black-Box Large Language Models | Salma Kharrat et.al. | 2411.12736 | link |
2024-11-19 | Information Theory of Meaningful Communication | Doron Sivan et.al. | 2411.12728 | link |
2024-11-19 | CATCH: Complementary Adaptive Token-level Contrastive Decoding to Mitigate Hallucinations in LVLMs | Zhehan Kan et.al. | 2411.12713 | null |
2024-11-19 | Enhancing Multi-Class Disease Classification: Neoplasms, Cardiovascular, Nervous System, and Digestive Disorders Using Advanced LLMs | Ahmed Akib Jawad Karim et.al. | 2411.12712 | null |
2024-11-19 | Strengthening Fake News Detection: Leveraging SVM and Sophisticated Text Vectorization Techniques. Defying BERT? | Ahmed Akib Jawad Karim et.al. | 2411.12703 | null |
2024-11-19 | When Backdoors Speak: Understanding LLM Backdoor Attacks Through Model-Generated Explanations | Huaizhi Ge et.al. | 2411.12701 | null |
2024-11-19 | SparseInfer: Training-free Prediction of Activation Sparsity for Fast LLM Inference | Jiho Shin et.al. | 2411.12692 | null |
2024-11-19 | Neurosymbolic Graph Enrichment for Grounded World Models | Stefano De Giorgis et.al. | 2411.12671 | null |
2024-11-19 | DLBacktrace: A Model Agnostic Explainability for any Deep Learning Models | Vinay Kumar Sankarapu et.al. | 2411.12643 | link |
2024-11-19 | Improving Controllability and Editability for Pretrained Text-to-Music Generation Models | Yixiao Zhang et.al. | 2411.12641 | null |
2024-11-19 | Provable unlearning in topic modeling and downstream tasks | Stanley Wei et.al. | 2411.12600 | null |
2024-11-19 | AdaCM $^2$ : On Understanding Extremely Long-Term Video with Adaptive Cross-Modality Memory Reduction | Yuanbin Man et.al. | 2411.12593 | null |
2024-11-19 | Procedural Knowledge in Pretraining Drives Reasoning in Large Language Models | Laura Ruis et.al. | 2411.12580 | link |
2024-11-19 | Large Language Models for Combinatorial Optimization of Design Structure Matrix | Shuo Jiang et.al. | 2411.12571 | null |
2024-11-19 | Unlocking State-Tracking in Linear RNNs Through Negative Eigenvalues | Riccardo Grazzi et.al. | 2411.12537 | link |
2024-11-19 | Contourlet Refinement Gate Framework for Thermal Spectrum Distribution Regularized Infrared Image Super-Resolution | Yang Zou et.al. | 2411.12530 | link |
2024-11-19 | Enhancing Reasoning Capabilities of LLMs via Principled Synthetic Logic Corpus | Terufumi Morishita et.al. | 2411.12498 | link |
2024-11-19 | AI Flow at the Network Edge | Jiawei Shao et.al. | 2411.12469 | null |
2024-11-19 | Guide-to-Explain for Controllable Summarization | Sangwon Ryu et.al. | 2411.12460 | null |
2024-11-19 | \textsc{Neon}: News Entity-Interaction Extraction for Enhanced Question Answering | Sneha Singhania et.al. | 2411.12449 | null |
2024-11-18 | Bi-Mamba: Towards Accurate 1-Bit State Space Models | Shengkun Tang et.al. | 2411.11843 | null |
2024-11-18 | Tackling prediction tasks in relational databases with LLMs | Marek Wydmuch et.al. | 2411.11829 | null |
2024-11-18 | Exploring adversarial robustness of JPEG AI: methodology, comparison and new methods | Egor Kovalev et.al. | 2411.11795 | null |
2024-11-18 | LLM-IE: A Python Package for Generative Information Extraction with Large Language Models | Enshuo Hsu et.al. | 2411.11779 | null |
2024-11-18 | sMoRe: Enhancing Object Manipulation and Organization in Mixed Reality Spaces with LLMs and Generative AI | Yunhao Xing et.al. | 2411.11752 | null |
2024-11-18 | BitMoD: Bit-serial Mixture-of-Datatype LLM Acceleration | Yuzong Chen et.al. | 2411.11745 | link |
2024-11-18 | Moral Persuasion in Large Language Models: Evaluating Susceptibility and Ethical Alignment | Allison Huang et.al. | 2411.11731 | link |
2024-11-18 | Semantic-Geometric-Physical-Driven Robot Manipulation Skill Transfer via Skill Library and Tactile Representation | Mingchao Qi et.al. | 2411.11714 | link |
2024-11-18 | FedCoLLM: A Parameter-Efficient Federated Co-tuning Framework for Large and Small Language Models | Tao Fan et.al. | 2411.11707 | null |
2024-11-18 | MC-LLaVA: Multi-Concept Personalized Vision-Language Model | Ruichuan An et.al. | 2411.11706 | link |
2024-11-18 | Technical Report: Enhancing LLM Reasoning with Reward-guided Tree Search | Jinhao Jiang et.al. | 2411.11694 | null |
2024-11-18 | TrojanRobot: Backdoor Attacks Against Robotic Manipulation in the Physical World | Xianlong Wang et.al. | 2411.11683 | null |
2024-11-18 | PSPO*: An Effective Process-supervised Policy Optimization for Reasoning Alignment | Jiawei Li et.al. | 2411.11681 | link |
2024-11-18 | Dissecting Misalignment of Multimodal Large Language Models via Influence Function | Lijie Hu et.al. | 2411.11667 | null |
2024-11-18 | TSINR: Capturing Temporal Continuity via Implicit Neural Representations for Time Series Anomaly Detection | Mengxuan Li et.al. | 2411.11641 | link |
2024-11-18 | Chapter 7 Review of Data-Driven Generative AI Models for Knowledge Extraction from Scientific Literature in Healthcare | Leon Kopitar et.al. | 2411.11635 | null |
2024-11-18 | Signaling and Social Learning in Swarms of Robots | Leo Cazenille et.al. | 2411.11616 | null |
2024-11-18 | Leveraging Computational Pathology AI for Noninvasive Optical Imaging Analysis Without Retraining | Danny Barash et.al. | 2411.11613 | null |
2024-11-18 | VLN-Game: Vision-Language Equilibrium Search for Zero-Shot Semantic Navigation | Bangguo Yu et.al. | 2411.11609 | null |
2024-11-18 | Exploring LLMs for Verifying Technical System Specifications Against Requirements | Lasse M. Reinpold et.al. | 2411.11582 | null |
2024-11-15 | VeriGraph: Scene Graphs for Execution Verifiable Robot Planning | Daniel Ekpo et.al. | 2411.10446 | null |
2024-11-15 | Enhancing the Reasoning Ability of Multimodal Large Language Models via Mixed Preference Optimization | Weiyun Wang et.al. | 2411.10442 | null |
2024-11-15 | LLaVA-o1: Let Vision Language Models Reason Step-by-Step | Guowei Xu et.al. | 2411.10440 | link |
2024-11-15 | MARS: Unleashing the Power of Variance Reduction for Training Large Models | Huizhuo Yuan et.al. | 2411.10438 | link |
2024-11-15 | Mitigating Hallucination in Multimodal Large Language Model via Hallucination-targeted Direct Preference Optimization | Yuhan Fu et.al. | 2411.10436 | null |
2024-11-15 | Evaluating Creativity and Deception in Large Language Models: A Simulation Framework for Multi-Agent Balderdash | Parsa Hejabi et.al. | 2411.10422 | link |
2024-11-15 | On the Foundation Model for Cardiac MRI Reconstruction | Chi Zhang et.al. | 2411.10403 | null |
2024-11-15 | Interactive Cycle Model – The Linkage Combination among Automatic Speech Recognition, Large Language Models and Smart Glasses | Libo Wang et.al. | 2411.10362 | link |
2024-11-15 | Bias Unveiled: Investigating Social Bias in LLM-Generated Code | Lin Ling et.al. | 2411.10351 | null |
2024-11-15 | Y-MAP-Net: Real-time depth, normals, segmentation, multi-label captioning and 2D human pose in RGB images | Ammar Qammaz et.al. | 2411.10334 | null |
2024-11-15 | Number it: Temporal Grounding Videos like Flipping Manga | Yongliang Wu et.al. | 2411.10332 | link |
2024-11-15 | Modification Takes Courage: Seamless Image Stitching via Reference-Driven Inpainting | Ziqi Xie et.al. | 2411.10309 | link |
2024-11-15 | Static network structure cannot stabilize cooperation among Large Language Model agents | Jin Han et.al. | 2411.10294 | null |
2024-11-15 | Scaling Law for Post-training after Model Pruning | Xiaodong Chen et.al. | 2411.10272 | null |
2024-11-15 | Visual-Linguistic Agent: Towards Collaborative Contextual Object Reasoning | Jingru Yang et.al. | 2411.10252 | null |
2024-11-15 | Measuring Non-Adversarial Reproduction of Training Data in Large Language Models | Michael Aerni et.al. | 2411.10242 | null |
2024-11-15 | Generative AI in Multimodal User Interfaces: Trends, Challenges, and Cross-Platform Adaptability | J. Bieniek et.al. | 2411.10234 | null |
2024-11-15 | An Empirical Study on LLM-based Agents for Automated Bug Fixing | Xiangxin Meng et.al. | 2411.10213 | null |
2024-11-15 | Agentic LLMs in the Supply Chain: Towards Autonomous Multi-Agent Consensus-Seeking | Valeria Jannelli et.al. | 2411.10184 | null |
2024-11-15 | CART: Compositional Auto-Regressive Transformer for Image Generation | Siddharth Roheda et.al. | 2411.10180 | null |
2024-11-14 | MagicQuill: An Intelligent Interactive Image Editing System | Zichen Liu et.al. | 2411.09703 | link |
2024-11-14 | Advancing Fine-Grained Visual Understanding with Multi-Scale Alignment in Multi-Modal Models | Wei Wang et.al. | 2411.09691 | null |
2024-11-14 | Squeezed Attention: Accelerating Long Context Length LLM Inference | Coleman Hooper et.al. | 2411.09688 | link |
2024-11-14 | Adaptive Decoding via Latent Preference Optimization | Shehzaad Dhuliawala et.al. | 2411.09661 | null |
2024-11-14 | On the Limits of Language Generation: Trade-Offs Between Hallucination and Mode Collapse | Alkis Kalavasis et.al. | 2411.09642 | null |
2024-11-14 | Local deployment of large-scale music AI models on commodity hardware | Xun Zhou et.al. | 2411.09625 | null |
2024-11-14 | PTR: Precision-Driven Tool Recommendation for Large Language Models | Hang Gao et.al. | 2411.09613 | null |
2024-11-14 | The Moral Foundations Weibo Corpus | Renjie Cao et.al. | 2411.09612 | null |
2024-11-14 | Initial Nugget Evaluation Results for the TREC 2024 RAG Track with the AutoNuggetizer Framework | Ronak Pradeep et.al. | 2411.09607 | null |
2024-11-14 | Accelerating Knowledge Graph and Ontology Engineering with Large Language Models | Cogan Shimizu et.al. | 2411.09601 | null |
2024-11-14 | Assessing the Performance of the DINOv2 Self-supervised Learning Vision Transformer Model for the Segmentation of the Left Atrium from MRI Images | Bipasha Kundu et.al. | 2411.09598 | null |
2024-11-14 | LLaMA-Mesh: Unifying 3D Mesh Generation with Language Models | Zhengyi Wang et.al. | 2411.09595 | null |
2024-11-14 | Adopting RAG for LLM-Aided Future Vehicle Design | Vahid Zolfaghari et.al. | 2411.09590 | null |
2024-11-14 | BabyLM Challenge: Exploring the Effect of Variation Sets on Language Model Training Efficiency | Akari Haga et.al. | 2411.09587 | null |
2024-11-14 | Software Performance Engineering for Foundation Model-Powered Software (FMware) | Haoxiang Zhang et.al. | 2411.09580 | null |
2024-11-14 | Piecing It All Together: Verifying Multi-Hop Multimodal Claims | Haoran Wang et.al. | 2411.09547 | null |
2024-11-14 | A Practical Guide to Fine-tuning Language Models with Limited Data | Márton Szép et.al. | 2411.09539 | null |
2024-11-14 | Navigating the Risks: A Survey of Security, Privacy, and Ethics Threats in LLM-Based Agents | Yuyou Gan et.al. | 2411.09523 | null |
2024-11-14 | Communication Compression for Tensor Parallel LLM Inference | Jan Hansen-Palmus et.al. | 2411.09510 | null |
2024-11-14 | Spider: Any-to-Many Multimodal LLM | Jinxiang Lai et.al. | 2411.09439 | link |
2024-11-13 | Large Wireless Model (LWM): A Foundation Model for Wireless Channels | Sadjad Alikhani et.al. | 2411.08872 | link |
2024-11-13 | The Limited Impact of Medical Adaptation of Large Language and Vision-Language Models | Daniel P. Jeong et.al. | 2411.08870 | link |
2024-11-13 | CamemBERT 2.0: A Smarter French Language Model Aged to Perfection | Wissam Antoun et.al. | 2411.08868 | null |
2024-11-13 | LLMStinger: Jailbreaking LLMs using RL fine-tuned LLMs | Piyush Jha et.al. | 2411.08862 | null |
2024-11-13 | Multimodal Instruction Tuning with Hybrid State Space Models | Jianing Zhou et.al. | 2411.08840 | null |
2024-11-13 | FinRobot: AI Agent for Equity Research and Valuation with Large Language Models | Tianyu Zhou et.al. | 2411.08804 | link |
2024-11-13 | Evaluating World Models with LLM for Decision Making | Chang Yang et.al. | 2411.08794 | null |
2024-11-13 | Can sparse autoencoders be used to decompose and interpret steering vectors? | Harry Mayne et.al. | 2411.08790 | link |
2024-11-13 | Sharingan: Extract User Action Sequence from Desktop Recordings | Yanting Chen et.al. | 2411.08768 | null |
2024-11-13 | Separating Tongue from Thought: Activation Patching Reveals Language-Agnostic Concept Representations in Transformers | Clément Dumas et.al. | 2411.08745 | link |
2024-11-13 | A Comparative Study of Discrete Speech Tokens for Semantic-Related Tasks with Large Language Models | Dingdong Wang et.al. | 2411.08742 | null |
2024-11-13 | Dynamic Rewarding with Prompt Optimization Enables Tuning-free Self-Alignment of Language Models | Somanshu Singla et.al. | 2411.08733 | link |
2024-11-13 | Polymetis:Large Language Modeling for Multiple Material Domains | Chao Huang et.al. | 2411.08728 | null |
2024-11-13 | Voxeland: Probabilistic Instance-Aware Semantic Mapping with Evidence-based Uncertainty Quantification | Jose-Luis Matez-Bandera et.al. | 2411.08727 | link |
2024-11-13 | Theoretical Analysis of Byte-Pair Encoding | László Kozma et.al. | 2411.08671 | null |
2024-11-13 | OSMLoc: Single Image-Based Visual Localization in OpenStreetMap with Geometric and Semantic Guidances | Youqi Liao et.al. | 2411.08665 | link |
2024-11-13 | UniMat: Unifying Materials Embeddings through Multi-modal Learning | Janghoon Ock et.al. | 2411.08664 | null |
2024-11-13 | Accelerating Quasi-Static Time Series Simulations with Foundation Models | Alban Puech et.al. | 2411.08652 | null |
2024-11-13 | A System Level Performance Evaluation for Superconducting Digital Systems | Joyjit Kundu et.al. | 2411.08645 | null |
2024-11-13 | Towards Secure Intelligent O-RAN Architecture: Vulnerabilities, Threats and Promising Technical Solutions using LLMs | Mojdeh Karbalaee Motalleb et.al. | 2411.08640 | null |
2024-11-12 | Learning with Less: Knowledge Distillation from Large Language Models via Unlabeled Data | Juanhui Li et.al. | 2411.08028 | null |
2024-11-12 | LLMPhy: Complex Physical Reasoning Using Large Language Models and World Models | Anoop Cherian et.al. | 2411.08027 | null |
2024-11-12 | Language Models as Causal Effect Generators | Lucius E. J. Bynum et.al. | 2411.08019 | link |
2024-11-12 | ExpressivityArena: Can LLMs Express Information Implicitly? | Joshua Tint et.al. | 2411.08010 | null |
2024-11-12 | Can adversarial attacks by large language models be attributed? | Manuel Cebrian et.al. | 2411.08003 | null |
2024-11-12 | Derivational Morphology Reveals Analogical Generalization in Large Language Models | Valentin Hofmann et.al. | 2411.07990 | null |
2024-11-12 | JanusFlow: Harmonizing Autoregression and Rectified Flow for Unified Multimodal Understanding and Generation | Yiyang Ma et.al. | 2411.07975 | link |
2024-11-12 | From General to Specific: Utilizing General Hallucation to Automatically Measure the Role Relationship Fidelity for Specific Role-Play Agents | Chuyi Kong et.al. | 2411.07965 | null |
2024-11-12 | Towards Low-bit Communication for Tensor Parallel LLM Inference | Harry Dong et.al. | 2411.07942 | null |
2024-11-12 | Leveraging Multimodal Models for Enhanced Neuroimaging Diagnostics in Alzheimer’s Disease | Francesco Chiumento et.al. | 2411.07871 | null |
2024-11-12 | Trustful LLMs: Customizing and Grounding Text Generation with Knowledge Bases and Dual Decoders | Xiaofeng Zhu et.al. | 2411.07870 | null |
2024-11-12 | Verbosity $\neq$ Veracity: Demystify Verbosity Compensation Behavior of Large Language Models | Yusen Zhang et.al. | 2411.07858 | link |
2024-11-12 | Tucano: Advancing Neural Text Generation for Portuguese | Nicholas Kluge Corrêa et.al. | 2411.07854 | link |
2024-11-12 | NL-SLAM for OC-VLN: Natural Language Grounded SLAM for Object-Centric VLN | Sonia Raychaudhuri et.al. | 2411.07848 | null |
2024-11-12 | Chain Association-based Attacking and Shielding Natural Language Processing Systems | Jiacheng Huang et.al. | 2411.07843 | null |
2024-11-12 | FRUGAL: Memory-Efficient Optimization by Reducing State Overhead for Scalable Training | Philip Zmushko et.al. | 2411.07837 | link |
2024-11-12 | Efficient Federated Finetuning of Tiny Transformers with Resource-Constrained Devices | Kilian Pfeiffer et.al. | 2411.07826 | null |
2024-11-12 | Query Optimization for Parametric Knowledge Refinement in Retrieval-Augmented Large Language Models | Youan Cong et.al. | 2411.07820 | null |
2024-11-12 | Federated Low-Rank Adaptation with Differential Privacy over Wireless Networks | Tianqu Kang et.al. | 2411.07806 | null |
2024-11-12 | Likelihood as a Performance Gauge for Retrieval-Augmented Generation | Tianyu Liu et.al. | 2411.07773 | link |
2024-11-11 | UTMath: Math Evaluation with Unit Test via Reasoning-to-Coding Thoughts | Bo Yang et.al. | 2411.07240 | link |
2024-11-11 | OpenThaiGPT 1.5: A Thai-Centric Open Source Large Language Model | Sumeth Yuenyong et.al. | 2411.07238 | null |
2024-11-11 | Contextualized Evaluations: Taking the Guesswork Out of Language Model Evaluations | Chaitanya Malaviya et.al. | 2411.07237 | null |
2024-11-11 | Tooling or Not Tooling? The Impact of Tools on Language Agents for Chemistry Problem Solving | Botao Yu et.al. | 2411.07228 | null |
2024-11-11 | TempCharBERT: Keystroke Dynamics for Continuous Access Control Based on Pre-trained Language Models | Matheus Simão et.al. | 2411.07224 | null |
2024-11-11 | Comparing Bottom-Up and Top-Down Steering Approaches on In-Context Learning Tasks | Madeline Brumley et.al. | 2411.07213 | null |
2024-11-11 | General Geospatial Inference with a Population Dynamics Foundation Model | Mohit Agarwal et.al. | 2411.07207 | link |
2024-11-11 | DLCR: A Generative Data Expansion Framework via Diffusion for Clothes-Changing Person Re-ID | Nyle Siddiqui et.al. | 2411.07205 | link |
2024-11-11 | The Super Weight in Large Language Models | Mengxia Yu et.al. | 2411.07191 | link |
2024-11-11 | NatureLM-audio: an Audio-Language Foundation Model for Bioacoustics | David Robinson et.al. | 2411.07186 | null |
2024-11-11 | SAMPart3D: Segment Any Part in 3D Objects | Yunhan Yang et.al. | 2411.07184 | link |
2024-11-11 | Counterfactual Generation from Language Models | Shauli Ravfogel et.al. | 2411.07180 | link |
2024-11-11 | More Expressive Attention with Negative Weights | Ang Lv et.al. | 2411.07176 | link |
2024-11-11 | Continual Memorization of Factoids in Large Language Models | Howard Chen et.al. | 2411.07175 | link |
2024-11-11 | A Domain-Agnostic Neurosymbolic Approach for Big Social Data Analysis: Evaluating Mental Health Sentiment on Social Media during COVID-19 | Vedant Khandelwal et.al. | 2411.07163 | null |
2024-11-11 | Chinese SimpleQA: A Chinese Factuality Evaluation for Large Language Models | Yancheng He et.al. | 2411.07140 | null |
2024-11-11 | Stronger Models are NOT Stronger Teachers for Instruction Tuning | Zhangchen Xu et.al. | 2411.07133 | null |
2024-11-11 | Token Merging for Training-Free Semantic Binding in Text-to-Image Synthesis | Taihang Hu et.al. | 2411.07132 | link |
2024-11-11 | Retrieval or Global Context Understanding? On Many-Shot In-Context Learning for Long-Context Evaluation | Kaijian Zou et.al. | 2411.07130 | link |
2024-11-11 | Benchmarking LLMs’ Judgments with No Gold Standard | Shengwei Xu et.al. | 2411.07127 | link |
2024-11-08 | Recycled Attention: Efficient inference for long-context language models | Fangyuan Xu et.al. | 2411.05787 | link |
2024-11-08 | Using Language Models to Disambiguate Lexical Choices in Translation | Josh Barua et.al. | 2411.05781 | link |
2024-11-08 | Fact or Fiction? Can LLMs be Reliable Annotators for Political Truths? | Veronica Chatrath et.al. | 2411.05775 | null |
2024-11-08 | Multi-hop Evidence Pursuit Meets the Web: Team Papelo at FEVER 2024 | Christopher Malon et.al. | 2411.05762 | null |
2024-11-08 | End-to-End Navigation with Vision Language Models: Transforming Spatial Reasoning into Question-Answering | Dylan Goetting et.al. | 2411.05755 | link |
2024-11-08 | Aioli: A Unified Optimization Framework for Language Model Data Mixing | Mayee F. Chen et.al. | 2411.05735 | link |
2024-11-08 | Poze: Sports Technique Feedback under Data Constraints | Agamdeep Singh et.al. | 2411.05734 | null |
2024-11-08 | STARS: Sensor-agnostic Transformer Architecture for Remote Sensing | Ethan King et.al. | 2411.05714 | null |
2024-11-08 | Unmasking the Limits of Large Language Models: A Systematic Evaluation of Masked Text Processing Ability through MskQA and MskCal | Fuka Matsuzaki et.al. | 2411.05665 | link |
2024-11-08 | The influence of persona and conversational task on social interactions with a LLM-controlled embodied conversational agent | Leon O. H. Kroczek et.al. | 2411.05653 | null |
2024-11-08 | LightVA: Lightweight Visual Analytics with LLM Agent-Based Task Planning and Execution | Yuheng Zhao et.al. | 2411.05651 | null |
2024-11-08 | Harnessing High-Level Song Descriptors towards Natural Language-Based Music Recommendation | Elena V. Epure et.al. | 2411.05649 | link |
2024-11-08 | Evaluating Large Language Model Capability in Vietnamese Fact-Checking Data Generation | Long Truong To et.al. | 2411.05641 | null |
2024-11-08 | Assessing Open-Source Large Language Models on Argumentation Mining Subtasks | Mohammad Yeghaneh Abkenar et.al. | 2411.05639 | null |
2024-11-08 | A Two-Step Concept-Based Approach for Enhanced Interpretability and Trust in Skin Lesion Diagnosis | Cristiano Patrício et.al. | 2411.05609 | link |
2024-11-08 | Evaluating and Adapting Large Language Models to Represent Folktales in Low-Resource Languages | JA Meaney et.al. | 2411.05593 | null |
2024-11-08 | Open-set object detection: towards unified problem formulation and benchmarking | Hejer Ammar et.al. | 2411.05564 | null |
2024-11-08 | Training objective drives the consistency of representational similarity across datasets | Laure Ciernik et.al. | 2411.05561 | link |
2024-11-08 | AcceLLM: Accelerating LLM Inference using Redundancy for Load Balancing and Data Locality | Ilias Bournias et.al. | 2411.05555 | null |
2024-11-08 | Assessing the Answerability of Queries in Retrieval-Augmented Code Generation | Geonmin Kim et.al. | 2411.05547 | null |
2024-11-07 | SVDQunat: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models | Muyang Li et.al. | 2411.05007 | link |
2024-11-07 | Analyzing The Language of Visual Tokens | David M. Chan et.al. | 2411.05001 | null |
2024-11-07 | Needle Threading: Can LLMs Follow Threads through Near-Million-Scale Haystacks? | Jonathan Roberts et.al. | 2411.05000 | null |
2024-11-07 | DynaMem: Online Dynamic Spatio-Semantic Memory for Open World Mobile Manipulation | Peiqi Liu et.al. | 2411.04999 | link |
2024-11-07 | LLM2CLIP: Powerful Language Model Unlock Richer Visual Representation | Weiquan Huang et.al. | 2411.04997 | link |
2024-11-07 | Mixture-of-Transformers: A Sparse and Scalable Architecture for Multi-Modal Foundation Models | Weixin Liang et.al. | 2411.04996 | null |
2024-11-07 | Rethinking Bradley-Terry Models in Preference-Based Reward Modeling: Foundations, Theory, and Alternatives | Hao Sun et.al. | 2411.04991 | link |
2024-11-07 | The Semantic Hub Hypothesis: Language Models Share Semantic Representations Across Languages and Modalities | Zhaofeng Wu et.al. | 2411.04986 | link |
2024-11-07 | Enhancing Reverse Engineering: Investigating and Benchmarking Large Language Models for Vulnerability Analysis in Decompiled Binaries | Dylan Manuel et.al. | 2411.04981 | null |
2024-11-07 | SuffixDecoding: A Model-Free Approach to Speeding Up Large Language Model Inference | Gabriele Oliaro et.al. | 2411.04975 | null |
2024-11-07 | BitNet a4.8: 4-bit Activations for 1-bit LLMs | Hongyu Wang et.al. | 2411.04965 | null |
2024-11-07 | Position Paper On Diagnostic Uncertainty Estimation from Large Language Models: Next-Word Probability Is Not Pre-test Probability | Yanjun Gao et.al. | 2411.04962 | null |
2024-11-07 | CAD-MLLM: Unifying Multimodality-Conditioned CAD Generation With MLLM | Jingwei Xu et.al. | 2411.04954 | null |
2024-11-07 | M3DocRAG: Multi-modal Retrieval is What You Need for Multi-page Multi-document Understanding | Jaemin Cho et.al. | 2411.04952 | null |
2024-11-07 | A Reinforcement Learning-Based Automatic Video Editing Method Using Pre-trained Vision-Language Model | Panwen Hu et.al. | 2411.04942 | null |
2024-11-07 | VideoGLaMM: A Large Multimodal Model for Pixel-Level Visual Grounding in Videos | Shehan Munasinghe et.al. | 2411.04923 | null |
2024-11-07 | GPTKB: Building Very Large Knowledge Bases from Language Models | Yujia Hu et.al. | 2411.04920 | link |
2024-11-07 | OpenCoder: The Open Cookbook for Top-Tier Code Large Language Models | Siming Huang et.al. | 2411.04905 | null |
2024-11-07 | In the Era of Prompt Learning with Vision-Language Models | Ankit Jha et.al. | 2411.04892 | null |
2024-11-07 | GUI Agents with Foundation Models: A Comprehensive Survey | Shuai Wang et.al. | 2411.04890 | null |
2024-11-06 | Medical Adaptation of Large Language and Vision-Language Models: Are We Making Progress? | Daniel P. Jeong et.al. | 2411.04118 | link |
2024-11-06 | How Transformers Solve Propositional Logic Problems: A Mechanistic Analysis | Guan Zhe Hong et.al. | 2411.04105 | null |
2024-11-06 | RaVL: Discovering and Mitigating Spurious Correlations in Fine-Tuned Vision-Language Models | Maya Varma et.al. | 2411.04097 | link |
2024-11-06 | Textual Decomposition Then Sub-motion-space Scattering for Open-Vocabulary Motion Generation | Ke Fan et.al. | 2411.04079 | null |
2024-11-06 | H-POPE: Hierarchical Polling-based Probing Evaluation of Hallucinations in Large Vision-Language Models | Nhi Pham et.al. | 2411.04077 | null |
2024-11-06 | M3SciQA: A Multi-Modal Multi-Document Scientific QA Benchmark for Evaluating Foundation Models | Chuhan Li et.al. | 2411.04075 | null |
2024-11-06 | Pseudo-labeling with Keyword Refining for Few-Supervised Video Captioning | Ping Li et.al. | 2411.04059 | link |
2024-11-06 | Beemo: Benchmark of Expert-edited Machine-generated Outputs | Ekaterina Artemova et.al. | 2411.04032 | link |
2024-11-06 | Prompt Engineering Using GPT for Word-Level Code-Mixed Language Identification in Low-Resource Dravidian Languages | Aniket Deroy et.al. | 2411.04025 | null |
2024-11-06 | Select2Plan: Training-Free ICL-Based Planning through VQA and Memory Retrieval | Davide Buoso et.al. | 2411.04006 | null |
2024-11-06 | Customized Multiple Clustering via Multi-Modal Subspace Proxy Learning | Jiawei Yao et.al. | 2411.03978 | link |
2024-11-06 | What Really is Commonsense Knowledge? | Quyet V. Do et.al. | 2411.03964 | null |
2024-11-06 | How Does A Text Preprocessing Pipeline Affect Ontology Syntactic Matching? | Zhangcheng Qiang et.al. | 2411.03962 | link |
2024-11-06 | Face Reconstruction from Face Embeddings using Adapter to a Face Foundation Model | Hatef Otroshi Shahreza et.al. | 2411.03960 | null |
2024-11-06 | Fine-Grained Guidance for Retrievers: Leveraging LLMs’ Feedback in Retrieval-Augmented Generation | Yuhang Liu et.al. | 2411.03957 | null |
2024-11-06 | Long-Form Text-to-Music Generation with Adaptive Prompts: A Case of Study in Tabletop Role-Playing Games Soundtracks | Felipe Marra et.al. | 2411.03948 | link |
2024-11-06 | Interactions Across Blocks in Post-Training Quantization of Large Language Models | Khasmamad Shabanovi et.al. | 2411.03934 | null |
2024-11-06 | Multi3Hate: Multimodal, Multilingual, and Multicultural Hate Speech Detection with Vision-Language Models | Minh Duc Bui et.al. | 2411.03888 | link |
2024-11-06 | Polynomial Composition Activations: Unleashing the Dynamics of Large Language Models | Zhijian Zhuo et.al. | 2411.03884 | link |
2024-11-06 | MEG: Medical Knowledge-Augmented Large Language Models for Question Answering | Laura Cabello et.al. | 2411.03883 | link |
2024-11-05 | Inference Optimal VLMs Need Only One Visual Token but Larger Models | Kevin Y. Li et.al. | 2411.03312 | link |
2024-11-05 | LLMs for Domain Generation Algorithm Detection | Reynier Leyva La O et.al. | 2411.03307 | null |
2024-11-05 | VERITAS: A Unified Approach to Reliability Evaluation | Rajkumar Ramamurthy et.al. | 2411.03300 | null |
2024-11-05 | Examining Human-AI Collaboration for Co-Writing Constructive Comments Online | Farhana Shahid et.al. | 2411.03295 | null |
2024-11-05 | Interaction2Code: How Far Are We From Automatic Interactive Webpage Generation? | Jingyu Xiao et.al. | 2411.03292 | link |
2024-11-05 | The Future of Intelligent Healthcare: A Systematic Analysis and Discussion on the Integration and Impact of Robots Using Large Language Models for Healthcare | Souren Pashangpour et.al. | 2411.03287 | null |
2024-11-05 | SMoA: Improving Multi-agent Large Language Models with Sparse Mixture-of-Agents | Dawei Li et.al. | 2411.03284 | link |
2024-11-05 | Spontaneous Emergence of Agent Individuality through Social Interactions in LLM-Based Communities | Ryosuke Takata et.al. | 2411.03252 | null |
2024-11-05 | DiffLM: Controllable Synthetic Data Generation via Diffusion Language Models | Ying Zhou et.al. | 2411.03250 | null |
2024-11-05 | From Pen to Prompt: How Creative Writers Integrate AI into their Writing Practice | Alicia Guo et.al. | 2411.03137 | null |
2024-11-05 | “Create a Fear of Missing Out” – ChatGPT Implements Unsolicited Deceptive Designs in Generated Websites Without Warning | Veronika Krauß et.al. | 2411.03108 | null |
2024-11-05 | Utilizing Precise and Complete Code Context to Guide LLM in Automatic False Positive Mitigation | Jinbao Chen et.al. | 2411.03079 | null |
2024-11-05 | Predictor-Corrector Enhanced Transformers with Exponential Moving Average Coefficient Learning | Bei Li et.al. | 2411.03042 | null |
2024-11-05 | HumanVLM: Foundation for Human-Scene Vision-Language Model | Dawei Dai et.al. | 2411.03034 | null |
2024-11-05 | Leveraging Large Language Models in Code Question Answering: Baselines and Issues | Georgy Andryushchenko et.al. | 2411.03012 | link |
2024-11-05 | Controlling for Unobserved Confounding with Large Language Model Classification of Patient Smoking Status | Samuel Lee et.al. | 2411.03004 | null |
2024-11-05 | Efficient and Effective Adaptation of Multimodal Foundation Models in Sequential Recommendation | Junchen Fu et.al. | 2411.02992 | null |
2024-11-05 | Growing a Tail: Increasing Output Diversity in Large Language Models | Michal Shur-Ofry et.al. | 2411.02989 | null |
2024-11-05 | [Vision Paper] PRObot: Enhancing Patient-Reported Outcome Measures for Diabetic Retinopathy using Chatbots and Generative AI | Maren Pielka et.al. | 2411.02973 | null |
2024-11-05 | Multi-modal NeRF Self-Supervision for LiDAR Semantic Segmentation | Xavier Timoneda et.al. | 2411.02969 | null |
2024-11-04 | Training-free Regional Prompting for Diffusion Transformers | Anthony Chen et.al. | 2411.02395 | link |
2024-11-04 | Adaptive Length Image Tokenization via Recurrent Allocation | Shivam Duggal et.al. | 2411.02393 | link |
2024-11-04 | Attacking Vision-Language Computer Agents via Pop-ups | Yanzhe Zhang et.al. | 2411.02391 | link |
2024-11-04 | Improving Scientific Hypothesis Generation with Knowledge Grounded Large Language Models | Guangzhi Xiong et.al. | 2411.02382 | null |
2024-11-04 | Addressing Uncertainty in LLMs to Enhance Reliability in Generative AI | Ramneet Kaur et.al. | 2411.02381 | null |
2024-11-04 | Learning General-Purpose Biomedical Volume Representations using Randomized Synthesis | Neel Dey et.al. | 2411.02372 | link |
2024-11-04 | DeeR-VLA: Dynamic Inference of Multimodal Large Language Models for Efficient Robot Execution | Yang Yue et.al. | 2411.02359 | link |
2024-11-04 | “Give Me BF16 or Give Me Death”? Accuracy-Performance Trade-Offs in LLM Quantization | Eldar Kurtic et.al. | 2411.02355 | null |
2024-11-04 | Machine learning identification of maternal inflammatory response and histologic choroamnionitis from placental membrane whole slide images | Abhishek Sharma et.al. | 2411.02354 | null |
2024-11-04 | Social-RAG: Retrieving from Group Interactions to Socially Ground Proactive AI Generation to Group Preferences | Ruotong Wang et.al. | 2411.02353 | null |
2024-11-04 | Can Large Language Models generalize analogy solving like people can? | Claire E. Stevenson et.al. | 2411.02348 | null |
2024-11-04 | WebRL: Training LLM Web Agents via Self-Evolving Online Curriculum Reinforcement Learning | Zehan Qi et.al. | 2411.02337 | link |
2024-11-04 | Sparsing Law: Towards Large Language Models with Greater Activation Sparsity | Yuqi Luo et.al. | 2411.02335 | link |
2024-11-04 | Disrupting Test Development with AI Assistants | Vijay Joshi et.al. | 2411.02328 | null |
2024-11-04 | PPLLaVA: Varied Video Sequence Understanding With Prompt Guidance | Ruyang Liu et.al. | 2411.02327 | link |
2024-11-04 | An Empirical Study on the Code Refactoring Capability of Large Language Models | Jonathan Cordeiro et.al. | 2411.02320 | null |
2024-11-04 | Evaluating the Ability of Large Language Models to Generate Verifiable Specifications in VeriFast | Marilyn Rego et.al. | 2411.02318 | null |
2024-11-04 | Defining and Evaluating Physical Safety for Large Language Models | Yung-Chen Tang et.al. | 2411.02317 | null |
2024-11-04 | Evaluating Creative Short Story Generation in Humans and Large Language Models | Mete Ismayilzada et.al. | 2411.02316 | link |
2024-11-04 | Taking AI Welfare Seriously | Robert Long et.al. | 2411.00986 | null |
2024-10-31 | P-Masking: Power Law Masking Improves Multi-attribute Controlled Generation | Mohamed Elgaar et.al. | 2410.24201 | null |
2024-11-01 | SelfCodeAlign: Self-Alignment for Code Generation | Yuxiang Wei et.al. | 2410.24198 | link |
2024-10-31 | DC-Spin: A Speaker-invariant Speech Tokenizer for Spoken Language Models | Heng-Jui Chang et.al. | 2410.24177 | null |
2024-10-31 | Constraint Back-translation Improves Complex Instruction Following of Large Language Models | Yunjia Qi et.al. | 2410.24175 | link |
2024-10-31 | $π_0$ : A Vision-Language-Action Flow Model for General Robot Control | Kevin Black et.al. | 2410.24164 | null |
2024-10-31 | GPT or BERT: why not both? | Lucas Georges Gabriel Charpentier et.al. | 2410.24159 | link |
2024-10-31 | Thought Space Explorer: Navigating and Expanding Thought Space for Large Language Model Reasoning | Jinghan Zhang et.al. | 2410.24155 | null |
2024-10-31 | Language-Driven Policy Distillation for Cooperative Driving in Multi-Agent Reinforcement Learning | Jiaqi Liu et.al. | 2410.24152 | null |
2024-10-31 | Exploring Vision Language Models for Facial Attribute Recognition: Emotion, Race, Gender, and Age | Nouar AlDahoul et.al. | 2410.24148 | null |
2024-10-31 | Leveraging Large Language Models for Code Translation and Software Development in Scientific Computing | Akash Dhruv et.al. | 2410.24119 | link |
2024-10-31 | Repository-Level Compositional Code Translation and Validation | Ali Reza Ibrahimzada et.al. | 2410.24117 | link |
2024-10-31 | Matchmaker: Self-Improving Large Language Model Programs for Schema Matching | Nabeel Seedat et.al. | 2410.24105 | null |
2024-10-31 | Progressive Safeguards for Safe and Model-Agnostic Reinforcement Learning | Nabil Omi et.al. | 2410.24096 | null |
2024-10-31 | In-Context Fine-Tuning for Time-Series Foundation Models | Abhimanyu Das et.al. | 2410.24087 | null |
2024-10-31 | Desert Camels and Oil Sheikhs: Arab-Centric Red Teaming of Frontier LLMs | Muhammed Saeed et.al. | 2410.24049 | null |
2024-10-31 | Handwriting Recognition in Historical Documents with Multimodal LLM | Lucian Li et.al. | 2410.24034 | null |
2024-10-31 | Navigating the Unknown: A Chat-Based Collaborative Interface for Personalized Exploratory Tasks | Yingzhe Peng et.al. | 2410.24032 | null |
2024-10-31 | AndroidLab: Training and Systematic Benchmarking of Android Autonomous Agents | Yifan Xu et.al. | 2410.24024 | link |
2024-10-31 | SFM-Protein: Integrative Co-evolutionary Pre-training for Advanced Protein Sequence Representation | Liang He et.al. | 2410.24022 | null |
2024-10-31 | Speech is More Than Words: Do Speech-to-Text Translation Systems Leverage Prosody? | Ioannis Tsiamas et.al. | 2410.24019 | null |
2024-10-30 | ReferEverything: Towards Segmenting Everything We Can Speak of in Videos | Anurag Bagchi et.al. | 2410.23287 | null |
2024-10-30 | A Monte Carlo Framework for Calibrated Uncertainty Estimation in Sequence Prediction | Qidong Yang et.al. | 2410.23272 | null |
2024-10-30 | TOMATO: Assessing Visual Temporal Reasoning Capabilities in Multimodal Foundation Models | Ziyao Shangguan et.al. | 2410.23266 | link |
2024-10-30 | EMMA: End-to-End Multimodal Model for Autonomous Driving | Jyh-Jing Hwang et.al. | 2410.23262 | null |
2024-10-30 | Keypoint Abstraction using Large Models for Object-Relative Imitation Learning | Xiaolin Fang et.al. | 2410.23254 | null |
2024-10-30 | Evaluating Cultural and Social Awareness of LLM Web Agents | Haoyi Qiu et.al. | 2410.23252 | null |
2024-10-30 | Carrot and Stick: Eliciting Comparison Data and Beyond | Yiling Chen et.al. | 2410.23243 | null |
2024-10-30 | A little less conversation, a little more action, please: Investigating the physical common-sense of LLMs in a 3D embodied environment | Matteo G. Mecattaf et.al. | 2410.23242 | link |
2024-10-30 | EMOTION: Expressive Motion Sequence Generation for Humanoid Robots with In-Context Learning | Peide Huang et.al. | 2410.23234 | null |
2024-10-30 | COMAL: A Convergent Meta-Algorithm for Aligning LLMs with General Preferences | Yixin Liu et.al. | 2410.23223 | link |
2024-10-30 | Partial Channel Dependence with Channel Masks for Time Series Foundation Models | Seunghan Lee et.al. | 2410.23222 | null |
2024-10-30 | OS-ATLAS: A Foundation Action Model for Generalist GUI Agents | Zhiyong Wu et.al. | 2410.23218 | link |
2024-10-31 | Grounding by Trying: LLMs with Reinforcement Learning-Enhanced Retrieval | Sheryl Hsu et.al. | 2410.23214 | null |
2024-10-30 | ProTransformer: Robustify Transformers via Plug-and-Play Paradigm | Zhichao Hou et.al. | 2410.23182 | link |
2024-10-30 | ReasoningRec: Bridging Personalized Recommendations and Human-Interpretable Explanations through LLM Reasoning | Millennium Bismay et.al. | 2410.23180 | link |
2024-10-30 | TokenFormer: Rethinking Transformer Scaling with Tokenized Model Parameters | Haiyang Wang et.al. | 2410.23168 | link |
2024-10-30 | SciPIP: An LLM-based Scientific Paper Idea Proposer | Wenxiao Wang et.al. | 2410.23166 | link |
2024-10-30 | FlexTSF: A Universal Forecasting Model for Time Series with Variable Regularities | Jingge Xiao et.al. | 2410.23160 | link |
2024-10-30 | VisualPredicator: Learning Abstract World Models with Neuro-Symbolic Predicates for Robot Planning | Yichao Liang et.al. | 2410.23156 | null |
2024-10-30 | Public Domain 12M: A Highly Aesthetic Image-Text Dataset with Novel Governance Mechanisms | Jordan Meyer et.al. | 2410.23144 | null |
2024-10-29 | Local Policies Enable Zero-shot Long-horizon Manipulation | Murtaza Dalal et.al. | 2410.22332 | null |
2024-10-29 | Task Vectors are Cross-Modal | Grace Luo et.al. | 2410.22330 | null |
2024-10-29 | Enhancing Code Annotation Reliability: Generative AI’s Role in Comment Quality Assessment Models | Seetharam Killivalavan et.al. | 2410.22323 | null |
2024-10-29 | Online Detecting LLM-Generated Texts via Sequential Hypothesis Testing by Betting | Can Chen et.al. | 2410.22318 | link |
2024-10-29 | Multi-Class Textual-Inversion Secretly Yields a Semantic-Agnostic Classifier | Kai Wang et.al. | 2410.22317 | link |
2024-10-29 | Natural Language Inference Improves Compositionality in Vision-Language Models | Paola Cascante-Bonilla et.al. | 2410.22315 | null |
2024-10-29 | Senna: Bridging Large Vision-Language Models and End-to-End Autonomous Driving | Bo Jiang et.al. | 2410.22313 | link |
2024-10-30 | GPT-4o reads the mind in the eyes | James W. A. Strachan et.al. | 2410.22309 | null |
2024-10-29 | SVIP: Towards Verifiable Inference of Open-source Large Language Models | Yifan Sun et.al. | 2410.22307 | null |
2024-10-29 | Flow-DPO: Improving LLM Mathematical Reasoning through Online Multi-Agent Learning | Yihe Deng et.al. | 2410.22304 | null |
2024-10-29 | LLMs are Highly-Constrained Biophysical Sequence Optimizers | Angelica Chen et.al. | 2410.22296 | null |
2024-10-29 | Fine-Tuning LLMs for Code Mutation: A New Era of Cyber Threats | Mohammad Setak et.al. | 2410.22293 | null |
2024-10-29 | From melodic note sequences to pitches using word2vec | Daniel Defays et.al. | 2410.22285 | null |
2024-10-29 | Embedding-based classifiers can detect prompt injection attacks | Md. Ahsan Ayub et.al. | 2410.22284 | link |
2024-10-29 | Whose ChatGPT? Unveiling Real-World Educational Inequalities Introduced by Large Language Models | Renzhe Yu et.al. | 2410.22282 | null |
2024-10-29 | Fourier Head: Helping Large Language Models Learn Complex Probability Distributions | Nate Gillman et.al. | 2410.22269 | null |
2024-10-29 | Meta-Learning Adaptable Foundation Models | Jacob L. Block et.al. | 2410.22264 | null |
2024-10-29 | FactBench: A Dynamic Benchmark for In-the-Wild Language Model Factuality Evaluation | Farima Fatahi Bayat et.al. | 2410.22257 | null |
2024-10-29 | Abrupt Learning in Transformers: A Case Study on Matrix Completion | Pulkit Gopalani et.al. | 2410.22244 | null |
2024-10-29 | Are Decoder-Only Large Language Models the Silver Bullet for Code Search? | Yuxuan Chen et.al. | 2410.22240 | link |
2024-10-28 | Arithmetic Without Algorithms: Language Models Solve Math With a Bag of Heuristics | Yaniv Nikankin et.al. | 2410.21272 | link |
2024-10-28 | LARP: Tokenizing Videos with a Learned Autoregressive Generative Prior | Hanyu Wang et.al. | 2410.21264 | null |
2024-10-28 | BLAST: Block-Level Adaptive Structured Matrices for Efficient Deep Neural Network Inference | Changwoo Lee et.al. | 2410.21262 | link |
2024-10-29 | AutoBench-V: Can Large Vision-Language Models Benchmark Themselves? | Han Bao et.al. | 2410.21259 | link |
2024-10-28 | Multi-modal AI for comprehensive breast cancer prognostication | Jan Witowski et.al. | 2410.21256 | null |
2024-10-28 | LongReward: Improving Long-context Large Language Models with AI Feedback | Jiajie Zhang et.al. | 2410.21252 | link |
2024-10-28 | Zero-Shot Dense Retrieval with Embeddings from Relevance Feedback | Nour Jedidi et.al. | 2410.21242 | null |
2024-10-28 | Hierarchical Knowledge Graph Construction from Images for Scalable E-Commerce | Zhantao Yang et.al. | 2410.21237 | null |
2024-10-28 | Flaming-hot Initiation with Regular Execution Sampling for Large Language Models | Weizhe Chen et.al. | 2410.21236 | null |
2024-10-28 | LoRA vs Full Fine-tuning: An Illusion of Equivalence | Reece Shuttleworth et.al. | 2410.21228 | null |
2024-10-28 | Vision Search Assistant: Empower Vision-Language Models as Multimodal Search Engines | Zhixin Zhang et.al. | 2410.21220 | link |
2024-10-28 | Lifting the Veil on the Large Language Model Supply Chain: Composition, Risks, and Mitigations | Kaifeng Huang et.al. | 2410.21218 | null |
2024-10-28 | BongLLaMA: LLaMA for Bangla Language | Abdullah Khan Zehady et.al. | 2410.21200 | null |
2024-10-28 | Belief in the Machine: Investigating Epistemological Blind Spots of Language Models | Mirac Suzgun et.al. | 2410.21195 | link |
2024-10-29 | Document Parsing Unveiled: Techniques, Challenges, and Prospects for Structured Information Extraction | Qintong Zhang et.al. | 2410.21169 | null |
2024-10-28 | M2rc-Eval: Massively Multilingual Repository-level Code Completion Evaluation | Jiaheng Liu et.al. | 2410.21157 | null |
2024-10-28 | Palisade – Prompt Injection Detection Framework | Sahasra Kokkula et.al. | 2410.21146 | null |
2024-10-28 | LLM-initialized Differentiable Causal Discovery | Shiv Kampani et.al. | 2410.21141 | null |
2024-10-28 | Do LLMs generate test oracles that capture the actual or the expected program behaviour? | Michael Konstantinou et.al. | 2410.21136 | null |
2024-10-28 | Towards Unifying Evaluation of Counterfactual Explanations: Leveraging Large Language Models for Human-Centric Assessments | Marharyta Domnich et.al. | 2410.21131 | link |
2024-10-25 | The Potential and Value of AI Chatbot in Personalized Cognitive Training | Zilong Wang et.al. | 2410.19733 | null |
2024-10-25 | Rethinking Visual Dependency in Long-Context Reasoning for Large Vision-Language Models | Yucheng Zhou et.al. | 2410.19732 | null |
2024-10-25 | Counting Ability of Large Language Models and Impact of Tokenization | Xiang Zhang et.al. | 2410.19730 | link |
2024-10-25 | FISHNET: Financial Intelligence from Sub-querying, Harmonizing, Neural-Conditioning, Expert Swarms, and Task Planning | Nicole Cho et.al. | 2410.19727 | null |
2024-10-25 | 2D-DPO: Scaling Direct Preference Optimization with 2-Dimensional Supervision | Shilong Li et.al. | 2410.19720 | null |
2024-10-25 | Multi-view biomedical foundation models for molecule-target and property prediction | Parthasarathy Suryanarayanan et.al. | 2410.19704 | link |
2024-10-25 | TimeSuite: Improving MLLMs for Long Video Understanding via Grounded Tuning | Xiangyu Zeng et.al. | 2410.19702 | null |
2024-10-25 | IPPON: Common Sense Guided Informative Path Planning for Object Goal Navigation | Kaixian Qu et.al. | 2410.19697 | null |
2024-10-25 | Less is More: Extreme Gradient Boost Rank-1 Adaption for Efficient Finetuning of LLMs | Yifei Zhang et.al. | 2410.19694 | null |
2024-10-25 | APRICOT: Active Preference Learning and Constraint-Aware Task Planning with LLMs | Huaxiaoyue Wang et.al. | 2410.19656 | null |
2024-10-25 | Frozen-DETR: Enhancing DETR with Image Understanding from Frozen Foundation Models | Shenghao Fu et.al. | 2410.19635 | null |
2024-10-25 | Take Caution in Using LLMs as Human Surrogates: Scylla Ex Machina | Yuan Gao et.al. | 2410.19599 | null |
2024-10-25 | Diverse Sign Language Translation | Xin Shen et.al. | 2410.19586 | link |
2024-10-25 | ChunkRAG: Novel LLM-Chunk Filtering Method for RAG Systems | Ritvik Aggarwal Ishneet Sukhvinder Singh Ibrahim Allahverdiyev et.al. | 2410.19572 | null |
2024-10-25 | GeoLLaVA: Efficient Fine-Tuned Vision-Language Models for Temporal Change Detection in Remote Sensing | Hosam Elgendy et.al. | 2410.19552 | link |
2024-10-25 | Bongard in Wonderland: Visual Puzzles that Still Make AI Go Mad? | Antonia Wüst et.al. | 2410.19546 | link |
2024-10-25 | Brain-like Functional Organization within Large Language Models | H. Sun et.al. | 2410.19542 | null |
2024-10-25 | Detection of Human and Machine-Authored Fake News in Urdu | Muhammad Zain Ali et.al. | 2410.19517 | link |
2024-10-25 | SWITCH: Studying with Teacher for Knowledge Distillation of Large Language Models | Jahyun Koo et.al. | 2410.19503 | null |
2024-10-25 | Introducing MAPO: Momentum-Aided Gradient Descent Prompt Optimization | Anthony Cui et.al. | 2410.19499 | null |
2024-10-24 | Unbounded: A Generative Infinite Game of Character Life Simulation | Jialu Li et.al. | 2410.18975 | null |
2024-10-24 | Deep Insights into Cognitive Decline: A Survey of Leveraging Non-Intrusive Modalities with Deep Learning Techniques | David Ortiz-Perez et.al. | 2410.18972 | null |
2024-10-24 | ConceptDrift: Uncovering Biases through the Lens of Foundational Models | Cristian Daniel Păduraru et.al. | 2410.18970 | null |
2024-10-24 | Ferret-UI 2: Mastering Universal User Interface Understanding Across Platforms | Zhangheng Li et.al. | 2410.18967 | null |
2024-10-24 | Does Data Contamination Detection Work (Well) for LLMs? A Survey and Evaluation on Detection Assumptions | Yujuan Fu et.al. | 2410.18966 | null |
2024-10-24 | On the Crucial Role of Initialization for Matrix Factorization | Bingcong Li et.al. | 2410.18965 | null |
2024-10-24 | OSCAR: Operating System Control via State-Aware Reasoning and Re-Planning | Xiaoqiang Wang et.al. | 2410.18963 | null |
2024-10-24 | Context is Key: A Benchmark for Forecasting with Essential Textual Information | Andrew Robert Williams et.al. | 2410.18959 | link |
2024-10-24 | Bridge-Coder: Unlocking LLMs’ Potential to Overcome Language Gaps in Low-Resource Code | Jipeng Zhang et.al. | 2410.18957 | null |
2024-10-24 | BioMistral-NLU: Towards More Generalizable Medical Language Understanding through Instruction Tuning | Yujuan Velvin Fu et.al. | 2410.18955 | null |
2024-10-24 | Dynamic Vocabulary Pruning in Early-Exit LLMs | Jort Vincenti et.al. | 2410.18952 | link |
2024-10-24 | SafeBench: A Safety Evaluation Framework for Multimodal Large Language Models | Zonghao Ying et.al. | 2410.18927 | null |
2024-10-24 | From Blind Solvers to Logical Thinkers: Benchmarking LLMs’ Logical Integrity on Faulty Mathematical Problems | A M Muntasir Rahman et.al. | 2410.18921 | null |
2024-10-25 | A Survey on Speech Large Language Models | Jing Peng et.al. | 2410.18908 | null |
2024-10-24 | PRISM: A Methodology for Auditing Biases in Large Language Models | Leif Azzopardi et.al. | 2410.18906 | link |
2024-10-24 | LLMs for Extremely Low-Resource Finno-Ugric Languages | Taido Purason et.al. | 2410.18902 | link |
2024-10-24 | Creating and Repairing Robot Programs in Open-World Domains | Claire Schlesinger et.al. | 2410.18893 | null |
2024-10-24 | Improving Small-Scale Large Language Models Function Calling for Reasoning Tasks | Graziano A. Manduzio et.al. | 2410.18890 | null |
2024-10-24 | Are LLMs Better than Reported? Detecting Label Errors and Mitigating Their Effect on Model Performance | Omer Nahum et.al. | 2410.18889 | null |
2024-10-24 | Provably Robust Watermarks for Open-Source Language Models | Miranda Christ et.al. | 2410.18861 | null |
2024-10-23 | TP-Eval: Tap Multimodal LLMs’ Potential in Evaluation by Customizing Prompts | Yuxuan Xie et.al. | 2410.18071 | null |
2024-10-23 | CLEAR: Character Unlearning in Textual and Visual Modalities | Alexey Dontsov et.al. | 2410.18057 | null |
2024-10-23 | LongRAG: A Dual-Perspective Retrieval-Augmented Generation Paradigm for Long-Context Question Answering | Qingfei Zhao et.al. | 2410.18050 | link |
2024-10-23 | Key Algorithms for Keyphrase Generation: Instruction-Based LLMs for Russian Scientific Keyphrases | Anna Glazkova et.al. | 2410.18040 | null |
2024-10-23 | MiLoRA: Efficient Mixture of Low-Rank Adaptation for Large Language Models Fine-tuning | Jingfan Zhang et.al. | 2410.18035 | null |
2024-10-23 | GraphTeam: Facilitating Large Language Model-based Graph Analysis via Multi-Agent Collaboration | Xin Li et.al. | 2410.18032 | link |
2024-10-23 | MiniFed : Integrating LLM-based Agentic-Workflow for Simulating FOMC Meeting | Sungil Seok et.al. | 2410.18012 | null |
2024-10-23 | Benchmarking Foundation Models on Exceptional Cases: Dataset Creation and Validation | Suho Kang et.al. | 2410.18001 | link |
2024-10-23 | MCUBERT: Memory-Efficient BERT Inference on Commodity Microcontrollers | Zebin Yang et.al. | 2410.17957 | null |
2024-10-23 | ExpertFlow: Optimized Expert Activation and Token Allocation for Efficient Mixture-of-Experts Inference | Xin He et.al. | 2410.17954 | null |
2024-10-23 | SimRAG: Self-Improving Retrieval-Augmented Generation for Adapting Large Language Models to Specialized Domains | Ran Xu et.al. | 2410.17952 | null |
2024-10-23 | Benchmarking Floworks against OpenAI & Anthropic: A Novel Framework for Enhanced LLM Function Calling | Nirav Bhan et.al. | 2410.17950 | null |
2024-10-23 | Toward path-invariant embeddings for local distance source characterization | Lisa Linville et.al. | 2410.17937 | null |
2024-10-23 | Guide for Defense (G4D): Dynamic Guidance for Robust and Balanced Defense in Large Language Models | He Cao et.al. | 2410.17922 | link |
2024-10-23 | Scaling Diffusion Language Models via Adaptation from Autoregressive Models | Shansan Gong et.al. | 2410.17891 | link |
2024-10-23 | R-CoT: Reverse Chain-of-Thought Problem Generation for Geometric Reasoning in Large Multimodal Models | Linger Deng et.al. | 2410.17885 | link |
2024-10-23 | Lightweight Neural App Control | Filippos Christianos et.al. | 2410.17883 | null |
2024-10-23 | AdaRankGrad: Adaptive Gradient-Rank and Moments for Memory-Efficient LLMs Training and Fine-Tuning | Yehonathan Refael et.al. | 2410.17881 | null |
2024-10-23 | Understanding Layer Significance in LLM Alignment | Guangyuan Shi et.al. | 2410.17875 | null |
2024-10-23 | DataTales: A Benchmark for Real-World Intelligent Data Narration | Yajing Yang et.al. | 2410.17859 | link |
2024-10-22 | PyramidDrop: Accelerating Your Large Vision-Language Models via Pyramid Visual Redundancy Reduction | Long Xing et.al. | 2410.17247 | link |
2024-10-22 | Towards Reliable Evaluation of Behavior Steering Interventions in LLMs | Itamar Pres et.al. | 2410.17245 | null |
2024-10-22 | Frontiers in Intelligent Colonoscopy | Ge-Peng Ji et.al. | 2410.17241 | link |
2024-10-22 | Large Language Models Empowered Personalized Web Agents | Hongru Cai et.al. | 2410.17236 | null |
2024-10-22 | Automated Spinal MRI Labelling from Reports Using a Large Language Model | Robin Y. Park et.al. | 2410.17235 | link |
2024-10-22 | Fine-Tuning Large Language Models to Appropriately Abstain with Semantic Entropy | Benedict Aaron Tjandra et.al. | 2410.17234 | null |
2024-10-22 | Few-shot In-Context Preference Learning Using Large Language Models | Chao Yu et.al. | 2410.17233 | null |
2024-10-22 | Context-aware Prompt Tuning: Advancing In-Context Learning with Adversarial Methods | Tsachi Blau et.al. | 2410.17222 | null |
2024-10-22 | MiniPLM: Knowledge Distillation for Pre-Training Language Models | Yuxian Gu et.al. | 2410.17215 | link |
2024-10-22 | Exploring Possibilities of AI-Powered Legal Assistance in Bangladesh through Large Language Modeling | Azmine Toushik Wasi et.al. | 2410.17210 | link |
2024-10-22 | VoiceBench: Benchmarking LLM-Based Voice Assistants | Yiming Chen et.al. | 2410.17196 | link |
2024-10-23 | Non-myopic Generation of Language Model for Reasoning and Planning | Chang Ma et.al. | 2410.17195 | link |
2024-10-22 | Remote Timing Attacks on Efficient Language Model Inference | Nicholas Carlini et.al. | 2410.17175 | null |
2024-10-22 | From Attention to Activation: Unravelling the Enigmas of Large Language Models | Prannay Kaul et.al. | 2410.17174 | null |
2024-10-22 | Self-calibration for Language Model Quantization and Pruning | Miles Williams et.al. | 2410.17170 | null |
2024-10-22 | Interchangeable Token Embeddings for Extendable Vocabulary and Alpha-Equivalence | İlker Işık et.al. | 2410.17161 | null |
2024-10-22 | Improving Pinterest Search Relevance Using Large Language Models | Han Wang et.al. | 2410.17152 | null |
2024-10-22 | Are Visual-Language Models Effective in Action Recognition? A Comparative Study | Mahmoud Ali et.al. | 2410.17149 | null |
2024-10-22 | Can General-Purpose Large Language Models Generalize to English-Thai Machine Translation ? | Jirat Chiaranaipanich et.al. | 2410.17145 | null |
2024-10-22 | Towards Automated Penetration Testing: Introducing LLM Benchmark, Analysis, and Improvements | Isamu Isozaki et.al. | 2410.17141 | link |
2024-10-21 | Reflection-Bench: probing AI intelligence with reflection | Lingyu Li et.al. | 2410.16270 | link |
2024-10-21 | SAM2Long: Enhancing SAM 2 for Long Video Segmentation with a Training-Free Memory Tree | Shuangrui Ding et.al. | 2410.16268 | link |
2024-10-21 | xGen-MM-Vid (BLIP-3-Video): You Only Need 32 Tokens to Represent a Video Even in VLMs | Michael S. Ryoo et.al. | 2410.16267 | null |
2024-10-22 | Mini-InternVL: A Flexible-Transfer Pocket Multimodal Model with 5% Parameters and 90% Performance | Zhangwei Gao et.al. | 2410.16261 | link |
2024-10-21 | Elucidating the design space of language models for image generation | Xuantong Liu et.al. | 2410.16257 | link |
2024-10-21 | CompassJudger-1: All-in-one Judge Model Helps Model Evaluation and Evolution | Maosong Cao et.al. | 2410.16256 | link |
2024-10-21 | Can Knowledge Editing Really Correct Hallucinations? | Baixiang Huang et.al. | 2410.16251 | link |
2024-10-21 | Analyzing Context Contributions in LLM-based Machine Translation | Emmanouil Zaranis et.al. | 2410.16246 | null |
2024-10-21 | IBGP: Imperfect Byzantine Generals Problem for Zero-Shot Robustness in Communicative Multi-Agent Systems | Yihuan Mao et.al. | 2410.16237 | null |
2024-10-21 | LLaVA-KD: A Framework of Distilling Multimodal Large Language Models | Yuxuan Cai et.al. | 2410.16236 | link |
2024-10-21 | ToW: Thoughts of Words Improve Reasoning in Large Language Models | Zhikun Xu et.al. | 2410.16235 | link |
2024-10-21 | Sketch2Code: Evaluating Vision-Language Models for Interactive Web Design Prototyping | Ryan Li et.al. | 2410.16232 | null |
2024-10-21 | Building A Coding Assistant via the Retrieval-Augmented Language Model | Xinze Li et.al. | 2410.16229 | link |
2024-10-21 | A Realistic Threat Model for Large Language Model Jailbreaks | Valentyn Boreiko et.al. | 2410.16222 | link |
2024-10-21 | Pre-training Distillation for Large Language Models: A Design Space Exploration | Hao Peng et.al. | 2410.16215 | null |
2024-10-21 | Comprehensive benchmarking of large language models for RNA secondary structure prediction | L. I. Zablocki et.al. | 2410.16212 | link |
2024-10-21 | CoT-TL: Low-Resource Temporal Knowledge Representation of Planning Instructions Using Chain-of-Thought Reasoning | Kumar Manas et.al. | 2410.16207 | null |
2024-10-21 | Improve Vision Language Model Chain-of-thought Reasoning | Ruohong Zhang et.al. | 2410.16198 | link |
2024-10-22 | LASER: Script Execution by Autonomous Agents for On-demand Traffic Simulation | Hao Gao et.al. | 2410.16197 | link |
2024-10-21 | Contamination Report for Multilingual Benchmarks | Sanchit Ahuja et.al. | 2410.16186 | null |
2024-10-18 | Are AI Detectors Good Enough? A Survey on Quality of Datasets With Machine-Generated Texts | German Gritsai et.al. | 2410.14677 | link |
2024-10-18 | SudoLM: Learning Access Control of Parametric Knowledge with Authorization Alignment | Qin Liu et.al. | 2410.14676 | null |
2024-10-18 | Enhancing Large Language Models’ Situated Faithfulness to External Contexts | Yukun Huang et.al. | 2410.14675 | link |
2024-10-18 | Decomposing The Dark Matter of Sparse Autoencoders | Joshua Engels et.al. | 2410.14670 | link |
2024-10-18 | NaturalBench: Evaluating Vision-Language Models on Natural Adversarial Samples | Baiqi Li et.al. | 2410.14669 | null |
2024-10-18 | MiCEval: Unveiling Multimodal Chain of Thought’s Quality via Image Description and Reasoning Steps | Xiongtao Zhou et.al. | 2410.14668 | link |
2024-10-18 | A Large Language Model-Driven Reward Design Framework via Dynamic Feedback for Reinforcement Learning | Shengjie Sun et.al. | 2410.14660 | null |
2024-10-18 | Bridging the Training-Inference Gap in LLMs by Leveraging Self-Generated Tokens | Zhepeng Cen et.al. | 2410.14655 | null |
2024-10-18 | EvoPress: Towards Optimal Dynamic Model Compression via Evolutionary Search | Oliver Sieberling et.al. | 2410.14649 | link |
2024-10-18 | Distance between Relevant Information Pieces Causes Bias in Long-Context LLMs | Runchu Tian et.al. | 2410.14641 | link |
2024-10-18 | GenEOL: Harnessing the Generative Power of LLMs for Training-Free Sentence Embeddings | Raghuveer Thirukovalluru et.al. | 2410.14635 | link |
2024-10-18 | Swiss Army Knife: Synergizing Biases in Knowledge from Vision Foundation Models for Multi-Task Learning | Yuxiang Lu et.al. | 2410.14633 | null |
2024-10-18 | On the Regularization of Learnable Embeddings for Time Series Processing | Luca Butera et.al. | 2410.14630 | null |
2024-10-18 | CELI: Controller-Embedded Language Model Interactions | Jan-Samuel Wagner et.al. | 2410.14627 | null |
2024-10-18 | DiSCo Meets LLMs: A Unified Approach for Sparse Retrieval and Contextual Distillation in Conversational Search | Simon Lupart et.al. | 2410.14609 | link |
2024-10-18 | Teaching Models to Balance Resisting and Accepting Persuasion | Elias Stengel-Eskin et.al. | 2410.14596 | link |
2024-10-18 | Neuro-Symbolic Traders: Assessing the Wisdom of AI Crowds in Markets | Namid R. Stillman et.al. | 2410.14587 | null |
2024-10-18 | Do LLMs estimate uncertainty well in instruction-following? | Juyeon Heo et.al. | 2410.14582 | link |
2024-10-18 | Large Language Models Are Overparameterized Text Encoders | Thennal D K et.al. | 2410.14578 | null |
2024-10-18 | MomentumSMoE: Integrating Momentum into Sparse Mixture of Experts | Rachel S. Y. Teo et.al. | 2410.14574 | link |
2024-10-17 | Fluid: Scaling Autoregressive Text-to-image Generative Models with Continuous Tokens | Lijie Fan et.al. | 2410.13863 | null |
2024-10-17 | PUMA: Empowering Unified MLLM with Multi-granular Visual Generation | Rongyao Fang et.al. | 2410.13861 | link |
2024-10-17 | VLM-Grounder: A VLM Agent for Zero-Shot 3D Visual Grounding | Runsen Xu et.al. | 2410.13860 | link |
2024-10-17 | $γ-$ MoD: Exploring Mixture-of-Depth Adaptation for Multimodal Large Language Models | Yaxin Luo et.al. | 2410.13859 | null |
2024-10-17 | How Numerical Precision Affects Mathematical Reasoning Capabilities of LLMs | Guhao Feng et.al. | 2410.13857 | null |
2024-10-17 | Can MLLMs Understand the Deep Implication Behind Chinese Images? | Chenhao Zhang et.al. | 2410.13854 | link |
2024-10-17 | Retrospective Learning from Interactions | Zizhao Chen et.al. | 2410.13852 | null |
2024-10-17 | Differentiable Robot Rendering | Ruoshi Liu et.al. | 2410.13851 | null |
2024-10-17 | SimLayerKV: A Simple Framework for Layer-Level KV Cache Reduction | Xuan Zhang et.al. | 2410.13846 | link |
2024-10-17 | A Unified View of Delta Parameter Editing in Post-Trained Large-Scale Models | Qiaoyu Tang et.al. | 2410.13841 | null |
2024-10-17 | Active-Dormant Attention Heads: Mechanistically Demystifying Extreme-Token Phenomena in LLMs | Tianyu Guo et.al. | 2410.13835 | link |
2024-10-17 | A Common Pitfall of Margin-based Language Model Alignment: Gradient Entanglement | Hui Yuan et.al. | 2410.13828 | link |
2024-10-17 | Unearthing Skill-Level Insights for Understanding Trade-Offs of Foundation Models | Mazda Moayeri et.al. | 2410.13826 | null |
2024-10-17 | AgentOccam: A Simple Yet Strong Baseline for LLM-Based Web Agents | Ke Yang et.al. | 2410.13825 | null |
2024-10-18 | Harnessing Webpage UIs for Text-Rich Visual Understanding | Junpeng Liu et.al. | 2410.13824 | null |
2024-10-17 | Deep Generative Models Unveil Patterns in Medical Images Through Vision-Language Conditioning | Xiaodan Xing et.al. | 2410.13823 | link |
2024-10-17 | Steering Your Generalists: Improving Robotic Foundation Models via Value Guidance | Mitsuhiko Nakamoto et.al. | 2410.13816 | null |
2024-10-17 | De-mark: Watermark Removal in Large Language Models | Ruibo Chen et.al. | 2410.13808 | null |
2024-10-17 | A Watermark for Order-Agnostic Language Models | Ruibo Chen et.al. | 2410.13805 | null |
2024-10-18 | BenTo: Benchmark Task Reduction with In-Context Transferability | Hongyu Zhao et.al. | 2410.13804 | link |
2024-10-16 | Dual Prototype Evolving for Test-Time Generalization of Vision-Language Models | Ce Zhang et.al. | 2410.12790 | link |
2024-10-16 | Meta-Chunking: Learning Efficient Text Segmentation via Logical Perception | Jihao Zhao et.al. | 2410.12788 | link |
2024-10-16 | In-Context Learning Enables Robot Action Prediction in LLMs | Yida Yin et.al. | 2410.12782 | null |
2024-10-16 | Identifying Task Groupings for Multi-Task Learning Using Pointwise V-Usable Information | Yingya Li et.al. | 2410.12774 | null |
2024-10-16 | Harmon: Whole-Body Motion Generation of Humanoid Robots from Language Descriptions | Zhenyu Jiang et.al. | 2410.12773 | null |
2024-10-16 | Towards Zero-Shot Camera Trap Image Categorization | Jiří Vyskočil et.al. | 2410.12769 | null |
2024-10-16 | The Non-Local Model Merging Problem: Permutation Symmetries and Variance Collapse | Ekansh Sharma et.al. | 2410.12766 | null |
2024-10-16 | StyleDistance: Stronger Content-Independent Style Embeddings with Synthetic Parallel Examples | Ajay Patel et.al. | 2410.12757 | null |
2024-10-17 | CREAM: Consistency Regularized Self-Rewarding Language Models | Zhaoyang Wang et.al. | 2410.12735 | link |
2024-10-16 | WorldMedQA-V: a multilingual, multimodal medical examination dataset for multimodal language models evaluation | João Matos et.al. | 2410.12722 | link |
2024-10-16 | FusionLLM: A Decentralized LLM Training System on Geo-distributed GPUs with Adaptive Compression | Zhenheng Tang et.al. | 2410.12707 | null |
2024-10-16 | WorldCuisines: A Massive-Scale Benchmark for Multilingual and Multicultural Visual Question Answering on Global Cuisines | Genta Indra Winata et.al. | 2410.12705 | link |
2024-10-16 | Sarcasm Detection in a Less-Resourced Language | Lazar Đoković et.al. | 2410.12704 | link |
2024-10-16 | Embedding an Ethical Mind: Aligning Text-to-Image Synthesis via Lightweight Value Optimization | Xingqi Wang et.al. | 2410.12700 | link |
2024-10-16 | VividMed: Vision Language Model with Versatile Visual Grounding for Medicine | Lingxiao Luo et.al. | 2410.12694 | link |
2024-10-16 | Automatic Mapping of Anatomical Landmarks from Free-Text Using Large Language Models: Insights from Llama-2 | Mohamad Abdi et.al. | 2410.12686 | null |
2024-10-16 | 3DIS: Depth-Driven Decoupled Instance Synthesis for Text-to-Image Generation | Dewei Zhou et.al. | 2410.12669 | link |
2024-10-16 | Cross-Modal Safety Mechanism Transfer in Large Vision-Language Models | Shicheng Xu et.al. | 2410.12662 | null |
2024-10-16 | Evaluating Morphological Compositional Generalization in Large Language Models | Mete Ismayilzada et.al. | 2410.12656 | link |
2024-10-16 | Beyond Speech and More: Investigating the Emergent Ability of Speech Foundation Models for Classifying Physiological Time-Series Signals | Orchid Chetia Phukan et.al. | 2410.12645 | null |
2024-10-15 | GaVaMoE: Gaussian-Variational Gated Mixture of Experts for Explainable Recommendation | Fei Tang et.al. | 2410.11841 | link |
2024-10-15 | A Hitchhiker’s Guide to Scaling Law Estimation | Leshem Choshen et.al. | 2410.11840 | link |
2024-10-15 | MMFuser: Multimodal Multi-Layer Feature Fuser for Fine-Grained Vision-Language Understanding | Yue Cao et.al. | 2410.11829 | link |
2024-10-15 | Adaptive Data Optimization: Dynamic Sample Selection with Scaling Laws | Yiding Jiang et.al. | 2410.11820 | link |
2024-10-15 | Improving Long-Text Alignment for Text-to-Image Diffusion Models | Luping Liu et.al. | 2410.11817 | link |
2024-10-15 | SGEdit: Bridging LLM with Text2Image Generative Model for Scene Graph-based Image Editing | Zhiyuan Zhang et.al. | 2410.11815 | null |
2024-10-15 | NesTools: A Dataset for Evaluating Nested Tool Learning Abilities of Large Language Models | Han Han et.al. | 2410.11805 | link |
2024-10-15 | FoundTS: Comprehensive and Unified Benchmarking of Foundation Models for Time Series Forecasting | Zhe Li et.al. | 2410.11802 | null |
2024-10-15 | Selection-p: Self-Supervised Task-Agnostic Prompt Compression for Faithfulness and Transferability | Tsz Ting Chung et.al. | 2410.11786 | null |
2024-10-15 | Latent BKI: Open-Dictionary Continuous Mapping in Visual-Language Latent Spaces with Quantifiable Uncertainty | Joey Wilson et.al. | 2410.11783 | link |
2024-10-15 | G-Designer: Architecting Multi-agent Communication Topologies via Graph Neural Networks | Guibin Zhang et.al. | 2410.11782 | null |
2024-10-15 | Language Models Encode Numbers Using Digit Representations in Base 10 | Amit Arnold Levy et.al. | 2410.11781 | link |
2024-10-15 | MLLM can see? Dynamic Correction Decoding for Hallucination Mitigation | Chenxi Wang et.al. | 2410.11779 | link |
2024-10-15 | Time-Series Foundation Model for Value-at-Risk | Anubha Goel et.al. | 2410.11773 | link |
2024-10-15 | Layer-wise Importance Matters: Less Memory for Better Performance in Parameter-efficient Fine-tuning of Large Language Models | Kai Yao et.al. | 2410.11772 | link |
2024-10-15 | SlideChat: A Large Vision-Language Assistant for Whole-Slide Pathology Image Understanding | Ying Chen et.al. | 2410.11761 | null |
2024-10-15 | Latent Action Pretraining from Videos | Seonghyeon Ye et.al. | 2410.11758 | null |
2024-10-15 | Personas with Attitudes: Controlling LLMs for Diverse Data Annotation | Leon Fröhling et.al. | 2410.11745 | link |
2024-10-15 | DySpec: Faster Speculative Decoding with Dynamic Token Tree Structure | Yunfan Xiong et.al. | 2410.11744 | null |
2024-10-16 | Light-Weight Fault Tolerant Attention for Large Language Model Training | Yuhang Liang et.al. | 2410.11720 | null |
2024-10-14 | DuoAttention: Efficient Long-Context LLM Inference with Retrieval and Streaming Heads | Guangxuan Xiao et.al. | 2410.10819 | link |
2024-10-14 | Your Mixture-of-Experts LLM Is Secretly an Embedding Model For Free | Ziyue Li et.al. | 2410.10814 | link |
2024-10-14 | LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory | Di Wu et.al. | 2410.10813 | link |
2024-10-14 | Local and Global Decoding in Text Generation | Daniel Gareev et.al. | 2410.10810 | link |
2024-10-14 | Mix Data or Merge Models? Optimizing for Diverse Multi-Task Learning | Aakanksha et.al. | 2410.10801 | null |
2024-10-14 | Towards Foundation Models for 3D Vision: How Close Are We? | Yiming Zuo et.al. | 2410.10799 | link |
2024-10-15 | MMAR: Towards Lossless Multi-Modal Auto-Regressive Probabilistic Modeling | Jian Yang et.al. | 2410.10798 | null |
2024-10-14 | Context-Parametric Inversion: Why Instruction Finetuning May Not Actually Improve Context Reliance | Sachin Goyal et.al. | 2410.10796 | link |
2024-10-15 | LiveXiv – A Multi-Modal Live Benchmark Based on Arxiv Papers Content | Nimrod Shabtay et.al. | 2410.10783 | link |
2024-10-14 | When Attention Sink Emerges in Language Models: An Empirical View | Xiangming Gu et.al. | 2410.10781 | link |
2024-10-14 | Focused ReAct: Improving ReAct through Reiterate and Early Stop | Shuoqiu Li et.al. | 2410.10779 | null |
2024-10-14 | AFlow: Automating Agentic Workflow Generation | Jiayi Zhang et.al. | 2410.10762 | link |
2024-10-14 | Denial-of-Service Poisoning Attacks against Large Language Models | Kuofeng Gao et.al. | 2410.10760 | link |
2024-10-14 | SplitLLM: Collaborative Inference of LLMs for Model Placement and Throughput Optimization | Akrit Mudvari et.al. | 2410.10759 | null |
2024-10-14 | Use Random Selection for Now: Investigation of Few-Shot Selection Strategies in LLM-based Text Augmentation for Classification | Jan Cegin et.al. | 2410.10756 | link |
2024-10-14 | NT-LLM: A Novel Node Tokenizer for Integrating Graph Structure into Large Language Models | Yanbiao Ji et.al. | 2410.10743 | null |
2024-10-14 | SensorBench: Benchmarking LLMs in Coding-Based Sensor Processing | Pengrui Quan et.al. | 2410.10741 | link |
2024-10-14 | Balancing Continuous Pre-Training and Instruction Fine-Tuning: Optimizing Instruction-Following in LLMs | Ishan Jindal et.al. | 2410.10739 | null |
2024-10-14 | Embedding Self-Correction as an Inherent Ability in Large Language Models for Enhanced Mathematical Reasoning | Kuofeng Gao et.al. | 2410.10735 | null |
2024-10-14 | Towards LLM-guided Efficient and Interpretable Multi-linear Tensor Network Rank Selection | Giorgos Iacovides et.al. | 2410.10728 | null |
2024-10-11 | Unraveling and Mitigating Safety Alignment Degradation of Vision-Language Models | Qin Liu et.al. | 2410.09047 | null |
2024-10-11 | AttnGCG: Enhancing Jailbreaking Attacks on LLMs with Attention Manipulation | Zijun Wang et.al. | 2410.09040 | link |
2024-10-11 | Semi-Supervised Learning of Noisy Mixture of Experts Models | Oh-Ran Kwon et.al. | 2410.09039 | null |
2024-10-11 | SimpleStrat: Diversifying Language Model Generation with Stratification | Justin Wong et.al. | 2410.09038 | null |
2024-10-11 | Mentor-KD: Making Small Language Models Better Multi-step Reasoners | Hojae Lee et.al. | 2410.09037 | link |
2024-10-11 | PEAR: A Robust and Flexible Automation Framework for Ptychography Enabled by Multiple Large Language Model Agents | Xiangyu Yin et.al. | 2410.09034 | link |
2024-10-11 | MedMobile: A mobile-sized language model with expert-level clinical capabilities | Krithik Vishwanath et.al. | 2410.09019 | link |
2024-10-11 | Parameter-Efficient Fine-Tuning of State Space Models | Kevin Galim et.al. | 2410.09016 | link |
2024-10-11 | The Impact of Visual Information in Chinese Characters: Evaluating Large Models’ Ability to Recognize and Utilize Radicals | Xiaofeng Wu et.al. | 2410.09013 | null |
2024-10-11 | Software Engineering and Foundation Models: Insights from Industry Blogs Using a Jury of Foundation Models | Hao Li et.al. | 2410.09012 | link |
2024-10-11 | SuperCorrect: Supervising and Correcting Language Models with Error-Driven Insights | Ling Yang et.al. | 2410.09008 | link |
2024-10-11 | From Interaction to Impact: Towards Safer AI Agents Through Understanding and Evaluating UI Operation Impacts | Zhuohao Jerry Zhang et.al. | 2410.09006 | null |
2024-10-11 | DA-Ada: Learning Domain-Aware Adapter for Domain Adaptive Object Detection | Haochen Li et.al. | 2410.09004 | link |
2024-10-11 | Hypothesis-only Biases in Large Language Model-Elicited Natural Language Inference | Grace Proebsting et.al. | 2410.08996 | null |
2024-10-11 | The structure of the token space for large language models | Michael Robinson et.al. | 2410.08993 | null |
2024-10-11 | Science is Exploration: Computational Frontiers for Conceptual Metaphor Theory | Rebecca M. M. Hicke et.al. | 2410.08991 | link |
2024-10-11 | SubZero: Random Subspace Zeroth-Order Optimization for Memory-Efficient LLM Fine-Tuning | Ziming Yu et.al. | 2410.08989 | link |
2024-10-11 | Towards Trustworthy Knowledge Graph Reasoning: An Uncertainty Aware Perspective | Bo Ni et.al. | 2410.08985 | null |
2024-10-11 | NoVo: Norm Voting off Hallucinations with Attention Heads in Large Language Models | Zheng Yi Ho et.al. | 2410.08970 | null |
2024-10-11 | Controllable Safety Alignment: Inference-Time Adaptation to Diverse Safety Requirements | Jingyu Zhang et.al. | 2410.08968 | null |
2024-10-10 | DICE: Discrete Inversion Enabling Controllable Editing for Multinomial Diffusion and Masked Generative Models | Xiaoxiao He et.al. | 2410.08207 | null |
2024-10-10 | Mono-InternVL: Pushing the Boundaries of Monolithic Multimodal Large Language Models with Endogenous Visual Pre-training | Gen Luo et.al. | 2410.08202 | null |
2024-10-10 | Adam Exploits $\ell_\infty$ -geometry of Loss Landscape via Coordinate-wise Adaptivity | Shuo Xie et.al. | 2410.08198 | link |
2024-10-10 | From Exploration to Mastery: Enabling LLMs to Master Tools via Self-Driven Interactions | Changle Qu et.al. | 2410.08197 | link |
2024-10-10 | MathCoder2: Better Math Reasoning from Continued Pretraining on Model-translated Mathematical Code | Zimu Lu et.al. | 2410.08196 | link |
2024-10-10 | Features are fate: a theory of transfer learning in high-dimensional regression | Javan Tahir et.al. | 2410.08194 | null |
2024-10-10 | GenARM: Reward Guided Generation with Autoregressive Reward Model for Test-time Alignment | Yuancheng Xu et.al. | 2410.08193 | null |
2024-10-10 | MRAG-Bench: Vision-Centric Evaluation for Retrieval-Augmented Multimodal Models | Wenbo Hu et.al. | 2410.08182 | null |
2024-10-10 | Sample then Identify: A General Framework for Risk Control and Assessment in Multimodal Large Language Models | Qingni Wang et.al. | 2410.08174 | null |
2024-10-10 | On the Evaluation of Generative Robotic Simulations | Feng Chen et.al. | 2410.08172 | null |
2024-10-10 | Visual Scratchpads: Enabling Global Reasoning in Vision | Aryo Lotfi et.al. | 2410.08165 | null |
2024-10-10 | Agent S: An Open Agentic Framework that Uses Computers Like a Human | Saaket Agashe et.al. | 2410.08164 | link |
2024-10-10 | The Effect of Surprisal on Reading Times in Information Seeking and Repeated Reading | Keren Gruteke Klein et.al. | 2410.08162 | link |
2024-10-10 | DART: Denoising Autoregressive Transformer for Scalable Text-to-Image Generation | Jiatao Gu et.al. | 2410.08159 | null |
2024-10-10 | Rewarding Progress: Scaling Automated Process Verifiers for LLM Reasoning | Amrith Setlur et.al. | 2410.08146 | null |
2024-10-10 | Insight Over Sight? Exploring the Vision-Knowledge Conflicts in Multimodal LLMs | Xiaoyuan Liu et.al. | 2410.08145 | link |
2024-10-10 | DelTA: An Online Document-Level Translation Agent Based on Multi-Level Memory | Yutong Wang et.al. | 2410.08143 | link |
2024-10-10 | Steering Masked Discrete Diffusion Models via Discrete Denoising Posterior Prediction | Jarrid Rector-Brooks et.al. | 2410.08134 | null |
2024-10-10 | Think Beyond Size: Dynamic Prompting for More Effective Reasoning | Kamesh R et.al. | 2410.08130 | null |
2024-10-10 | Mars: Situated Inductive Reasoning in an Open-World Environment | Xiaojuan Tang et.al. | 2410.08126 | null |
2024-10-09 | MM-Ego: Towards Building Egocentric Multimodal LLMs | Hanrong Ye et.al. | 2410.07177 | null |
2024-10-09 | Astute RAG: Overcoming Imperfect Retrieval Augmentation and Knowledge Conflicts for Large Language Models | Fei Wang et.al. | 2410.07176 | null |
2024-10-09 | Do better language models have crisper vision? | Jona Ruthardt et.al. | 2410.07173 | null |
2024-10-09 | One Initialization to Rule them All: Fine-tuning via Explained Variance Adaptation | Fabian Paischer et.al. | 2410.07170 | link |
2024-10-09 | Sylber: Syllabic Embedding Representation of Speech from Raw Audio | Cheol Jun Cho et.al. | 2410.07168 | link |
2024-10-09 | Deciphering Cross-Modal Alignment in Large Vision-Language Models with Modality Integration Rate | Qidong Huang et.al. | 2410.07167 | link |
2024-10-09 | Embodied Agent Interface: Benchmarking LLMs for Embodied Decision Making | Manling Li et.al. | 2410.07166 | link |
2024-10-09 | Simplicity Prevails: Rethinking Negative Preference Optimization for LLM Unlearning | Chongyu Fan et.al. | 2410.07163 | link |
2024-10-09 | Trans4D: Realistic Geometry-Aware Transition for Compositional Text-to-4D Synthesis | Bohan Zeng et.al. | 2410.07155 | link |
2024-10-09 | Towards Interpreting Visual Information Processing in Vision-Language Models | Clement Neo et.al. | 2410.07149 | link |
2024-10-09 | Stuffed Mamba: State Collapse and State Capacity of RNN-Based Long-Context Modeling | Yingfa Chen et.al. | 2410.07145 | null |
2024-10-09 | Cheating Automatic LLM Benchmarks: Null Models Achieve High Win Rates | Xiaosen Zheng et.al. | 2410.07137 | link |
2024-10-10 | EvolveDirector: Approaching Advanced Text-to-Image Generation with Large Vision-Language Models | Rui Zhao et.al. | 2410.07133 | link |
2024-10-09 | Mental Disorders Detection in the Era of Large Language Models | Gleb Kuzmin et.al. | 2410.07129 | null |
2024-10-09 | Exploring the Readiness of Prominent Small Language Models for the Democratization of Financial Literacy | Tagore Rao Kosireddy et.al. | 2410.07118 | link |
2024-10-09 | Personalized Visual Instruction Tuning | Renjie Pi et.al. | 2410.07113 | link |
2024-10-09 | VHELM: A Holistic Evaluation of Vision Language Models | Tony Lee et.al. | 2410.07112 | link |
2024-10-09 | I Want to Break Free! Anti-Social Behavior and Persuasion Ability of LLMs in Multi-Agent Settings with Social Hierarchy | Gian Maria Campedelli et.al. | 2410.07109 | link |
2024-10-09 | Unleashing Multi-Hop Reasoning Potential in Large Language Models through Repetition of Misordered Context | Sangwon Yu et.al. | 2410.07103 | null |
2024-10-09 | MLE-bench: Evaluating Machine Learning Agents on Machine Learning Engineering | Jun Shern Chan et.al. | 2410.07095 | link |
2024-10-07 | Fine-Tuning CLIP’s Last Visual Projector: A Few-Shot Cornucopia | Mohammad Fahes et.al. | 2410.05270 | link |
2024-10-07 | Data Advisor: Dynamic Data Curation for Safety Alignment of Large Language Models | Fei Wang et.al. | 2410.05269 | link |
2024-10-07 | PrefixQuant: Static Quantization Beats Dynamic through Prefixed Outliers in LLMs | Mengzhao Chen et.al. | 2410.05265 | link |
2024-10-07 | TurtleBench: Evaluating Top Language Models via Real-World Yes/No Puzzles | Qingchen Yu et.al. | 2410.05262 | link |
2024-10-07 | TextHawk2: A Large Vision-Language Model Excels in Bilingual OCR and Grounding with 16x Fewer Tokens | Ya-Qi Yu et.al. | 2410.05261 | null |
2024-10-07 | Differential Transformer | Tianzhu Ye et.al. | 2410.05258 | link |
2024-10-07 | GLEE: A Unified Framework and Benchmark for Language-based Economic Environments | Eilam Shapira et.al. | 2410.05254 | link |
2024-10-07 | Causal Micro-Narratives | Mourad Heddaya et.al. | 2410.05252 | null |
2024-10-07 | SFTMix: Elevating Language Model Instruction Tuning with Mixup Recipe | Yuxin Xiao et.al. | 2410.05248 | null |
2024-10-07 | Navigating the Digital World as Humans Do: Universal Visual Grounding for GUI Agents | Boyu Gou et.al. | 2410.05243 | link |
2024-10-08 | TuneVLSeg: Prompt Tuning Benchmark for Vision-Language Segmentation Models | Rabin Adhikari et.al. | 2410.05239 | link |
2024-10-07 | GSM-Symbolic: Understanding the Limitations of Mathematical Reasoning in Large Language Models | Iman Mirzadeh et.al. | 2410.05229 | null |
2024-10-07 | Cookbook: A framework for improving LLM generative abilities via programmatic data generating templates | Avanika Narayan et.al. | 2410.05224 | null |
2024-10-07 | Precise Model Benchmarking with Only a Few Observations | Riccardo Fogliato et.al. | 2410.05222 | null |
2024-10-07 | Density estimation with LLMs: a geometric investigation of in-context learning trajectories | Toni J. B. Liu et.al. | 2410.05218 | null |
2024-10-07 | Organizing Unstructured Image Collections using Natural Language | Mingxuan Liu et.al. | 2410.05217 | null |
2024-10-07 | Preserving Multi-Modal Capabilities of Pre-trained VLMs for Improving Vision-Linguistic Compositionality | Youngtaek Oh et.al. | 2410.05210 | link |
2024-10-07 | RevisEval: Improving LLM-as-a-Judge via Response-Adapted References | Qiyuan Zhang et.al. | 2410.05193 | null |
2024-10-07 | Understanding Warmup-Stable-Decay Learning Rates: A River Valley Loss Landscape Perspective | Kaiyue Wen et.al. | 2410.05192 | null |
2024-10-07 | LADEV: A Language-Driven Testing and Evaluation Platform for Vision-Language-Action Models in Robotic Manipulation | Zhijie Wang et.al. | 2410.05191 | null |
2024-10-04 | Enhance Reasoning by Learning from Mistakes: Peer-Review Knowledge Distillation from Multiple Large Language Models | Zhuochun Li et.al. | 2410.03663 | link |
2024-10-04 | Unraveling Cross-Modality Knowledge Conflict in Large Vision-Language Models | Tinghui Zhu et.al. | 2410.03659 | link |
2024-10-04 | RAFT: Realistic Attacks to Fool Text Detectors | James Wang et.al. | 2410.03658 | link |
2024-10-04 | Aligning LLMs with Individual Preferences via Interaction | Shujin Wu et.al. | 2410.03642 | link |
2024-10-04 | Conditional Enzyme Generation Using Protein Language Models with Adapters | Jason Yang et.al. | 2410.03634 | null |
2024-10-04 | Large Language Model Performance Benchmarking on Mobile Platforms: A Thorough Evaluation | Jie Xiao et.al. | 2410.03613 | null |
2024-10-04 | TICKing All the Boxes: Generated Checklists Improve LLM Evaluation and Generation | Jonathan Cook et.al. | 2410.03608 | null |
2024-10-04 | LeLaN: Learning A Language-Conditioned Navigation Policy from In-the-Wild Videos | Noriaki Hirose et.al. | 2410.03603 | null |
2024-10-04 | Efficiently Identifying Watermarked Segments in Mixed-Source Texts | Xuandong Zhao et.al. | 2410.03600 | null |
2024-10-04 | Understanding Reasoning in Chain-of-Thought from the Hopfieldian View | Lijie Hu et.al. | 2410.03595 | null |
2024-10-04 | Look Twice Before You Answer: Memory-Space Visual Retracing for Hallucination Mitigation in Multimodal Large Language Models | Xin Zou et.al. | 2410.03577 | link |
2024-10-04 | Towards Linguistically-Aware and Language-Independent Tokenization for Large Language Models (LLMs) | Abrar Rahman et.al. | 2410.03568 | null |
2024-10-04 | Structure-Enhanced Protein Instruction Tuning: Towards General-Purpose Protein Understanding | Wei Wu et.al. | 2410.03553 | null |
2024-10-04 | Re-examining Sexism and Misogyny Classification with Annotator Attitudes | Aiqi Jiang et.al. | 2410.03543 | null |
2024-10-04 | No Need to Talk: Asynchronous Mixture of Language Models | Anastasiia Filippova et.al. | 2410.03529 | null |
2024-10-04 | Steering Large Language Models between Code Execution and Textual Reasoning | Yongchao Chen et.al. | 2410.03524 | null |
2024-10-04 | A Probabilistic Perspective on Unlearning and Alignment for Large Language Models | Yan Scholten et.al. | 2410.03523 | link |
2024-10-04 | CliMedBench: A Large-Scale Chinese Benchmark for Evaluating Medical Large Language Models in Clinical Scenarios | Zetian Ouyang et.al. | 2410.03502 | link |
2024-10-04 | FedStein: Enhancing Multi-Domain Federated Learning Through James-Stein Estimator | Sunny Gupta et.al. | 2410.03499 | link |
2024-10-04 | Towards Reproducible LLM Evaluation: Quantifying Uncertainty in LLM Benchmark Scores | Robert E. Blackwell et.al. | 2410.03492 | null |
2024-10-03 | Interpreting and Editing Vision-Language Representations to Mitigate Hallucinations | Nick Jiang et.al. | 2410.02762 | link |
2024-10-03 | FakeShield: Explainable Image Forgery Detection and Localization via Multi-modal Large Language Models | Zhipei Xu et.al. | 2410.02761 | link |
2024-10-03 | Erasing Conceptual Knowledge from Language Models | Rohit Gandikota et.al. | 2410.02760 | link |
2024-10-03 | Loong: Generating Minute-level Long Videos with Autoregressive Language Models | Yuqing Wang et.al. | 2410.02757 | null |
2024-10-03 | SIEVE: General Purpose Data Filtering System Matching GPT-4o Accuracy at 1% the Cost | Jifan Zhang et.al. | 2410.02755 | null |
2024-10-03 | Training Language Models on Synthetic Edit Sequences Improves Code Synthesis | Ulyana Piterbarg et.al. | 2410.02749 | link |
2024-10-03 | CriSPO: Multi-Aspect Critique-Suggestion-guided Automatic Prompt Optimization for Text Generation | Han He et.al. | 2410.02748 | link |
2024-10-03 | Contrastive Localized Language-Image Pre-Training | Hong-You Chen et.al. | 2410.02746 | null |
2024-10-03 | Neutral residues: revisiting adapters for model extension | Franck Signe Talla et.al. | 2410.02744 | null |
2024-10-03 | MA-RLHF: Reinforcement Learning from Human Feedback with Macro Actions | Yekun Chai et.al. | 2410.02743 | link |
2024-10-03 | Grounding Large Language Models In Embodied Environment With Imperfect World Models | Haolan Liu et.al. | 2410.02742 | null |
2024-10-03 | Salient Information Prompting to Steer Content in Prompt-based Abstractive Summarization | Lei Xu et.al. | 2410.02741 | link |
2024-10-03 | Revisit Large-Scale Image-Caption Data in Pre-training Multimodal Foundation Models | Zhengfeng Lai et.al. | 2410.02740 | null |
2024-10-04 | Justice or Prejudice? Quantifying Biases in LLM-as-a-Judge | Jiayi Ye et.al. | 2410.02736 | null |
2024-10-03 | DivScene: Benchmarking LVLMs for Object Navigation with Diverse Scenes and Objects | Zhaowei Wang et.al. | 2410.02730 | link |
2024-10-03 | Unified Multi-Modal Interleaved Document Representation for Information Retrieval | Jaewoo Lee et.al. | 2410.02729 | null |
2024-10-03 | Adaptive Inference-Time Compute: LLMs Can Predict if They Can Do Better, Even Mid-Generation | Rohin Manvi et.al. | 2410.02725 | null |
2024-10-03 | Large Language Models as Markov Chains | Oussama Zekri et.al. | 2410.02724 | null |
2024-10-03 | Domain-Specific Retrieval-Augmented Generation Using Vector Stores, Knowledge Graphs, and Tensor Factorization | Ryan C. Barron et.al. | 2410.02721 | null |
2024-10-03 | UncertaintyRAG: Span-Level Uncertainty Enhanced Long-Context Modeling for Retrieval-Augmented Generation | Zixuan Li et.al. | 2410.02719 | null |
2024-10-02 | Locret: Enhancing Eviction in Long-Context LLM Inference with Trained Retaining Heads | Yuxiang Huang et.al. | 2410.01805 | link |
2024-10-02 | Efficient $1$ -bit tensor approximations | Alex W. Neal Riasanovsky et.al. | 2410.01799 | null |
2024-10-02 | Knowledge-Driven Feature Selection and Engineering for Genotype Data with Large Language Models | Joseph Lee et.al. | 2410.01795 | link |
2024-10-02 | When a language model is optimized for reasoning, does it still show embers of autoregression? An analysis of OpenAI o1 | R. Thomas McCoy et.al. | 2410.01792 | null |
2024-10-02 | Investigating on RLHF methodology | Alexey Kutalev et.al. | 2410.01789 | null |
2024-10-02 | OmniGenBench: Automating Large-scale in-silico Benchmarking for Genomic Foundation Models | Heng Yang et.al. | 2410.01784 | link |
2024-10-02 | Open-RAG: Enhanced Retrieval-Augmented Reasoning with Open-Source Large Language Models | Shayekh Bin Islam et.al. | 2410.01782 | link |
2024-10-03 | Quantifying Generalization Complexity for Large Language Models | Zhenting Qi et.al. | 2410.01769 | link |
2024-10-02 | Integrating Protein Sequence and Expression Level to Analysis Molecular Characterization of Breast Cancer Subtypes | Hossein Sholehrasa et.al. | 2410.01755 | null |
2024-10-02 | LEOPARD : A Vision Language Model For Text-Rich Multi-Image Tasks | Mengzhao Jia et.al. | 2410.01744 | link |
2024-10-02 | VitaGlyph: Vitalizing Artistic Typography with Flexible Dual-branch Diffusion Models | Kailai Feng et.al. | 2410.01738 | link |
2024-10-02 | Visual Perception in Text Strings | Qi Jia et.al. | 2410.01733 | link |
2024-10-02 | Automated Knowledge Concept Annotation and Question Representation Learning for Knowledge Tracing | Yilmazcan Ozyurt et.al. | 2410.01727 | link |
2024-10-02 | Auto-Demo Prompting: Leveraging Generated Outputs as Demonstrations for Enhanced Batch Prompting | Longyu Feng et.al. | 2410.01724 | null |
2024-10-02 | Towards a Theoretical Understanding of Synthetic Data in LLM Post-Training: A Reverse-Bottleneck Perspective | Zeyu Gan et.al. | 2410.01720 | link |
2024-10-02 | Examining the Role of Relationship Alignment in Large Language Models | Kristen M. Altenburger et.al. | 2410.01708 | null |
2024-10-02 | Interpretable Contrastive Monte Carlo Tree Search Reasoning | Zitian Gao et.al. | 2410.01707 | link |
2024-10-02 | An Exploration of Self-Supervised Mutual Information Alignment for Multi-Task Settings | Soham Govande et.al. | 2410.01704 | link |
2024-10-02 | CreDes: Causal Reasoning Enhancement and Dual-End Searching for Solving Long-Range Reasoning Problems using LLMs | Kangsheng Wang et.al. | 2410.01696 | null |
2024-10-02 | U-shaped and Inverted-U Scaling behind Emergent Abilities of Large Language Models | Tung-Yu Wu et.al. | 2410.01692 | null |
2024-09-30 | MM1.5: Methods, Analysis & Insights from Multimodal LLM Fine-tuning | Haotian Zhang et.al. | 2409.20566 | null |
2024-09-30 | LaMMA-P: Generalizable Multi-Agent Long-Horizon Task Allocation and Planning with LM-Driven PDDL Planner | Xiaopan Zhang et.al. | 2409.20560 | null |
2024-09-30 | Propose, Assess, Search: Harnessing LLMs for Goal-Oriented Planning in Instructional Videos | Md Mohaiminul Islam et.al. | 2409.20557 | null |
2024-09-30 | UniAff: A Unified Representation of Affordances for Tool Usage and Articulation with Vision-Language Models | Qiaojun Yu et.al. | 2409.20551 | null |
2024-09-30 | LLM Hallucinations in Practical Code Generation: Phenomena, Mechanism, and Mitigation | Ziyao Zhang et.al. | 2409.20550 | null |
2024-09-30 | Robi Butler: Remote Multimodal Interactions with Household Robot Assistant | Anxing Xiao et.al. | 2409.20548 | null |
2024-09-30 | Uncertainty-Informed Screening for Safer Solvents Used in the Synthesis of Perovskite via Language Models | Arpan Mukherjee et.al. | 2409.20512 | null |
2024-09-30 | COLLAGE: Collaborative Human-Agent Interaction Generation using Hierarchical Latent Diffusion and Language Models | Divyanshu Daiya et.al. | 2409.20502 | null |
2024-09-30 | A Weakly Supervised Data Labeling Framework for Machine Lexical Normalization in Vietnamese Social Media | Dung Ha Nguyen et.al. | 2409.20467 | null |
2024-09-30 | Robot Navigation Using Physically Grounded Vision-Language Models in Outdoor Environments | Mohamed Elnoor et.al. | 2409.20445 | null |
2024-10-01 | Instance-adaptive Zero-shot Chain-of-Thought Prompting | Xiaosong Yuan et.al. | 2409.20441 | null |
2024-09-30 | HELPD: Mitigating Hallucination of LVLMs by Hierarchical Feedback Learning with Vision-enhanced Penalty Decoding | Fan Yuan et.al. | 2409.20429 | link |
2024-09-30 | World to Code: Multi-modal Data Generation via Self-Instructed Compositional Captioning and Filtering | Jiacong Wang et.al. | 2409.20424 | link |
2024-09-30 | Anti-stereotypical Predictive Text Suggestions Do Not Reliably Yield Anti-stereotypical Writing | Connor Baumler et.al. | 2409.20390 | null |
2024-09-30 | Wait, but Tylenol is Acetaminophen… Investigating and Improving Language Models’ Ability to Resist Requests for Misinformation | Shan Chen et.al. | 2409.20385 | null |
2024-09-30 | Word-wise intonation model for cross-language TTS systems | Tomilov A. A. et.al. | 2409.20374 | null |
2024-09-30 | The Perfect Blend: Redefining RLHF with Mixture of Judges | Tengyu Xu et.al. | 2409.20370 | null |
2024-09-30 | VideoINSTA: Zero-shot Long Video Understanding via Informative Spatial-Temporal Reasoning with LLMs | Ruotong Liao et.al. | 2409.20365 | link |
2024-09-30 | Efficient Driving Behavior Narration and Reasoning on Edge Device Using Large Language Models | Yizhou Huang et.al. | 2409.20364 | null |
2024-09-30 | Rotated Runtime Smooth: Training-Free Activation Smoother for accurate INT4 inference | Ke Yi et.al. | 2409.20361 | null |
2024-09-27 | Exploring Token Pruning in Vision State Space Models | Zheng Zhan et.al. | 2409.18962 | null |
2024-09-27 | LML: Language Model Learning a Dataset for Data-Augmented Prediction | Praneeth Vadlapati et.al. | 2409.18957 | link |
2024-09-27 | Ruler: A Model-Agnostic Method to Control Generated Length for Large Language Models | Jiaming Li et.al. | 2409.18943 | link |
2024-09-27 | From Seconds to Hours: Reviewing MultiModal Large Language Models on Comprehensive Long Video Understanding | Heqing Zou et.al. | 2409.18938 | link |
2024-09-27 | Social Media Bot Policies: Evaluating Passive and Active Enforcement | Kristina Radivojevic et.al. | 2409.18931 | null |
2024-09-27 | AIPatient: Simulating Patients with EHRs and LLM Powered Agentic Workflow | Huizi Yu et.al. | 2409.18924 | null |
2024-09-27 | Soft Measures for Extracting Causal Collective Intelligence | Maryam Berijanian et.al. | 2409.18911 | link |
2024-09-27 | Improving Visual Object Tracking through Visual Prompting | Shih-Fang Chen et.al. | 2409.18901 | link |
2024-09-27 | IDGen: Item Discrimination Induced Prompt Generation for LLM Evaluation | Fan Lin et.al. | 2409.18892 | link |
2024-09-27 | Suicide Phenotyping from Clinical Notes in Safety-Net Psychiatric Hospital Using Multi-Label Classification with Pre-Trained Language Models | Zehan Li et.al. | 2409.18878 | null |
2024-09-27 | Predicting and analyzing memorization within fine-tuned Large Language Models | Jérémie Dentan et.al. | 2409.18858 | null |
2024-09-27 | Mitigating Selection Bias with Node Pruning and Auxiliary Options | Hyeong Kyu Choi et.al. | 2409.18857 | null |
2024-09-27 | LLMs4Synthesis: Leveraging Large Language Models for Scientific Synthesis | Hamed Babaei Giglou et.al. | 2409.18812 | link |
2024-09-27 | Open-Nav: Exploring Zero-Shot Vision-and-Language Navigation in Continuous Environment with Open-Source LLMs | Yanyuan Qiao et.al. | 2409.18794 | null |
2024-09-27 | A Survey on the Honesty of Large Language Models | Siheng Li et.al. | 2409.18786 | link |
2024-09-27 | Enhancing Explainability in Multimodal Large Language Models Using Ontological Context | Jihen Amara et.al. | 2409.18753 | null |
2024-09-27 | OpenObject-NAV: Open-Vocabulary Object-Oriented Navigation Based on Dynamic Carrier-Relationship Scene Graph | Yujie Tang et.al. | 2409.18743 | null |
2024-09-27 | Scalable Cross-Entropy Loss for Sequential Recommendations with Large Item Catalogs | Gleb Mezentsev et.al. | 2409.18721 | link |
2024-09-27 | Read Over the Lines: Attacking LLMs and Toxicity Detection Systems with ASCII Art to Mask Profanity | Sergey Berezin et.al. | 2409.18708 | link |
2024-09-27 | Beyond Single-Audio: Advancing Multi-Audio Processing in Audio Large Language Models | Yiming Chen et.al. | 2409.18680 | link |
2024-09-26 | EgoLM: Multi-Modal Language Model of Egocentric Motions | Fangzhou Hong et.al. | 2409.18127 | null |
2024-09-26 | Lotus: Diffusion-based Visual Foundation Model for High-quality Dense Prediction | Jing He et.al. | 2409.18124 | null |
2024-09-26 | Multi-View and Multi-Scale Alignment for Contrastive Language-Image Pre-training in Mammography | Yuexi Du et.al. | 2409.18119 | link |
2024-09-26 | E.T. Bench: Towards Open-Ended Event-Level Video-Language Understanding | Ye Liu et.al. | 2409.18111 | link |
2024-09-26 | Open-World Evaluation for Retrieving Diverse Perspectives | Hung-Ting Chen et.al. | 2409.18110 | null |
2024-09-26 | MALPOLON: A Framework for Deep Species Distribution Modeling | Theo Larcher et.al. | 2409.18102 | link |
2024-09-26 | SKT: Integrating State-Aware Keypoint Trajectories with Vision-Language Models for Robotic Garment Manipulation | Xin Li et.al. | 2409.18082 | null |
2024-09-26 | Infer Human’s Intentions Before Following Natural Language Instructions | Yanming Wan et.al. | 2409.18073 | link |
2024-09-26 | Infering Alt-text For UI Icons With Large Language Models During App Development | Sabrina Haque et.al. | 2409.18060 | null |
2024-09-26 | DualAD: Dual-Layer Planning for Reasoning in Autonomous Driving | Dingrui Wang et.al. | 2409.18053 | link |
2024-09-26 | EMOVA: Empowering Language Models to See, Hear and Speak with Vivid Emotions | Kai Chen et.al. | 2409.18042 | null |
2024-09-26 | Compositional Hardness of Code in Large Language Models – A Probabilistic Perspective | Yotam Wolf et.al. | 2409.18028 | null |
2024-09-26 | An Adversarial Perspective on Machine Unlearning for AI Safety | Jakub Łucki et.al. | 2409.18025 | link |
2024-09-26 | DARE: Diverse Visual Question Answering with Robustness Evaluation | Hannah Sterz et.al. | 2409.18023 | null |
2024-09-26 | Role-RL: Online Long-Context Processing with Role Reinforcement Learning for Distinct LLMs in Their Optimal Roles | Lewei He et.al. | 2409.18014 | null |
2024-09-26 | Control Industrial Automation System with Large Language Models | Yuchen Xia et.al. | 2409.18009 | link |
2024-09-26 | Multilingual Evaluation of Long Context Retrieval and Reasoning | Ameeta Agrawal et.al. | 2409.18006 | link |
2024-09-26 | Enhancing Tourism Recommender Systems for Sustainable City Trips Using Retrieval-Augmented Generation | Ashmi Banerjee et.al. | 2409.18003 | null |
2024-09-26 | Extracting Affect Aggregates from Longitudinal Social Media Data with Temporal Adapters for Large Language Models | Georg Ahnert et.al. | 2409.17990 | link |
2024-09-26 | LLM4Brain: Training a Large Language Model for Brain Video Understanding | Ruizhe Zheng et.al. | 2409.17987 | null |
2024-09-25 | Attention Prompting on Image for Large Vision-Language Models | Runpeng Yu et.al. | 2409.17143 | link |
2024-09-25 | FineZip : Pushing the Limits of Large Language Models for Practical Lossless Text Compression | Fazal Mittu et.al. | 2409.17141 | link |
2024-09-25 | Turn Every Application into an Agent: Towards Efficient Human-Agent-Computer Interaction with API-First LLM-Based Agents | Junting Lu et.al. | 2409.17140 | null |
2024-09-25 | Blox-Net: Generative Design-for-Robot-Assembly Using VLM Supervision, Physics Simulation, and a Robot with Reset | Andrew Goldberg et.al. | 2409.17126 | null |
2024-09-25 | Programming Every Example: Lifting Pre-training Data Quality like Experts at Scale | Fan Zhou et.al. | 2409.17115 | link |
2024-09-25 | Unveiling Ontological Commitment in Multi-Modal Foundation Models | Mert Keser et.al. | 2409.17109 | null |
2024-09-25 | Accumulator-Aware Post-Training Quantization | Ian Colbert et.al. | 2409.17092 | null |
2024-09-25 | Can Vision Language Models Learn from Visual Demonstrations of Ambiguous Spatial Reasoning? | Bowen Zhao et.al. | 2409.17080 | link |
2024-09-25 | VPTQ: Extreme Low-bit Vector Post-Training Quantization for Large Language Models | Yifei Liu et.al. | 2409.17066 | link |
2024-09-25 | Benchmarking Domain Generalization Algorithms in Computational Pathology | Neda Zamanitajeddin et.al. | 2409.17063 | link |
2024-09-25 | Using LLM for Real-Time Transcription and Summarization of Doctor-Patient Interactions into ePuskesmas in Indonesia | Azmul Asmar Irfan et.al. | 2409.17054 | null |
2024-09-25 | GeoBiked: A Dataset with Geometric Features and Automated Labeling Techniques to Enable Deep Generative Models in Engineering Design | Phillip Mueller et.al. | 2409.17045 | null |
2024-09-25 | How to Connect Speech Foundation Models and Large Language Models? What Matters and What Does Not | Francesco Verdini et.al. | 2409.17044 | null |
2024-09-25 | Counterfactual Token Generation in Large Language Models | Ivi Chatzi et.al. | 2409.17027 | link |
2024-09-25 | LLM-CARD: Towards a Description and Landscape of Large Language Models | Shengwei Tian et.al. | 2409.17011 | link |
2024-09-25 | Models Can and Should Embrace the Communicative Nature of Human-Generated Math | Sasha Boguraev et.al. | 2409.17005 | null |
2024-09-26 | INT-FlashAttention: Enabling Flash Attention for INT8 Quantization | Shimao Chen et.al. | 2409.16997 | link |
2024-09-25 | Harnessing Diversity for Important Data Selection in Pretraining Large Language Models | Chi Zhang et.al. | 2409.16986 | null |
2024-09-25 | AXCEL: Automated eXplainable Consistency Evaluation using LLMs | P Aditya Sreekar et.al. | 2409.16984 | null |
2024-09-25 | Decoding Large-Language Models: A Systematic Overview of Socio-Technical Impacts, Constraints, and Emerging Questions | Zeyneb N. Kaya et.al. | 2409.16974 | null |
2024-09-20 | Gender Representation and Bias in Indian Civil Service Mock Interviews | Somonnoy Banerjee et.al. | 2409.12194 | null |
2024-09-18 | Qwen2-VL: Enhancing Vision-Language Model’s Perception of the World at Any Resolution | Peng Wang et.al. | 2409.12191 | link |
2024-09-18 | To CoT or not to CoT? Chain-of-thought helps mainly on math and symbolic reasoning | Zayne Sprague et.al. | 2409.12183 | link |
2024-09-23 | A Controlled Study on Long Context Extension and Generalization in LLMs | Yi Lu et.al. | 2409.12181 | link |
2024-09-18 | Finetuning Language Models to Emit Linguistic Expressions of Uncertainty | Arslan Chaudhry et.al. | 2409.12180 | null |
2024-09-18 | Decoding Style: Efficient Fine-Tuning of LLMs for Image-Guided Outfit Recommendation with Preference | Najmeh Forouzandehmehr et.al. | 2409.12150 | null |
2024-09-18 | MAgICoRe: Multi-Agent, Iterative, Coarse-to-Fine Refinement for Reasoning | Justin Chih-Yao Chen et.al. | 2409.12147 | link |
2024-09-18 | MoRAG – Multi-Fusion Retrieval Augmented Generation for Human Motion | Kalakonda Sai Shashank et.al. | 2409.12140 | link |
2024-09-24 | Takin: A Cohort of Superior Quality Zero-shot Speech Generation Models | Sijing Chen et.al. | 2409.12139 | null |
2024-09-18 | GRIN: GRadient-INformed MoE | Liyuan Liu et.al. | 2409.12136 | null |
2024-09-18 | Linguini: A benchmark for language-agnostic linguistic reasoning | Eduardo Sánchez et.al. | 2409.12126 | link |
2024-09-18 | Qwen2.5-Math Technical Report: Toward Mathematical Expert Model via Self-Improvement | An Yang et.al. | 2409.12122 | null |
2024-09-18 | Low Frame-rate Speech Codec: a Codec Designed for Fast High-quality Speech LLM Training and Inference | Edresson Casanova et.al. | 2409.12117 | null |
2024-09-18 | Measuring Human and AI Values based on Generative Psychometrics with Large Language Models | Haoran Ye et.al. | 2409.12106 | link |
2024-09-19 | Skill matching at scale: freelancer-project alignment for efficient multilingual candidate retrieval | Warren Jouanneau et.al. | 2409.12097 | null |
2024-09-19 | The Impact of Element Ordering on LM Agent Performance | Wayne Chi et.al. | 2409.12089 | link |
2024-09-18 | Dual-Layer Training and Decoding of Large Language Model with Simultaneously Thinking and Speaking | Ningyuan Xi et.al. | 2409.12059 | null |
2024-09-19 | Using Large Language Models to Generate Clinical Trial Tables and Figures | Yumeng Yang et.al. | 2409.12046 | null |
2024-09-18 | All-in-one foundational models learning across quantum chemical levels | Yuxinxin Chen et.al. | 2409.12015 | link |
2024-09-18 | Mixture of Prompt Learning for Vision Language Models | Yu Du et.al. | 2409.12011 | null |
2024-09-17 | AraDiCE: Benchmarks for Dialectal and Cultural Capabilities in LLMs | Basel Mousi et.al. | 2409.11404 | null |
2024-09-17 | NVLM: Open Frontier-Class Multimodal LLMs | Wenliang Dai et.al. | 2409.11402 | null |
2024-09-17 | Says Who? Effective Zero-Shot Annotation of Focalization | Rebecca M. M. Hicke et.al. | 2409.11390 | null |
2024-09-17 | Diversify and Conquer: Diversity-Centric Data Selection with Iterative Refinement | Simon Yu et.al. | 2409.11378 | link |
2024-09-17 | Towards Time Series Reasoning with LLMs | Winnie Chow et.al. | 2409.11376 | null |
2024-09-17 | Multi-OCT-SelfNet: Integrating Self-Supervised Learning with Multi-Source Data Fusion for Enhanced Multi-Class Retinal Disease Classification | Fatema-E- Jannat et.al. | 2409.11375 | null |
2024-09-17 | Learning Spatially-Aware Language and Audio Embedding | Bhavika Devnani et.al. | 2409.11369 | null |
2024-09-17 | CoCA: Regaining Safety-awareness of Multimodal Large Language Models with Constitutional Calibration | Jiahui Gao et.al. | 2409.11365 | null |
2024-09-17 | CORE-Bench: Fostering the Credibility of Published Research Through a Computational Reproducibility Agent Benchmark | Zachary S. Siegel et.al. | 2409.11363 | link |
2024-09-17 | AI Suggestions Homogenize Writing Toward Western Styles and Diminish Cultural Nuances | Dhruv Agarwal et.al. | 2409.11360 | null |
2024-09-17 | THaMES: An End-to-End Tool for Hallucination Mitigation and Evaluation in Large Language Models | Mengfei Liang et.al. | 2409.11353 | link |
2024-09-17 | LPT++: Efficient Training on Mixture of Long-tailed Experts | Bowen Dong et.al. | 2409.11323 | null |
2024-09-17 | SOAP: Improving and Stabilizing Shampoo using Adam | Nikhil Vyas et.al. | 2409.11321 | link |
2024-09-17 | Beyond LoRA: Exploring Efficient Fine-Tuning Techniques for Time Series Foundational Models | Divij Gupta et.al. | 2409.11302 | null |
2024-09-17 | Leveraging Distillation Techniques for Document Understanding: A Case Study with FLAN-T5 | Marcel Lamott et.al. | 2409.11282 | null |
2024-09-17 | P-RAG: Progressive Retrieval Augmented Generation For Planning on Embodied Everyday Task | Weiye Xu et.al. | 2409.11279 | null |
2024-09-17 | Hackphyr: A Local Fine-Tuned LLM Agent for Network Security Environments | Maria Rigaki et.al. | 2409.11276 | null |
2024-09-17 | Task Arithmetic for Language Expansion in Speech Translation | Yao-Fei Cheng et.al. | 2409.11274 | null |
2024-09-18 | LOLA – An Open-Source Massively Multilingual Large Language Model | Nikit Srivastava et.al. | 2409.11272 | link |
2024-09-17 | Bio-Inspired Mamba: Temporal Locality and Bioplausible Learning in Selective State Space Models | Jiahao Qin et.al. | 2409.11263 | null |
2024-09-16 | RetrievalAttention: Accelerating Long-Context LLM Inference via Vector Retrieval | Di Liu et.al. | 2409.10516 | link |
2024-09-16 | Context-aware Code Segmentation for C-to-Rust Translation using Large Language Models | Momoko Shiraishi et.al. | 2409.10506 | null |
2024-09-16 | DILA: Dictionary Label Attention for Mechanistic Interpretability in High-dimensional Multi-label Medical Coding Prediction | John Wu et.al. | 2409.10504 | null |
2024-09-16 | Causal Language Modeling Can Elicit Search and Reasoning Capabilities on Logic Puzzles | Kulin Shah et.al. | 2409.10502 | link |
2024-09-16 | Code Vulnerability Detection: A Comparative Analysis of Emerging Large Language Models | Shaznin Sultana et.al. | 2409.10490 | null |
2024-09-16 | Do Pre-trained Vision-Language Models Encode Object States? | Kaleb Newman et.al. | 2409.10488 | link |
2024-09-16 | XLM for Autonomous Driving Systems: A Comprehensive Review | Sonda Fourati et.al. | 2409.10484 | null |
2024-09-17 | Schrodinger’s Memory: Large Language Models | Wei Wang et.al. | 2409.10482 | null |
2024-09-16 | Towards Semantic Versioning of Open Pre-trained Language Model Releases on Hugging Face | Adekunle Ajibode et.al. | 2409.10472 | link |
2024-09-16 | LLM as BT-Planner: Leveraging LLMs for Behavior Tree Generation in Robot Task Planning | Jicong Ao et.al. | 2409.10444 | link |
2024-09-16 | CtRNet-X: Camera-to-Robot Pose Estimation in Real-world Conditions Using a Single Camera | Jingpei Lu et.al. | 2409.10441 | null |
2024-09-16 | HiFi-CS: Towards Open Vocabulary Visual Grounding For Robotic Grasping Using Vision-Language Models | Vineet Bhat et.al. | 2409.10419 | link |
2024-09-16 | A Large-Scale Privacy Assessment of Android Third-Party SDKs | Mark Huasong Meng et.al. | 2409.10411 | null |
2024-09-16 | A Knowledge-Enhanced Disease Diagnosis Method Based on Prompt Learning and BERT Integration | Zhang Zheng et.al. | 2409.10403 | null |
2024-09-17 | Learnings from a Large-Scale Deployment of an LLM-Powered Expert-in-the-Loop Healthcare Chatbot | Bhuvan Sachdeva et.al. | 2409.10354 | null |
2024-09-16 | Large Language Model Enhanced Hard Sample Identification for Denoising Recommendation | Tianrui Song et.al. | 2409.10343 | null |
2024-09-16 | The 20 questions game to distinguish large language models | Gurvan Richardeau et.al. | 2409.10338 | null |
2024-09-16 | MGSA: Multi-granularity Graph Structure Attention for Knowledge Graph-to-Text Generation | Shanshan Wang et.al. | 2409.10294 | null |
2024-09-16 | ReflectDiffu: Reflect between Emotion-intent Contagion and Mimicry for Empathetic Response Generation via a RL-Diffusion Framework | Jiahao Yuan et.al. | 2409.10289 | link |
2024-09-16 | ComplexCodeEval: A Benchmark for Evaluating Large Code Models on More Complex Code | Jia Feng et.al. | 2409.10280 | link |
2024-09-13 | Agents in Software Engineering: Survey, Landscape, and Vision | Yanxian Huang et.al. | 2409.09030 | link |
2024-09-13 | Contri(e)ve: Context + Retrieve for Scholarly Question Answering | Kanchan Shivashankar et.al. | 2409.09010 | null |
2024-09-13 | Safeguarding Decentralized Social Media: LLM Agents for Automating Community Rule Compliance | Lucio La Cava et.al. | 2409.08963 | null |
2024-09-13 | Emerging Reliance Behaviors in Human-AI Text Generation: Hallucinations, Data Quality Assessment, and Cognitive Forcing Functions | Zahra Ashktorab et.al. | 2409.08937 | null |
2024-09-13 | SynSUM – Synthetic Benchmark with Structured and Unstructured Medical Records | Paloma Rabaey et.al. | 2409.08936 | link |
2024-09-13 | LLM-based Weak Supervision Framework for Query Intent Classification in Video Search | Farnoosh Javadi et.al. | 2409.08931 | null |
2024-09-13 | Affective Computing Has Changed: The Foundation Model Disruption | Björn Schuller et.al. | 2409.08907 | null |
2024-09-13 | AnyBipe: An End-to-End Framework for Training and Deploying Bipedal Robots Guided by Large Language Models | Yifei Yao et.al. | 2409.08904 | link |
2024-09-13 | A Market for Lemons? Strategic Directions for a Vigilant Application of Artificial Intelligence in Entrepreneurship Research | Martin Obschonka et.al. | 2409.08890 | null |
2024-09-13 | Visual Language Tracking with Multi-modal Interaction: A Robust Benchmark | Xuchen Li et.al. | 2409.08887 | null |
2024-09-13 | Exploring Graph Structure Comprehension Ability of Multimodal Large Language Models: Case Studies | Zhiqiang Zhong et.al. | 2409.08864 | null |
2024-09-13 | FP-VEC: Fingerprinting Large Language Models via Efficient Vector Addition | Zhenhua Xu et.al. | 2409.08846 | null |
2024-09-13 | AIPO: Improving Training Objective for Iterative Preference Optimization | Yaojie Shen et.al. | 2409.08845 | link |
2024-09-13 | A RAG Approach for Generating Competency Questions in Ontology Engineering | Xueli Pan et.al. | 2409.08820 | null |
2024-09-13 | Your Weak LLM is Secretly a Strong Teacher for Alignment | Leitian Tao et.al. | 2409.08813 | null |
2024-09-13 | Mutual Theory of Mind in Human-AI Collaboration: An Empirical Study with LLM-driven AI Agents in a Real-time Shared Workspace Task | Shao Zhang et.al. | 2409.08811 | null |
2024-09-13 | LLaQo: Towards a Query-Based Coach in Expressive Music Performance Assessment | Huan Zhang et.al. | 2409.08795 | link |
2024-09-13 | Optimizing Ingredient Substitution Using Large Language Models to Enhance Phytochemical Content in Recipes | Luis Rita et.al. | 2409.08792 | null |
2024-09-13 | Electrocardiogram Report Generation and Question Answering via Retrieval-Augmented Self-Supervised Modeling | Jialu Tang et.al. | 2409.08788 | null |
2024-09-13 | Uncertainty and Generalizability in Foundation Models for Earth Observation | Raul Ramos-Pollan et.al. | 2409.08744 | null |
2024-09-12 | Windows Agent Arena: Evaluating Multi-Modal OS Agents at Scale | Rogerio Bonatti et.al. | 2409.08264 | link |
2024-09-12 | OmniQuery: Contextually Augmenting Captured Multimodal Memory to Enable Personal Question Answering | Jiahao Nick Li et.al. | 2409.08250 | null |
2024-09-12 | Source2Synth: Synthetic Data Generation and Curation Grounded in Real Data Sources | Alisia Lupidi et.al. | 2409.08239 | null |
2024-09-12 | LLM Honeypot: Leveraging Large Language Models as Advanced Interactive Honeypot Systems | Hakan T. Otal et.al. | 2409.08234 | link |
2024-09-12 | Adaptive Language-Guided Abstraction from Contrastive Explanations | Andi Peng et.al. | 2409.08212 | null |
2024-09-12 | ComAlign: Compositional Alignment in Vision-Language Models | Ali Abdollah et.al. | 2409.08206 | null |
2024-09-12 | What Makes a Maze Look Like a Maze? | Joy Hsu et.al. | 2409.08202 | null |
2024-09-12 | AudioBERT: Audio Knowledge Augmented Language Model | Hyunjong Ok et.al. | 2409.08199 | link |
2024-09-12 | Fine-tuning Large Language Models for Entity Matching | Aaron Steiner et.al. | 2409.08185 | link |
2024-09-12 | On the Role of Context in Reading Time Prediction | Andreas Opedal et.al. | 2409.08160 | link |
2024-09-12 | Faster Speech-LLaMA Inference with Multi-token Prediction | Desh Raj et.al. | 2409.08148 | null |
2024-09-12 | LLM-POTUS Score: A Framework of Analyzing Presidential Debates with Large Language Models | Zhengliang Liu et.al. | 2409.08147 | null |
2024-09-12 | Towards a graph-based foundation model for network traffic analysis | Louis Van Langendonck et.al. | 2409.08111 | null |
2024-09-12 | The Faetar Benchmark: Speech Recognition in a Very Under-Resourced Language | Michael Ong et.al. | 2409.08103 | null |
2024-09-12 | The CLC-UKET Dataset: Benchmarking Case Outcome Prediction for the UK Employment Tribunal | Huiyuan Xie et.al. | 2409.08098 | null |
2024-09-12 | Securing Large Language Models: Addressing Bias, Misinformation, and Prompt Attacks | Benji Peng et.al. | 2409.08087 | null |
2024-09-12 | SimMAT: Exploring Transferability from Vision Foundation Models to Any Image Modality | Chenyang Lei et.al. | 2409.08083 | link |
2024-09-12 | SoVAR: Building Generalizable Scenarios from Accident Reports for Autonomous Driving Testing | An Guo et.al. | 2409.08081 | link |
2024-09-12 | TravelAgent: An AI Assistant for Personalized Travel Planning | Aili Chen et.al. | 2409.08069 | null |
2024-09-12 | An Evaluation Framework for Attributed Information Retrieval using Large Language Models | Hanane Djeddal et.al. | 2409.08014 | link |
2024-09-11 | “My Grade is Wrong!”: A Contestable AI Framework for Interactive Feedback in Evaluating Student Essays | Shengxin Hong et.al. | 2409.07453 | null |
2024-09-11 | StereoCrafter: Diffusion-based Generation of Long and High-fidelity Stereoscopic 3D from Monocular Videos | Sijie Zhao et.al. | 2409.07447 | null |
2024-09-11 | SUPER: Evaluating Agents on Setting Up and Executing Tasks from Research Repositories | Ben Bogin et.al. | 2409.07440 | link |
2024-09-11 | A Suite for Acoustic Language Model Evaluation | Gallil Maimon et.al. | 2409.07437 | link |
2024-09-11 | Synthetic continued pretraining | Zitong Yang et.al. | 2409.07431 | link |
2024-09-11 | Agent Workflow Memory | Zora Zhiruo Wang et.al. | 2409.07429 | link |
2024-09-11 | CLNX: Bridging Code and Natural Language for C/C++ Vulnerability-Contributing Commits Identification | Zeqing Qin et.al. | 2409.07407 | null |
2024-09-11 | AdaCAD: Adaptively Decoding to Balance Conflicts between Contextual and Parametric Knowledge | Han Wang et.al. | 2409.07394 | link |
2024-09-11 | Awaking the Slides: A Tuning-free and Knowledge-regulated AI Tutoring System via Language Model Coordination | Daniel Zhang-Li et.al. | 2409.07372 | null |
2024-09-11 | Demo: SGCode: A Flexible Prompt-Optimizing System for Secure Generation of Code | Khiem Ton et.al. | 2409.07368 | null |
2024-09-11 | Think Together and Work Better: Combining Humans’ and LLMs’ Think-Aloud Outcomes for Effective Text Evaluation | SeongYeub Chu et.al. | 2409.07355 | link |
2024-09-11 | Securing Vision-Language Models with a Robust Encoder Against Jailbreak and Adversarial Attacks | Md Zarif Hossain et.al. | 2409.07353 | link |
2024-09-11 | Explanation, Debate, Align: A Weak-to-Strong Framework for Language Model Generalization | Mehrdad Zakershahrak et.al. | 2409.07335 | null |
2024-09-11 | Learning to Compress Contexts for Efficient Knowledge-based Visual Question Answering | Weixi Weng et.al. | 2409.07331 | null |
2024-09-11 | MEDIC: Towards a Comprehensive Framework for Evaluating LLMs in Clinical Applications | Praveen K Kanithi et.al. | 2409.07314 | null |
2024-09-11 | Exploring User-level Gradient Inversion with a Diffusion Prior | Zhuohang Li et.al. | 2409.07291 | null |
2024-09-11 | STORE: Streamlining Semantic Tokenization and Generative Recommendation with A Single LLM | Qijiong Liu et.al. | 2409.07276 | null |
2024-09-11 | MiniDrive: More Efficient Vision-Language Models with Multi-Level 2D Features as Text Tokens for Autonomous Driving | Enming Zhang et.al. | 2409.07267 | link |
2024-09-12 | Alignment of Diffusion Models: Fundamentals, Challenges, and Future | Buhua Liu et.al. | 2409.07253 | link |
2024-09-11 | PiTe: Pixel-Temporal Alignment for Large Video-Language Model | Yang Liu et.al. | 2409.07239 | link |
2024-09-10 | Benchmarking Sub-Genre Classification For Mainstage Dance Music | Hongzhi Shu et.al. | 2409.06690 | null |
2024-09-10 | E2LLM: Encoder Elongated Large Language Models for Long-Context Understanding and Reasoning | Zihan Liao et.al. | 2409.06679 | null |
2024-09-10 | LLaMA-Omni: Seamless Speech Interaction with Large Language Models | Qingkai Fang et.al. | 2409.06666 | link |
2024-09-10 | Human Perception of LLM-generated Text Content in Social Media Environments | Kristina Radivojevic et.al. | 2409.06653 | null |
2024-09-10 | Optimal Workload Placement on Multi-Instance GPUs | Bekir Turkkan et.al. | 2409.06646 | null |
2024-09-11 | EyeCLIP: A visual-language foundation model for multi-modal ophthalmic image analysis | Danli Shi et.al. | 2409.06644 | link |
2024-09-11 | Segmenting sea ice floes in close-range optical imagery with active contour and foundation models | Giulio Passerotti et.al. | 2409.06641 | null |
2024-09-10 | TeXBLEU: Automatic Metric for Evaluate LaTeX Format | Kyudan Jung et.al. | 2409.06639 | link |
2024-09-10 | MoWE-Audio: Multitask AudioLLMs with Mixture of Weak Encoders | Wenyu Zhang et.al. | 2409.06635 | null |
2024-09-10 | A Practice of Post-Training on Llama-3 70B with Optimal Selection of Additional Language Mixture Ratio | Ningyuan Xi et.al. | 2409.06624 | null |
2024-09-10 | Exploring Italian sentence embeddings properties through multi-tasking | Vivi Nastase et.al. | 2409.06622 | link |
2024-09-10 | Alleviating Hallucinations in Large Language Models with Scepticism Modeling | Yetao Wu et.al. | 2409.06601 | null |
2024-09-10 | GroUSE: A Benchmark to Evaluate Evaluators in Grounded Question Answering | Sacha Muller et.al. | 2409.06595 | link |
2024-09-10 | Quantifying and Enabling the Interpretability of CLIP-like Models | Avinash Madasu et.al. | 2409.06579 | null |
2024-09-10 | Exploring syntactic information in sentence embeddings through multilingual subject-verb agreement | Vivi Nastase et.al. | 2409.06567 | null |
2024-09-10 | MAPS: Energy-Reliability Tradeoff Management in Autonomous Vehicles Through LLMs Penetrated Science | Mahdieh Aliazam et.al. | 2409.06558 | null |
2024-09-10 | Questioning Internal Knowledge Structure of Large Language Models Through the Lens of the Olympic Games | Juhwan Choi et.al. | 2409.06518 | link |
2024-09-10 | Aligning Machine and Human Visual Representations across Abstraction Levels | Lukas Muttenthaler et.al. | 2409.06509 | null |
2024-09-10 | Mitigating Hallucination in Visual-Language Models via Re-Balancing Contrastive Decoding | Xiaoyu Liang et.al. | 2409.06485 | null |
2024-09-10 | Multimodal Large Language Model Driven Scenario Testing for Autonomous Vehicles | Qiujing Lu et.al. | 2409.06450 | null |
2024-09-09 | MMEvol: Empowering Multimodal Large Language Models with Evol-Instruct | Run Luo et.al. | 2409.05840 | null |
2024-09-09 | Are Large Language Models a Threat to Programming Platforms? An Exploratory Study | Md Mustakim Billah et.al. | 2409.05824 | null |
2024-09-09 | VFA: Vision Frequency Analysis of Foundation Models and Human | Mohammad-Javad Darvishi-Bayazi et.al. | 2409.05817 | null |
2024-09-09 | Improving Pretraining Data Using Perplexity Correlations | Tristan Thrush et.al. | 2409.05816 | null |
2024-09-09 | Benchmarking Chinese Knowledge Rectification in Large Language Models | Tianhe Lu et.al. | 2409.05806 | link |
2024-09-09 | Evidence from fMRI Supports a Two-Phase Abstraction Process in Language Models | Emily Cheng et.al. | 2409.05771 | null |
2024-09-09 | Model Input Verification of Large Scale Simulations | Rumyana Neykova et.al. | 2409.05768 | null |
2024-09-09 | A Novel Idea Generation Tool using a Structured Conversational AI (CAI) System | B. Sankar et.al. | 2409.05747 | null |
2024-09-09 | LLMs Will Always Hallucinate, and We Need to Live With This | Sourav Banerjee et.al. | 2409.05746 | null |
2024-09-09 | A System and Benchmark for LLM-based Q\&A on Heterogeneous Data | Achille Fokoue et.al. | 2409.05735 | null |
2024-09-09 | Towards Democratizing Multilingual Large Language Models For Medicine Through A Two-Stage Instruction Fine-tuning Approach | Meng Zhou et.al. | 2409.05732 | null |
2024-09-09 | The Influence of Task and Group Disparities over Users’ Attitudes Toward Using Large Language Models for Psychotherapy | Qihang He et.al. | 2409.05703 | null |
2024-09-09 | Segmentation by Factorization: Unsupervised Semantic Segmentation for Pathology by Factorizing Foundation Model Features | Jacob Gildenblat et.al. | 2409.05697 | null |
2024-09-09 | Zero-shot Outlier Detection via Prior-data Fitted Networks: Model Selection Bygone! | Yuchen Shen et.al. | 2409.05672 | null |
2024-09-09 | Revisiting English Winogender Schemas for Consistency, Coverage, and Grammatical Case | Vagrant Gautam et.al. | 2409.05653 | link |
2024-09-10 | MemoRAG: Moving towards Next-Gen RAG Via Memory-Inspired Knowledge Discovery | Hongjin Qian et.al. | 2409.05591 | link |
2024-09-09 | Leveraging Content and Acoustic Representations for Efficient Speech Emotion Recognition | Soumya Dutta et.al. | 2409.05566 | link |
2024-09-09 | CauseJudger: Identifying the Cause with LLMs for Abductive Logical Reasoning | Jinwei He et.al. | 2409.05559 | null |
2024-09-09 | SciAgents: Automating scientific discovery through multi-agent intelligent graph reasoning | Alireza Ghafarollahi et.al. | 2409.05556 | link |
2024-09-09 | Harmonic Reasoning in Large Language Models | Anna Kruspe et.al. | 2409.05521 | null |
2024-09-06 | VILA-U: a Unified Foundation Model Integrating Visual Understanding and Generation | Yecheng Wu et.al. | 2409.04429 | link |
2024-09-06 | Exploring Foundation Models for Synthetic Medical Imaging: A Study on Chest X-Rays and Fine-Tuning Techniques | Davide Clode da Silva et.al. | 2409.04424 | null |
2024-09-06 | RLPF: Reinforcement Learning from Prediction Feedback for User Summarization with LLMs | Jiaxing Wu et.al. | 2409.04421 | null |
2024-09-06 | Question-Answering Dense Video Events | Hangyu Qin et.al. | 2409.04388 | link |
2024-09-06 | Learning vs Retrieval: The Role of In-Context Examples in Regression with LLMs | Aliakbar Nafar et.al. | 2409.04318 | link |
2024-09-06 | An optically accelerated extreme learning machine using hot atomic vapors | Pierre Azam et.al. | 2409.04312 | null |
2024-09-06 | Using Large Language Models to Generate Authentic Multi-agent Knowledge Work Datasets | Desiree Heim et.al. | 2409.04286 | null |
2024-09-06 | Advancing Automated Knowledge Transfer in Evolutionary Multitasking via Large Language Models | Yuxiao Huang et.al. | 2409.04270 | null |
2024-09-06 | An overview of domain-specific foundation model: key technologies, applications and challenges | Haolong Chen et.al. | 2409.04267 | null |
2024-09-06 | UniDet3D: Multi-dataset Indoor 3D Object Detection | Maksim Kolodiazhnyi et.al. | 2409.04234 | link |
2024-09-06 | Fast Forwarding Low-Rank Training | Adir Rahamim et.al. | 2409.04206 | null |
2024-09-06 | Residual Stream Analysis with Multi-Layer SAEs | Tim Lawson et.al. | 2409.04185 | link |
2024-09-06 | GALLa: Graph Aligned Large Language Models for Improved Source Code Understanding | Ziyin Zhang et.al. | 2409.04183 | null |
2024-09-06 | Combining LLMs and Knowledge Graphs to Reduce Hallucinations in Question Answering | Larissa Pusch et.al. | 2409.04181 | null |
2024-09-06 | From Calculation to Adjudication: Examining LLM judges on Mathematical Reasoning Tasks | Andreas Stephan et.al. | 2409.04168 | null |
2024-09-06 | Can OpenSource beat ChatGPT? – A Comparative Study of Large Language Models for Text-to-Code Generation | Luis Mayer et.al. | 2409.04164 | null |
2024-09-06 | Prompt-based Personality Profiling: Reinforcement Learning for Relevance Filtering | Jan Hofmann et.al. | 2409.04122 | null |
2024-09-06 | Multi-Programming Language Ensemble for Code Generation in Large Language Model | Tengfei Xue et.al. | 2409.04114 | link |
2024-09-06 | Can LLMs Generate Novel Research Ideas? A Large-Scale Human Study with 100+ NLP Researchers | Chenglei Si et.al. | 2409.04109 | link |
2024-09-06 | UI-JEPA: Towards Active Perception of User Intent through Onscreen User Activity | Yicheng Fu et.al. | 2409.04081 | null |
2024-09-05 | Lexicon3D: Probing Visual Foundation Models for Complex 3D Scene Understanding | Yunze Man et.al. | 2409.03757 | link |
2024-09-05 | Foundation Model or Finetune? Evaluation of few-shot semantic segmentation for river pollution | Marga Don et.al. | 2409.03754 | link |
2024-09-05 | Attention Heads of Large Language Models: A Survey | Zifan Zheng et.al. | 2409.03752 | link |
2024-09-05 | LLM-CI: Assessing Contextual Integrity Norms in Language Models | Yan Shvartzshnaider et.al. | 2409.03735 | null |
2024-09-05 | Safety vs. Performance: How Multi-Objective Learning Reduces Barriers to Market Entry | Meena Jagadeesan et.al. | 2409.03734 | null |
2024-09-05 | Planning In Natural Language Improves LLM Search For Code Generation | Evan Wang et.al. | 2409.03733 | link |
2024-09-06 | RAG based Question-Answering for Contextual Response Prediction System | Sriram Veturi et.al. | 2409.03708 | null |
2024-09-05 | LAST: Language Model Aware Speech Tokenization | Arnon Turetzky et.al. | 2409.03701 | null |
2024-09-05 | TRACE-cs: Trustworthy Reasoning for Contrastive Explanations in Course Scheduling Problems | Stylianos Loukas Vasileiou et.al. | 2409.03671 | link |
2024-09-05 | A Fused Large Language Model for Predicting Startup Success | Abdurahman Maarouf et.al. | 2409.03668 | null |
2024-09-05 | The representation landscape of few-shot learning and fine-tuning in large language models | Diego Doimo et.al. | 2409.03662 | link |
2024-09-06 | LLM-based multi-agent poetry generation in non-cooperative environments | Ran Zhang et.al. | 2409.03659 | link |
2024-09-05 | On the Limited Generalization Capability of the Implicit Reward Model Induced by Direct Preference Optimization | Yong Lin et.al. | 2409.03650 | null |
2024-09-05 | Text-Guided Mixup Towards Long-Tailed Image Categorization | Richard Franklin et.al. | 2409.03583 | link |
2024-09-05 | FrozenSeg: Harmonizing Frozen Foundation Models for Open-Vocabulary Segmentation | Xi Chen et.al. | 2409.03525 | null |
2024-09-05 | Have Large Vision-Language Models Mastered Art History? | Ombretta Strafforello et.al. | 2409.03521 | null |
2024-09-05 | Tissue Concepts: supervised foundation models in computational pathology | Till Nicke et.al. | 2409.03519 | link |
2024-09-05 | From MOOC to MAIC: Reshaping Online Teaching and Learning through LLM-driven Agents | Jifan Yu et.al. | 2409.03512 | null |
2024-09-05 | LLM-based event abstraction and integration for IoT-sourced logs | Mohsen Shirali et.al. | 2409.03478 | link |
2024-09-05 | How Much Data is Enough Data? Fine-Tuning Large Language Models for In-House Translation: Performance Evaluation Across Multiple Dataset Sizes | Inacio Vieira et.al. | 2409.03454 | null |
2024-09-04 | RoboTwin: Dual-Arm Robot Benchmark with Generative Digital Twins (early version) | Yao Mu et.al. | 2409.02920 | null |
2024-09-04 | Can LVLMs Obtain a Driver’s License? A Benchmark Towards Reliable AGI for Autonomous Driving | Yuhang Lu et.al. | 2409.02914 | null |
2024-09-04 | Masked Diffusion Models are Secretly Time-Agnostic Masked Models and Exploit Inaccurate Categorical Sampling | Kaiwen Zheng et.al. | 2409.02908 | null |
2024-09-05 | LongCite: Enabling LLMs to Generate Fine-grained Citations in Long-context QA | Jiajie Zhang et.al. | 2409.02897 | link |
2024-09-04 | LongLLaVA: Scaling Multi-modal LLMs to 1000 Images Efficiently via Hybrid Architecture | Xidong Wang et.al. | 2409.02889 | link |
2024-09-04 | CanvOI, an Oncology Intelligence Foundation Model: Scaling FLOPS Differently | Jonathan Zalach et.al. | 2409.02885 | null |
2024-09-04 | Benchmarking Spurious Bias in Few-Shot Image Classifiers | Guangtao Zheng et.al. | 2409.02882 | link |
2024-09-04 | Configurable Foundation Models: Building LLMs from a Modular Perspective | Chaojun Xiao et.al. | 2409.02877 | null |
2024-09-04 | Historical German Text Normalization Using Type- and Token-Based Language Modeling | Anton Ehrmanntraut et.al. | 2409.02841 | null |
2024-09-04 | Exploring Sentiment Dynamics and Predictive Behaviors in Cryptocurrency Discussions by Few-Shot Learning with Large Language Models | Moein Shahiki Tash et.al. | 2409.02836 | null |
2024-09-04 | CMM-Math: A Chinese Multimodal Math Dataset To Evaluate and Enhance the Mathematics Reasoning of Large Multimodal Models | Wentao Liu et.al. | 2409.02834 | link |
2024-09-04 | ExpLLM: Towards Chain of Thought for Facial Expression Recognition | Xing Lan et.al. | 2409.02828 | null |
2024-09-04 | Design Contradictions: Help or Hindrance? | Aron E. Owen et.al. | 2409.02823 | null |
2024-09-04 | Language Understanding as a Constraint on Consensus Size in LLM Societies | Giordano De Marzo et.al. | 2409.02822 | null |
2024-09-04 | Towards a Unified View of Preference Learning for Large Language Models: A Survey | Bofei Gao et.al. | 2409.02795 | link |
2024-09-05 | Pooling And Attention: What Are Effective Designs For LLM-Based Embedding Models? | Yixuan Tang et.al. | 2409.02727 | link |
2024-09-04 | Pre-training data selection for biomedical domain adaptation using journal impact metrics | Mathieu Laï-king et.al. | 2409.02725 | null |
2024-09-04 | Alignment-Aware Model Extraction Attacks on Large Language Models | Zi Liang et.al. | 2409.02718 | link |
2024-09-04 | Creating a Gen-AI based Track and Trace Assistant MVP (SuperTracy) for PostNL | Mohammad Reshadati et.al. | 2409.02711 | null |
2024-09-04 | LLM-Assisted Visual Analytics: Opportunities and Challenges | Maeve Hutchinson et.al. | 2409.02691 | null |
2024-08-30 | SYNTHEVAL: Hybrid Behavioral Testing of NLP Models with Synthetic CheckLists | Raoyuan Zhao et.al. | 2408.17437 | link |
2024-08-30 | DARES: Depth Anything in Robotic Endoscopic Surgery with Self-supervised Vector-LoRA of the Foundation Model | Mona Sheikh Zeinoddin et.al. | 2408.17433 | link |
2024-08-30 | Advancing Multi-talker ASR Performance with Large Language Models | Mohan Shi et.al. | 2408.17431 | null |
2024-08-30 | CLOCR-C: Context Leveraging OCR Correction with Pre-trained Language Models | Jonathan Bourne et.al. | 2408.17428 | link |
2024-09-03 | Open-vocabulary Temporal Action Localization using VLMs | Naoki Wake et.al. | 2408.17422 | null |
2024-08-30 | Getting Inspiration for Feature Elicitation: App Store- vs. LLM-based Approach | Jialiang Wei et.al. | 2408.17404 | link |
2024-08-30 | EMPOWER: Embodied Multi-role Open-vocabulary Planning with Online Grounding and Execution | Francesco Argenziano et.al. | 2408.17379 | null |
2024-08-30 | NDP: Next Distribution Prediction as a More Broad Target | Junhao Ruan et.al. | 2408.17377 | null |
2024-08-30 | Assessing Generative Language Models in Classification Tasks: Performance and Self-Evaluation Capabilities in the Environmental and Climate Change Domain | Francesca Grasso et.al. | 2408.17362 | link |
2024-08-30 | Forget to Flourish: Leveraging Machine-Unlearning on Pretrained Language Models for Privacy Leakage | Md Rafi Ur Rashid et.al. | 2408.17354 | null |
2024-09-02 | LSMS: Language-guided Scale-aware MedSegmentor for Medical Image Referring Segmentation | Shuyi Ouyang et.al. | 2408.17347 | null |
2024-08-30 | Investigating Neuron Ablation in Attention Heads: The Case for Peak Activation Centering | Nicholas Pochinkov et.al. | 2408.17322 | link |
2024-08-30 | Bridging Domain Knowledge and Process Discovery Using Large Language Models | Ali Norouzifar et.al. | 2408.17316 | link |
2024-08-30 | Flexible and Effective Mixing of Large Language Models into a Mixture of Domain Experts | Rhui Dih Lee et.al. | 2408.17280 | null |
2024-08-30 | Joint Estimation and Prediction of City-wide Delivery Demand: A Large Language Model Empowered Graph-based Learning Approach | Tong Nie et.al. | 2408.17258 | null |
2024-08-30 | VisionTS: Visual Masked Autoencoders Are Free-Lunch Zero-Shot Time Series Forecasters | Mouxiang Chen et.al. | 2408.17253 | link |
2024-08-30 | Improving Extraction of Clinical Event Contextual Properties from Electronic Health Records: A Comparative Study | Shubham Agarwal et.al. | 2408.17181 | null |
2024-08-30 | Codec Does Matter: Exploring the Semantic Shortcoming of Codec for Audio Language Model | Zhen Ye et.al. | 2408.17175 | link |
2024-08-30 | Look, Compare, Decide: Alleviating Hallucination in Large Vision-Language Models via Multi-View Multi-Path Reasoning | Xiaoye Qu et.al. | 2408.17150 | link |
2024-08-30 | Reasoning AI Performance Degradation in 6G Networks with Large Language Models | Liming Huang et.al. | 2408.17097 | null |
2024-08-29 | PromptSmooth: Certifying Robustness of Medical Vision-Language Models via Prompt Learning | Noor Hussein et.al. | 2408.16769 | link |
2024-08-29 | How Far Can Cantonese NLP Go? Benchmarking Cantonese Capabilities of Large Language Models | Jiyue Jiang et.al. | 2408.16756 | link |
2024-08-29 | Reinforcement Learning without Human Feedback for Last Mile Fine-Tuning of Large Language Models | Alec Solway et.al. | 2408.16753 | null |
2024-08-29 | A Gradient Analysis Framework for Rewarding Good and Penalizing Bad Examples in Language Models | Yi-Lin Tuan et.al. | 2408.16751 | null |
2024-08-29 | Assessing Large Language Models for Online Extremism Research: Identification, Explanation, and New Knowledge | Beidi Dong et.al. | 2408.16749 | null |
2024-08-29 | Theoretical and Methodological Framework for Studying Texts Produced by Large Language Models | Jiří Milička et.al. | 2408.16740 | null |
2024-08-29 | Smaller, Weaker, Yet Better: Training LLM Reasoners via Compute-Optimal Sampling | Hritik Bansal et.al. | 2408.16737 | null |
2024-08-29 | VideoLLM-MoD: Efficient Video-Language Streaming with Mixture-of-Depths Vision Computation | Shiwei Wu et.al. | 2408.16730 | null |
2024-08-30 | Mini-Omni: Language Models Can Hear, Talk While Thinking in Streaming | Zhifei Xie et.al. | 2408.16725 | link |
2024-08-29 | GradBias: Unveiling Word Influence on Bias in Text-to-Image Generative Models | Moreno D’Incà et.al. | 2408.16700 | link |
2024-08-29 | Entropic Distribution Matching in Supervised Fine-tuning of LLMs: Less Overfitting and Better Diversity | Ziniu Li et.al. | 2408.16673 | null |
2024-08-29 | Space3D-Bench: Spatial 3D Question Answering Benchmark | Emilia Szymanska et.al. | 2408.16662 | null |
2024-08-29 | DriveGenVLM: Real-world Video Generation for Vision Language Model based Autonomous Driving | Yongjie Fu et.al. | 2408.16647 | null |
2024-08-29 | Examination of Code generated by Large Language Models | Robin Beer et.al. | 2408.16601 | link |
2024-08-29 | Enhancing Dialogue Generation in Werewolf Game Through Situation Analysis and Persuasion Strategies | Zhiyang Qi et.al. | 2408.16586 | null |
2024-08-29 | WavTokenizer: an Efficient Acoustic Discrete Codec Tokenizer for Audio Language Modeling | Shengpeng Ji et.al. | 2408.16532 | link |
2024-08-29 | CNIMA: A Universal Evaluation Framework and Automated Approach for Assessing Second Language Dialogues | Rena Gao et.al. | 2408.16518 | link |
2024-08-29 | LLMs vs Established Text Augmentation Techniques for Classification: When do the Benefits Outweight the Costs? | Jan Cegin et.al. | 2408.16502 | null |
2024-08-29 | CogVLM2: Visual Language Models for Image and Video Understanding | Wenyi Hong et.al. | 2408.16500 | link |
2024-08-29 | A Survey on Evaluating Large Language Models in Code Generation Tasks | Liguo Chen et.al. | 2408.16498 | null |
2024-08-28 | Eagle: Exploring The Design Space for Multimodal LLMs with Mixture of Encoders | Min Shi et.al. | 2408.15998 | link |
2024-08-29 | Spatio-Temporal Context Prompting for Zero-Shot Action Detection | Wei-Jhe Huang et.al. | 2408.15996 | null |
2024-08-28 | Perceive-IR: Learning to Perceive Degradation Better for All-in-One Image Restoration | Xu Zhang et.al. | 2408.15994 | null |
2024-08-28 | BattleAgentBench: A Benchmark for Evaluating Cooperation and Competition Capabilities of Language Models in Multi-Agent Systems | Wei Wang et.al. | 2408.15971 | null |
2024-08-28 | More Text, Less Point: Towards 3D Data-Efficient Point-Language Understanding | Yuan Tang et.al. | 2408.15966 | link |
2024-08-28 | Atari-GPT: Investigating the Capabilities of Multimodal Large Language Models as Low-Level Policies for Atari Games | Nicholas R. Waytowich et.al. | 2408.15950 | null |
2024-08-28 | DeMoBot: Deformable Mobile Manipulation with Vision-based Sub-goal Retrieval | Yuying Zhang et.al. | 2408.15919 | null |
2024-08-28 | Leveraging Open Knowledge for Advancing Task Expertise in Large Language Models | Yuncheng Yang et.al. | 2408.15915 | link |
2024-08-28 | Decentralized LLM Inference over Edge Networks with Energy Harvesting | Aria Khoshsirat et.al. | 2408.15907 | null |
2024-08-28 | LLM-Based Multi-Hop Question Answering with Knowledge Graph Integration in Evolving Environments | Ruirui Chen et.al. | 2408.15903 | null |
2024-08-28 | Nexus: Specialization meets Adaptability for Efficiently Training Mixture of Experts | Nikolas Gritsch et.al. | 2408.15901 | null |
2024-08-28 | Bias in LLMs as Annotators: The Effect of Party Cues on Labelling Decision by Large Language Models | Sebastian Vallejo Vera et.al. | 2408.15895 | null |
2024-08-28 | LLaVA-MoD: Making LLaVA Tiny via MoE Knowledge Distillation | Fangxun Shu et.al. | 2408.15881 | link |
2024-08-28 | Persuasion Games using Large Language Models | Ganesh Prasath Ramani et.al. | 2408.15879 | null |
2024-08-28 | Retrieval-Augmented Instruction Tuning for Automated Process Engineering Calculations : A Tool-Chaining Problem-Solving Framework with Attributable Reflection | Sagar Srinivas Sakhinana et.al. | 2408.15866 | null |
2024-08-28 | Benchmarking foundation models as feature extractors for weakly-supervised computational pathology | Peter Neidlinger et.al. | 2408.15823 | null |
2024-08-28 | Visual Prompt Engineering for Medical Vision Language Models in Radiology | Stefan Denner et.al. | 2408.15802 | null |
2024-08-28 | Scaling Up Summarization: Leveraging Large Language Models for Long Text Extractive Summarization | Léo Hemamou et.al. | 2408.15801 | null |
2024-08-28 | Evaluating Named Entity Recognition Using Few-Shot Prompting with Large Language Models | Hédi Zhegidi et.al. | 2408.15796 | link |
2024-08-28 | Efficient LLM Scheduling by Learning to Rank | Yichao Fu et.al. | 2408.15792 | link |
2024-08-27 | Generative Verifiers: Reward Modeling as Next-Token Prediction | Lunjun Zhang et.al. | 2408.15240 | null |
2024-08-27 | The Mamba in the Llama: Distilling and Accelerating Hybrid Models | Junxiong Wang et.al. | 2408.15237 | link |
2024-08-27 | Into the Unknown Unknowns: Engaged Human Learning through Participation in Language Model Agent Conversations | Yucheng Jiang et.al. | 2408.15232 | null |
2024-08-27 | LLM Defenses Are Not Robust to Multi-Turn Human Jailbreaks Yet | Nathaniel Li et.al. | 2408.15221 | null |
2024-08-27 | Investigating Coverage Criteria in Large Language Models: An In-Depth Study Through Jailbreak Attacks | Shide Zhou et.al. | 2408.15207 | null |
2024-08-27 | Leveraging Hallucinations to Reduce Manual Prompt Dependency in Promptable Segmentation | Jian Hu et.al. | 2408.15205 | link |
2024-08-27 | Can Unconfident LLM Annotations Be Used for Confident Conclusions? | Kristina Gligorić et.al. | 2408.15204 | link |
2024-08-27 | Infusing Acoustic Pause Context into Text-Based Dementia Assessment | Franziska Braun et.al. | 2408.15188 | null |
2024-08-27 | Unlocking Potential in Pre-Trained Music Language Models for Versatile Multi-Track Music Arrangement | Longshen Ou et.al. | 2408.15176 | null |
2024-08-27 | X-Reflect: Cross-Reflection Prompting for Multimodal Recommendation | Hanjia Lyu et.al. | 2408.15172 | null |
2024-08-27 | Measuring text summarization factuality using atomic facts entailment metrics in the context of retrieval augmented generation | N. E. Kriman et.al. | 2408.15171 | null |
2024-08-27 | How transformers learn structured data: insights from hierarchical filtering | Jerome Garnier-Brun et.al. | 2408.15138 | link |
2024-08-27 | CLIP-AGIQA: Boosting the Performance of AI-Generated Image Quality Assessment with CLIP | Zhenchen Tang et.al. | 2408.15098 | null |
2024-08-27 | Relation Also Knows: Rethinking the Recall and Editing of Factual Associations in Auto-Regressive Transformer Language Models | Xiyu Liu et.al. | 2408.15091 | null |
2024-08-27 | BaichuanSEED: Sharing the Potential of ExtensivE Data Collection and Deduplication by Introducing a Competitive Large Language Model Baseline | Guosheng Dong et.al. | 2408.15079 | null |
2024-08-27 | Constraining Participation: Affordances of Feedback Features in Interfaces to Large Language Models | Ned Cooper et.al. | 2408.15066 | null |
2024-08-27 | The Benefits of Balance: From Information Projections to Variance Reduction | Lang Liu et.al. | 2408.15065 | null |
2024-08-28 | DocLayLLM: An Efficient and Effective Multi-modal Extension of Large Language Models for Text-rich Document Understanding | Wenhui Liao et.al. | 2408.15045 | link |
2024-08-28 | A Survey of Large Language Models for European Languages | Wazir Ali et.al. | 2408.15040 | null |
2024-08-27 | Speech Recognition Transformers: Topological-lingualism Perspective | Shruti Singh et.al. | 2408.14991 | null |
2024-08-26 | A Practitioner’s Guide to Continual Multimodal Pretraining | Karsten Roth et.al. | 2408.14471 | link |
2024-08-27 | Step-by-Step Unmasking for Parameter-Efficient Fine-tuning of Large Language Models | Aradhye Agarwal et.al. | 2408.14470 | link |
2024-08-26 | Grounded Multi-Hop VideoQA in Long-Form Egocentric Videos | Qirui Chen et.al. | 2408.14469 | null |
2024-08-26 | Explicit Inductive Inference using Large Language Models | Tianyang Liu et.al. | 2408.14467 | null |
2024-08-26 | Evaluating Large Language Models on Spatial Tasks: A Multi-Task Benchmarking Study | Liuchang Xu Shuo Zhao et.al. | 2408.14438 | null |
2024-08-26 | Social perception of faces in a vision-language model | Carina I. Hausladen et.al. | 2408.14435 | link |
2024-08-26 | CHARTOM: A Visual Theory-of-Mind Benchmark for Multimodal Large Language Models | Shubham Bharti et.al. | 2408.14419 | null |
2024-08-26 | MEDSAGE: Enhancing Robustness of Medical Dialogue Summarization to ASR Errors with LLM-generated Synthetic Dialogues | Kuluhan Binici et.al. | 2408.14418 | null |
2024-08-26 | Hyperdimensional Computing Empowered Federated Foundation Model over Wireless Networks for Metaverse | Yahao Ding et.al. | 2408.14416 | null |
2024-08-26 | Language-specific Calibration for Pruning Multilingual Language Models | Simon Kurz et.al. | 2408.14398 | null |
2024-08-26 | Reprogramming Foundational Large Language Models(LLMs) for Enterprise Adoption for Spatio-Temporal Forecasting Applications: Unveiling a New Era in Copilot-Guided Cross-Modal Time Series Representation Learning | Sakhinana Sagar Srinivas et.al. | 2408.14387 | null |
2024-08-26 | Probing Causality Manipulation of Large Language Models | Chenyang Zhang et.al. | 2408.14380 | link |
2024-08-26 | An Embedding is Worth a Thousand Noisy Labels | Francesco Di Salvo et.al. | 2408.14358 | link |
2024-08-26 | SWE-bench-java: A GitHub Issue Resolving Benchmark for Java | Daoguang Zan et.al. | 2408.14354 | link |
2024-08-26 | Assessing Contamination in Large Language Models: Introducing the LogProber method | Nicolas Yax et.al. | 2408.14352 | null |
2024-08-27 | Foundation Models for Music: A Survey | Yinghao Ma et.al. | 2408.14340 | link |
2024-08-26 | Claim Verification in the Age of Large Language Models: A Survey | Alphaeus Dmonte et.al. | 2408.14317 | null |
2024-08-26 | LLM-3D Print: Large Language Models To Monitor and Control 3D Printing | Yayati Jadhav et.al. | 2408.14307 | null |
2024-08-26 | Investigating the Effectiveness of Bayesian Spam Filters in Detecting LLM-modified Spam Mails | Malte Josten et.al. | 2408.14293 | link |
2024-08-26 | Predictability and Causality in Spanish and English Natural Language Generation | Andrea Busto-Castiñeira et.al. | 2408.14283 | null |
2024-08-23 | MME-RealWorld: Could Your Multimodal LLM Challenge High-Resolution Real-World Scenarios that are Difficult for Humans? | Yi-Fan Zhang et.al. | 2408.13257 | null |
2024-08-23 | Domain-specific long text classification from sparse relevant information | Célia D’Cruz et.al. | 2408.13253 | null |
2024-08-23 | Foundational Model for Electron Micrograph Analysis: Instruction-Tuning Small-Scale Language-and-Vision Assistant for Enterprise Adoption | Sakhinana Sagar Srinivas et.al. | 2408.13248 | null |
2024-08-23 | Multi-Layer Transformers Gradient Can be Approximated in Almost Linear Time | Yingyu Liang et.al. | 2408.13233 | null |
2024-08-23 | EUR-USD Exchange Rate Forecasting Based on Information Fusion with Large Language Models and Deep Learning Methods | Hongcheng Ding et.al. | 2408.13214 | null |
2024-08-23 | DOMAINEVAL: An Auto-Constructed Benchmark for Multi-Domain Code Generation | Qiming Zhu et.al. | 2408.13204 | null |
2024-08-23 | Can LLM be a Good Path Planner based on Prompt Engineering? Mitigating the Hallucination for Path Planning | Hourui Deng et.al. | 2408.13184 | null |
2024-08-23 | IntelliCare: Improving Healthcare Analysis with Variance-Controlled Patient-Level Knowledge from Large Language Models | Zhihao Yu et.al. | 2408.13073 | link |
2024-08-23 | Guiding IoT-Based Healthcare Alert Systems with Large Language Models | Yulan Gao et.al. | 2408.13071 | null |
2024-08-23 | SpeechPrompt: Prompting Speech Language Models for Speech Processing Tasks | Kai-Wei Chang et.al. | 2408.13040 | null |
2024-08-23 | VFM-Det: Towards High-Performance Vehicle Detection via Large Foundation Models | Wentao Wu et.al. | 2408.13031 | link |
2024-08-23 | In-Context Learning with Reinforcement Learning for Incomplete Utterance Rewriting | Haowei Du et.al. | 2408.13028 | null |
2024-08-23 | A Web-Based Solution for Federated Learning with LLM-Based Automation | Chamith Mawela et.al. | 2408.13010 | null |
2024-08-23 | Systematic Evaluation of LLM-as-a-Judge in LLM Alignment Tasks: Explainable Metrics and Diverse Prompt Templates | Hui Wei et.al. | 2408.13006 | link |
2024-08-23 | CRUXEval-X: A Benchmark for Multilingual Code Reasoning, Understanding and Execution | Ruiyang Xu et.al. | 2408.13001 | null |
2024-08-23 | Open Llama2 Model for the Lithuanian Language | Artūras Nakvosas et.al. | 2408.12963 | null |
2024-08-23 | Multimodal Contrastive In-Context Learning | Yosuke Miyanishi et.al. | 2408.12959 | null |
2024-08-23 | Image Segmentation in Foundation Model Era: A Survey | Tianfei Zhou et.al. | 2408.12957 | link |
2024-08-23 | E-code: Mastering Efficient Code Generation through Pretrained Models and Expert Encoder Group | Yue Pan et.al. | 2408.12948 | null |
2024-08-23 | Causal-Guided Active Learning for Debiasing Large Language Models | Zhouhao Sun et.al. | 2408.12942 | link |
2024-08-22 | Controllable Text Generation for Large Language Models: A Survey | Xun Liang et.al. | 2408.12599 | link |
2024-08-23 | Non-Homophilic Graph Pre-Training and Prompt Learning | Xingtong Yu et.al. | 2408.12594 | link |
2024-08-22 | RuleAlign: Making Large Language Models Better Physicians with Diagnostic Rule Alignment | Xiaohan Wang et.al. | 2408.12579 | null |
2024-08-22 | MuMA-ToM: Multi-modal Multi-Agent Theory of Mind | Haojun Shi et.al. | 2408.12574 | link |
2024-08-22 | Jamba-1.5: Hybrid Transformer-Mamba Models at Scale | Jamba Team et.al. | 2408.12570 | null |
2024-08-22 | ssProp: Energy-Efficient Training for Convolutional Neural Networks with Scheduled Sparse Back Propagation | Lujia Zhong et.al. | 2408.12561 | link |
2024-08-22 | Towards Evaluating and Building Versatile Large Language Models for Medicine | Chaoyi Wu et.al. | 2408.12547 | link |
2024-08-22 | Show-o: One Single Transformer to Unify Multimodal Understanding and Generation | Jinheng Xie et.al. | 2408.12528 | null |
2024-08-22 | MEDCO: Medical Education Copilots Based on A Multi-Agent Framework | Hao Wei et.al. | 2408.12496 | null |
2024-08-22 | GenderCARE: A Comprehensive Framework for Assessing and Reducing Gender Bias in Large Language Models | Kunsheng Tang et.al. | 2408.12494 | link |
2024-08-23 | Vintern-1B: An Efficient Multimodal Large Language Model for Vietnamese | Khang T. Doan et.al. | 2408.12480 | null |
2024-08-22 | Frame Order Matters: A Temporal Sequence-Aware Model for Few-Shot Action Recognition | Bozheng Li et.al. | 2408.12475 | null |
2024-08-22 | DLCRec: A Novel Approach for Managing Diversity in LLM-Based Recommender Systems | Jiaju Chen et.al. | 2408.12470 | link |
2024-08-22 | Envisioning Class Entity Reasoning by Large Language Models for Few-shot Learning | Mushui Liu et.al. | 2408.12469 | null |
2024-08-22 | Enhancing Multi-hop Reasoning through Knowledge Erasure in Large Language Model Editing | Mengqi Zhang et.al. | 2408.12456 | null |
2024-08-22 | Positional Description for Numerical Normalization | Deepanshu Gupta et.al. | 2408.12430 | null |
2024-08-22 | FlexEdit: Marrying Free-Shape Masks to VLLM for Flexible Image Editing | Jue Wang et.al. | 2408.12429 | link |
2024-08-22 | Enhanced Infield Agriculture with Interpretable Machine Learning Approaches for Crop Classification | Sudi Murindanyi et.al. | 2408.12426 | null |
2024-08-22 | Unlearning Trojans in Large Language Models: A Comparison Between Natural Language and Source Code | Mahdi Kazemi et.al. | 2408.12416 | null |
2024-08-22 | Generalized SAM: Efficient Fine-Tuning of SAM for Variable Input Image Sizes | Sota Kato et.al. | 2408.12406 | link |
2024-08-21 | Great Memory, Shallow Reasoning: Limits of $k$ NN-LMs | Shangyi Geng et.al. | 2408.11815 | link |
2024-08-21 | SEA: Supervised Embedding Alignment for Token-Level Visual-Textual Integration in MLLMs | Yuanyang Yin et.al. | 2408.11813 | null |
2024-08-21 | EmbodiedSAM: Online Segment Any 3D Thing in Real Time | Xiuwei Xu et.al. | 2408.11811 | null |
2024-08-21 | Approaching Deep Learning through the Spectral Dynamics of Weights | David Yunis et.al. | 2408.11804 | link |
2024-08-21 | Story3D-Agent: Exploring 3D Storytelling Visualization with Large Language Models | Yuzhou Huang et.al. | 2408.11801 | null |
2024-08-21 | PermitQA: A Benchmark for Retrieval Augmented Generation in Wind Siting and Permitting domain | Rounak Meyur et.al. | 2408.11800 | null |
2024-08-21 | Practical token pruning for foundation models in few-shot conversational virtual assistant systems | Haode Qi et.al. | 2408.11799 | null |
2024-08-21 | EE-MLLM: A Data-Efficient and Compute-Efficient Multimodal Large Language Model | Feipeng Ma et.al. | 2408.11795 | null |
2024-08-21 | Leveraging Chemistry Foundation Models to Facilitate Structure Focused Retrieval Augmented Generation in Multi-Agent Workflows for Catalyst and Materials Design | Nathaniel H. Park et.al. | 2408.11793 | null |
2024-08-21 | Critique-out-Loud Reward Models | Zachary Ankner et.al. | 2408.11791 | link |
2024-08-21 | DreamFactory: Pioneering Multi-Scene Long Video Generation with a Multi-Agent Framework | Zhifei Xie et.al. | 2408.11788 | null |
2024-08-21 | Personality Alignment of Large Language Models | Minjun Zhu et.al. | 2408.11779 | link |
2024-08-21 | Leveraging Fine-Tuned Retrieval-Augmented Generation with Long-Context Support: For 3GPP Standards | Omar Erak et.al. | 2408.11775 | link |
2024-08-21 | Against All Odds: Overcoming Typology, Script, and Language Confusion in Multilingual Embedding Inversion Attacks | Yiyi Chen et.al. | 2408.11749 | link |
2024-08-21 | DH-Bench: Probing Depth and Height Perception of Large Visual-Language Models | Shehreen Azad et.al. | 2408.11748 | link |
2024-08-21 | Open-Ended 3D Point Cloud Instance Segmentation | Phuc D. A. Nguyen et.al. | 2408.11747 | null |
2024-08-21 | Mixed Sparsity Training: Achieving 4 $\times$ FLOP Reduction for Transformer Pretraining | Pihe Hu et.al. | 2408.11746 | null |
2024-08-21 | FocusLLM: Scaling LLM’s Context by Parallel Decoding | Zhenyu Li et.al. | 2408.11745 | link |
2024-08-21 | MARLIN: Mixed-Precision Auto-Regressive Parallel Inference on Large Language Models | Elias Frantar et.al. | 2408.11743 | link |
2024-08-21 | CluMo: Cluster-based Modality Fusion Prompt for Continual Learning in Visual Question Answering | Yuliang Cai et.al. | 2408.11742 | link |
2024-08-20 | Prompt-Guided Image-Adaptive Neural Implicit Lookup Tables for Interpretable Image Enhancement | Satoshi Kosugi et.al. | 2408.11055 | link |
2024-08-20 | Revisiting VerilogEval: Newer LLMs, In-Context Learning, and Specification-to-RTL Tasks | Nathaniel Pinckney et.al. | 2408.11053 | link |
2024-08-20 | FLAME: Learning to Navigate with Multimodal LLM in Urban Environments | Yunzhe Xu et.al. | 2408.11051 | link |
2024-08-21 | MagicDec: Breaking the Latency-Throughput Tradeoff for Long Context Generation with Speculative Decoding | Jian Chen et.al. | 2408.11049 | link |
2024-08-20 | Inside the Black Box: Detecting Data Leakage in Pre-trained Language Encoders | Yuan Xin et.al. | 2408.11046 | null |
2024-08-20 | Reconciling Methodological Paradigms: Employing Large Language Models as Novice Qualitative Research Assistants in Talent Management Research | Sreyoshi Bhaduri et.al. | 2408.11043 | null |
2024-08-20 | Transfusion: Predict the Next Token and Diffuse Images with One Multi-Modal Model | Chunting Zhou et.al. | 2408.11039 | null |
2024-08-20 | Scaling Law with Learning Rate Annealing | Howe Tissue et.al. | 2408.11029 | null |
2024-08-20 | Athena: Safe Autonomous Agents with Verbal Contrastive Learning | Tanmana Sadhu et.al. | 2408.11021 | null |
2024-08-20 | While GitHub Copilot Excels at Coding, Does It Ensure Responsible Output? | Wen Cheng et.al. | 2408.11006 | link |
2024-08-20 | SenPa-MAE: Sensor Parameter Aware Masked Autoencoder for Multi-Satellite Self-Supervised Pretraining | Jonathan Prexl et.al. | 2408.11000 | link |
2024-08-20 | CTP-LLM: Clinical Trial Phase Transition Prediction Using Large Language Models | Michael Reinisch et.al. | 2408.10995 | null |
2024-08-20 | Dr.Academy: A Benchmark for Evaluating Questioning Capability in Education for Large Language Models | Yuyan Chen et.al. | 2408.10947 | null |
2024-08-20 | Large Language Model Driven Recommendation | Anton Korikov et.al. | 2408.10946 | null |
2024-08-20 | HiRED: Attention-Guided Token Dropping for Efficient Inference of High-Resolution Vision-Language Models in Resource-Constrained Environments | Kazi Hasan Ibn Arif et.al. | 2408.10945 | link |
2024-08-20 | SysBench: Can Large Language Models Follow System Messages? | Yanzhao Qin et.al. | 2408.10943 | link |
2024-08-20 | Proxona: Leveraging LLM-Driven Personas to Enhance Creators’ Understanding of Their Audience | Yoonseo Choi et.al. | 2408.10937 | null |
2024-08-21 | LBC: Language-Based-Classifier for Out-Of-Variable Generalization | Kangjun Noh et.al. | 2408.10923 | link |
2024-08-21 | BEYOND DIALOGUE: A Profile-Dialogue Alignment Framework Towards General Role-Playing Language Model | Yeyong Yu et.al. | 2408.10903 | link |
2024-08-20 | Soda-Eval: Open-Domain Dialogue Evaluation in the age of LLMs | John Mendonça et.al. | 2408.10902 | link |
2024-08-19 | SANER: Annotation-free Societal Attribute Neutralizer for Debiasing CLIP | Yusuke Hirota et.al. | 2408.10202 | null |
2024-08-19 | Demystifying the Communication Characteristics for Distributed Transformer Models | Quentin Anthony et.al. | 2408.10197 | null |
2024-08-19 | Transformers to SSMs: Distilling Quadratic Knowledge to Subquadratic Models | Aviv Bick et.al. | 2408.10189 | null |
2024-08-19 | LongVILA: Scaling Long-Context Visual Language Models for Long Videos | Fuzhao Xue et.al. | 2408.10188 | link |
2024-08-19 | SMILE: Zero-Shot Sparse Mixture of Low-Rank Experts Construction From Pre-Trained Foundation Models | Anke Tang et.al. | 2408.10174 | link |
2024-08-19 | Customizing Language Models with Instance-wise LoRA for Sequential Recommendation | Xiaoyu Kong et.al. | 2408.10159 | link |
2024-08-19 | Multilingual Needle in a Haystack: Investigating Long-Context Behavior of Multilingual Large Language Models | Amey Hengle et.al. | 2408.10151 | link |
2024-08-19 | In-Context Learning with Representations: Contextual Generalization of Trained Transformers | Tong Yang et.al. | 2408.10147 | null |
2024-08-19 | Instruction Finetuning for Leaderboard Generation from Empirical AI Research | Salomon Kabongo et.al. | 2408.10141 | null |
2024-08-19 | Rhyme-aware Chinese lyric generator based on GPT | Yixiao Yuan et.al. | 2408.10130 | null |
2024-08-19 | Video Object Segmentation via SAM 2: The 4th Solution for LSVOS Challenge VOS Track | Feiyu Pan et.al. | 2408.10125 | null |
2024-08-19 | Molecular Graph Representation Learning Integrating Large Language Models with Domain-specific Small Models | Tianyu Zhang et.al. | 2408.10124 | link |
2024-08-19 | Geometry Informed Tokenization of Molecules for Language Model Generation | Xiner Li et.al. | 2408.10120 | null |
2024-08-19 | GLIMMER: Incorporating Graph and Lexical Features in Unsupervised Multi-Document Summarization | Ran Liu et.al. | 2408.10115 | link |
2024-08-20 | PLUTUS: A Well Pre-trained Large Unified Transformer can Unveil Financial Time Series Regularities | Yuanjian Xu et.al. | 2408.10111 | null |
2024-08-19 | ARMADA: Attribute-Based Multimodal Data Augmentation | Xiaomeng Jin et.al. | 2408.10086 | null |
2024-08-19 | Personalizing Reinforcement Learning from Human Feedback with Variational Preference Learning | Sriyash Poddar et.al. | 2408.10075 | null |
2024-08-19 | FFAA: Multimodal Large Language Model based Explainable Open-World Face Forgery Analysis Assistant | Zhengchao Huang et.al. | 2408.10072 | link |
2024-08-19 | Privacy Checklist: Privacy Violation Detection Grounding on Contextual Integrity Theory | Haoran Li et.al. | 2408.10053 | null |
2024-08-19 | Defense Priorities in the Open-Source AI Debate: A Preliminary Assessment | Masao Dahlgren et.al. | 2408.10026 | null |
2024-08-16 | SAM2-UNet: Segment Anything 2 Makes Strong Encoder for Natural and Medical Image Segmentation | Xinyu Xiong et.al. | 2408.08870 | link |
2024-08-16 | PEDAL: Enhancing Greedy Decoding with Large Language Models using Diverse Exemplars | Sumanth Prabhu et.al. | 2408.08869 | null |
2024-08-16 | A Hassle-free Algorithm for Private Learning in Practice: Don’t Use Tree Aggregation, Use BLTs | H. Brendan McMahan et.al. | 2408.08868 | null |
2024-08-16 | Visual Agents as Fast and Slow Thinkers | Guangyan Sun et.al. | 2408.08862 | link |
2024-08-16 | DPA: Dual Prototypes Alignment for Unsupervised Adaptation of Vision-Language Models | Eman Ali et.al. | 2408.08855 | link |
2024-08-16 | GeoTransformer: Enhancing Urban Forecasting with Geospatial Attention Mechanisms | Yuhao Jia et.al. | 2408.08852 | null |
2024-08-16 | ECG-Chat: A Large ECG-Language Model for Cardiac Disease Diagnosis | Yubao Zhao et.al. | 2408.08849 | link |
2024-08-16 | PsychoLex: Unveiling the Psychological Mind of Large Language Models | Mohammad Amin Abbasi et.al. | 2408.08848 | null |
2024-08-16 | FLEXTAF: Enhancing Table Reasoning with Flexible Tabular Formats | Xuanliang Zhang et.al. | 2408.08841 | link |
2024-08-16 | EasyRec: Simple yet Effective Language Models for Recommendation | Xubin Ren et.al. | 2408.08821 | link |
2024-08-16 | Retrieval-augmented Few-shot Medical Image Segmentation with Foundation Models | Lin Zhao et.al. | 2408.08813 | null |
2024-08-16 | Artificial Intelligence and Strategic Decision-Making: Evidence from Entrepreneurs and Investors | Felipe A. Csaszar et.al. | 2408.08811 | null |
2024-08-16 | Constructing Domain-Specific Evaluation Sets for LLM-as-a-judge | Ravi Raju et.al. | 2408.08808 | null |
2024-08-16 | CIKMar: A Dual-Encoder Approach to Prompt-Based Reranking in Educational Dialogue Systems | Joanito Agili Lopo et.al. | 2408.08805 | null |
2024-08-16 | A Disease-Specific Foundation Model Using Over 100K Fundus Images: Release and Validation for Abnormality and Multi-Disease Classification on Downstream Tasks | Boa Jang et.al. | 2408.08790 | link |
2024-08-16 | EmoDynamiX: Emotional Support Dialogue Strategy Prediction by Modelling MiXed Emotions and Discourse Dynamics | Chenwei Wan et.al. | 2408.08782 | link |
2024-08-16 | Large Language Models Might Not Care What You Are Saying: Prompt Format Beats Descriptions | Chenming Tang et.al. | 2408.08780 | null |
2024-08-16 | DAC: Decomposed Automation Correction for Text-to-SQL | Dingzirui Wang et.al. | 2408.08779 | link |
2024-08-16 | Lower Layer Matters: Alleviating Hallucination via Multi-Layer Fusion Contrastive Decoding with Truthfulness Refocused | Dingwei Chen et.al. | 2408.08769 | null |
2024-08-16 | Rethinking Generative Semantic Communication for Multi-User Systems with Multi-Modal LLM | Wanting Yang et.al. | 2408.08765 | null |
2024-08-15 | Can Large Language Models Understand Symbolic Graphics Programs? | Zeju Qiu et.al. | 2408.08313 | null |
2024-08-15 | ScalingFilter: Assessing Data Quality through Inverse Utilization of Scaling Laws | Ruihang Li et.al. | 2408.08310 | null |
2024-08-15 | Towards Flexible Visual Relationship Segmentation | Fangrui Zhu et.al. | 2408.08305 | null |
2024-08-15 | Benchmarking the Capabilities of Large Language Models in Transportation System Engineering: Accuracy, Consistency, and Reasoning Behaviors | Usman Syed et.al. | 2408.08302 | null |
2024-08-15 | VLPG-Nav: Object Navigation Using Visual Language Pose Graph and Object Localization Probability Maps | Senthil Hariharan Arul et.al. | 2408.08301 | null |
2024-08-15 | HELP: Hierarchical Embeddings-based Log Parsing | Andy Xu et.al. | 2408.08300 | null |
2024-08-15 | The ShareLM Collection and Plugin: Contributing Human-Model Chats for the Benefit of the Community | Shachar Don-Yehiya et.al. | 2408.08291 | null |
2024-08-15 | Autonomous Behavior Planning For Humanoid Loco-manipulation Through Grounded Language Model | Jin Wang et.al. | 2408.08282 | null |
2024-08-15 | BAM! Just Like That: Simple and Efficient Parameter Upcycling for Mixture of Experts | Qizhen Zhang et.al. | 2408.08274 | null |
2024-08-15 | DaRec: A Disentangled Alignment Framework for Large Language Model and Recommender System | Xihong Yang et.al. | 2408.08231 | null |
2024-08-15 | RED-CT: A Systems Design Methodology for Using LLM-labeled Data to Train and Deploy Edge Classifiers for Computational Social Science | David Farr et.al. | 2408.08217 | null |
2024-08-15 | Does Reasoning Emerge? Examining the Probabilities of Causation in Large Language Models | Javier González et.al. | 2408.08210 | null |
2024-08-15 | LLM4DSR: Leveraing Large Language Model for Denoising Sequential Recommendation | Bohao Wang et.al. | 2408.08208 | null |
2024-08-15 | Heavy Labels Out! Dataset Distillation with Label Space Lightening | Ruonan Yu et.al. | 2408.08201 | null |
2024-08-15 | Scaling Up Natural Language Understanding for Multi-Robots Through the Lens of Hierarchy | Shaojun Xu et.al. | 2408.08188 | null |
2024-08-15 | General-purpose Clothes Manipulation with Semantic Keypoints | Yuhong Deng et.al. | 2408.08160 | null |
2024-08-15 | EmBARDiment: an Embodied AI Agent for Productivity in XR | Riccardo Bovo et.al. | 2408.08158 | null |
2024-08-15 | DeepSeek-Prover-V1.5: Harnessing Proof Assistant Feedback for Reinforcement Learning and Monte-Carlo Tree Search | Huajian Xin et.al. | 2408.08152 | link |
2024-08-15 | P/D-Serve: Serving Disaggregated Large Language Model at Scale | Yibo Jin et.al. | 2408.08147 | null |
2024-08-15 | KOALA: Enhancing Speculative Decoding for LLM via Multi-Layer Draft Heads with Adversarial Learning | Kaiqi Zhang et.al. | 2408.08146 | null |
2024-08-14 | The Death of Schema Linking? Text-to-SQL in the Age of Well-Reasoned Language Models | Karime Maamari et.al. | 2408.07702 | null |
2024-08-15 | Model Merging in LLMs, MLLMs, and Beyond: Methods, Theories, Applications and Opportunities | Enneng Yang et.al. | 2408.07666 | link |
2024-08-14 | Spoken Stereoset: On Evaluating Social Bias Toward Speaker in Speech Large Language Models | Yi-Cheng Lin et.al. | 2408.07665 | link |
2024-08-14 | Alignment-Enhanced Decoding:Defending via Token-Level Adaptive Refining of Probability Distributions | Quan Liu et.al. | 2408.07663 | link |
2024-08-14 | WeKnow-RAG: An Adaptive Approach for Retrieval-Augmented Generation Integrating Web Search and Knowledge Graphs | Weijian Xie et.al. | 2408.07611 | null |
2024-08-14 | Transformers and Large Language Models for Efficient Intrusion Detection Systems: A Comprehensive Survey | Hamza Kheddar et.al. | 2408.07583 | null |
2024-08-15 | MathScape: Evaluating MLLMs in multimodal Math Scenarios through a Hierarchical Benchmark | Minxuan Zhou et.al. | 2408.07543 | link |
2024-08-15 | Usefulness of data flow diagrams and large language models for security threat validation: a registered report | Winnie Bahati Mbaka et.al. | 2408.07537 | null |
2024-08-14 | Development of a Multi-Agent Clinical Decision Support System for Korean Triage and Acuity Scale (KTAS)-Based Triage and Treatment Planning in Emergency Departments | Seungjun Han et.al. | 2408.07531 | null |
2024-08-14 | Large Language Models Know What Makes Exemplary Contexts | Quanyu Long et.al. | 2408.07505 | null |
2024-08-14 | Cross-Platform Video Person ReID: A New Benchmark Dataset and Adaptation Approach | Shizhou Zhang et.al. | 2408.07500 | link |
2024-08-14 | QirK: Question Answering via Intermediate Representation on Knowledge Graphs | Jan Luca Scheerer et.al. | 2408.07494 | null |
2024-08-14 | Training Overhead Ratio: A Practical Reliability Metric for Large Language Model Training Systems | Ning Lu et.al. | 2408.07482 | null |
2024-08-14 | Bridging and Modeling Correlations in Pairwise Data for Direct Preference Optimization | Yuxin Jiang et.al. | 2408.07471 | link |
2024-08-14 | Domain-invariant Representation Learning via Segment Anything Model for Blood Cell Classification | Yongcheng Li et.al. | 2408.07467 | link |
2024-08-14 | Large Language Models Prompting With Episodic Memory | Dai Do et.al. | 2408.07465 | null |
2024-08-14 | From Brazilian Portuguese to European Portuguese | João Sanches et.al. | 2408.07457 | null |
2024-08-14 | Fact or Fiction? Improving Fact Verification with Knowledge Graphs through Simplified Subgraph Retrievals | Tobias A. Opsahl et.al. | 2408.07453 | link |
2024-08-15 | BAPLe: Backdoor Attacks on Medical Foundational Models using Prompt Learning | Asif Hanif et.al. | 2408.07440 | link |
2024-08-14 | Beyond Inter-Item Relations: Dynamic Adaptive Mixture-of-Experts for LLM-Based Sequential Recommendation | CanYi Liu et.al. | 2408.07427 | null |
2024-08-13 | Diversity Empowers Intelligence: Integrating Expertise of Software Engineering Agents | Kexun Zhang et.al. | 2408.07060 | null |
2024-08-13 | LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs | Yushi Bai et.al. | 2408.07055 | link |
2024-08-13 | Casper: Prompt Sanitization for Protecting User Privacy in Web-Based Large Language Models | Chun Jie Chong et.al. | 2408.07004 | null |
2024-08-13 | LLMs can Schedule | Henrik Abgaryan et.al. | 2408.06993 | link |
2024-08-13 | DyG-Mamba: Continuous State Space Modeling on Dynamic Graphs | Dongyuan Li et.al. | 2408.06966 | null |
2024-08-13 | Towards Holistic Disease Risk Prediction using Small Language Models | Liv Björkdahl et.al. | 2408.06943 | null |
2024-08-13 | OpenResearcher: Unleashing AI for Accelerated Scientific Research | Yuxiang Zheng et.al. | 2408.06941 | link |
2024-08-13 | The advantages of context specific language models: the case of the Erasmian Language Model | João Gonçalves et.al. | 2408.06931 | link |
2024-08-13 | Evaluating Cultural Adaptability of a Large Language Model via Simulation of Synthetic Personas | Louis Kwok et.al. | 2408.06929 | link |
2024-08-13 | SceneGPT: A Language Model for 3D Scene Understanding | Shivam Chandhok et.al. | 2408.06926 | null |
2024-08-13 | Re-TASK: Revisiting LLM Tasks from Capability, Skill, and Knowledge Perspectives | Zhihu Wang et.al. | 2408.06904 | null |
2024-08-13 | Leveraging Language Models for Emotion and Behavior Analysis in Education | Kaito Tanaka et.al. | 2408.06874 | null |
2024-08-13 | LoRA $^2$ : Multi-Scale Low-Rank Approximations for Fine-Tuning Large Language Models | Jia-Chen Zhang et.al. | 2408.06854 | null |
2024-08-13 | Causal Agent based on Large Language Model | Kairong Han et.al. | 2408.06849 | link |
2024-08-13 | DracoGPT: Extracting Visualization Design Preferences from Large Language Models | Huichen Will Wang et.al. | 2408.06845 | null |
2024-08-13 | How Aligned are Human Chart Takeaways and LLM Predictions? A Case Study on Bar Charts with Varying Layouts | Huichen Will Wang et.al. | 2408.06837 | null |
2024-08-13 | Efficient Search for Customized Activation Functions with Gradient Descent | Lukas Strack et.al. | 2408.06820 | link |
2024-08-13 | MAQA: Evaluating Uncertainty Quantification in LLMs Regarding Data Uncertainty | Yongjin Yang et.al. | 2408.06816 | link |
2024-08-13 | HLSPilot: LLM-based High-Level Synthesis | Chenwei Xiong et.al. | 2408.06810 | link |
2024-08-13 | Layerwise Recurrent Router for Mixture-of-Experts | Zihan Qiu et.al. | 2408.06793 | link |
2024-08-12 | FastFiD: Improve Inference Efficiency of Open Domain Question Answering via Sentence Selection | Yufei Huang et.al. | 2408.06333 | link |
2024-08-12 | Animate, or Inanimate, That is the Question for Large Language Models | Leonardo Ranaldi et.al. | 2408.06332 | null |
2024-08-12 | Can We Rely on LLM Agents to Draft Long-Horizon Plans? Let’s Take TravelPlanner as an Example | Yanan Chen et.al. | 2408.06318 | null |
2024-08-12 | Long-Form Answers to Visual Questions from Blind and Low Vision People | Mina Huh et.al. | 2408.06303 | null |
2024-08-12 | The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery | Chris Lu et.al. | 2408.06292 | link |
2024-08-12 | MovieSum: An Abstractive Summarization Dataset for Movie Screenplays | Rohit Saxena et.al. | 2408.06281 | link |
2024-08-13 | Review-driven Personalized Preference Reasoning with Large Language Models for Recommendation | Jieyong Kim et.al. | 2408.06276 | link |
2024-08-13 | FuxiTranyu: A Multilingual Large Language Model Trained with Balanced Data | Haoran Sun et.al. | 2408.06273 | link |
2024-08-12 | A RAG-Based Question-Answering Solution for Cyber-Attack Investigation and Attribution | Sampath Rajapaksha et.al. | 2408.06272 | null |
2024-08-12 | Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment | Karel D’Oosterlinck et.al. | 2408.06266 | link |
2024-08-12 | Context-aware Visual Storytelling with Visual Prefix Tuning and Contrastive Learning | Yingjin Song et.al. | 2408.06259 | null |
2024-08-12 | On Effects of Steering Latent Representation for Large Language Model Unlearning | Dang Huu-Tien et.al. | 2408.06223 | link |
2024-08-12 | Mutual Reasoning Makes Smaller LLMs Stronger Problem-Solvers | Zhenting Qi et.al. | 2408.06195 | link |
2024-08-12 | FruitNeRF: A Unified Neural Radiance Field based Fruit Counting Framework | Lukas Meyer et.al. | 2408.06190 | link |
2024-08-12 | Improving Structural Diversity of Blackbox LLMs via Chain-of-Specification Prompting | Halley Young et.al. | 2408.06186 | null |
2024-08-12 | OmniCLIP: Adapting CLIP for Video Recognition with Spatial-Temporal Omni-Scale Feature Learning | Mushui Liu et.al. | 2408.06158 | link |
2024-08-12 | LipidBERT: A Lipid Language Model Pre-trained on METiS de novo Lipid Library | Tianhao Yu et.al. | 2408.06150 | null |
2024-08-12 | Self-Supervised Learning on MeerKAT Wide-Field Continuum Images | Erica Lastufka et.al. | 2408.06147 | link |
2024-08-12 | Med42-v2: A Suite of Clinical LLMs | Clément Christophe et.al. | 2408.06142 | null |
2024-08-12 | Utilize Transformers for translating Wikipedia category names | Hoang-Thang Ta et.al. | 2408.06124 | null |
2024-08-10 | Preserving Privacy in Large Language Models: A Survey on Current Threats and Solutions | Michele Miranda et.al. | 2408.05212 | link |
2024-08-09 | VITA: Towards Open-Source Interactive Omni Multimodal LLM | Chaoyou Fu et.al. | 2408.05211 | link |
2024-08-09 | Evaluating the capability of large language models to personalize science texts for diverse middle-school-age learners | Michael Vaccaro Jr et.al. | 2408.05204 | null |
2024-08-09 | TaSL: Task Skill Localization and Consolidation for Language Model Continual Learning | Yujie Feng et.al. | 2408.05200 | link |
2024-08-09 | ECG-FM: An Open Electrocardiogram Foundation Model | Kaden McKeen et.al. | 2408.05178 | link |
2024-08-09 | Weak-Annotation of HAR Datasets using Vision Foundation Models | Marius Bock et.al. | 2408.05169 | link |
2024-08-09 | AttackER: Towards Enhancing Cyber-Attack Attribution with a Named Entity Recognition Dataset | Pritam Deka et.al. | 2408.05149 | null |
2024-08-09 | A Hybrid RAG System with Comprehensive Enhancement on Complex Reasoning | Ye Yuan et.al. | 2408.05141 | null |
2024-08-09 | Is ChatGPT a Good Software Librarian? An Exploratory Study on the Use of ChatGPT for Software Library Recommendations | Jasmine Latendresse et.al. | 2408.05128 | null |
2024-08-09 | Large Language Models and Thematic Analysis: Human-AI Synergy in Researching Hate Speech on Social Media | Petre Breazu et.al. | 2408.05126 | null |
2024-08-09 | Sportify: Question Answering with Embedded Visualizations and Personified Narratives for Sports Video | Chunggi Lee et.al. | 2408.05123 | null |
2024-08-09 | A Survey of NL2SQL with Large Language Models: Where are we, and where are we going? | Xinyu Liu et.al. | 2408.05109 | link |
2024-08-09 | Depth Helps: Improving Pre-trained RGB-based Policy with Depth Information Injection | Xincheng Pang et.al. | 2408.05107 | null |
2024-08-09 | How Well Do LLMs Identify Cultural Unity in Diversity? | Jialin Li et.al. | 2408.05102 | link |
2024-08-09 | Hyperbolic Learning with Multimodal Large Language Models | Paolo Mandica et.al. | 2408.05097 | null |
2024-08-09 | Unlocking Decoding-time Controllability: Gradient-Free Multi-Objective Alignment with Contrastive Prompts | Tingchen Fu et.al. | 2408.05094 | null |
2024-08-09 | Order Matters in Hallucination: Reasoning Order as Benchmark and Reflexive Prompting for Large-Language-Models | Zikai Xie et.al. | 2408.05093 | link |
2024-08-09 | Generating novel experimental hypotheses from language models: A case study on cross-dative generalization | Kanishka Misra et.al. | 2408.05086 | link |
2024-08-09 | RT-Surv: Improving Mortality Prediction After Radiotherapy with Large Language Model Structuring of Large-Scale Unstructured Electronic Health Records | Sangjoon Park et.al. | 2408.05074 | null |
2024-08-09 | Examining the Behavior of LLM Architectures Within the Framework of Standardized National Exams in Brazil | Marcelo Sartori Locatelli et.al. | 2408.05035 | null |
2024-08-08 | Better Alignment with Instruction Back-and-Forth Translation | Thao Nguyen et.al. | 2408.04614 | null |
2024-08-08 | Code-switching in text and speech reveals information-theoretic audience design | Debasmita Bhattacharya et.al. | 2408.04596 | null |
2024-08-09 | Img-Diff: Contrastive Data Synthesis for Multimodal Large Language Models | Qirui Jiao et.al. | 2408.04594 | link |
2024-08-08 | Towards Resilient and Efficient LLMs: A Comparative Study of Efficiency, Performance, and Adversarial Robustness | Xiaojing Fan et.al. | 2408.04585 | null |
2024-08-08 | SAM2-Adapter: Evaluating & Adapting Segment Anything 2 in Downstream Tasks: Camouflage, Shadow, Medical Image Segmentation, and More | Tianrun Chen et.al. | 2408.04579 | null |
2024-08-08 | SCENE: Evaluating Explainable AI Techniques Using Soft Counterfactuals | Haoran Zheng et.al. | 2408.04575 | null |
2024-08-08 | Learning Fine-Grained Grounded Citations for Attributed Large Language Models | Lei Huang et.al. | 2408.04568 | link |
2024-08-08 | Bias-Aware Low-Rank Adaptation: Mitigating Catastrophic Inheritance of Large Language Models | Yupeng Chang et.al. | 2408.04556 | link |
2024-08-08 | Depth Any Canopy: Leveraging Depth Foundation Models for Canopy Height Estimation | Daniele Rege Cambrin et.al. | 2408.04523 | link |
2024-08-08 | Compromesso! Italian Many-Shot Jailbreaks Undermine the Safety of Large Language Models | Fabio Pernisi et.al. | 2408.04522 | null |
2024-08-08 | What You Need is What You Get: Theory of Mind for an LLM-Based Code Understanding Assistant | Jonan Richards et.al. | 2408.04477 | null |
2024-08-08 | Can LLMs Beat Humans in Debating? A Dynamic Multi-agent Framework for Competitive Debate | Yiqun Zhang et.al. | 2408.04472 | link |
2024-08-08 | RiskAwareBench: Towards Evaluating Physical Risk Awareness for High-level Planning of LLM-based Embodied Agents | Zihao Zhu et.al. | 2408.04449 | link |
2024-08-08 | Large Language Models for cross-language code clone detection | Micheline Bénédicte Moumoula et.al. | 2408.04430 | link |
2024-08-08 | Recognizing Emotion Regulation Strategies from Human Behavior with Large Language Models | Philipp Müller et.al. | 2408.04420 | null |
2024-08-08 | Enhancing Robustness of Retrieval-Augmented Language Models with In-Context Learning | Seong-Il Park et.al. | 2408.04414 | null |
2024-08-08 | Deeploy: Enabling Energy-Efficient Deployment of Small Language Models On Heterogeneous Microcontrollers | Moritz Scherer et.al. | 2408.04413 | null |
2024-08-08 | Exploring Reasoning Biases in Large Language Models Through Syllogism: Insights from the NeuBAROCO Dataset | Kentaro Ozeki et.al. | 2408.04403 | link |
2024-08-08 | Automated Educational Question Generation at Different Bloom’s Skill Levels using Large Language Models: Strategies and Evaluation | Nicy Scaria et.al. | 2408.04394 | link |
2024-08-08 | Open-domain Implicit Format Control for Large Language Model Generation | Yiqun Yao et.al. | 2408.04392 | link |
2024-08-07 | How Well Can Vision Language Models See Image Details? | Chenhui Gou et.al. | 2408.03940 | null |
2024-08-07 | SLIM-RAFT: A Novel Fine-Tuning Approach to Improve Cross-Linguistic Performance for Mercosur Common Nomenclature | Vinícius Di Oliveira et.al. | 2408.03936 | null |
2024-08-07 | CodexGraph: Bridging Large Language Models and Code Repositories via Code Graph Databases | Xiangyan Liu et.al. | 2408.03910 | link |
2024-08-07 | Decoding Biases: Automated Methods and LLM Judges for Gender Bias Detection in Language Models | Shachi H Kumar et.al. | 2408.03907 | null |
2024-08-07 | Speech-MASSIVE: A Multilingual Speech Dataset for SLU and Beyond | Beomseok Lee et.al. | 2408.03900 | link |
2024-08-07 | Simplifying Scholarly Abstracts for Accessible Digital Libraries | Haining Wang et.al. | 2408.03899 | link |
2024-08-07 | From Data to Story: Towards Automatic Animated Data Video Creation with LLM-based Multi-Agent Systems | Leixian Shen et.al. | 2408.03876 | null |
2024-08-07 | PackMamba: Efficient Processing of Variable-Length Sequences in Mamba training | Haoran Xu et.al. | 2408.03865 | null |
2024-08-07 | GAIA – A Large Language Model for Advanced Power Dispatch | Yuheng Cheng et.al. | 2408.03847 | null |
2024-08-07 | MaxMind: A Memory Loop Network to Enhance Software Productivity based on Large Language Models | Yuchen Dong et.al. | 2408.03841 | null |
2024-08-07 | WalledEval: A Comprehensive Safety Evaluation Toolkit for Large Language Models | Prannaya Gupta et.al. | 2408.03837 | link |
2024-08-07 | Target Prompting for Information Extraction with Vision Language Model | Dipankar Medhi et.al. | 2408.03834 | null |
2024-08-07 | Leveraging Variation Theory in Counterfactual Data Augmentation for Optimized Active Learning | Simret Araya Gebreegziabher et.al. | 2408.03819 | null |
2024-08-07 | Generative Language Models with Retrieval Augmented Generation for Automated Short Answer Scoring | Zifan Wang et.al. | 2408.03811 | null |
2024-08-07 | ‘Finance Wizard’ at the FinLLM Challenge Task: Financial Text Summarization | Meisin Lee et.al. | 2408.03762 | null |
2024-08-07 | MMSummary: Multimodal Summary Generation for Fetal Ultrasound Video | Xiaoqing Guo et.al. | 2408.03761 | null |
2024-08-07 | Advancing Multimodal Large Language Models with Quantization-Aware Scale Learning for Efficient Adaptation | Jingjing Xie et.al. | 2408.03735 | link |
2024-08-07 | Question Rephrasing for Quantifying Uncertainty in Large Language Models: Applications in Molecular Chemistry Tasks | Zizhang Chen et.al. | 2408.03732 | null |
2024-08-07 | A Convex-optimization-based Layer-wise Post-training Pruner for Large Language Models | Pengxiang Zhao et.al. | 2408.03728 | null |
2024-08-07 | Local Topology Measures of Contextual Language Model Latent Spaces With Applications to Dialogue Term Extraction | Benjamin Matthias Ruppik et.al. | 2408.03706 | null |
2024-08-06 | CoverBench: A Challenging Benchmark for Complex Claim Verification | Alon Jacovi et.al. | 2408.03325 | null |
2024-08-06 | Segment Anything in Medical Images and Videos: Benchmark and Deployment | Jun Ma et.al. | 2408.03322 | link |
2024-08-06 | TextIM: Part-aware Interactive Motion Synthesis from Text | Siyuan Fan et.al. | 2408.03302 | null |
2024-08-06 | KaPO: Knowledge-aware Preference Optimization for Controllable Knowledge Selection in Retrieval-Augmented Language Models | Ruizhe Zhang et.al. | 2408.03297 | null |
2024-08-06 | Biomedical SAM 2: Segment Anything in Biomedical Images and Videos | Zhiling Yan et.al. | 2408.03286 | link |
2024-08-07 | StructEval: Deepen and Broaden Large Language Model Assessment via Structured Evaluation | Boxi Cao et.al. | 2408.03281 | link |
2024-08-06 | Compress and Compare: Interactively Evaluating Efficiency and Behavior Across ML Model Compression Experiments | Angie Boggust et.al. | 2408.03274 | null |
2024-08-06 | Synthesizing Text-to-SQL Data from Weak and Strong LLMs | Jiaxi Yang et.al. | 2408.03256 | null |
2024-08-06 | Unveiling Factual Recall Behaviors of Large Language Models through Knowledge Neurons | Yifei Wang et.al. | 2408.03247 | link |
2024-08-06 | Making Long-Context Language Models Better Multi-Hop Reasoners | Yanyang Li et.al. | 2408.03246 | link |
2024-08-06 | Leveraging Parameter Efficient Training Methods for Low Resource Text Classification: A Case Study in Marathi | Pranita Deshmukh et.al. | 2408.03172 | null |
2024-08-06 | Conditioning LLMs with Emotion in Neural Machine Translation | Charles Brazier et.al. | 2408.03150 | null |
2024-08-06 | Leveraging Entity Information for Cross-Modality Correlation Learning: The Entity-Guided Multimodal Summarization | Yanghai Zhang et.al. | 2408.03149 | link |
2024-08-06 | Inference Optimizations for Large Language Models: Effects, Challenges, and Practical Considerations | Leo Donisch et.al. | 2408.03130 | null |
2024-08-06 | Lisbon Computational Linguists at SemEval-2024 Task 2: Using A Mistral 7B Model and Data Augmentation | Artur Guimarães et.al. | 2408.03127 | link |
2024-08-06 | Evaluating the Translation Performance of Large Language Models Based on Euas-20 | Yan Huang et.al. | 2408.03119 | null |
2024-08-06 | Topic Modeling with Fine-tuning LLMs and Bag of Sentences | Johannes Schneider et.al. | 2408.03099 | link |
2024-08-07 | TestART: Improving LLM-based Unit Test via Co-evolution of Automated Generation and Repair Iteration | Siqi Gu et.al. | 2408.03095 | null |
2024-08-06 | 500xCompressor: Generalized Prompt Compression for Large Language Models | Zongqian Li et.al. | 2408.03094 | link |
2024-08-06 | Extend Model Merging from Fine-Tuned to Pre-Trained Large Language Models via Weight Disentanglement | Le Yu et.al. | 2408.03092 | link |
2024-08-05 | Lumina-mGPT: Illuminate Flexible Photorealistic Text-to-Image Generation with Multimodal Generative Pretraining | Dongyang Liu et.al. | 2408.02657 | link |
2024-08-05 | Can Reinforcement Learning Unlock the Hidden Dangers in Aligned Large Language Models? | Mohammad Bahrami Karkevandi et.al. | 2408.02651 | null |
2024-08-05 | Command-line Obfuscation Detection using Small Language Models | Vojtech Outrata et.al. | 2408.02637 | null |
2024-08-05 | SEAS: Self-Evolving Adversarial Safety Optimization for Large Language Models | Muxi Diao et.al. | 2408.02632 | null |
2024-08-05 | Language Model Can Listen While Speaking | Ziyang Ma et.al. | 2408.02622 | null |
2024-08-05 | Progressively Selective Label Enhancement for Language Model Alignment | Biao Liu et.al. | 2408.02599 | null |
2024-08-05 | Modelling Visual Semantics via Image Captioning to extract Enhanced Multi-Level Cross-Modal Semantic Incongruity Representation with Attention for Multimodal Sarcasm Detection | Sajal Aggarwal et.al. | 2408.02595 | null |
2024-08-05 | Leveraging the Power of LLMs: A Fine-Tuning Approach for High-Quality Aspect-Based Summarization | Ankan Mullick et.al. | 2408.02584 | null |
2024-08-05 | DanModCap: Designing a Danmaku Moderation Tool for Video-Sharing Platforms that Leverages Impact Captions | Siying Hu et.al. | 2408.02574 | null |
2024-08-05 | Evaluating and Enhancing LLMs Agent based on Theory of Mind in Guandan: A Multi-Player Cooperative Game under Imperfect Information | Yauwai Yim et.al. | 2408.02559 | null |
2024-08-05 | Generative AI as a Service in 6G Edge-Cloud: Generation Task Offloading by In-context Learning | Hao Zhou et.al. | 2408.02549 | null |
2024-08-05 | RAG Foundry: A Framework for Enhancing LLMs for Retrieval Augmented Generation | Daniel Fleischer et.al. | 2408.02545 | link |
2024-08-05 | Caution for the Environment: Multimodal Agents are Susceptible to Environmental Distractions | Xinbei Ma et.al. | 2408.02544 | link |
2024-08-05 | Towards Coarse-grained Visual Language Navigation Task Planning Enhanced by Event Knowledge Graph | Zhao Kaichen et.al. | 2408.02535 | null |
2024-08-05 | Practical Attacks against Black-box Code Completion Engines | Slobodan Jenko et.al. | 2408.02509 | null |
2024-08-05 | UnifiedMLLM: Enabling Unified Representation for Multi-modal Multi-tasks With Large Language Model | Zhaowei Li et.al. | 2408.02503 | link |
2024-08-05 | Context Conquers Parameters: Outperforming Proprietary LLM in Commit Message Generation | Aaron Imani et.al. | 2408.02502 | link |
2024-08-05 | A First Look at License Compliance Capability of LLMs in Code Generation | Weiwei Xu et.al. | 2408.02487 | link |
2024-08-05 | Exploring Conditional Multi-Modal Prompts for Zero-shot HOI Detection | Ting Lei et.al. | 2408.02484 | link |
2024-08-05 | From LLMs to LLM-based Agents for Software Engineering: A Survey of Current, Challenges and Future | Haolin Jin et.al. | 2408.02479 | null |
2024-08-02 | Prompt Recursive Search: A Living Framework with Adaptive Growth in LLM Auto-Prompting | Xiangyu Zhao et.al. | 2408.01423 | null |
2024-08-02 | Mission Impossible: A Statistical Perspective on Jailbreaking LLMs | Jingtong Su et.al. | 2408.01420 | null |
2024-08-02 | DebateQA: Evaluating Question Answering on Debatable Knowledge | Rongwu Xu et.al. | 2408.01419 | link |
2024-08-02 | Talk Less, Interact Better: Evaluating In-context Conversational Adaptation in Multimodal LLMs | Yilun Hua et.al. | 2408.01417 | null |
2024-08-02 | Pre-trained Language Models Improve the Few-shot Prompt Ability of Decision Transformer | Yu Yang et.al. | 2408.01402 | null |
2024-08-02 | Coalitions of Large Language Models Increase the Robustness of AI Agents | Prattyush Mangal et.al. | 2408.01380 | null |
2024-08-02 | Toward Automatic Relevance Judgment using Vision–Language Models for Image–Text Retrieval Evaluation | Jheng-Hong Yang et.al. | 2408.01363 | null |
2024-08-02 | Hallu-PI: Evaluating Hallucination in Multi-modal Large Language Models within Perturbed Inputs | Peng Ding et.al. | 2408.01355 | link |
2024-08-02 | MCGMark: An Encodable and Robust Online Watermark for LLM-Generated Malicious Code | Kaiwen Ning et.al. | 2408.01354 | link |
2024-08-02 | Prompt Refinement or Fine-tuning? Best Practices for using LLMs in Computational Social Science Tasks | Anders Giovanni Møller et.al. | 2408.01346 | null |
2024-08-02 | MuChoMusic: Evaluating Music Understanding in Multimodal Audio-Language Models | Benno Weck et.al. | 2408.01337 | link |
2024-08-02 | A Backbone for Long-Horizon Robot Task Understanding | Xiaoshuai Chen et.al. | 2408.01334 | null |
2024-08-02 | FANNO: Augmenting High-Quality Instruction Data with Open-Sourced LLMs Only | He Zhu et.al. | 2408.01323 | null |
2024-08-02 | A Comprehensive Review of Multimodal Large Language Models: Performance and Challenges Across Different Tasks | Jiaqi Wang et.al. | 2408.01319 | null |
2024-08-02 | Reconsidering Token Embeddings with the Definitions for Pre-trained Language Models | Ying Zhang et.al. | 2408.01308 | null |
2024-08-02 | The Mismeasure of Man and Models: Evaluating Allocational Harms in Large Language Models | Hannah Chen et.al. | 2408.01285 | null |
2024-08-02 | RAGEval: Scenario Specific RAG Evaluation Dataset Generation Framework | Kunlun Zhu et.al. | 2408.01262 | link |
2024-08-02 | The Phantom Menace: Unmasking Privacy Leakages in Vision-Language Models | Simone Caldarella et.al. | 2408.01228 | null |
2024-08-02 | High-Throughput Phenotyping of Clinical Text Using Large Language Models | Daniel B. Hier et.al. | 2408.01214 | null |
2024-08-02 | Misinforming LLMs: vulnerabilities, challenges and opportunities | Bo Zhou et.al. | 2408.01168 | null |
2024-08-01 | AgentGen: Enhancing Planning Abilities for Large Language Model based Agent via Environment and Task Generation | Mengkang Hu et.al. | 2408.00764 | link |
2024-08-01 | UniTalker: Scaling up Audio-Driven 3D Facial Animation through A Unified Model | Xiangyu Fan et.al. | 2408.00762 | null |
2024-08-01 | Tamper-Resistant Safeguards for Open-Weight LLMs | Rishub Tamirisa et.al. | 2408.00761 | link |
2024-08-01 | Thermal Conductivity Predictions with Foundation Atomistic Models | Balázs Póta et.al. | 2408.00755 | link |
2024-08-01 | Coarse Correspondence Elicit 3D Spacetime Understanding in Multimodal Language Model | Benlin Liu et.al. | 2408.00754 | null |
2024-08-01 | Collaborative Vision-Text Representation Optimizing for Open-Vocabulary Segmentation | Siyu Jiao et.al. | 2408.00744 | link |
2024-08-01 | DynamoLLM: Designing LLM Inference Clusters for Performance and Energy Efficiency | Jovan Stojkovic et.al. | 2408.00741 | null |
2024-08-01 | Virchow 2: Scaling Self-Supervised Mixed Magnification Models in Pathology | Eric Zimmermann et.al. | 2408.00738 | null |
2024-08-01 | Improving Retrieval-Augmented Generation in Medicine with Iterative Follow-up Questions | Guangzhi Xiong et.al. | 2408.00727 | link |
2024-08-01 | An Empirical Analysis of Compute-Optimal Inference for Problem-Solving with Language Models | Yangzhen Wu et.al. | 2408.00724 | null |
2024-08-01 | Pathway to Secure and Trustworthy 6G for LLMs: Attacks, Defense, and Opportunities | Sunder Ali Khowaja et.al. | 2408.00722 | null |
2024-08-01 | SAM 2: Segment Anything in Images and Videos | Nikhila Ravi et.al. | 2408.00714 | link |
2024-08-01 | Point-supervised Brain Tumor Segmentation with Box-prompted MedSAM | Xiaofeng Liu et.al. | 2408.00706 | null |
2024-08-02 | Improving Text Embeddings for Smaller Language Models Using Contrastive Fine-tuning | Trapoom Ukarapol et.al. | 2408.00690 | link |
2024-08-01 | Can Developers Prompt? A Controlled Experiment for Code Documentation Generation | Hans-Alexander Kruse et.al. | 2408.00686 | null |
2024-08-01 | ExpertAF: Expert Actionable Feedback from Video | Kumar Ashutosh et.al. | 2408.00672 | null |
2024-08-01 | AutoM3L: An Automated Multimodal Machine Learning Framework with Large Language Models | Daqin Luo et.al. | 2408.00665 | link |
2024-08-01 | Disentangling Dense Embeddings with Sparse Autoencoders | Charles O’Neill et.al. | 2408.00657 | null |
2024-08-02 | SentenceVAE: Faster, Longer and More Accurate Inference with Next-sentence Prediction for Large Language Models | Hongjun An et.al. | 2408.00655 | link |
2024-08-01 | Towards End-to-End Explainable Facial Action Unit Recognition via Vision-Language Joint Learning | Xuri Ge et.al. | 2408.00644 | null |
2024-07-31 | Generalized Out-of-Distribution Detection and Beyond in Vision Language Model Era: A Survey | Atsuyuki Miyai et.al. | 2407.21794 | null |
2024-07-31 | Vision-Language Model Based Handwriting Verification | Mihir Chauhan et.al. | 2407.21788 | null |
2024-07-31 | Large Language Monkeys: Scaling Inference Compute with Repeated Sampling | Bradley Brown et.al. | 2407.21787 | link |
2024-07-31 | The Llama 3 Herd of Models | Abhimanyu Dubey et.al. | 2407.21783 | null |
2024-07-31 | Paying More Attention to Image: A Training-Free Method for Alleviating Hallucination in LVLMs | Shi Liu et.al. | 2407.21771 | null |
2024-07-31 | MoMa: Efficient Early-Fusion Pre-training with Mixture of Modality-Aware Experts | Xi Victoria Lin et.al. | 2407.21770 | null |
2024-07-31 | ReplanVLM: Replanning Robotic Tasks with Visual Language Models | Aoran Mei et.al. | 2407.21762 | null |
2024-07-31 | Learning Video Context as Interleaved Multimodal Sequences | Kevin Qinghong Lin et.al. | 2407.21757 | link |
2024-07-31 | A Federated Learning-Friendly Approach for Parameter-Efficient Fine-Tuning of SAM in 3D Segmentation | Mothilal Asokan et.al. | 2407.21739 | null |
2024-07-31 | Open-Vocabulary Audio-Visual Semantic Segmentation | Ruohao Guo et.al. | 2407.21721 | null |
2024-07-31 | Adaptive Retrieval-Augmented Generation for Conversational Systems | Xi Wang et.al. | 2407.21712 | null |
2024-07-31 | CEAR: Automatic construction of a knowledge graph of chemical entities and roles from scientific literature | Stefan Langer et.al. | 2407.21708 | null |
2024-07-31 | TransferTOD: A Generalizable Chinese Multi-Domain Task-Oriented Dialogue System with Transfer Capabilities | Ming Zhang et.al. | 2407.21693 | link |
2024-07-31 | Synth-Empathy: Towards High-Quality Synthetic Empathy Data | Hao Liang et.al. | 2407.21669 | link |
2024-08-01 | Defending Jailbreak Attack in VLMs via Cross-modality Information Detector | Yue Xu et.al. | 2407.21659 | link |
2024-07-31 | MTA-CLIP: Language-Guided Semantic Segmentation with Mask-Text Alignment | Anurag Das et.al. | 2407.21654 | null |
2024-07-31 | Zero-Shot Cross-Domain Dialogue State Tracking via Dual Low-Rank Adaptation | Xiang Luo et.al. | 2407.21633 | link |
2024-07-31 | TAROT: Task-Oriented Authorship Obfuscation Using Policy Optimization Methods | Gabriel Loiseau et.al. | 2407.21630 | link |
2024-07-31 | LLM-for-X: Application-agnostic Integration of Large Language Models to Support Personal Writing Workflows | Lukas Teufelberger et.al. | 2407.21593 | null |
2024-07-31 | A Performance Study of LLM-Generated Code on Leetcode | Tristan Coignion et.al. | 2407.21579 | null |
2024-07-30 | ThinK: Thinner Key Cache by Query-Driven Pruning | Yuhui Xu et.al. | 2407.21018 | null |
2024-07-30 | CLEFT: Language-Image Contrastive Learning with Efficient Large Language Model and Prompt Fine-Tuning | Yuexi Du et.al. | 2407.21011 | link |
2024-07-30 | GABInsight: Exploring Gender-Activity Binding Bias in Vision-Language Models | Ali Abdollahi et.al. | 2407.21001 | link |
2024-07-31 | MoFO: Momentum-Filtered Optimizer for Mitigating Forgetting in LLM Fine-Tuning | Yupeng Chen et.al. | 2407.20999 | null |
2024-07-30 | From Feature Importance to Natural Language Explanations Using LLMs with RAG | Sule Tekkesinoglu et.al. | 2407.20990 | link |
2024-07-30 | Large Language Models (LLMs) for Semantic Communication in Edge-based IoT Networks | Alakesh Kalita et.al. | 2407.20970 | null |
2024-07-30 | MMTrail: A Multimodal Trailer Video Dataset with Language and Music Descriptions | Xiaowei Chi et.al. | 2407.20962 | link |
2024-07-30 | UniProcessor: A Text-induced Unified Low-level Image Processor | Huiyu Duan et.al. | 2407.20928 | link |
2024-07-30 | SSPA: Split-and-Synthesize Prompting with Gated Alignments for Multi-Label Image Recognition | Hao Tan et.al. | 2407.20920 | null |
2024-07-30 | Automated Review Generation Method Based on Large Language Models | Shican Wu et.al. | 2407.20906 | link |
2024-07-30 | Faithful and Plausible Natural Language Explanations for Image Classification: A Pipeline Approach | Adam Wojciechowski et.al. | 2407.20899 | link |
2024-07-30 | ThinkRepair: Self-Directed Automated Program Repair | Xin Yin et.al. | 2407.20898 | link |
2024-07-30 | Effective Black Box Testing of Sentiment Analysis Classification Networks | Parsa Karbasizadeh et.al. | 2407.20884 | null |
2024-07-30 | Breaking Agents: Compromising Autonomous LLM Agents Through Malfunction Amplification | Boyang Zhang et.al. | 2407.20859 | null |
2024-07-30 | Learn by Selling: Equipping Large Language Models with Product Knowledge for Context-Driven Recommendations | Sarthak Anand et.al. | 2407.20856 | null |
2024-07-30 | Large Language Model (LLM)-enabled Graphs in Dynamic Networking | Geng Sun et.al. | 2407.20840 | null |
2024-07-30 | How to Measure the Intelligence of Large Language Models? | Nils Körber et.al. | 2407.20828 | null |
2024-07-30 | Diffusion Augmented Agents: A Framework for Efficient Exploration and Transfer Learning | Norman Di Palo et.al. | 2407.20798 | null |
2024-07-30 | Interpretable Pre-Trained Transformers for Heart Time-Series Data | Harry J. Davies et.al. | 2407.20775 | link |
2024-07-30 | OmniBal: Towards Fast Instruct-tuning for Vision-Language Models via Omniverse Computation Balance | Yongqiang Yao et.al. | 2407.20761 | link |
2024-07-29 | Specify and Edit: Overcoming Ambiguity in Text-Based Image Editing | Ekaterina Iakovleva et.al. | 2407.20232 | null |
2024-07-29 | Improving 2D Feature Representations by 3D-Aware Fine-Tuning | Yuanwen Yue et.al. | 2407.20229 | null |
2024-07-29 | FlexAttention for Efficient High-Resolution Vision-Language Models | Junyan Li et.al. | 2407.20228 | null |
2024-07-29 | Can Editing LLMs Inject Harm? | Canyu Chen et.al. | 2407.20224 | null |
2024-07-29 | SANGRIA: Surgical Video Scene Graph Optimization for Surgical Workflow Prediction | Çağhan Köksal et.al. | 2407.20214 | null |
2024-07-29 | QAEA-DR: A Unified Text Augmentation Framework for Dense Retrieval | Hongming Tan et.al. | 2407.20207 | null |
2024-07-29 | MindSearch: Mimicking Human Minds Elicits Deep AI Searcher | Zehui Chen et.al. | 2407.20183 | link |
2024-07-29 | Theia: Distilling Diverse Vision Foundation Models for Robot Learning | Jinghuan Shang et.al. | 2407.20179 | link |
2024-07-29 | AutoScale: Automatic Prediction of Compute-optimal Data Composition for Training LLMs | Feiyang Kang et.al. | 2407.20177 | link |
2024-07-29 | Advancing Multimodal Large Language Models in Chart Question Answering with Visualization-Referenced Instruction Tuning | Xingchen Zeng et.al. | 2407.20174 | link |
2024-07-29 | Diffusion Feedback Helps CLIP See Better | Wenxuan Wang et.al. | 2407.20171 | link |
2024-07-29 | Language-Conditioned Offline RL for Multi-Robot Navigation | Steven Morad et.al. | 2407.20164 | null |
2024-07-29 | rLLM: Relational Table Learning with LLMs | Weichen Li et.al. | 2407.20157 | link |
2024-07-29 | ByteCheckpoint: A Unified Checkpointing System for LLM Development | Borui Wan et.al. | 2407.20143 | null |
2024-07-29 | Strong Copyright Protection for Language Models via Adaptive Model Fusion | Javier Abad et.al. | 2407.20105 | null |
2024-07-29 | Orca: Ocean Significant Wave Height Estimation with Spatio-temporally Aware Large Language Models | Zhe Li et.al. | 2407.20053 | null |
2024-07-29 | Exploring Large Language Models to generate Easy to Read content | Paloma Martínez et.al. | 2407.20046 | null |
2024-07-29 | MaskInversion: Localized Embeddings via Optimization of Explainability Maps | Walid Bousselham et.al. | 2407.20034 | null |
2024-07-29 | Efficient Training of Large Language Models on Distributed Infrastructures: A Survey | Jiangfei Duan et.al. | 2407.20018 | null |
2024-07-29 | Rosetta Statements: Lowering the Barrier for Semantic Parsing and Increasing the Cognitive Interoperability of Knowledge Graphs | Lars Vogt et.al. | 2407.20007 | null |
2024-07-26 | Wolf: Captioning Everything with a World Summarization Framework | Boyi Li et.al. | 2407.18908 | null |
2024-07-26 | SHIC: Shape-Image Correspondences with no Keypoint Supervision | Aleksandar Shtedritski et.al. | 2407.18907 | null |
2024-07-26 | A Flexible and Scalable Approach for Collecting Wildlife Advertisements on the Web | Juliana Barbosa et.al. | 2407.18898 | link |
2024-07-26 | Small Molecule Optimization with Large Language Models | Philipp Guevorguian et.al. | 2407.18897 | link |
2024-07-26 | Human-artificial intelligence teaming for scientific information extraction from data-driven additive manufacturing research using large language models | Mutahar Safdar et.al. | 2407.18827 | null |
2024-07-26 | Automatic Detection of Moral Values in Music Lyrics | Vjosa Preniqi et.al. | 2407.18787 | link |
2024-07-26 | The power of Prompts: Evaluating and Mitigating Gender Bias in MT with LLMs | Aleix Sant et.al. | 2407.18786 | null |
2024-07-26 | Foundation Models for the Digital Twin Creation of Cyber-Physical Systems | Shaukat Ali et.al. | 2407.18779 | null |
2024-07-26 | TAGIFY: LLM-powered Tagging Interface for Improved Data Findability on OGD portals | Kevin Kliimask et.al. | 2407.18764 | null |
2024-07-26 | Knowledge Graph Structure as Prompt: Improving Small Language Models Capabilities for Knowledge-based Causal Discovery | Yuni Susanti et.al. | 2407.18752 | link |
2024-07-26 | Towards Effective and Efficient Continual Pre-training of Large Language Models | Jie Chen et.al. | 2407.18743 | null |
2024-07-26 | Towards Generalized Offensive Language Identification | Alphaeus Dmonte et.al. | 2407.18738 | null |
2024-07-26 | LLASP: Fine-tuning Large Language Models for Answer Set Programming | Erica Coppolillo et.al. | 2407.18723 | null |
2024-07-26 | Neurosymbolic AI for Enhancing Instructability in Generative AI | Amit Sheth et.al. | 2407.18722 | null |
2024-07-26 | Cluster-norm for Unsupervised Probing of Knowledge | Walter Laurito et.al. | 2407.18712 | link |
2024-07-26 | Adaptive Contrastive Search: Uncertainty-Guided Decoding for Open-Ended Text Generation | Esteban Garces Arias et.al. | 2407.18698 | link |
2024-07-26 | Collaborative Evolving Strategy for Automatic Data-Centric Development | Xu Yang et.al. | 2407.18690 | null |
2024-07-26 | The BIAS Detection Framework: Bias Detection in Word Embeddings and Language Models for European Languages | Alexandre Puttick et.al. | 2407.18689 | link |
2024-07-26 | Right Now, Wrong Then: Non-Stationary Direct Preference Optimization under Preference Drift | Seongho Son et.al. | 2407.18676 | null |
2024-07-26 | Every Part Matters: Integrity Verification of Scientific Figures Based on Multimodal Large Language Models | Xiang Shi et.al. | 2407.18626 | link |
2024-07-25 | Self-Training with Direct Preference Optimization Improves Chain-of-Thought Reasoning | Tianduo Wang et.al. | 2407.18248 | link |
2024-07-25 | LoRA-Pro: Are Low-Rank Adapters Properly Optimized? | Zhengbo Wang et.al. | 2407.18242 | link |
2024-07-26 | Recursive Introspection: Teaching Language Model Agents How to Self-Improve | Yuxiao Qu et.al. | 2407.18219 | null |
2024-07-26 | Exploring Scaling Trends in LLM Robustness | Nikolaus Howe et.al. | 2407.18213 | link |
2024-07-25 | AsEP: Benchmarking Deep Learning Methods for Antibody-specific Epitope Prediction | Chunan Liu et.al. | 2407.18184 | link |
2024-07-25 | Gene Regulatory Network Inference from Pre-trained Single-Cell Transcriptomics Transformer with Joint Graph Learning | Sindhura Kommu et.al. | 2407.18181 | null |
2024-07-25 | Unlocking Tokens as Data Points for Generalization Bounds on Larger Language Models | Sanae Lotfi et.al. | 2407.18158 | null |
2024-07-25 | $\mathbb{X}$ -Sample Contrastive Loss: Improving Contrastive Learning with Sample Similarity Graphs | Vlad Sobal et.al. | 2407.18134 | null |
2024-07-26 | Dallah: A Dialect-Aware Multimodal Large Language Model for Arabic | Fakhraddin Alwajih et.al. | 2407.18129 | null |
2024-07-25 | Efficient Inference of Vision Instruction-Following Models with Elastic Cache | Zuyan Liu et.al. | 2407.18121 | link |
2024-07-25 | Multi-Resolution Histopathology Patch Graphs for Ovarian Cancer Subtyping | Jack Breen et.al. | 2407.18105 | link |
2024-07-25 | Fine-Tuning Large Language Models for Stock Return Prediction Using Newsflow | Tian Guo et.al. | 2407.18103 | null |
2024-07-25 | PEFT-U: Parameter-Efficient Fine-Tuning for User Personalization | Christopher Clarke et.al. | 2407.18078 | link |
2024-07-25 | C2P: Featuring Large Language Models with Causal Reasoning | Abdolmahdi Bagheri et.al. | 2407.18069 | null |
2024-07-25 | ComPeer: A Generative Conversational Agent for Proactive Peer Support | Tianjian Liu et.al. | 2407.18064 | link |
2024-07-25 | Audio Entailment: Assessing Deductive Reasoning for Audio Understanding | Soham Deshmukh et.al. | 2407.18062 | link |
2024-07-25 | Difficulty Estimation and Simplification of French Text Using LLMs | Henri Jamet et.al. | 2407.18061 | null |
2024-07-25 | The Geometry of Queries: Query-Based Innovations in Retrieval-Augmented Generation | Eric Yang et.al. | 2407.18044 | null |
2024-07-25 | RestoreAgent: Autonomous Image Restoration Agent via Multimodal Large Language Models | Haoyu Chen et.al. | 2407.18035 | null |
2024-07-25 | GermanPartiesQA: Benchmarking Commercial Large Language Models for Political Bias and Sycophancy | Jan Batzner et.al. | 2407.18008 | null |
2024-07-24 | I Could’ve Asked That: Reformulating Unanswerable Questions | Wenting Zhao et.al. | 2407.17469 | link |
2024-07-24 | WildHallucinations: Evaluating Long-form Factuality in LLMs with Real-World Entity Queries | Wenting Zhao et.al. | 2407.17468 | null |
2024-07-24 | CMR Scaling Law: Predicting Critical Mixture Ratios for Continual Pre-training of Language Models | Jiawei Gu et.al. | 2407.17467 | null |
2024-07-24 | $VILA^2$ : VILA Augmented VILA | Yunhao Fang et.al. | 2407.17453 | null |
2024-07-24 | Fluent Student-Teacher Redteaming | T. Ben Thompson et.al. | 2407.17447 | link |
2024-07-24 | Can Watermarking Large Language Models Prevent Copyrighted Text Generation and Hide Training Data? | Michael-Andrei Panaitescu-Liess et.al. | 2407.17417 | null |
2024-07-24 | (PASS) Visual Prompt Locates Good Structure Sparsity through a Recurrent HyperNetwork | Tianjin Huang et.al. | 2407.17412 | null |
2024-07-24 | Dependency Transformer Grammars: Integrating Dependency Structures into Transformer Language Models | Yida Zhao et.al. | 2407.17406 | link |
2024-07-24 | Grammar-based Game Description Generation using Large Language Models | Tsunehiko Tanaka et.al. | 2407.17404 | link |
2024-07-24 | 3D Question Answering for City Scene Understanding | Penglei Sun et.al. | 2407.17398 | null |
2024-07-24 | PERSONA: A Reproducible Testbed for Pluralistic Alignment | Louis Castricato et.al. | 2407.17387 | null |
2024-07-24 | A Comprehensive Approach to Misspelling Correction with BERT and Levenshtein Distance | Amirreza Naziri et.al. | 2407.17383 | null |
2024-07-24 | MMRA: A Benchmark for Multi-granularity Multi-image Relational Association | Siwei Wu et.al. | 2407.17379 | link |
2024-07-24 | ViPer: Visual Personalization of Generative Models via Individual Preference Learning | Sogand Salehi et.al. | 2407.17365 | null |
2024-07-24 | Gradient-based inference of abstract task representations for generalization in neural networks | Ali Hummos et.al. | 2407.17356 | null |
2024-07-24 | Scalify: scale propagation for efficient low-precision LLM training | Paul Balança et.al. | 2407.17353 | link |
2024-07-24 | Boosting Large Language Models with Socratic Method for Conversational Mathematics Teaching | Yuyang Ding et.al. | 2407.17349 | link |
2024-07-24 | DexGANGrasp: Dexterous Generative Adversarial Grasping Synthesis for Task-Oriented Manipulation | Qian Feng et.al. | 2407.17348 | null |
2024-07-24 | Label Alignment and Reassignment with Generalist Large Language Model for Enhanced Cross-Domain Named Entity Recognition | Ke Bao et.al. | 2407.17344 | null |
2024-07-24 | How Good (Or Bad) Are LLMs at Detecting Misleading Visualizations? | Leo Yu-Ho Lo et.al. | 2407.17291 | null |
2024-07-23 | PartGLEE: A Foundation Model for Recognizing and Parsing Any Objects | Junyi Li et.al. | 2407.16696 | link |
2024-07-23 | Stress-Testing Long-Context Language Models with Lifelong ICL and Task Haystack | Xiaoyue Xu et.al. | 2407.16695 | link |
2024-07-23 | Can Large Language Models Automatically Jailbreak GPT-4V? | Yuanwei Wu et.al. | 2407.16686 | null |
2024-07-23 | SAM-CP: Marrying SAM with Composable Prompts for Versatile Segmentation | Pengfei Chen et.al. | 2407.16682 | null |
2024-07-23 | RedAgent: Red Teaming Large Language Models with Context-aware Autonomous Language Agent | Huiyu Xu et.al. | 2407.16667 | null |
2024-07-23 | Course-Correction: Safety Alignment Using Synthetic Preferences | Rongwu Xu et.al. | 2407.16637 | link |
2024-07-23 | Lawma: The Power of Specialization for Legal Tasks | Ricardo Dominguez-Olmedo et.al. | 2407.16615 | null |
2024-07-23 | Data Mixture Inference: What do BPE Tokenizers Reveal about their Training Data? | Jonathan Hayase et.al. | 2407.16607 | link |
2024-07-23 | Shared Imagination: LLMs Hallucinate Alike | Yilun Zhou et.al. | 2407.16604 | null |
2024-07-23 | A Comparative Study on Patient Language across Therapeutic Domains for Effective Patient Voice Classification in Online Health Discussions | Giorgos Lysandrou et.al. | 2407.16593 | null |
2024-07-23 | Exploring Automatic Cryptographic API Misuse Detection in the Era of LLMs | Yifan Xia et.al. | 2407.16576 | null |
2024-07-23 | TLCR: Token-Level Continuous Reward for Fine-grained Reinforcement Learning from Human Feedback | Eunseop Yoon et.al. | 2407.16574 | link |
2024-07-23 | Retrieve, Generate, Evaluate: A Case Study for Medical Paraphrases Generation with Small Language Models | Ioana Buhnila et.al. | 2407.16565 | link |
2024-07-23 | Patched RTC: evaluating LLMs for diverse software development tasks | Asankhaya Sharma et.al. | 2407.16557 | link |
2024-07-24 | MicroEmo: Time-Sensitive Multimodal Emotion Recognition with Micro-Expression Dynamics in Video Dialogues | Liyun Zhang et.al. | 2407.16552 | null |
2024-07-23 | Quantifying the Role of Textual Predictability in Automatic Speech Recognition | Sean Robertson et.al. | 2407.16537 | null |
2024-07-23 | Imperfect Vision Encoders: Efficient and Robust Tuning for Vision-Language Models | Aristeidis Panos et.al. | 2407.16526 | null |
2024-07-24 | AMONGAGENTS: Evaluating Large Language Models in the Interactive Text-Based Social Deduction Game | Yizhou Chi et.al. | 2407.16521 | link |
2024-07-23 | Language-Based Security for Low-Level MPC | Christian Skalka et.al. | 2407.16504 | null |
2024-07-23 | Machine Translation Hallucination Detection for Low and High Resource Languages using Large Language Models | Kenza Benkirane et.al. | 2407.16470 | link |
2024-07-22 | AutoAD-Zero: A Training-Free Framework for Zero-Shot Audio Description | Junyu Xie et.al. | 2407.15850 | link |
2024-07-22 | LLMmap: Fingerprinting For Large Language Models | Dario Pasquini et.al. | 2407.15847 | link |
2024-07-22 | SlowFast-LLaVA: A Strong Training-Free Baseline for Video Large Language Models | Mingze Xu et.al. | 2407.15841 | link |
2024-07-22 | MMInstruct: A High-Quality Multi-Modal Instruction Tuning Dataset with Extensive Diversity | Yangzhou Liu et.al. | 2407.15838 | link |
2024-07-22 | dMel: Speech Tokenization made Simple | He Bai et.al. | 2407.15835 | link |
2024-07-22 | J-CHAT: Japanese Large-scale Spoken Dialogue Corpus for Spoken Dialogue Language Modeling | Wataru Nakata et.al. | 2407.15828 | null |
2024-07-22 | Accelerating Pre-training of Multimodal LLMs via Chain-of-Sight | Ziyuan Huang et.al. | 2407.15819 | null |
2024-07-22 | Perceptions of Linguistic Uncertainty by Language Models and Humans | Catarina G Belem et.al. | 2407.15814 | link |
2024-07-22 | AdaCLIP: Adapting CLIP with Hybrid Learnable Prompts for Zero-Shot Anomaly Detection | Yunkang Cao et.al. | 2407.15795 | link |
2024-07-22 | CLIP with Generative Latent Replay: a Strong Baseline for Incremental Learning | Emanuele Frascaroli et.al. | 2407.15793 | link |
2024-07-22 | Extracting Structured Insights from Financial News: An Augmented LLM Driven Approach | Rian Dolphin et.al. | 2407.15788 | null |
2024-07-22 | Concept-Based Interpretable Reinforcement Learning with Limited to No Human Labels | Zhuorui Ye et.al. | 2407.15786 | null |
2024-07-22 | Conditioned Language Policy: A General Framework for Steerable Multi-Objective Finetuning | Kaiwen Wang et.al. | 2407.15762 | null |
2024-07-22 | MoRSE: Bridging the Gap in Cybersecurity Expertise with Retrieval Augmented Generation | Marco Simoni et.al. | 2407.15748 | null |
2024-07-22 | OMoS-QA: A Dataset for Cross-Lingual Extractive Question Answering in a German Migration Context | Steffen Kleinle et.al. | 2407.15736 | null |
2024-07-22 | TaskGen: A Task-Based, Memory-Infused Agentic Framework using StrictJSON | John Chong Min Tan et.al. | 2407.15734 | link |
2024-07-22 | Zero-Shot Embeddings Inform Learning and Forgetting with Vision-Language Encoders | Laura Niss et.al. | 2407.15731 | null |
2024-07-22 | SAM2CLIP2SAM: Vision Language Model for Segmentation of 3D CT Scans for Covid-19 Detection | Dimitrios Kollias et.al. | 2407.15728 | null |
2024-07-22 | DStruct2Design: Data and Benchmarks for Data Structure Driven Generative Floor Plan Design | Zhi Hao Luo et.al. | 2407.15723 | link |
2024-07-22 | Do Large Language Models Have Compositional Ability? An Investigation into Limitations and Scalability | Zhuoyan Xu et.al. | 2407.15720 | link |
2024-07-19 | Internal Consistency and Self-Feedback in Large Language Models: A Survey | Xun Liang et.al. | 2407.14507 | link |
2024-07-19 | On Pre-training of Multimodal Language Models Customized for Chart Understanding | Wan-Cyuan Fan et.al. | 2407.14506 | null |
2024-07-19 | PD-TPE: Parallel Decoder with Text-guided Position Encoding for 3D Visual Grounding | Chenshu Hou et.al. | 2407.14491 | null |
2024-07-19 | Evaluating the Reliability of Self-Explanations in Large Language Models | Korbinian Randl et.al. | 2407.14487 | link |
2024-07-19 | Data-Centric Human Preference Optimization with Rationales | Hoang Anh Just et.al. | 2407.14477 | link |
2024-07-19 | Contrastive Learning with Counterfactual Explanations for Radiology Report Generation | Mingjie Li et.al. | 2407.14474 | null |
2024-07-19 | Check-Eval: A Checklist-based Approach for Evaluating Text Quality | Jayr Pereira et.al. | 2407.14467 | null |
2024-07-19 | Undermining Mental Proof: How AI Can Make Cooperation Harder by Making Thinking Easier | Zachary Wojtowicz et.al. | 2407.14452 | null |
2024-07-19 | Token-level Correlation-guided Compression for Efficient Multimodal Document Understanding | Renshan Zhang et.al. | 2407.14439 | link |
2024-07-19 | Jumping Ahead: Improving Reconstruction Fidelity with JumpReLU Sparse Autoencoders | Senthooran Rajamanoharan et.al. | 2407.14435 | null |
2024-07-19 | Mixture of Experts with Mixture of Precisions for Tuning Quality of Service | HamidReza Imani et.al. | 2407.14417 | null |
2024-07-19 | System-1.x: Learning to Balance Fast and Slow Planning with Language Models | Swarnadeep Saha et.al. | 2407.14414 | link |
2024-07-19 | DEAL: Disentangle and Localize Concept-level Explanations for VLMs | Tang Li et.al. | 2407.14412 | link |
2024-07-19 | The Vision of Autonomic Computing: Can LLMs Make It a Reality? | Zhiyang Zhang et.al. | 2407.14402 | null |
2024-07-19 | Frontiers of Deep Learning: From Novel Application to Real-World Deployment | Rui Xie et.al. | 2407.14386 | null |
2024-07-19 | Open Artificial Knowledge | Vadim Borisov et.al. | 2407.14371 | null |
2024-07-19 | Enhancing Zero-shot Audio Classification using Sound Attribute Knowledge from Large Language Models | Xuenan Xu et.al. | 2407.14355 | link |
2024-07-19 | Improving Retrieval in Sponsored Search by Leveraging Query Context Signals | Akash Kumar Mohankumar et.al. | 2407.14346 | null |
2024-07-19 | LLMs left, right, and center: Assessing GPT’s capabilities to label political bias from web domains | Raphael Hernandes et.al. | 2407.14344 | null |
2024-07-19 | Multimodal Misinformation Detection using Large Vision-Language Models | Sahar Tahmasebi et.al. | 2407.14321 | null |
2024-07-18 | Latent Causal Probing: A Formal Perspective on Probing with Causal Models of Data | Charles Jin et.al. | 2407.13765 | null |
2024-07-18 | SegPoint: Segment Any Point Cloud via Large Language Model | Shuting He et.al. | 2407.13761 | null |
2024-07-18 | Black-Box Opinion Manipulation Attacks to Retrieval-Augmented Generation of Large Language Models | Zhuo Chen et.al. | 2407.13757 | null |
2024-07-18 | CellularLint: A Systematic Approach to Identify Inconsistent Behavior in Cellular Network Specifications | Mirza Masfiqur Rahman et.al. | 2407.13742 | null |
2024-07-18 | Baba Is AI: Break the Rules to Beat the Benchmark | Nathan Cloos et.al. | 2407.13729 | null |
2024-07-18 | CoDefeater: Using LLMs To Find Defeaters in Assurance Cases | Usman Gohar et.al. | 2407.13717 | link |
2024-07-18 | Understanding Reference Policies in Direct Preference Optimization | Yixin Liu et.al. | 2407.13709 | link |
2024-07-18 | A Comprehensive Review of Recommender Systems: Transitioning from Theory to Practice | Shaina Raza et.al. | 2407.13699 | link |
2024-07-18 | Benchmark Agreement Testing Done Right: A Guide for LLM Benchmark Evaluation | Yotam Perlitz et.al. | 2407.13696 | link |
2024-07-18 | Prover-Verifier Games improve legibility of LLM outputs | Jan Hendrik Kirchner et.al. | 2407.13692 | null |
2024-07-18 | Shaded Route Planning Using Active Segmentation and Identification of Satellite Images | Longchao Da et.al. | 2407.13689 | null |
2024-07-18 | FuLG: 150B Romanian Corpus for Language Model Pretraining | Vlad-Andrei Bădoiu et.al. | 2407.13657 | null |
2024-07-18 | COMCAT: Leveraging Human Judgment to Improve Automatic Documentation and Summarization | Skyler Grandel et.al. | 2407.13648 | null |
2024-07-18 | Weak-to-Strong Reasoning | Yuqing Yang et.al. | 2407.13647 | link |
2024-07-18 | Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies | Chaofan Tao et.al. | 2407.13623 | link |
2024-07-18 | KNOWNET: Guided Health Information Seeking from LLMs via Knowledge Graph Integration | Youfu Yan et.al. | 2407.13598 | null |
2024-07-18 | PLANTS: A Novel Problem and Dataset for Summarization of Planning-Like (PL) Tasks | Vishal Pallagani et.al. | 2407.13597 | null |
2024-07-18 | EarthMarker: A Visual Prompt Learning Framework for Region-level and Point-level Remote Sensing Imagery Comprehension | Wei Zhang et.al. | 2407.13596 | link |
2024-07-18 | Robust Calibration of Large Vision-Language Adapters | Balamurali Murugesan et.al. | 2407.13588 | link |
2024-07-18 | Towards Zero-Shot Multimodal Machine Translation | Matthieu Futeral et.al. | 2407.13579 | link |
2024-07-17 | LMMs-Eval: Reality Check on the Evaluation of Large Multimodal Models | Kaichen Zhang et.al. | 2407.12772 | link |
2024-07-17 | EchoSight: Advancing Visual-Language Models with Wiki Knowledge | Yibin Yan et.al. | 2407.12735 | null |
2024-07-17 | NL2Contact: Natural Language Guided 3D Hand-Object Contact Modeling with Diffusion Model | Zhongqun Zhang et.al. | 2407.12727 | null |
2024-07-17 | Is Sarcasm Detection A Step-by-Step Reasoning Process in Large Language Models? | Ben Yao et.al. | 2407.12725 | null |
2024-07-17 | The Future of Learning: Large Language Models through the Lens of Students | He Zhang et.al. | 2407.12723 | null |
2024-07-17 | MoME: Mixture of Multimodal Experts for Generalist Multimodal Large Language Models | Leyang Shen et.al. | 2407.12709 | link |
2024-07-17 | Subgraph-Aware Training of Text-based Methods for Knowledge Graph Completion | Youmin Ko et.al. | 2407.12703 | link |
2024-07-17 | Patch-Level Training for Large Language Models | Chenze Shao et.al. | 2407.12665 | link |
2024-07-17 | Zero-shot Text-guided Infinite Image Synthesis with LLM guidance | Soyeong Kwon et.al. | 2407.12642 | null |
2024-07-17 | Domain-specific or Uncertainty-aware models: Does it really make a difference for biomedical text classification? | Aman Sinha et.al. | 2407.12626 | null |
2024-07-17 | Harnessing the Power of Artificial Intelligence to Vitalize Endangered Indigenous Languages: Technologies and Experiences | Claudio Pinhanez et.al. | 2407.12620 | null |
2024-07-17 | AudienceView: AI-Assisted Interpretation of Audience Feedback in Journalism | William Brannon et.al. | 2407.12613 | link |
2024-07-17 | VisFocus: Prompt-Guided Vision Encoders for OCR-Free Dense Document Understanding | Ofir Abramovich et.al. | 2407.12594 | link |
2024-07-18 | Benchmarking Robust Self-Supervised Learning Across Diverse Downstream Tasks | Antoni Kowalczuk et.al. | 2407.12588 | link |
2024-07-17 | E5-V: Universal Embeddings with Multimodal Large Language Models | Ting Jiang et.al. | 2407.12580 | link |
2024-07-17 | Audio Conditioning for Music Generation via Discrete Bottleneck Features | Simon Rouard et.al. | 2407.12563 | null |
2024-07-17 | Conspiracy theories and where to find them on TikTok | Francesco Corso et.al. | 2407.12545 | null |
2024-07-17 | Abstraction Alignment: Comparing Model and Human Conceptual Relationships | Angie Boggust et.al. | 2407.12543 | link |
2024-07-17 | Towards Collaborative Intelligence: Propagating Intentions and Reasoning for Multi-Agent Coordination with Large Language Models | Xihe Qiu et.al. | 2407.12532 | null |
2024-07-17 | Crafting the Path: Robust Query Rewriting for Information Retrieval | Ingeol Baek et.al. | 2407.12529 | null |
2024-07-16 | UrbanWorld: An Urban World Model for 3D City Generation | Yu Shang et.al. | 2407.11965 | link |
2024-07-16 | NeedleBench: Can LLMs Do Retrieval and Reasoning in 1 Million Context Window? | Mo Li et.al. | 2407.11963 | link |
2024-07-16 | Code Documentation and Analysis to Secure Software Development | Paul Attie et.al. | 2407.11934 | null |
2024-07-16 | What’s Wrong? Refining Meeting Summaries with LLM Feedback | Frederic Kirstein et.al. | 2407.11919 | null |
2024-07-16 | GraphFM: A Scalable Framework for Multi-Graph Pretraining | Divyansha Lachi et.al. | 2407.11907 | null |
2024-07-16 | Ascend-CC: Confidential Computing on Heterogeneous NPU for Emerging Generative AI Workloads | Aritra Dhar et.al. | 2407.11888 | null |
2024-07-16 | Zero-shot Cross-Lingual Transfer for Synthetic Data Generation in Grammatical Error Detection | Gaetan Lopez Latouche et.al. | 2407.11854 | null |
2024-07-16 | Schema Matching with Large Language Models: an Experimental Study | Marcel Parciak et.al. | 2407.11852 | link |
2024-07-16 | LoFTI: Localization and Factuality Transfer to Indian Locales | Sona Elza Simon et.al. | 2407.11833 | link |
2024-07-16 | GPT Assisted Annotation of Rhetorical and Linguistic Features for Interpretable Propaganda Technique Detection in News Text | Kyle Hamilton et.al. | 2407.11827 | null |
2024-07-16 | PipeInfer: Accelerating LLM Inference using Asynchronous Pipelined Speculation | Branden Butler et.al. | 2407.11798 | null |
2024-07-16 | Large Language Models as Misleading Assistants in Conversation | Betty Li Hou et.al. | 2407.11789 | null |
2024-07-16 | SwitchCIT: Switching for Continual Instruction Tuning of Large Language Models | Xinbo Wu et.al. | 2407.11780 | null |
2024-07-16 | Sharif-MGTD at SemEval-2024 Task 8: A Transformer-Based Approach to Detect Machine Generated Text | Seyedeh Fatemeh Ebrahimi et.al. | 2407.11774 | null |
2024-07-16 | Educational Personalized Learning Path Planning with Large Language Models | Chee Ng et.al. | 2407.11773 | null |
2024-07-16 | XEdgeAI: A Human-centered Industrial Inspection Framework with Data-centric Explainable Edge AI Approach | Truong Thanh Hung Nguyen et.al. | 2407.11771 | link |
2024-07-16 | Robust Utility-Preserving Text Anonymization Based on Large Language Models | Tianyu Yang et.al. | 2407.11770 | link |
2024-07-16 | Vectoring Languages | Joseph Chen et.al. | 2407.11766 | null |
2024-07-16 | Exploring Quantization for Efficient Pre-Training of Transformer Language Models | Kamran Chitsaz et.al. | 2407.11722 | link |
2024-07-17 | Harnessing Large Language Models for Multimodal Product Bundling | Xiaohao Liu et.al. | 2407.11712 | link |
2024-07-15 | VGBench: Evaluating Large Language Models on Vector Graphics Understanding and Generation | Bocheng Zou et.al. | 2407.10972 | link |
2024-07-15 | Q-Sparse: All Large Language Models can be Fully Sparsely-Activated | Hongyu Wang et.al. | 2407.10969 | null |
2024-07-15 | Fast Matrix Multiplications for Lookup Table-Quantized LLMs | Han Guo et.al. | 2407.10960 | link |
2024-07-15 | Spider2-V: How Far Are Multimodal Agents From Automating Data Science and Engineering Workflows? | Ruisheng Cao et.al. | 2407.10956 | link |
2024-07-15 | MMM: Multilingual Mutual Reinforcement Effect Mix Datasets & Test with Open-domain Information Extraction Large Language Models | Chengguang Gan et.al. | 2407.10953 | null |
2024-07-15 | Can Textual Semantics Mitigate Sounding Object Segmentation Preference? | Yaoting Wang et.al. | 2407.10947 | link |
2024-07-15 | Learning from Naturally Occurring Feedback | Shachar Don-Yehiya et.al. | 2407.10944 | link |
2024-07-15 | GRUtopia: Dream General Robots in a City at Scale | Hanqing Wang et.al. | 2407.10943 | link |
2024-07-15 | Fine-Tuning and Prompt Optimization: Two Great Steps that Work Better Together | Dilara Soylu et.al. | 2407.10930 | null |
2024-07-15 | Benchmarking Vision Language Models for Cultural Understanding | Shravan Nayak et.al. | 2407.10920 | null |
2024-07-15 | FinDKG: Dynamic Knowledge Graphs with Large Language Models for Detecting Global Trends in Financial Markets | Xiaohui Victor Li et.al. | 2407.10909 | link |
2024-07-15 | Hey, That’s My Model! Introducing Chain & Hash, An LLM Fingerprinting Technique | Mark Russinovich et.al. | 2407.10887 | null |
2024-07-15 | SLIP: Securing LLMs IP Using Weights Decomposition | Yehonathan Refael et.al. | 2407.10886 | null |
2024-07-15 | Understanding the Importance of Evolutionary Search in Automated Heuristic Design with Large Language Models | Rui Zhang et.al. | 2407.10873 | null |
2024-07-15 | GPT Sonograpy: Hand Gesture Decoding from Forearm Ultrasound Images via VLM | Keshav Bimbraw et.al. | 2407.10870 | null |
2024-07-15 | Physics-Inspired Generative Models in Medical Imaging: A Review | Dennis Hein et.al. | 2407.10856 | null |
2024-07-15 | Weighted Grouped Query Attention in Transformers | Sai Sena Chinnakonduru et.al. | 2407.10855 | null |
2024-07-15 | An Actionable Framework for Assessing Bias and Fairness in Large Language Model Use Cases | Dylan Bouchard et.al. | 2407.10853 | link |
2024-07-15 | MetaLLM: A High-performant and Cost-efficient Dynamic Framework for Wrapping LLMs | Quang H. Nguyen et.al. | 2407.10834 | link |
2024-07-15 | BiasScanner: Automatic Detection and Classification of News Bias to Strengthen Democracy | Tim Menzner et.al. | 2407.10829 | null |
2024-07-12 | FairyLandAI: Personalized Fairy Tales utilizing ChatGPT and DALLE-3 | Georgios Makridis et.al. | 2407.09467 | null |
2024-07-12 | Human-like Episodic Memory for Infinite Context LLMs | Zafeirios Fountas et.al. | 2407.09450 | link |
2024-07-12 | ASTPrompter: Weakly Supervised Automated Language Model Red-Teaming to Identify Likely Toxic Prompts | Amelia F. Hardy et.al. | 2407.09447 | link |
2024-07-12 | MUSCLE: A Model Update Strategy for Compatible LLM Evolution | Jessica Echterhoff et.al. | 2407.09435 | null |
2024-07-12 | A Perspective on Foundation Models for the Electric Power Grid | Hendrik F. Hamann et.al. | 2407.09434 | null |
2024-07-12 | Open (Clinical) LLMs are Sensitive to Instruction Phrasings | Alberto Mario Ceballos Arroyo et.al. | 2407.09429 | link |
2024-07-12 | TelecomGPT: A Framework to Build Telecom-Specfic Large Language Models | Hang Zou et.al. | 2407.09424 | null |
2024-07-12 | Mitigating Entity-Level Hallucination in Large Language Models | Weihang Su et.al. | 2407.09417 | link |
2024-07-12 | SPIQA: A Dataset for Multimodal Question Answering on Scientific Papers | Shraman Pramanick et.al. | 2407.09413 | link |
2024-07-12 | Deep Bag-of-Words Model: An Efficient and Interpretable Relevance Architecture for Chinese E-Commerce | Zhe Lin et.al. | 2407.09395 | null |
2024-07-12 | PersonaRAG: Enhancing Retrieval-Augmented Generation Systems with User-Centric Agents | Saber Zerhoudi et.al. | 2407.09394 | link |
2024-07-12 | GAVEL: Generating Games Via Evolution and Language Models | Graham Todd et.al. | 2407.09388 | link |
2024-07-12 | Is Contrasting All You Need? Contrastive Learning for the Detection and Attribution of AI-generated Text | Lucio La Cava et.al. | 2407.09364 | null |
2024-07-12 | Good Intentions, Risky Inventions: A Method for Assessing the Risks and Benefits of AI in Mobile and Wearable Uses | Marios Constantinides et.al. | 2407.09322 | link |
2024-07-12 | Scalability of Bayesian Network Structure Elicitation with Large Language Models: a Novel Methodology and Comparative Analysis | Nikolay Babakov et.al. | 2407.09311 | null |
2024-07-12 | Transformer Layers as Painters | Qi Sun et.al. | 2407.09298 | link |
2024-07-12 | Security Matrix for Multimodal Agents on Mobile Devices: A Systematic and Proof of Concept Study | Yulong Yang et.al. | 2407.09295 | null |
2024-07-12 | CEIPA: Counterfactual Explainable Incremental Prompt Attack Analysis on Large Language Models | Dong Shu et.al. | 2407.09292 | null |
2024-07-12 | Structuring Authenticity Assessments on Historical Documents using LLMs | Andrea Schimmenti et.al. | 2407.09290 | null |
2024-07-12 | WSESeg: Introducing a Dataset for the Segmentation of Winter Sports Equipment with a Baseline for Interactive Segmentation | Robin Schön et.al. | 2407.09288 | link |
2024-07-11 | MAVIS: Mathematical Visual Instruction Tuning | Renrui Zhang et.al. | 2407.08739 | link |
2024-07-11 | Real-Time Anomaly Detection and Reactive Planning with Large Language Models | Rohan Sinha et.al. | 2407.08735 | null |
2024-07-11 | Is Your Model Really A Good Math Reasoner? Evaluating Mathematical Reasoning with Checklist | Zihao Zhou et.al. | 2407.08733 | null |
2024-07-11 | A Taxonomy for Data Contamination in Large Language Models | Medha Palavalli et.al. | 2407.08716 | null |
2024-07-11 | GTA: A Benchmark for General Tool Agents | Jize Wang et.al. | 2407.08713 | link |
2024-07-11 | eyeballvul: a future-proof benchmark for vulnerability detection in the wild | Timothee Chauvin et.al. | 2407.08708 | link |
2024-07-11 | Extracting Training Data from Document-Based VQA Models | Francesco Pinto et.al. | 2407.08707 | null |
2024-07-11 | HiRes-LLaVA: Restoring Fragmentation Input in High-Resolution Large Vision-Language Models | Runhui Huang et.al. | 2407.08706 | null |
2024-07-11 | Live2Diff: Live Stream Translation via Uni-directional Attention in Video Diffusion Models | Zhening Xing et.al. | 2407.08701 | null |
2024-07-11 | Mitigating Catastrophic Forgetting in Language Transfer via Model Merging | Anton Alexandrov et.al. | 2407.08699 | null |
2024-07-11 | Cloud Atlas: Efficient Fault Localization for Cloud Systems using Language Models and Causal Insight | Zhiqiang Xie et.al. | 2407.08694 | null |
2024-07-11 | Robotic Control via Embodied Chain-of-Thought Reasoning | Zawalski Michał et.al. | 2407.08693 | null |
2024-07-11 | SEED-Story: Multimodal Long Story Generation with Large Language Model | Shuai Yang et.al. | 2407.08683 | link |
2024-07-11 | NODE-Adapter: Neural Ordinary Differential Equations for Better Vision-Language Reasoning | Yi Zhang et.al. | 2407.08672 | null |
2024-07-11 | Uncertainty Estimation of Large Language Models in Medical Question Answering | Jiaxin Wu et.al. | 2407.08662 | null |
2024-07-11 | Towards Building Specialized Generalist AI with System 1 and System 2 Fusion | Kaiyan Zhang et.al. | 2407.08642 | null |
2024-07-11 | $β$-DPO: Direct Preference Optimization with Dynamic $β$ | Junkang Wu et.al. | 2407.08639 | link |
2024-07-11 | RoboMorph: Evolving Robot Morphology using Large Language Models | Kevin Qiu et.al. | 2407.08626 | null |
2024-07-11 | Tamil Language Computing: the Present and the Future | Kengatharaiyer Sarveswaran et.al. | 2407.08618 | null |
2024-07-11 | FlashAttention-3: Fast and Accurate Attention with Asynchrony and Low-precision | Jay Shah et.al. | 2407.08608 | link |
2024-07-10 | Training on the Test Task Confounds Evaluation and Emergence | Ricardo Dominguez-Olmedo et.al. | 2407.07890 | link |
2024-07-10 | Towards Robust Alignment of Language Models: Distributionally Robustifying Direct Preference Optimization | Junkang Wu et.al. | 2407.07880 | link |
2024-07-11 | Toto: Time Series Optimized Transformer for Observability | Ben Cohen et.al. | 2407.07874 | null |
2024-07-10 | FACTS About Building Retrieval Augmented Generation-based Chatbots | Rama Akkiraju et.al. | 2407.07858 | null |
2024-07-10 | OpenDiLoCo: An Open-Source Framework for Globally Distributed Low-Communication Training | Sami Jaghouar et.al. | 2407.07852 | link |
2024-07-10 | Natural Language Mechanisms via Self-Resolution with Foundation Models | Nicolas Della Penna et.al. | 2407.07845 | null |
2024-07-10 | Benchmarking Embedding Aggregation Methods in Computational Pathology: A Clinical Data Perspective | Shengjia Chen et.al. | 2407.07841 | link |
2024-07-10 | Decompose and Compare Consistency: Measuring VLMs’ Answer Reliability via Task-Decomposition Consistency Comparison | Qian Yang et.al. | 2407.07840 | null |
2024-07-10 | Transformer Alignment in Large Language Models | Murdock Aubry et.al. | 2407.07810 | null |
2024-07-11 | AVCap: Leveraging Audio-Visual Features as Text Tokens for Captioning | Jongsuk Kim et.al. | 2407.07801 | link |
2024-07-10 | Attribute or Abstain: Large Language Models as Long Document Assistants | Jan Buchmann et.al. | 2407.07799 | link |
2024-07-11 | Evaluating Large Language Models with Grid-Based Game Competitions: An Extensible LLM Benchmark and Leaderboard | Oguzhan Topsakal et.al. | 2407.07796 | link |
2024-07-10 | Flooding Spread of Manipulated Knowledge in LLM-Based Multi-Agent Communities | Tianjie Ju et.al. | 2407.07791 | link |
2024-07-10 | WorldAPIs: The World Is Worth How Many APIs? A Thought Experiment | Jiefu Ou et.al. | 2407.07778 | null |
2024-07-10 | Mobility VLA: Multimodal Instruction Navigation with Long-Context VLMs and Topological Graphs | Hao-Tien Lewis Chiang et.al. | 2407.07775 | null |
2024-07-10 | Can ChatGPT Pass a Theory of Computing Course? | Matei A. Golesteanu et.al. | 2407.07757 | null |
2024-07-10 | Fine-Tuning Large Language Models with User-Level Differential Privacy | Zachary Charles et.al. | 2407.07737 | null |
2024-07-10 | PaliGemma: A versatile 3B VLM for transfer | Lucas Beyer et.al. | 2407.07726 | link |
2024-07-10 | Why should we ever automate moral decision making? | Vincent Conitzer et.al. | 2407.07671 | null |
2024-07-10 | A Proposed S.C.O.R.E. Evaluation Framework for Large Language Models : Safety, Consensus, Objectivity, Reproducibility and Explainability | Ting Fang Tan et.al. | 2407.07666 | null |
2024-07-09 | AnyTaskTune: Advanced Domain-Specific Solutions through Task-Fine-Tuning | Jiaxi Cui et.al. | 2407.07094 | link |
2024-07-09 | FBI-LLM: Scaling Up Fully Binarized LLMs from Scratch via Autoregressive Distillation | Liqun Ma et.al. | 2407.07093 | link |
2024-07-09 | CopyBench: Measuring Literal and Non-Literal Reproduction of Copyright-Protected Text in Language Model Generation | Tong Chen et.al. | 2407.07087 | link |
2024-07-09 | Hypothetical Minds: Scaffolding Theory of Mind for Multi-Agent Tasks with Large Language Models | Logan Cross et.al. | 2407.07086 | link |
2024-07-09 | Adapting LLMs to Hebrew: Unveiling DictaLM 2.0 with Enhanced Vocabulary and Instruction Capabilities | Shaltiel Shmidman et.al. | 2407.07080 | null |
2024-07-09 | Lookback Lens: Detecting and Mitigating Contextual Hallucinations in Large Language Models Using Only Attention Maps | Yung-Sung Chuang et.al. | 2407.07071 | link |
2024-07-09 | Prompting Techniques for Secure Code Generation: A Systematic Investigation | Catherine Tony et.al. | 2407.07064 | null |
2024-07-10 | Internet of Agents: Weaving a Web of Heterogeneous Agents for Collaborative Intelligence | Weize Chen et.al. | 2407.07061 | link |
2024-07-10 | Multimodal Self-Instruct: Synthetic Abstract Image and Visual Reasoning Instruction Using Language Model | Wenqi Zhang et.al. | 2407.07053 | link |
2024-07-09 | ProtoSAM – One Shot Medical Image Segmentation With Foundational Models | Lev Ayzenberg et.al. | 2407.07042 | link |
2024-07-09 | Vision-and-Language Navigation Today and Tomorrow: A Survey in the Era of Foundation Models | Yue Zhang et.al. | 2407.07035 | link |
2024-07-09 | Exploring Scalability of Self-Training for Open-Vocabulary Temporal Action Localization | Jeongseok Hyun et.al. | 2407.07024 | link |
2024-07-09 | Using Large Language Models for Generating Smart Contracts for Health Insurance from Textual Policies | Inwon Kang et.al. | 2407.07019 | null |
2024-07-09 | End-To-End Causal Effect Estimation from Unstructured Natural Language Data | Nikita Dhawan et.al. | 2407.07018 | null |
2024-07-09 | Is Large Language Model All You Need to Predict the Synthesizability and Precursors of Crystal Structures? | Zhilong Song et.al. | 2407.07016 | null |
2024-07-09 | Induction Heads as an Essential Mechanism for Pattern Matching in In-context Learning | J. Crosbie et.al. | 2407.07011 | null |
2024-07-09 | Metron: Holistic Performance Evaluation Framework for LLM Inference Systems | Amey Agrawal et.al. | 2407.07000 | link |
2024-07-09 | Robust Neural Information Retrieval: An Adversarial and Out-of-distribution Perspective | Yu-An Liu et.al. | 2407.06992 | link |
2024-07-09 | Segment-Based Interactive Machine Translation for Pre-trained Models | Angel Navarro et.al. | 2407.06990 | null |
2024-07-09 | Listen and Speak Fairly: A Study on Semantic Gender Bias in Speech Integrated Large Language Models | Yi-Cheng Lin et.al. | 2407.06957 | link |
2024-07-08 | Multi-Object Hallucination in Vision-Language Models | Xuweiyi Chen et.al. | 2407.06192 | link |
2024-07-08 | 4D Contrastive Superflows are Dense 3D Representation Learners | Xiang Xu et.al. | 2407.06190 | link |
2024-07-08 | Video-STaR: Self-Training Enables Video Instruction Tuning with Any Supervision | Orr Zohar et.al. | 2407.06189 | link |
2024-07-08 | CrowdMoGen: Zero-Shot Text-Driven Collective Motion Generation | Xinying Guo et.al. | 2407.06188 | null |
2024-07-08 | JeDi: Joint-Image Diffusion Models for Finetuning-Free Personalized Text-to-Image Generation | Yu Zeng et.al. | 2407.06187 | null |
2024-07-08 | Vision-Language Models under Cultural and Inclusive Considerations | Antonia Karamolegkou et.al. | 2407.06177 | null |
2024-07-08 | On Speeding Up Language Model Evaluation | Jin Peng Zhou et.al. | 2407.06172 | null |
2024-07-08 | What’s Wrong with Your Code Generated by Large Language Models? An Extensive Study | Shihan Dou et.al. | 2407.06153 | null |
2024-07-08 | Using Grammar Masking to Ensure Syntactic Validity in LLM-based Modeling Tasks | Lukas Netz et.al. | 2407.06146 | null |
2024-07-08 | ANOLE: An Open, Autoregressive, Native Large Multimodal Models for Interleaved Image-Text Generation | Ethan Chern et.al. | 2407.06135 | link |
2024-07-08 | Evaluating the Semantic Profiling Abilities of LLMs for Natural Language Utterances in Data Visualization | Hannah K. Bako et.al. | 2407.06129 | link |
2024-07-08 | Depression Detection and Analysis using Large Language Models on Textual and Audio-Visual Modalities | Avinash Anand et.al. | 2407.06125 | null |
2024-07-08 | Enhancing Language Model Rationality with Bi-Directional Deliberation Reasoning | Yadong Zhang et.al. | 2407.06112 | null |
2024-07-08 | Artificial Intuition: Efficient Classification of Scientific Abstracts | Harsh Sakhrani et.al. | 2407.06093 | null |
2024-07-08 | Merge, Ensemble, and Cooperate! A Survey on Collaborative Strategies in the Era of Large Language Models | Jinliang Lu et.al. | 2407.06089 | null |
2024-07-08 | From Loops to Oops: Fallback Behaviors of Language Models Under Uncertainty | Maor Ivgi et.al. | 2407.06071 | link |
2024-07-08 | Variational Best-of-N Alignment | Afra Amini et.al. | 2407.06057 | null |
2024-07-08 | MST5 – Multilingual Question Answering over Knowledge Graphs | Nikit Srivastava et.al. | 2407.06041 | link |
2024-07-08 | PAS: Data-Efficient Plug-and-Play Prompt Augmentation System | Miao Zheng et.al. | 2407.06027 | null |
2024-07-08 | iLLM-TSC: Integration reinforcement learning and large language model for traffic signal control policy improvement | Aoyu Pang et.al. | 2407.06025 | link |
2024-07-05 | Me, Myself, and AI: The Situational Awareness Dataset (SAD) for LLMs | Rudolf Laine et.al. | 2407.04694 | link |
2024-07-05 | ANAH-v2: Scaling Analytical Hallucination Annotation of Large Language Models | Yuzhe Gu et.al. | 2407.04693 | link |
2024-07-05 | Rethinking Visual Prompting for Multimodal Large Language Models with External Knowledge | Yuanze Lin et.al. | 2407.04681 | null |
2024-07-05 | Lost in Translation: The Algorithmic Gap Between LMs and the Brain | Tommaso Tosato et.al. | 2407.04680 | null |
2024-07-05 | Seed-ASR: Understanding Diverse Speech and Contexts with LLM-based Speech Recognition | Ye Bai et.al. | 2407.04675 | null |
2024-07-05 | Lazarus: Resilient and Elastic Training of Mixture-of-Experts Models with Adaptive Expert Placement | Yongji Wu et.al. | 2407.04656 | null |
2024-07-05 | Speculative Speech Recognition by Audio-Prefixed Low-Rank Adaptation of Language Models | Bolaji Yusuf et.al. | 2407.04641 | null |
2024-07-05 | Entity Decomposition with Filtering: A Zero-Shot Clinical Named Entity Recognition Framework | Reza Averly et.al. | 2407.04629 | null |
2024-07-05 | On scalable oversight with weak LLMs judging strong LLMs | Zachary Kenton et.al. | 2407.04622 | null |
2024-07-05 | CountGD: Multi-Modal Open-World Counting | Niki Amini-Naieni et.al. | 2407.04619 | null |
2024-07-05 | ARM: Efficient Guided Decoding with Autoregressive Reward Models | Sergey Troshin et.al. | 2407.04615 | null |
2024-07-05 | AWT: Transferring Vision-Language Models via Augmentation, Weighting, and Transportation | Yuhan Zhu et.al. | 2407.04603 | link |
2024-07-05 | Written Term Detection Improves Spoken Term Detection | Bolaji Yusuf et.al. | 2407.04601 | link |
2024-07-05 | Testing learning hypotheses using neural networks by manipulating learning data | Cara Su-Yi Leong et.al. | 2407.04593 | null |
2024-07-05 | Leveraging Large Language Models for Integrated Satellite-Aerial-Terrestrial Networks: Recent Advances and Future Directions | Shumaila Javaid et.al. | 2407.04581 | null |
2024-07-05 | VRSD: Rethinking Similarity and Diversity for Retrieval in Large Language Models | Hang Gao et.al. | 2407.04573 | null |
2024-07-05 | Not (yet) the whole story: Evaluating Visual Storytelling Requires More than Measuring Coherence, Grounding, and Repetition | Aditya K Surikuchi et.al. | 2407.04559 | link |
2024-07-05 | Spontaneous Reward Hacking in Iterative Self-Refinement | Jane Pan et.al. | 2407.04549 | null |
2024-07-05 | PoPreRo: A New Dataset for Popularity Prediction of Romanian Reddit Posts | Ana-Cristina Rogoz et.al. | 2407.04541 | link |
2024-07-05 | GPT vs RETRO: Exploring the Intersection of Retrieval and Parameter-Efficient Fine-Tuning | Aleksander Ficek et.al. | 2407.04528 | null |
2024-07-03 | Planetarium: A Rigorous Benchmark for Translating Text to Structured Planning Languages | Max Zuo et.al. | 2407.03321 | link |
2024-07-03 | InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output | Pan Zhang et.al. | 2407.03320 | link |
2024-07-03 | BACON: Supercharge Your VLM with Bag-of-Concept Graph to Mitigate Hallucinations | Zhantao Yang et.al. | 2407.03314 | null |
2024-07-03 | Universal Length Generalization with Turing Programs | Kaiying Hou et.al. | 2407.03310 | null |
2024-07-03 | Large Language Models for JSON Schema Discovery | Michael J. Mior et.al. | 2407.03286 | null |
2024-07-03 | LLM Internal States Reveal Hallucination Risk Faced With a Query | Ziwei Ji et.al. | 2407.03282 | link |
2024-07-03 | STF: Sentence Transformer Fine-Tuning For Topic Categorization With Limited Data | Kheir Eddine Daouadi et.al. | 2407.03253 | null |
2024-07-03 | Improving Retrieval-augmented Text-to-SQL with AST-based Ranking and Schema Pruning | Zhili Shen et.al. | 2407.03227 | null |
2024-07-03 | How Does Quantization Affect Multilingual LLMs? | Kelly Marchisio et.al. | 2407.03211 | null |
2024-07-03 | TheoremLlama: Transforming General-Purpose LLMs into Lean4 Experts | Ruida Wang et.al. | 2407.03203 | link |
2024-07-03 | Fine-Tuning with Divergent Chains of Thought Boosts Reasoning Through Self-Correction in Language Models | Haritz Puerto et.al. | 2407.03181 | link |
2024-07-03 | Investigating Decoder-only Large Language Models for Speech-to-text Translation | Chao-Wei Huang et.al. | 2407.03169 | null |
2024-07-03 | SOS! Soft Prompt Attack Against Open-Source Large Language Models | Ziqing Yang et.al. | 2407.03160 | null |
2024-07-03 | Let the Code LLM Edit Itself When You Edit the Code | Zhenyu He et.al. | 2407.03157 | null |
2024-07-03 | Reinforcement Learning for Sequence Design Leveraging Protein Language Models | Jithendaraa Subramanian et.al. | 2407.03154 | null |
2024-07-03 | Enhancing Translation Accuracy of Large Language Models through Continual Pre-Training on Parallel Data | Minato Kondo et.al. | 2407.03145 | null |
2024-07-03 | Social Bias Evaluation for Large Language Models Requires Prompt Variations | Rem Hida et.al. | 2407.03129 | link |
2024-07-03 | KeyVideoLLM: Towards Large-scale Video Keyframe Selection | Hao Liang et.al. | 2407.03104 | null |
2024-07-03 | Cactus: Towards Psychological Counseling Conversations using Cognitive Behavioral Theory | Suyeon Lee et.al. | 2407.03103 | link |
2024-07-03 | ScreenTK: Seamless Detection of Time-Killing Moments Using Continuous Mobile Screen Text Monitoring | Le Fang et.al. | 2407.03063 | null |
2024-07-02 | MInference 1.0: Accelerating Pre-filling for Long-Context LLMs via Dynamic Sparse Attention | Huiqiang Jiang et.al. | 2407.02490 | link |
2024-07-02 | Neurocache: Efficient Vector Retrieval for Long-range Language Modeling | Ali Safaya et.al. | 2407.02486 | link |
2024-07-02 | RankRAG: Unifying Context Ranking with Retrieval-Augmented Generation in LLMs | Yue Yu et.al. | 2407.02485 | null |
2024-07-02 | MMedAgent: Learning to Use Medical Tools with Multi-modal Agent | Binxu Li et.al. | 2407.02483 | link |
2024-07-02 | Understanding Alignment in Multimodal LLMs: A Comprehensive Study | Elmira Amirloo et.al. | 2407.02477 | null |
2024-07-02 | Open Scene Graphs for Open World Object-Goal Navigation | Joel Loo et.al. | 2407.02473 | null |
2024-07-02 | ValueScope: Unveiling Implicit Norms and Values via Return Potential Model of Social Interactions | Chan Young Park et.al. | 2407.02472 | link |
2024-07-02 | Reliable Confidence Intervals for Information Retrieval Evaluation Using Generative A.I | Harrie Oosterhuis et.al. | 2407.02464 | null |
2024-07-02 | Ensemble of pre-trained language models and data augmentation for hate speech detection from Arabic tweets | Kheir Eddine Daouadi et.al. | 2407.02448 | null |
2024-07-03 | Video Watermarking: Safeguarding Your Video from (Unauthorized) Annotations by Video-based LLMs | Jinmin Li et.al. | 2407.02411 | null |
2024-07-02 | CEB: Compositional Evaluation Benchmark for Fairness in Large Language Models | Song Wang et.al. | 2407.02408 | null |
2024-07-02 | Assessing the Code Clone Detection Capability of Large Language Models | Zixian Zhang et.al. | 2407.02402 | null |
2024-07-02 | Learning to Refine with Fine-Grained Natural Language Feedback | Manya Wadhwa et.al. | 2407.02397 | link |
2024-07-02 | Is Your AI-Generated Code Really Secure? Evaluating Large Language Models on Secure Code Generation with CodeSecEval | Jiexin Wang et.al. | 2407.02395 | null |
2024-07-02 | TokenPacker: Efficient Visual Projector for Multimodal LLM | Wentong Li et.al. | 2407.02392 | link |
2024-07-02 | Talking to Machines: do you read me? | Lina M. Rojas-Barahona et.al. | 2407.02354 | null |
2024-07-02 | Pelican: Correcting Hallucination in Vision-LLMs via Claim Decomposition and Program of Thought Verification | Pritish Sahu et.al. | 2407.02352 | null |
2024-07-02 | Generative Large Language Models in Automated Fact-Checking: A Survey | Ivan Vykopal et.al. | 2407.02351 | null |
2024-07-02 | Conceptual Codebook Learning for Vision-Language Models | Yi Zhang et.al. | 2407.02350 | null |
2024-07-02 | MORPHEUS: Modeling Role from Personalized Dialogue History by Exploring and Utilizing Latent Space | Yihong Tang et.al. | 2407.02345 | null |
2024-06-28 | Web2Code: A Large-scale Webpage-to-Code Dataset and Evaluation Framework for Multimodal LLMs | Sukmin Yun et.al. | 2406.20098 | link |
2024-06-28 | LLaRA: Supercharging Robot Learning Data for Vision-Language Policy | Xiang Li et.al. | 2406.20095 | link |
2024-06-28 | Scaling Synthetic Data Creation with 1,000,000,000 Personas | Xin Chan et.al. | 2406.20094 | link |
2024-06-28 | LLaVolta: Efficient Multi-modal Models via Stage-wise Visual Context Compression | Jieneng Chen et.al. | 2406.20092 | link |
2024-06-28 | ProgressGym: Alignment with a Millennium of Moral Progress | Tianyi Qiu et.al. | 2406.20087 | link |
2024-06-28 | Auto Cherry-Picker: Learning from High-quality Generative Data Driven by Language | Yicheng Chen et.al. | 2406.20085 | null |
2024-06-28 | Molecular Facts: Desiderata for Decontextualization in LLM Fact Verification | Anisha Gunjal et.al. | 2406.20079 | link |
2024-06-28 | EVF-SAM: Early Vision-Language Fusion for Text-Prompted Segment Anything Model | Yuxuan Zhang et.al. | 2406.20076 | link |
2024-06-28 | To Word Senses and Beyond: Inducing Concepts with Contextualized Language Models | Bastien Liétard et.al. | 2406.20054 | null |
2024-06-28 | Covert Malicious Finetuning: Challenges in Safeguarding LLM Adaptation | Danny Halawi et.al. | 2406.20053 | null |
2024-07-02 | BMW Agents – A Framework For Task Automation Through Multi-Agent Collaboration | Noel Crawford et.al. | 2406.20041 | null |
2024-06-28 | BioMNER: A Dataset for Biomedical Method Entity Recognition | Chen Tang et.al. | 2406.20038 | null |
2024-06-28 | LEMoE: Advanced Mixture of Experts Adaptor for Lifelong Model Editing of Large Language Models | Renzhi Wang et.al. | 2406.20030 | null |
2024-06-28 | ToolBeHonest: A Multi-level Hallucination Diagnostic Benchmark for Tool-Augmented Large Language Models | Yuxiang Zhang et.al. | 2406.20015 | link |
2024-06-28 | The SIFo Benchmark: Investigating the Sequential Instruction Following Ability of Large Language Models | Xinyi Chen et.al. | 2406.19999 | link |
2024-06-28 | Single Parent Family: A Spectrum of Family Members from a Single Pre-Trained Foundation Model | Habib Hajimolahoseini et.al. | 2406.19995 | null |
2024-06-28 | ScaleBiO: Scalable Bilevel Optimization for LLM Data Reweighting | Rui Pan et.al. | 2406.19976 | null |
2024-06-28 | STLLaVA-Med: Self-Training Large Language and Vision Assistant for Medical | Guohao Sun et.al. | 2406.19973 | link |
2024-06-28 | Into the Unknown: Generating Geospatial Descriptions for New Environments | Tzuf Paz-Argaman et.al. | 2406.19967 | null |
2024-06-28 | Simulating Financial Market via Large Language Model based Agents | Shen Gao et.al. | 2406.19966 | null |
2024-06-27 | ReXTime: A Benchmark Suite for Reasoning-Across-Time in Videos | Jr-Jen Chen et.al. | 2406.19392 | link |
2024-06-27 | The Remarkable Robustness of LLMs: Stages of Inference? | Vedang Lad et.al. | 2406.19384 | link |
2024-06-27 | The Model Arena for Cross-lingual Sentiment Analysis: A Comparative Study in the Era of Large Language Models | Xiliang Zhu et.al. | 2406.19358 | null |
2024-06-27 | DiVERT: Distractor Generation with Variational Errors Represented as Text for Math Multiple-choice Questions | Nigel Fernandez et.al. | 2406.19356 | link |
2024-06-27 | Fundamental Problems With Model Editing: How Should Rational Belief Revision Work in LLMs? | Peter Hase et.al. | 2406.19354 | null |
2024-06-27 | IndoToxic2024: A Demographically-Enriched Dataset of Hate Speech and Toxicity Types for Indonesian Language | Lucky Susanto et.al. | 2406.19349 | null |
2024-06-27 | Jump Starting Bandits with LLM-Generated Prior Knowledge | Parand A. Alamdari et.al. | 2406.19317 | link |
2024-06-27 | MCNC: Manifold Constrained Network Compression | Chayne Thrash et.al. | 2406.19301 | null |
2024-06-27 | From Artificial Needles to Real Haystacks: Improving Retrieval Capabilities in LLMs by Finetuning on Synthetic Data | Zheyang Xiong et.al. | 2406.19292 | link |
2024-06-27 | PhysioLLM: Supporting Personalized Health Insights with Wearables and Large Language Models | Cathy Mengying Fang et.al. | 2406.19283 | null |
2024-06-27 | HuatuoGPT-Vision, Towards Injecting Medical Visual Knowledge into Multimodal LLMs at Scale | Junying Chen et.al. | 2406.19280 | link |
2024-06-27 | VERISCORE: Evaluating the factuality of verifiable claims in long-form text generation | Yixiao Song et.al. | 2406.19276 | link |
2024-06-27 | AutoPureData: Automated Filtering of Web Data for LLM Fine-tuning | Praneeth Vadlapati et.al. | 2406.19271 | link |
2024-06-27 | Read Anywhere Pointed: Layout-aware GUI Screen Reading with Tree-of-Lens Grounding | Yue Fan et.al. | 2406.19263 | link |
2024-06-27 | Enhancing Video-Language Representations with Structural Spatio-Temporal Alignment | Hao Fei et.al. | 2406.19255 | null |
2024-06-27 | AutoRAG-HP: Automatic Online Hyper-Parameter Tuning for Retrieval-Augmented Generation | Jia Fu et.al. | 2406.19251 | null |
2024-06-27 | Revealing Fine-Grained Values and Opinions in Large Language Models | Dustin Wright et.al. | 2406.19238 | link |
2024-06-28 | FlowVQA: Mapping Multimodal Logic in Visual Question Answering with Flowcharts | Shubhankar Singh et.al. | 2406.19237 | null |
2024-06-27 | Seeing Is Believing: Black-Box Membership Inference Attacks Against Retrieval Augmented Generation | Yuying Li et.al. | 2406.19234 | null |
2024-06-28 | RuBLiMP: Russian Benchmark of Linguistic Minimal Pairs | Ekaterina Taktasheva et.al. | 2406.19232 | link |
2024-06-26 | Towards Compositionality in Concept Learning | Adam Stein et.al. | 2406.18534 | link |
2024-06-26 | Symbolic Learning Enables Self-Evolving Agents | Wangchunshu Zhou et.al. | 2406.18532 | link |
2024-06-26 | PrExMe! Large Scale Prompt Exploration of Open Source LLMs for Machine Translation and Summarization Evaluation | Christoph Leiter et.al. | 2406.18528 | link |
2024-06-26 | CharXiv: Charting Gaps in Realistic Chart Understanding in Multimodal LLMs | Zirui Wang et.al. | 2406.18521 | link |
2024-06-26 | “Is ChatGPT a Better Explainer than My Professor?”: Evaluating the Explanation Capabilities of LLMs in Conversation Compared to a Human Baseline | Grace Li et.al. | 2406.18512 | null |
2024-06-26 | WildTeaming at Scale: From In-the-Wild Jailbreaks to (Adversarially) Safer Language Models | Liwei Jiang et.al. | 2406.18510 | link |
2024-06-26 | Mental Modeling of Reinforcement Learning Agents by Language Models | Wenhao Lu et.al. | 2406.18505 | null |
2024-06-26 | Is In-Context Learning a Type of Gradient-Based Learning? Evidence from the Inverse Frequency Effect in Structural Priming | Zhenghao Zhou et.al. | 2406.18501 | null |
2024-06-26 | Role-Play Zero-Shot Prompting with Large Language Models for Open-Domain Human-Machine Conversation | Ahmed Njifenjou et.al. | 2406.18460 | null |
2024-06-26 | Cascading Large Language Models for Salient Event Graph Generation | Xingwei Tan et.al. | 2406.18449 | link |
2024-06-26 | New intelligent empowerment for digital transformation | Peng Yifeng et.al. | 2406.18440 | null |
2024-06-26 | IRCAN: Mitigating Knowledge Conflicts in LLM Generation via Identifying and Reweighting Context-Aware Neurons | Dan Shi et.al. | 2406.18406 | link |
2024-06-26 | Do LLMs dream of elephants (when told not to)? Latent concept association and associative memory in transformers | Yibo Jiang et.al. | 2406.18400 | null |
2024-06-26 | Adversarial Search Engine Optimization for Large Language Models | Fredrik Nestaas et.al. | 2406.18382 | null |
2024-06-26 | MALSIGHT: Exploring Malicious Source Code and Benign Pseudocode for Iterative Binary Malware Summarization | Haolang Lu et.al. | 2406.18379 | null |
2024-06-26 | Themis: Towards Flexible and Interpretable NLG Evaluation | Xinyu Hu et.al. | 2406.18365 | link |
2024-06-26 | AI Alignment through Reinforcement Learning from Human Feedback? Contradictions and Limitations | Adam Dahlgren Lindström et.al. | 2406.18346 | null |
2024-06-26 | PDFA Distillation via String Probability Queries {PDFA Distillation via String Probability Queries} | Robert Baumgartner et.al. | 2406.18328 | link |
2024-06-26 | PaCoST: Paired Confidence Significance Testing for Benchmark Contamination Detection in Large Language Models | Huixuan Zhang et.al. | 2406.18326 | null |
2024-06-26 | MathOdyssey: Benchmarking Mathematical Problem-Solving Skills in Large Language Models Using Odyssey Math Data | Meng Fang et.al. | 2406.18321 | null |
2024-06-25 | MG-LLaVA: Towards Multi-Granularity Visual Instruction Tuning | Xiangyu Zhao et.al. | 2406.17770 | link |
2024-06-25 | EXTRACT: Efficient Policy Learning by Extracting Transferrable Robot Skills from Offline Data | Jesse Zhang et.al. | 2406.17768 | null |
2024-06-25 | BMIKE-53: Investigating Cross-Lingual Knowledge Editing with In-Context Learning | Ercong Nie et.al. | 2406.17764 | null |
2024-06-25 | CaLMQA: Exploring culturally specific long-form question answering across 23 languages | Shane Arora et.al. | 2406.17761 | link |
2024-06-25 | Accelerating Clinical Evidence Synthesis with Large Language Models | Zifeng Wang et.al. | 2406.17755 | null |
2024-06-25 | Measuring and Benchmarking Large Language Models’ Capabilities to Generate Persuasive Language | Amalie Brogaard Pauli et.al. | 2406.17753 | null |
2024-06-25 | Recite, Reconstruct, Recollect: Memorization in LMs as a Multifaceted Phenomenon | USVSN Sai Prashanth et.al. | 2406.17746 | link |
2024-06-25 | Point-SAM: Promptable 3D Segmentation Model for Point Clouds | Yuchen Zhou et.al. | 2406.17741 | link |
2024-06-25 | Find Parent then Label Children: A Two-stage Taxonomy Completion Method with Pre-trained Language Model | Fei Xia et.al. | 2406.17739 | null |
2024-06-25 | LLM Targeted Underperformance Disproportionately Impacts Vulnerable Users | Elinor Poole-Dayan et.al. | 2406.17737 | null |
2024-06-25 | FedBiOT: LLM Local Fine-tuning in Federated Learning without Full Model | Feijie Wu et.al. | 2406.17706 | link |
2024-06-25 | From Distributional to Overton Pluralism: Investigating Large Language Model Alignment | Thom Lake et.al. | 2406.17692 | link |
2024-06-26 | VarBench: Robust Language Model Benchmarking Through Dynamic Variable Perturbation | Kun Qian et.al. | 2406.17681 | link |
2024-06-25 | Quantifying AI Psychology: A Psychometrics Benchmark for Large Language Models | Yuan Li et.al. | 2406.17675 | null |
2024-06-25 | LaTable: Towards Large Tabular Models | Boris van Breugel et.al. | 2406.17673 | null |
2024-06-25 | LLM-ARC: Enhancing LLMs with an Automated Reasoning Critic | Aditya Kalyanpur et.al. | 2406.17663 | null |
2024-06-25 | Grass: Compute Efficient Low-Memory LLM Training with Structured Sparse Gradients | Aashiq Muhamed et.al. | 2406.17660 | link |
2024-06-25 | DKPROMPT: Domain Knowledge Prompting Vision-Language Models for Open-World Planning | Xiaohan Zhang et.al. | 2406.17659 | null |
2024-06-25 | Leveraging Large Language Models for Software Model Completion: Results from Industrial and Public Datasets | Christof Tinnes et.al. | 2406.17651 | link |
2024-06-25 | Variationist: Exploring Multifaceted Variation and Bias in Written Language Data | Alan Ramponi et.al. | 2406.17647 | link |
2024-06-24 | Cambrian-1: A Fully Open, Vision-Centric Exploration of Multimodal LLMs | Shengbang Tong et.al. | 2406.16860 | link |
2024-06-24 | EAGLE-2: Faster Inference of Language Models with Dynamic Draft Trees | Yuhui Li et.al. | 2406.16858 | link |
2024-06-24 | Long Context Transfer from Language to Vision | Peiyuan Zhang et.al. | 2406.16852 | link |
2024-06-24 | Losing Visual Needles in Image Haystacks: Vision Language Models are Easily Distracted in Short and Long Contexts | Aditya Sharma et.al. | 2406.16851 | null |
2024-06-24 | RaTEScore: A Metric for Radiology Report Generation | Weike Zhao et.al. | 2406.16845 | link |
2024-06-24 | From Decoding to Meta-Generation: Inference-time Algorithms for Large Language Models | Sean Welleck et.al. | 2406.16838 | null |
2024-06-24 | USDC: A Dataset of $\underline{U}$ser $\underline{S}$tance and $\underline{D}$ogmatism in Long $\underline{C}$ onversations | Mounika Marreddy et.al. | 2406.16833 | null |
2024-06-24 | Understanding and Mitigating Tokenization Bias in Language Models | Buu Phan et.al. | 2406.16829 | null |
2024-06-24 | Ragnarök: A Reusable RAG Framework and Baselines for TREC 2024 Retrieval-Augmented Generation Track | Ronak Pradeep et.al. | 2406.16828 | link |
2024-06-24 | GPT-4V Explorations: Mining Autonomous Driving | Zixuan Li et.al. | 2406.16817 | null |
2024-06-24 | RES-Q: Evaluating Code-Editing Large Language Model Systems at the Repository Scale | Beck LaBash et.al. | 2406.16801 | link |
2024-06-25 | Lottery Ticket Adaptation: Mitigating Destructive Interference in LLMs | Ashwinee Panda et.al. | 2406.16797 | link |
2024-06-24 | Adam-mini: Use Fewer Learning Rates To Gain More | Yushun Zhang et.al. | 2406.16793 | link |
2024-06-24 | M2Lingual: Enhancing Multilingual, Multi-Turn Instruction Alignment in Large Language Models | Rishabh Maheshwary et.al. | 2406.16783 | null |
2024-06-24 | It Is Not About What You Say, It Is About How You Say It: A Surprisingly Simple Approach for Improving Reading Comprehension | Sagi Shaier et.al. | 2406.16779 | null |
2024-06-24 | Finding Transformer Circuits with Edge Pruning | Adithya Bhaskar et.al. | 2406.16778 | link |
2024-06-24 | Blending LLMs into Cascaded Speech Translation: KIT’s Offline Speech Translation System for IWSLT 2024 | Sai Koneru et.al. | 2406.16777 | null |
2024-06-24 | WARP: On the Benefits of Weight Averaged Rewarded Policies | Alexandre Ramé et.al. | 2406.16768 | null |
2024-06-24 | The GPT-WritingPrompts Dataset: A Comparative Analysis of Character Portrayal in Short Stories | Xi Yu Huang et.al. | 2406.16767 | link |
2024-06-24 | Towards Fast Multilingual LLM Inference: Speculative Decoding and Specialized Drafters | Euiin Yi et.al. | 2406.16758 | link |
2024-06-21 | GenoTEX: A Benchmark for Evaluating LLM-Based Exploration of Gene Expression Data in Alignment with Bioinformaticians | Haoyang Liu et.al. | 2406.15341 | link |
2024-06-21 | Gradient-Mask Tuning Elevates the Upper Limits of LLM Performance | Haoling Li et.al. | 2406.15330 | null |
2024-06-21 | Bug In the Code Stack: Can LLMs Find Bugs in Large Python Code Stacks | Hokyung Lee et.al. | 2406.15325 | link |
2024-06-21 | Cognitive Map for Language Models: Optimal Planning via Verbally Representing the World Model | Doyoung Kim et.al. | 2406.15275 | link |
2024-06-21 | Towards Fine-Grained Citation Evaluation in Generated Text: A Comparative Analysis of Faithfulness Metrics | Weijia Zhang et.al. | 2406.15264 | null |
2024-06-21 | Unsupervised Morphological Tree Tokenizer | Qingyang Zhu et.al. | 2406.15245 | null |
2024-06-21 | Large Batch Analysis for Adagrad Under Anisotropic Smoothness | Yuxing Liu et.al. | 2406.15244 | null |
2024-06-21 | Detecting Synthetic Lyrics with Few-Shot Inference | Yanis Labrak et.al. | 2406.15231 | null |
2024-06-21 | A LLM-Based Ranking Method for the Evaluation of Automatic Counter-Narrative Generation | Irune Zubiaga et.al. | 2406.15227 | link |
2024-06-21 | Unsupervised Extraction of Dialogue Policies from Conversations | Makesh Narsimhan Sreedhar et.al. | 2406.15214 | null |
2024-06-21 | Prompting Whisper for QA-driven Zero-shot End-to-end Spoken Language Understanding | Mohan Li et.al. | 2406.15209 | null |
2024-06-21 | Exploring the Efficacy of Robotic Assistants with ChatGPT and Claude in Enhancing ADHD Therapy: Innovating Treatment Paradigms | Santiago Berrezueta-Guzman et.al. | 2406.15198 | null |
2024-06-21 | UDA: A Benchmark Suite for Retrieval Augmented Generation in Real-world Document Analysis | Yulong Hui et.al. | 2406.15187 | link |
2024-06-21 | Hybrid Alignment Training for Large Language Models | Chenglong Wang et.al. | 2406.15178 | link |
2024-06-21 | EmpathyEar: An Open-source Avatar Multimodal Empathetic Chatbot | Hao Fei et.al. | 2406.15177 | link |
2024-06-21 | Enhancing Idiomatic Representation in Multiple Languages via an Adaptive Contrastive Triplet Loss | Wei He et.al. | 2406.15175 | null |
2024-06-21 | Évaluation des capacités de réponse de larges modèles de langage (LLM) pour des questions d’historiens | Mathieu Chartier et.al. | 2406.15173 | null |
2024-06-21 | Assessing Good, Bad and Ugly Arguments Generated by ChatGPT: a New Dataset, its Methodology and Associated Tasks | Victor Hugo Nascimento Rocha et.al. | 2406.15130 | link |
2024-06-21 | Brain-Like Language Processing via a Shallow Untrained Multihead Attention Network | Badr AlKhamissi et.al. | 2406.15109 | link |
2024-06-21 | PARIKSHA : A Large-Scale Investigation of Human-LLM Evaluator Agreement on Multilingual and Multi-Cultural Data | Ishaan Watts et.al. | 2406.15053 | null |
2024-06-20 | Model Merging and Safety Alignment: One Bad Model Spoils the Bunch | Hasan Abed Al Kader Hammoud et.al. | 2406.14563 | null |
2024-06-20 | Whiteboard-of-Thought: Thinking Step-by-Step Across Modalities | Sachit Menon et.al. | 2406.14562 | null |
2024-06-20 | How to Compute the Probability of a Word | Tiago Pimentel et.al. | 2406.14561 | link |
2024-06-21 | Asynchronous Large Language Model Enhanced Planner for Autonomous Driving | Yuan Chen et.al. | 2406.14556 | link |
2024-06-20 | GraphReader: Building Graph-based Agent to Enhance Long-Context Abilities of Large Language Models | Shilong Li et.al. | 2406.14550 | null |
2024-06-20 | Uncovering Latent Memories: Assessing Data Leakage and Memorization Patterns in Large Language Models | Sunny Duan et.al. | 2406.14549 | null |
2024-06-20 | Connecting the Dots: LLMs can Infer and Verbalize Latent Structure from Disparate Training Data | Johannes Treutlein et.al. | 2406.14546 | link |
2024-06-20 | Unmasking Database Vulnerabilities: Zero-Knowledge Schema Inference Attacks in Text-to-SQL Systems | Đorđe Klisura et.al. | 2406.14545 | null |
2024-06-20 | Prism: A Framework for Decoupling and Assessing the Capabilities of VLMs | Yuxuan Qiao et.al. | 2406.14544 | link |
2024-06-21 | Are LLMs Naturally Good at Synthetic Tabular Data Generation? | Shengzhe Xu et.al. | 2406.14541 | link |
2024-06-20 | PostMark: A Robust Blackbox Watermark for Large Language Models | Yapei Chang et.al. | 2406.14517 | link |
2024-06-20 | MMBench-Video: A Long-Form Multi-Shot Benchmark for Holistic Video Understanding | Xinyu Fang et.al. | 2406.14515 | link |
2024-06-20 | Evidence of a log scaling law for political persuasion with large language models | Kobi Hackenburg et.al. | 2406.14508 | link |
2024-06-20 | Overview of the CAIL 2023 Argument Mining Track | Jingcong Liang et.al. | 2406.14503 | null |
2024-06-20 | Improving Expert Radiology Report Summarization by Prompting Large Language Models with a Layperson Summary | Xingmeng Zhao et.al. | 2406.14500 | null |
2024-06-20 | LLaSA: Large Multimodal Agent for Human Activity Analysis Through Wearable Sensors | Sheikh Asif Imran et.al. | 2406.14498 | link |
2024-06-20 | CodeRAG-Bench: Can Retrieval Augment Code Generation? | Zora Zhiruo Wang et.al. | 2406.14497 | link |
2024-06-20 | African or European Swallow? Benchmarking Large Vision-Language Models for Fine-Grained Object Classification | Gregor Geigle et.al. | 2406.14496 | link |
2024-06-20 | Does Object Grounding Really Reduce Hallucination of Large Vision-Language Models? | Gregor Geigle et.al. | 2406.14492 | null |
2024-06-20 | Instruction Pre-Training: Language Models are Supervised Multitask Learners | Daixuan Cheng et.al. | 2406.14491 | link |
2024-06-18 | DrVideo: Document Retrieval Based Long Video Understanding | Ziyu Ma et.al. | 2406.12846 | null |
2024-06-18 | Interpretable Preferences via Multi-Objective Reward Modeling and Mixture-of-Experts | Haoxiang Wang et.al. | 2406.12845 | link |
2024-06-18 | Synergizing Foundation Models and Federated Learning: A Survey | Shenghui Li et.al. | 2406.12844 | null |
2024-06-18 | GroPrompt: Efficient Grounded Prompting and Adaptation for Referring Video Object Segmentation | Ci-Siang Lin et.al. | 2406.12834 | null |
2024-06-18 | LaMDA: Large Model Fine-Tuning via Spectrally Decomposed Low-Dimensional Adaptation | Seyedarmin Azizi et.al. | 2406.12832 | link |
2024-06-18 | What Are the Odds? Language Models Are Capable of Probabilistic Reasoning | Akshay Paruchuri et.al. | 2406.12830 | link |
2024-06-18 | From RAGs to rich parameters: Probing how language models utilize external knowledge over parametric information for factual queries | Hitesh Wadhwa et.al. | 2406.12824 | null |
2024-06-18 | Is It Good Data for Multilingual Instruction Tuning or Just Bad Multilingual Evaluation for Large Language Models? | Pinzhen Chen et.al. | 2406.12822 | null |
2024-06-18 | Adversarial Attacks on Multimodal Agents | Chen Henry Wu et.al. | 2406.12814 | link |
2024-06-18 | Can Large Language Models Always Solve Easy Problems if They Can Solve Harder Ones? | Zhe Yang et.al. | 2406.12809 | link |
2024-06-18 | Identifying Performance-Sensitive Configurations in Software Systems through Code Analysis with LLM Agents | Zehao Wang et.al. | 2406.12806 | null |
2024-06-18 | Supporting Human Raters with the Detection of Harmful Content using Large Language Models | Kurt Thomas et.al. | 2406.12800 | null |
2024-06-18 | ChatGLM: A Family of Large Language Models from GLM-130B to GLM-4 All Tools | Team GLM et.al. | 2406.12793 | link |
2024-06-18 | In-Context Learning of Energy Functions | Rylan Schaeffer et.al. | 2406.12785 | null |
2024-06-18 | UBENCH: Benchmarking Uncertainty in Large Language Models with Multiple Choice Questions | Xunzhi Wang et.al. | 2406.12784 | link |
2024-06-18 | Hopping Too Late: Exploring the Limitations of Large Language Models on Multi-Hop Queries | Eden Biran et.al. | 2406.12775 | link |
2024-06-18 | Towards Exact Gradient-based Training on Analog In-memory Computing | Zhaoxian Wu et.al. | 2406.12774 | null |
2024-06-18 | GFM4MPM: Towards Geospatial Foundation Models for Mineral Prospectivity Mapping | Angel Daruna et.al. | 2406.12756 | null |
2024-06-18 | OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI | Zhen Huang et.al. | 2406.12753 | link |
2024-06-18 | Benchmarking Multi-Image Understanding in Vision and Language Models: Perception, Knowledge, Reasoning, and Multi-Hop Reasoning | Bingchen Zhao et.al. | 2406.12742 | link |
2024-06-17 | LLaNA: Large Language and NeRF Assistant | Andrea Amaduzzi et.al. | 2406.11840 | null |
2024-06-17 | mDPO: Conditional Preference Optimization for Multimodal Large Language Models | Fei Wang et.al. | 2406.11839 | link |
2024-06-17 | MMDU: A Multi-Turn Multi-Image Dialog Understanding Benchmark and Instruction-Tuning Dataset for LVLMs | Ziyu Liu et.al. | 2406.11833 | link |
2024-06-17 | Unveiling Encoder-Free Vision-Language Models | Haiwen Diao et.al. | 2406.11832 | link |
2024-06-17 | Exploring the Role of Large Language Models in Prompt Encoding for Diffusion Models | Bingqi Ma et.al. | 2406.11831 | null |
2024-06-17 | Language Modeling with Editable External Knowledge | Belinda Z. Li et.al. | 2406.11830 | link |
2024-06-17 | WPO: Enhancing RLHF with Weighted Preference Optimization | Wenxuan Zhou et.al. | 2406.11827 | link |
2024-06-17 | On Efficient Language and Vision Assistants for Visually-Situated Natural Language Understanding: What Matters in Reading and Reasoning | Geewook Kim et.al. | 2406.11823 | link |
2024-06-17 | MegaScenes: Scene-Level View Synthesis at Scale | Joseph Tung et.al. | 2406.11819 | link |
2024-06-17 | Embodied Instruction Following in Unknown Environments | Zhenyu Wu et.al. | 2406.11818 | null |
2024-06-17 | Iterative Length-Regularized Direct Preference Optimization: A Case Study on Improving 7B Language Models to GPT-4 Level | Jie Liu et.al. | 2406.11817 | null |
2024-06-17 | VideoLLM-online: Online Video Large Language Model for Streaming Video | Joya Chen et.al. | 2406.11816 | null |
2024-06-17 | How Do Large Language Models Acquire Factual Knowledge During Pretraining? | Hoyeon Chang et.al. | 2406.11813 | link |
2024-06-17 | RepLiQA: A Question-Answering Dataset for Benchmarking LLMs on Unseen Reference Content | Joao Monteiro et.al. | 2406.11811 | link |
2024-06-17 | Safety Arithmetic: A Framework for Test-time Safety Alignment of Language Models by Steering Parameters and Activations | Rima Hazra et.al. | 2406.11801 | link |
2024-06-17 | DataComp-LM: In search of the next generation of training sets for language models | Jeffrey Li et.al. | 2406.11794 | null |
2024-06-17 | CELL your Model: Contrastive Explanation Methods for Large Language Models | Ronny Luss et.al. | 2406.11785 | null |
2024-06-17 | Split, Unlearn, Merge: Leveraging Data Attributes for More Effective Unlearning in LLMs | Swanand Ravindra Kadhe et.al. | 2406.11780 | null |
2024-06-17 | Improving Multi-Agent Debate with Sparse Communication Topology | Yunxuan Li et.al. | 2406.11776 | null |
2024-06-17 | Task Me Anything | Jieyu Zhang et.al. | 2406.11775 | link |
2024-06-14 | Quantifying Variance in Evaluation Benchmarks | Lovish Madaan et.al. | 2406.10229 | null |
2024-06-14 | EFM3D: A Benchmark for Measuring Progress Towards 3D Egocentric Foundation Models | Julian Straub et.al. | 2406.10224 | link |
2024-06-14 | Short Film Dataset (SFD): A Benchmark for Story-Level Video Understanding | Ridouane Ghermi et.al. | 2406.10221 | link |
2024-06-14 | Semantic Membership Inference Attack against Large Language Models | Hamid Mozaffari et.al. | 2406.10218 | null |
2024-06-14 | Regularizing Hidden States Enables Learning Generalizable Reward Model for LLMs | Rui Yang et.al. | 2406.10216 | link |
2024-06-14 | DevBench: A multimodal developmental benchmark for language learning | Alvin Wei Ming Tan et.al. | 2406.10215 | link |
2024-06-14 | Be like a Goldfish, Don’t Memorize! Mitigating Memorization in Generative LLMs | Abhimanyu Hans et.al. | 2406.10209 | link |
2024-06-14 | A Fundamental Trade-off in Aligned Language Models and its Relation to Sampling Adaptors | Naaman Tan et.al. | 2406.10203 | link |
2024-06-14 | TRIP-PAL: Travel Planning with Guarantees by Combining Large Language Models and Automated Planners | Tomas de la Rosa et.al. | 2406.10196 | null |
2024-06-14 | Detecting and Evaluating Medical Hallucinations in Large Vision Language Models | Jiawei Chen et.al. | 2406.10185 | null |
2024-06-14 | Practical offloading for fine-tuning LLM on commodity GPU via learned subspace projectors | Siyuan Chen et.al. | 2406.10181 | link |
2024-06-14 | Let the Poem Hit the Rhythm: Using a Byte-Based Transformer for Beat-Aligned Poetry Generation | Mohamad Elzohbi et.al. | 2406.10174 | link |
2024-06-14 | IntentionQA: A Benchmark for Evaluating Purchase Intention Comprehension Abilities of Language Models in E-commerce | Wenxuan Ding et.al. | 2406.10173 | link |
2024-06-14 | Datasets for Multilingual Answer Sentence Selection | Matteo Gabburo et.al. | 2406.10172 | null |
2024-06-14 | CarLLaVA: Vision language models for camera-only closed-loop driving | Katrin Renz et.al. | 2406.10165 | null |
2024-06-14 | Sycophancy to Subterfuge: Investigating Reward-Tampering in Large Language Models | Carson Denison et.al. | 2406.10162 | link |
2024-06-14 | RoboGolf: Mastering Real-World Minigolf with a Reflective Multi-Modality Vision-Language Model | Hantao Zhou et.al. | 2406.10157 | null |
2024-06-14 | BABILong: Testing the Limits of LLMs with Long Context Reasoning-in-a-Haystack | Yuri Kuratov et.al. | 2406.10149 | link |
2024-06-14 | Evaluation of Large Language Models: STEM education and Gender Stereotypes | Smilla Due et.al. | 2406.10133 | null |
2024-06-14 | The Devil is in the Neurons: Interpreting and Mitigating Social Biases in Pre-trained Language Models | Yan Liu et.al. | 2406.10130 | link |
2024-06-13 | VideoGPT+: Integrating Image and Video Encoders for Enhanced Video Understanding | Muhammad Maaz et.al. | 2406.09418 | link |
2024-06-13 | Explore the Limits of Omni-modal Pretraining at Scale | Yiyuan Zhang et.al. | 2406.09412 | link |
2024-06-13 | 4M-21: An Any-to-Any Vision Model for Tens of Tasks and Modalities | Roman Bachmann et.al. | 2406.09406 | null |
2024-06-13 | Visual Sketchpad: Sketching as a Visual Chain of Thought for Multimodal Language Models | Yushi Hu et.al. | 2406.09403 | null |
2024-06-13 | OmniTokenizer: A Joint Image-Video Tokenizer for Visual Generation | Junke Wang et.al. | 2406.09399 | link |
2024-06-13 | Aligning Vision Models with Human Aesthetics in Retrieval: Benchmarks and Algorithms | Miaosen Zhang et.al. | 2406.09397 | null |
2024-06-13 | Too Many Frames, not all Useful:Efficient Strategies for Long-Form Video QA | Jongwoo Park et.al. | 2406.09396 | link |
2024-06-13 | Exploring the Spectrum of Visio-Linguistic Compositionality and Recognition | Youngtaek Oh et.al. | 2406.09388 | link |
2024-06-13 | Towards Vision-Language Geo-Foundation Model: A Survey | Yue Zhou et.al. | 2406.09385 | link |
2024-06-13 | Reflecting on the State of Rehearsal-free Continual Learning with Pretrained Models | Lukas Thede et.al. | 2406.09384 | null |
2024-06-13 | Needle In A Video Haystack: A Scalable Synthetic Framework for Benchmarking Video MLLMs | Zijia Zhao et.al. | 2406.09367 | link |
2024-06-13 | ElicitationGPT: Text Elicitation Mechanisms via Language Models | Yifan Wu et.al. | 2406.09363 | null |
2024-06-13 | Enhancing Domain Adaptation through Prompt Gradient Alignment | Hoang Phan et.al. | 2406.09353 | link |
2024-06-13 | Separations in the Representational Capabilities of Transformers and Recurrent Architectures | Satwik Bhattamishra et.al. | 2406.09347 | null |
2024-06-13 | DiscreteSLU: A Large Language Model with Self-Supervised Discrete Speech Units for Spoken Language Understanding | Suwon Shon et.al. | 2406.09345 | null |
2024-06-13 | ProxyLM: Predicting Language Model Performance on Multilingual Tasks via Proxy Models | David Anugraha et.al. | 2406.09334 | link |
2024-06-13 | REVS: Unlearning Sensitive Information in Language Models via Rank Editing in the Vocabulary Space | Tomer Ashuach et.al. | 2406.09325 | null |
2024-06-13 | Bag of Tricks: Benchmarking of Jailbreak Attacks on LLMs | Zhao Xu et.al. | 2406.09324 | link |
2024-06-13 | JailbreakEval: An Integrated Toolkit for Evaluating Jailbreak Attempts Against Large Language Models | Delong Ran et.al. | 2406.09321 | link |
2024-06-13 | Common and Rare Fundus Diseases Identification Using Vision-Language Foundation Model with Knowledge of Over 400 Diseases | Meng Wang et.al. | 2406.09317 | link |
2024-06-12 | What If We Recaption Billions of Web Images with LLaMA-3? | Xianhang Li et.al. | 2406.08478 | null |
2024-06-12 | Improving LLMs for Recommendation with Out-Of-Vocabulary Tokens | Ting-Ji Huang et.al. | 2406.08477 | null |
2024-06-12 | Real2Code: Reconstruct Articulated Objects via Code Generation | Zhao Mandi et.al. | 2406.08474 | null |
2024-06-12 | PAL: Pluralistic Alignment Framework for Learning from Heterogeneous Preferences | Daiwei Chen et.al. | 2406.08469 | link |
2024-06-12 | Magpie: Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing | Zhangchen Xu et.al. | 2406.08464 | link |
2024-06-12 | AToM-Bot: Embodied Fulfillment of Unspoken Human Needs with Affective Theory of Mind | Wei Ding et.al. | 2406.08455 | null |
2024-06-12 | OLMES: A Standard for Language Model Evaluations | Yuling Gu et.al. | 2406.08446 | null |
2024-06-12 | SVSNet+: Enhancing Speaker Voice Similarity Assessment Models with Representations from Speech Foundation Models | Chun Yin et.al. | 2406.08445 | null |
2024-06-12 | TasTe: Teaching Large Language Models to Translate through Self-Reflection | Yutong Wang et.al. | 2406.08434 | link |
2024-06-12 | Next-Generation Database Interfaces: A Survey of LLM-based Text-to-SQL | Zijin Hong et.al. | 2406.08426 | null |
2024-06-12 | OmniCorpus: An Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text | Qingyun Li et.al. | 2406.08418 | link |
2024-06-12 | Discovering Preference Optimization Algorithms with and for Large Language Models | Chris Lu et.al. | 2406.08414 | link |
2024-06-12 | Memory Is All You Need: An Overview of Compute-in-Memory Architectures for Accelerating Large Language Model Inference | Christopher Wolters et.al. | 2406.08413 | null |
2024-06-13 | MMWorld: Towards Multi-discipline Multi-faceted World Model Evaluation in Videos | Xuehai He et.al. | 2406.08407 | link |
2024-06-12 | Understanding Sounds, Missing the Questions: The Challenge of Object Hallucination in Large Audio-Language Models | Chun-Yi Kuan et.al. | 2406.08402 | link |
2024-06-12 | cPAPERS: A Dataset of Situated and Multimodal Interactive Conversations in Scientific Papers | Anirudh Sundar et.al. | 2406.08398 | null |
2024-06-12 | VisionLLM v2: An End-to-End Generalist Multimodal Large Language Model for Hundreds of Vision-Language Tasks | Jiannan Wu et.al. | 2406.08394 | link |
2024-06-12 | Large Language Models Must Be Taught to Know What They Don’t Know | Sanyam Kapoor et.al. | 2406.08391 | link |
2024-06-12 | Banal Deception Human-AI Ecosystems: A Study of People’s Perceptions of LLM-generated Deceptive Behaviour | Xiao Zhan et.al. | 2406.08386 | null |
2024-06-13 | APSeg: Auto-Prompt Network for Cross-Domain Few-Shot Semantic Segmentation | Weizhao He et.al. | 2406.08372 | null |
2024-06-11 | A3VLM: Actionable Articulation-Aware Vision Language Model | Siyuan Huang et.al. | 2406.07549 | link |
2024-06-11 | Image and Video Tokenization with Binary Spherical Quantization | Yue Zhao et.al. | 2406.07548 | link |
2024-06-11 | Open-LLM-Leaderboard: From Multi-choice to Open-style Questions for LLMs Evaluation, Benchmark, and Arena | Aidar Myrzakhan et.al. | 2406.07545 | link |
2024-06-11 | QuickLLaMA: Query-aware Inference Acceleration for Large Language Models | Jingyao Li et.al. | 2406.07528 | link |
2024-06-11 | Simple and Effective Masked Diffusion Language Models | Subham Sekhar Sahoo et.al. | 2406.07524 | link |
2024-06-11 | Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling | Liliang Ren et.al. | 2406.07522 | link |
2024-06-11 | Beyond Model Collapse: Scaling Up with Synthesized Data Requires Reinforcement | Yunzhen Feng et.al. | 2406.07515 | null |
2024-06-11 | THaLLE: Text Hyperlocally Augmented Large Language Extension – Technical Report | KBTG Labs et.al. | 2406.07505 | null |
2024-06-11 | Image Textualization: An Automatic Framework for Creating Accurate and Detailed Image Descriptions | Renjie Pi et.al. | 2406.07502 | link |
2024-06-11 | TextGrad: Automatic “Differentiation” via Text | Mert Yuksekgonul et.al. | 2406.07496 | link |
2024-06-12 | CADS: A Systematic Literature Review on the Challenges of Abstractive Dialogue Summarization | Frederic Kirstein et.al. | 2406.07494 | null |
2024-06-11 | Paraphrasing in Affirmative Terms Improves Negation Understanding | MohammadHossein Rezaei et.al. | 2406.07492 | null |
2024-06-11 | PITCH: Productivity and Mental Well-being Coaching through Daily Conversational Interaction | Adnan Abbas et.al. | 2406.07485 | null |
2024-06-11 | Advancing Annotation of Stance in Social Media Posts: A Comparative Analysis of Large Language Models and Crowd Sourcing | Mao Li et.al. | 2406.07483 | null |
2024-06-11 | VideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMs | Zesen Cheng et.al. | 2406.07476 | link |
2024-06-11 | Anomaly Detection on Unstable Logs with GPT Models | Fatemeh Hadadi et.al. | 2406.07467 | null |
2024-06-11 | Estimating the Hallucination Rate of Generative AI | Andrew Jesson et.al. | 2406.07457 | null |
2024-06-11 | Reinforcement Learning from Human Feedback without Reward Inference: Model-Free Algorithm and Instance-Dependent Analysis | Qining Zhang et.al. | 2406.07455 | null |
2024-06-11 | On the Robustness of Document-Level Relation Extraction Models to Entity Name Variations | Shiao Meng et.al. | 2406.07444 | link |
2024-06-11 | McEval: Massively Multilingual Code Evaluation | Linzheng Chai et.al. | 2406.07436 | null |
2024-06-10 | Autoregressive Model Beats Diffusion: Llama for Scalable Image Generation | Peize Sun et.al. | 2406.06525 | link |
2024-06-10 | UMBRELA: UMbrela is the (Open-Source Reproduction of the) Bing RELevance Assessor | Shivani Upadhyay et.al. | 2406.06519 | link |
2024-06-10 | Merlin: A Vision Language Foundation Model for 3D Computed Tomography | Louis Blankemeier et.al. | 2406.06512 | null |
2024-06-10 | NarrativeBridge: Enhancing Video Captioning with Causal-Temporal Narrative | Asmar Nadeem et.al. | 2406.06499 | null |
2024-06-10 | Direct Preference Optimization for Suppressing Hallucinated Prior Exams in Radiology Report Generation | Oishi Banerjee et.al. | 2406.06496 | null |
2024-06-10 | Can Language Models Serve as Text-Based World Simulators? | Ruoyao Wang et.al. | 2406.06485 | null |
2024-06-10 | Parallelizing Linear Transformers with the Delta Rule over Sequence Length | Songlin Yang et.al. | 2406.06484 | link |
2024-06-10 | Towards a Personal Health Large Language Model | Justin Cosentino et.al. | 2406.06474 | null |
2024-06-10 | AID: Adapting Image2Video Diffusion Models for Instruction-guided Video Prediction | Zhen Xing et.al. | 2406.06465 | null |
2024-06-10 | Transforming Wearable Data into Health Insights using Large Language Model Agents | Mike A. Merrill et.al. | 2406.06464 | null |
2024-06-10 | VCR: Visual Caption Restoration | Tianyu Zhang et.al. | 2406.06462 | link |
2024-06-11 | Reasoning in Token Economies: Budget-Aware Evaluation of LLM Reasoning Strategies | Junlin Wang et.al. | 2406.06461 | null |
2024-06-10 | Evaluating the Retrieval Component in LLM-Based Question Answering Systems | Ashkan Alinejad et.al. | 2406.06458 | null |
2024-06-10 | A Large Language Model Pipeline for Breast Cancer Oncology | Tristen Pool et.al. | 2406.06455 | null |
2024-06-10 | Insights from Social Shaping Theory: The Appropriation of Large Language Models in an Undergraduate Programming Course | Aadarsh Padiyath et.al. | 2406.06451 | null |
2024-06-10 | LLM Dataset Inference: Did you train on my dataset? | Pratyush Maini et.al. | 2406.06443 | link |
2024-06-10 | Interpretability of Language Models via Task Spaces | Lucas Weber et.al. | 2406.06441 | null |
2024-06-10 | Language Models are Alignable Decision-Makers: Dataset and Application to the Medical Triage Domain | Brian Hu et.al. | 2406.06435 | link |
2024-06-10 | Multivariate Stochastic Dominance via Optimal Transport and Applications to Models Benchmarking | Gabriel Rioux et.al. | 2406.06425 | null |
2024-06-10 | An Empirical Design Justice Approach to Identifying Ethical Considerations in the Intersection of Large Language Models and Social Robotics | Alva Markelius et.al. | 2406.06400 | null |
2024-06-07 | 3D-GRAND: Towards Better Grounding and Less Hallucination for 3D-LLMs | Jianing Yang et.al. | 2406.05132 | link |
2024-06-07 | An Empirical Study on Parameter-Efficient Fine-Tuning for MultiModal Large Language Models | Xiongtao Zhou et.al. | 2406.05130 | link |
2024-06-07 | Towards Semantic Equivalence of Tokenization in Multimodal LLM | Shengqiong Wu et.al. | 2406.05127 | null |
2024-06-07 | Large Generative Graph Models | Yu Wang et.al. | 2406.05109 | null |
2024-06-07 | LINX: A Language Driven Generative System for Goal-Oriented Automated Data Exploration | Tavor Lipman et.al. | 2406.05107 | null |
2024-06-07 | Corpus Poisoning via Approximate Greedy Gradient Descent | Jinyan Su et.al. | 2406.05087 | link |
2024-06-07 | Multi-Head RAG: Solving Multi-Aspect Problems with LLMs | Maciej Besta et.al. | 2406.05085 | link |
2024-06-07 | SUMIE: A Synthetic Benchmark for Incremental Entity Summarization | Eunjeong Hwang et.al. | 2406.05079 | null |
2024-06-07 | Are Large Language Models More Empathetic than Humans? | Anuradha Welivita et.al. | 2406.05063 | null |
2024-06-07 | Robustness Assessment of Mathematical Reasoning in the Presence of Missing and Contradictory Conditions | Shi-Yu Tian et.al. | 2406.05055 | null |
2024-06-07 | Hints-In-Browser: Benchmarking Language Models for Programming Feedback Generation | Nachiket Kotalwar et.al. | 2406.05053 | null |
2024-06-07 | Bootstrapping Referring Multi-Object Tracking | Yani Zhang et.al. | 2406.05039 | link |
2024-06-07 | Scenarios and Approaches for Situated Natural Language Explanations | Pengshuo Qiu et.al. | 2406.05035 | null |
2024-06-07 | CHIQ: Contextual History Enhancement for Improving Query Rewriting in Conversational Search | Fengran Mo et.al. | 2406.05013 | link |
2024-06-07 | Compositional Generalization with Grounded Language Models | Sondre Wold et.al. | 2406.04989 | link |
2024-06-07 | Language models emulate certain cognitive profiles: An investigation of how predictability measures interact with individual differences | Patrick Haller et.al. | 2406.04988 | link |
2024-06-07 | MEFT: Memory-Efficient Fine-Tuning through Sparse Adapter | Jitai Hao et.al. | 2406.04984 | link |
2024-06-07 | CityCraft: A Real Crafter for 3D City Generation | Jie Deng et.al. | 2406.04983 | null |
2024-06-07 | Quantifying Geospatial in the Common Crawl Corpus | Ilya Ilyankou et.al. | 2406.04952 | null |
2024-06-07 | BAMO at SemEval-2024 Task 9: BRAINTEASER: A Novel Task Defying Common Sense | Baktash Ansari et.al. | 2406.04947 | link |
2024-06-06 | Verbalized Machine Learning: Revisiting Machine Learning with Language Models | Tim Z. Xiao et.al. | 2406.04344 | null |
2024-06-06 | Flash3D: Feed-Forward Generalisable 3D Scene Reconstruction from a Single Image | Stanislaw Szymanowicz et.al. | 2406.04343 | link |
2024-06-06 | Learning 1D Causal Visual Representation with De-focus Attention Networks | Chenxin Tao et.al. | 2406.04342 | link |
2024-06-06 | RoboMamba: Multimodal State Space Model for Efficient Robot Reasoning and Manipulation | Jiaming Liu et.al. | 2406.04339 | null |
2024-06-06 | Coherent Zero-Shot Visual Instruction Generation | Quynh Phung et.al. | 2406.04337 | null |
2024-06-06 | DeepStack: Deeply Stacking Visual Tokens is Surprisingly Simple and Effective for LMMs | Lingchen Meng et.al. | 2406.04334 | null |
2024-06-06 | PaCE: Parsimonious Concept Engineering for Large Language Models | Jinqi Luo et.al. | 2406.04331 | link |
2024-06-06 | Parameter-Inverted Image Pyramid Networks | Xizhou Zhu et.al. | 2406.04330 | link |
2024-06-06 | Simplified and Generalized Masked Diffusion for Discrete Data | Jiaxin Shi et.al. | 2406.04329 | link |
2024-06-06 | Causal Estimation of Memorisation Profiles | Pietro Lesci et.al. | 2406.04327 | link |
2024-06-06 | ShareGPT4Video: Improving Video Understanding and Generation with Better Captions | Lin Chen et.al. | 2406.04325 | null |
2024-06-06 | Step-aware Preference Optimization: Aligning Preference with Denoising Performance at Each Step | Zhanhao Liang et.al. | 2406.04314 | link |
2024-06-06 | Improving Alignment and Robustness with Short Circuiting | Andy Zou et.al. | 2406.04313 | link |
2024-06-06 | Semantically Diverse Language Generation for Uncertainty Estimation in Language Models | Lukas Aichberger et.al. | 2406.04306 | link |
2024-06-06 | Quixer: A Quantum Transformer Model | Nikhil Khatri et.al. | 2406.04305 | null |
2024-06-06 | Text-to-Drive: Diverse Driving Behavior Synthesis via Large Language Models | Phat Nguyen et.al. | 2406.04300 | null |
2024-06-06 | VISTA: Visualized Text Embedding For Universal Multi-Modal Retrieval | Junjie Zhou et.al. | 2406.04292 | link |
2024-06-06 | Stratified Prediction-Powered Inference for Hybrid Language Model Evaluation | Adam Fisch et.al. | 2406.04291 | null |
2024-06-07 | What Languages are Easy to Language-Model? A Perspective from Learning Probabilistic Regular Languages | Nadav Borenstein et.al. | 2406.04289 | null |
2024-06-06 | Characterizing Similarities and Divergences in Conversational Tones in Humans and LLMs by Sampling with People | Dun-Ming Huang et.al. | 2406.04278 | link |
2024-06-05 | Wings: Learning Multimodal LLMs without Text-only Forgetting | Yi-Kai Zhang et.al. | 2406.03496 | null |
2024-06-06 | Seq1F1B: Efficient Sequence-Level Pipeline Parallelism for Large Language Model Training | Ao Sun et.al. | 2406.03488 | link |
2024-06-05 | Analyzing LLM Behavior in Dialogue Summarization: Unveiling Circumstantial Hallucination Trends | Sanjana Ramprasad et.al. | 2406.03487 | null |
2024-06-05 | BIPED: Pedagogically Informed Tutoring System for ESL Education | Soonwoo Kwon et.al. | 2406.03486 | null |
2024-06-05 | Does your data spark joy? Performance gains from domain upsampling at the end of training | Cody Blakeney et.al. | 2406.03476 | null |
2024-06-05 | AD-H: Autonomous Driving with Hierarchical Agents | Zaibin Zhang et.al. | 2406.03474 | null |
2024-06-05 | What is the Best Way for ChatGPT to Translate Poetry? | Shanshan Wang et.al. | 2406.03450 | null |
2024-06-05 | Pre-trained Large Language Models Use Fourier Features to Compute Addition | Tianyi Zhou et.al. | 2406.03445 | null |
2024-06-05 | Are language models rational? The case of coherence norms and belief revision | Thomas Hofweber et.al. | 2406.03442 | null |
2024-06-05 | Cycles of Thought: Measuring LLM Confidence through Stable Explanations | Evan Becker et.al. | 2406.03441 | null |
2024-06-05 | Computation-Efficient Era: A Comprehensive Survey of State Space Models in Medical Image Analysis | Moein Heidari et.al. | 2406.03430 | link |
2024-06-05 | Interactive Text-to-Image Retrieval with Large Language Models: A Plug-and-Play Approach | Saehyung Lee et.al. | 2406.03411 | link |
2024-06-05 | Automating Turkish Educational Quiz Generation Using Large Language Models | Kamyar Zeinalipour et.al. | 2406.03397 | link |
2024-06-05 | Log Parsing with Self-Generated In-Context Learning and Self-Correction | Yifan Wu et.al. | 2406.03376 | null |
2024-06-05 | IrokoBench: A New Benchmark for African Languages in the Age of Large Language Models | David Ifeoluwa Adelani et.al. | 2406.03368 | null |
2024-06-05 | CLMASP: Coupling Large Language Models with Answer Set Programming for Robotic Task Planning | Xinrui Lin et.al. | 2406.03367 | null |
2024-06-05 | LLM-based Rewriting of Inappropriate Argumentation using Reinforcement Learning from Machine Feedback | Timon Ziegenbein et.al. | 2406.03363 | null |
2024-06-05 | Save It for the “Hot” Day: An LLM-Empowered Visual Analytics System for Heat Risk Management | Haobo Li et.al. | 2406.03317 | null |
2024-06-05 | The Good, the Bad, and the Hulk-like GPT: Analyzing Emotional Decisions of Large Language Models in Cooperation and Bargaining Games | Mikhail Mozikov et.al. | 2406.03299 | null |
2024-06-05 | SpikeLM: Towards General Spike-Driven Language Modeling via Elastic Bi-Spiking Mechanisms | Xingrun Xing et.al. | 2406.03287 | link |
2024-06-04 | Learning to grok: Emergence of in-context learning and skill composition in modular arithmetic tasks | Tianyu He et.al. | 2406.02550 | link |
2024-06-04 | Open-YOLO 3D: Towards Fast and Accurate Open-Vocabulary 3D Instance Segmentation | Mohamed El Amine Boudjoghra et.al. | 2406.02548 | link |
2024-06-04 | Leveraging Visual Tokens for Extended Text Contexts in Multi-Modal Learning | Alex Jinpeng Wang et.al. | 2406.02547 | link |
2024-06-04 | To Believe or Not to Believe Your LLM | Yasin Abbasi Yadkori et.al. | 2406.02543 | null |
2024-06-04 | Loki: Low-Rank Keys for Efficient Sparse Attention | Prajwal Singhania et.al. | 2406.02542 | link |
2024-06-04 | Parrot: Multilingual Visual Instruction Tuning | Hai-Long Sun et.al. | 2406.02539 | link |
2024-06-04 | TopViewRS: Vision-Language Models as Top-View Spatial Reasoners | Chengzu Li et.al. | 2406.02537 | link |
2024-06-04 | Mitigate Position Bias in Large Language Models via Scaling a Single Dimension | Yijiong Yu et.al. | 2406.02536 | link |
2024-06-04 | SpecExec: Massively Parallel Speculative Decoding for Interactive LLM Inference on Consumer Devices | Ruslan Svirschevski et.al. | 2406.02532 | link |
2024-06-04 | Scalable MatMul-free Language Modeling | Rui-Jie Zhu et.al. | 2406.02528 | link |
2024-06-04 | CheckEmbed: Effective Verification of LLM Solutions to Open-Ended Tasks | Maciej Besta et.al. | 2406.02524 | link |
2024-06-04 | RoboCasa: Large-Scale Simulation of Everyday Tasks for Generalist Robots | Soroush Nasiriany et.al. | 2406.02523 | null |
2024-06-04 | Demystifying the Compression of Mixture-of-Experts Through a Unified Framework | Shwai He et.al. | 2406.02500 | link |
2024-06-04 | Hiding Text in Large Language Models: Introducing Unconditional Token Forcing Confusion | Jakub Hoscilowicz et.al. | 2406.02481 | link |
2024-06-04 | Analyzing Temporal Complex Events with Large Language Models? A Benchmark towards Temporal, Long Context Understanding | Zhihan Zhang et.al. | 2406.02472 | link |
2024-06-04 | Meta-Designing Quantum Experiments with Language Models | Sören Arlt et.al. | 2406.02470 | null |
2024-06-04 | Seed-TTS: A Family of High-Quality Versatile Speech Generation Models | Philip Anastassiou et.al. | 2406.02430 | link |
2024-06-04 | Self-Supervised Singing Voice Pre-Training towards Speech-to-Singing Conversion | Ruiqi Li et.al. | 2406.02429 | null |
2024-06-04 | GrootVL: Tree Topology is All You Need in State Space Model | Yicheng Xiao et.al. | 2406.02395 | link |
2024-06-04 | Multiple Choice Questions and Large Languages Models: A Case Study with Fictional Medical Data | Maxime Griot et.al. | 2406.02394 | link |
2024-05-31 | Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis | Chaoyou Fu et.al. | 2405.21075 | null |
2024-05-31 | Code Pretraining Improves Entity Tracking Abilities of Language Models | Najoung Kim et.al. | 2405.21068 | null |
2024-05-31 | Transformers are SSMs: Generalized Models and Efficient Algorithms Through Structured State Space Duality | Tri Dao et.al. | 2405.21060 | link |
2024-05-31 | RydbergGPT | David Fitzek et.al. | 2405.21052 | link |
2024-05-31 | Kaleido Diffusion: Improving Conditional Diffusion Models with Autoregressive Latent Modeling | Jiatao Gu et.al. | 2405.21048 | null |
2024-05-31 | Grammar-Aligned Decoding | Kanghee Park et.al. | 2405.21047 | null |
2024-05-31 | Exploratory Preference Optimization: Harnessing Implicit Q*-Approximation for Sample-Efficient RLHF | Tengyang Xie et.al. | 2405.21046 | null |
2024-05-31 | Direct Alignment of Language Models via Quality-Aware Self-Refinement | Runsheng Yu et.al. | 2405.21040 | null |
2024-05-31 | Standards for Belief Representations in LLMs | Daniel A. Herrmann et.al. | 2405.21030 | null |
2024-05-31 | LACIE: Listener-Aware Finetuning for Confidence Calibration in Large Language Models | Elias Stengel-Eskin et.al. | 2405.21028 | link |
2024-05-31 | You Only Scan Once: Efficient Multi-dimension Sequential Modeling with LightNet | Zhen Qin et.al. | 2405.21022 | null |
2024-05-31 | Improved Techniques for Optimization-Based Jailbreaking on Large Language Models | Xiaojun Jia et.al. | 2405.21018 | link |
2024-06-04 | StrucTexTv3: An Efficient Vision-Language Model for Text-rich Image Perception, Comprehension, and Beyond | Pengyuan Lyu et.al. | 2405.21013 | null |
2024-05-31 | Hard Cases Detection in Motion Prediction by Vision-Language Foundation Models | Yi Yang et.al. | 2405.20991 | link |
2024-05-31 | DeCo: Decoupling Token Compression from Semantic Abstraction in Multimodal Large Language Models | Linli Yao et.al. | 2405.20985 | link |
2024-05-31 | Enhancing Noise Robustness of Retrieval-Augmented Language Models with Adaptive Adversarial Training | Feiteng Fang et.al. | 2405.20978 | link |
2024-05-31 | SaySelf: Teaching LLMs to Express Confidence with Self-Reflective Rationales | Tianyang Xu et.al. | 2405.20974 | link |
2024-05-31 | LCQ: Low-Rank Codebook based Quantization for Large Language Models | Wen-Pu Cai et.al. | 2405.20973 | null |
2024-06-03 | Large Language Models are Zero-Shot Next Location Predictors | Ciro Beneduce et.al. | 2405.20962 | link |
2024-06-03 | A Robot Walks into a Bar: Can Language Models Serve as Creativity Support Tools for Comedy? An Evaluation of LLMs’ Humour Alignment with Comedians | Piotr Wojciech Mirowski et.al. | 2405.20956 | null |
2024-05-30 | MotionLLM: Understanding Human Behaviors from Human Motions and Videos | Ling-Hao Chen et.al. | 2405.20340 | link |
2024-05-30 | Visual Perception by Large Language Model’s Weights | Feipeng Ma et.al. | 2405.20339 | link |
2024-05-30 | Xwin-LM: Strong and Scalable Alignment Practice for LLMs | Bolin Ni et.al. | 2405.20335 | link |
2024-05-31 | ParSEL: Parameterized Shape Editing with Language | Aditya Ganeshan et.al. | 2405.20319 | null |
2024-05-30 | CausalQuest: Collecting Natural Causal Questions for AI Agents | Roberto Ceraolo et.al. | 2405.20318 | link |
2024-05-30 | ANAH: Analytical Annotation of Hallucinations in Large Language Models | Ziwei Ji et.al. | 2405.20315 | link |
2024-05-30 | Sequence-Augmented SE(3)-Flow Matching For Conditional Protein Backbone Generation | Guillaume Huguet et.al. | 2405.20313 | link |
2024-05-30 | Large Language Models Can Self-Improve At Web Agent Tasks | Ajay Patel et.al. | 2405.20309 | link |
2024-05-30 | Can’t make an Omelette without Breaking some Eggs: Plausible Action Anticipation using Large Video-Language Models | Himangi Mittal et.al. | 2405.20305 | null |
2024-05-30 | Group Robust Preference Optimization in Reward-free RLHF | Shyam Sundhar Ramesh et.al. | 2405.20304 | link |
2024-05-30 | Who Writes the Review, Human or AI? | Panagiotis C. Theocharopoulos et.al. | 2405.20285 | null |
2024-05-30 | ETHER: Efficient Finetuning of Large-Scale Models with Hyperplane Reflections | Massimo Bini et.al. | 2405.20271 | link |
2024-05-30 | Evaluating Large Language Model Biases in Persona-Steered Generation | Andy Liu et.al. | 2405.20253 | link |
2024-05-30 | Towards Hierarchical Multi-Agent Workflows for Zero-Shot Prompt Optimization | Yuchi Liu et.al. | 2405.20252 | link |
2024-05-30 | Retrieval Augmented Structured Generation: Business Document Information Extraction As Tool Use | Franz Louis Cesista et.al. | 2405.20245 | null |
2024-05-30 | Context Injection Attacks on Large Language Models | Cheng’an Wei et.al. | 2405.20234 | null |
2024-05-30 | Data-efficient fine-tuning of foundational models for first-principles quality sublimation enthalpies | Harveen Kaur et.al. | 2405.20217 | null |
2024-05-30 | TS-Align: A Teacher-Student Collaborative Framework for Scalable Iterative Finetuning of Large Language Models | Chen Zhang et.al. | 2405.20215 | null |
2024-05-30 | One QuantLLM for ALL: Fine-tuning Quantized LLMs Once for Efficient Deployments | Ke Yi et.al. | 2405.20202 | null |
2024-05-31 | Using Large Language Models for Humanitarian Frontline Negotiation: Opportunities and Considerations | Zilin Ma et.al. | 2405.20195 | null |
2024-05-29 | X-VILA: Cross-Modality Alignment for Large Language Model | Hanrong Ye et.al. | 2405.19335 | null |
2024-05-29 | LLMs Meet Multimodal Generation and Editing: A Survey | Yingqing He et.al. | 2405.19334 | link |
2024-05-29 | Multi-Modal Generative Embedding Model | Feipeng Ma et.al. | 2405.19333 | null |
2024-05-29 | Self-Exploring Language Models: Active Preference Elicitation for Online Alignment | Shenao Zhang et.al. | 2405.19332 | link |
2024-05-29 | Normative Modules: A Generative Agent Architecture for Learning Norms that Supports Multi-Agent Cooperation | Atrisha Sarkar et.al. | 2405.19328 | null |
2024-05-29 | MAP-Neo: Highly Capable and Transparent Bilingual Large Language Model Series | Ge Zhang et.al. | 2405.19327 | link |
2024-05-29 | Reasoning3D – Grounding and Reasoning in 3D: Fine-Grained Zero-Shot Open-Vocabulary 3D Reasoning Part Segmentation via Large Vision-Language Models | Tianrun Chen et.al. | 2405.19326 | null |
2024-05-29 | Nearest Neighbor Speculative Decoding for LLM Generation and Attribution | Minghan Li et.al. | 2405.19325 | null |
2024-05-29 | Are Large Language Models Chameleons? | Mingmeng Geng et.al. | 2405.19323 | null |
2024-05-29 | Value-Incentivized Preference Optimization: A Unified Approach to Online and Offline RLHF | Shicong Cen et.al. | 2405.19320 | null |
2024-05-29 | Robust Preference Optimization through Reward Model Distillation | Adam Fisch et.al. | 2405.19316 | null |
2024-05-29 | Matryoshka Query Transformer for Large Vision-Language Models | Wenbo Hu et.al. | 2405.19315 | link |
2024-05-29 | Language Models Trained to do Arithmetic Predict Human Risky and Intertemporal Choice | Jian-Qiao Zhu et.al. | 2405.19313 | null |
2024-05-29 | Expert-Guided Extinction of Toxic Tokens for Debiased Generation | Xueyao Sun et.al. | 2405.19299 | null |
2024-05-29 | MASSIVE Multilingual Abstract Meaning Representation: A Dataset and Baselines for Hallucination Detection | Michael Regan et.al. | 2405.19285 | null |
2024-05-29 | Optimizing Foundation Model Inference on a Many-tiny-core Open-source RISC-V Platform | Viviane Potocnik et.al. | 2405.19284 | null |
2024-05-29 | Programmable Motion Generation for Open-Set Motion Control Tasks | Hanchao Liu et.al. | 2405.19283 | null |
2024-05-29 | PediatricsGPT: Large Language Models as Chinese Medical Assistants for Pediatric Applications | Dingkang Yang et.al. | 2405.19266 | link |
2024-05-29 | AlchemistCoder: Harmonizing and Eliciting Code Capability by Hindsight Tuning on Multi-source Data | Zifan Song et.al. | 2405.19265 | link |
2024-05-29 | Weak-to-Strong Search: Align Large Language Models via Searching over Small Language Models | Zhanhui Zhou et.al. | 2405.19262 | link |
2024-05-28 | Why are Visually-Grounded Language Models Bad at Image Classification? | Yuhui Zhang et.al. | 2405.18415 | link |
2024-05-28 | Don’t Forget to Connect! Improving RAG with Graph-based Reranking | Jialin Dong et.al. | 2405.18414 | null |
2024-05-28 | WIDIn: Wording Image for Domain-Invariant Representation in Single-Source Domain Generalization | Jiawei Ma et.al. | 2405.18405 | null |
2024-05-29 | Superposed Decoding: Multiple Generations from a Single Autoregressive Inference Pass | Ethan Shen et.al. | 2405.18400 | link |
2024-05-28 | Instruct-MusicGen: Unlocking Text-to-Music Editing for Music Language Models via Instruction Tuning | Yixiao Zhang et.al. | 2405.18386 | link |
2024-05-28 | OwLore: Outlier-weighed Layerwise Sampled Low-Rank Projection for Memory-Efficient LLM Fine-tuning | Pengxiang Li et.al. | 2405.18380 | link |
2024-05-28 | LLaMA-NAS: Efficient Neural Architecture Search for Large Language Models | Anthony Sarah et.al. | 2405.18377 | null |
2024-05-28 | Empowering Source-Free Domain Adaptation with MLLM-driven Curriculum Learning | Dongjie Chen et.al. | 2405.18376 | link |
2024-05-28 | Thai Winograd Schemas: A Benchmark for Thai Commonsense Reasoning | Phakphum Artkaew et.al. | 2405.18375 | link |
2024-05-28 | PromptWizard: Task-Aware Agent-driven Prompt Optimization Framework | Eshaan Agarwal et.al. | 2405.18369 | null |
2024-05-28 | Is a 3D-Tokenized LLM the Key to Reliable Autonomous Driving? | Yifan Bai et.al. | 2405.18361 | null |
2024-05-28 | Bridging the Gap: Dynamic Learning Strategies for Improving Multilingual Performance in LLMs | Somnath Kumar et.al. | 2405.18359 | null |
2024-05-28 | MMCTAgent: Multi-modal Critical Thinking Agent Framework for Complex Visual Reasoning | Somnath Kumar et.al. | 2405.18358 | null |
2024-05-28 | Faithful Logical Reasoning via Symbolic Chain-of-Thought | Jundong Xu et.al. | 2405.18357 | link |
2024-05-28 | Universal and Extensible Language-Vision Models for Organ Segmentation and Tumor Detection from Abdominal Computed Tomography | Jie Liu et.al. | 2405.18356 | link |
2024-05-28 | Intelligent Clinical Documentation: Harnessing Generative AI for Patient-Centric Clinical Note Generation | Anjanava Biswas et.al. | 2405.18346 | null |
2024-05-28 | The Battle of LLMs: A Comparative Study in Conversational QA Tasks | Aryan Rangapur et.al. | 2405.18344 | null |
2024-05-28 | Frustratingly Easy Test-Time Adaptation of Vision-Language Models | Matteo Farina et.al. | 2405.18330 | link |
2024-05-28 | Multi-modal Generation via Cross-Modal In-Context Learning | Amandeep Kumar et.al. | 2405.18304 | link |
2024-05-28 | Semantic are Beacons: A Semantic Perspective for Unveiling Parameter-Efficient Fine-Tuning in Knowledge Learning | Renzhi Wang et.al. | 2405.18292 | null |
2024-05-27 | Matryoshka Multimodal Models | Mu Cai et.al. | 2405.17430 | null |
2024-05-27 | NV-Embed: Improved Techniques for Training LLMs as Generalist Embedding Models | Chankyu Lee et.al. | 2405.17428 | null |
2024-05-27 | Reason3D: Searching and Reasoning 3D Segmentation via Large Language Model | Kuan-Chih Huang et.al. | 2405.17427 | link |
2024-05-27 | LARM: Large Auto-Regressive Model for Long-Horizon Embodied Intelligence | Zhuoling Li et.al. | 2405.17424 | null |
2024-05-27 | Privacy-Aware Visual Language Models | Laurens Samson et.al. | 2405.17423 | null |
2024-05-27 | Self-Corrected Multimodal Large Language Model for End-to-End Robot Manipulation | Jiaming Liu et.al. | 2405.17418 | null |
2024-05-27 | THREAD: Thinking Deeper with Recursive Spawning | Philip Schroeder et.al. | 2405.17402 | link |
2024-05-27 | The Expressive Capacity of State Space Models: A Formal Language Perspective | Yash Sarrof et.al. | 2405.17394 | null |
2024-05-27 | MindMerger: Efficient Boosting LLM Reasoning in non-English Languages | Zixian Huang et.al. | 2405.17386 | link |
2024-05-27 | Unlocking the Secrets of Linear Complexity Sequence Model from A Unified Perspective | Zhen Qin et.al. | 2405.17383 | null |
2024-05-27 | ReMoDetect: Reward Models Recognize Aligned LLM’s Generations | Hyunseok Lee et.al. | 2405.17382 | link |
2024-05-27 | Various Lengths, Constant Speed: Efficient Language Modeling with Lightning Attention | Zhen Qin et.al. | 2405.17381 | link |
2024-05-27 | RTL-Repo: A Benchmark for Evaluating LLMs on Large-Scale RTL Design Projects | Ahmed Allam et.al. | 2405.17378 | link |
2024-05-28 | Navigating the Safety Landscape: Measuring Risks in Finetuning Large Language Models | ShengYun Peng et.al. | 2405.17374 | link |
2024-05-27 | Prompt Optimization with Human Feedback | Xiaoqiang Lin et.al. | 2405.17346 | link |
2024-05-27 | Exploring and steering the moral compass of Large Language Models | Alejandro Tlaie et.al. | 2405.17345 | link |
2024-05-27 | Cost-efficient Knowledge-based Question Answering with Large Language Models | Junnan Dong et.al. | 2405.17337 | null |
2024-05-27 | XFormParser: A Simple and Effective Multimodal Multilingual Semi-structured Form Parser | Xianfu Cheng et.al. | 2405.17336 | link |
2024-05-27 | FedHPL: Efficient Heterogeneous Federated Learning with Prompt Tuning and Logit Distillation | Yuting Ma et.al. | 2405.17267 | null |
2024-05-27 | On the Noise Robustness of In-Context Learning for Text Generation | Hongfu Gao et.al. | 2405.17264 | link |
2024-05-24 | Scaling Laws for Discriminative Classification in Large Language Models | Dean Wyatte et.al. | 2405.15765 | null |
2024-05-24 | Filtered Corpus Training (FiCT) Shows that Language Models can Generalize from Indirect Evidence | Abhinav Patil et.al. | 2405.15750 | link |
2024-05-24 | Sparse maximal update parameterization: A holistic approach to sparse training dynamics | Nolan Dey et.al. | 2405.15743 | link |
2024-05-24 | Large Language Models Reflect Human Citation Patterns with a Heightened Citation Bias | Andres Algaba et.al. | 2405.15739 | link |
2024-05-24 | LM4LV: A Frozen Large Language Model for Low-level Vision Tasks | Boyang Zheng et.al. | 2405.15734 | link |
2024-05-24 | Understanding the differences in Foundation Models: Attention, State Space Models, and Recurrent Neural Networks | Jerome Sieber et.al. | 2405.15731 | link |
2024-05-24 | Optimizing Large Language Models for OpenAPI Code Completion | Bohdan Petryshyn et.al. | 2405.15729 | link |
2024-05-24 | Disease-informed Adaptation of Vision-Language Models | Jiajin Zhang et.al. | 2405.15728 | link |
2024-05-24 | The Impact of Geometric Complexity on Neural Collapse in Transfer Learning | Michael Munn et.al. | 2405.15706 | null |
2024-05-24 | Prompt-Aware Adapter: Towards Learning Adaptive Visual Tokens for Multimodal Large Language Models | Yue Zhang et.al. | 2405.15684 | null |
2024-05-24 | VDGD: Mitigating LVLM Hallucinations in Cognitive Prompts by Bridging the Visual Perception Gap | Sreyan Ghosh et.al. | 2405.15683 | link |
2024-05-24 | What Do You See? Enhancing Zero-Shot Image Classification with Multimodal Large Language Models | Abdelrahman Abdelhamed et.al. | 2405.15668 | null |
2024-05-24 | Class Machine Unlearning for Complex Data via Concepts Inference and Data Poisoning | Wenhan Chang et.al. | 2405.15662 | null |
2024-05-24 | \(\mathbf{L^2\cdot M = C^2}\) Large Language Models as Covert Channels… a Systematic Analysis | Simen Gaure et.al. | 2405.15652 | null |
2024-05-24 | LLM-based Robot Task Planning with Exceptional Handling for General Purpose Service Robots | Ruoyu Wang et.al. | 2405.15646 | null |
2024-05-24 | GECKO: Generative Language Model for English, Code and Korean | Sungwoo Oh et.al. | 2405.15640 | null |
2024-05-24 | M4U: Evaluating Multilingual Understanding and Reasoning for Large Multimodal Models | Hongyu Wang et.al. | 2405.15638 | link |
2024-05-24 | GPTZoo: A Large-scale Dataset of GPTs for the Research Community | Xinyi Hou et.al. | 2405.15630 | link |
2024-05-24 | A Comparative Analysis of Distributed Training Strategies for GPT-2 | Ishan Patwardhan et.al. | 2405.15628 | null |
2024-05-24 | Inverse-RLignment: Inverse Reinforcement Learning from Demonstrations for LLM Alignment | Hao Sun et.al. | 2405.15624 | null |
2024-05-23 | PuzzleAvatar: Assembling 3D Avatars from Personal Albums | Yuliang Xiu et.al. | 2405.14869 | link |
2024-05-23 | A Nurse is Blue and Elephant is Rugby: Cross Domain Alignment in Large Language Models Reveal Human-like Patterns | Asaf Yehudai et.al. | 2405.14863 | null |
2024-05-23 | Bitune: Bidirectional Instruction-Tuning | Dawid J. Kopiczko et.al. | 2405.14862 | null |
2024-05-23 | Not All Language Model Features Are Linear | Joshua Engels et.al. | 2405.14860 | link |
2024-05-23 | PV-Tuning: Beyond Straight-Through Estimation for Extreme LLM Compression | Vladimir Malinovskii et.al. | 2405.14852 | link |
2024-05-23 | A Textbook Remedy for Domain Shifts: Knowledge Priors for Medical Image Analysis | Yue Yang et.al. | 2405.14839 | null |
2024-05-23 | From Explicit CoT to Implicit CoT: Learning to Internalize CoT Step by Step | Yuntian Deng et.al. | 2405.14838 | link |
2024-05-23 | HippoRAG: Neurobiologically Inspired Long-Term Memory for Large Language Models | Bernal Jiménez Gutiérrez et.al. | 2405.14831 | link |
2024-05-23 | Designing A Sustainable Marine Debris Clean-up Framework without Human Labels | Raymond Wang et.al. | 2405.14815 | link |
2024-05-23 | As an AI Language Model, “Yes I Would Recommend Calling the Police’’: Norm Inconsistency in LLM Decision-Making | Shomik Jain et.al. | 2405.14812 | null |
2024-05-23 | Implicit Personalization in Language Models: A Systematic Study | Zhijing Jin et.al. | 2405.14808 | link |
2024-05-23 | Can LLMs Solve longer Math Word Problems Better? | Xin Xu et.al. | 2405.14804 | link |
2024-05-23 | Lessons from the Trenches on Reproducible Evaluation of Language Models | Stella Biderman et.al. | 2405.14782 | null |
2024-05-23 | WISE: Rethinking the Knowledge Memory for Lifelong Model Editing of Large Language Models | Peng Wang et.al. | 2405.14768 | link |
2024-05-23 | FinRobot: An Open-Source AI Agent Platform for Financial Applications using Large Language Models | Hongyang Yang et.al. | 2405.14767 | link |
2024-05-23 | Evaluating Large Language Models for Public Health Classification and Extraction Tasks | Joshua Harris et.al. | 2405.14766 | null |
2024-05-23 | Large language models can be zero-shot anomaly detectors for time series? | Sarah Alnegheimish et.al. | 2405.14755 | link |
2024-05-23 | A Transformer-Based Approach for Smart Invocation of Automatic Code Completion | Aral de Moor et.al. | 2405.14753 | link |
2024-05-23 | MultiCast: Zero-Shot Multivariate Time Series Forecasting Using LLMs | Georgios Chatzigeorgakidis et.al. | 2405.14748 | null |
2024-05-23 | Exploring Prosocial Irrationality for LLM Agents: A Social Cognition View | Xuan Liu et.al. | 2405.14744 | null |
2024-05-21 | Reducing Transformer Key-Value Cache Size with Cross-Layer Attention | William Brandon et.al. | 2405.12981 | null |
2024-05-21 | OmniGlue: Generalizable Feature Matching with Foundation Model Guidance | Hanwen Jiang et.al. | 2405.12979 | link |
2024-05-21 | BiomedParse: a biomedical foundation model for image parsing of everything everywhere all at once | Theodore Zhao et.al. | 2405.12971 | null |
2024-05-21 | Energy Rank Alignment: Using Preference Optimization to Search Chemical Space at Scale | Shriram Chennakesavalu et.al. | 2405.12961 | link |
2024-05-21 | Aggregation of Reasoning: A Hierarchical Framework for Enhancing Answer Selection in Large Language Models | Zhangyue Yin et.al. | 2405.12939 | link |
2024-05-21 | Skin-in-the-Game: Decision Making via Multi-Stakeholder Alignment in LLMs | Bilgehan Sel et.al. | 2405.12933 | null |
2024-05-21 | Code-mixed Sentiment and Hate-speech Prediction | Anjali Yadav et.al. | 2405.12929 | link |
2024-05-21 | Streamlining Software Reviews: Efficient Predictive Modeling with Minimal Examples | Tim Menzies et.al. | 2405.12920 | link |
2024-05-21 | G-DIG: Towards Gradient-based DIverse and hiGh-quality Instruction Data Selection for Machine Translation | Xingyuan Pan et.al. | 2405.12915 | link |
2024-05-21 | An Empirical Study and Analysis of Text-to-Image Generation Using Large Language Model-Powered Textual Representation | Zhiyu Tan et.al. | 2405.12914 | link |
2024-05-21 | Topic Modelling Case Law Using a Large Language Model and a New Taxonomy for UK Law: AI Insights into Summary Judgment | Holli Sargeant et.al. | 2405.12910 | link |
2024-05-21 | Adversarial DPO: Harnessing Harmful Data for Reducing Toxicity with Minimal Impact on Coherence and Evasiveness in Dialogue Agents | San Kim et.al. | 2405.12900 | null |
2024-05-21 | Investigating Persuasion Techniques in Arabic: An Empirical Study Leveraging Large Language Models | Abdurahmman Alzahrani et.al. | 2405.12884 | null |
2024-05-21 | LLM Processes: Numerical Predictive Distributions Conditioned on Natural Language | James Requeima et.al. | 2405.12856 | link |
2024-05-21 | OpenCarbonEval: A Unified Carbon Emission Estimation Framework in Large-Scale AI Models | Zhaojian Yu et.al. | 2405.12843 | link |
2024-05-21 | SmartFlow: Robotic Process Automation using LLMs | Arushi Jain et.al. | 2405.12842 | null |
2024-05-21 | Large Language Models Meet NLP: A Survey | Libo Qin et.al. | 2405.12819 | link |
2024-05-21 | Test Oracle Automation in the era of LLMs | Facundo Molina et.al. | 2405.12766 | null |
2024-05-21 | C3L: Content Correlated Vision-Language Instruction Tuning Data Generation via Contrastive Learning | Ji Ma et.al. | 2405.12752 | null |
2024-05-21 | Generative AI and Large Language Models for Cyber Security: All Insights You Need | Mohamed Amine Ferrag et.al. | 2405.12750 | null |
2024-05-20 | Adapting Large Multimodal Models to Distribution Shifts: The Role of In-Context Learning | Guanglin Zhou et.al. | 2405.12217 | link |
2024-05-20 | MathBench: Evaluating the Theory and Application Proficiency of LLMs with a Hierarchical Mathematics Benchmark | Hongwei Liu et.al. | 2405.12209 | link |
2024-05-20 | Developers’ Perceptions on the Impact of ChatGPT in Software Development: A Survey | Thiago S. Vaillant et.al. | 2405.12195 | link |
2024-05-20 | CT-Eval: Benchmarking Chinese Text-to-Table Performance in Large Language Models | Haoxiang Shi et.al. | 2405.12174 | null |
2024-05-20 | Fennec: Fine-grained Language Model Evaluation and Correction Extended through Branching and Bridging | Xiaobo Liang et.al. | 2405.12163 | link |
2024-05-20 | Eliciting Problem Specifications via Large Language Models | Robert E. Wray et.al. | 2405.12147 | null |
2024-05-20 | DTLLM-VLT: Diverse Text Generation for Visual Language Tracking Based on LLM | Xuchen Li et.al. | 2405.12139 | null |
2024-05-20 | MoRA: High-Rank Updating for Parameter-Efficient Fine-Tuning | Ting Jiang et.al. | 2405.12130 | link |
2024-05-20 | Reindex-Then-Adapt: Improving Large Language Models for Conversational Recommendation | Zhankui He et.al. | 2405.12119 | null |
2024-05-20 | Imp: Highly Capable Large Multimodal Models for Mobile Devices | Zhenwei Shao et.al. | 2405.12107 | link |
2024-05-20 | DOP: Diagnostic-Oriented Prompting for Large Language Models in Mathematical Correction | Hao Chen et.al. | 2405.12100 | null |
2024-05-20 | Distributional Semantics, Holism, and the Instability of Meaning | Jumbly Grindrod et.al. | 2405.12084 | null |
2024-05-20 | PARALLELGPUOS: A Concurrent OS-level GPU Checkpoint and Restore System using Validated Speculation | Zhuobin Huang et.al. | 2405.12079 | null |
2024-05-20 | CLAMBER: A Benchmark of Identifying and Clarifying Ambiguous Information Needs in Large Language Models | Tong Zhang et.al. | 2405.12063 | link |
2024-05-20 | STYLE: Improving Domain Transferability of Asking Clarification Questions in Large Language Model Powered Conversational Agents | Yue Chen et.al. | 2405.12059 | null |
2024-05-20 | KG-RAG: Bridging the Gap Between Knowledge and Creativity | Diego Sanmartin et.al. | 2405.12035 | null |
2024-05-20 | Can AI Relate: Testing Large Language Model Response for Mental Health Support | Saadia Gabriel et.al. | 2405.12021 | link |
2024-05-20 | MTVQA: Benchmarking Multilingual Text-Centric Visual Question Answering | Jingqun Tang et.al. | 2405.11985 | link |
2024-05-20 | A review on the use of large language models as virtual tutors | Silvia García-Méndez et.al. | 2405.11983 | null |
2024-05-20 | Position-Guided Prompt Learning for Anomaly Detection in Chest X-Rays | Zhichao Sun et.al. | 2405.11976 | link |
2024-05-17 | Observational Scaling Laws and the Predictability of Language Model Performance | Yangjun Ruan et.al. | 2405.10938 | link |
2024-05-17 | A Survey on Large Language Models with Multilingualism: Recent Advances and New Frontiers | Kaiyu Huang et.al. | 2405.10936 | link |
2024-05-17 | The Local Interaction Basis: Identifying Computationally-Relevant and Sparsely Interacting Features in Neural Networks | Lucius Bushnaq et.al. | 2405.10928 | link |
2024-05-17 | Blackbox Adaptation for Medical Image Segmentation | Jay N. Paranjape et.al. | 2405.10913 | link |
2024-05-17 | COGNET-MD, an evaluation framework and dataset for Large Language Model benchmarks in the medical domain | Dimitrios P. Panagoulias et.al. | 2405.10893 | null |
2024-05-17 | Application of Artificial Intelligence in Schizophrenia Rehabilitation Management: Systematic Literature Review | Hongyi Yang et.al. | 2405.10883 | null |
2024-05-17 | ECR-Chain: Advancing Generative Language Models to Better Emotion-Cause Reasoners through Reasoning Chains | Zhaopei Huang et.al. | 2405.10860 | link |
2024-05-17 | The Future of Large Language Model Pre-training is Federated | Lorenzo Sani et.al. | 2405.10853 | null |
2024-05-17 | Open-Vocabulary Spatio-Temporal Action Detection | Tao Wu et.al. | 2405.10832 | null |
2024-05-17 | Large Language Model (LLM) for Telecommunications: A Comprehensive Survey on Principles, Key Techniques, and Opportunities | Hao Zhou et.al. | 2405.10825 | null |
2024-05-17 | ActiveLLM: Large Language Model-based Active Learning for Textual Few-Shot Scenarios | Markus Bayer et.al. | 2405.10808 | null |
2024-05-17 | The Relational Machine Calculus | Chris Barrett et.al. | 2405.10801 | null |
2024-05-17 | Empowering Small-Scale Knowledge Graphs: A Strategy of Leveraging General-Purpose Knowledge Graphs for Enriched Embeddings | Albert Sawczyn et.al. | 2405.10745 | null |
2024-05-17 | Efficient Multimodal Large Language Models: A Survey | Yizhang Jin et.al. | 2405.10739 | link |
2024-05-17 | INDUS: Effective and Efficient Language Models for Scientific Applications | Bishwaranjan Bhattacharjee et.al. | 2405.10725 | null |
2024-05-17 | SignLLM: Sign Languages Production Large Language Models | Sen Fang et.al. | 2405.10718 | null |
2024-05-17 | Persian Pronoun Resolution: Leveraging Neural Networks and Language Models | Hassan Haji Mohammadi et.al. | 2405.10714 | null |
2024-05-17 | SynDy: Synthetic Dynamic Dataset Generation Framework for Misinformation Tasks | Michael Shliselberg et.al. | 2405.10700 | null |
2024-05-17 | Revolutionizing Process Mining: A Novel Architecture for ChatGPT Integration and Enhanced User Experience through Optimized Prompt Engineering | Mehrdad Agha Mohammad Ali Kermani et.al. | 2405.10689 | null |
2024-05-17 | Realistic Evaluation of Toxicity in Large Language Models | Tinh Son Luong et.al. | 2405.10659 | null |
2024-05-16 | UniRAG: Universal Retrieval Augmentation for Multi-Modal Large Language Models | Sahel Sharifymoghaddam et.al. | 2405.10311 | link |
2024-05-16 | 4D Panoptic Scene Graph Generation | Jingkang Yang et.al. | 2405.10305 | link |
2024-05-16 | Conformal Alignment: Knowing When to Trust Foundation Models with Guarantees | Yu Gui et.al. | 2405.10301 | link |
2024-05-16 | HW-GPT-Bench: Hardware-Aware Architecture Benchmark for Language Models | Rhea Sanjay Sukthanker et.al. | 2405.10299 | link |
2024-05-17 | Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning | Yuexiang Zhai et.al. | 2405.10292 | null |
2024-05-16 | Timeline-based Sentence Decomposition with In-Context Learning for Temporal Fact Extraction | Jianhao Chen et.al. | 2405.10288 | link |
2024-05-16 | FFF: Fixing Flawed Foundations in contrastive pre-training results in very strong Vision-Language models | Adrian Bulat et.al. | 2405.10286 | null |
2024-05-16 | Revisiting OPRO: The Limitations of Small-Scale LLMs as Optimizers | Tuo Zhang et.al. | 2405.10276 | null |
2024-05-16 | Keep It Private: Unsupervised Privatization of Online Text | Calvin Bao et.al. | 2405.10260 | link |
2024-05-16 | When LLMs step into the 3D World: A Survey and Meta-Analysis of 3D Tasks via Multi-modal Large Language Models | Xianzheng Ma et.al. | 2405.10255 | link |
2024-05-16 | PRISM: A Multi-Modal Generative Foundation Model for Slide-Level Histopathology | George Shaikovski et.al. | 2405.10254 | null |
2024-05-16 | A Systematic Evaluation of Large Language Models for Natural Language Generation Tasks | Xuanfan Ni et.al. | 2405.10251 | null |
2024-05-16 | IntelliExplain: Enhancing Interactive Code Generation through Natural Language Explanations for Non-Professional Programmers | Hao Yan et.al. | 2405.10250 | null |
2024-05-16 | A Foundation Model for Brain Lesion Segmentation with Mixture of Modality Experts | Xinru Zhang et.al. | 2405.10246 | link |
2024-05-16 | DocuMint: Docstring Generation for Python using Small Language Models | Bibek Poudel et.al. | 2405.10243 | link |
2024-05-16 | Low-Rank Adaptation of Time Series Foundational Models for Out-of-Domain Modality Forecasting | Divij Gupta et.al. | 2405.10216 | null |
2024-05-16 | CPsyExam: A Chinese Benchmark for Evaluating Psychology using Examinations | Jiahao Zhao et.al. | 2405.10212 | link |
2024-05-16 | LFED: A Literary Fiction Evaluation Dataset for Large Language Models | Linhao Yu et.al. | 2405.10166 | link |
2024-05-16 | PIR: Remote Sensing Image-Text Retrieval with Prior Instruction Representation Learning | Jiancheng Pan et.al. | 2405.10160 | link |
2024-05-16 | Speaker Verification in Agent-Generated Conversations | Yizhe Yang et.al. | 2405.10150 | null |
2024-05-15 | Modeling Bilingual Sentence Processing: Evaluating RNN and Transformer Architectures for Cross-Language Structural Priming | Bushi Xiao et.al. | 2405.09508 | null |
2024-05-15 | Constrained Learning for Causal Inference and Semiparametric Statistics | Tiffany Tianhui Cai et.al. | 2405.09493 | null |
2024-05-15 | Beyond Flesch-Kincaid: Prompt-based Metrics Improve Difficulty Classification of Educational Texts | Donya Rooein et.al. | 2405.09482 | null |
2024-05-15 | Tell Me Why: Explainable Public Health Fact-Checking with Large Language Models | Majid Zarharan et.al. | 2405.09454 | link |
2024-05-15 | M $^4$ oE: A Foundation Model for Medical Multimodal Image Segmentation with Mixture of Experts | Yufeng Jiang et.al. | 2405.09446 | link |
2024-05-15 | Facilitating Opinion Diversity through Hybrid NLP Approaches | Michiel van der Meer et.al. | 2405.09439 | null |
2024-05-15 | A Survey On Text-to-3D Contents Generation In The Wild | Chenhan Jiang et.al. | 2405.09431 | null |
2024-05-15 | MicroPython Testbed for Federated Learning Algorithms | Miroslav Popovic et.al. | 2405.09423 | link |
2024-05-15 | Matching domain experts by training from scratch on domain knowledge | Xiaoliang Luo et.al. | 2405.09395 | null |
2024-05-15 | Compositional imprecise probability | Jack Liell-Cock et.al. | 2405.09391 | null |
2024-05-15 | PolygloToxicityPrompts: Multilingual Evaluation of Neural Toxic Degeneration in Large Language Models | Devansh Jain et.al. | 2405.09373 | link |
2024-05-15 | SARATR-X: A Foundation Model for Synthetic Aperture Radar Images Target Recognition | Weijie L et.al. | 2405.09365 | link |
2024-05-15 | Large Language Model Bias Mitigation from the Perspective of Knowledge Editing | Ruizhe Chen et.al. | 2405.09341 | null |
2024-05-15 | Prompting-based Synthetic Data Generation for Few-Shot Question Answering | Maximilian Schmidt et.al. | 2405.09335 | link |
2024-05-15 | Transfer Learning in Pre-Trained Large Language Models for Malware Detection Based on System Calls | Pedro Miguel Sánchez Sánchez et.al. | 2405.09318 | null |
2024-05-15 | Comparing the Efficacy of GPT-4 and Chat-GPT in Mental Health Care: A Blind Assessment of Large Language Models for Psychological Support | Birger Moell et.al. | 2405.09300 | null |
2024-05-15 | Do language models capture implied discourse meanings? An investigation with exhaustivity implicatures of Korean morphology | Hagyeong Shin et.al. | 2405.09293 | null |
2024-05-15 | Sign of the Times: Evaluating the use of Large Language Models for Idiomaticity Detection | Dylan Phelps et.al. | 2405.09279 | null |
2024-05-15 | Dynamic Activation Pitfalls in LLaMA Models: An Empirical Study | Chi Ma et.al. | 2405.09274 | null |
2024-05-15 | New Textual Corpora for Serbian Language Modeling | Mihailo Škorić et.al. | 2405.09250 | null |
2024-05-14 | Efficient Vision-Language Pre-training by Cluster Masking | Zihao Wei et.al. | 2405.08815 | link |
2024-05-14 | Towards Enhanced RAC Accessibility: Leveraging Datasets and LLMs | Edison Jair Bejarano Sepulveda et.al. | 2405.08792 | link |
2024-05-14 | Incorporating Clinical Guidelines through Adapting Multi-modal Large Language Model for Prostate Cancer PI-RADS Scoring | Tiantian Zhang et.al. | 2405.08786 | link |
2024-05-14 | Is the Pope Catholic? Yes, the Pope is Catholic. Generative Evaluation of Intent Resolution in LLMs | Akhila Yerukola et.al. | 2405.08760 | link |
2024-05-14 | Distributed Threat Intelligence at the Edge Devices: A Large Language Model-Driven Approach | Syed Mhamudul Hasan et.al. | 2405.08755 | null |
2024-05-14 | Hunyuan-DiT: A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding | Zhimin Li et.al. | 2405.08748 | link |
2024-05-14 | Beyond Scaling Laws: Understanding Transformer Performance with Associative Memory | Xueyan Niu et.al. | 2405.08707 | null |
2024-05-14 | EndoDAC: Efficient Adapting Foundation Model for Self-Supervised Depth Estimation from Any Endoscopic Camera | Beilei Cui et.al. | 2405.08672 | link |
2024-05-14 | Promoting AI Equity in Science: Generalized Domain Prompt Learning for Accessible VLM Research | Qinglong Cao et.al. | 2405.08668 | link |
2024-05-14 | Thinking Tokens for Language Modeling | David Herel et.al. | 2405.08644 | null |
2024-05-15 | ALMol: Aligned Language-Molecule Translation LLMs through Offline Preference Contrastive Optimisation | Dimitris Gkoumas et.al. | 2405.08619 | null |
2024-05-14 | A Comprehensive Survey of Large Language Models and Multimodal Large Language Models in Medicine | Hanguang Xiao et.al. | 2405.08603 | null |
2024-05-15 | EVDA: Evolving Deepfake Audio Detection Continual Learning Benchmark | Xiaohui Zhang et.al. | 2405.08596 | link |
2024-05-14 | Open-Vocabulary Object Detection via Neighboring Region Attention Alignment | Sunyuan Qiang et.al. | 2405.08593 | null |
2024-05-14 | Improving Transformers with Dynamically Composable Multi-Head Attention | Da Xiao et.al. | 2405.08553 | link |
2024-05-14 | Self-Distillation Improves DNA Sequence Inference | Tong Yu et.al. | 2405.08538 | link |
2024-05-14 | Falcon 7b for Software Mention Detection in Scholarly Documents | AmeerAli Khan et.al. | 2405.08514 | null |
2024-05-14 | Archimedes-AUEB at SemEval-2024 Task 5: LLM explains Civil Procedure | Odysseas S. Chlapanis et.al. | 2405.08502 | link |
2024-05-14 | Is Less More? Quality, Quantity and Context in Idiom Processing with Natural Language Models | Agne Knietaite et.al. | 2405.08497 | link |
2024-05-14 | Enhancing Gender-Inclusive Machine Translation with Neomorphemes and Large Language Models | Andrea Piergentili et.al. | 2405.08477 | null |
2024-05-13 | Plot2Code: A Comprehensive Benchmark for Evaluating Multi-modal Large Language Models in Code Generation from Scientific Plots | Chengyue Wu et.al. | 2405.07990 | null |
2024-05-13 | A Generalist Learner for Multifaceted Medical Image Interpretation | Hong-Yu Zhou et.al. | 2405.07988 | null |
2024-05-13 | The Platonic Representation Hypothesis | Minyoung Huh et.al. | 2405.07987 | link |
2024-05-13 | Investigating the Semantic Robustness of CLIP-based Zero-Shot Anomaly Segmentation | Kevin Stangl et.al. | 2405.07969 | null |
2024-05-13 | PyZoBot: A Platform for Conversational Information Extraction and Synthesis from Curated Zotero Reference Libraries through Advanced Retrieval-Augmented Generation | Suad Alshammari et.al. | 2405.07963 | link |
2024-05-13 | AgentClinic: a multimodal agent benchmark to evaluate AI in simulated clinical environments | Samuel Schmidgall et.al. | 2405.07960 | null |
2024-05-13 | EconLogicQA: A Question-Answering Benchmark for Evaluating Large Language Models in Economic Sequential Reasoning | Yinzhu Quan et.al. | 2405.07938 | link |
2024-05-14 | PARDEN, Can You Repeat That? Defending against Jailbreaks via Repetition | Ziyang Zhang et.al. | 2405.07932 | link |
2024-05-13 | Stable Diffusion-based Data Augmentation for Federated Learning with Non-IID Data | Mahdi Morafah et.al. | 2405.07925 | null |
2024-05-13 | Can Better Text Semantics in Prompt Tuning Improve VLM Generalization? | Hari Chandana Kuchibhotla et.al. | 2405.07921 | null |
2024-05-13 | A Systematic Investigation of Distilling Large Language Models into Cross-Encoders for Passage Re-ranking | Ferdinand Schlatt et.al. | 2405.07920 | link |
2024-05-13 | PLUTO: Pathology-Universal Transformer | Dinkar Juyal et.al. | 2405.07905 | null |
2024-05-13 | Russian-Language Multimodal Dataset for Automatic Summarization of Scientific Papers | Alena Tsanda et.al. | 2405.07886 | link |
2024-05-13 | Zero-Shot Tokenizer Transfer | Benjamin Minixhofer et.al. | 2405.07883 | link |
2024-05-13 | RLHF Workflow: From Reward Modeling to Online RLHF | Hanze Dong et.al. | 2405.07863 | link |
2024-05-13 | Can LLMs Help Predict Elections? (Counter)Evidence from the World’s Largest Democracy | Pratik Gujral et.al. | 2405.07828 | null |
2024-05-13 | A View of How Language Models Will Transform Law | Frank Fagan et.al. | 2405.07826 | null |
2024-05-13 | FreeVA: Offline MLLM as Training-Free Video Assistant | Wenhao Wu et.al. | 2405.07798 | link |
2024-05-13 | DEPTH: Discourse Education through Pre-Training Hierarchically | Zachary Bamberger et.al. | 2405.07788 | link |
2024-05-13 | Generating Human Motion in 3D Scenes from Text Descriptions | Zhi Cen et.al. | 2405.07784 | null |
2024-05-10 | Linearizing Large Language Models | Jean Mercat et.al. | 2405.06640 | link |
2024-05-10 | Value Augmented Sampling for Language Model Alignment and Personalization | Seungwook Han et.al. | 2405.06639 | link |
2024-05-10 | Multimodal LLMs Struggle with Basic Visual Network Analysis: a VNA Benchmark | Evan M. Williams et.al. | 2405.06634 | link |
2024-05-10 | Characterizing the Accuracy - Efficiency Trade-off of Low-rank Decomposition in Language Models | Chakshu Moar et.al. | 2405.06626 | null |
2024-05-10 | Explaining Text Similarity in Transformer Models | Alexandros Vasileiou et.al. | 2405.06604 | link |
2024-05-10 | Enhancing Weakly Supervised Semantic Segmentation with Multi-modal Foundation Models: An End-to-End Approach | Elham Ravanbakhsh et.al. | 2405.06586 | null |
2024-05-10 | What Can Natural Language Processing Do for Peer Review? | Ilia Kuznetsov et.al. | 2405.06563 | link |
2024-05-10 | Mitigating Hallucinations in Large Language Models via Self-Refinement-Enhanced Knowledge Retrieval | Mengjia Niu et.al. | 2405.06545 | null |
2024-05-10 | Prompting Large Language Models with Knowledge Graphs for Question Answering Involving Long-tail Facts | Wenyu Huang et.al. | 2405.06524 | null |
2024-05-10 | UniDM: A Unified Framework for Data Manipulation with Large Language Models | Yichen Qian et.al. | 2405.06510 | null |
2024-05-10 | Storypark: Leveraging Large Language Models to Enhance Children Story Learning Through Child-AI collaboration Storytelling | Lyumanshan Ye et.al. | 2405.06495 | null |
2024-05-10 | Pseudo-Prompt Generating in Pre-trained Vision-Language Models for Multi-Label Medical Image Classification | Yaoqin Ye et.al. | 2405.06468 | link |
2024-05-10 | Improving Instruction Following in Language Models through Proxy-Based Uncertainty Estimation | JoonHo Lee et.al. | 2405.06424 | link |
2024-05-10 | Can Large Language Models Replicate ITS Feedback on Open-Ended Math Questions? | Hunter McNichols et.al. | 2405.06414 | link |
2024-05-10 | Potential and Limitations of LLMs in Capturing Structured Semantics: A Case Study on SRL | Ning Cheng et.al. | 2405.06410 | null |
2024-05-10 | Program Synthesis using Inductive Logic Programming for the Abstraction and Reasoning Corpus | Filipe Marinho Rocha et.al. | 2405.06399 | null |
2024-05-10 | Memory Mosaics | Jianyu Zhang et.al. | 2405.06394 | link |
2024-05-10 | LLM Discussion: Enhancing the Creativity of Large Language Models via Discussion Framework and Role-Play | Li-Chun Lu et.al. | 2405.06373 | link |
2024-05-10 | LMD3: Language Model Data Density Dependence | John Kirchenbauer et.al. | 2405.06331 | null |
2024-05-10 | Correlation Dimension of Natural Language in a Statistical Manifold | Xin Du et.al. | 2405.06321 | null |
2024-05-09 | Natural Language Processing RELIES on Linguistics | Juri Opitz et.al. | 2405.05966 | null |
2024-05-09 | OpenBA-V2: Reaching 77.3% High Compression Ratio with Fast Multi-Stage Pruning | Dan Qiao et.al. | 2405.05957 | link |
2024-05-09 | Probing Multimodal LLMs as World Models for Driving | Shiva Sreeram et.al. | 2405.05956 | link |
2024-05-09 | Smurfs: Leveraging Multiple Proficiency Agents with Context-Efficiency for Tool Planning | Junzhi Chen et.al. | 2405.05955 | link |
2024-05-09 | CuMo: Scaling Multimodal LLM with Co-Upcycled Mixture-of-Experts | Jiachen Li et.al. | 2405.05949 | link |
2024-05-09 | DOLOMITES: Domain-Specific Long-Form Methodical Tasks | Chaitanya Malaviya et.al. | 2405.05938 | null |
2024-05-09 | Trustworthy AI-Generative Content in Intelligent 6G Network: Adversarial, Privacy, and Fairness | Siyuan Li et.al. | 2405.05930 | null |
2024-05-09 | Does Fine-Tuning LLMs on New Knowledge Encourage Hallucinations? | Zorik Gekhman et.al. | 2405.05904 | null |
2024-05-09 | Co-driver: VLM-based Autonomous Driving Assistant with Human-like Behavior and Understanding for Complex Road Scenes | Ziang Guo et.al. | 2405.05885 | link |
2024-05-09 | FlockGPT: Guiding UAV Flocking with Linguistic Orchestration | Artem Lykov et.al. | 2405.05872 | null |
2024-05-09 | Pre-trained Text-to-Image Diffusion Models Are Versatile Representation Learners for Control | Gunshi Gupta et.al. | 2405.05852 | link |
2024-05-09 | Robots Can Feel: LLM-based Framework for Robot Ethical Reasoning | Artem Lykov et.al. | 2405.05824 | link |
2024-05-09 | Boosting Multimodal Large Language Models with Visual Tokens Withdrawal for Rapid Inference | Zhihang Lin et.al. | 2405.05803 | link |
2024-05-09 | Towards a More Inclusive AI: Progress and Perspectives in Large Language Model Training for the Sámi Language | Ronny Paul et.al. | 2405.05777 | null |
2024-05-09 | Experimental Pragmatics with Machines: Testing LLM Predictions for the Inferences of Plain and Embedded Disjunctions | Polina Tsvilodub et.al. | 2405.05776 | null |
2024-05-09 | Large Language Model-Aided Evolutionary Search for Constrained Multiobjective Optimization | Zeyi Wang et.al. | 2405.05767 | null |
2024-05-09 | Similarity Guided Multimodal Fusion Transformer for Semantic Location Prediction in Social Media | Zhizhen Zhang et.al. | 2405.05760 | null |
2024-05-09 | Exploring the Potential of Human-LLM Synergy in Advancing Qualitative Analysis: A Case Study on Mental-Illness Stigma | Han Meng et.al. | 2405.05758 | null |
2024-05-09 | Can large language models understand uncommon meanings of common words? | Jinyang Wu et.al. | 2405.05741 | null |
2024-05-09 | Evaluating Dialect Robustness of Language Models via Conversation Understanding | Dipankar Srirag et.al. | 2405.05688 | link |
2024-05-08 | THRONE: An Object-based Hallucination Benchmark for the Free-form Generations of Large Vision-Language Models | Prannay Kaul et.al. | 2405.05256 | null |
2024-05-09 | You Only Cache Once: Decoder-Decoder Architectures for Language Models | Yutao Sun et.al. | 2405.05254 | link |
2024-05-08 | Open Source Language Models Can Provide Feedback: Evaluating LLMs’ Ability to Help Students Using GPT-4-As-A-Judge | Charles Koutcheme et.al. | 2405.05253 | link |
2024-05-09 | LLMs with Personalities in Multi-issue Negotiation Games | Sean Noh et.al. | 2405.05248 | null |
2024-05-08 | EVA-X: A Foundation Model for General Chest X-ray Analysis with Self-supervised Learning | Jingfeng Yao et.al. | 2405.05237 | link |
2024-05-08 | SuFIA: Language-Guided Augmented Dexterity for Robotic Surgical Assistants | Masoud Moghani et.al. | 2405.05226 | null |
2024-05-08 | Conv-Basis: A New Paradigm for Efficient Attention Inference and Gradient Computation in Transformers | Jiuxiang Gu et.al. | 2405.05219 | null |
2024-05-08 | FinePOSE: Fine-Grained Prompt-Driven 3D Human Pose Estimation via Diffusion Models | Jinglin Xu et.al. | 2405.05216 | link |
2024-05-08 | MIDGARD: Self-Consistency Using Minimum Description Length for Structured Commonsense Reasoning | Inderjeet Nair et.al. | 2405.05189 | link |
2024-05-08 | Encoder-Decoder Framework for Interactive Free Verses with Generation with Controllable High-Quality Rhyming | Tommaso Pasini et.al. | 2405.05176 | null |
2024-05-08 | Air Gap: Protecting Privacy-Conscious Conversational Agents | Eugene Bagdasaryan et.al. | 2405.05175 | null |
2024-05-08 | XAMPLER: Learning to Retrieve Cross-Lingual In-Context Examples | Peiqin Lin et.al. | 2405.05116 | link |
2024-05-08 | QFMTS: Generating Query-Focused Summaries over Multi-Table Inputs | Weijia Zhang et.al. | 2405.05109 | null |
2024-05-08 | Concerns on Bias in Large Language Models when Creating Synthetic Personae | Helena A. Haxvig et.al. | 2405.05080 | null |
2024-05-08 | Impact of Tone-Aware Explanations in Recommender Systems | Ayano Okoso et.al. | 2405.05061 | null |
2024-05-08 | Conversational Topic Recommendation in Counseling and Psychotherapy with Decision Transformer and Large Language Models | Aylin Gunal et.al. | 2405.05060 | null |
2024-05-08 | Seeds of Stereotypes: A Large-Scale Textual Analysis of Race and Gender Associations with Diseases in Online Sources | Lasse Hyldig Hansen et.al. | 2405.05049 | null |
2024-05-08 | ${M^2D}$ NeRF: Multi-Modal Decomposition NeRF with 3D Feature Fields | Ning Wang et.al. | 2405.05010 | null |
2024-05-08 | ADELIE: Aligning Large Language Models on Information Extraction | Yunjia Qi et.al. | 2405.05008 | link |
2024-05-08 | NAVRepair: Node-type Aware C/C++ Code Vulnerability Repair | Ruoke Wang et.al. | 2405.04994 | null |
2024-05-07 | ChatHuman: Language-driven 3D Human Understanding with Retrieval-Augmented Tool Reasoning | Jing Lin et.al. | 2405.04533 | null |
2024-05-07 | QServe: W4A8KV4 Quantization and System Co-design for Efficient LLM Serving | Yujun Lin et.al. | 2405.04532 | link |
2024-05-07 | NaturalCodeBench: Examining Coding Performance Mismatch on HumanEval and Natural User Prompts | Shudan Zhang et.al. | 2405.04520 | null |
2024-05-07 | xLSTM: Extended Long Short-Term Memory | Maximilian Beck et.al. | 2405.04517 | link |
2024-05-07 | A Transformer with Stack Attention | Jiaoda Li et.al. | 2405.04515 | link |
2024-05-08 | Unveiling Disparities in Web Task Handling Between Human and Web Agent | Kihoon Son et.al. | 2405.04497 | null |
2024-05-07 | Toward In-Context Teaching: Adapting Examples to Students’ Misconceptions | Alexis Ross et.al. | 2405.04495 | null |
2024-05-07 | Representation Learning of Daily Movement Data Using Text Encoders | Alexander Capstick et.al. | 2405.04494 | link |
2024-05-08 | DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model | DeepSeek-AI et.al. | 2405.04434 | link |
2024-05-07 | The Silicone Ceiling: Auditing GPT’s Race and Gender Biases in Hiring | Lena Armstrong et.al. | 2405.04412 | null |
2024-05-07 | Learning To See But Forgetting To Follow: Visual Instruction Tuning Makes LLMs More Prone To Jailbreak Attacks | Georgios Pantazopoulos et.al. | 2405.04403 | link |
2024-05-07 | Large Language Models Cannot Explain Themselves | Advait Sarkar et.al. | 2405.04382 | null |
2024-05-07 | A Fourth Wave of Open Data? Exploring the Spectrum of Scenarios for Open Data and Generative AI | Hannah Chafetz et.al. | 2405.04333 | null |
2024-05-07 | Deception in Reinforced Autonomous Agents: The Unconventional Rabbit Hat Trick in Legislation | Atharvan Dogra et.al. | 2405.04325 | null |
2024-05-07 | Granite Code Models: A Family of Open Foundation Models for Code Intelligence | Mayank Mishra et.al. | 2405.04324 | link |
2024-05-07 | Accelerating Speculative Decoding using Dynamic Speculation Length | Jonathan Mamou et.al. | 2405.04304 | null |
2024-05-07 | Enhancing the Efficiency and Accuracy of Underlying Asset Reviews in Structured Finance: The Application of Multi-agent Framework | Xiangpeng Wan et.al. | 2405.04294 | link |
2024-05-07 | Who Wrote This? The Key to Zero-Shot LLM-Generated Text Detection Is GECScore | Junchao Wu et.al. | 2405.04286 | null |
2024-05-07 | On the Foundations of Earth and Climate Foundation Models | Xiao Xiang Zhu et.al. | 2405.04285 | null |
2024-05-07 | Semantic API Alignment: Linking High-level User Goals to APIs | Robert Feldt et.al. | 2405.04236 | null |
2024-05-06 | Complex Video Reasoning and Robustness Evaluation Suite for Video-LMMs | Muhammad Uzair Khattak et.al. | 2405.03690 | null |
2024-05-06 | Pose Priors from Language Models | Sanjay Subramanian et.al. | 2405.03689 | null |
2024-05-06 | Large Language Models Reveal Information Operation Goals, Tactics, and Narrative Frames | Keith Burghardt et.al. | 2405.03688 | link |
2024-05-06 | Language-Image Models with 3D Understanding | Jang Hyun Cho et.al. | 2405.03685 | null |
2024-05-06 | AtomGPT: Atomistic Generative Pre-trained Transformer for Forward and Inverse Materials Design | Kamal Choudhary et.al. | 2405.03680 | link |
2024-05-06 | When LLMs Meet Cybersecurity: A Systematic Literature Review | Jie Zhang et.al. | 2405.03644 | link |
2024-05-06 | A Controlled Experiment on the Energy Efficiency of the Source Code Generated by Code Llama | Vlad-Andrei Cursaru et.al. | 2405.03616 | null |
2024-05-06 | GREEN: Generative Radiology Report Evaluation and Error Notation | Sophie Ostmeier et.al. | 2405.03595 | null |
2024-05-06 | Enabling High-Sparsity Foundational Llama Models with Efficient Pretraining and Deployment | Abhinav Agarwalla et.al. | 2405.03594 | null |
2024-05-06 | Liberating Seen Classes: Boosting Few-Shot and Zero-Shot Text Classification via Anchor Generation and Classification Reframing | Han Liu et.al. | 2405.03565 | null |
2024-05-07 | ID-centric Pre-training for Recommendation | Yiqing Wu et.al. | 2405.03562 | null |
2024-05-06 | AlphaMath Almost Zero: process Supervision without process | Guoxin Chen et.al. | 2405.03553 | link |
2024-05-06 | MAmmoTH2: Scaling Instructions from the Web | Xiang Yue et.al. | 2405.03548 | null |
2024-05-06 | Position Paper: Leveraging Foundational Models for Black-Box Optimization: Benefits, Challenges, and Future Directions | Xingyou Song et.al. | 2405.03547 | null |
2024-05-06 | Are Human Rules Necessary? Generating Reusable APIs with CoT Reasoning and In-Context Learning | Yubo Mai et.al. | 2405.03509 | null |
2024-05-06 | UnsafeBench: Benchmarking Image Safety Classifiers on Real-World and AI-Generated Images | Yiting Qu et.al. | 2405.03486 | null |
2024-05-06 | LGTM: Local-to-Global Text-Driven Human Motion Diffusion Model | Haowen Sun et.al. | 2405.03485 | link |
2024-05-06 | Doing Personal LAPS: LLM-Augmented Dialogue Construction for Personalized Multi-Session Conversational Search | Hideaki Joko et.al. | 2405.03480 | link |
2024-05-07 | Large Language Models (LLMs) as Agents for Augmented Democracy | Jairo Gudiño-Rosero et.al. | 2405.03452 | null |
2024-05-06 | SEvenLLM: Benchmarking, Eliciting, and Enhancing Abilities of Large Language Models in Cyber Threat Intelligence | Hangyuan Ji et.al. | 2405.03446 | link |
2024-05-03 | Vibe-Eval: A hard evaluation suite for measuring progress of multimodal language models | Piotr Padlewski et.al. | 2405.02287 | link |
2024-05-03 | Structural Pruning of Pre-trained Language Models via Neural Architecture Search | Aaron Klein et.al. | 2405.02267 | link |
2024-05-03 | On the test-time zero-shot generalization of vision-language models: Do we really need prompt learning? | Maxime Zanella et.al. | 2405.02266 | link |
2024-05-03 | Leveraging Large Language Models to Enhance Domain Expert Inclusion in Data Science Workflows | Jasmine Y. Shih et.al. | 2405.02260 | null |
2024-05-03 | What matters when building vision-language models? | Hugo Laurençon et.al. | 2405.02246 | null |
2024-05-03 | REASONS: A benchmark for REtrieval and Automated citationS Of scieNtific Sentences using Public and Proprietary LLMs | Deepa Tilwani et.al. | 2405.02228 | null |
2024-05-03 | Fair Risk Control: A Generalized Framework for Calibrating Multi-group Fairness Risks | Lujing Zhang et.al. | 2405.02225 | null |
2024-05-03 | FairEvalLLM. A Comprehensive Framework for Benchmarking Fairness in Large Language Model Recommender Systems | Yashar Deldjoo et.al. | 2405.02219 | null |
2024-05-03 | Automatic Programming: Large Language Models and Beyond | Michael R. Lyu et.al. | 2405.02213 | null |
2024-05-03 | Assessing and Verifying Task Utility in LLM-Powered Applications | Negar Arabzadeh et.al. | 2405.02178 | null |
2024-05-03 | Hoaxpedia: A Unified Wikipedia Hoax Articles Dataset | Hsuvas Borkakoty et.al. | 2405.02175 | link |
2024-05-03 | Mapping the Unseen: Unified Promptable Panoptic Mapping with Dynamic Labeling using Foundation Models | Mohamad Al Mdfaa et.al. | 2405.02162 | null |
2024-05-03 | Neural Context Flows for Learning Generalizable Dynamical Systems | Roussel Desmond Nzoyem et.al. | 2405.02154 | link |
2024-05-03 | The AI Review Lottery: Widespread AI-Assisted Peer Reviews Boost Paper Scores and Acceptance Rates | Giuseppe Russo Latona et.al. | 2405.02150 | link |
2024-05-03 | MedReadMe: A Systematic Study for Fine-grained Sentence Readability in Medical Domain | Chao Jiang et.al. | 2405.02144 | null |
2024-05-03 | Optimising Calls to Large Language Models with Uncertainty-Based Two-Tier Selection | Guillem Ramírez et.al. | 2405.02134 | null |
2024-05-03 | Unveiling the Potential of LLM-Based ASR on Chinese Open-Source Datasets | Xuelong Geng et.al. | 2405.02132 | link |
2024-05-03 | Evaluating Large Language Models for Structured Science Summarization in the Open Research Knowledge Graph | Vladyslav Nechakhin et.al. | 2405.02105 | null |
2024-05-03 | Argumentative Large Language Models for Explainable and Contestable Decision-Making | Gabriel Freedman et.al. | 2405.02079 | link |
2024-05-03 | Comparative Analysis of Retrieval Systems in the Real World | Dmytro Mozolevskyi et.al. | 2405.02048 | null |
2024-05-02 | Prometheus 2: An Open Source Language Model Specialized in Evaluating Other Language Models | Seungone Kim et.al. | 2405.01535 | link |
2024-05-02 | Plan-Seq-Learn: Language Model Guided RL for Solving Long Horizon Robotics Tasks | Murtaza Dalal et.al. | 2405.01534 | null |
2024-05-02 | OmniDrive: A Holistic LLM-Agent Framework for Autonomous Driving with 3D Perception, Reasoning and Planning | Shihao Wang et.al. | 2405.01533 | link |
2024-05-02 | FLAME: Factuality-Aware Alignment for Large Language Models | Sheng-Chieh Lin et.al. | 2405.01525 | null |
2024-05-03 | A separability-based approach to quantifying generalization: which layer is best? | Luciano Dyballa et.al. | 2405.01524 | link |
2024-05-02 | Transformer-Aided Semantic Communications | Matin Mortaheb et.al. | 2405.01521 | null |
2024-05-02 | D2PO: Discriminator-Guided DPO with Response Evaluation Models | Prasann Singhal et.al. | 2405.01511 | link |
2024-05-02 | Analyzing the Role of Semantic Representations in the Era of Large Language Models | Zhijing Jin et.al. | 2405.01502 | link |
2024-05-02 | Supporting Business Document Workflows via Collection-Centric Information Foraging with Large Language Models | Raymond Fok et.al. | 2405.01501 | null |
2024-05-02 | Controllable Text Generation in the Instruction-Tuning Era | Dhananjay Ashok et.al. | 2405.01490 | null |
2024-05-02 | MANTIS: Interleaved Multi-Image Instruction Tuning | Dongfu Jiang et.al. | 2405.01483 | link |
2024-05-02 | NeMo-Aligner: Scalable Toolkit for Efficient Model Alignment | Gerald Shen et.al. | 2405.01481 | link |
2024-05-02 | V-FLUTE: Visual Figurative Language Understanding with Textual Explanations | Arkadiy Saakyan et.al. | 2405.01474 | link |
2024-05-02 | Advancing human-centric AI for robust X-ray analysis through holistic self-supervised learning | Théo Moutakanni et.al. | 2405.01469 | null |
2024-05-02 | Understanding Retrieval-Augmented Task Adaptation for Vision-Language Models | Yifei Ming et.al. | 2405.01468 | null |
2024-05-02 | A Systematic Literature Review on Large Language Models for Automated Program Repair | Quanjun Zhang et.al. | 2405.01466 | link |
2024-05-02 | Natural Language to Verilog: Design of a Recurrent Spiking Neural Network using Large Language Models and ChatGPT | Paola Vitolo et.al. | 2405.01419 | null |
2024-05-02 | MiniGPT-3D: Efficiently Aligning 3D Point Clouds with Large Language Models using 2D Priors | Yuan Tang et.al. | 2405.01413 | link |
2024-05-02 | Verification and Refinement of Natural Language Explanations through LLM-Symbolic Theorem Proving | Xin Quan et.al. | 2405.01379 | link |
2024-05-02 | GAIA: A General AI Assistant for Intelligent Accelerator Operations | Frank Mayet et.al. | 2405.01359 | null |
2024-05-01 | Self-Play Preference Optimization for Language Model Alignment | Yue Wu et.al. | 2405.00675 | link |
2024-05-01 | Is Bigger Edit Batch Size Always Better? – An Empirical Study on Model Editing with Llama-3 | Junsang Yoon et.al. | 2405.00664 | link |
2024-05-01 | HalluVault: A Novel Logic Programming-aided Metamorphic Testing Framework for Detecting Fact-Conflicting Hallucinations in Large Language Models | Ningke Li et.al. | 2405.00648 | null |
2024-05-01 | When Quantization Affects Confidence of Large Language Models? | Irina Proskurina et.al. | 2405.00632 | link |
2024-05-01 | “I’m Not Sure, But…”: Examining the Impact of Large Language Models’ Uncertainty Expression on User Reliance and Trust | Sunnie S. Y. Kim et.al. | 2405.00623 | null |
2024-05-01 | Causal Evaluation of Language Models | Sirui Chen et.al. | 2405.00622 | link |
2024-05-01 | Addressing Topic Granularity and Hallucination in Large Language Models for Topic Modelling | Yida Mu et.al. | 2405.00611 | link |
2024-05-01 | Investigating Automatic Scoring and Feedback using Large Language Models | Gloria Ashiya Katuka et.al. | 2405.00602 | null |
2024-05-01 | Are Models Biased on Text without Gender-related Language? | Catarina G Belém et.al. | 2405.00588 | link |
2024-05-01 | The Real, the Better: Aligning Large Language Models with Online Human Behaviors | Guanying Jiang et.al. | 2405.00578 | null |
2024-05-01 | EALD-MLLM: Emotion Analysis in Long-sequential and De-identity videos with Multi-modal Large Language Model | Deng Li et.al. | 2405.00574 | null |
2024-05-01 | NumLLM: Numeric-Sensitive Large Language Model for Chinese Finance | Huan-Yi Su et.al. | 2405.00566 | null |
2024-05-01 | Mixture of insighTful Experts (MoTE): The Synergy of Thought Chains and Expert Mixtures in Self-Alignment | Zhili Liu et.al. | 2405.00557 | null |
2024-05-01 | Long-Term Human Trajectory Prediction using 3D Dynamic Scene Graphs | Nicolas Gorlo et.al. | 2405.00552 | link |
2024-05-01 | ChatBI: Towards Natural Language to Complex Business Intelligence SQL | Jinqing Lian et.al. | 2405.00527 | null |
2024-05-01 | CookingSense: A Culinary Knowledgebase with Multidisciplinary Assertions | Donghee Choi et.al. | 2405.00523 | null |
2024-05-01 | Navigating WebAI: Training Agents to Complete Web Tasks with Large Language Models and Reinforcement Learning | Lucas-Andreï Thil et.al. | 2405.00516 | null |
2024-05-01 | GOLD: Geometry Problem Solver with Natural Language Description | Jiaxin Zhang et.al. | 2405.00494 | link |
2024-05-01 | Is Temperature the Creativity Parameter of Large Language Models? | Max Peeperkorn et.al. | 2405.00492 | link |
2024-05-01 | The Pyramid of Captions | Delong Chen et.al. | 2405.00485 | null |
2024-04-30 | Visual Fact Checker: Enabling High-Fidelity Detailed Caption Generation | Yunhao Ge et.al. | 2404.19752 | null |
2024-04-30 | PrivComp-KG : Leveraging Knowledge Graph and Large Language Models for Privacy Policy Compliance Verification | Leon Garza et.al. | 2404.19744 | null |
2024-04-30 | Better & Faster Large Language Models via Multi-token Prediction | Fabian Gloeckle et.al. | 2404.19737 | null |
2024-04-30 | A Framework for Leveraging Human Computation Gaming to Enhance Knowledge Graphs for Accuracy Critical Generative AI Applications | Steph Buongiorno et.al. | 2404.19729 | null |
2024-04-30 | PANGeA: Procedural Artificial Narrative using Generative AI for Turn-Based Video Games | Steph Buongiorno et.al. | 2404.19721 | null |
2024-04-30 | Assessing LLMs in Malicious Code Deobfuscation of Real-world Malware Campaigns | Constantinos Patsakis et.al. | 2404.19715 | null |
2024-04-30 | Automated Generation of High-Quality Medical Simulation Scenarios Through Integration of Semi-Structured Data and Large Language Models | Scott Sumpter et.al. | 2404.19713 | null |
2024-04-30 | When to Retrieve: Teaching LLMs to Utilize Information Retrieval Effectively | Tiziano Labruna et.al. | 2404.19705 | link |
2024-04-30 | Naturally Supervised 3D Visual Grounding with Language-Regularized Concept Learners | Chun Feng et.al. | 2404.19696 | null |
2024-04-30 | Towards Generalist Robot Learning from Internet Video: A Survey | Robert McCarthy et.al. | 2404.19664 | null |
2024-04-30 | MetaCoCo: A New Few-Shot Classification Benchmark with Spurious Correlation | Min Zhang et.al. | 2404.19644 | link |
2024-04-30 | On Training a Neural Network to Explain Binaries | Alexander Interrante-Grant et.al. | 2404.19631 | null |
2024-04-30 | Seeing Through the Clouds: Cloud Gap Imputation with Prithvi Foundation Model | Denys Godwin et.al. | 2404.19609 | null |
2024-04-30 | Transferring Troubles: Cross-Lingual Transferability of Backdoor Attacks in LLMs with Instruction Tuning | Xuanli He et.al. | 2404.19597 | null |
2024-04-30 | RAG and RAU: A Survey on Retrieval-Augmented Language Model in Natural Language Processing | Yucheng Hu et.al. | 2404.19543 | link |
2024-04-30 | MoST: Multi-modality Scene Tokenization for Motion Prediction | Norman Mu et.al. | 2404.19531 | null |
2024-04-30 | Do Large Language Models Understand Conversational Implicature – A case study with a chinese sitcom | Shisen Yue et.al. | 2404.19509 | link |
2024-04-30 | More Compute Is What You Need | Zhen Guo et.al. | 2404.19484 | null |
2024-05-01 | Neuro-Vision to Language: Image Reconstruction and Language enabled Interaction via Brain Recordings | Guobin Shen et.al. | 2404.19438 | null |
2024-04-30 | Can Large Language Models put 2 and 2 together? Probing for Entailed Arithmetical Relationships | D. Panas et.al. | 2404.19432 | null |
2024-04-29 | Hallucination of Multimodal Large Language Models: A Survey | Zechen Bai et.al. | 2404.18930 | link |
2024-04-29 | Holmes: Benchmark the Linguistic Competence of Language Models | Andreas Waldis et.al. | 2404.18923 | null |
2024-04-29 | DPO Meets PPO: Reinforced Token Optimization for RLHF | Han Zhong et.al. | 2404.18922 | link |
2024-04-29 | TheaterGen: Character Management with LLM for Consistent Multi-turn Image Generation | Junhao Cheng et.al. | 2404.18919 | link |
2024-04-29 | Kangaroo: Lossless Self-Speculative Decoding via Double Early Exiting | Fangcheng Liu et.al. | 2404.18911 | link |
2024-04-29 | Human-in-the-Loop Synthetic Text Data Inspection with Provenance Tracking | Hong Jin Kang et.al. | 2404.18881 | link |
2024-04-29 | More RLHF, More Trust? On The Impact of Human Preference Alignment On Language Model Trustworthiness | Aaron J. Li et.al. | 2404.18870 | link |
2024-04-29 | Truth-value judgment in language models: belief directions are context sensitive | Stefan F. Schouten et.al. | 2404.18865 | null |
2024-04-29 | Performance-Aligned LLMs for Generating Fast Code | Daniel Nichols et.al. | 2404.18864 | null |
2024-04-29 | A Survey on Vision Mamba: Models, Applications and Challenges | Rui Xu et.al. | 2404.18861 | link |
2024-04-29 | VERT: Verified Equivalent Rust Transpilation with Few-Shot Learning | Aidan Z. H. Yang et.al. | 2404.18852 | null |
2024-04-30 | FeDeRA:Efficient Fine-tuning of Language Models in Federated Learning Leveraging Weight Decomposition | Yuxuan Yan et.al. | 2404.18848 | null |
2024-04-29 | It’s Difficult to be Neutral – Human and LLM-based Sentiment Annotation of Patient Comments | Petter Mæhlum et.al. | 2404.18832 | null |
2024-04-29 | Benchmarking Benchmark Leakage in Large Language Models | Ruijie Xu et.al. | 2404.18824 | link |
2024-04-29 | AppPoet: Large Language Model based Android malware detection via multi-view prompt engineering | Wenxiang Zhao et.al. | 2404.18816 | null |
2024-04-29 | Unknown Script: Impact of Script on Cross-Lingual Transfer | Wondimagegnhue Tsegaye Tufa et.al. | 2404.18810 | link |
2024-04-29 | Replacing Judges with Juries: Evaluating LLM Generations with a Panel of Diverse Models | Pat Verga et.al. | 2404.18796 | null |
2024-04-29 | PECC: Problem Extraction and Coding Challenges | Patrick Haller et.al. | 2404.18766 | link |
2024-04-29 | Transitive Vision-Language Prompt Learning for Domain Generalization | Liyuan Wang et.al. | 2404.18758 | null |
2024-04-29 | Enhancing Interactive Image Retrieval With Query Rewriting Using Large Language Models and Vision Language Models | Hongyi Zhu et.al. | 2404.18746 | null |
2024-04-26 | Probabilistic Inference in Language Models via Twisted Sequential Monte Carlo | Stephen Zhao et.al. | 2404.17546 | link |
2024-04-26 | Exploring the Distinctiveness and Fidelity of the Descriptions Generated by Large Vision-Language Models | Yuhang Huang et.al. | 2404.17534 | null |
2024-04-26 | Large Language Model Agent as a Mechanical Designer | Yayati Jadhav et.al. | 2404.17525 | null |
2024-04-26 | On the Use of Large Language Models to Generate Capability Ontologies | Luis Miguel Vieira da Silva et.al. | 2404.17524 | link |
2024-04-26 | Enhancing Legal Compliance and Regulation Analysis with Large Language Models | Shabnam Hassani et.al. | 2404.17522 | null |
2024-04-26 | A Comprehensive Evaluation on Event Reasoning of Large Language Models | Zhengwei Tao et.al. | 2404.17513 | link |
2024-04-26 | CEval: A Benchmark for Evaluating Counterfactual Text Generation | Van Bach Nguyen et.al. | 2404.17475 | link |
2024-04-26 | Ruffle&Riley: Insights from Designing and Evaluating a Large Language Model-Based Conversational Tutoring System | Robin Schmucker et.al. | 2404.17460 | null |
2024-04-26 | “ChatGPT Is Here to Help, Not to Replace Anybody” – An Evaluation of Students’ Opinions On Integrating ChatGPT In CS Courses | Bruno Pereira Cipriano et.al. | 2404.17443 | null |
2024-04-26 | PromptCIR: Blind Compressed Image Restoration with Prompt Learning | Bingchen Li et.al. | 2404.17433 | link |
2024-04-26 | Evaluation of Geographical Distortions in Language Models: A Crucial Step Towards Equitable Representations | Rémy Decoupes et.al. | 2404.17401 | null |
2024-04-26 | UniRGB-IR: A Unified Framework for Visible-Infrared Downstream Tasks via Adapter Tuning | Maoxun Yuan et.al. | 2404.17360 | link |
2024-04-26 | InspectorRAGet: An Introspection Platform for RAG Evaluation | Kshitij Fadnis et.al. | 2404.17347 | link |
2024-04-26 | Introducing cosmosGPT: Monolingual Training for Turkish Language Models | H. Toprak Kesgin et.al. | 2404.17336 | null |
2024-04-26 | A Novel Spike Transformer Network for Depth Estimation from Event Cameras via Cross-modality Knowledge Distillation | Xin Zhang et.al. | 2404.17335 | null |
2024-04-26 | An Extendable Cloud-Native Alloy Property Explorer | Zhuoyuan Li et.al. | 2404.17330 | link |
2024-04-26 | When to Trust LLMs: Aligning Confidence with Response Quality | Shuchang Tao et.al. | 2404.17287 | link |
2024-04-26 | Reinforcement Retrieval Leveraging Fine-grained Feedback for Fact Checking News Claims with Black-Box LLM | Xuan Zhang et.al. | 2404.17283 | link |
2024-04-26 | Prompting Towards Alleviating Code-Switched Data Scarcity in Under-Resourced Languages with GPT as a Pivot | Michelle Terblanche et.al. | 2404.17216 | null |
2024-04-26 | Low-Rank Knowledge Decomposition for Medical Foundation Models | Yuhang Zhou et.al. | 2404.17184 | link |
2024-04-25 | The Third Monocular Depth Estimation Challenge | Jaime Spencer et.al. | 2404.16831 | null |
2024-04-25 | Make-it-Real: Unleashing Large Multimodal Model’s Ability for Painting 3D Objects with Realistic Materials | Ye Fang et.al. | 2404.16829 | null |
2024-04-25 | V2A-Mark: Versatile Deep Visual-Audio Watermarking for Manipulation Localization and Copyright Protection | Xuanyu Zhang et.al. | 2404.16824 | null |
2024-04-25 | How Far Are We to GPT-4V? Closing the Gap to Commercial Multimodal Models with Open-Source Suites | Zhe Chen et.al. | 2404.16821 | link |
2024-04-25 | IndicGenBench: A Multilingual Benchmark to Evaluate Generation Capabilities of LLMs on Indic Languages | Harman Singh et.al. | 2404.16816 | link |
2024-04-26 | Make Your LLM Fully Utilize the Context | Shengnan An et.al. | 2404.16811 | link |
2024-04-25 | Improving Diversity of Commonsense Generation by Large Language Models via In-Context Learning | Tianhui Zhang et.al. | 2404.16807 | link |
2024-04-25 | AAPL: Adding Attributes to Prompt Learning for Vision-Language Models | Gahyeon Kim et.al. | 2404.16804 | link |
2024-04-25 | Weak-to-Strong Extrapolation Expedites Alignment | Chujie Zheng et.al. | 2404.16792 | link |
2024-04-25 | SEED-Bench-2-Plus: Benchmarking Multimodal Large Language Models with Text-Rich Visual Comprehension | Bohao Li et.al. | 2404.16790 | link |
2024-04-25 | Continual Learning of Large Language Models: A Comprehensive Survey | Haizhou Shi et.al. | 2404.16789 | link |
2024-04-25 | Modeling Selective Feature Attention for Representation-based Siamese Text Matching | Jianxiang Zang et.al. | 2404.16776 | link |
2024-04-25 | REBEL: Reinforcement Learning via Regressing Relative Rewards | Zhaolin Gao et.al. | 2404.16767 | link |
2024-04-25 | Prefix Text as a Yarn: Eliciting Non-English Alignment in Foundation Language Model | Runzhe Zhan et.al. | 2404.16766 | null |
2024-04-25 | RadGenome-Chest CT: A Grounded Vision-Language Dataset for Chest CT Analysis | Xiaoman Zhang et.al. | 2404.16754 | link |
2024-04-25 | Embracing Diversity: Interpretable Zero-shot classification beyond one vector per class | Mazda Moayeri et.al. | 2404.16717 | null |
2024-04-25 | Layer Skip: Enabling Early Exit Inference and Self-Speculative Decoding | Mostafa Elhoushi et.al. | 2404.16710 | link |
2024-04-25 | Cooperate or Collapse: Emergence of Sustainability Behaviors in a Society of LLM Agents | Giorgio Piatti et.al. | 2404.16698 | link |
2024-04-25 | Influence of Solution Efficiency and Valence of Instruction on Additive and Subtractive Solution Strategies in Humans and GPT-4 | Lydia Uhler et.al. | 2404.16692 | null |
2024-04-25 | EmoVIT: Revolutionizing Emotion Insights with Visual Instruction Tuning | Hongxia Xie et.al. | 2404.16670 | link |
2024-04-24 | Hybrid LLM/Rule-based Approaches to Business Insights Generation from Structured Data | Aliaksei Vertsel et.al. | 2404.15604 | null |
2024-04-24 | ImplicitAVE: An Open-Source Dataset and Multimodal LLMs Benchmark for Implicit Attribute Value Extraction | Henry Peng Zou et.al. | 2404.15592 | link |
2024-04-24 | MiM: Mask in Mask Self-Supervised Pre-Training for 3D Medical Image Analysis | Jiaxin Zhuang et.al. | 2404.15580 | null |
2024-04-24 | Can Foundational Large Language Models Assist with Conducting Pharmaceuticals Manufacturing Investigations? | Hossein Salami et.al. | 2404.15578 | null |
2024-04-24 | Retrieval Head Mechanistically Explains Long-Context Factuality | Wenhao Wu et.al. | 2404.15574 | link |
2024-04-23 | PRISM: Patient Records Interpretation for Semantic Clinical Trial Matching using Large Language Models | Shashi Kant Gupta et.al. | 2404.15549 | null |
2024-04-23 | BattleAgent: Multi-modal Dynamic Emulation on Historical Battles to Complement Historical Analysis | Shuhang Lin et.al. | 2404.15532 | link |
2024-04-23 | Towards Systematic Evaluation of Logical Reasoning Ability of Large Language Models | Mihir Parmar et.al. | 2404.15522 | link |
2024-04-23 | Visual Delta Generator with Large Multi-modal Models for Semi-supervised Composed Image Retrieval | Young Kyun Jang et.al. | 2404.15516 | null |
2024-04-23 | ToM-LM: Delegating Theory Of Mind Reasoning to External Symbolic Executors in Large Language Models | Weizhi Tang et.al. | 2404.15515 | null |
2024-04-23 | IryoNLP at MEDIQA-CORR 2024: Tackling the Medical Error Detection & Correction Task On the Shoulders of Medical Agents | Jean-Philippe Corbeil et.al. | 2404.15488 | link |
2024-04-23 | Large Language Models Spot Phishing Emails with Surprising Accuracy: A Comparative Analysis of Performance | Het Patel et.al. | 2404.15485 | null |
2024-04-23 | Can Large Language Models Learn the Physics of Metamaterials? An Empirical Study with ChatGPT | Darui Lu et.al. | 2404.15458 | null |
2024-04-23 | XC-Cache: Cross-Attending to Cached Context for Efficient LLM Inference | João Monteiro et.al. | 2404.15420 | null |
2024-04-23 | Wiki-LLaVA: Hierarchical Retrieval-Augmented Generation for Multimodal LLMs | Davide Caffagni et.al. | 2404.15406 | null |
2024-04-23 | Aligning LLM Agents by Learning Latent Preference from User Edits | Ge Gao et.al. | 2404.15269 | link |
2024-04-23 | XFT: Unlocking the Power of Code Instruction Tuning by Simply Merging Upcycled Mixture-of-Experts | Yifeng Ding et.al. | 2404.15247 | link |
2024-04-23 | CultureBank: An Online Community-Driven Knowledge Base Towards Culturally Aware Language Technologies | Weiyan Shi et.al. | 2404.15238 | link |
2024-04-23 | Revisiting Unnaturalness for Automated Program Repair in the Era of Large Language Models | Aidan Z. H. Yang et.al. | 2404.15236 | null |
2024-04-23 | Re-Thinking Inverse Graphics With Large Language Models | Peter Kulits et.al. | 2404.15228 | null |
2024-04-23 | Does Instruction Tuning Make LLMs More Consistent? | Constanza Fierro et.al. | 2404.15206 | null |
2024-04-23 | Setting up the Data Printer with Improved English to Ukrainian Machine Translation | Yurii Paniv et.al. | 2404.15196 | link |
2024-04-23 | Regressive Side Effects of Training Language Models to Mimic Student Misconceptions | Shashank Sonkar et.al. | 2404.15156 | null |
2024-04-23 | Bias patterns in the application of LLMs for clinical decision support: A comprehensive study | Raphael Poulain et.al. | 2404.15149 | link |
2024-04-23 | Rethinking LLM Memorization through the Lens of Adversarial Compression | Avi Schwarzschild et.al. | 2404.15146 | null |
2024-04-23 | MedDr: Diagnosis-Guided Bootstrapping for Large-Scale Medical Vision-Language Learning | Sunan He et.al. | 2404.15127 | link |
2024-04-23 | Identifying Fairness Issues in Automatically Generated Testing Content | Kevin Stowe et.al. | 2404.15104 | null |
2024-04-23 | Multimodal Large Language Model is a Human-Aligned Annotator for Text-to-Image Generation | Xun Wu et.al. | 2404.15100 | null |
2024-04-23 | Detection of circular permutations by Protein Language Models | Yue Hu et.al. | 2404.15087 | link |
2024-04-23 | Multi-Head Mixture-of-Experts | Xun Wu et.al. | 2404.15045 | link |
2024-04-23 | TAXI: Evaluating Categorical Knowledge Editing for Language Models | Derek Powell et.al. | 2404.15004 | link |
2024-04-23 | Transformers Can Represent $n$ -gram Language Models | Anej Svete et.al. | 2404.14994 | null |
2024-04-23 | A Short Review for Ontology Learning from Text: Stride from Shallow Learning, Deep Learning to Large Language Models Trend | Rick Du et.al. | 2404.14991 | null |
2024-04-23 | $\texttt{MiniMol}$ : A Parameter-Efficient Foundation Model for Molecular Learning | Kerstin Kläser et.al. | 2404.14986 | null |
2024-04-23 | Social Media and Artificial Intelligence for Sustainable Cities and Societies: A Water Quality Analysis Use-case | Muhammad Asif Auyb et.al. | 2404.14977 | null |
2024-04-22 | AutoAD III: The Prequel – Back to the Pixels | Tengda Han et.al. | 2404.14412 | null |
2024-04-22 | SpaceByte: Towards Deleting Tokenization from Large Language Modeling | Kevin Slagle et.al. | 2404.14408 | link |
2024-04-22 | RTP-LX: Can LLMs Evaluate Toxicity in Multilingual Scenarios? | Adrian de Wynter et.al. | 2404.14397 | link |
2024-04-22 | SEED-X: Multimodal Models with Unified Multi-granularity Comprehension and Generation | Yuying Ge et.al. | 2404.14396 | link |
2024-04-22 | PARAMANU-GANITA: Language Model with Mathematical Capabilities | Mitodru Niyogi et.al. | 2404.14395 | null |
2024-04-22 | A Multimodal Automated Interpretability Agent | Tamar Rott Shaham et.al. | 2404.14394 | null |
2024-04-22 | A Survey on Self-Evolution of Large Language Models | Zhengwei Tao et.al. | 2404.14387 | link |
2024-04-22 | Beyond Scaling: Predicting Patent Approval with Domain-specific Fine-grained Claim Dependency Graph | Xiaochen Kev Gao et.al. | 2404.14372 | link |
2024-04-23 | Preference Fine-Tuning of LLMs Should Leverage Suboptimal, On-Policy Data | Fahim Tajwar et.al. | 2404.14367 | link |
2024-04-22 | Better Synthetic Data by Retrieving and Transforming Existing Datasets | Saumya Gandhi et.al. | 2404.14361 | link |
2024-04-22 | Rethinking Legal Compliance Automation: Opportunities with Large Language Models | Shabnam Hassani et.al. | 2404.14356 | null |
2024-04-22 | Calc-CMU at SemEval-2024 Task 7: Pre-Calc – Learning to Use the Calculator Improves Numeracy in Language Models | Vishruth Veerendranath et.al. | 2404.14355 | link |
2024-04-22 | Automated Long Answer Grading with RiceChem Dataset | Shashank Sonkar et.al. | 2404.14316 | link |
2024-04-22 | Self-Supervised Alignment with Mutual Information: Learning to Follow Principles without Preference Labels | Jan-Philipp Fränken et.al. | 2404.14313 | link |
2024-04-22 | Explaining Arguments’ Strength: Unveiling the Role of Attacks and Supports (Technical Report) | Xiang Yin et.al. | 2404.14304 | link |
2024-04-22 | Marking: Visual Grading with Highlighting Errors and Annotating Missing Bits | Shashank Sonkar et.al. | 2404.14301 | null |
2024-04-22 | Does Your Neural Code Completion Model Use My Code? A Membership Inference Approach | Yao Wan et.al. | 2404.14296 | link |
2024-04-22 | A Survey on Efficient Inference for Large Language Models | Zixuan Zhou et.al. | 2404.14294 | null |
2024-04-22 | LLM-Personalize: Aligning LLM Planners with Human Preferences via Reinforced Self-Training for Housekeeping Robots | Dongge Han et.al. | 2404.14285 | null |
2024-04-22 | Detecting and Mitigating Hallucination in Large Vision Language Models via Fine-Grained AI Feedback | Wenyi Xiao et.al. | 2404.14233 | link |
2024-04-19 | MoVA: Adapting Mixture of Vision Experts to Multimodal Context | Zhuofan Zong et.al. | 2404.13046 | link |
2024-04-19 | Unified Scene Representation and Reconstruction for 3D Large Language Models | Tao Chu et.al. | 2404.13044 | null |
2024-04-19 | Data Alignment for Zero-Shot Concept Generation in Dermatology AI | Soham Gadgil et.al. | 2404.13043 | null |
2024-04-19 | Sample Design Engineering: An Empirical Study of What Makes Good Downstream Fine-Tuning Samples for LLMs | Biyang Guo et.al. | 2404.13033 | link |
2024-04-19 | When Life gives you LLMs, make LLM-ADE: Large Language Models with Adaptive Data Engineering | Stephen Choi et.al. | 2404.13028 | null |
2024-04-19 | Stronger Random Baselines for In-Context Learning | Gregory Yauney et.al. | 2404.13020 | link |
2024-04-19 | Groma: Localized Visual Tokenization for Grounding Multimodal Large Language Models | Chuofan Ma et.al. | 2404.13013 | link |
2024-04-19 | Rethinking the Evaluation of Dialogue Systems: Effects of User Feedback on Crowdworkers and LLMs | Clemencia Siro et.al. | 2404.12994 | link |
2024-04-19 | FineRec:Exploring Fine-grained Sequential Recommendation | Xiaokun Zhang et.al. | 2404.12975 | link |
2024-04-19 | Eyes Can Deceive: Benchmarking Counterfactual Reasoning Abilities of Multi-modal Large Language Models | Yian Li et.al. | 2404.12966 | null |
2024-04-19 | Towards Reliable Latent Knowledge Estimation in LLMs: In-Context Learning vs. Prompting Based Factual Knowledge Extraction | Qinyuan Wu et.al. | 2404.12957 | link |
2024-04-19 | Zero-Shot Medical Phrase Grounding with Off-the-shelf Diffusion Models | Konstantinos Vilouras et.al. | 2404.12920 | link |
2024-04-19 | Physical Backdoor Attack can Jeopardize Driving with Vision-Large-Language Models | Zhenyang Ni et.al. | 2404.12916 | link |
2024-04-19 | Large Language Models for Networking: Workflow, Advances and Challenges | Chang Liu et.al. | 2404.12901 | null |
2024-04-19 | Enabling Natural Zero-Shot Prompting on Encoder Models via Statement-Tuning | Ahmed Elshabrawy et.al. | 2404.12897 | null |
2024-04-19 | Unlocking Multi-View Insights in Knowledge-Dense Retrieval-Augmented Generation | Guanhua Chen et.al. | 2404.12879 | null |
2024-04-19 | LLM-R2: A Large Language Model Enhanced Rule-based Rewrite System for Boosting Query Efficiency | Zhaodonghui Li et.al. | 2404.12872 | link |
2024-04-19 | How Does the Textual Information Affect the Retrieval of Multimodal In-Context Learning? | Yang Luo et.al. | 2404.12866 | link |
2024-04-19 | Foundation Model assisted Weakly Supervised LiDAR Semantic Segmentation | Yilong Chen et.al. | 2404.12861 | null |
2024-04-19 | TartuNLP @ SIGTYP 2024 Shared Task: Adapting XLM-RoBERTa for Ancient and Historical Languages | Aleksei Dorkin et.al. | 2404.12845 | null |
2024-04-18 | BLINK: Multimodal Large Language Models Can See but Not Perceive | Xingyu Fu et.al. | 2404.12390 | null |
2024-04-18 | Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models | Aitor Ormazabal et.al. | 2404.12387 | null |
2024-04-18 | MedThink: Explaining Medical Visual Question Answering via Multimodal Decision-Making Rationale | Xiaotang Gai et.al. | 2404.12372 | null |
2024-04-18 | When LLMs are Unfit Use FastFit: Fast and Effective Text Classification with Many Classes | Asaf Yehudai et.al. | 2404.12365 | link |
2024-04-18 | From $r$ to $Q^*$ : Your Language Model is Secretly a Q-Function | Rafael Rafailov et.al. | 2404.12358 | null |
2024-04-18 | Towards a Foundation Model for Partial Differential Equation: Multi-Operator Learning and Extrapolation | Jingmin Sun et.al. | 2404.12355 | link |
2024-04-18 | V2Xum-LLM: Cross-Modal Video Summarization with Temporal Prompt Instruction Tuning | Hang Hua et.al. | 2404.12353 | null |
2024-04-18 | Evaluating AI for Law: Bridging the Gap with Open-Source Solutions | Rohan Bhambhoria et.al. | 2404.12349 | null |
2024-04-18 | Large Language Models in Targeted Sentiment Analysis | Nicolay Rusnachenko et.al. | 2404.12342 | link |
2024-04-18 | Normative Requirements Operationalization with Large Language Models | Nick Feng et.al. | 2404.12335 | null |
2024-04-18 | Reuse Your Rewards: Reward Model Transfer for Zero-Shot Cross-Lingual Alignment | Zhaofeng Wu et.al. | 2404.12318 | null |
2024-04-18 | Large Language Models for Synthetic Participatory Planning of Shared Automated Electric Mobility Systems | Jiangbo Yu et.al. | 2404.12317 | null |
2024-04-18 | Simultaneous Interpretation Corpus Construction by Large Language Models in Distant Language Pair | Yusuke Sakai et.al. | 2404.12299 | null |
2024-04-18 | Augmenting emotion features in irony detection with Large language modeling | Yucheng Lin et.al. | 2404.12291 | null |
2024-04-18 | Performance Evaluation of Segment Anything Model with Variational Prompting for Application to Non-Visible Spectrum Imagery | Yona Falinie A. Gaus et.al. | 2404.12285 | null |
2024-04-18 | Enhancing Embedding Performance through Large Language Model-based Text Enrichment and Rewriting | Nicholas Harris et.al. | 2404.12283 | null |
2024-04-18 | Advancing the Robustness of Large Language Models through Self-Denoised Smoothing | Jiabao Ji et.al. | 2404.12274 | link |
2024-04-18 | FedEval-LLM: Federated Evaluation of Large Language Models on Downstream Tasks with Collective Wisdom | Yuanqin He et.al. | 2404.12273 | null |
2024-04-18 | Who Validates the Validators? Aligning LLM-Assisted Evaluation of LLM Outputs with Human Preferences | Shreya Shankar et.al. | 2404.12272 | null |
2024-04-18 | Concept Induction: Analyzing Unstructured Text with High-Level Concepts Using LLooM | Michelle S. Lam et.al. | 2404.12259 | link |
2024-04-18 | Private federated discovery of out-of-vocabulary words for Gboard | Ziteng Sun et.al. | 2404.11607 | null |
2024-04-17 | VG4D: Vision-Language Model Goes 4D Video Recognition | Zhichao Deng et.al. | 2404.11605 | link |
2024-04-17 | A Deep Dive into Large Language Models for Automated Bug Localization and Repair | Soneya Binta Hossain et.al. | 2404.11595 | null |
2024-04-17 | Prompt Optimizer of Text-to-Image Diffusion Models for Abstract Concept Understanding | Zezhong Fan et.al. | 2404.11589 | null |
2024-04-17 | LLMTune: Accelerate Database Knob Tuning with Large Language Models | Xinmei Huang et.al. | 2404.11581 | link |
2024-04-17 | On the Scalability of GNNs for Molecular Graphs | Maciej Sypetkowski et.al. | 2404.11568 | null |
2024-04-17 | MoA: Mixture-of-Attention for Subject-Context Disentanglement in Personalized Image Generation | Kuan-Chieh et.al. | 2404.11565 | null |
2024-04-17 | Quantifying Multilingual Performance of Large Language Models Across Languages | Zihao Li et.al. | 2404.11553 | link |
2024-04-17 | Evaluating Span Extraction in Generative Paradigm: A Reflection on Aspect-Based Sentiment Analysis | Soyoung Yang et.al. | 2404.11539 | null |
2024-04-17 | FedPFT: Federated Proxy Fine-Tuning of Foundation Models | Zhaopeng Peng et.al. | 2404.11536 | link |
2024-04-17 | Select and Reorder: A Novel Approach for Neural Sign Language Production | Harry Walsh et.al. | 2404.11532 | null |
2024-04-17 | Pack of LLMs: Model Fusion at Test-Time via Perplexity Optimization | Costas Mavromatis et.al. | 2404.11531 | link |
2024-04-17 | Embedding Privacy in Computational Social Science and Artificial Intelligence Research | Keenan Jones et.al. | 2404.11515 | null |
2024-04-17 | Towards Coarse-to-Fine Evaluation of Inference Efficiency for Large Language Models | Yushuo Chen et.al. | 2404.11502 | link |
2024-04-17 | Paraphrase and Solve: Exploring and Exploiting the Impact of Surface Form on Mathematical Reasoning in Large Language Models | Yue Zhou et.al. | 2404.11500 | link |
2024-04-18 | Octopus v3: Technical Report for On-device Sub-billion Multimodal AI Agent | Wei Chen et.al. | 2404.11459 | null |
2024-04-17 | Unifying Bias and Unfairness in Information Retrieval: A Survey of Challenges and Opportunities with Large Language Models | Sunhao Dai et.al. | 2404.11457 | link |
2024-04-17 | AI-Enhanced Cognitive Behavioral Therapy: Deep Learning and Large Language Models for Extracting Cognitive Pathways from Social Media Texts | Meng Jiang et.al. | 2404.11449 | link |
2024-04-17 | Open-Ended Wargames with Large Language Models | Daniel P. Hogan et.al. | 2404.11446 | link |
2024-04-17 | DUPE: Detection Undermining via Prompt Engineering for Deepfake Text | James Weichert et.al. | 2404.11408 | null |
2024-04-16 | Nearly Optimal Algorithms for Contextual Dueling Bandits from Adversarial Feedback | Qiwei Di et.al. | 2404.10776 | null |
2024-04-16 | COMBO: Compositional World Models for Embodied Multi-Agent Cooperation | Hongxin Zhang et.al. | 2404.10775 | null |
2024-04-16 | Deep Learning and LLM-based Methods Applied to Stellar Lightcurve Classification | Yu-Yang Li et.al. | 2404.10757 | link |
2024-04-16 | Is DPO Superior to PPO for LLM Alignment? A Comprehensive Study | Shusheng Xu et.al. | 2404.10719 | link |
2024-04-17 | Dual Modalities of Text: Visual and Textual Generative Pre-training | Yekun Chai et.al. | 2404.10710 | link |
2024-04-16 | Question Difficulty Ranking for Multiple-Choice Reading Comprehension | Vatsal Raina et.al. | 2404.10704 | null |
2024-04-16 | An empirical study on code review activity prediction in practice | Doriane Olewicki et.al. | 2404.10703 | null |
2024-04-16 | Automating REST API Postman Test Cases Using LLM | S Deepika Sri et.al. | 2404.10678 | null |
2024-04-16 | Self-playing Adversarial Language Game Enhances LLM Reasoning | Pengyu Cheng et.al. | 2404.10642 | link |
2024-04-16 | HLAT: High-quality Large Language Model Pre-trained on AWS Trainium | Haozheng Fan et.al. | 2404.10630 | link |
2024-04-16 | Private Attribute Inference from Images with Vision-Language Models | Batuhan Tömekçe et.al. | 2404.10618 | link |
2024-04-16 | Automated Evaluation of Large Vision-Language Models on Self-driving Corner Cases | Yanze Li et.al. | 2404.10595 | null |
2024-04-16 | Construction of Domain-specified Japanese Large Language Model for Finance through Continual Pre-training | Masanori Hirano et.al. | 2404.10555 | null |
2024-04-16 | Unveiling the Misuse Potential of Base Large Language Models via In-Context Learning | Xiao Wang et.al. | 2404.10552 | null |
2024-04-16 | Capturing the Macroscopic Behaviour of Molecular Dynamics with Membership Functions | Alexander Sikorski et.al. | 2404.10523 | link |
2024-04-16 | CoTAR: Chain-of-Thought Attribution Reasoning with Multi-level Granularity | Moshe Berchansky et.al. | 2404.10513 | null |
2024-04-16 | White Men Lead, Black Women Help: Uncovering Gender, Racial, and Intersectional Bias in Language Agency | Yixin Wan et.al. | 2404.10508 | null |
2024-04-16 | Self-Supervised Visual Preference Alignment | Ke Zhu et.al. | 2404.10501 | link |
2024-04-16 | When Emotional Stimuli meet Prompt Designing: An Auto-Prompt Graphical Paradigm | Chenggian Ma et.al. | 2404.10500 | null |
2024-04-16 | Spiral of Silences: How is Large Language Model Killing Information Retrieval? – A Case Study on Open Domain Question Answering | Xiaoyang Chen et.al. | 2404.10496 | link |
2024-04-15 | KG-CTG: Citation Generation through Knowledge Graph-guided Large Language Models | Avinash Anand et.al. | 2404.09763 | null |
2024-04-15 | Resilience of Large Language Models for Noisy Instructions | Bin Wang et.al. | 2404.09754 | null |
2024-04-15 | Personalized Collaborative Fine-Tuning for On-Device Large Language Models | Nicolas Wagner et.al. | 2404.09753 | link |
2024-04-15 | AMPCliff: quantitative definition and benchmarking of activity cliffs in antimicrobial peptides | Kewei Li et.al. | 2404.09738 | link |
2024-04-15 | Quantization of Large Language Models with an Overdetermined Basis | Daniil Merkulov et.al. | 2404.09737 | null |
2024-04-15 | Photo-Realistic Image Restoration in the Wild with Controlled Vision-Language Models | Ziwei Luo et.al. | 2404.09732 | link |
2024-04-15 | Unveiling Imitation Learning: Exploring the Impact of Data Falsity to Large Language Model | Hyunsoo Cho et.al. | 2404.09717 | null |
2024-04-15 | Enhancing Robot Explanation Capabilities through Vision-Language Models: a Preliminary Study by Interpreting Visual Inputs for Improved Human-Robot Interaction | David Sobrín-Hidalgo et.al. | 2404.09705 | null |
2024-04-15 | Generative AI for Game Theory-based Mobile Networking | Long He et.al. | 2404.09699 | null |
2024-04-15 | Are Large Language Models Reliable Argument Quality Annotators? | Nailia Mirzakhmedova et.al. | 2404.09696 | link |
2024-04-15 | LoRAP: Transformer Sub-Layers Deserve Differentiated Structured Compression for Large Language Models | Guangyan Li et.al. | 2404.09695 | null |
2024-04-15 | Multi-News+: Cost-efficient Dataset Cleansing via LLM-based Data Annotation | Juhwan Choi et.al. | 2404.09682 | link |
2024-04-15 | Learn Your Reference Model for Real Good Alignment | Alexey Gorbatovski et.al. | 2404.09656 | null |
2024-04-15 | Do LLMs Understand Visual Anomalies? Uncovering LLM Capabilities in Zero-shot Anomaly Detection | Jiaqi Zhu et.al. | 2404.09654 | null |
2024-04-15 | Bridging Vision and Language Spaces with Assignment Prediction | Jungin Park et.al. | 2404.09632 | link |
2024-04-15 | AesExpert: Towards Multi-modality Foundation Model for Image Aesthetics Perception | Yipo Huang et.al. | 2404.09624 | link |
2024-04-15 | UNIAA: A Unified Multi-modal Image Aesthetic Assessment Baseline and Benchmark | Zhaokun Zhou et.al. | 2404.09619 | null |
2024-04-15 | A Self-feedback Knowledge Elicitation Approach for Chemical Reaction Predictions | Pengfei Liu et.al. | 2404.09606 | link |
2024-04-15 | Improving Recall of Large Language Models: A Model Collaboration Approach for Relational Triple Extraction | Zepeng Ding et.al. | 2404.09593 | null |
2024-04-15 | Modelling Language | Jumbly Grindrod et.al. | 2404.09579 | null |
2024-04-15 | Transformers, Contextualism, and Polysemy | Jumbly Grindrod et.al. | 2404.09577 | link |
2024-04-15 | Large language models and linguistic intentionality | Jumbly Grindrod et.al. | 2404.09576 | null |
2024-04-12 | Probing the 3D Awareness of Visual Foundation Models | Mohamed El Banani et.al. | 2404.08636 | link |
2024-04-12 | Pre-training Small Base LMs with Fewer Tokens | Sunny Sanyal et.al. | 2404.08634 | link |
2024-04-12 | FCert: Certifiably Robust Few-Shot Classification in the Era of Foundation Models | Yanting Wang et.al. | 2404.08631 | link |
2024-04-12 | Training-free Boost for Open-Vocabulary Object Detection with Confidence Aggregation | Yanhao Zheng et.al. | 2404.08603 | link |
2024-04-12 | Enhancing Visual Question Answering through Question-Driven Image Captions as Prompts | Övgü Özdemir et.al. | 2404.08589 | link |
2024-04-12 | Pathological Primitive Segmentation Based on Visual Foundation Model with Zero-Shot Mask Generation | Abu Bakor Hayat Arnob et.al. | 2404.08584 | link |
2024-04-12 | FashionFail: Addressing Failure Cases in Fashion Object Detection and Segmentation | Riza Velioglu et.al. | 2404.08582 | link |
2024-04-12 | Lossy Image Compression with Foundation Diffusion Models | Lucas Relic et.al. | 2404.08580 | null |
2024-04-12 | Enhancing Autonomous Vehicle Training with Language Model Integration and Critical Scenario Generation | Hanlin Tian et.al. | 2404.08570 | link |
2024-04-12 | RLHF Deciphered: A Critical Analysis of Reinforcement Learning from Human Feedback for LLMs | Shreyas Chaudhari et.al. | 2404.08555 | null |
2024-04-12 | Memory Traces: Are Transformers Tulving Machines? | Jean-Marie Chauvet et.al. | 2404.08543 | null |
2024-04-12 | Online Safety Analysis for LLMs: a Benchmark, an Assessment, and a Path Forward | Xuan Xie et.al. | 2404.08517 | null |
2024-04-12 | ChatGPT and general-purpose AI count fruits in pictures surprisingly well | Konlavach Mengsuwan et.al. | 2404.08515 | null |
2024-04-12 | Efficient Interactive LLM Serving with Proxy Model-based Sequence Length Prediction | Haoran Qiu et.al. | 2404.08509 | link |
2024-04-12 | LaSagnA: Language-based Segmentation Assistant for Complex Queries | Cong Wei et.al. | 2404.08506 | link |
2024-04-12 | Strategic Interactions between Large Language Models-based Agents in Beauty Contests | Siting Lu et.al. | 2404.08492 | null |
2024-04-12 | Mitigating Language-Level Performance Disparity in mPLMs via Teacher Language Selection and Cross-lingual Self-Distillation | Haozhe Zhao et.al. | 2404.08491 | link |
2024-04-12 | Thematic Analysis with Large Language Models: does it work with languages other than English? A targeted test in Italian | Stefano De Paoli et.al. | 2404.08488 | null |
2024-04-12 | Comparing Apples to Oranges: LLM-powered Multimodal Intention Prediction in an Object Categorization Task | Hassan Ali et.al. | 2404.08424 | null |
2024-04-12 | Adapting the Segment Anything Model During Usage in Novel Situations | Robin Schön et.al. | 2404.08421 | null |
2024-04-11 | OpenBias: Open-set Bias Detection in Text-to-Image Generative Models | Moreno D’Incà et.al. | 2404.07990 | link |
2024-04-11 | Any2Point: Empowering Any-modality Large Models for Efficient 3D Understanding | Yiwen Tang et.al. | 2404.07989 | link |
2024-04-11 | Two Effects, One Trigger: On the Modality Gap, Object Bias, and Information Imbalance in Contrastive Vision-Language Representation Learning | Simon Schrodi et.al. | 2404.07983 | link |
2024-04-11 | Language Imbalance Can Boost Cross-lingual Generalisation | Anton Schäfer et.al. | 2404.07982 | link |
2024-04-11 | Manipulating Large Language Models to Increase Product Visibility | Aounon Kumar et.al. | 2404.07981 | link |
2024-04-11 | LLoCO: Learning Long Contexts Offline | Sijun Tan et.al. | 2404.07979 | link |
2024-04-11 | Ferret-v2: An Improved Baseline for Referring and Grounding with Large Language Models | Haotian Zhang et.al. | 2404.07973 | null |
2024-04-11 | Rho-1: Not All Tokens Are What You Need | Zhenghao Lin et.al. | 2404.07965 | link |
2024-04-11 | On Unified Prompt Tuning for Request Quality Assurance in Public Code Review | Xinyu Chen et.al. | 2404.07942 | null |
2024-04-11 | Leveraging Large Language Models (LLMs) to Support Collaborative Human-AI Online Risk Data Annotation | Jinkyung Park et.al. | 2404.07926 | null |
2024-04-11 | LaVy: Vietnamese Multimodal Large Language Model | Chi Tran et.al. | 2404.07922 | link |
2024-04-11 | AmpleGCG: Learning a Universal and Transferable Generative Model of Adversarial Suffixes for Jailbreaking Both Open and Closed LLMs | Zeyi Liao et.al. | 2404.07921 | link |
2024-04-11 | DesignQA: A Multimodal Benchmark for Evaluating Large Language Models’ Understanding of Engineering Documentation | Anna C. Doris et.al. | 2404.07917 | link |
2024-04-11 | HGRN2: Gated Linear RNNs with State Expansion | Zhen Qin et.al. | 2404.07904 | link |
2024-04-11 | High-Dimension Human Value Representation in Large Language Models | Samuel Cahyawijaya et.al. | 2404.07900 | link |
2024-04-11 | Guiding Large Language Models to Post-Edit Machine Translation with Error Annotations | Dayeon Ki et.al. | 2404.07851 | link |
2024-04-11 | On Training Data Influence of GPT Models | Qingyi Liu et.al. | 2404.07840 | link |
2024-04-11 | RecurrentGemma: Moving Past Transformers for Efficient Open Language Models | Aleksandar Botev et.al. | 2404.07839 | link |
2024-04-11 | Streamlined Photoacoustic Image Processing with Foundation Models: A Training-Free Solution | Handi Deng et.al. | 2404.07833 | null |
2024-04-11 | Heron-Bench: A Benchmark for Evaluating Vision Language Models in Japanese | Yuichi Inoue et.al. | 2404.07824 | link |
2024-04-10 | BRAVE: Broadening the visual encoding of vision-language models | Oğuzhan Fatih Kar et.al. | 2404.07204 | null |
2024-04-10 | UMBRAE: Unified Multimodal Decoding of Brain Signals | Weihao Xia et.al. | 2404.07202 | link |
2024-04-10 | Scaling Laws for Data Filtering – Data Curation cannot be Compute Agnostic | Sachin Goyal et.al. | 2404.07177 | link |
2024-04-10 | Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention | Tsendsuren Munkhdalai et.al. | 2404.07143 | null |
2024-04-10 | Open reaction-diffusion systems: bridging probabilistic theory across scales | Mauricio J. del Razo et.al. | 2404.07119 | link |
2024-04-10 | Continuous Language Model Interpolation for Dynamic and Controllable Text Generation | Sara Kangaslahti et.al. | 2404.07117 | link |
2024-04-11 | From Model-centered to Human-Centered: Revision Distance as a Metric for Text Evaluation in LLMs-based Applications | Yongqiang Ma et.al. | 2404.07108 | null |
2024-04-10 | Graph Chain-of-Thought: Augmenting Large Language Models by Reasoning on Graphs | Bowen Jin et.al. | 2404.07103 | link |
2024-04-10 | Dynamic Generation of Personalities with Large Language Models | Jianzhi Liu et.al. | 2404.07084 | link |
2024-04-10 | VLLMs Provide Better Context for Emotion Understanding Through Common Sense Reasoning | Alexandros Xenos et.al. | 2404.07078 | link |
2024-04-10 | Exploring Concept Depth: How Large Language Models Acquire Knowledge at Different Layers? | Mingyu Jin et.al. | 2404.07066 | link |
2024-04-10 | Groundedness in Retrieval-augmented Long-form Generation: An Empirical Study | Alessandro Stolfo et.al. | 2404.07060 | null |
2024-04-10 | Meta4XNLI: A Crosslingual Parallel Corpus for Metaphor Detection and Interpretation | Elisa Sanchez-Bayona et.al. | 2404.07053 | link |
2024-04-10 | ORacle: Large Vision-Language Models for Knowledge-Guided Holistic OR Domain Modeling | Ege Özsoy et.al. | 2404.07031 | link |
2024-04-10 | Improving Language Model Reasoning with Self-motivated Learning | Yunlong Feng et.al. | 2404.07017 | null |
2024-04-10 | A Mathematical Theory for Learning Semantic Languages by Abstract Learners | Kuo-Yu Liao et.al. | 2404.07009 | null |
2024-04-10 | WordDecipher: Enhancing Digital Workspace Communication with Explainable AI for Non-native English Speakers | Yuexi Chen et.al. | 2404.07005 | null |
2024-04-10 | LM Transparency Tool: Interactive Tool for Analyzing Transformer Language Models | Igor Tufanov et.al. | 2404.07004 | null |
2024-04-10 | Event Grounded Criminal Court View Generation withCooperative (Large) Language Models | Linan Yue et.al. | 2404.07001 | link |
2024-04-10 | Advancing Real-time Pandemic Forecasting Using Large Language Models: A COVID-19 Case Study | Hongru Du et.al. | 2404.06962 | link |
2024-04-09 | InternLM-XComposer2-4KHD: A Pioneering Large Vision-Language Model Handling Resolutions from 336 Pixels to 4K HD | Xiaoyi Dong et.al. | 2404.06512 | link |
2024-04-09 | Can Feedback Enhance Semantic Grounding in Large Vision-Language Models? | Yuan-Hong Liao et.al. | 2404.06510 | null |
2024-04-09 | On the Effect of (Near) Duplicate Subwords in Language Modelling | Anton Schäfer et.al. | 2404.06508 | link |
2024-04-09 | Pitfalls of Conversational LLMs on News Debiasing | Ipek Baris Schlicht et.al. | 2404.06488 | null |
2024-04-10 | Ada-LEval: Evaluating long-context LLMs with length-adaptable benchmarks | Chonghua Wang et.al. | 2404.06480 | link |
2024-04-10 | Text-Based Reasoning About Vector Graphics | Zhenhailong Wang et.al. | 2404.06479 | null |
2024-04-09 | Automated Federated Pipeline for Parameter-Efficient Fine-Tuning of Large Language Models | Zihan Fang et.al. | 2404.06448 | null |
2024-04-09 | Large Language Models to the Rescue: Deadlock Resolution in Multi-Robot Systems | Kunal Garg et.al. | 2404.06413 | null |
2024-04-09 | AgentQuest: A Modular Benchmark Framework to Measure Progress and Improve LLM Agents | Luca Gioacchini et.al. | 2404.06411 | link |
2024-04-09 | Take a Look at it! Rethinking How to Evaluate Language Model Jailbreak | Hongyu Cai et.al. | 2404.06407 | link |
2024-04-09 | Apprentices to Research Assistants: Advancing Research with Large Language Models | M. Namvarpour et.al. | 2404.06404 | null |
2024-04-09 | MiniCPM: Unveiling the Potential of Small Language Models with Scalable Training Strategies | Shengding Hu et.al. | 2404.06395 | link |
2024-04-10 | MuPT: A Generative Symbolic Music Pretrained Transformer | Xingwei Qu et.al. | 2404.06393 | null |
2024-04-09 | Event Extraction in Basque: Typologically motivated Cross-Lingual Transfer-Learning Analysis | Mikel Zubillaga et.al. | 2404.06392 | null |
2024-04-09 | Latent Distance Guided Alignment Training for Large Language Models | Haotian Luo et.al. | 2404.06390 | null |
2024-04-09 | Model Generation from Requirements with LLMs: an Exploratory Study | Alessio Ferrari et.al. | 2404.06371 | null |
2024-04-09 | Enhancing Decision Analysis with a Large Language Model: pyDecision a Comprehensive Library of MCDA Methods in Python | Valdecy Pereira et.al. | 2404.06370 | link |
2024-04-09 | VISION2UI: A Real-World Dataset with Layout for Code Generation from UI Designs | Yi Gui et.al. | 2404.06369 | null |
2024-04-09 | ClinLinker: Medical Entity Linking of Clinical Concept Mentions in Spanish | Fernando Gallego et.al. | 2404.06367 | null |
2024-04-09 | Test-Time Adaptation with SaLIP: A Cascade of SAM and CLIP for Zero shot Medical Image Segmentation | Sidra Aleem et.al. | 2404.06362 | link |
2024-04-08 | MA-LMM: Memory-Augmented Large Multimodal Model for Long-Term Video Understanding | Bo He et.al. | 2404.05726 | link |
2024-04-08 | Ferret-UI: Grounded Mobile UI Understanding with Multimodal LLMs | Keen You et.al. | 2404.05719 | null |
2024-04-08 | Comprehensive Study on German Language Models for Clinical and Biomedical Text Understanding | Ahmad Idrissi-Yaghir et.al. | 2404.05694 | null |
2024-04-08 | Evaluating Mathematical Reasoning Beyond Accuracy | Shijie Xia et.al. | 2404.05692 | link |
2024-04-08 | Retrieval-Augmented Open-Vocabulary Object Detection | Jooyeon Kim et.al. | 2404.05687 | link |
2024-04-08 | MoMA: Multimodal LLM Adapter for Fast Personalized Image Generation | Kunpeng Song et.al. | 2404.05674 | link |
2024-04-08 | CoReS: Orchestrating the Dance of Reasoning and Segmentation | Xiaoyi Bao et.al. | 2404.05673 | link |
2024-04-09 | Fighting crime with Transformers: Empirical analysis of address parsing methods in payment data | Haitham Hammami et.al. | 2404.05632 | link |
2024-04-08 | LTNER: Large Language Model Tagging for Named Entity Recognition with Contextualized Entity Marking | Faren Yan et.al. | 2404.05624 | null |
2024-04-08 | MULTIFLOW: Shifting Towards Task-Agnostic Vision-Language Pruning | Matteo Farina et.al. | 2404.05621 | link |
2024-04-08 | SpeechAlign: Aligning Speech Generation to Human Preferences | Dong Zhang et.al. | 2404.05600 | link |
2024-04-08 | MedExpQA: Multilingual Benchmarking of Large Language Models for Medical Question Answering | Iñigo Alonso et.al. | 2404.05590 | null |
2024-04-08 | Enhancing Software Related Information Extraction with Generative Language Models through Single-Choice Question Answering | Wolfgang Otto et.al. | 2404.05587 | null |
2024-04-08 | Towards More General Video-based Deepfake Detection through Facial Feature Guided Adaptation for Foundation Model | Yue-Hua Han et.al. | 2404.05583 | null |
2024-04-08 | 360°REA: Towards A Reusable Experience Accumulation with 360° Assessment for Multi-Agent System | Shen Gao et.al. | 2404.05569 | link |
2024-04-08 | Dense Training, Sparse Inference: Rethinking Training of Mixture-of-Experts Language Models | Bowen Pan et.al. | 2404.05567 | null |
2024-04-08 | Chinese Sequence Labeling with Semi-Supervised Boundary-Aware Language Model Pre-training | Longhui Zhang et.al. | 2404.05560 | link |
2024-04-08 | Evaluating Interventional Reasoning Capabilities of Large Language Models | Tejas Kasetty et.al. | 2404.05545 | null |
2024-04-08 | OPSD: an Offensive Persian Social media Dataset and its baseline evaluations | Mehran Safayani et.al. | 2404.05540 | null |
2024-04-08 | Best-of-Venom: Attacking RLHF by Injecting Poisoned Preference Data | Tim Baumgärtner et.al. | 2404.05530 | null |
2024-04-05 | Who Evaluates the Evaluations? Objectively Scoring Text-to-Image Prompt Coherence Metrics with T2IScoreScore (TS2) | Michael Saxon et.al. | 2404.04251 | link |
2024-04-05 | Physical Property Understanding from Language-Embedded Feature Fields | Albert J. Zhai et.al. | 2404.04242 | null |
2024-04-05 | Cleared for Takeoff? Compositional & Conditional Reasoning may be the Achilles Heel to (Flight-Booking) Language Agents | Harsh Kohli et.al. | 2404.04237 | null |
2024-04-05 | player2vec: A Language Modeling Approach to Understand Player Behavior in Games | Tianze Wang et.al. | 2404.04234 | null |
2024-04-05 | Image-Text Co-Decomposition for Text-Supervised Semantic Segmentation | Ji-Jia Wu et.al. | 2404.04231 | link |
2024-04-05 | Unlocking Parameter-Efficient Fine-Tuning for Low-Resource Language Translation | Tong Su et.al. | 2404.04212 | null |
2024-04-05 | Social Skill Training with Large Language Models | Diyi Yang et.al. | 2404.04204 | null |
2024-04-05 | Do Sentence Transformers Learn Quasi-Geospatial Concepts from General Text? | Ilya Ilyankou et.al. | 2404.04169 | null |
2024-04-05 | Chinese Tiny LLM: Pretraining a Chinese-Centric Large Language Model | Xinrun Du et.al. | 2404.04167 | null |
2024-04-05 | Dwell in the Beginning: How Language Models Embed Long Documents for Dense Retrieval | João Coelho et.al. | 2404.04163 | link |
2024-04-05 | BEAR: A Unified Framework for Evaluating Relational Knowledge in Causal and Masked Language Models | Jacek Wiland et.al. | 2404.04113 | link |
2024-04-05 | Large language models as oracles for instantiating ontologies with domain-specific knowledge | Giovanni Ciatto et.al. | 2404.04108 | link |
2024-04-05 | Robust Preference Optimization with Provable Noise Tolerance for LLMs | Xize Liang et.al. | 2404.04102 | null |
2024-04-05 | Label Propagation for Zero-shot Classification with Vision-Language Models | Vladan Stojnić et.al. | 2404.04072 | link |
2024-04-05 | Assessing the quality of information extraction | Filip Seitl et.al. | 2404.04068 | null |
2024-04-05 | CLUE: A Clinical Language Understanding Evaluation for LLMs | Amin Dada et.al. | 2404.04067 | link |
2024-04-05 | VoicePilot: Harnessing LLMs as Speech Interfaces for Physically Assistive Robots | Akhil Padmanabha et.al. | 2404.04066 | null |
2024-04-05 | A Comparison of Methods for Evaluating Generative IR | Negar Arabzadeh et.al. | 2404.04044 | link |
2024-04-05 | Teaching Llama a New Language Through Cross-Lingual Knowledge Transfer | Hele-Andra Kuulmets et.al. | 2404.04042 | link |
2024-04-05 | Willkommens-Merkel, Chaos-Johnson, and Tore-Klose: Modeling the Evaluative Meaning of German Personal Name Compounds | Annerose Eichel et.al. | 2404.04031 | link |
2024-04-04 | OpenNeRF: Open Set 3D Neural Scene Segmentation with Pixel-Wise Features and Rendered Novel Views | Francis Engelmann et.al. | 2404.03650 | null |
2024-04-04 | AutoWebGLM: Bootstrap And Reinforce A Large Language Model-based Web Navigating Agent | Hanyu Lai et.al. | 2404.03648 | link |
2024-04-04 | Capabilities of Large Language Models in Control Engineering: A Benchmark Study on GPT-4, Claude 3 Opus, and Gemini 1.0 Ultra | Darioush Kevian et.al. | 2404.03647 | null |
2024-04-04 | Locating and Editing Factual Associations in Mamba | Arnab Sen Sharma et.al. | 2404.03646 | link |
2024-04-04 | Training LLMs over Neurally Compressed Text | Brian Lester et.al. | 2404.03626 | null |
2024-04-04 | Standardizing Knowledge Engineering Practices with a Reference Architecture | Bradley P. Allen et.al. | 2404.03624 | null |
2024-04-04 | Unveiling LLMs: The Evolution of Latent Representations in a Temporal Knowledge Graph | Marco Bronzini et.al. | 2404.03623 | link |
2024-04-04 | Visualization-of-Thought Elicits Spatial Reasoning in Large Language Models | Wenshan Wu et.al. | 2404.03622 | null |
2024-04-04 | DeViDe: Faceted medical knowledge for improved medical vision-language pre-training | Haozhe Luo et.al. | 2404.03618 | null |
2024-04-04 | Sailor: Open Language Models for South-East Asia | Longxu Dou et.al. | 2404.03608 | link |
2024-04-04 | Mitigating the Impact of Outlier Channels for Language Model Quantization with Activation Regularization | Aniruddha Nrusimha et.al. | 2404.03605 | link |
2024-04-04 | Evaluating LLMs at Detecting Errors in LLM Responses | Ryo Kamoi et.al. | 2404.03602 | link |
2024-04-04 | Intent Detection and Entity Extraction from BioMedical Literature | Ankan Mullick et.al. | 2404.03598 | link |
2024-04-04 | ReFT: Representation Finetuning for Language Models | Zhengxuan Wu et.al. | 2404.03592 | link |
2024-04-04 | SemGrasp: Semantic Grasp Generation via Language Aligned Discretization | Kailin Li et.al. | 2404.03590 | null |
2024-04-04 | Untangle the KNOT: Interweaving Conflicting Knowledge and Reasoning Skills in Large Language Models | Yantao Liu et.al. | 2404.03577 | link |
2024-04-04 | Embodied AI with Two Arms: Zero-shot Learning, Safety and Modularity | Jake Varley et.al. | 2404.03570 | null |
2024-04-04 | Personalized LLM Response Generation with Parameterized Memory Injection | Kai Zhang et.al. | 2404.03565 | link |
2024-04-04 | Select and Summarize: Scene Saliency for Movie Script Summarization | Rohit Saxena et.al. | 2404.03561 | link |
2024-04-04 | How does Multi-Task Training Affect Transformer In-Context Capabilities? Investigations with Function Classes | Harmon Bhasin et.al. | 2404.03558 | link |
2024-04-03 | ALOHa: A New Measure for Hallucination in Captioning Models | Suzanne Petryk et.al. | 2404.02904 | null |
2024-04-03 | MatAtlas: Text-driven Consistent Geometry Texturing and Material Assignment | Duygu Ceylan et.al. | 2404.02899 | null |
2024-04-03 | ChatGLM-Math: Improving Math Problem-Solving in Large Language Models with a Self-Critique Pipeline | Yifan Xu et.al. | 2404.02893 | link |
2024-04-03 | MODNO: Multi Operator Learning With Distributed Neural Operators | Zecheng Zhang et.al. | 2404.02892 | null |
2024-04-03 | Linear Attention Sequence Parallelism | Weigao Sun et.al. | 2404.02882 | link |
2024-04-03 | Integrating Explanations in Learning LTL Specifications from Demonstrations | Ashutosh Gupta et.al. | 2404.02872 | null |
2024-04-03 | Toward Inference-optimal Mixture-of-Expert Large Language Models | Longfei Yun et.al. | 2404.02852 | null |
2024-04-03 | I-Design: Personalized LLM Interior Designer | Ata Çelen et.al. | 2404.02838 | null |
2024-04-03 | Cherry on Top: Parameter Heterogeneity and Quantization in Large Language Models | Wanyun Cui et.al. | 2404.02837 | null |
2024-04-03 | Retrieving Examples from Memory for Retrieval Augmented Neural Machine Translation: A Systematic Comparison | Maxime Bouthors et.al. | 2404.02835 | null |
2024-04-03 | Empowering Biomedical Discovery with AI Agents | Shanghua Gao et.al. | 2404.02831 | null |
2024-04-03 | BAdam: A Memory Efficient Full Parameter Training Method for Large Language Models | Qijun Luo et.al. | 2404.02827 | link |
2024-04-03 | Conifer: Improving Complex Constrained Instruction-Following Ability of Large Language Models | Haoran Sun et.al. | 2404.02823 | link |
2024-04-03 | A Survey of Optimization-based Task and Motion Planning: From Classical To Learning Approaches | Zhigen Zhao et.al. | 2404.02817 | null |
2024-04-03 | The RealHumanEval: Evaluating Large Language Models’ Abilities to Support Programmers | Hussein Mozannar et.al. | 2404.02806 | link |
2024-04-03 | Efficient Multi-Vector Dense Retrieval Using Bit Vectors | Franco Maria Nardini et.al. | 2404.02805 | link |
2024-04-03 | AI and personalized learning: bridging the gap with modern educational goals | Kristjan-Julius Laak et.al. | 2404.02798 | null |
2024-04-03 | CLaM-TTS: Improving Neural Codec Language Model for Zero-Shot Text-to-Speech | Jaehyeon Kim et.al. | 2404.02781 | null |
2024-04-03 | FPT: Feature Prompt Tuning for Few-shot Readability Assessment | Ziyang Wang et.al. | 2404.02772 | link |
2024-04-03 | DIBS: Enhancing Dense Video Captioning with Unlabeled Videos via Pseudo Boundary Enrichment and Online Refinement | Hao Wu et.al. | 2404.02755 | null |
2024-04-02 | Segment Any 3D Object with Language | Seungjun Lee et.al. | 2404.02157 | null |
2024-04-02 | Iterated Learning Improves Compositionality in Large Vision-Language Models | Chenhao Zheng et.al. | 2404.02145 | null |
2024-04-02 | Topic-based Watermarks for LLM-Generated Text | Alexander Nemecek et.al. | 2404.02138 | null |
2024-04-02 | ViTamin: Designing Scalable Vision Models in the Vision-Language Era | Jienneg Chen et.al. | 2404.02132 | link |
2024-04-02 | FLawN-T5: An Empirical Examination of Effective Instruction-Tuning Data Mixtures for Legal Reasoning | Joel Niklaus et.al. | 2404.02127 | link |
2024-04-02 | Exploring Automated Distractor Generation for Math Multiple-choice Questions via Large Language Models | Wanyong Feng et.al. | 2404.02124 | link |
2024-04-02 | GINopic: Topic Modeling with Graph Isomorphism Network | Suman Adhya et.al. | 2404.02115 | link |
2024-04-02 | CLAPNQ: Cohesive Long-form Answers from Passages in Natural Questions for RAG systems | Sara Rosenthal et.al. | 2404.02103 | link |
2024-04-02 | Advancing LLM Reasoning Generalists with Preference Trees | Lifan Yuan et.al. | 2404.02078 | link |
2024-04-02 | Red-Teaming Segment Anything Model | Krzysztof Jankowski et.al. | 2404.02067 | link |
2024-04-02 | Digital Forgetting in Large Language Models: A Survey of Unlearning Methods | Alberto Blanco-Justicia et.al. | 2404.02062 | null |
2024-04-02 | Long-context LLMs Struggle with Long In-context Learning | Tianle Li et.al. | 2404.02060 | link |
2024-04-02 | IISAN: Efficiently Adapting Multimodal Representation for Sequential Recommendation with Decoupled PEFT | Junchen Fu et.al. | 2404.02059 | link |
2024-04-02 | Deconstructing In-Context Learning: Understanding Prompts via Corruption | Namrata Shivagunde et.al. | 2404.02054 | link |
2024-04-02 | A Survey on Large Language Model-Based Game Agents | Sihao Hu et.al. | 2404.02039 | link |
2024-04-02 | MultiParaDetox: Extending Text Detoxification with Parallel Data to New Languages | Daryna Dementieva et.al. | 2404.02037 | null |
2024-04-02 | Improving Retrieval Augmented Open-Domain Question-Answering with Vectorized Contexts | Zhuo Chen et.al. | 2404.02022 | link |
2024-04-02 | Large Language Models for Orchestrating Bimanual Robots | Kun Chu et.al. | 2404.02018 | link |
2024-04-02 | MuxServe: Flexible Multiplexing for Efficient Multiple LLM Serving | Jiangfei Duan et.al. | 2404.02015 | link |
2024-04-02 | Dissecting Paraphrases: The Impact of Prompt Syntax and supplementary Information on Knowledge Retrieval from Pretrained Language Models | Stephan Linzbach et.al. | 2404.01992 | null |
2024-03-29 | Unsolvable Problem Detection: Evaluating Trustworthiness of Vision Language Models | Atsuyuki Miyai et.al. | 2403.20331 | link |
2024-03-29 | Are We on the Right Way for Evaluating Large Vision-Language Models? | Lin Chen et.al. | 2403.20330 | link |
2024-03-29 | ReALM: Reference Resolution As Language Modeling | Joel Ruben Antony Moniz et.al. | 2403.20329 | null |
2024-03-29 | Gecko: Versatile Text Embeddings Distilled from Large Language Models | Jinhyuk Lee et.al. | 2403.20327 | null |
2024-03-29 | Convolutional Prompting meets Language Models for Continual Learning | Anurag Roy et.al. | 2403.20317 | null |
2024-03-29 | Learn “No” to Say “Yes” Better: Improving Vision-Language Models via Negations | Jaisidh Singh et.al. | 2403.20312 | link |
2024-03-29 | Towards Greener LLMs: Bringing Energy-Efficiency to the Forefront of LLM Inference | Jovan Stojkovic et.al. | 2403.20306 | null |
2024-03-29 | Can LLMs Correct Physicians, Yet? Investigating Effective Interaction Methods in the Medical Domain | Burcu Sayin et.al. | 2403.20288 | link |
2024-03-29 | LUQ: Long-text Uncertainty Quantification for LLMs | Caiqi Zhang et.al. | 2403.20279 | link |
2024-04-01 | Draw-and-Understand: Leveraging Visual Prompts to Enable MLLMs to Comprehend What You Want | Weifeng Lin et.al. | 2403.20271 | link |
2024-03-29 | Latxa: An Open Language Model and Evaluation Suite for Basque | Julen Etxaniz et.al. | 2403.20266 | link |
2024-03-29 | ELITR-Bench: A Meeting Assistant Benchmark for Long-Context Language Models | Thibaut Thonet et.al. | 2403.20262 | link |
2024-03-29 | MedCLIP-SAM: Bridging Text and Image Towards Universal Medical Image Segmentation | Taha Koleilat et.al. | 2403.20253 | link |
2024-03-29 | Using LLMs to Model the Beliefs and Preferences of Targeted Populations | Keiichi Namikoshi et.al. | 2403.20252 | null |
2024-03-29 | Long-Tailed Anomaly Detection with Learnable Class Names | Chih-Hui Ho et.al. | 2403.20236 | null |
2024-03-29 | H2RSVLM: Towards Helpful and Honest Remote Sensing Large Vision Language Model | Chao Pang et.al. | 2403.20213 | link |
2024-03-29 | Unleashing the Potential of Large Language Models for Predictive Tabular Tasks in Data Science | Yazheng Yang et.al. | 2403.20208 | null |
2024-03-29 | The Future of Combating Rumors? Retrieval, Discrimination, and Generation | Junhao Xu et.al. | 2403.20204 | null |
2024-03-29 | ConvBench: A Multi-Turn Conversation Evaluation Benchmark with Hierarchical Capability for Large Vision-Language Models | Shuo Liu et.al. | 2403.20194 | null |
2024-03-29 | HARMamba: Efficient Wearable Sensor Human Activity Recognition Based on Bidirectional Selective SSM | Shuangjian Li et.al. | 2403.20183 | null |
2024-03-28 | RSMamba: Remote Sensing Image Classification with State Space Model | Keyan Chen et.al. | 2403.19654 | link |
2024-03-28 | InterDreamer: Zero-Shot Text to 3D Dynamic Human-Object Interaction | Sirui Xu et.al. | 2403.19652 | null |
2024-03-28 | MagicLens: Self-Supervised Image Retrieval with Open-Ended Instructions | Kai Zhang et.al. | 2403.19651 | link |
2024-03-28 | Sparse Feature Circuits: Discovering and Editing Interpretable Causal Graphs in Language Models | Samuel Marks et.al. | 2403.19647 | link |
2024-03-28 | Change-Agent: Towards Interactive Comprehensive Change Interpretation and Analysis from Change Detection and Change Captioning | Chenyang Liu et.al. | 2403.19646 | link |
2024-03-28 | Retrieval-Enhanced Knowledge Editing for Multi-Hop Question Answering in Language Models | Yucheng Shi et.al. | 2403.19631 | link |
2024-03-28 | RH20T-P: A Primitive-Level Robotic Dataset Towards Composable Generalization Agents | Zeren Chen et.al. | 2403.19622 | null |
2024-03-28 | SAID-NeRF: Segmentation-AIDed NeRF for Depth Completion of Transparent Objects | Avinash Ummadisingu et.al. | 2403.19607 | null |
2024-03-28 | Img2Loc: Revisiting Image Geolocalization using Multi-modality Foundation Models and Image-based Retrieval-Augmented Generation | Zhongliang Zhou et.al. | 2403.19584 | link |
2024-03-28 | Keypoint Action Tokens Enable In-Context Imitation Learning in Robotics | Norman Di Palo et.al. | 2403.19578 | null |
2024-03-28 | WaterJudge: Quality-Detection Trade-off when Watermarking Large Language Models | Piotr Molenda et.al. | 2403.19548 | null |
2024-03-28 | Interpreting Key Mechanisms of Factual Recall in Transformer-Based Language Models | Ang Lv et.al. | 2403.19521 | link |
2024-03-28 | Improving Clinical NLP Performance through Language Model-Generated Synthetic Clinical Data | Shan Chen et.al. | 2403.19511 | link |
2024-03-28 | LLMs as Academic Reading Companions: Extending HCI Through Synthetic Personae | Celia Chen et.al. | 2403.19506 | null |
2024-03-28 | Evolving Assembly Code in an Adversarial Environment | Irina Maliukov et.al. | 2403.19489 | link |
2024-03-28 | JDocQA: Japanese Document Question Answering Dataset for Generative Language Models | Eri Onami et.al. | 2403.19454 | link |
2024-03-28 | Mixed Preference Optimization: Reinforcement Learning with Data Selection and Better Reference Model | Qi Gou et.al. | 2403.19443 | null |
2024-03-28 | OAKINK2: A Dataset of Bimanual Hands-Object Manipulation in Complex Task Completion | Xinyu Zhan et.al. | 2403.19417 | null |
2024-03-28 | BP4ER: Bootstrap Prompting for Explicit Reasoning in Medical Dialogue Generation | Yuhong He et.al. | 2403.19414 | null |
2024-03-28 | Checkpoint Merging via Bayesian Optimization in LLM Pretraining | Deyuan Liu et.al. | 2403.19390 | null |
2024-03-27 | Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models | Yanwei Li et.al. | 2403.18814 | link |
2024-03-28 | ECoDepth: Effective Conditioning of Diffusion Models for Monocular Depth Estimation | Suraj Patni et.al. | 2403.18807 | link |
2024-03-27 | Is Modularity Transferable? A Case Study through the Lens of Knowledge Distillation | Mateusz Klimaszewski et.al. | 2403.18804 | link |
2024-03-27 | Projective Methods for Mitigating Gender Bias in Pre-trained Language Models | Hillary Dawkins et.al. | 2403.18803 | link |
2024-03-27 | Long-form factuality in large language models | Jerry Wei et.al. | 2403.18802 | link |
2024-03-27 | Towards a World-English Language Model for On-Device Virtual Assistants | Rricha Jalota et.al. | 2403.18783 | null |
2024-03-27 | 3P-LLM: Probabilistic Path Planning using Large Language Model for Autonomous Robot Navigation | Ehsan Latif et.al. | 2403.18778 | null |
2024-03-27 | ImageNet-D: Benchmarking Neural Network Robustness on Diffusion Synthetic Object | Chenshuang Zhang et.al. | 2403.18775 | link |
2024-03-27 | CheckEval: Robust Evaluation Framework using Large Language Model via Checklist | Yukyung Lee et.al. | 2403.18771 | null |
2024-03-27 | MLDT: Multi-Level Decomposition for Complex Long-Horizon Robotic Task Planning with Open-Source Large Language Model | Yike Wu et.al. | 2403.18760 | link |
2024-03-27 | CYCLE: Learning to Self-Refine the Code Generation | Yangruibo Ding et.al. | 2403.18746 | link |
2024-03-27 | Understanding the Learning Dynamics of Alignment with Human Feedback | Shawn Im et.al. | 2403.18742 | link |
2024-03-27 | PhysicsAssistant: An LLM-Powered Interactive Learning Robot for Physics Lab Investigations | Ehsan Latif et.al. | 2403.18721 | null |
2024-03-27 | Mitigating Hallucinations in Large Vision-Language Models with Instruction Contrastive Decoding | Xintong Wang et.al. | 2403.18715 | link |
2024-03-27 | The Invalsi Benchmark: measuring Language Models Mathematical and Language understanding in Italian | Andrea Esuli et.al. | 2403.18697 | null |
2024-03-27 | NL-ITI: Optimizing Probing and Intervention for Improvement of ITI Method | Jakub Hoscilowicz et.al. | 2403.18680 | link |
2024-03-27 | An Exploratory Study on Upper-Level Computing Students’ Use of Large Language Models as Tools in a Semester-Long Project | Ben Arie Tanay et.al. | 2403.18679 | null |
2024-03-27 | SDSAT: Accelerating LLM Inference through Speculative Decoding with Semantic Adaptive Tokens | Chengbo Liu et.al. | 2403.18647 | link |
2024-03-27 | To Recommend or Not: Recommendability Identification in Conversations with Pre-trained Language Models | Zhefan Wang et.al. | 2403.18628 | link |
2024-03-27 | Vulnerability Detection with Code Language Models: How Far Are We? | Yangruibo Ding et.al. | 2403.18624 | link |
2024-03-26 | OmniVid: A Generative Framework for Universal Video Understanding | Junke Wang et.al. | 2403.17935 | link |
2024-03-26 | Track Everything Everywhere Fast and Robustly | Yunzhou Song et.al. | 2403.17931 | null |
2024-03-26 | MAGIS: LLM-Based Multi-Agent Framework for GitHub Issue Resolution | Wei Tao et.al. | 2403.17927 | null |
2024-03-26 | LISA: Layerwise Importance Sampling for Memory-Efficient Large Language Model Fine-Tuning | Rui Pan et.al. | 2403.17919 | link |
2024-03-26 | Large scale paired antibody language models | Henry Kenlay et.al. | 2403.17889 | null |
2024-03-26 | Compressed Multi-task embeddings for Data-Efficient Downstream training and inference in Earth Observation | Carlos Gomes et.al. | 2403.17886 | link |
2024-03-26 | MIND Your Language: A Multilingual Dataset for Cross-lingual News Recommendation | Andreea Iana et.al. | 2403.17876 | link |
2024-03-26 | Addressing Social Misattributions of Large Language Models: An HCXAI-based Approach | Andrea Ferrario et.al. | 2403.17873 | null |
2024-03-26 | Exploring LLMs as a Source of Targeted Synthetic Textual Data to Minimize High Confidence Misclassifications | Philip Lippmann et.al. | 2403.17860 | null |
2024-03-26 | ChroniclingAmericaQA: A Large-scale Question Answering Dataset based on Historical American Newspaper Pages | Bhawna Piryani et.al. | 2403.17859 | link |
2024-03-26 | Verbing Weirds Language (Models): Evaluation of English Zero-Derivation in Five LLMs | David R. Mortensen et.al. | 2403.17856 | null |
2024-03-26 | ArabicaQA: A Comprehensive Dataset for Arabic Question Answering | Abdelrahman Abdallah et.al. | 2403.17848 | link |
2024-03-26 | Hierarchical Open-Vocabulary 3D Scene Graphs for Language-Grounded Robot Navigation | Abdelrhman Werby et.al. | 2403.17846 | null |
2024-03-26 | Mechanistic Design and Scaling of Hybrid Architectures | Michael Poli et.al. | 2403.17844 | link |
2024-03-26 | ReMamber: Referring Image Segmentation with Mamba Twister | Yuhuan Yang et.al. | 2403.17839 | link |
2024-03-26 | A foundation model utilizing chest CT volumes and radiology reports for supervised-level zero-shot detection of abnormalities | Ibrahim Ethem Hamamci et.al. | 2403.17834 | link |
2024-03-26 | Assessment of Multimodal Large Language Models in Alignment with Human Values | Zhelun Shi et.al. | 2403.17830 | null |
2024-03-26 | Accelerating Radio Spectrum Regulation Workflows with Large Language Models (LLMs) | Amir Ghasemi et.al. | 2403.17819 | null |
2024-03-26 | Graph Language Model (GLM): A new graph-based approach to detect social instabilities | Wallyson Lemes de Oliveira et.al. | 2403.17816 | null |
2024-03-26 | Are Compressed Language Models Less Subgroup Robust? | Leonidas Gee et.al. | 2403.17811 | link |
2024-03-25 | Towards Human-AI Deliberation: Design and Evaluation of LLM-Empowered Deliberative AI for AI-Assisted Decision-Making | Shuai Ma et.al. | 2403.16812 | null |
2024-03-25 | An LLM-Based Digital Twin for Optimizing Human-in-the Loop Systems | Hanqing Yang et.al. | 2403.16809 | link |
2024-03-25 | Iterative Refinement of Project-Level Code Context for Precise Code Generation with Compiler Feedback | Zhangqian Bi et.al. | 2403.16792 | link |
2024-03-25 | All Artificial, Less Intelligence: GenAI through the Lens of Formal Verification | Deepak Narayan Gadde et.al. | 2403.16750 | null |
2024-03-25 | A Robotic Skill Learning System Built Upon Diffusion Policies and Foundation Models | Nils Ingelhag et.al. | 2403.16730 | null |
2024-03-25 | ProCQA: A Large-scale Community-based Programming Question Answering Dataset for Code Search | Zehan Li et.al. | 2403.16702 | link |
2024-03-25 | Synapse: Learning Preferential Concepts from Visual Demonstrations | Sadanand Modak et.al. | 2403.16689 | null |
2024-03-25 | Investigation of the effectiveness of applying ChatGPT in Dialogic Teaching Using Electroencephalography | Jiayue Zhang et.al. | 2403.16687 | null |
2024-03-26 | RU22Fact: Optimizing Evidence for Multilingual Explainable Fact-Checking on Russia-Ukraine Conflict | Yirong Zeng et.al. | 2403.16662 | link |
2024-03-25 | Grammatical vs Spelling Error Correction: An Investigation into the Responsiveness of Transformer-based Language Models using BART and MarianMT | Rohit Raju et.al. | 2403.16655 | null |
2024-03-26 | CLHA: A Simple yet Effective Contrastive Learning Framework for Human Alignment | Feiteng Fang et.al. | 2403.16649 | link |
2024-03-25 | Virtual Co-Pilot: Multimodal Large Language Model-enabled Quick-access Procedures for Single Pilot Operations | Fan Li et.al. | 2403.16645 | null |
2024-03-25 | Semantically Enriched Cross-Lingual Sentence Embeddings for Crisis-related Social Media Texts | Rabindra Lamsal et.al. | 2403.16614 | null |
2024-03-25 | Conversational Grounding: Annotation and Analysis of Grounding Acts and Grounding Units | Biswesh Mohapatra et.al. | 2403.16609 | null |
2024-03-25 | TrustAI at SemEval-2024 Task 8: A Comprehensive Analysis of Multi-domain Machine Generated Text Detection Techniques | Ashok Urlana et.al. | 2403.16592 | null |
2024-03-25 | Can Large Language Models (or Humans) Distill Text? | Nicolas Audinet de Pieuchon et.al. | 2403.16584 | link |
2024-03-25 | NSINA: A News Corpus for Sinhala | Hansi Hettiarachchi et.al. | 2403.16571 | link |
2024-03-25 | Elysium: Exploring Object-level Perception in Videos via MLLM | Han Wang et.al. | 2403.16558 | link |
2024-03-25 | DOrA: 3D Visual Grounding with Order-Aware Referring | Tung-Yu Wu et.al. | 2403.16539 | null |
2024-03-25 | Open-Set Recognition in the Age of Vision-Language Models | Dimity Miller et.al. | 2403.16528 | link |
2024-03-25 | Hallucination Detection in Foundation Models for Decision-Making: A Flexible Definition and Review of the State of the Art | Neeloy Chakraborty et.al. | 2403.16527 | null |
2024-03-25 | Harnessing the power of LLMs for normative reasoning in MASs | Bastin Tony Roy Savarimuthu et.al. | 2403.16524 | null |
2024-03-25 | Norm Violation Detection in Multi-Agent Systems using Large Language Models: A Pilot Study | Shawn He et.al. | 2403.16517 | null |
2024-03-25 | Linguistically Differentiating Acts and Recalls of Racial Microaggressions on Social Media | Uma Sushmitha Gunturi et.al. | 2403.16514 | null |
2024-03-25 | LLMs Are Few-Shot In-Context Low-Resource Language Learners | Samuel Cahyawijaya et.al. | 2403.16512 | link |
2024-03-22 | LLaVA-PruMerge: Adaptive Token Reduction for Efficient Large Multimodal Models | Yuzhang Shang et.al. | 2403.15388 | null |
2024-03-22 | Long-CLIP: Unlocking the Long-Text Capability of CLIP | Beichen Zhang et.al. | 2403.15378 | link |
2024-03-22 | InternVideo2: Scaling Video Foundation Models for Multimodal Video Understanding | Yi Wang et.al. | 2403.15377 | link |
2024-03-22 | Can large language models explore in-context? | Akshay Krishnamurthy et.al. | 2403.15371 | null |
2024-03-22 | CoLLEGe: Concept Embedding Generation for Large Language Models | Ryan Teehan et.al. | 2403.15362 | null |
2024-03-22 | Neural Plasticity-Inspired Foundation Model for Observing the Earth Crossing Modalities | Zhitong Xiong et.al. | 2403.15356 | link |
2024-03-22 | Controlled Training Data Generation with Diffusion Models | Teresa Yeo et.al. | 2403.15309 | null |
2024-03-22 | Sphere Neural-Networks for Rational Reasoning | Tiansi Dong et.al. | 2403.15297 | null |
2024-03-22 | Measuring Gender and Racial Biases in Large Language Models | Jiafu An et.al. | 2403.15281 | null |
2024-03-22 | Bioinformatics and Biomedical Informatics with ChatGPT: Year One Review | Jinge Wang et.al. | 2403.15274 | null |
2024-03-22 | Event Temporal Relation Extraction based on Retrieval-Augmented on LLMs | Xiaobin Zhang et.al. | 2403.15273 | null |
2024-03-22 | Imagination Augmented Generation: Learning to Imagine Richer Context for Question Answering over Large Language Models | Huanxuan Liao et.al. | 2403.15268 | link |
2024-03-22 | AI Exposure and Strategic Positioning on an Online Work Platform | Shun Yiu et.al. | 2403.15262 | null |
2024-03-22 | FollowIR: Evaluating and Teaching Information Retrieval Models to Follow Instructions | Orion Weller et.al. | 2403.15246 | link |
2024-03-22 | Shadow Generation for Composite Image Using Diffusion model | Qingyang Liu et.al. | 2403.15234 | link |
2024-03-22 | An Exploratory Investigation into Code License Infringements in Large Language Model Training Datasets | Jonathan Katzy et.al. | 2403.15230 | link |
2024-03-22 | Not All Attention is Needed: Parameter and Computation Efficient Transfer Learning for Multi-modal Large Language Models | Qiong Wu et.al. | 2403.15226 | link |
2024-03-22 | Anytime, Anywhere, Anyone: Investigating the Feasibility of Segment Anything Model for Crowd-Sourcing Medical Image Annotations | Pranav Kulkarni et.al. | 2403.15218 | link |
2024-03-22 | InstaSynth: Opportunities and Challenges in Generating Synthetic Instagram Data with ChatGPT for Sponsored Content Detection | Thales Bertaglia et.al. | 2403.15214 | link |
2024-03-22 | MSCoTDet: Language-driven Multi-modal Fusion for Improved Multispectral Pedestrian Detection | Taeheon Kim et.al. | 2403.15209 | null |
2024-03-21 | MathVerse: Does Your Multi-modal LLM Truly See the Diagrams in Visual Math Problems? | Renrui Zhang et.al. | 2403.14624 | null |
2024-03-21 | Parameter-Efficient Fine-Tuning for Large Models: A Comprehensive Survey | Zeyu Han et.al. | 2403.14608 | null |
2024-03-21 | MyVLM: Personalizing VLMs for User-Specific Queries | Yuval Alaluf et.al. | 2403.14599 | null |
2024-03-21 | ReAct Meets ActRe: Autonomous Annotations of Agent Trajectories for Contrastive Self-Training | Zonghan Yang et.al. | 2403.14589 | null |
2024-03-21 | Large Language Models for Multi-Choice Question Classification of Medical Subjects | Víctor Ponce-López et.al. | 2403.14582 | null |
2024-03-21 | RAmBLA: A Framework for Evaluating the Reliability of LLMs as Assistants in the Biomedical Domain | William James Bolton et.al. | 2403.14578 | link |
2024-03-21 | A Chain-of-Thought Prompting Approach with LLMs for Evaluating Students’ Formative Assessment Responses in Science | Clayton Cohn et.al. | 2403.14565 | null |
2024-03-21 | The Era of Semantic Decoding | Maxime Peyrard et.al. | 2403.14562 | null |
2024-03-21 | Lexicon-Level Contrastive Visual-Grounding Improves Language Modeling | Chengxu Zhuang et.al. | 2403.14551 | null |
2024-03-21 | EDT: Improving Large Language Models’ Generation by Entropy-based Dynamic Temperature Sampling | Shimao Zhang et.al. | 2403.14541 | link |
2024-03-21 | Cobra: Extending Mamba to Multi-Modal Large Language Model for Efficient Inference | Han Zhao et.al. | 2403.14520 | link |
2024-03-21 | The Ethics of ChatGPT in Medicine and Healthcare: A Systematic Review on Large Language Models (LLMs) | Joschka Haltaufderheide et.al. | 2403.14473 | null |
2024-03-21 | Detoxifying Large Language Models via Knowledge Editing | Mengru Wang et.al. | 2403.14472 | link |
2024-03-21 | ChatGPT Alternative Solutions: Large Language Models Survey | Hanieh Alipour et.al. | 2403.14469 | null |
2024-03-21 | Recourse for reclamation: Chatting with generative language models | Jennifer Chien et.al. | 2403.14467 | null |
2024-03-21 | Towards Single-System Illusion in Software-Defined Vehicles – Automated, AI-Powered Workflow | Krzysztof Lebioda et.al. | 2403.14460 | null |
2024-03-21 | Multi-Level Explanations for Generative Language Models | Lucas Monteiro Paes et.al. | 2403.14459 | null |
2024-03-21 | gTBLS: Generating Tables from Text by Conditional Question Answering | Anirudh Sundar et.al. | 2403.14457 | null |
2024-03-21 | Language Models Can Reduce Asymmetry in Information Markets | Nasim Rahaman et.al. | 2403.14443 | null |
2024-03-21 | A Multimodal Approach to Device-Directed Speech Detection with Large Language Models | Dominik Wager et.al. | 2403.14438 | null |
2024-03-20 | RAR: Retrieving And Ranking Augmented MLLMs for Visual Recognition | Ziyu Liu et.al. | 2403.13805 | link |
2024-03-20 | Learning from Models and Data for Visual Grounding | Ruozhen He et.al. | 2403.13804 | null |
2024-03-20 | Reverse Training to Nurse the Reversal Curse | Olga Golovneva et.al. | 2403.13799 | null |
2024-03-20 | Bridge the Modality and Capacity Gaps in Vision-Language Model Selection | Chao Yi et.al. | 2403.13797 | null |
2024-03-20 | RewardBench: Evaluating Reward Models for Language Modeling | Nathan Lambert et.al. | 2403.13787 | link |
2024-03-20 | Chain-of-Interaction: Enhancing Large Language Models for Psychiatric Behavior Understanding by Dyadic Contexts | Guangzeng Han et.al. | 2403.13786 | link |
2024-03-20 | Information-Theoretic Distillation for Reference-less Summarization | Jaehun Jung et.al. | 2403.13780 | null |
2024-03-20 | Embedding Pose Graph, Enabling 3D Foundation Model Capabilities with a Compact Representation | Hugues Thomas et.al. | 2403.13777 | null |
2024-03-20 | Describe-and-Dissect: Interpreting Neurons in Vision Networks with Language Models | Nicholas Bai et.al. | 2403.13771 | link |
2024-03-20 | Enhancing Gait Video Analysis in Neurodegenerative Diseases by Knowledge Augmentation in Vision Language Model | Diwei Wang et.al. | 2403.13756 | null |
2024-03-20 | Different Tokenization Schemes Lead to Comparable Performance in Spanish Number Agreement | Catherine Arnett et.al. | 2403.13754 | null |
2024-03-20 | EthioLLM: Multilingual Large Language Models for Ethiopian Languages with Task Evaluation | Atnafu Lambebo Tonja et.al. | 2403.13737 | null |
2024-03-20 | Large Language Models meet Network Slicing Management and Orchestration | Abdulhalim Dandoush et.al. | 2403.13721 | null |
2024-03-20 | SPTNet: An Efficient Alternative Framework for Generalized Category Discovery with Spatial Prompt Tuning | Hongjun Wang et.al. | 2403.13684 | null |
2024-03-20 | PARAMANU-AYN: An Efficient Novel Generative and Instruction-tuned Language Model for Indian Legal Case Documents | Mitodru Niyogi et.al. | 2403.13681 | null |
2024-03-21 | RoleInteract: Evaluating the Social Interaction of Role-Playing Agents | Hongzhan Chen et.al. | 2403.13679 | link |
2024-03-20 | Grounding Spatial Relations in Text-Only Language Models | Gorka Azkune et.al. | 2403.13666 | link |
2024-03-21 | Do Not Worry if You Do Not Have Data: Building Pretrained Language Models Using Translationese | Meet Doshi et.al. | 2403.13638 | null |
2024-03-20 | VL-Mamba: Exploring State Space Models for Multimodal Learning | Yanyuan Qiao et.al. | 2403.13600 | null |
2024-03-20 | No more optimization rules: LLM-enabled policy-based multi-modal query optimizer (version 1) | Yifan Wang et.al. | 2403.13597 | null |
2024-03-19 | LLMLingua-2: Data Distillation for Efficient and Faithful Task-Agnostic Prompt Compression | Zhuoshi Pan et.al. | 2403.12968 | link |
2024-03-19 | Chain-of-Spot: Interactive Reasoning Improves Large Vision-Language Models | Zuyan Liu et.al. | 2403.12966 | link |
2024-03-19 | Negative Yields Positive: Unified Dual-Path Adapter for Vision-Language Models | Ce Zhang et.al. | 2403.12964 | link |
2024-03-19 | Dated Data: Tracing Knowledge Cutoffs in Large Language Models | Jeffrey Cheng et.al. | 2403.12958 | link |
2024-03-19 | Just Shift It: Test-Time Prototype Shifting for Zero-Shot Generalization with Vision-Language Models | Elaine Sui et.al. | 2403.12952 | link |
2024-03-19 | Automatic Information Extraction From Employment Tribunal Judgements Using Large Language Models | Joana Ribeiro de Faria et.al. | 2403.12936 | null |
2024-03-19 | Segment Anything for comprehensive analysis of grapevine cluster architecture and berry properties | Efrain Torres-Lomas et.al. | 2403.12935 | null |
2024-03-19 | Rapid AIdeation: Generating Ideas With the Self and in Collaboration With Large Language Models | Gionnieve Lim et.al. | 2403.12928 | null |
2024-03-19 | Supporting Energy Policy Research with Large Language Models | Grant Buster et.al. | 2403.12924 | null |
2024-03-19 | Contextual AD Narration with Interleaved Multimodal Sequence | Hanlin Wang et.al. | 2403.12922 | link |
2024-03-19 | Semantic Layering in Room Segmentation via LLMs | Taehyeon Kim et.al. | 2403.12920 | null |
2024-03-19 | Generalizable and Stable Finetuning of Pretrained Language Models on Low-Resource Texts | Sai Ashish Somayajula et.al. | 2403.12918 | link |
2024-03-19 | Yell At Your Robot: Improving On-the-Fly from Language Corrections | Lucy Xiaoyang Shi et.al. | 2403.12910 | null |
2024-03-19 | Toward Sustainable GenAI using Generation Directives for Carbon-Friendly Large Language Model Inference | Baolin Li et.al. | 2403.12900 | null |
2024-03-19 | mPLUG-DocOwl 1.5: Unified Structure Learning for OCR-free Document Understanding | Anwen Hu et.al. | 2403.12895 | link |
2024-03-20 | MEDBind: Unifying Language and Multimodal Medical Data Embeddings | Yuan Gao et.al. | 2403.12894 | null |
2024-03-19 | HYDRA: A Hyper Agent for Dynamic Compositional Visual Reasoning | Fucai Ke et.al. | 2403.12884 | link |
2024-03-19 | Agent-FLAN: Designing Data and Methods of Effective Agent Tuning for Large Language Models | Zehui Chen et.al. | 2403.12881 | link |
2024-03-19 | Epistemology of Language Models: Do Language Models Have Holistic Knowledge? | Minsu Kim et.al. | 2403.12862 | null |
2024-03-19 | RASP: A Drone-based Reconfigurable Actuation and Sensing Platform Towards Ambient Intelligent Systems | Minghui Zhao et.al. | 2403.12853 | null |
2024-03-18 | Modality-Agnostic fMRI Decoding of Vision and Language | Mitja Nikolaus et.al. | 2403.11771 | null |
2024-03-18 | Meta-Prompting for Automating Zero-shot Visual Recognition with LLMs | M. Jehanzeb Mirza et.al. | 2403.11755 | link |
2024-03-18 | Revisiting The Classics: A Study on Identifying and Rectifying Gender Stereotypes in Rhymes and Poems | Aditya Narayan Sankaran et.al. | 2403.11752 | link |
2024-03-18 | Embedded Named Entity Recognition using Probing Classifiers | Nicholas Popovič et.al. | 2403.11747 | link |
2024-03-18 | TTT-KD: Test-Time Training for 3D Semantic Segmentation through Knowledge Distillation from Foundation Models | Lisa Weijler et.al. | 2403.11691 | null |
2024-03-18 | HDLdebugger: Streamlining HDL debugging with Large Language Models | Xufeng Yao et.al. | 2403.11671 | null |
2024-03-18 | Prioritized Semantic Learning for Zero-shot Instance Navigation | Xander Sun et.al. | 2403.11650 | link |
2024-03-18 | Arc2Face: A Foundation Model of Human Faces | Foivos Paraperas Papantoniou et.al. | 2403.11641 | link |
2024-03-18 | Compositional Kronecker Context Optimization for Vision-Language Models | Kun Ding et.al. | 2403.11631 | null |
2024-03-18 | Let’s Focus on Neuron: Neuron-Level Supervised Fine-tuning for Large Language Model | Haoyun Xu et.al. | 2403.11621 | null |
2024-03-18 | CRS-Diff: Controllable Generative Remote Sensing Foundation Model | Datao Tang et.al. | 2403.11614 | link |
2024-03-18 | Linguacodus: A Synergistic Framework for Transformative Code Generation in Machine Learning Pipelines | Ekaterina Trofimova et.al. | 2403.11585 | null |
2024-03-18 | Reinforcement Learning with Token-level Feedback for Controllable Text Generation | Wendi Li et.al. | 2403.11558 | link |
2024-03-18 | LLM^3:Large Language Model-based Task and Motion Planning with Motion Failure Reasoning | Shu Wang et.al. | 2403.11552 | link |
2024-03-18 | Boosting Continual Learning of Vision-Language Models via Mixture-of-Experts Adapters | Jiazuo Yu et.al. | 2403.11549 | link |
2024-03-18 | DEE: Dual-stage Explainable Evaluation Method for Text Generation | Shenyu Zhang et.al. | 2403.11509 | null |
2024-03-18 | Do CLIPs Always Generalize Better than ImageNet Models? | Qizhou Wang et.al. | 2403.11497 | null |
2024-03-18 | VideoAgent: A Memory-augmented Multimodal Agent for Video Understanding | Yue Fan et.al. | 2403.11481 | null |
2024-03-18 | HateCOT: An Explanation-Enhanced Dataset for Generalizable Offensive Speech Detection via Large Language Models | Huy Nghiem et.al. | 2403.11456 | link |
2024-03-18 | Zero-shot Compound Expression Recognition with Visual Language Model at the 6th ABAW Challenge | Jiahe Wang et.al. | 2403.11450 | null |
2024-03-15 | VideoAgent: Long-form Video Understanding with Large Language Model as Agent | Xiaohan Wang et.al. | 2403.10517 | null |
2024-03-15 | Demystifying Faulty Code with LLM: Step-by-Step Reasoning for Explainable Fault Localization | Ratnadira Widyasari et.al. | 2403.10507 | null |
2024-03-15 | ATOM: Asynchronous Training of Massive Models for Deep Learning in a Decentralized Environment | Xiaofeng Wu et.al. | 2403.10504 | null |
2024-03-15 | Benchmarking Zero-Shot Robustness of Multimodal Foundation Models: A Pilot Study | Chenguang Wang et.al. | 2403.10499 | link |
2024-03-15 | Reconfigurable Robot Identification from Motion Data | Yuhang Hu et.al. | 2403.10496 | null |
2024-03-15 | Can a GPT4-Powered AI Agent Be a Good Enough Performance Attribution Analyst? | Bruno de Melo et.al. | 2403.10482 | null |
2024-03-15 | Enhancing LLM Factual Accuracy with RAG to Counter Hallucinations: A Case Study on Domain-Specific Queries in Private Knowledge-Bases | Jiarui Li et.al. | 2403.10446 | link |
2024-03-15 | Optimal Block-Level Draft Verification for Accelerating Speculative Decoding | Ziteng Sun et.al. | 2403.10444 | null |
2024-03-15 | Using an LLM to Turn Sign Spottings into Spoken Language Sentences | Ozge Mercanoglu Sincan et.al. | 2403.10434 | null |
2024-03-15 | SocialGenPod: Privacy-Friendly Generative AI Social Web Applications with Decentralised Personal Data Stores | Vidminas Vizgirda et.al. | 2403.10408 | link |
2024-03-15 | A Thorough Comparison of Cross-Encoders and LLMs for Reranking SPLADE | Hervé Déjean et.al. | 2403.10407 | null |
2024-03-15 | Monotonic Representation of Numeric Properties in Language Models | Benjamin Heinzerling et.al. | 2403.10381 | link |
2024-03-15 | EXAMS-V: A Multi-Discipline Multilingual Multimodal Exam Benchmark for Evaluating Vision Language Models | Rocktim Jyoti Das et.al. | 2403.10378 | link |
2024-03-15 | TriSum: Learning Summarization Ability from Large Language Models with Structured Rationale | Pengcheng Jiang et.al. | 2403.10351 | null |
2024-03-15 | Investigating grammatical abstraction in language models using few-shot learning of novel noun gender | Priyanka Sukumaran et.al. | 2403.10338 | null |
2024-03-15 | CDGP: Automatic Cloze Distractor Generation based on Pre-trained Language Model | Shang-Hsuan Chiang et.al. | 2403.10326 | link |
2024-03-15 | NetBench: A Large-Scale and Comprehensive Network Traffic Benchmark Dataset for Foundation Models | Chen Qian et.al. | 2403.10319 | link |
2024-03-15 | Uni-SMART: Universal Science Multimodal Analysis and Research Transformer | Hengxing Cai et.al. | 2403.10301 | null |
2024-03-15 | Few-Shot Image Classification and Segmentation as Visual Question Answering Using Vision-Language Models | Tian Meng et.al. | 2403.10287 | null |
2024-03-15 | Team Trifecta at Factify5WQA: Setting the Standard in Fact Verification with Fine-Tuning | Shang-Hsuan Chiang et.al. | 2403.10281 | link |
2024-03-14 | GaussianGrasper: 3D Language Gaussian Splatting for Open-vocabulary Robotic Grasping | Yuhang Zheng et.al. | 2403.09637 | link |
2024-03-14 | Dynamic Memory Compression: Retrofitting LLMs for Accelerated Inference | Piotr Nawrot et.al. | 2403.09636 | null |
2024-03-14 | Transformers Get Stable: An End-to-End Signal Propagation Theory for Language Models | Akhil Kedia et.al. | 2403.09635 | link |
2024-03-14 | OneTracker: Unifying Visual Object Tracking with Foundation Models and Efficient Tuning | Lingyi Hong et.al. | 2403.09634 | null |
2024-03-14 | 3D-VLA: A 3D Vision-Language-Action Generative World Model | Haoyu Zhen et.al. | 2403.09631 | null |
2024-03-14 | Quiet-STaR: Language Models Can Teach Themselves to Think Before Speaking | Eric Zelikman et.al. | 2403.09629 | link |
2024-03-14 | Explore In-Context Segmentation via Latent Diffusion Models | Chaoyang Wang et.al. | 2403.09616 | null |
2024-03-14 | MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training | Brandon McKinzie et.al. | 2403.09611 | null |
2024-03-14 | Large Language Models and Causal Inference in Collaboration: A Comprehensive Survey | Xiaoyu Liu et.al. | 2403.09606 | null |
2024-03-14 | Logical Discrete Graphical Models Must Supplement Large Language Models for Information Synthesis | Gregory Coppola et.al. | 2403.09599 | null |
2024-03-14 | Renovating Names in Open-Vocabulary Segmentation Benchmarks | Haiwen Huang et.al. | 2403.09593 | null |
2024-03-14 | ExploRLLM: Guiding Exploration in Reinforcement Learning with Large Language Models | Runyu Ma et.al. | 2403.09583 | null |
2024-03-14 | Eyes Closed, Safety On: Protecting Multimodal LLMs via Image-to-Text Transformation | Yunhao Gou et.al. | 2403.09572 | null |
2024-03-14 | Enhancing Trust in Autonomous Agents: An Architecture for Accountability and Explainability through Blockchain and Large Language Models | Laura Fernández-Becerra et.al. | 2403.09567 | null |
2024-03-14 | Welcome Your New AI Teammate: On Safety Analysis by Leashing Large Language Models | Ali Nouri et.al. | 2403.09565 | null |
2024-03-14 | PreCurious: How Innocent Pre-Trained Language Models Turn into Privacy Traps | Ruixuan Liu et.al. | 2403.09562 | null |
2024-03-14 | Less is More: Data Value Estimation for Visual Instruction Tuning | Zikang Liu et.al. | 2403.09559 | null |
2024-03-14 | Logits of API-Protected LLMs Leak Proprietary Information | Matthew Finlayson et.al. | 2403.09539 | null |
2024-03-14 | VisionGPT-3D: A Generalized Multimodal Agent for Enhanced 3D Vision Understanding | Chris Kelly et.al. | 2403.09530 | null |
2024-03-14 | WavCraft: Audio Editing and Generation with Natural Language Prompts | Jinhua Liang et.al. | 2403.09527 | link |
2024-03-13 | Simple and Scalable Strategies to Continually Pre-train Large Language Models | Adam Ibrahim et.al. | 2403.08763 | link |
2024-03-13 | Steering LLMs Towards Unbiased Responses: A Causality-Guided Debiasing Framework | Jingling Li et.al. | 2403.08743 | null |
2024-03-13 | The Garden of Forking Paths: Observing Dynamic Parameters Distribution in Large Language Models | Carlo Nicolini et.al. | 2403.08739 | null |
2024-03-13 | ILCiteR: Evidence-grounded Interpretable Local Citation Recommendation | Sayar Ghosh Roy et.al. | 2403.08737 | link |
2024-03-13 | Strengthening Multimodal Large Language Model with Bootstrapped Preference Optimization | Renjie Pi et.al. | 2403.08730 | null |
2024-03-14 | SOTOPIA- $π$ : Interactive Learning of Socially Intelligent Language Agents | Ruiyi Wang et.al. | 2403.08715 | link |
2024-03-13 | Review of Generative AI Methods in Cybersecurity | Yagmur Yigit et.al. | 2403.08701 | null |
2024-03-13 | TeaMs-RL: Teaching LLMs to Teach Themselves Better Instructions via Reinforcement Learning | Shangding Gu et.al. | 2403.08694 | link |
2024-03-13 | Do Language Models Care About Text Quality? Evaluating Web-Crawled Corpora Across 11 Languages | Rik van Noord et.al. | 2403.08693 | null |
2024-03-13 | Zero-shot and Few-shot Generation Strategies for Artificial Clinical Records | Erlend Frayling et.al. | 2403.08664 | null |
2024-03-13 | Self-Supervised Learning for Covariance Estimation | Tzvi Diskin et.al. | 2403.08662 | null |
2024-03-13 | Human Alignment of Large Language Models through Online Preference Optimisation | Daniele Calandriello et.al. | 2403.08635 | null |
2024-03-13 | MedInsight: A Multi-Source Context Augmentation Framework for Generating Patient-Centric Medical Responses using Large Language Models | Subash Neupane et.al. | 2403.08607 | null |
2024-03-14 | Language-Grounded Dynamic Scene Graphs for Interactive Object Search with Mobile Manipulation | Daniel Honerkamp et.al. | 2403.08605 | link |
2024-03-13 | DevBench: A Comprehensive Benchmark for Software Development | Bowen Li et.al. | 2403.08604 | link |
2024-03-13 | Call Me When Necessary: LLMs can Efficiently and Faithfully Reason over Structured Environments | Sitao Cheng et.al. | 2403.08593 | null |
2024-03-13 | Non-discrimination Criteria for Generative Language Models | Sara Sterlie et.al. | 2403.08564 | link |
2024-03-13 | AIGCs Confuse AI Too: Investigating and Explaining Synthetic Image-induced Hallucinations in Large Vision-Language Models | Yifei Gao et.al. | 2403.08542 | link |
2024-03-13 | Language models scale reliably with over-training and on downstream tasks | Samir Yitzhak Gadre et.al. | 2403.08540 | link |
2024-03-13 | Masked Generative Story Transformer with Character Guidance and Caption Augmentation | Christos Papadimitriou et.al. | 2403.08502 | link |
2024-03-12 | Beyond Text: Frozen Large Language Models in Visual Signal Comprehension | Lei Zhu et.al. | 2403.07874 | link |
2024-03-12 | Rethinking Generative Large Language Model Evaluation for Semantic Comprehension | Fangyun Wei et.al. | 2403.07872 | null |
2024-03-12 | Exploring Safety Generalization Challenges of Large Language Models via Code | Qibing Ren et.al. | 2403.07865 | link |
2024-03-12 | Bridging Different Language Models and Generative Vision Models for Text-to-Image Generation | Shihao Zhao et.al. | 2403.07860 | link |
2024-03-12 | MoPE-CLIP: Structured Pruning for Efficient Vision-Language Models with Module-wise Pruning Error Metric | Haokun Lin et.al. | 2403.07839 | null |
2024-03-12 | DeliGrasp: Inferring Object Mass, Friction, and Compliance with LLMs for Adaptive and Minimally Deforming Grasp Policies | William Xie et.al. | 2403.07832 | null |
2024-03-12 | The Missing Piece in Model Editing: A Deep Dive into the Hidden Damage Brought By Model Editing | Jianchen Wang et.al. | 2403.07825 | null |
2024-03-12 | Branch-Train-MiX: Mixing Expert LLMs into a Mixture-of-Experts LLM | Sainbayar Sukhbaatar et.al. | 2403.07816 | null |
2024-03-12 | Chronos: Learning the Language of Time Series | Abdul Fatir Ansari et.al. | 2403.07815 | link |
2024-03-12 | Beyond Memorization: The Challenge of Random Memory Access in Language Models | Tongyao Zhu et.al. | 2403.07805 | link |
2024-03-12 | Fine-tuning Large Language Models with Sequential Instructions | Hanxu Hu et.al. | 2403.07794 | link |
2024-03-12 | Transforming Competition into Collaboration: The Revolutionary Role of Multi-Agent Systems and Language Models in Modern Organizations | Carlos Jose Xavier Cruz et.al. | 2403.07769 | link |
2024-03-12 | Synth $^2$ : Boosting Visual-Language Models with Synthetic Captions and Image Embeddings | Sahand Sharifzadeh et.al. | 2403.07750 | null |
2024-03-12 | FineMath: A Fine-Grained Mathematical Evaluation Benchmark for Chinese Large Language Models | Yan Liu et.al. | 2403.07747 | null |
2024-03-12 | Multi-modal Auto-regressive Modeling via Visual Words | Tianshuo Peng et.al. | 2403.07720 | link |
2024-03-12 | WorkArena: How Capable Are Web Agents at Solving Common Knowledge Work Tasks? | Alexandre Drouin et.al. | 2403.07718 | link |
2024-03-12 | StableToolBench: Towards Stable Large-Scale Benchmarking on Tool Learning of Large Language Models | Zhicheng Guo et.al. | 2403.07714 | link |
2024-03-12 | Improving Reinforcement Learning from Human Feedback Using Contrastive Rewards | Wei Shen et.al. | 2403.07708 | null |
2024-03-12 | Large, Small or Both: A Novel Data Augmentation Framework Based on Language Models for Debiasing Opinion Summarization | Yanyue Zhang et.al. | 2403.07693 | null |
2024-03-12 | Reference-free Monolithic Preference Optimization with Odds Ratio | Jiwoo Hong et.al. | 2403.07691 | link |
2024-03-11 | Hybrid Human-LLM Corpus Construction and LLM Evaluation for Rare Linguistic Phenomena | Leonie Weissweiler et.al. | 2403.06965 | null |
2024-03-11 | Materials science in the era of large language models: a perspective | Ge Lei et.al. | 2403.06949 | null |
2024-03-11 | Split to Merge: Unifying Separated Modalities for Unsupervised Domain Adaptation | Xinyao Li et.al. | 2403.06946 | link |
2024-03-11 | Naming, Describing, and Quantifying Visual Objects in Humans and LLMs | Alberto Testoni et.al. | 2403.06935 | link |
2024-03-11 | ERA-CoT: Improving Chain-of-Thought through Entity Relationship Analysis | Yanming Liu et.al. | 2403.06932 | link |
2024-03-11 | MEND: Meta dEmonstratioN Distillation for Efficient and Effective In-Context Learning | Yichuan Li et.al. | 2403.06914 | link |
2024-03-11 | Application of Quantum Tensor Networks for Protein Classification | Debarshi Kundu et.al. | 2403.06890 | null |
2024-03-11 | Exploring Large Language Models and Hierarchical Frameworks for Classification of Large Unstructured Legal Documents | Nishchal Prasad et.al. | 2403.06872 | link |
2024-03-11 | Semantic Residual Prompts for Continual Learning | Martin Menabue et.al. | 2403.06870 | link |
2024-03-11 | Learning with Noisy Foundation Models | Hao Chen et.al. | 2403.06869 | null |
2024-03-11 | A Geospatial Approach to Predicting Desert Locust Breeding Grounds in Africa | Ibrahim Salihu Yusuf et.al. | 2403.06860 | null |
2024-03-11 | Development of a Reliable and Accessible Caregiving Language Model (CaLM) | Bambang Parmanto et.al. | 2403.06857 | null |
2024-03-11 | DriveDreamer-2: LLM-Enhanced World Models for Diverse Driving Video Generation | Guosheng Zhao et.al. | 2403.06845 | null |
2024-03-11 | RA-ISF: Learning to Answer and Understand from Retrieval Augmentation via Iterative Self-Feedback | Yanming Liu et.al. | 2403.06840 | link |
2024-03-11 | ACFIX: Guiding LLMs with Mined Common RBAC Practices for Context-Aware Repair of Access Control Vulnerabilities in Smart Contracts | Lyuye Zhang et.al. | 2403.06838 | null |
2024-03-11 | Can LLMs Separate Instructions From Data? And What Do We Even Mean By That? | Egor Zverev et.al. | 2403.06833 | link |
2024-03-11 | The Power of Noise: Toward a Unified Multi-modal Knowledge Graph Representation Framework | Zhuo Chen et.al. | 2403.06832 | link |
2024-03-11 | ConspEmoLLM: Conspiracy Theory Detection Using an Emotion-Based Large Language Model | Zhiwei Liu et.al. | 2403.06765 | link |
2024-03-11 | An Image is Worth 1/2 Tokens After Layer 2: Plug-and-Play Inference Acceleration for Large Vision-Language Models | Liang Chen et.al. | 2403.06764 | link |
2024-03-11 | ALaRM: Align Language Models via Hierarchical Rewards Modeling | Yuhang Lai et.al. | 2403.06754 | link |
2024-03-08 | Bayesian Preference Elicitation with Language Models | Kunal Handa et.al. | 2403.05534 | null |
2024-03-08 | Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context | Machel Reid et.al. | 2403.05530 | null |
2024-03-08 | GEAR: An Efficient KV Cache Compression Recipefor Near-Lossless Generative Inference of LLM | Hao Kang et.al. | 2403.05527 | link |
2024-03-08 | DeepSeek-VL: Towards Real-World Vision-Language Understanding | Haoyu Lu et.al. | 2403.05525 | link |
2024-03-08 | Beyond Finite Data: Towards Data-free Out-of-distribution Generalization via Extrapola | Yijiang Li et.al. | 2403.05523 | null |
2024-03-08 | Authorship Attribution in Bangla Literature (AABL) via Transfer Learning using ULMFiT | Aisha Khatun et.al. | 2403.05519 | null |
2024-03-08 | Bias-Augmented Consistency Training Reduces Biased Reasoning in Chain-of-Thought | James Chua et.al. | 2403.05518 | link |
2024-03-08 | To Err Is Human, but Llamas Can Learn It Too | Agnes Luhtaru et.al. | 2403.05493 | link |
2024-03-08 | Will GPT-4 Run DOOM? | Adrian de Wynter et.al. | 2403.05468 | null |
2024-03-08 | Cost-Performance Optimization for Processing Low-Resource Language Tasks Using Commercial LLMs | Arijit Nag et.al. | 2403.05434 | null |
2024-03-08 | Towards Real-World Stickers Use: A New Dataset for Multi-Tag Sticker Recognition | Bingbing Wang et.al. | 2403.05428 | null |
2024-03-08 | FedFMS: Exploring Federated Foundation Models for Medical Image Segmentation | Yuxi Liu et.al. | 2403.05408 | link |
2024-03-08 | Exploring Robust Features for Few-Shot Object Detection in Satellite Imagery | Xavier Bou et.al. | 2403.05381 | link |
2024-03-08 | VLM-PL: Advanced Pseudo Labeling approach Class Incremental Object Detection with Vision-Language Model | Junsu Kim et.al. | 2403.05346 | null |
2024-03-08 | Explaining Pre-Trained Language Models with Attribution Scores: An Analysis in Low-Resource Settings | Wei Zhou et.al. | 2403.05338 | null |
2024-03-08 | ChatASU: Evoking LLM’s Reflexion to Truly Understand Aspect Sentiment in Dialogues | Yiding Liu et.al. | 2403.05326 | null |
2024-03-08 | RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Horizon Generation | Zihao Wang et.al. | 2403.05313 | null |
2024-03-08 | Tapilot-Crossing: Benchmarking and Evolving LLMs Towards Interactive Data Analysis Agents | Jinyang Li et.al. | 2403.05307 | link |
2024-03-08 | ACLSum: A New Dataset for Aspect-based Summarization of Scientific Publications | Sotaro Takeshita et.al. | 2403.05303 | link |
2024-03-08 | Modeling Dynamic (De)Allocations of Local Memory for Translation Validation | Abhishek Rose et.al. | 2403.05302 | null |
2024-03-07 | iScore: Visual Analytics for Interpreting How Language Models Automatically Score Summaries | Adam Coscia et.al. | 2403.04760 | link |
2024-03-07 | KnowledgeVIS: Interpreting Language Models by Comparing Fill-in-the-Blank Prompts | Adam Coscia et.al. | 2403.04758 | link |
2024-03-07 | LLMs in the Imaginarium: Tool Learning through Simulated Trial and Error | Boshi Wang et.al. | 2403.04746 | link |
2024-03-08 | How Far Are We from Intelligent Visual Deductive Reasoning? | Yizhe Zhang et.al. | 2403.04732 | link |
2024-03-07 | Common 7B Language Models Already Possess Strong Math Capabilities | Chen Li et.al. | 2403.04706 | link |
2024-03-07 | ObjectCompose: Evaluating Resilience of Vision-Based Models on Object-to-Background Compositional Changes | Hashmat Shadab Malik et.al. | 2403.04701 | link |
2024-03-07 | Fact-Checking the Output of Large Language Models via Token-Level Uncertainty Quantification | Ekaterina Fadeeva et.al. | 2403.04696 | link |
2024-03-07 | Telecom Language Models: Must They Be Large? | Nicola Piovesan et.al. | 2403.04666 | null |
2024-03-07 | Yi: Open Foundation Models by 01.AI | 01. AI et.al. | 2403.04652 | link |
2024-03-07 | Teaching Large Language Models to Reason with Reinforcement Learning | Alex Havrilla et.al. | 2403.04642 | null |
2024-03-07 | CAT: Enhancing Multimodal Large Language Model to Answer Questions in Dynamic Audio-Visual Scenarios | Qilang Ye et.al. | 2403.04640 | link |
2024-03-07 | A Detailed Audio-Text Data Simulation Pipeline using Single-Event Sounds | Xuenan Xu et.al. | 2403.04594 | link |
2024-03-07 | Embodied Understanding of Driving Scenarios | Yunsong Zhou et.al. | 2403.04593 | link |
2024-03-07 | Wiki-TabNER:Advancing Table Interpretation Through Named Entity Recognition | Aneta Koleva et.al. | 2403.04577 | link |
2024-03-07 | Reducing self-supervised learning complexity improves weakly-supervised classification performance in computational pathology | Tim Lenz et.al. | 2403.04558 | null |
2024-03-07 | Enhancing Data Quality in Federated Fine-Tuning of Foundation Models | Wanru Zhao et.al. | 2403.04529 | null |
2024-03-07 | Where does In-context Translation Happen in Large Language Models | Suzanna Sia et.al. | 2403.04510 | null |
2024-03-07 | GraphInstruct: Empowering Large Language Models with Graph Understanding and Reasoning Capability | Zihan Luo et.al. | 2403.04483 | link |
2024-03-08 | Do Large Language Model Understand Multi-Intent Spoken Language ? | Shangjian Yin et.al. | 2403.04481 | link |
2024-03-08 | Pearl: A Review-driven Persona-Knowledge Grounded Conversational Recommendation Dataset | Minjin Kim et.al. | 2403.04460 | link |
2024-03-06 | Backtracing: Retrieving the Cause of the Query | Rose E. Wang et.al. | 2403.03956 | link |
2024-03-06 | Bridging Language and Items for Retrieval and Recommendation | Yupeng Hou et.al. | 2403.03952 | link |
2024-03-06 | The Heuristic Core: Understanding Subnetwork Generalization in Pretrained Language Models | Adithya Bhaskar et.al. | 2403.03942 | link |
2024-03-06 | Did Translation Models Get More Robust Without Anyone Even Noticing? | Ben Peters et.al. | 2403.03923 | null |
2024-03-06 | Fuzzing BusyBox: Leveraging LLM and Crash Reuse for Embedded Bug Unearthing | Asmita et.al. | 2403.03897 | link |
2024-03-06 | IRCoder: Intermediate Representations Make Language Models Robust Multilingual Code Generators | Indraneil Paul et.al. | 2403.03894 | link |
2024-03-06 | From One to Many: Expanding the Scope of Toxicity Mitigation in Language Models | Luiza Pozzobon et.al. | 2403.03893 | link |
2024-03-06 | FaaF: Facts as a Function for the evaluation of RAG systems | Vasileios Katranidis et.al. | 2403.03888 | link |
2024-03-06 | SaulLM-7B: A pioneering Large Language Model for Law | Pierre Colombo et.al. | 2403.03883 | null |
2024-03-06 | Learning to Decode Collaboratively with Multiple Language Models | Shannon Zejiang Shen et.al. | 2403.03870 | link |
2024-03-06 | On the Origins of Linear Representations in Large Language Models | Yibo Jiang et.al. | 2403.03867 | null |
2024-03-06 | KIWI: A Dataset of Knowledge-Intensive Writing Instructions for Answering Research Questions | Fangyuan Xu et.al. | 2403.03866 | null |
2024-03-06 | Are Language Models Puzzle Prodigies? Algorithmic Puzzles Unveil Serious Challenges in Multimodal Reasoning | Deepanway Ghosal et.al. | 2403.03864 | link |
2024-03-06 | X-Shot: A Unified System to Handle Frequent, Few-shot and Zero-shot Learning Simultaneously in Classification | Hanzi Xu et.al. | 2403.03863 | link |
2024-03-06 | Designing Informative Metrics for Few-Shot Example Selection | Rishabh Adiga et.al. | 2403.03861 | null |
2024-03-06 | Emojinize : Enriching Any Text with Emoji Translations | Lars Henning Klein et.al. | 2403.03857 | null |
2024-03-06 | ShortGPT: Layers in Large Language Models are More Redundant Than You Expect | Xin Men et.al. | 2403.03853 | null |
2024-03-06 | Evaluating the Elementary Multilingual Capabilities of Large Language Models with MultiQ | Carolin Holtermann et.al. | 2403.03814 | link |
2024-03-06 | Popeye: A Unified Visual-Language Model for Multi-Source Ship Detection from Remote Sensing Imagery | Wei Zhang et.al. | 2403.03790 | null |
2024-03-06 | PPTC-R benchmark: Towards Evaluating the Robustness of Large Language Models for PowerPoint Task Completion | Zekai Zhang et.al. | 2403.03788 | link |
2024-03-05 | The WMDP Benchmark: Measuring and Reducing Malicious Use With Unlearning | Nathaniel Li et.al. | 2403.03218 | null |
2024-03-05 | CLEVR-POC: Reasoning-Intensive Visual Question Answering in Partially Observable Environments | Savitha Sam Abraham et.al. | 2403.03203 | null |
2024-03-05 | Towards Democratized Flood Risk Management: An Advanced AI Assistant Enabled by GPT-4 for Enhanced Interpretability and Public Engagement | Rafaela Martelo et.al. | 2403.03188 | link |
2024-03-05 | Reliable, Adaptable, and Attributable Language Models with Retrieval | Akari Asai et.al. | 2403.03187 | null |
2024-03-05 | MOKA: Open-Vocabulary Robotic Manipulation through Mark-Based Visual Prompting | Fangchen Liu et.al. | 2403.03174 | null |
2024-03-05 | SNIFFER: Multimodal Large Language Model for Explainable Out-of-Context Misinformation Detection | Peng Qi et.al. | 2403.03170 | null |
2024-03-05 | PARADISE: Evaluating Implicit Planning Skills of Language Models with Procedural Warnings and Tips Dataset | Arda Uzunoğlu et.al. | 2403.03167 | link |
2024-03-05 | Quantum Many-Body Physics Calculations with Large Language Models | Haining Pan et.al. | 2403.03154 | null |
2024-03-05 | Language Guided Exploration for RL Agents in Text Environments | Hitesh Golchha et.al. | 2403.03141 | null |
2024-03-05 | CoGenesis: A Framework Collaborating Large and Small Language Models for Secure Context-Aware Instruction Following | Kaiyan Zhang et.al. | 2403.03129 | null |
2024-03-05 | Angry Men, Sad Women: Large Language Models Reflect Gendered Stereotypes in Emotion Attribution | Flor Miriam Plaza-del-Arco et.al. | 2403.03121 | link |
2024-03-05 | “In Dialogues We Learn”: Towards Personalized Dialogue Without Pre-defined Profiles through In-Dialogue Learning | Chuanqi Cheng et.al. | 2403.03102 | null |
2024-03-05 | KnowAgent: Knowledge-Augmented Planning for LLM-Based Agents | Yuqi Zhu et.al. | 2403.03101 | link |
2024-03-05 | Learning to Use Tools via Cooperative and Interactive Agents | Zhengliang Shi et.al. | 2403.03031 | link |
2024-03-05 | Socratic Reasoning Improves Positive Text Rewriting | Anmol Goel et.al. | 2403.03029 | null |
2024-03-05 | Word Importance Explains How Prompts Affect Language Model Outputs | Stefan Hackmann et.al. | 2403.03028 | null |
2024-03-05 | OPEx: A Component-Wise Analysis of LLM-Centric Agents in Embodied Instruction Following | Haochen Shi et.al. | 2403.03017 | null |
2024-03-05 | Knowledge Graphs as Context Sources for LLM-Based Explanations of Learning Recommendations | Hasan Abu-Rasheed et.al. | 2403.03008 | null |
2024-03-05 | Feast Your Eyes: Mixture-of-Resolution Adaptation for Multimodal Large Language Models | Gen Luo et.al. | 2403.03003 | link |
2024-03-05 | Localized Zeroth-Order Prompt Optimization | Wenyang Hu et.al. | 2403.02993 | null |
2024-03-02 | LM4OPT: Unveiling the Potential of Large Language Models in Formulating Mathematical Optimization Problems | Tasnim Ahmed et.al. | 2403.01342 | null |
2024-03-02 | Making Hybrid Languages: A Recipe | Leif Andersen et.al. | 2403.01335 | null |
2024-03-02 | Chaining thoughts and LLMs to learn DNA structural biophysics | Tyler D. Ross et.al. | 2403.01332 | link |
2024-03-02 | VBART: The Turkish LLM | Meliksah Turker et.al. | 2403.01308 | null |
2024-03-02 | ICC: Quantifying Image Caption Concreteness for Multimodal Dataset Curation | Moran Yanuka et.al. | 2403.01306 | link |
2024-03-02 | Improving the Validity of Automatically Generated Feedback via Reinforcement Learning | Alexander Scarlatos et.al. | 2403.01304 | link |
2024-03-02 | NoMAD-Attention: Efficient LLM Inference on CPUs Through Multiply-add-free Attention | Tianyi Zhang et.al. | 2403.01273 | link |
2024-03-02 | Employing LLMs for Incident Response Planning and Review | Sam Hays et.al. | 2403.01271 | null |
2024-03-02 | Dissecting Language Models: Machine Unlearning via Selective Pruning | Nicholas Pochinkov et.al. | 2403.01267 | link |
2024-03-02 | Accelerating Greedy Coordinate Gradient via Probe Sampling | Yiran Zhao et.al. | 2403.01251 | link |
2024-03-02 | SceneCraft: An LLM Agent for Synthesizing 3D Scene as Blender Code | Ziniu Hu et.al. | 2403.01248 | null |
2024-03-02 | Mitigating Catastrophic Forgetting in Large Language Models with Self-Synthesized Rehearsal | Jianheng Huang et.al. | 2403.01244 | link |
2024-03-02 | IntactKV: Improving Large Language Model Quantization by Keeping Pivot Tokens Intact | Ruikang Liu et.al. | 2403.01241 | link |
2024-03-02 | Inexact Unlearning Needs More Careful Evaluations to Avoid a False Sense of Privacy | Jamie Hayes et.al. | 2403.01218 | null |
2024-03-02 | API Is Enough: Conformal Prediction for Large Language Models Without Logit-Access | Jiayuan Su et.al. | 2403.01216 | null |
2024-03-02 | Data-free Multi-label Image Recognition via LLM-powered Prompt Tuning | Shuo Yang et.al. | 2403.01209 | null |
2024-03-02 | The Case for Animal-Friendly AI | Sankalpa Ghose et.al. | 2403.01199 | null |
2024-03-02 | DMoERM: Recipes of Mixture-of-Experts for Effective Reward Modeling | Shanghaoran Quan et.al. | 2403.01197 | link |
2024-03-02 | RAGged Edges: The Double-Edged Sword of Retrieval-Augmented Chatbots | Philip Feldman. James R. Foulds et.al. | 2403.01193 | null |
2024-03-02 | Balancing Exploration and Exploitation in LLM using Soft RLLF for Enhanced Negation Understanding | Ha-Thanh Nguyen et.al. | 2403.01185 | null |
2024-02-29 | The Counterfeit Conundrum: Can Code Language Models Grasp the Nuances of Their Incorrect Generations? | Alex Gu et.al. | 2402.19475 | null |
2024-02-29 | The All-Seeing Project V2: Towards General Relation Comprehension of the Open World | Weiyun Wang et.al. | 2402.19474 | link |
2024-02-29 | Retrieval-Augmented Generation for AI-Generated Content: A Survey | Penghao Zhao et.al. | 2402.19473 | link |
2024-02-29 | Loose LIPS Sink Ships: Asking Questions in Battleship with Language-Informed Program Sampling | Gabriel Grand et.al. | 2402.19471 | null |
2024-03-01 | TV-TREES: Multimodal Entailment Trees for Neuro-Symbolic Video Reasoning | Kate Sanders et.al. | 2402.19467 | null |
2024-02-29 | Towards Tracing Trustworthiness Dynamics: Revisiting Pre-training Period of Large Language Models | Chen Qian et.al. | 2402.19465 | link |
2024-02-29 | Curiosity-driven Red-teaming for Large Language Models | Zhang-Wei Hong et.al. | 2402.19464 | link |
2024-02-29 | Functional Benchmarks for Robust Evaluation of Reasoning Performance, and the Reasoning Gap | Saurabh Srivastava et.al. | 2402.19450 | link |
2024-02-29 | Heavy-Tailed Class Imbalance and Why Adam Outperforms Gradient Descent on Language Models | Frederik Kunstner et.al. | 2402.19449 | null |
2024-02-29 | ArCHer: Training Language Model Agents via Hierarchical Multi-Turn RL | Yifei Zhou et.al. | 2402.19446 | link |
2024-02-29 | Pushing the Limits of Cross-Embodiment Learning for Manipulation and Navigation | Jonathan Yang et.al. | 2402.19432 | null |
2024-02-29 | Compositional API Recommendation for Library-Oriented Code Generation | Zexiong Ma et.al. | 2402.19431 | null |
2024-02-29 | Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models | Soham De et.al. | 2402.19427 | null |
2024-02-29 | Crafting Knowledge: Exploring the Creative Mechanisms of Chat-Based Search Engines | Lijia Ma et.al. | 2402.19421 | null |
2024-02-29 | PaECTER: Patent-level Representation Learning using Citation-informed Transformers | Mainak Ghosh et.al. | 2402.19411 | null |
2024-02-29 | On the Scaling Laws of Geographical Representation in Language Models | Nathan Godey et.al. | 2402.19406 | null |
2024-02-29 | Entity-Aware Multimodal Alignment Framework for News Image Captioning | Junzhe Zhang et.al. | 2402.19404 | null |
2024-02-29 | Wisdom of the Silicon Crowd: LLM Ensemble Prediction Capabilities Match Human Crowd Accuracy | Philipp Schoenegger et.al. | 2402.19379 | null |
2024-02-29 | OpenMedLM: Prompt engineering can out-perform fine-tuning in medical question-answering with open-source large language models | Jenish Maharjan et.al. | 2402.19371 | null |
2024-02-29 | SoK: Exploring the Potential of Large Language Models for Improving Digital Forensic Investigation Efficiency | Akila Wickramasekara et.al. | 2402.19366 | null |
2024-02-28 | Arithmetic Control of LLMs for Diverse User Preferences: Directional Preference Alignment with Multi-Objective Rewards | Haoxiang Wang et.al. | 2402.18571 | link |
2024-02-28 | Diffusion Language Models Are Versatile Protein Learners | Xinyou Wang et.al. | 2402.18567 | link |
2024-02-28 | A Categorization of Complexity Classes for Information Retrieval and Synthesis Using Natural Logic | Gregory Coppola et.al. | 2402.18566 | null |
2024-02-28 | Approaching Human-Level Forecasting with Language Models | Danny Halawi et.al. | 2402.18563 | null |
2024-02-28 | Implicit Bias of Next-Token Prediction | Christos Thrampoulidis et.al. | 2402.18551 | null |
2024-02-28 | Orchid: Flexible and Data-Dependent Convolution for Sequence Modeling | Mahdi Karami et.al. | 2402.18508 | null |
2024-02-28 | Few-Shot Fairness: Unveiling LLM’s Potential for Fairness-Aware Classification | Garima Chhikara et.al. | 2402.18502 | null |
2024-02-28 | Language Models Represent Beliefs of Self and Others | Wentao Zhu et.al. | 2402.18496 | null |
2024-02-28 | IBD: Alleviating Hallucinations in Large Vision-Language Models via Image-Biased Decoding | Lanyun Zhu et.al. | 2402.18476 | null |
2024-02-28 | Meta-Task Prompting Elicits Embedding from Large Language Models | Yibin Lei et.al. | 2402.18458 | link |
2024-02-28 | Prompt-Driven Dynamic Object-Centric Learning for Single Domain Generalization | Deng Li et.al. | 2402.18447 | null |
2024-02-28 | Beyond Natural Language: LLMs Leveraging Alternative Formats for Enhanced Reasoning and Communication | Weize Chen et.al. | 2402.18439 | link |
2024-02-28 | A Cognitive Evaluation Benchmark of Image Reasoning and Description for Large Vision Language Models | Xiujie Song et.al. | 2402.18409 | link |
2024-02-28 | Balanced Similarity with Auxiliary Prompts: Towards Alleviating Text-to-Image Retrieval Bias for CLIP in Zero-shot Learning | Hanyao Wang et.al. | 2402.18400 | null |
2024-02-28 | Decomposed Prompting: Unveiling Multilingual Linguistic Structure Knowledge in English-Centric Large Language Models | Ercong Nie et.al. | 2402.18397 | null |
2024-02-28 | The First Place Solution of WSDM Cup 2024: Leveraging Large Language Models for Conversational Multi-Doc QA | Yiming Li et.al. | 2402.18385 | link |
2024-02-28 | Large Language Models As Evolution Strategies | Robert Tjarko Lange et.al. | 2402.18381 | null |
2024-02-28 | Tokenization Is More Than Compression | Craig W. Schmidt et.al. | 2402.18376 | link |
2024-02-28 | VerifiNER: Verification-augmented NER via Knowledge-grounded Reasoning with Large Language Models | Seoyeon Kim et.al. | 2402.18374 | link |
2024-02-28 | Focus on Your Question! Interpreting and Mitigating Toxic CoT Problems in Commonsense Reasoning | Jiachun Li et.al. | 2402.18344 | link |
2024-02-27 | ShapeLLM: Universal 3D Object Understanding for Embodied Interaction | Zekun Qi et.al. | 2402.17766 | link |
2024-02-27 | The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits | Shuming Ma et.al. | 2402.17764 | null |
2024-02-27 | Massive Activations in Large Language Models | Mingjie Sun et.al. | 2402.17762 | link |
2024-02-27 | Towards Optimal Learning of Language Models | Yuxian Gu et.al. | 2402.17759 | null |
2024-02-27 | Evaluating Very Long-Term Conversational Memory of LLM Agents | Adyasha Maharana et.al. | 2402.17753 | null |
2024-02-27 | Tower: An Open Multilingual Large Language Model for Translation-Related Tasks | Duarte M. Alves et.al. | 2402.17733 | link |
2024-02-27 | AmbigNLG: Addressing Task Ambiguity in Instruction for NLG | Ayana Niwa et.al. | 2402.17717 | link |
2024-02-27 | Case-Based or Rule-Based: How Do Transformers Do the Math? | Yi Hu et.al. | 2402.17709 | link |
2024-02-27 | RAVEL: Evaluating Interpretability Methods on Disentangling Language Model Representations | Jing Huang et.al. | 2402.17700 | link |
2024-02-27 | NextLevelBERT: Investigating Masked Language Modeling with Higher-Level Representations for Long Documents | Tamara Czinczoll et.al. | 2402.17682 | link |
2024-02-27 | The Emergence of Large Language Models in Static Analysis: A First Look through Micro-Benchmarks | Ashwin Prasad Shivarpatna Venkatesh et.al. | 2402.17679 | null |
2024-02-27 | CAD-SIGNet: CAD Language Inference from Point Clouds using Layer-wise Sketch Instance Guided Attention | Mohammad Sadil Khan et.al. | 2402.17678 | null |
2024-02-27 | Securing Reliability: A Brief Overview on Enhancing In-Context Learning for Foundation Models | Yunpeng Huang et.al. | 2402.17671 | null |
2024-02-27 | Beyond prompt brittleness: Evaluating the reliability and consistency of political worldviews in LLMs | Tanise Ceron et.al. | 2402.17649 | null |
2024-02-27 | SongComposer: A Large Language Model for Lyric and Melody Composition in Song Generation | Shuangrui Ding et.al. | 2402.17645 | link |
2024-02-27 | Are LLMs Capable of Data-based Statistical and Causal Reasoning? Benchmarking Advanced Quantitative Reasoning with Data | Xiao Liu et.al. | 2402.17644 | link |
2024-02-27 | Variational Learning is Effective for Large Deep Networks | Yuesong Shen et.al. | 2402.17641 | link |
2024-02-27 | Masked Gamma-SSL: Learning Uncertainty Estimation via Masked Image Modeling | David S. W. Williams et.al. | 2402.17622 | null |
2024-02-27 | Agent-Pro: Learning to Evolve via Policy-Level Reflection and Optimization | Wenqi Zhang et.al. | 2402.17574 | link |
2024-02-27 | Unleashing the Potential of Large Language Models as Prompt Optimizers: An Analogical Analysis with Gradient-based Model Optimizers | Xinyu Tang et.al. | 2402.17564 | link |
2024-02-26 | Integrating Large Language Models with Graphical Session-Based Recommendation | Naicheng Guo et.al. | 2402.16539 | null |
2024-02-26 | LLMArena: Assessing Capabilities of Large Language Models in Dynamic Multi-Agent Environments | Junzhe Chen et.al. | 2402.16499 | link |
2024-02-26 | On Languaging a Simulation Engine | Han Liu et.al. | 2402.16482 | null |
2024-02-26 | Unveiling ChatGPT’s Usage in Open Source Projects: A Mining-based Study | Rosalia Tufano et.al. | 2402.16480 | null |
2024-02-26 | mEdIT: Multilingual Text Editing via Instruction Tuning | Vipul Raheja et.al. | 2402.16472 | link |
2024-02-26 | Unveiling Vulnerability of Self-Attention | Khai Jiet Liong et.al. | 2402.16470 | link |
2024-02-26 | Defending LLMs against Jailbreaking Attacks via Backtranslation | Yihan Wang et.al. | 2402.16459 | link |
2024-02-26 | ProLLaMA: A Protein Large Language Model for Multi-Task Protein Language Processing | Liuzhenghao Lv et.al. | 2402.16445 | link |
2024-02-26 | ShieldLM: Empowering LLMs as Aligned, Customizable and Explainable Safety Detectors | Zhexin Zhang et.al. | 2402.16444 | link |
2024-02-26 | Language-Specific Neurons: The Key to Multilingual Capabilities in Large Language Models | Tianyi Tang et.al. | 2402.16438 | link |
2024-02-26 | RoCoIns: Enhancing Robustness of Large Language Models through Code-Style Instructions | Yuansen Zhang et.al. | 2402.16431 | null |
2024-02-26 | Predicting Sustainable Development Goals Using Course Descriptions – from LLMs to Conventional Foundation Models | Lev Kharlashkin et.al. | 2402.16420 | null |
2024-02-26 | From RAGs to riches: Using large language models to write documents for clinical trials | Nigel Markey et.al. | 2402.16406 | null |
2024-02-26 | MoZIP: A Multilingual Benchmark to Evaluate Large Language Models in Intellectual Property | Shiwen Ni et.al. | 2402.16389 | link |
2024-02-26 | Immunization against harmful fine-tuning attacks | Domenic Rosati et.al. | 2402.16382 | null |
2024-02-26 | Improving LLM-based Machine Translation with Systematic Self-Correction | Zhaopeng Feng et.al. | 2402.16379 | link |
2024-02-26 | Unraveling Babel: Exploring Multilingual Activation Patterns within Large Language Models | Weize Liu et.al. | 2402.16367 | null |
2024-02-26 | LLM Inference Unveiled: Survey and Roofline Model Insights | Zhihang Yuan et.al. | 2402.16363 | link |
2024-02-26 | Layer-wise Regularized Dropout for Neural Language Models | Shiwen Ni et.al. | 2402.16361 | null |
2024-02-26 | An Integrated Data Processing Framework for Pretraining Foundation Models | Yiding Sun et.al. | 2402.16358 | link |
2024-02-23 | AgentOhana: Design Unified Data and Training Pipeline for Effective Agent Learning | Jianguo Zhang et.al. | 2402.15506 | link |
2024-02-23 | API-BLEND: A Comprehensive Corpora for Training and Benchmarking API LLMs | Kinjal Basu et.al. | 2402.15491 | link |
2024-02-23 | Prejudice and Caprice: A Statistical Framework for Measuring Social Discrimination in Large Language Models | Yiran Liu et.al. | 2402.15481 | null |
2024-02-23 | Leveraging Domain Knowledge for Efficient Reward Modelling in RLHF: A Case-Study in E-Commerce Opinion Summarization | Swaroop Nath et.al. | 2402.15473 | link |
2024-02-23 | Repetition Improves Language Model Embeddings | Jacob Mitchell Springer et.al. | 2402.15449 | link |
2024-02-23 | A Data-Centric Approach To Generate Faithful and High Quality Patient Summaries with Large Language Models | Stefan Hegselmann et.al. | 2402.15422 | link |
2024-02-23 | PREDILECT: Preferences Delineated with Zero-Shot Language-based Reasoning in Reinforcement Learning | Simon Holk et.al. | 2402.15420 | null |
2024-02-23 | Does Combining Parameter-efficient Modules Improve Few-shot Transfer Accuracy? | Nader Asadi et.al. | 2402.15414 | null |
2024-02-23 | Grasp, See and Place: Efficient Unknown Object Rearrangement with Policy Structure Prior | Kechun Xu et.al. | 2402.15402 | link |
2024-02-23 | Explorations of Self-Repair in Language Models | Cody Rushing et.al. | 2402.15390 | link |
2024-02-23 | Safe Task Planning for Language-Instructed Multi-Robot Systems using Conformal Prediction | Jun Wang et.al. | 2402.15368 | null |
2024-02-23 | Farsight: Fostering Responsible AI Awareness During AI Application Prototyping | Zijie J. Wang et.al. | 2402.15350 | link |
2024-02-23 | NuNER: Entity Recognition Encoder Pre-training via LLM-Annotated Data | Sergei Bogdanov et.al. | 2402.15343 | link |
2024-02-23 | Ranking Entities along Conceptual Space Dimensions with LLMs: An Analysis of Fine-Tuning Strategies | Nitesh Kumar et.al. | 2402.15337 | link |
2024-02-23 | GPTVQ: The Blessing of Dimensionality for LLM Quantization | Mart van Baalen et.al. | 2402.15319 | null |
2024-02-23 | ArabianGPT: Native Arabic GPT-based Large Language | Anis Koubaa et.al. | 2402.15313 | null |
2024-02-23 | Counterfactual Generation with Identifiability Guarantees | Hanqi Yan et.al. | 2402.15309 | link |
2024-02-23 | Representing Online Handwriting for Recognition in Large Vision-Language Models | Anastasiia Fadeeva et.al. | 2402.15307 | null |
2024-02-23 | How (un)ethical are instruction-centric responses of LLMs? Unveiling the vulnerabilities of safety guardrails to harmful queries | Somnath Banerjee et.al. | 2402.15302 | link |
2024-02-23 | Causal Graph Discovery with Retrieval-Augmented Generation based Large Language Models | Yuzhe Zhang et.al. | 2402.15301 | null |
2024-02-22 | PALO: A Polyglot Large Multimodal Model for 5B People | Muhammad Maaz et.al. | 2402.14818 | link |
2024-02-22 | Demographic Bias of Expert-Level Vision-Language Foundation Models in Medical Imaging | Yuzhe Yang et.al. | 2402.14815 | link |
2024-02-22 | WeakSAM: Segment Anything Meets Weakly-supervised Instance-level Recognition | Lianghui Zhu et.al. | 2402.14812 | link |
2024-02-22 | Fine-Tuning Enhances Existing Mechanisms: A Case Study on Entity Tracking | Nikhil Prakash et.al. | 2402.14811 | null |
2024-02-22 | CriticBench: Benchmarking LLMs for Critique-Correct Reasoning | Zicheng Lin et.al. | 2402.14809 | link |
2024-02-22 | RelayAttention for Efficient Large Language Model Serving with Long System Prompts | Lei Zhu et.al. | 2402.14808 | link |
2024-02-22 | A Decision-Language Model (DLM) for Dynamic Restless Multi-Armed Bandit Tasks in Public Health | Nikhil Behari et.al. | 2402.14807 | null |
2024-02-22 | Identifying Multiple Personalities in Large Language Models with External Evaluation | Xiaoyang Song et.al. | 2402.14805 | null |
2024-02-22 | Not All Experts are Equal: Efficient Expert Pruning and Skipping for Mixture-of-Experts Large Language Models | Xudong Lu et.al. | 2402.14800 | link |
2024-02-22 | Enhancing Systematic Decompositional Natural Language Inference Using Informal Logic | Nathaniel Weir et.al. | 2402.14798 | null |
2024-02-22 | Zero-shot cross-lingual transfer in instruction tuning of large language model | Nadezhda Chirkova et.al. | 2402.14778 | null |
2024-02-22 | 2D Matryoshka Sentence Embeddings | Xianming Li et.al. | 2402.14776 | link |
2024-02-22 | DualFocus: Integrating Macro and Micro Perspectives in Multi-modal Large Language Models | Yuhang Cao et.al. | 2402.14767 | link |
2024-02-22 | MT-Bench-101: A Fine-Grained Benchmark for Evaluating Large Language Models in Multi-Turn Dialogues | Ge Bai et.al. | 2402.14762 | link |
2024-02-22 | Generalizing Reward Modeling for Out-of-Distribution Preference Learning | Chen Jia et.al. | 2402.14760 | link |
2024-02-22 | Large Language Models as Urban Residents: An LLM Agent Framework for Personal Mobility Generation | Jiawei Wang et.al. | 2402.14744 | link |
2024-02-22 | Dependency Annotation of Ottoman Turkish with Multilingual BERT | Şaziye Betül Özateş et.al. | 2402.14743 | null |
2024-02-22 | Back to Basics: Revisiting REINFORCE Style Optimization for Learning from Human Feedback in LLMs | Arash Ahmadian et.al. | 2402.14740 | null |
2024-02-22 | Efficient and Effective Vocabulary Expansion Towards Multilingual Large Language Models | Seungduk Kim et.al. | 2402.14714 | link |
2024-02-22 | IEPile: Unearthing Large-Scale Schema-Based Information Extraction Corpus | Honghao Gui et.al. | 2402.14710 | link |
2024-02-21 | Coercing LLMs to do and reveal (almost) anything | Jonas Geiping et.al. | 2402.14020 | link |
2024-02-21 | Is LLM-as-a-Judge Robust? Investigating Universal Adversarial Attacks on Zero-shot LLM Assessment | Vyas Raina et.al. | 2402.14016 | link |
2024-02-21 | OlympiadBench: A Challenging Benchmark for Promoting AGI with Olympiad-Level Bilingual Multimodal Scientific Problems | Chaoqun He et.al. | 2402.14008 | link |
2024-02-21 | Can Watermarks Survive Translation? On the Cross-lingual Consistency of Text Watermark for Large Language Models | Zhiwei He et.al. | 2402.14007 | link |
2024-02-21 | Hallucinations or Attention Misdirection? The Path to Strategic Value Extraction in Business Using Large Language Models | Aline Ioste et.al. | 2402.14002 | null |
2024-02-21 | Analysing The Impact of Sequence Composition on Language Model Pre-Training | Yu Zhao et.al. | 2402.13991 | link |
2024-02-21 | Towards Building Multilingual Language Model for Medicine | Pengcheng Qiu et.al. | 2402.13963 | link |
2024-02-21 | Measuring Social Biases in Masked Language Models by Proxy of Prediction Quality | Rahul Zalkikar et.al. | 2402.13954 | link |
2024-02-21 | Making Reasoning Matter: Measuring and Improving Faithfulness of Chain-of-Thought Reasoning | Debjit Paul et.al. | 2402.13950 | null |
2024-02-21 | Do Efficient Transformers Really Save Computation? | Kai Yang et.al. | 2402.13934 | null |
2024-02-21 | Large Language Models are Vulnerable to Bait-and-Switch Attacks for Generating Harmful Content | Federico Bianchi et.al. | 2402.13926 | null |
2024-02-21 | SYNFAC-EDIT: Synthetic Imitation Edit Feedback for Factual Alignment in Clinical Summarization | Prakamya Mishra et.al. | 2402.13919 | link |
2024-02-21 | What Linguistic Features and Languages are Important in LLM Translation? | Ryandito Diandaru et.al. | 2402.13917 | null |
2024-02-21 | Calibrating Large Language Models with Sample Consistency | Qing Lyu et.al. | 2402.13904 | null |
2024-02-21 | Beyond Probabilities: Unveiling the Misalignment in Evaluating Large Language Models | Chenyang Lyu et.al. | 2402.13887 | null |
2024-02-21 | $\texttt{Se}^2$: $\textit{Se}$quential Example $\textit{Se}$ lection for In-Context Learning | Haoyu Liu et.al. | 2402.13874 | link |
2024-02-21 | An Explainable Transformer-based Model for Phishing Email Detection: A Large Language Model Approach | Mohammad Amaz Uddin et.al. | 2402.13871 | null |
2024-02-21 | Kuaiji: the First Chinese Accounting Large Language Model | Jiayuan Luo et.al. | 2402.13866 | null |
2024-02-21 | RealDex: Towards Human-like Grasping for Robotic Dexterous Hand | Yumeng Liu et.al. | 2402.13853 | null |
2024-02-21 | VL-Trojan: Multimodal Instruction Backdoor Attacks against Autoregressive Visual Language Models | Jiawei Liang et.al. | 2402.13851 | null |
2024-02-20 | Towards audio language modeling – an overview | Haibin Wu et.al. | 2402.13236 | null |
2024-02-20 | Unlocking Insights: Semantic Search in Jupyter Notebooks | Lan Li et.al. | 2402.13234 | null |
2024-02-20 | A Touch, Vision, and Language Dataset for Multimodal Alignment | Letian Fu et.al. | 2402.13232 | link |
2024-02-20 | Investigating Cultural Alignment of Large Language Models | Badr AlKhamissi et.al. | 2402.13231 | link |
2024-02-20 | Smaug: Fixing Failure Modes of Preference Optimisation with DPO-Positive | Arka Pal et.al. | 2402.13228 | link |
2024-02-20 | AgentMD: Empowering Language Agents for Risk Prediction with Large-Scale Clinical Tool Learning | Qiao Jin et.al. | 2402.13225 | null |
2024-02-20 | RoCode: A Dataset for Measuring Code Intelligence from Problem Definitions in Romanian | Adrian Cosma et.al. | 2402.13222 | link |
2024-02-20 | How Easy is It to Fool Your Multimodal LLMs? An Empirical Analysis on Deceptive Prompts | Yusu Qian et.al. | 2402.13220 | link |
2024-02-20 | Softmax Probabilities (Mostly) Predict Large Language Model Correctness on Multiple-Choice Q&A | Benjamin Plaut et.al. | 2402.13213 | link |
2024-02-20 | Soft Self-Consistency Improves Language Model Agents | Han Wang et.al. | 2402.13212 | link |
2024-02-20 | Can Large Language Models be Good Emotional Supporter? Mitigating Preference Bias on Emotional Support Conversation | Dongjin Kang et.al. | 2402.13211 | null |
2024-02-20 | Bayesian Reward Models for LLM Alignment | Adam X. Yang et.al. | 2402.13210 | null |
2024-02-20 | How do Hyenas deal with Human Speech? Speech Recognition and Translation with ConfHyena | Marco Gaido et.al. | 2402.13208 | link |
2024-02-20 | Question Calibration and Multi-Hop Modeling for Temporal Question Answering | Chao Xue et.al. | 2402.13188 | null |
2024-02-20 | What if LLMs Have Different World Views: Simulating Alien Civilizations with LLM-based Agents | Mingyu Jin et.al. | 2402.13184 | link |
2024-02-20 | DINOBot: Robot Manipulation via Retrieval and Alignment with Vision Foundation Models | Norman Di Palo et.al. | 2402.13181 | null |
2024-02-20 | Benchmarking Retrieval-Augmented Generation for Medicine | Guangzhi Xiong et.al. | 2402.13178 | link |
2024-02-20 | Defending Jailbreak Prompts via In-Context Adversarial Game | Yujun Zhou et.al. | 2402.13148 | null |
2024-02-20 | OLViT: Multi-Modal State Tracking via Attention-Based Embeddings for Video-Grounded Dialog | Adnen Abdessaied et.al. | 2402.13146 | null |
2024-02-20 | The Hidden Space of Transformer Language Adapters | Jesujoba O. Alabi et.al. | 2402.13137 | link |
2024-02-19 | Sequoia: Scalable, Robust, and Hardware-aware Speculative Decoding | Zhuoming Chen et.al. | 2402.12374 | link |
2024-02-19 | AnaloBench: Benchmarking the Identification of Abstract and Long-context Analogies | Xiao Ye et.al. | 2402.12370 | link |
2024-02-19 | A Critical Evaluation of AI Feedback for Aligning Large Language Models | Archit Sharma et.al. | 2402.12366 | link |
2024-02-19 | Emergent Word Order Universals from Cognitively-Motivated Language Models | Tatsuki Kuribayashi et.al. | 2402.12363 | link |
2024-02-19 | Graph-Based Retriever Captures the Long Tail of Biomedical Knowledge | Julien Delile et.al. | 2402.12352 | null |
2024-02-19 | GTBench: Uncovering the Strategic Reasoning Limitations of LLMs via Game-Theoretic Evaluations | Jinhao Duan et.al. | 2402.12348 | link |
2024-02-19 | Emulated Disalignment: Safety Alignment for Large Language Models May Backfire! | Zhanhui Zhou et.al. | 2402.12343 | link |
2024-02-19 | Robust CLIP: Unsupervised Adversarial Fine-Tuning of Vision Embeddings for Robust Large Vision-Language Models | Christian Schlarmann et.al. | 2402.12336 | link |
2024-02-19 | Query-Based Adversarial Prompt Generation | Jonathan Hayase et.al. | 2402.12329 | null |
2024-02-19 | Shall We Talk: Exploring Spontaneous Collaborations of Competing LLM Agents | Zengqing Wu et.al. | 2402.12327 | link |
2024-02-19 | ARKS: Active Retrieval in Knowledge Soup for Code Generation | Hongjin Su et.al. | 2402.12317 | link |
2024-02-19 | Is Open-Source There Yet? A Comparative Study on Commercial and Open-Source LLMs in Their Ability to Label Chest X-Ray Reports | Felix J. Dorfner et.al. | 2402.12298 | null |
2024-02-19 | KARL: Knowledge-Aware Retrieval and Representations aid Retention and Learning in Students | Matthew Shu et.al. | 2402.12291 | null |
2024-02-19 | DriveVLM: The Convergence of Autonomous Driving and Large Vision-Language Models | Xiaoyu Tian et.al. | 2402.12289 | null |
2024-02-19 | Adaptive Skeleton Graph Decoding | Shuowei Jin et.al. | 2402.12280 | null |
2024-02-19 | Key ingredients for effective zero-shot cross-lingual knowledge transfer in generative tasks | Nadezhda Chirkova et.al. | 2402.12279 | null |
2024-02-19 | Explain then Rank: Scale Calibration of Neural Rankers Using Natural Language Explanations from Large Language Models | Puxuan Yu et.al. | 2402.12276 | link |
2024-02-19 | High-quality Data-to-Text Generation for Severely Under-Resourced Languages with Out-of-the-box Large Language Models | Michela Lorandi et.al. | 2402.12267 | link |
2024-02-19 | Uncertainty quantification in fine-tuned LLMs using LoRA ensembles | Oleksandr Balabanov et.al. | 2402.12264 | link |
2024-02-19 | NEO-BENCH: Evaluating Robustness of Large Language Models with Neologisms | Jonathan Zheng et.al. | 2402.12261 | link |
2024-02-16 | PaLM2-VAdapter: Progressively Aligned Language Model Makes a Strong Vision-language Adapter | Junfei Xiao et.al. | 2402.10896 | null |
2024-02-16 | RLVF: Learning from Verbal Feedback without Overgeneralization | Moritz Stephan et.al. | 2402.10893 | link |
2024-02-16 | Instruction Diversity Drives Generalization To Unseen Tasks | Dylan Zhang et.al. | 2402.10891 | null |
2024-02-16 | When is Tree Search Useful for LLM Planning? It Depends on the Discriminator | Ziru Chen et.al. | 2402.10890 | link |
2024-02-16 | Multi-modal preference alignment remedies regression of visual instruction tuning on language model | Shengzhi Li et.al. | 2402.10884 | link |
2024-02-16 | EcoRank: Budget-Constrained Text Re-ranking Using Large Language Models | Muhammad Shihab Rashid et.al. | 2402.10866 | link |
2024-02-16 | Time Series Forecasting with LLMs: Understanding and Enhancing Model Capabilities | Mingyu Jin et.al. | 2402.10835 | link |
2024-02-16 | RAG-Driver: Generalisable Driving Explanations with Retrieval-Augmented In-Context Learning in Multi-Modal Large Language Model | Jianhao Yuan et.al. | 2402.10828 | null |
2024-02-16 | Quantifying the Persona Effect in LLM Simulations | Tiancheng Hu et.al. | 2402.10811 | link |
2024-02-16 | Generative Cross-Modal Retrieval: Memorizing Images in Multimodal Language Models for Retrieval and Beyond | Yongqi Li et.al. | 2402.10805 | null |
2024-02-16 | EdgeQAT: Entropy and Distribution Guided Quantization-Aware Training for the Acceleration of Lightweight LLMs on the Edge | Xuan Shen et.al. | 2402.10787 | link |
2024-02-16 | A Condensed Transition Graph Framework for Zero-shot Link Prediction with Large Language Models | Mingchen Li et.al. | 2402.10779 | null |
2024-02-16 | AutoGPT+P: Affordance-based Task Planning with Large Language Models | Timo Birr et.al. | 2402.10778 | null |
2024-02-16 | How Reliable Are Automatic Evaluation Methods for Instruction-Tuned LLMs? | Ehsan Doostmohammadi et.al. | 2402.10770 | null |
2024-02-16 | Distillation Enhanced Generative Retrieval | Yongqi Li et.al. | 2402.10769 | null |
2024-02-16 | Inference to the Best Explanation in Large Language Models | Dhairya Dalal et.al. | 2402.10767 | null |
2024-02-16 | When Dataflow Analysis Meets Large Language Models | Chengpeng Wang et.al. | 2402.10754 | link |
2024-02-16 | ToolSword: Unveiling Safety Issues of Large Language Models in Tool Learning Across Three Stages | Junjie Ye et.al. | 2402.10753 | link |
2024-02-16 | GenRES: Rethinking Evaluation for Generative Relation Extraction in the Era of Large Language Models | Pengcheng Jiang et.al. | 2402.10744 | link |
2024-02-16 | Let’s Learn Step by Step: Enhancing In-Context Learning Ability with Curriculum Learning | Yinpeng Liu et.al. | 2402.10738 | link |
2024-02-15 | Self-Play Fine-Tuning of Diffusion Models for Text-to-Image Generation | Huizhuo Yuan et.al. | 2402.10210 | null |
2024-02-15 | Rewards-in-Context: Multi-objective Alignment of Foundation Models with Dynamic Preference Adjustment | Rui Yang et.al. | 2402.10207 | link |
2024-02-15 | Chain-of-Thought Reasoning Without Prompting | Xuezhi Wang et.al. | 2402.10200 | null |
2024-02-15 | A Trembling House of Cards? Mapping Adversarial Attacks against Language Agents | Lingbo Mo et.al. | 2402.10196 | link |
2024-02-15 | BitDelta: Your Fine-Tune May Only Be Worth One Bit | James Liu et.al. | 2402.10193 | link |
2024-02-15 | Uncertainty Decomposition and Quantification for In-Context Learning of Large Language Models | Chen Ling et.al. | 2402.10189 | link |
2024-02-15 | Rethinking Information Structures in RLHF: Reward Generalization from a Graph Theory Perspective | Tianyi Qiu et.al. | 2402.10184 | null |
2024-02-15 | TDAG: A Multi-Agent Framework based on Dynamic Task Decomposition and Agent Generation | Yaoxiang Wang et.al. | 2402.10178 | link |
2024-02-15 | OpenMathInstruct-1: A 1.8 Million Math Instruction Tuning Dataset | Shubham Toshniwal et.al. | 2402.10176 | link |
2024-02-15 | Unlocking Structure Measuring: Introducing PDD, an Automatic Metric for Positional Discourse Coherence | Yinhong Liu et.al. | 2402.10175 | link |
2024-02-15 | OptiMUS: Scalable Optimization Modeling with (MI)LP Solvers and Large Language Models | Ali AhmadiTeshnizi et.al. | 2402.10172 | link |
2024-02-15 | Data Engineering for Scaling Language Models to 128K Context | Yao Fu et.al. | 2402.10171 | link |
2024-02-15 | Knowledge-Infused LLM-Powered Conversational Health Agent: A Case Study for Diabetes Patients | Mahyar Abbasian et.al. | 2402.10153 | null |
2024-02-15 | ControlLM: Crafting Diverse Personalities for Language Models | Yixuan Weng et.al. | 2402.10151 | link |
2024-02-15 | TOAD: Task-Oriented Automatic Dialogs with Diverse Response Styles | Yinhong Liu et.al. | 2402.10137 | null |
2024-02-15 | Zero-Shot Reasoning: Personalized Content Generation Without the Cold Start Problem | Davor Hafnar et.al. | 2402.10133 | link |
2024-02-15 | Selective Reflection-Tuning: Student-Selected Data Recycling for LLM Instruction-Tuning | Ming Li et.al. | 2402.10110 | link |
2024-02-15 | Quantized Embedding Vectors for Controllable Diffusion Language Models | Cheng Kang et.al. | 2402.10107 | null |
2024-02-15 | GeoEval: Benchmark for Evaluating LLMs and Multi-Modal Models on Geometry Problem-Solving | Jiaxin Zhang et.al. | 2402.10104 | link |
2024-02-15 | Any-Shift Prompting for Generalization over Distributions | Zehao Xiao et.al. | 2402.10099 | null |
2024-02-14 | AQA-Bench: An Interactive Benchmark for Evaluating LLMs’ Sequential Reasoning Ability | Siwei Yang et.al. | 2402.09404 | link |
2024-02-14 | Reinforcement Learning from Human Feedback with Active Queries | Kaixuan Ji et.al. | 2402.09401 | null |
2024-02-14 | Get More with LESS: Synthesizing Recurrence with KV Cache Compression for Efficient LLM Inference | Harry Dong et.al. | 2402.09398 | link |
2024-02-14 | LlaSMol: Advancing Large Language Models for Chemistry with a Large-Scale, Comprehensive, High-Quality Instruction Tuning Dataset | Botao Yu et.al. | 2402.09391 | link |
2024-02-14 | HGOT: Hierarchical Graph of Thoughts for Retrieval-Augmented In-Context Learning in Factuality Evaluation | Yihao Fang et.al. | 2402.09390 | link |
2024-02-14 | Transformers Can Achieve Length Generalization But Not Robustly | Yongchao Zhou et.al. | 2402.09371 | null |
2024-02-14 | Pseudorandom Error-Correcting Codes | Miranda Christ et.al. | 2402.09370 | null |
2024-02-14 | Massively Multi-Cultural Knowledge Acquisition & LM Benchmarking | Yi Fung et.al. | 2402.09369 | link |
2024-02-14 | Copyright Traps for Large Language Models | Matthieu Meeus et.al. | 2402.09363 | link |
2024-02-14 | HiRE: High Recall Approximate Top- $k$ Estimation for Efficient LLM Inference | Yashas Samaga B L et.al. | 2402.09360 | null |
2024-02-14 | Developing a Framework for Auditing Large Language Models Using Human-in-the-Loop | Maryam Amirizaniani et.al. | 2402.09346 | null |
2024-02-14 | Mitigating Reward Hacking via Information-Theoretic Reward Modeling | Yuchun Miao et.al. | 2402.09345 | link |
2024-02-14 | AuditLLM: A Tool for Auditing Large Language Models Using Multiprobe Approach | Maryam Amirizaniani et.al. | 2402.09334 | null |
2024-02-14 | ICDPO: Effectively Borrowing Alignment Capability of Others via In-context Direct Preference Optimization | Feifan Song et.al. | 2402.09320 | link |
2024-02-14 | Embracing the black box: Heading towards foundation models for causal discovery from time series data | Gideon Stein et.al. | 2402.09305 | link |
2024-02-14 | Trained Without My Consent: Detecting Code Inclusion In Language Models Trained on Code | Vahid Majdinasab et.al. | 2402.09299 | link |
2024-02-14 | Attacks, Defenses and Evaluations for LLM Conversation Safety: A Survey | Zhichen Dong et.al. | 2402.09283 | link |
2024-02-14 | Leveraging Large Language Models for Enhanced NLP Task Performance through Knowledge Distillation and Optimized Training Strategies | Yining Huang et.al. | 2402.09282 | null |
2024-02-14 | Personalized Large Language Models | Stanisław Woźniak et.al. | 2402.09269 | null |
2024-02-14 | Self-Alignment for Factuality: Mitigating Hallucinations in LLMs via Self-Evaluation | Xiaoying Zhang et.al. | 2402.09267 | null |
2024-02-13 | Mitigating Object Hallucination in Large Vision-Language Models via Classifier-Free Guidance | Linxi Zhao et.al. | 2402.08680 | null |
2024-02-13 | COLD-Attack: Jailbreaking LLMs with Stealthiness and Controllability | Xingang Guo et.al. | 2402.08679 | link |
2024-02-13 | Human Curriculum Effects Emerge with In-Context Learning in Neural Networks | Jacob Russin et.al. | 2402.08674 | link |
2024-02-13 | Rec-GPT4V: Multimodal Recommendation with Large Vision-Language Models | Yuqing Liu et.al. | 2402.08670 | null |
2024-02-13 | Improving Generalization in Semantic Parsing by Increasing Natural Language Variation | Irina Saparina et.al. | 2402.08666 | link |
2024-02-13 | The Last JITAI? The Unreasonable Effectiveness of Large Language Models in Issuing Just-in-Time Adaptive Interventions: Fostering Physical Activity in a Prospective Cardiac Rehabilitation Setting | David Haag et.al. | 2402.08658 | null |
2024-02-13 | PIN: Positional Insert Unlocks Object Localisation Abilities in VLMs | Michael Dorkenwald et.al. | 2402.08657 | null |
2024-02-13 | Tandem Transformers for Inference Efficient LLMs | Aishwarya P S et.al. | 2402.08644 | null |
2024-02-13 | SemRel2024: A Collection of Semantic Textual Relatedness Datasets for 14 Languages | Nedjma Ousidhoum et.al. | 2402.08638 | null |
2024-02-13 | Knowledge Editing on Black-box Large Language Models | Xiaoshuai Song et.al. | 2402.08631 | link |
2024-02-13 | Bayesian Multi-Task Transfer Learning for Soft Prompt Tuning | Haeju Lee et.al. | 2402.08594 | link |
2024-02-13 | Test-Time Backdoor Attacks on Multimodal Large Language Models | Dong Lu et.al. | 2402.08577 | link |
2024-02-13 | Online Foundation Model Selection in Robotics | Po-han Li et.al. | 2402.08570 | null |
2024-02-13 | Agent Smith: A Single Image Can Jailbreak One Million Multimodal LLM Agents Exponentially Fast | Xiangming Gu et.al. | 2402.08567 | link |
2024-02-13 | Artificial Intelligence for Literature Reviews: Opportunities and Challenges | Francisco Bolanos et.al. | 2402.08565 | null |
2024-02-13 | Higher Layers Need More LoRA Experts | Chongyang Gao et.al. | 2402.08562 | link |
2024-02-13 | Grounding LLMs For Robot Task Planning Using Closed-loop State Feedback | Vineet Bhat et.al. | 2402.08546 | null |
2024-02-13 | The Application of ChatGPT in Responding to Questions Related to the Boston Bowel Preparation Scale | Xiaoqiang Liu et.al. | 2402.08492 | null |
2024-02-13 | Intriguing Differences Between Zero-Shot and Systematic Evaluations of Vision-Language Transformer Models | Shaeke Salman et.al. | 2402.08473 | null |
2024-02-13 | Large Language Models for the Automated Analysis of Optimization Algorithms | Camilo Chacón Sartori et.al. | 2402.08472 | link |
2024-02-12 | A systematic investigation of learnability from single child linguistic input | Yulu Qin et.al. | 2402.07899 | link |
2024-02-12 | Suppressing Pink Elephants with Direct Principle Feedback | Louis Castricato et.al. | 2402.07896 | null |
2024-02-12 | WildfireGPT: Tailored Large Language Model for Wildfire Analysis | Yangxinyu Xie et.al. | 2402.07877 | link |
2024-02-12 | Policy Improvement using Language Feedback Models | Victor Zhong et.al. | 2402.07876 | link |
2024-02-12 | PIVOT: Iterative Visual Prompting Elicits Actionable Knowledge for VLMs | Soroush Nasiriany et.al. | 2402.07872 | null |
2024-02-12 | Scaling Laws for Fine-Grained Mixture of Experts | Jakub Krajewski et.al. | 2402.07871 | link |
2024-02-12 | PoisonedRAG: Knowledge Poisoning Attacks to Retrieval-Augmented Generation of Large Language Models | Wei Zou et.al. | 2402.07867 | link |
2024-02-12 | Prismatic VLMs: Investigating the Design Space of Visually-Conditioned Language Models | Siddharth Karamcheti et.al. | 2402.07865 | link |
2024-02-12 | AI-Augmented Predictions: LLM Assistants Improve Human Forecasting Accuracy | Philipp Schoenegger et.al. | 2402.07862 | null |
2024-02-12 | Lissard: Long and Simple Sequential Reasoning Datasets | Mirelle Bueno et.al. | 2402.07859 | link |
2024-02-12 | Mercury: An Efficiency Benchmark for LLM Code Synthesis | Mingzhe Du et.al. | 2402.07844 | link |
2024-02-12 | Do Membership Inference Attacks Work on Large Language Models? | Michael Duan et.al. | 2402.07841 | link |
2024-02-12 | Aya Model: An Instruction Finetuned Open-Access Multilingual Language Model | Ahmet Üstün et.al. | 2402.07827 | null |
2024-02-12 | Differentially Private Zeroth-Order Methods for Scalable Large Language Model Finetuning | Z Liu et.al. | 2402.07818 | null |
2024-02-12 | Injecting Wiktionary to improve token-level contextual representations using contrastive learning | Anna Mosolova et.al. | 2402.07817 | null |
2024-02-12 | Retrieval-Augmented Thought Process as Sequential Decision Making | Thomas Pouplin et.al. | 2402.07812 | null |
2024-02-12 | Empowering Federated Learning for Massive Models with NVIDIA FLARE | Holger R. Roth et.al. | 2402.07792 | null |
2024-02-12 | TELLER: A Trustworthy Framework for Explainable, Generalizable and Controllable Fake News Detection | Hui Liu et.al. | 2402.07776 | link |
2024-02-12 | Quantitative knowledge retrieval from large language models | David Selby et.al. | 2402.07770 | link |
2024-02-12 | Towards an Understanding of Stepwise Inference in Transformers: A Synthetic Graph Navigation Model | Mikail Khona et.al. | 2402.07757 | null |
2024-02-09 | Feedback Loops With Language Models Drive In-Context Reward Hacking | Alexander Pan et.al. | 2402.06627 | link |
2024-02-09 | Understanding the Effects of Iterative Prompting on Truthfulness | Satyapriya Krishna et.al. | 2402.06625 | null |
2024-02-09 | Aya Dataset: An Open-Access Collection for Multilingual Instruction Tuning | Shivalika Singh et.al. | 2402.06619 | null |
2024-02-09 | FaBERT: Pre-training BERT on Persian Blogs | Mostafa Masumi et.al. | 2402.06617 | null |
2024-02-09 | On the Out-Of-Distribution Generalization of Multimodal Large Language Models | Xingxuan Zhang et.al. | 2402.06599 | null |
2024-02-09 | CigaR: Cost-efficient Program Repair with LLMs | Dávid Hidvégi et.al. | 2402.06598 | link |
2024-02-09 | Understanding the Weakness of Large Language Model Agents within a Complex Android Environment | Mingzhe Xing et.al. | 2402.06596 | link |
2024-02-09 | Self-consistent context aware conformer transducer for speech recognition | Konstantin Kolokolov et.al. | 2402.06592 | null |
2024-02-09 | G-SciEdBERT: A Contextualized LLM for Science Assessment Tasks in German | Ehsan Latif et.al. | 2402.06584 | link |
2024-02-09 | Video Annotator: A framework for efficiently building video classifiers using vision-language models and active learning | Amir Ziai et.al. | 2402.06560 | link |
2024-02-09 | The Quantified Boolean Bayesian Network: Theory and Experiments with a Logical Graphical Model | Gregory Coppola et.al. | 2402.06557 | link |
2024-02-09 | Bryndza at ClimateActivism 2024: Stance, Target and Hate Event Detection via Retrieval-Augmented GPT-4 and LLaMA | Marek Šuppa et.al. | 2402.06549 | link |
2024-02-09 | Calibrating Long-form Generations from Large Language Models | Yukun Huang et.al. | 2402.06544 | link |
2024-02-09 | Introspective Planning: Guiding Language-Enabled Agents to Refine Their Own Uncertainty | Kaiqu Liang et.al. | 2402.06529 | link |
2024-02-09 | Multimodal Clinical Trial Outcome Prediction with Large Language Models | Wenhao Zheng et.al. | 2402.06512 | link |
2024-02-09 | Iris-SAM: Iris Segmentation Using a Foundational Model | Parisa Farmanifard et.al. | 2402.06497 | link |
2024-02-09 | Large Language Models for Captioning and Retrieving Remote Sensing Images | João Daniel Silva et.al. | 2402.06475 | null |
2024-02-09 | V-STaR: Training Verifiers for Self-Taught Reasoners | Arian Hosseini et.al. | 2402.06457 | null |
2024-02-09 | StruQ: Defending Against Prompt Injection with Structured Queries | Sizhe Chen et.al. | 2402.06363 | link |
2024-02-09 | CoSearchAgent: A Lightweight Collaborative Search Agent with Large Language Models | Peiyuan Gong et.al. | 2402.06360 | link |
2024-02-08 | SPHINX-X: Scaling Data and Parameters for a Family of Multi-modal Large Language Models | Peng Gao et.al. | 2402.05935 | link |
2024-02-08 | Driving Everywhere with Large Language Model Policy Adaptation | Boyi Li et.al. | 2402.05932 | null |
2024-02-08 | WebLINX: Real-World Website Navigation with Multi-Turn Dialogue | Xing Han Lù et.al. | 2402.05930 | link |
2024-02-08 | An Interactive Agent Foundation Model | Zane Durante et.al. | 2402.05929 | null |
2024-02-08 | On the Convergence of Zeroth-Order Federated Tuning in Large Language Models | Zhenqing Ling et.al. | 2402.05926 | link |
2024-02-08 | Efficient Stagewise Pretraining via Progressive Subnetworks | Abhishek Panigrahi et.al. | 2402.05913 | null |
2024-02-08 | FACT-GPT: Fact-Checking Augmentation via Claim Matching with LLMs | Eun Cheol Choi et.al. | 2402.05904 | link |
2024-02-08 | Large Language Model Meets Graph Neural Network in Knowledge Distillation | Shengxiang Hu et.al. | 2402.05894 | null |
2024-02-08 | Generative Echo Chamber? Effects of LLM-Powered Search Systems on Diverse Information Seeking | Nikhil Sharma et.al. | 2402.05880 | null |
2024-02-08 | PromptCrypt: Prompt Encryption for Secure Communication with Large Language Models | Guo Lin et.al. | 2402.05868 | link |
2024-02-08 | How Well Can LLMs Negotiate? NegotiationArena Platform and Analysis | Federico Bianchi et.al. | 2402.05863 | link |
2024-02-08 | Let Your Graph Do the Talking: Encoding Structured Data for LLMs | Bryan Perozzi et.al. | 2402.05862 | link |
2024-02-08 | Learning to Route Among Specialized Experts for Zero-Shot Generalization | Mohammed Muqeeth et.al. | 2402.05859 | link |
2024-02-08 | Limitations of Agents Simulated by Predictive Models | Raymond Douglas et.al. | 2402.05829 | null |
2024-02-08 | Is it Possible to Edit Large Language Models Robustly? | Xinbei Ma et.al. | 2402.05827 | link |
2024-02-08 | Selective Forgetting: Advancing Machine Unlearning Techniques and Evaluation in Language Models | Lingzhi Wang et.al. | 2402.05813 | null |
2024-02-08 | Training Large Language Models for Reasoning through Reverse Curriculum Reinforcement Learning | Zhiheng Xi et.al. | 2402.05808 | link |
2024-02-08 | How do Transformers perform In-Context Autoregressive Learning? | Michael E. Sander et.al. | 2402.05787 | null |
2024-02-08 | Limits of Transformer Language Models on Algorithmic Learning | Jonathan Thomm et.al. | 2402.05785 | link |
2024-02-08 | Text-to-Code Generation with Modality-relative Pre-training | Fenia Christopoulou et.al. | 2402.05783 | null |
Autonomous Driving
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-11 | Flipping Against All Odds: Reducing LLM Coin Flip Bias via Verbalized Rejection Sampling | Tim Z. Xiao et.al. | 2506.09998 | null |
2025-06-11 | ReSim: Reliable World Simulation for Autonomous Driving | Jiazhi Yang et.al. | 2506.09981 | null |
2025-06-11 | The Sample Complexity of Online Strategic Decision Making with Information Asymmetry and Knowledge Transportability | Jiachen Hu et.al. | 2506.09940 | null |
2025-06-11 | Assessing a Safety Case: Bottom-up Guidance for Claims and Evidence Evaluation | Scott Schnelle et.al. | 2506.09929 | null |
2025-06-11 | Leveraging Depth and Language for Open-Vocabulary Domain-Generalized Semantic Segmentation | Siyu Chen et.al. | 2506.09881 | null |
2025-06-11 | Foundation Model-Aided Deep Reinforcement Learning for RIS-Assisted Wireless Communication | Mohammad Ghassemi et.al. | 2506.09855 | null |
2025-06-11 | Reinforced Refinement with Self-Aware Expansion for End-to-End Autonomous Driving | Haochen Liu et.al. | 2506.09800 | null |
2025-06-11 | Delegations as Adaptive Representation Patterns: Rethinking Influence in Liquid Democracy | Davide Grossi et.al. | 2506.09789 | null |
2025-06-11 | Feature Engineering for Agents: An Adaptive Cognitive Architecture for Interpretable ML Monitoring | Gusseppe Bravo-Rocca et.al. | 2506.09742 | null |
2025-06-11 | On the Virtues of Information Security in the UK Climate Movement | Mikaela Brough et.al. | 2506.09719 | null |
2025-06-11 | Application-Driven Value Alignment in Agentic AI Systems: Survey and Perspectives | Wei Zeng et.al. | 2506.09656 | null |
2025-06-11 | DipLLM: Fine-Tuning LLM for Strategic Decision-making in Diplomacy | Kaixuan Xu et.al. | 2506.09655 | null |
2025-06-11 | R-CARLA: High-Fidelity Sensor Simulations with Interchangeable Dynamics for Autonomous Racing | Maurice Brunner et.al. | 2506.09629 | null |
2025-06-11 | ECAM: A Contrastive Learning Approach to Avoid Environmental Collision in Trajectory Forecasting | Giacomo Rosin et.al. | 2506.09626 | null |
2025-06-11 | Integrating Quantized LLMs into Robotics Systems as Edge AI to Leverage their Natural Language Processing Capabilities | Miguel Á. González-Santamarta et.al. | 2506.09581 | null |
2025-06-12 | TooBadRL: Trigger Optimization to Boost Effectiveness of Backdoor Attacks on Deep Reinforcement Learning | Songze Li et.al. | 2506.09562 | null |
2025-06-11 | AD^2-Bench: A Hierarchical CoT Benchmark for MLLM in Autonomous Driving under Adverse Conditions | Zhaoyang Wei et.al. | 2506.09557 | null |
2025-06-11 | GLD-Road:A global-local decoding road network extraction model for remote sensing images | Ligao Deng et.al. | 2506.09553 | null |
2025-06-11 | How attention simplifies mental representations for planning | Jason da Silva Castanheira et.al. | 2506.09520 | null |
2025-06-11 | A Survey on the Role of Artificial Intelligence and Machine Learning in 6G-V2X Applications | Donglin Wang et.al. | 2506.09512 | null |
2025-06-10 | The Decoupled Risk Landscape in Performative Prediction | Javier Sanguino et.al. | 2506.09044 | null |
2025-06-10 | HabSim: Architecture for modelling disruptions, propagation, detection and repair in deep space habitats | Luca Vaccino et.al. | 2506.08903 | null |
2025-06-10 | Real-Time Cascade Mitigation in Power Systems Using Influence Graph Improved by Reinforcement Learning | Kai Zhou et.al. | 2506.08893 | null |
2025-06-10 | Measuring Data Science Automation: A Survey of Evaluation Tools for AI Assistants and Agents | Irene Testini et.al. | 2506.08800 | null |
2025-06-10 | Bayesian Inverse Physics for Neuro-Symbolic Robot Learning | Octavio Arriaga et.al. | 2506.08756 | null |
2025-06-10 | Unlocking the Potential of Large Language Models in the Nuclear Industry with Synthetic Data | Muhammad Anwar et.al. | 2506.08750 | null |
2025-06-10 | Unveiling the Impact of Social and Environmental Determinants of Health on Lung Function Decline in Cystic Fibrosis through Data Integration using the US Registry | Eleni-Rosalina Andrinopoulou et.al. | 2506.08731 | null |
2025-06-10 | Causality-aware Safety Testing for Autonomous Driving Systems | Wenbing Tang et.al. | 2506.08688 | null |
2025-06-10 | HGFormer: A Hierarchical Graph Transformer Framework for Two-Stage Colonel Blotto Games via Reinforcement Learning | Yang Lv et.al. | 2506.08580 | null |
2025-06-10 | Generating Vision-Language Navigation Instructions Incorporated Fine-Grained Alignment Annotations | Yibo Cui et.al. | 2506.08566 | null |
2025-06-10 | TrajFlow: Multi-modal Motion Prediction via Flow Matching | Qi Yan et.al. | 2506.08541 | null |
2025-06-10 | Robust Evolutionary Multi-Objective Network Architecture Search for Reinforcement Learning (EMNAS-RL) | Nihal Acharya Adde et.al. | 2506.08533 | null |
2025-06-10 | One Patch to Rule Them All: Transforming Static Patches into Dynamic Attacks in the Physical World | Xingshuo Han et.al. | 2506.08482 | null |
2025-06-10 | How to Provably Improve Return Conditioned Supervised Learning? | Zhishuai Liu et.al. | 2506.08463 | null |
2025-06-10 | Diffusion Models for Safety Validation of Autonomous Driving Systems | Juanran Wang et.al. | 2506.08459 | null |
2025-06-10 | Mic-hackathon 2024: Hackathon on Machine Learning for Electron and Scanning Probe Microscopy | Utkarsh Pratiush et.al. | 2506.08423 | null |
2025-06-10 | Re4MPC: Reactive Nonlinear MPC for Multi-model Motion Planning via Deep Reinforcement Learning | Neşet Ünver Akmandor et.al. | 2506.08344 | null |
2025-06-10 | Understanding Software Engineering Agents Through the Lens of Traceability: An Empirical Study | Ira Ceka et.al. | 2506.08311 | null |
2025-06-09 | HiBerNAC: Hierarchical Brain-emulated Robotic Neural Agent Collective for Disentangling Complex Manipulation | Hongjun Wu et.al. | 2506.08296 | null |
2025-06-09 | Scaling Laws of Motion Forecasting and Planning – A Technical Report | Mustafa Baniodeh et.al. | 2506.08228 | null |
2025-06-09 | ZeroVO: Visual Odometry with Minimal Assumptions | Lei Lai et.al. | 2506.08005 | null |
2025-06-09 | Diffusion of Responsibility in Collective Decision Making | Pavel Naumov et.al. | 2506.07935 | null |
2025-06-09 | CausalPFN: Amortized Causal Effect Estimation via In-Context Learning | Vahid Balazadeh et.al. | 2506.07918 | link |
2025-06-09 | LUCIFER: Language Understanding and Context-Infused Framework for Exploration and Behavior Refinement | Dimitris Panagopoulos et.al. | 2506.07915 | null |
2025-06-09 | Evaluating explainable AI for deep learning-based network intrusion detection system alert classification | Rajesh Kalakoti et.al. | 2506.07882 | null |
2025-06-09 | A distributed motion planning approach to cooperative underwater acoustic source tracking and pursuit | Andrea Tiranti et.al. | 2506.07877 | null |
2025-06-09 | Are Trees Really Green? A Detection Approach of IoT Malware Attacks | Silvia Lucia Sanna et.al. | 2506.07836 | null |
2025-06-09 | R3D2: Realistic 3D Asset Insertion via Diffusion for Autonomous Driving Simulation | William Ljungbergh et.al. | 2506.07826 | null |
2025-06-09 | Identifiability in epidemic models with prior immunity and under-reporting | Fanny Bergström et.al. | 2506.07825 | null |
2025-06-09 | Accelerating Diffusion Models in Offline RL via Reward-Aware Consistency Trajectory Distillation | Xintong Duan et.al. | 2506.07822 | null |
2025-06-09 | REMoH: A Reflective Evolution of Multi-objective Heuristics approach via Large Language Models | Diego Forniés-Tabuenca et.al. | 2506.07759 | null |
2025-06-09 | SpikeSMOKE: Spiking Neural Networks for Monocular 3D Object Detection with Cross-Scale Gated Coding | Xuemei Chen et.al. | 2506.07737 | null |
2025-06-09 | Blending Participatory Design and Artificial Awareness for Trustworthy Autonomous Vehicles | Ana Tanevska et.al. | 2506.07633 | null |
2025-06-09 | SurgBench: A Unified Large-Scale Benchmark for Surgical Video Analysis | Jianhui Wei et.al. | 2506.07603 | null |
2025-06-09 | Fractional Collisions: A Framework for Risk Estimation of Counterfactual Conflicts using Autonomous Driving Behavior Simulations | Sreeja Roy-Singh et.al. | 2506.07540 | null |
2025-06-09 | A Unified Anti-Jamming Design in Complex Environments Based on Cross-Modal Fusion and Intelligent Decision-Making | Huake Wang et.al. | 2506.07532 | null |
2025-06-09 | Improving Fairness of Large Language Models in Multi-document Summarization | Haoyuan Li Yusen Zhang et.al. | 2506.07479 | null |
2025-06-09 | Individual Treatment Effect: Prediction Intervals and Sharp Bounds | Zhehao Zhang et.al. | 2506.07469 | null |
2025-06-09 | LegalReasoner: Step-wised Verification-Correction for Legal Judgment Reasoning | Weijie Shi et.al. | 2506.07443 | null |
2025-06-09 | LiteVLM: A Low-Latency Vision-Language Model Inference Pipeline for Resource-Constrained Environments | Jin Huang et.al. | 2506.07416 | null |
2025-06-06 | PyGemini: Unified Software Development towards Maritime Autonomy Systems | Kjetil Vasstein et.al. | 2506.06262 | null |
2025-06-06 | PDHCG: A Scalable First-Order Method for Large-Scale Competitive Market Equilibrium Computation | Huikang Liu et.al. | 2506.06258 | null |
2025-06-06 | STSBench: A Spatio-temporal Scenario Benchmark for Multi-modal Large Language Models in Autonomous Driving | Christian Fruhwirth-Reisinger et.al. | 2506.06218 | null |
2025-06-06 | CCLSTM: Coupled Convolutional Long-Short Term Memory Network for Occupancy Flow Forecasting | Peter Lengyel et.al. | 2506.06128 | null |
2025-06-06 | Self driving algorithm for an active four wheel drive racecar | Gergely Bari et.al. | 2506.06077 | null |
2025-06-06 | Learning Deterministic Policies with Policy Gradients in Constrained Markov Decision Processes | Alessandro Montenegro et.al. | 2506.05953 | null |
2025-06-06 | SurGSplat: Progressive Geometry-Constrained Gaussian Splatting for Surgical Scene Reconstruction | Yuchao Zheng et.al. | 2506.05935 | null |
2025-06-06 | Rethinking Semi-supervised Segmentation Beyond Accuracy: Reliability and Robustness | Steven Landgraf et.al. | 2506.05917 | null |
2025-06-06 | A Driving Regime-Embedded Deep Learning Framework for Modeling Intra-Driver Heterogeneity in Multi-Scale Car-Following Dynamics | Shirui Zhou et.al. | 2506.05902 | null |
2025-06-06 | Object Navigation with Structure-Semantic Reasoning-Based Multi-level Map and Multimodal Decision-Making LLM | Chongshang Yan et.al. | 2506.05896 | null |
2025-06-06 | Interpretable Clustering Ensemble | Hang Lv et.al. | 2506.05877 | null |
2025-06-06 | Trajectory Entropy: Modeling Game State Stability from Multimodality Trajectory Prediction | Yesheng Zhang et.al. | 2506.05810 | null |
2025-06-06 | Where Do We Look When We Teach? Analyzing Human Gaze Behavior Across Demonstration Devices in Robot Imitation Learning | Yutaro Ishida et.al. | 2506.05808 | null |
2025-06-06 | Discrete Minds in a Continuous World: Do Language Models Know Time Passes? | Minghan Wang et.al. | 2506.05790 | null |
2025-06-06 | There’s Waldo: PCB Tamper Forensic Analysis using Explainable AI on Impedance Signatures | Maryam Saadat Safa et.al. | 2506.05734 | null |
2025-06-06 | DriveAction: A Benchmark for Exploring Human-like Driving Decisions in VLA Models | Yuhan Hao et.al. | 2506.05667 | null |
2025-06-06 | TissUnet: Improved Extracranial Tissue and Cranium Segmentation for Children through Adulthood | Markian Mandzak et.al. | 2506.05660 | link |
2025-06-05 | AutoQD: Automatic Discovery of Diverse Behaviors with Quality-Diversity Optimization | Saeed Hedayatian et.al. | 2506.05634 | null |
2025-06-05 | FaCTR: Factorized Channel-Temporal Representation Transformers for Efficient Time Series Forecasting | Yash Vijay et.al. | 2506.05597 | null |
2025-06-05 | Collaborative Learning in Agentic Systems: A Collective AI is Greater Than the Sum of Its Parts | Saptarshi Nath et.al. | 2506.05577 | null |
2025-06-05 | VideoMolmo: Spatio-Temporal Grounding Meets Pointing | Ghazi Shazan Ahmad et.al. | 2506.05336 | null |
2025-06-05 | Time to Talk: LLM Agents for Asynchronous Group Communication in Mafia Games | Niv Eckhaus et.al. | 2506.05309 | link |
2025-06-05 | Stable Vision Concept Transformers for Medical Diagnosis | Lijie Hu et.al. | 2506.05286 | null |
2025-06-06 | Unifying Appearance Codes and Bilateral Grids for Driving Scene Gaussian Splatting | Nan Wang et.al. | 2506.05280 | null |
2025-06-05 | Conservative classifiers do consistently well with improving agents: characterizing statistical and online learning | Dravyansh Sharma et.al. | 2506.05252 | null |
2025-06-05 | Cooperation and the Design of Public Goods | J. Carlos Martínez Mori et.al. | 2506.05251 | null |
2025-06-05 | Track Any Anomalous Object: A Granular Video Anomaly Detection Pipeline | Yuzhi Huang et.al. | 2506.05175 | null |
2025-06-05 | LLM-Guided Scenario-based GUI Testing | Shengcheng Yu et.al. | 2506.05079 | null |
2025-06-05 | Reason-to-Recommend: Using Interaction-of-Thought Reasoning to Enhance LLM Recommendation | Keyu Zhao et.al. | 2506.05069 | null |
2025-06-05 | DemoSpeedup: Accelerating Visuomotor Policies via Entropy-Guided Demonstration Acceleration | Lingxiao Guo et.al. | 2506.05064 | null |
2025-06-05 | Artificial Intelligence Should Genuinely Support Clinical Reasoning and Decision Making To Bridge the Translational Gap | Kacper Sokol et.al. | 2506.05030 | null |
2025-06-05 | FinMultiTime: A Four-Modal Bilingual Dataset for Financial Time-Series Analysis | Wenyan Xu et.al. | 2506.05019 | null |
2025-06-05 | Agentic AI for Intent-Based Industrial Automation | Marcos Lima Romero et.al. | 2506.04980 | null |
2025-06-05 | Time-Lapse Video-Based Embryo Grading via Complementary Spatial-Temporal Pattern Mining | Yong Sun et.al. | 2506.04950 | null |
2025-06-05 | Goal-Oriented Semantic Resource Allocation with Cumulative Prospect Theoretic Agents | Symeon Vaidanis et.al. | 2506.04947 | null |
2025-06-05 | Adapting Online Customer Reviews for Blind Users: A Case Study of Restaurant Reviews | Mohan Sunkara et.al. | 2506.04865 | null |
2025-06-05 | Joint Evaluation of Answer and Reasoning Consistency for Hallucination Detection in Large Reasoning Models | Changyue Wang et.al. | 2506.04832 | null |
2025-06-05 | Memory-Driven Bounded Confidence Opinion Dynamics: A Hegselmann-Krause Model Based on Fractional-Order Methods | Meiru Jiang et.al. | 2506.04701 | null |
2025-06-05 | Empowering Economic Simulation for Massively Multiplayer Online Games through Generative Agent-Based Modeling | Bihan Xu et.al. | 2506.04699 | null |
2025-06-05 | Real-Time LPV-Based Non-Linear Model Predictive Control for Robust Trajectory Tracking in Autonomous Vehicles | Nitish Kumar et.al. | 2506.04684 | null |
2025-06-04 | Finding signatures of low-dimensional geometric landscapes in high-dimensional cell fate transitions | Maria Yampolskaya et.al. | 2506.04219 | null |
2025-06-04 | Pseudo-Simulation for Autonomous Driving | Wei Cao et.al. | 2506.04218 | null |
2025-06-04 | OWMM-Agent: Open World Mobile Manipulation With Multi-modal Agentic Data Synthesis | Junting Chen et.al. | 2506.04217 | null |
2025-06-04 | Improving Regulatory Oversight in Online Content Moderation | Benedetta Tessa et.al. | 2506.04145 | null |
2025-06-04 | TRiSM for Agentic AI: A Review of Trust, Risk, and Security Management in LLM-based Agentic Multi-Agent Systems | Shaina Raza et.al. | 2506.04133 | null |
2025-06-04 | CLAIM: An Intent-Driven Multi-Agent Framework for Analyzing Manipulation in Courtroom Dialogues | Disha Sheshanarayana et.al. | 2506.04131 | null |
2025-06-04 | Leveraging External Data for Testing Experimental Therapies with Biomarker Interactions in Randomized Clinical Trials | Boyu Ren et.al. | 2506.04128 | null |
2025-06-04 | TextAtari: 100K Frames Game Playing with Language Agents | Wenhao Li et.al. | 2506.04098 | link |
2025-06-04 | Autonomous Vehicle Lateral Control Using Deep Reinforcement Learning with MPC-PID Demonstration | Chengdong Wu et.al. | 2506.04040 | null |
2025-06-04 | FPGA-Enabled Machine Learning Applications in Earth Observation: A Systematic Review | Cédric Léonard et.al. | 2506.03938 | link |
2025-06-04 | Identifying Alzheimer’s Disease Prediction Strategies of Convolutional Neural Network Classifiers using R2* Maps and Spectral Clustering | Christian Tinauer et.al. | 2506.03890 | null |
2025-06-04 | PulseReddit: A Novel Reddit Dataset for Benchmarking MAS in High-Frequency Cryptocurrency Trading | Qiuhan Han et.al. | 2506.03861 | null |
2025-06-05 | Construction of Urban Greenland Resources Collaborative Management Platform | Dongyang Lyu et.al. | 2506.03830 | null |
2025-06-04 | Impact of friction force and retrieval speed on in silico mechanical thrombectomies: a sensitivity analysis | Mahesh S. Nagargoje et.al. | 2506.03812 | null |
2025-06-04 | FedFACT: A Provable Framework for Controllable Group-Fairness Calibration in Federated Learning | Li Zhang et.al. | 2506.03777 | null |
2025-06-04 | Fast Non-Line-of-Sight Transient Data Simulation and an Open Benchmark Dataset | Yingjie Shi et.al. | 2506.03747 | null |
2025-06-04 | My Advisor, Her AI and Me: Evidence from a Field Experiment on Human-AI Collaboration and Investment Decisions | Cathy et.al. | 2506.03707 | null |
2025-06-04 | Trustworthy Medical Question Answering: An Evaluation-Centric Survey | Yinuo Wang et.al. | 2506.03659 | null |
2025-06-04 | Analyzing Transformer Models and Knowledge Distillation Approaches for Image Captioning on Edge AI | Wing Man Casca Kwok et.al. | 2506.03607 | null |
2025-06-04 | A Class Inference Scheme With Dempster-Shafer Theory for Learning Fuzzy-Classifier Systems | Hiroki Shiraishi et.al. | 2506.03588 | null |
2025-06-03 | Not All Tokens Are Meant to Be Forgotten | Xiangyu Zhou et.al. | 2506.03142 | null |
2025-06-03 | Simulate Any Radar: Attribute-Controllable Radar Simulation via Waveform Parameter Embedding | Weiqing Xiao et.al. | 2506.03134 | null |
2025-06-03 | Designing Algorithmic Delegates: The Role of Indistinguishability in Human-AI Handoff | Sophie Greenwood et.al. | 2506.03102 | null |
2025-06-03 | Non-stationary Bandit Convex Optimization: A Comprehensive Study | Xiaoqi Liu et.al. | 2506.02980 | null |
2025-06-03 | Online Performance Assessment of Multi-Source-Localization for Autonomous Driving Systems Using Subjective Logic | Stefan Orf et.al. | 2506.02932 | null |
2025-06-03 | Functionality Assessment Framework for Autonomous Driving Systems using Subjective Networks | Stefan Orf et.al. | 2506.02922 | null |
2025-06-03 | Towards Auto-Annotation from Annotation Guidelines: A Benchmark through 3D LiDAR Detection | Yechi Ma et.al. | 2506.02914 | null |
2025-06-03 | GaRA-SAM: Robustifying Segment Anything Model with Gated-Rank Adaptation | Sohyun Lee et.al. | 2506.02882 | null |
2025-06-04 | Adaptive Configuration Selection for Multi-Model Inference Pipelines in Edge Computing | Jinhao Sheng et.al. | 2506.02814 | null |
2025-06-03 | Doubly-Robust Estimation of Counterfactual Policy Mean Embeddings | Houssam Zenati et.al. | 2506.02793 | null |
2025-06-03 | Collective Intelligence Outperforms Individual Talent: A Case Study in League of Legends | Angelo Josey Caldeira et.al. | 2506.02706 | null |
2025-06-03 | Large-scale Self-supervised Video Foundation Model for Intelligent Surgery | Shu Yang et.al. | 2506.02692 | null |
2025-06-03 | From Prompts to Protection: Large Language Model-Enabled In-Context Learning for Smart Public Safety UAV | Yousef Emami et.al. | 2506.02649 | null |
2025-06-03 | Compositional Learning for Modular Multi-Agent Self-Organizing Networks | Qi Liao et.al. | 2506.02616 | null |
2025-06-03 | BEVCALIB: LiDAR-Camera Calibration via Geometry-Guided Bird’s-Eye View Representations | Weiduo Yuan et.al. | 2506.02587 | null |
2025-06-03 | V2X-UniPool: Unifying Multimodal Perception and Knowledge Reasoning for Autonomous Driving | Xuewen Luo et.al. | 2506.02580 | null |
2025-06-03 | HiLO: High-Level Object Fusion for Autonomous Driving using Transformers | Timo Osterburg et.al. | 2506.02554 | null |
2025-06-03 | Think Twice, Act Once: A Co-Evolution Framework of LLM and RL for Large-Scale Decision Making | Xu Wan et.al. | 2506.02522 | null |
2025-06-03 | A Smart Multimodal Healthcare Copilot with Powerful LLM Reasoning | Xuejiao Zhao et.al. | 2506.02470 | null |
2025-06-03 | Joint Modeling for Learning Decision-Making Dynamics in Behavioral Experiments | Yuan Bian et.al. | 2506.02394 | null |
2025-05-30 | Agent-X: Evaluating Deep Multimodal Reasoning in Vision-Centric Agentic Tasks | Tajamul Ashraf et.al. | 2505.24876 | null |
2025-05-30 | Convex Approximations of Random Constrained Markov Decision Processes | V Varagapriya et.al. | 2505.24815 | null |
2025-06-03 | EVA-MILP: Towards Standardized Evaluation of MILP Instance Generation | Yidong Luo et.al. | 2505.24779 | null |
2025-05-30 | Causal-aware Large Language Models: Enhancing Decision-Making Through Learning, Adapting and Acting | Wei Chen et.al. | 2505.24710 | null |
2025-06-02 | NUC-Net: Non-uniform Cylindrical Partition Network for Efficient LiDAR Semantic Segmentation | Xuzhi Wang et.al. | 2505.24634 | null |
2025-05-30 | Random Rule Forest (RRF): Interpretable Ensembles of LLM-Generated Questions for Predicting Startup Success | Ben Griffin et.al. | 2505.24622 | null |
2025-05-30 | Interpretable phenotyping of Heart Failure patients with Dutch discharge letters | Vittorio Torri et.al. | 2505.24619 | null |
2025-05-30 | Multi-criteria Rank-based Aggregation for Explainable AI | Sujoy Chatterjee et.al. | 2505.24612 | null |
2025-05-30 | Fine-tuning for Data-enabled Predictive Control of Noisy Systems by Reinforcement Learning | Jinbao Wang et.al. | 2505.24572 | null |
2025-05-30 | Object Centric Concept Bottlenecks | David Steinmann et.al. | 2505.24492 | null |
2025-05-30 | SAH-Drive: A Scenario-Aware Hybrid Planner for Closed-Loop Vehicle Trajectory Generation | Yuqi Fan et.al. | 2505.24390 | link |
2025-05-30 | Revisiting Cross-Modal Knowledge Distillation: A Disentanglement Approach for RGBD Semantic Segmentation | Roger Ferrod et.al. | 2505.24361 | null |
2025-05-30 | The Use of Alendronate to Enhance Transcranial Transmission of Focused Ultrasound for Successful Ablations in Brain | G. Sakharova et.al. | 2505.24349 | null |
2025-05-30 | ROAD: Responsibility-Oriented Reward Design for Reinforcement Learning in Autonomous Driving | Yongming Chen et.al. | 2505.24317 | null |
2025-05-30 | Data Fusion for Partial Identification of Causal Effects | Quinn Lanners et.al. | 2505.24296 | null |
2025-05-30 | Effects of Theory of Mind and Prosocial Beliefs on Steering Human-Aligned Behaviors of LLMs in Ultimatum Games | Neemesh Yadav et.al. | 2505.24255 | link |
2025-05-30 | Proxy Target: Bridging the Gap Between Discrete Spiking Neural Networks and Continuous Control | Zijie Xu et.al. | 2505.24161 | null |
2025-06-03 | S4-Driver: Scalable Self-Supervised Driving Multimodal Large Language Modelwith Spatio-Temporal Visual Representation | Yichen Xie et.al. | 2505.24139 | null |
2025-05-30 | Federated Foundation Model for GI Endoscopy Images | Alina Devkota et.al. | 2505.24108 | null |
2025-05-30 | Training LLMs for EHR-Based Reasoning Tasks via Reinforcement Learning | Jiacheng Lin et.al. | 2505.24105 | null |
2025-05-29 | Impromptu VLA: Open Weights and Open Data for Driving Vision-Language-Action Models | Haohan Chi et.al. | 2505.23757 | link |
2025-05-29 | Bounded Rationality for LLMs: Satisficing Alignment at Inference-Time | Mohamad Chehade et.al. | 2505.23729 | null |
2025-05-29 | From Connectivity to Autonomy: The Dawn of Self-Evolving Communication Systems | Zeinab Nezami et.al. | 2505.23710 | null |
2025-05-29 | DiCoFlex: Model-agnostic diverse counterfactuals with flexible control | Oleksii Furman et.al. | 2505.23700 | null |
2025-05-29 | Grounded Reinforcement Learning for Visual Reasoning | Gabriel Sarch et.al. | 2505.23678 | link |
2025-05-29 | Active Layer-Contrastive Decoding Reduces Hallucination in Large Language Model Generation | Hongxiang Zhang et.al. | 2505.23657 | null |
2025-05-29 | Autoregressive Meta-Actions for Unified Controllable Trajectory Generation | Jianbo Zhao et.al. | 2505.23612 | null |
2025-05-29 | BioReason: Incentivizing Multimodal Biological Reasoning within a DNA-LLM Model | Adibvafa Fallahpour et.al. | 2505.23579 | link |
2025-05-29 | Cognitive Guardrails for Open-World Decision Making in Autonomous Drone Swarms | Jane Cleland-Huang et.al. | 2505.23576 | null |
2025-05-29 | Going from a Representative Agent to Counterfactuals in Combinatorial Choice | Yanqiu Ruan et.al. | 2505.23546 | null |
2025-05-29 | TRAP: Targeted Redirecting of Agentic Preferences | Hangoo Kang et.al. | 2505.23518 | null |
2025-05-29 | Socratic-PRMBench: Benchmarking Process Reward Models with Systematic Reasoning Patterns | Xiang Li et.al. | 2505.23474 | null |
2025-05-29 | To Measure What Isn’t There – Visual Exploration of Missingness Structures Using Quality Metrics | Sara Johansson Fernstad et.al. | 2505.23447 | null |
2025-05-29 | Bounded-Abstention Pairwise Learning to Rank | Antonio Ferrara et.al. | 2505.23437 | null |
2025-05-29 | Emergent Risk Awareness in Rational Agents under Resource Constraints | Daniel Jarne Ornia et.al. | 2505.23436 | null |
2025-05-29 | A Unified Framework for Human AI Collaboration in Security Operations Centers with Trusted Autonomy | Ahmad Mohsin et.al. | 2505.23397 | null |
2025-05-29 | Ordinal regression for meta-analysis of test accuracy: a flexible approach for utilising all threshold data | Enzo Cerullo et.al. | 2505.23393 | link |
2025-05-29 | Understanding the Information Propagation Effects of Communication Topologies in LLM-based Multi-Agent Systems | Xu Shen et.al. | 2505.23352 | link |
2025-05-29 | Wireless Agentic AI with Retrieval-Augmented Multimodal Semantic Perception | Guangyuan Liu et.al. | 2505.23275 | null |
2025-05-29 | Disrupting Vision-Language Model-Driven Navigation Services via Adversarial Object Fusion | Chunlong Xie et.al. | 2505.23266 | null |
2025-05-28 | Characterizing Bias: Benchmarking Large Language Models in Simplified versus Traditional Chinese | Hanjia Lyu et.al. | 2505.22645 | link |
2025-05-28 | PS4PRO: Pixel-to-pixel Supervision for Photorealistic Rendering and Optimization | Yezhi Shen et.al. | 2505.22616 | null |
2025-05-28 | Universal Visuo-Tactile Video Understanding for Embodied Interaction | Yifan Xie et.al. | 2505.22566 | null |
2025-05-28 | A Human-Centric Approach to Explainable AI for Personalized Education | Vinitra Swamy et.al. | 2505.22541 | link |
2025-05-29 | The Meeseeks Mesh: Spatially Consistent 3D Adversarial Objects for BEV Detector | Aixuan Li et.al. | 2505.22499 | null |
2025-05-28 | Hypothesis Testing in Imaging Inverse Problems | Yiming Xi et.al. | 2505.22481 | null |
2025-05-29 | SHTOcc: Effective 3D Occupancy Prediction with Sparse Head and Tail Voxels | Qiucheng Yu et.al. | 2505.22461 | null |
2025-05-29 | GeoDrive: 3D Geometry-Informed Driving World Model with Precise Action Control | Anthony Chen et.al. | 2505.22421 | link |
2025-05-28 | Individualised Counterfactual Examples Using Conformal Prediction Intervals | James M. Adams et.al. | 2505.22326 | null |
2025-05-28 | Chain-of-Thought for Large Language Model-empowered Wireless Communications | Xudong Wang et.al. | 2505.22320 | null |
2025-05-28 | Rethinking BPS: A Utility-Based Evaluation Framework | Konrad Özdemir et.al. | 2505.22316 | null |
2025-05-28 | Natural Language Processing in Support of Evidence-based Medicine: A Scoping Review | Zihan Xu et.al. | 2505.22280 | null |
2025-05-28 | Solver-Free Decision-Focused Learning for Linear Optimization Problems | Senne Berden et.al. | 2505.22224 | null |
2025-05-28 | Bayesian Learning in Structural Dynamics: A Comprehensive Review and Emerging Trends | Wang-Ji Yan et.al. | 2505.22223 | null |
2025-05-28 | Lifted Forward Planning in Relational Factored Markov Decision Processes with Concurrent Actions | Florian Andreas Marwitz et.al. | 2505.22147 | null |
2025-05-28 | A simulation framework for autonomous lunar construction work | Mattias Linde et.al. | 2505.22091 | null |
2025-05-28 | From Failures to Fixes: LLM-Driven Scenario Repair for Self-Evolving Autonomous Driving | Xinyu Xia et.al. | 2505.22067 | null |
2025-05-28 | Reinforced Reasoning for Embodied Planning | Di Wu et.al. | 2505.22050 | null |
2025-05-28 | Learnable Burst-Encodable Time-of-Flight Imaging for High-Fidelity Long-Distance Depth Sensing | Manchao Bao et.al. | 2505.22025 | null |
2025-05-29 | DORAEMON: Decentralized Ontology-aware Reliable Agent with Enhanced Memory Oriented Navigation | Tianjun Gu et.al. | 2505.21969 | null |
2025-05-27 | Silence is Not Consensus: Disrupting Agreement Bias in Multi-Agent LLMs via Catfish Agent for Clinical Decision Making | Yihan Wang et.al. | 2505.21503 | null |
2025-05-27 | Tissue-specific predictive performance: A unified estimation and inference framework for multi-category screening tests | A. Gregory DiRienzo et.al. | 2505.21482 | null |
2025-05-27 | Are Language Models Consequentialist or Deontological Moral Reasoners? | Keenan Samway et.al. | 2505.21479 | null |
2025-05-27 | Active-O3: Empowering Multimodal Large Language Models with Active Perception via GRPO | Muzhi Zhu et.al. | 2505.21457 | null |
2025-05-27 | Autonomous Multi-Modal LLM Agents for Treatment Planning in Focused Ultrasound Ablation Surgery | Lina Zhao et.al. | 2505.21418 | null |
2025-05-27 | A Framework for Adversarial Analysis of Decision Support Systems Prior to Deployment | Brett Bissey et.al. | 2505.21414 | null |
2025-05-27 | Enhancing JavaScript Malware Detection through Weighted Behavioral DFAs | Pedro Pereira et.al. | 2505.21406 | null |
2025-05-27 | A Structured Unplugged Approach for Foundational AI Literacy in Primary Education | Maria Cristina Carrisi et.al. | 2505.21398 | link |
2025-05-27 | DecisionFlow: Advancing Large Language Model as Principled Decision Maker | Xiusi Chen et.al. | 2505.21397 | null |
2025-05-27 | A first look at ROS~2 applications written in asynchronous Rust | Martin Škoudlil et.al. | 2505.21323 | null |
2025-05-27 | PACT: A Contract-Theoretic Framework for Pricing Agentic AI Services Powered by Large Language Models | Ya-Ting Yang et.al. | 2505.21286 | null |
2025-05-27 | Developing hybrid mechanistic and data-driven personalized prediction models for platelet dynamics | Marie Steinacker et.al. | 2505.21204 | link |
2025-05-27 | GGBond: Growing Graph-Based AI-Agent Society for Socially-Aware Recommender Simulation | Hailin Zhong et.al. | 2505.21154 | null |
2025-05-27 | Universal Value-Function Uncertainties | Moritz A. Zanger et.al. | 2505.21119 | null |
2025-05-27 | Position is Power: System Prompts as a Mechanism of Bias in Large Language Models (LLMs) | Anna Neumann et.al. | 2505.21091 | null |
2025-05-27 | Agent-Environment Alignment via Automated Interface Generation | Kaiming Liu et.al. | 2505.21055 | null |
2025-05-27 | Robust Video-Based Pothole Detection and Area Estimation for Intelligent Vehicles with Depth Map and Kalman Smoothing | Dehao Wang et.al. | 2505.21049 | null |
2025-05-27 | Large Language Model-enhanced Reinforcement Learning for Low-Altitude Economy Networking | Lingyi Cai et.al. | 2505.21045 | null |
2025-05-27 | BIPNN: Learning to Solve Binary Integer Programming via Hypergraph Neural Networks | Sen Bai et.al. | 2505.20997 | null |
2025-05-27 | RF4D:Neural Radar Fields for Novel View Synthesis in Outdoor Dynamic Scenes | Jiarui Zhang et.al. | 2505.20967 | null |
2025-05-26 | Variational Deep Learning via Implicit Regularization | Jonathan Wenger et.al. | 2505.20235 | null |
2025-05-26 | Chain-of-Thought for Autonomous Driving: A Comprehensive Survey and Future Prospects | Yixin Cui et.al. | 2505.20223 | link |
2025-05-26 | Fine-grained List-wise Alignment for Generative Medication Recommendation | Chenxiao Fan et.al. | 2505.20218 | link |
2025-05-26 | URPlanner: A Universal Paradigm For Collision-Free Robotic Motion Planning Based on Deep Reinforcement Learning | Fengkang Ying et.al. | 2505.20175 | null |
2025-05-26 | Preference Disaggregation Analysis with Criteria Selection in a Regularization Framework | Kun Zhou et.al. | 2505.20111 | null |
2025-05-26 | SwarmThinkers: Learning Physically Consistent Atomic KMC Transitions at Scale | Qi Li et.al. | 2505.20094 | null |
2025-05-26 | Explanation User Interfaces: A Systematic Literature Review | Eleonora Cappuccio et.al. | 2505.20085 | null |
2025-05-26 | Synthetic Time Series Forecasting with Transformer Architectures: Extensive Simulation Benchmarks | Ali Forootani et.al. | 2505.20048 | link |
2025-05-26 | ReasonPlan: Unified Scene Prediction and Decision Reasoning for Closed-loop Autonomous Driving | Xueyi Liu et.al. | 2505.20024 | link |
2025-05-26 | An Explainable Diagnostic Framework for Neurodegenerative Dementias via Reinforcement-Optimized LLM Reasoning | Andrew Zamai et.al. | 2505.19954 | null |
2025-05-26 | Uncertainty-Aware Safety-Critical Decision and Control for Autonomous Vehicles at Unsignalized Intersections | Ran Yu et.al. | 2505.19939 | null |
2025-05-26 | Subtle Risks, Critical Failures: A Framework for Diagnosing Physical Safety of LLMs for Embodied Decision Making | Yejin Son et.al. | 2505.19933 | null |
2025-05-26 | Attention! You Vision Language Model Could Be Maliciously Manipulated | Xiaosen Wang et.al. | 2505.19911 | null |
2025-05-26 | Large Language Models as Autonomous Spacecraft Operators in Kerbal Space Program | Alejandro Carrasco et.al. | 2505.19896 | link |
2025-05-26 | Deep Active Inference Agents for Delayed and Long-Horizon Environments | Yavar Taheri Yeganeh et.al. | 2505.19867 | link |
2025-05-26 | Zero-Shot Pseudo Labels Generation Using SAM and CLIP for Semi-Supervised Semantic Segmentation | Nagito Saito et.al. | 2505.19846 | null |
2025-05-27 | PCDCNet: A Surrogate Model for Air Quality Forecasting with Physical-Chemical Dynamics and Constraints | Shuo Wang et.al. | 2505.19842 | null |
2025-05-26 | MedDreamer: Model-Based Reinforcement Learning with Latent Imagination on Complex EHRs for Clinical Decision Support | Qianyi Xu et.al. | 2505.19785 | null |
2025-05-26 | Divide and Conquer: Grounding LLMs as Efficient Decision-Making Agents via Offline Hierarchical Reinforcement Learning | Zican Hu et.al. | 2505.19761 | link |
2025-05-26 | DriveCamSim: Generalizable Camera Simulation via Explicit Camera Modeling for Autonomous Driving | Wenchao Sun et.al. | 2505.19692 | link |
2025-05-26 | Large Language Models for Planning: A Comprehensive and Systematic Survey | Pengfei Cao et.al. | 2505.19683 | link |
2025-05-26 | DoctorAgent-RL: A Multi-Agent Collaborative Reinforcement Learning System for Multi-Turn Clinical Dialogue | Yichun Feng et.al. | 2505.19630 | link |
2025-05-26 | Software Engineering for Self-Adaptive Robotics: A Research Agenda | Shaukat Ali et.al. | 2505.19629 | null |
2025-05-26 | Benchmarking Large Multimodal Models for Ophthalmic Visual Question Answering with OphthalWeChat | Pusheng Xu et.al. | 2505.19624 | null |
2025-05-23 | The Staircase of Ethics: Probing LLM Value Priorities through Multi-Step Induction to Complex Moral Dilemmas | Ya Wu et.al. | 2505.18154 | null |
2025-05-23 | Planning without Search: Refining Frontier LLMs with Offline Goal-Conditioned RL | Joey Hong et.al. | 2505.18098 | null |
2025-05-23 | Bayesian Deep Learning for Discrete Choice | Daniel F. Villarraga et.al. | 2505.18077 | null |
2025-05-23 | Towards Uncertainty Aware Task Delegation and Human-AI Collaborative Decision-Making | Min Hun Lee et.al. | 2505.18066 | null |
2025-05-23 | Linear Mixture Distributionally Robust Markov Decision Processes | Zhishuai Liu et.al. | 2505.18044 | null |
2025-05-23 | Clip4Retrofit: Enabling Real-Time Image Labeling on Edge Devices via Cross-Architecture CLIP Distillation | Li Zhong et.al. | 2505.18039 | null |
2025-05-23 | Automata Learning of Preferences over Temporal Logic Formulas from Pairwise Comparisons | Hazhar Rahmani et.al. | 2505.18030 | null |
2025-05-23 | Empathic network learning for multi-expert emergency decision-making under incomplete and inconsistent information | Simin Shen et.al. | 2505.18009 | null |
2025-05-23 | Beyond Distillation: Pushing the Limits of Medical LLM Reasoning with Minimalist Rule-Based RL | Che Liu et.al. | 2505.17952 | null |
2025-05-23 | Stepwise Reasoning Checkpoint Analysis: A Test Time Scaling Method to Enhance LLMs’ Reasoning | Zezhong Wang et.al. | 2505.17829 | null |
2025-05-23 | Seeing It or Not? Interpretable Vision-aware Latent Steering to Mitigate Object Hallucinations | Boxu Chen et.al. | 2505.17812 | null |
2025-05-23 | Integrating Counterfactual Simulations with Language Models for Explaining Multi-Agent Behaviour | Bálint Gyevnár et.al. | 2505.17801 | null |
2025-05-23 | TopoPoint: Enhance Topology Reasoning via Endpoint Detection in Autonomous Driving | Yanping Fu et.al. | 2505.17771 | link |
2025-05-23 | Soft-CAM: Making black box models self-explainable for high-stakes decisions | Kerol Djoumessi et.al. | 2505.17748 | null |
2025-05-23 | Feasible Action Space Reduction for Quantifying Causal Responsibility in Continuous Spatial Interactions | Ashwin George et.al. | 2505.17739 | link |
2025-05-23 | RQR3D: Reparametrizing the regression targets for BEV-based 3D object detection | Ozsel Kilinc et.al. | 2505.17732 | null |
2025-05-23 | SafeMVDrive: Multi-view Safety-Critical Driving Video Synthesis in the Real World Domain | Jiawei Zhou et.al. | 2505.17727 | null |
2025-05-23 | A Distributionally-Robust Framework for Nuisance in Causal Effect Estimation | Akira Tanimoto et.al. | 2505.17717 | null |
2025-05-23 | SynRES: Towards Referring Expression Segmentation in the Wild via Synthetic Data | Dong-Hee Kim et.al. | 2505.17695 | null |
2025-05-23 | FutureSightDrive: Thinking Visually with Spatio-Temporal CoT for Autonomous Driving | Shuang Zeng et.al. | 2505.17685 | null |
2025-05-22 | Extremely Simple Multimodal Outlier Synthesis for Out-of-Distribution Detection and Segmentation | Moru Liu et.al. | 2505.16985 | link |
2025-05-22 | UAV See, UGV Do: Aerial Imagery and Virtual Teach Enabling Zero-Shot Ground Vehicle Repeat | Desiree Fisker et.al. | 2505.16912 | null |
2025-05-22 | RealEngine: Simulating Autonomous Driving in Realistic Context | Junzhe Jiang et.al. | 2505.16902 | link |
2025-05-22 | A simulation and case study to evaluate the extrapolation performance of flexible Bayesian survival models when incorporating real-world data | Iain R. Timmins et.al. | 2505.16835 | link |
2025-05-22 | Chirp Delay-Doppler Domain Modulation: A New Paradigm of Integrated Sensing and Communication for Autonomous Vehicles | Zhuoran Li et.al. | 2505.16807 | link |
2025-05-22 | SOLVE: Synergy of Language-Vision and End-to-End Networks for Autonomous Driving | Xuesong Chen et.al. | 2505.16805 | null |
2025-05-22 | Fuzzy Information Evolution with Three-Way Decision in Social Network Group Decision-Making | Qianlei Jia et.al. | 2505.16781 | null |
2025-05-22 | Sequential Monte Carlo for Policy Optimization in Continuous POMDPs | Hany Abdulsamad et.al. | 2505.16732 | null |
2025-05-22 | BadVLA: Towards Backdoor Attacks on Vision-Language-Action Models via Objective-Decoupled Optimization | Xueyang Zhou et.al. | 2505.16640 | null |
2025-05-22 | Multivariate Latent Recalibration for Conditional Normalizing Flows | Victor Dheur et.al. | 2505.16636 | link |
2025-05-22 | SoccerChat: Integrating Multimodal Data for Enhanced Soccer Game Understanding | Sushant Gautam et.al. | 2505.16630 | null |
2025-05-22 | CodeMerge: Codebook-Guided Model Merging for Robust Test-Time Adaptation in Autonomous Driving | Huitong Yang et.al. | 2505.16524 | null |
2025-05-22 | AppealCase: A Dataset and Benchmark for Civil Case Appeal Scenarios | Yuting Huang et.al. | 2505.16514 | link |
2025-05-22 | Human-like Semantic Navigation for Autonomous Driving using Knowledge Representation and Large Language Models | Augusto Luis Ballardini et.al. | 2505.16498 | null |
2025-05-22 | Internal Bias in Reasoning Models leads to Overthinking | Renfei Dang et.al. | 2505.16448 | null |
2025-05-22 | Unlocking Smarter Device Control: Foresighted Planning with a World Model-Driven Code Execution Approach | Xiaoran Yin et.al. | 2505.16422 | null |
2025-05-22 | WebAgent-R1: Training Web Agents via End-to-End Multi-Turn Reinforcement Learning | Zhepei Wei et.al. | 2505.16421 | link |
2025-05-22 | Raw2Drive: Reinforcement Learning with Aligned World Models for End-to-End Autonomous Driving (in CARLA v2) | Zhenjie Yang et.al. | 2505.16394 | null |
2025-05-22 | VL-SAFE: Vision-Language Guided Safety-Aware Reinforcement Learning with World Models for Autonomous Driving | Yansong Qu et.al. | 2505.16377 | null |
2025-05-22 | No Black Boxes: Interpretable and Interactable Predictive Healthcare with Knowledge-Enhanced Agentic Causal Discovery | Xiaoxue Han et.al. | 2505.16288 | null |
2025-05-21 | HCRMP: A LLM-Hinted Contextual Reinforcement Learning Framework for Autonomous Driving | Zhiwen Chen et.al. | 2505.15793 | null |
2025-05-21 | HAMF: A Hybrid Attention-Mamba Framework for Joint Scene Context Understanding and Future Motion Representation Learning | Xiaodong Mei et.al. | 2505.15703 | null |
2025-05-21 | Aligning Explanations with Human Communication | Jacopo Teneggi et.al. | 2505.15626 | link |
2025-05-21 | Trial and Return Option Strategy in Omnichannel Retailing | Yasuyuki Kusuda et.al. | 2505.15597 | null |
2025-05-21 | TinyDrive: Multiscale Visual Question Answering with Selective Token Routing for Autonomous Driving | Hossein Hassani et.al. | 2505.15564 | null |
2025-05-21 | seg_3D_by_PC2D: Multi-View Projection for Domain Generalization and Adaptation in 3D Semantic Segmentation | Andrew Caunes et.al. | 2505.15545 | link |
2025-05-21 | A Multi-Tiered Bayesian Network Coastal Compound Flood Analysis Framework | Ziyue Liu et.al. | 2505.15520 | null |
2025-05-21 | Spectral-Aware Global Fusion for RGB-Thermal Semantic Segmentation | Ce Zhang et.al. | 2505.15491 | null |
2025-05-21 | Developing clinical informatics to support direct care and population health management: the VIEWER story | Robert Harland et.al. | 2505.15459 | null |
2025-05-21 | On the Generalization vs Fidelity Paradox in Knowledge Distillation | Suhas Kamasetty Ramesh et.al. | 2505.15442 | link |
2025-05-21 | Evaluation of Mobile Environment for Vehicular Visible Light Communication Using Multiple LEDs and Event Cameras | Ryota Soga et.al. | 2505.15412 | null |
2025-05-21 | RIS Beam Calibration for ISAC Systems: Modeling and Performance Analysis | Mengting Li et.al. | 2505.15403 | null |
2025-05-21 | Saliency-Aware Quantized Imitation Learning for Efficient Robotic Control | Seongmin Park et.al. | 2505.15304 | null |
2025-05-21 | AgentThink: A Unified Framework for Tool-Augmented Chain-of-Thought Reasoning in Vision-Language Models for Autonomous Driving | Kangan Qian et.al. | 2505.15298 | null |
2025-05-21 | Web-Shepherd: Advancing PRMs for Reinforcing Web Agents | Hyungjoo Chae et.al. | 2505.15277 | link |
2025-05-21 | Learning-based Autonomous Oversteer Control and Collision Avoidance | Seokjun Lee et.al. | 2505.15275 | null |
2025-05-21 | Identification of Probabilities of Causation: A Complete Characterization | Xin Shu et.al. | 2505.15274 | null |
2025-05-21 | LiveVLM: Efficient Online Video Understanding via Streaming-Oriented KV Cache and Retrieval | Zhenyu Ning et.al. | 2505.15269 | null |
2025-05-21 | DC-Scene: Data-Centric Learning for 3D Scene Understanding | Ting Huang et.al. | 2505.15232 | link |
2025-05-21 | Finding separatrices of dynamical flows with Deep Koopman Eigenfunctions | Kabir V. Dabholkar et.al. | 2505.15231 | null |
2025-05-20 | Cost-Augmented Monte Carlo Tree Search for LLM-Assisted Planning | Zihao Zhang et.al. | 2505.14656 | null |
2025-05-20 | OSIRIS-REx Operational Key Decision Points: A Retrospective | Rich Burns et.al. | 2505.14632 | null |
2025-05-20 | Bellman operator convergence enhancements in reinforcement learning algorithms | David Krame Kadurha et.al. | 2505.14564 | null |
2025-05-20 | R2MED: A Benchmark for Reasoning-Driven Medical Retrieval | Lei Li et.al. | 2505.14558 | link |
2025-05-20 | Energy-Efficient Deep Reinforcement Learning with Spiking Transformers | Mohammad Irfan Uddin et.al. | 2505.14533 | null |
2025-05-20 | Interpretable Dual-Stream Learning for Local Wind Hazard Prediction in Vulnerable Communities | Mahmuda Akhter Nishu et.al. | 2505.14522 | null |
2025-05-20 | BACON: A fully explainable AI model with graded logic for decision making problems | Haishi Bai et.al. | 2505.14510 | null |
2025-05-20 | Enhanced Multimodal Aspect-Based Sentiment Analysis by LLM-Generated Rationales | Jun Cao et.al. | 2505.14499 | null |
2025-05-20 | MoMoE: Mixture of Moderation Experts Framework for AI-Assisted Online Governance | Agam Goyal et.al. | 2505.14483 | null |
2025-05-20 | Interpretable Reinforcement Learning for Load Balancing using Kolmogorov-Arnold Networks | Kamal Singh et.al. | 2505.14459 | null |
2025-05-20 | Choosing a Model, Shaping a Future: Comparing LLM Perspectives on Sustainability and its Relationship with AI | Annika Bush et.al. | 2505.14435 | null |
2025-05-20 | MindVote: How LLMs Predict Human Decision-Making in Social Media Polls | Xutao Mao et.al. | 2505.14422 | null |
2025-05-20 | Solving Unit Commitment Problems with Graph Neural Network based Initial Commitment Prediction and Large Neighborhood Search | Linfeng Yang et.al. | 2505.14408 | null |
2025-05-20 | When Bias Backfires: The Modulatory Role of Counterfactual Explanations on the Adoption of Algorithmic Bias in XAI-Supported Human Decision-Making | Ulrike Kuhl et.al. | 2505.14377 | link |
2025-05-20 | Vid2World: Crafting Video Diffusion Models to Interactive World Models | Siqiao Huang et.al. | 2505.14357 | null |
2025-05-20 | EVA: Red-Teaming GUI Agents via Evolving Indirect Prompt Injection | Yijie Lu et.al. | 2505.14289 | null |
2025-05-20 | Embedded Mean Field Reinforcement Learning for Perimeter-defense Game | Li Wang et.al. | 2505.14209 | null |
2025-05-20 | High-dimensional Nonparametric Contextual Bandit Problem | Shogo Iwazaki et.al. | 2505.14102 | null |
2025-05-20 | CSAGC-IDS: A Dual-Module Deep Learning Network Intrusion Detection Model for Complex and Imbalanced Data | Yifan Zeng et.al. | 2505.14027 | null |
2025-05-20 | AUTOLAW: Enhancing Legal Compliance in Large Language Models via Case Law Generation and Jury-Inspired Deliberation | Tai D. Nguyen et.al. | 2505.14015 | null |
2025-05-19 | G1: Bootstrapping Perception and Reasoning Abilities of Vision-Language Model via Reinforcement Learning | Liang Chen et.al. | 2505.13426 | link |
2025-05-19 | Make Still Further Progress: Chain of Thoughts for Tabular Data Leaderboard | Si-Yang Liu et.al. | 2505.13421 | null |
2025-05-19 | Multi-Armed Bandits Meet Large Language Models | Djallel Bouneffouf et.al. | 2505.13355 | null |
2025-05-19 | Investigating the Vulnerability of LLM-as-a-Judge Architectures to Prompt-Injection Attacks | Narek Maloyan et.al. | 2505.13348 | null |
2025-05-19 | Cross-Cloud Data Privacy Protection: Optimizing Collaborative Mechanisms of AI Systems by Integrating Federated Learning and LLMs | Huaiying Luo et.al. | 2505.13292 | null |
2025-05-19 | Low-regret Strategies for Energy Systems Planning in a Highly Uncertain Future | Gabriel Wiest et.al. | 2505.13277 | null |
2025-05-19 | DB3D-L: Depth-aware BEV Feature Transformation for Accurate 3D Lane Detection | Yehao Liu et.al. | 2505.13266 | null |
2025-05-19 | Adversarial Testing in LLMs: Insights into Decision-Making Vulnerabilities | Lili Zhang et.al. | 2505.13195 | null |
2025-05-19 | Role-Playing Evaluation for Large Language Models | Yassine El Boudouri et.al. | 2505.13157 | link |
2025-05-19 | Neurosymbolic Diffusion Models | Emile van Krieken et.al. | 2505.13138 | link |
2025-05-19 | Treatment Effect Estimation for Optimal Decision-Making | Dennis Frauen et.al. | 2505.13092 | null |
2025-05-19 | Orthogonal Survival Learners for Estimating Heterogeneous Treatment Effects from Time-to-Event Data | Dennis Frauen et.al. | 2505.13072 | null |
2025-05-19 | SNAPE-PM: Building and Utilizing Dynamic Partner Models for Adaptive Explanation Generation | Amelie S. Robrecht et.al. | 2505.13053 | link |
2025-05-19 | CAIM: Development and Evaluation of a Cognitive AI Memory Framework for Long-Term Interaction with Intelligent Agents | Rebecca Westhäußer et.al. | 2505.13044 | null |
2025-05-19 | EPIC: Explanation of Pretrained Image Classification Networks via Prototype | Piotr Borycki et.al. | 2505.12897 | link |
2025-05-19 | Scheduling of Flexible Manufacturing Systems Based on Place-Timed Petri Nets and Basis Reachability Graphs | Zhou He et.al. | 2505.12862 | null |
2025-05-19 | Geometric Formalization of First-Order Stochastic Dominance in $N$ Dimensions: A Tractable Path to Multi-Dimensional Economic Decision Analysis | Jingyuan Li et.al. | 2505.12840 | null |
2025-05-19 | Testing Identifiability and Transportability with Observational and Experimental Data | Konstantina Lelova et.al. | 2505.12801 | null |
2025-05-19 | Forewarned is Forearmed: A Survey on Large Language Model-based Agents in Autonomous Cyberattacks | Minrui Xu et.al. | 2505.12786 | null |
2025-05-19 | Beyond Individual UX: Defining Group Experience(GX) as a New Paradigm for Group-centered AI | Soohwan Lee et.al. | 2505.12780 | null |
2025-05-16 | REACT: Runtime-Enabled Active Collision-avoidance Technique for Autonomous Driving | Heye Huang et.al. | 2505.11474 | null |
2025-05-16 | Can AI automatically analyze public opinion? A LLM agents-based agentic pipeline for timely public opinion analysis | Jing Liu et.al. | 2505.11401 | null |
2025-05-16 | Efficient End-to-End Learning for Decision-Making: A Meta-Optimization Approach | Rares Cristian et.al. | 2505.11360 | null |
2025-05-16 | LD-Scene: LLM-Guided Diffusion for Controllable Generation of Adversarial Safety-Critical Driving Scenarios | Mingxing Peng et.al. | 2505.11247 | null |
2025-05-16 | Learning traffic flows: Graph Neural Networks for Metamodelling Traffic Assignment | Oskar Bohn Lassen et.al. | 2505.11230 | null |
2025-05-16 | Sample Efficient Reinforcement Learning via Large Vision Language Model Distillation | Donghoon Lee et.al. | 2505.11221 | link |
2025-05-16 | Planar Velocity Estimation for Fast-Moving Mobile Robots Using Event-Based Optical Flow | Liam Boyle et.al. | 2505.11116 | null |
2025-05-16 | Blockchain-Enabled Decentralized Privacy-Preserving Group Purchasing for Energy Plans | Sid Chi-Kin Chau et.al. | 2505.11094 | null |
2025-05-16 | Time Travel is Cheating: Going Live with DeepFund for Real-Time Fund Investment Benchmarking | Changlun Li et.al. | 2505.11065 | link |
2025-05-16 | DRL-Based Injection Molding Process Parameter Optimization for Adaptive and Profitable Production | Joon-Young Kim et.al. | 2505.10988 | null |
2025-05-16 | Prior-Guided Diffusion Planning for Offline Reinforcement Learning | Donghyeon Ki et.al. | 2505.10881 | null |
2025-05-16 | Deep Symbolic Optimization: Reinforcement Learning for Symbolic Mathematics | Conor F. Hayes et.al. | 2505.10762 | null |
2025-05-15 | Decision Making in Urban Traffic: A Game Theoretic Approach for Autonomous Vehicles Adhering to Traffic Rules | Keqi Shu et.al. | 2505.10690 | null |
2025-05-15 | GaussianFormer3D: Multi-Modal Gaussian-based Semantic Occupancy Prediction with 3D Deformable Attention | Lingjun Zhao et.al. | 2505.10685 | null |
2025-05-15 | Towards a Deeper Understanding of Reasoning Capabilities in Large Language Models | Annie Wong et.al. | 2505.10543 | link |
2025-05-15 | Batched Nonparametric Bandits via k-Nearest Neighbor UCB | Sakshi Arya et.al. | 2505.10498 | null |
2025-05-15 | Fine-tuning Diffusion Policies with Backpropagation Through Diffusion Timesteps | Ningyuan Yang et.al. | 2505.10482 | null |
2025-05-15 | Emotion-sensitive Explanation Model | Christian Schütze et.al. | 2505.10454 | null |
2025-05-15 | Can a Referendum Solve Problems of Shared Sovereignty on Mars? | Roxanne Ruixian Zhu et.al. | 2505.10434 | null |
2025-05-15 | Influence of prior and task generated emotions on XAI explanation retention and understanding | Birte Richter et.al. | 2505.10427 | null |
2025-05-15 | Efficient Adaptation of Reinforcement Learning Agents to Sudden Environmental Change | Jonathan Clifford Balloch et.al. | 2505.10330 | null |
2025-05-15 | Optimizing Electric Bus Charging Scheduling with Uncertainties Using Hierarchical Deep Reinforcement Learning | Jiaju Qi et.al. | 2505.10296 | null |
2025-05-15 | From Questions to Clinical Recommendations: Large Language Models Driving Evidence-Based Clinical Decision Making | Dubai Li et.al. | 2505.10282 | link |
2025-05-15 | Inferring Driving Maps by Deep Learning-based Trail Map Extraction | Michael Hubbertz et.al. | 2505.10258 | null |
2025-05-15 | Sage Deer: A Super-Aligned Driving Generalist Is Your Copilot | Hao Lu et.al. | 2505.10257 | null |
2025-05-15 | Context-aware collaborative pushing of heavy objects using skeleton-based intention prediction | Gokhan Solak et.al. | 2505.10239 | null |
2025-05-15 | Lost in Models? Structuring Managerial Decision Support in Process Mining with Multi-criteria Decision Making | Rob H. Bemthuis et.al. | 2505.10236 | null |
2025-05-15 | Force-Driven Validation for Collaborative Robotics in Automated Avionics Testing | Pietro Dardano et.al. | 2505.10224 | link |
2025-05-15 | A User Study Evaluating Argumentative Explanations in Diagnostic Decision Support | Felix Liedeker et.al. | 2505.10188 | null |
2025-05-15 | GE-Chat: A Graph Enhanced RAG Framework for Evidential Response Generation of LLMs | Longchao Da et.al. | 2505.10143 | null |
2025-05-15 | Large Wireless Localization Model (LWLM): A Foundation Model for Positioning in 6G Networks | Guangjin Pan et.al. | 2505.10134 | link |
2025-05-15 | Application of YOLOv8 in monocular downward multiple Car Target detection | Shijie Lyu et.al. | 2505.10016 | null |
2025-05-15 | A Comprehensive Machine Learning Framework for Heart Disease Prediction: Performance Evaluation and Future Perspectives | Ali Azimi Lamir et.al. | 2505.09969 | null |
2025-05-15 | Comparing Exploration-Exploitation Strategies of LLMs and Humans: Insights from Standard Multi-armed Bandit Tasks | Ziyuan Zhang et.al. | 2505.09901 | link |
2025-05-14 | Camera-Only 3D Panoptic Scene Completion for Autonomous Driving through Differentiable Object Shapes | Nicola Marinello et.al. | 2505.09562 | null |
2025-05-14 | \textsc{rfPG}: Robust Finite-Memory Policy Gradients for Hidden-Model POMDPs | Maris F. L. Galesloot et.al. | 2505.09518 | null |
2025-05-14 | Risk-aware Markov Decision Processes Using Cumulative Prospect Theory | Thomas Brihaye et.al. | 2505.09514 | null |
2025-05-14 | A Bayesian Treatment Selection Design for Phase II Randomised Cancer Clinical Trials | Moka Komaki et.al. | 2505.09460 | null |
2025-05-15 | SafePath: Conformal Prediction for Safe LLM-Based Autonomous Navigation | Achref Doula et.al. | 2505.09427 | null |
2025-05-14 | MoRAL: Motion-aware Multi-Frame 4D Radar and LiDAR Fusion for Robust 3D Object Detection | Xiangyuan Peng et.al. | 2505.09422 | null |
2025-05-14 | FaceShield: Explainable Face Anti-Spoofing with Multimodal Large Language Models | Hongyang Wang et.al. | 2505.09415 | null |
2025-05-14 | Counterfactual Strategies for Markov Decision Processes | Paul Kobialka et.al. | 2505.09412 | null |
2025-05-14 | FreeDriveRF: Monocular RGB Dynamic NeRF without Poses for Autonomous Driving via Point-Level Dynamic-Static Decoupling | Yue Wen et.al. | 2505.09406 | null |
2025-05-14 | APR-Transformer: Initial Pose Estimation for Localization in Complex Environments through Absolute Pose Regression | Srinivas Ravuri et.al. | 2505.09356 | link |
2025-05-14 | Adaptive control for multi-scale stochastic dynamical systems with stochastic next generation reservoir computing | Jiani Cheng et.al. | 2505.09327 | null |
2025-05-14 | TransDiffuser: End-to-end Trajectory Generation with Decorrelated Multi-modal Representation for Autonomous Driving | Xuefeng Jiang et.al. | 2505.09315 | null |
2025-05-15 | Embodied Intelligent Industrial Robotics: Concepts and Techniques | Chaoran Zhang et.al. | 2505.09305 | link |
2025-05-14 | Reproducibility Study of “Cooperate or Collapse: Emergence of Sustainable Cooperation in a Society of LLM Agents” | Pedro M. P. Curvo et.al. | 2505.09289 | link |
2025-05-14 | Zero-Shot Multi-modal Large Language Model v.s. Supervised Deep Learning: A Comparative Study on CT-Based Intracranial Hemorrhage Subtyping | Yinuo Wang et.al. | 2505.09252 | link |
2025-05-14 | PreCare: Designing AI Assistants for Advance Care Planning (ACP) to Enhance Personal Value Exploration, Patient Knowledge, and Decisional Confidence | Yu Lun Hsu et.al. | 2505.09115 | null |
2025-05-14 | Beyond the Known: Decision Making with Counterfactual Reasoning Decision Transformer | Minh Hoang Nguyen et.al. | 2505.09114 | link |
2025-05-14 | Sequential Treatment Effect Estimation with Unmeasured Confounders | Yingrong Wang et.al. | 2505.09113 | null |
2025-05-14 | OpenLKA: An Open Dataset of Lane Keeping Assist from Recent Car Models under Real-world Driving Conditions | Yuhang Wang et.al. | 2505.09092 | link |
2025-05-14 | Deployable and Generalizable Motion Prediction: Taxonomy, Open Challenges and Future Directions | Letian Wang et.al. | 2505.09074 | null |
2025-05-13 | Extending Large Vision-Language Model for Diverse Interactive Tasks in Autonomous Driving | Zongchuang Zhao et.al. | 2505.08725 | link |
2025-05-13 | Optimal Trajectory Planning with Collision Avoidance for Autonomous Vehicle Maneuvering | Jason Zalev et.al. | 2505.08724 | null |
2025-05-13 | A Study of Data-driven Methods for Inventory Optimization | Lee Yeung Ping et.al. | 2505.08673 | null |
2025-05-13 | A Social Robot with Inner Speech for Dietary Guidance | Valerio Belcamino et.al. | 2505.08664 | link |
2025-05-13 | Chilean Avian flu and its marine impacts: an online Statistical Process Control task | Diego Carvalho do Nascimento et.al. | 2505.08629 | null |
2025-05-13 | Towards Resilient SDA: Graph Theory and Cooperative Control in Distributed Network Architectures | Nesrine Benchoubane et.al. | 2505.08520 | null |
2025-05-13 | Zero-Shot Sim-to-Real Reinforcement Learning for Fruit Harvesting | Emlyn Williams et.al. | 2505.08458 | null |
2025-05-13 | Scalable UAV Multi-Hop Networking via Multi-Agent Reinforcement Learning with Large Language Models | Yanggang Xu et.al. | 2505.08448 | null |
2025-05-13 | Agent-as-a-Service based on Agent Network | Yuhan Zhu et.al. | 2505.08446 | null |
2025-05-13 | Explaining Autonomous Vehicles with Intention-aware Policy Graphs | Sara Montese et.al. | 2505.08404 | null |
2025-05-13 | A Comparison Between Human and Generative AI Decision-Making Attributes in Complex Health Services | Nandini Doreswamy et.al. | 2505.08360 | null |
2025-05-13 | An Identifiable Cost-Aware Causal Decision-Making Framework Using Counterfactual Reasoning | Ruichu Cai et.al. | 2505.08343 | null |
2025-05-13 | A Practical Introduction to Deep Reinforcement Learning | Yinghan Sun et.al. | 2505.08295 | null |
2025-05-13 | Automatic Curriculum Learning for Driving Scenarios: Towards Robust and Efficient Reinforcement Learning | Ahmed Abouelazm et.al. | 2505.08264 | null |
2025-05-13 | Object detection in adverse weather conditions for autonomous vehicles using Instruct Pix2Pix | Unai Gurbindo et.al. | 2505.08228 | null |
2025-05-13 | DSADF: Thinking Fast and Slow for Decision Making | Alex Zhihao Dou et.al. | 2505.08189 | null |
2025-05-14 | Fusing Bidirectional Chains of Thought and Reward Mechanisms A Method for Enhancing Question-Answering Capabilities of Large Language Models for Chinese Intangible Cultural Heritage | Ruilin Liu et.al. | 2505.08167 | null |
2025-05-12 | Are LLMs complicated ethical dilemma analyzers? | Jiashen et.al. | 2505.08106 | link |
2025-05-12 | Topology-Guided Knowledge Distillation for Efficient Point Cloud Processing | Luu Tung Hai et.al. | 2505.08101 | link |
2025-05-12 | Fréchet Power-Scenario Distance: A Metric for Evaluating Generative AI Models across Multiple Time-Scales in Smart Grids | Yuting Cai et.al. | 2505.08082 | null |
2025-05-12 | Must Read: A Systematic Survey of Computational Persuasion | Nimet Beyza Bozdag et.al. | 2505.07775 | link |
2025-05-13 | Codifying Character Logic in Role-Playing | Letian Peng et.al. | 2505.07705 | link |
2025-05-12 | PatchTrack: A Comprehensive Analysis of ChatGPT’s Influence on Pull Request Outcomes | Daniel Ogenrwot et.al. | 2505.07700 | null |
2025-05-12 | JobHop: A Large-Scale Dataset of Career Trajectories | Iman Johary et.al. | 2505.07653 | null |
2025-05-12 | Noise Optimized Conditional Diffusion for Domain Adaptation | Lingkun Luo et.al. | 2505.07548 | null |
2025-05-12 | The Human-Data-Model Interaction Canvas for Visual Analytics | Jürgen Bernard et.al. | 2505.07534 | null |
2025-05-12 | Improved Mixing of Critical Hardcore Model | Zongchen Chen et.al. | 2505.07515 | null |
2025-05-12 | A Value of Information-based assessment of strain-based thickness loss monitoring in ship hull structures | Nicholas E. Silionis et.al. | 2505.07427 | null |
2025-05-12 | ViMRHP: A Vietnamese Benchmark Dataset for Multimodal Review Helpfulness Prediction via Human-AI Collaborative Annotation | Truc Mai-Thanh Nguyen et.al. | 2505.07416 | link |
2025-05-12 | ReinboT: Amplifying Robot Visual-Language Manipulation with Reinforcement Learning | Hongyin Zhang et.al. | 2505.07395 | null |
2025-05-12 | Laypeople’s Attitudes Towards Fair, Affirmative, and Discriminatory Decision-Making Algorithms | Gabriel Lima et.al. | 2505.07339 | null |
2025-05-12 | Drive Fast, Learn Faster: On-Board RL for High Performance Autonomous Racing | Benedict Hildisch et.al. | 2505.07321 | null |
2025-05-12 | How Do Companies Manage the Environmental Sustainability of AI? An Interview Study About Green AI Efforts and Regulations | Ashmita Sampatsing et.al. | 2505.07317 | null |
2025-05-12 | Multi-Agent DRL for Multi-Objective Twin Migration Routing with Workload Prediction in 6G-enabled IoV | Peng Yin et.al. | 2505.07290 | null |
2025-05-12 | Discovering Fine-Grained Visual-Concept Relations by Disentangled Optimal Transport Concept Bottleneck Models | Yan Xie et.al. | 2505.07209 | null |
2025-05-12 | Benchmarking Ethical and Safety Risks of Healthcare LLMs in China-Toward Systemic Governance under Healthy China 2030 | Mouxiao Bian et.al. | 2505.07205 | null |
2025-05-12 | Ranking-aware Continual Learning for LiDAR Place Recognition | Xufei Wang et.al. | 2505.07198 | null |
2025-05-11 | Justiça Algorítmica: Instrumentalização, Limites Conceituais e Desafios na Engenharia de Software | Lucas Rodrigues Valença et.al. | 2505.07132 | null |
2025-05-11 | Constrained Online Decision-Making with Density Estimation Oracles | Haichen Hu et.al. | 2505.07101 | null |
2025-05-11 | DriveSOTIF: Advancing Perception SOTIF Through Multimodal Large Language Models | Shucheng Huang et.al. | 2505.07084 | link |
2025-05-09 | Let Humanoids Hike! Integrative Skill Development on Complex Trails | Kwan-Yee Lin et.al. | 2505.06218 | null |
2025-05-09 | Robust Multi-Agent Decision-Making in Finite-Population Games | Shinkyu Park et.al. | 2505.06200 | null |
2025-05-09 | Active Perception for Tactile Sensing: A Task-Agnostic Attention-Based Approach | Tim Schneider et.al. | 2505.06182 | null |
2025-05-09 | Interaction-Aware Parameter Privacy-Preserving Data Sharing in Coupled Systems via Particle Filter Reinforcement Learning | Haokun Yu et.al. | 2505.06122 | null |
2025-05-09 | FIC-TSC: Learning Time Series Classification with Fisher Information Constraint | Xiwen Chen et.al. | 2505.06114 | null |
2025-05-09 | Centralized Decision-Making for Platooning By Using SPaT-Driven Reference Speeds | Melih Yazgan et.al. | 2505.06071 | null |
2025-05-09 | Short-circuiting Shortcuts: Mechanistic Investigation of Shortcuts in Text Classification | Leon Eshuijs et.al. | 2505.06032 | link |
2025-05-09 | Efficient Information Updates in Compute-First Networking via Reinforcement Learning with Joint AoI and VoI | Jianpeng Qi et.al. | 2505.06025 | null |
2025-05-09 | From Pixels to Perception: Interpretable Predictions via Instance-wise Grouped Feature Selection | Moritz Vandenhirtz et.al. | 2505.06003 | link |
2025-05-09 | Differentiable Fuzzy Neural Networks for Recommender Systems | Stephan Bartl et.al. | 2505.06000 | link |
2025-05-09 | Priority-Driven Safe Model Predictive Control Approach to Autonomous Driving Applications | Francesco Prignoli et.al. | 2505.05933 | null |
2025-05-09 | Assessing the Dynamics of the Coffee Value Chain in Davao del Sur: An Agent-Based Modeling Approach | Lucia Stephanie B. Sibala et.al. | 2505.05797 | null |
2025-05-09 | Predicting Diabetic Macular Edema Treatment Responses Using OCT: Dataset and Methods of APTOS Competition | Weiyi Zhang et.al. | 2505.05768 | null |
2025-05-08 | Adaptive Stress Testing Black-Box LLM Planners | Neeloy Chakraborty et.al. | 2505.05665 | null |
2025-05-08 | Closing the Loop: Motion Prediction Models beyond Open-Loop Benchmarks | Mohamed-Khalil Bouzidi et.al. | 2505.05638 | null |
2025-05-08 | Trading Under Uncertainty: A Distribution-Based Strategy for Futures Markets Using FutureQuant Transformer | Wenhao Guo et.al. | 2505.05595 | null |
2025-05-08 | Anticipating Gaming to Incentivize Improvement: Guiding Agents in (Fair) Strategic Classification | Sura Alhanouti et.al. | 2505.05594 | null |
2025-05-08 | Quantum-network nodes with real-time noise mitigation using spectator qubits | S. J. H. Loenen et.al. | 2505.05582 | null |
2025-05-08 | 3D Scene Generation: A Survey | Beichen Wen et.al. | 2505.05474 | link |
2025-05-08 | RL-DAUNCE: Reinforcement Learning-Driven Data Assimilation with Uncertainty-Aware Constrained Ensembles | Pouria Behnoudfar et.al. | 2505.05452 | null |
2025-05-08 | DSDrive: Distilling Large Language Model for Lightweight End-to-End Autonomous Driving with Unified Reasoning and Planning | Wenru Liu et.al. | 2505.05360 | null |
2025-05-08 | ICNN-enhanced 2SP: Leveraging input convex neural networks for solving two-stage stochastic programming | Yu Liu et.al. | 2505.05261 | link |
2025-05-08 | PADriver: Towards Personalized Autonomous Driving | Genghua Kou et.al. | 2505.05240 | null |
2025-05-08 | Multi-Objective Reinforcement Learning for Adaptive Personalized Autonomous Driving | Hendrik Surmann et.al. | 2505.05223 | null |
2025-05-08 | Incentive-Aware Machine Learning; Robustness, Fairness, Improvement & Causality | Chara Podimata et.al. | 2505.05211 | null |
2025-05-08 | Dukawalla: Voice Interfaces for Small Businesses in Africa | Elizabeth Ankrah et.al. | 2505.05170 | null |
2025-05-08 | Bandit Max-Min Fair Allocation | Tsubasa Harada et.al. | 2505.05169 | null |
2025-05-08 | Day-Ahead Bidding Strategies for Wind Farm Operators under a One-Price Balancing Scheme | Max Bruninx et.al. | 2505.05153 | null |
2025-05-08 | Taming OOD Actions for Offline Reinforcement Learning: An Advantage-Based Approach | Xuyang Chen et.al. | 2505.05126 | null |
2025-05-08 | X-Driver: Explainable Autonomous Driving with Vision-Language Models | Wei Liu et.al. | 2505.05098 | null |
2025-05-08 | Hybrid Personalization Using Declarative and Procedural Memory Modules of the Cognitive Architecture ACT-R | Kevin Innerebner et.al. | 2505.05083 | null |
2025-05-08 | LVLM-MPC Collaboration for Autonomous Driving: A Safety-Aware and Task-Scalable Control Architecture | Kazuki Atsuta et.al. | 2505.04980 | null |
2025-05-08 | Network Digital Twin for Route Optimization in 5G/B5G Transport Slicing with What-If Analysis | Rebecca Aben-Athar et.al. | 2505.04879 | null |
2025-05-08 | Federated Learning for Cyber Physical Systems: A Comprehensive Survey | Minh K. Quan et.al. | 2505.04873 | null |
2025-05-07 | Is there Value in Reinforcement Learning? | Lior Fox et.al. | 2505.04822 | null |
2025-05-07 | ORBIT-2: Scaling Exascale Vision Foundation Models for Weather and Climate Downscaling | Xiao Wang et.al. | 2505.04802 | null |
2025-05-07 | Robust ML Auditing using Prior Knowledge | Jade Garcia Bourrée et.al. | 2505.04796 | null |
2025-05-07 | Primal-dual algorithm for contextual stochastic combinatorial optimization | Louis Bouvier et.al. | 2505.04757 | null |
2025-05-07 | Active Sampling for MRI-based Sequential Decision Making | Yuning Du et.al. | 2505.04586 | link |
2025-05-07 | Stow: Robotic Packing of Items into Fabric Pods | Nicolas Hudson et.al. | 2505.04572 | null |
2025-05-07 | An imageless magnetic resonance framework for fast and cost-effective decision-making | Alba González-Cebrián et.al. | 2505.04550 | null |
2025-05-07 | DFVO: Learning Darkness-free Visible and Infrared Image Disentanglement and Fusion All at Once | Qi Zhou et.al. | 2505.04526 | link |
2025-05-07 | Do We Still Need to Work on Odometry for Autonomous Driving? | Cedric Le Gentil et.al. | 2505.04438 | null |
2025-05-07 | Large Means Left: Political Bias in Large Language Models Increases with Their Number of Parameters | David Exler et.al. | 2505.04393 | null |
2025-05-07 | Predicting Road Surface Anomalies by Visual Tracking of a Preceding Vehicle | Petr Jahoda et.al. | 2505.04392 | null |
2025-05-07 | Design and Evaluation of an NDN-Based Network for Distributed Digital Twins | Chen Chen et.al. | 2505.04326 | null |
2025-05-07 | Verification of Digital Twins using Classical and Statistical Model Checking | Raghavendran Gunasekaran et.al. | 2505.04322 | null |
2025-05-07 | Mastering Multi-Drone Volleyball through Hierarchical Co-Self-Play Reinforcement Learning | Ruize Zhang et.al. | 2505.04317 | null |
2025-05-07 | KERAIA: An Adaptive and Explainable Framework for Dynamic Knowledge Representation and Reasoning | Stephen Richard Varey et.al. | 2505.04313 | null |
2025-05-07 | PPO-ACT: Proximal Policy Optimization with Adversarial Curriculum Transfer for Spatial Public Goods Games | Zhaoqilin Yang et.al. | 2505.04302 | null |
2025-05-07 | GASCADE: Grouped Summarization of Adverse Drug Event for Enhanced Cancer Pharmacovigilance | Sofia Jamil et.al. | 2505.04284 | link |
2025-05-07 | Multi-Agent Reinforcement Learning-based Cooperative Autonomous Driving in Smart Intersections | Taoyuan Yu et.al. | 2505.04231 | null |
2025-05-07 | VideoPath-LLaVA: Pathology Diagnostic Reasoning Through Video Instruction Tuning | Trinh T. L. Vuong et.al. | 2505.04192 | link |
2025-05-07 | Optimization of Infectious Disease Intervention Measures Based on Reinforcement Learning – Empirical analysis based on UK COVID-19 epidemic data | Baida Zhang et.al. | 2505.04161 | null |
2025-05-07 | Natural Language Generation in Healthcare: A Review of Methods and Applications | Mengxian Lyu et.al. | 2505.04073 | null |
2025-05-07 | Shadow Wireless Intelligence: Large Language Model-Driven Reasoning in Covert Communications | Yuanai Xie et.al. | 2505.04068 | null |
2025-05-07 | Reliable Disentanglement Multi-view Learning Against View Adversarial Attacks | Xuyang Wang et.al. | 2505.04046 | link |
2025-05-07 | Ethical Appetite: Consumer Preferences and Price Premiums for Animal Welfare-Friendly Food Products | Voraprapa Nakavachara et.al. | 2505.04042 | null |
2025-05-06 | Frenet Corridor Planner: An Optimal Local Path Planning Framework for Autonomous Driving | Faizan M. Tariq et.al. | 2505.03695 | null |
2025-05-06 | Moral Testing of Autonomous Driving Systems | Wenbing Tang et.al. | 2505.03683 | null |
2025-05-06 | CaRaFFusion: Improving 2D Semantic Segmentation with Camera-Radar Point Cloud Fusion and Zero-Shot Image Inpainting | Huawei Sun et.al. | 2505.03679 | null |
2025-05-06 | RoboOS: A Hierarchical Embodied Framework for Cross-Embodiment and Multi-Agent Collaboration | Huajie Tan et.al. | 2505.03673 | link |
2025-05-06 | Learning Symbolic Persistent Macro-Actions for POMDP Solving Over Time | Celeste Veronese et.al. | 2505.03668 | null |
2025-05-06 | Synthesizing Images on Perceptual Boundaries of ANNs for Uncovering and Manipulating Human Perceptual Variability | Chen Wei et.al. | 2505.03641 | null |
2025-05-06 | CUI-MET: Clinical Utility Index Dose Optimization Approach for Multiple-Dose, Multiple-Outcome Randomized Trial Designs | Fanni Zhang et.al. | 2505.03633 | null |
2025-05-06 | Meta-reasoning Using Attention Maps and Its Applications in Cloud Robotics | Adrian Lendinez et.al. | 2505.03587 | null |
2025-05-06 | Decision Making under Model Misspecification: DRO with Robust Bayesian Ambiguity Sets | Charita Dellaporta et.al. | 2505.03585 | null |
2025-05-06 | Small-Scale-Fading-Aware Resource Allocation in Wireless Federated Learning | Jiacheng Wang et.al. | 2505.03533 | null |
2025-05-06 | Coop-WD: Cooperative Perception with Weighting and Denoising for Robust V2V Communication | Chenguang Liu et.al. | 2505.03528 | null |
2025-05-06 | Real-time small area estimation of food security in Zimbabwe: integrating mobile-phone and face-to-face surveys using joint multilevel regression and poststratification | Sahoko Ishida et.al. | 2505.03517 | link |
2025-05-06 | LogisticsVLN: Vision-Language Navigation For Low-Altitude Terminal Delivery Based on Agentic UAVs | Xinyuan Zhang et.al. | 2505.03460 | null |
2025-05-06 | Attention-aggregated Attack for Boosting the Transferability of Facial Adversarial Examples | Jian-Wei Li et.al. | 2505.03383 | null |
2025-05-06 | DroidRetriever: An Autonomous Navigation and Information Integration System Facilitating Mobile Sensemaking | Yiheng Bian et.al. | 2505.03364 | null |
2025-05-06 | RIFT: Closed-Loop RL Fine-Tuning for Realistic and Controllable Traffic Simulation | Keyu Chen et.al. | 2505.03344 | null |
2025-05-06 | Artificial Behavior Intelligence: Technology, Challenges, and Future Directions | Kanghyun Jo et.al. | 2505.03315 | null |
2025-05-06 | 3D Can Be Explored In 2D: Pseudo-Label Generation for LiDAR Point Clouds Using Sensor-Intensity-Based 2D Semantic Segmentation | Andrew Caunes et.al. | 2505.03300 | null |
2025-05-06 | MDPs with a State Sensing Cost | Vansh Kapoor et.al. | 2505.03280 | null |
2025-05-06 | RAG-MCP: Mitigating Prompt Bloat in LLM Tool Selection via Retrieval-Augmented Generation | Tiantian Gan et.al. | 2505.03275 | null |
2025-05-05 | A Survey of Slow Thinking-based Reasoning LLMs using Reinforced Learning and Inference-time Scaling Law | Qianjun Pan et.al. | 2505.02665 | null |
2025-05-05 | Timing Is Everything: Finding the Optimal Fusion Points in Multimodal Medical Imaging | Valerio Guarrasi et.al. | 2505.02467 | null |
2025-05-05 | ReeM: Ensemble Building Thermodynamics Model for Efficient HVAC Control via Hierarchical Reinforcement Learning | Yang Deng et.al. | 2505.02439 | null |
2025-05-04 | Risk Assessment and Threat Modeling for safe autonomous driving technology | Ian Alexis Wong Paz et.al. | 2505.02231 | null |
2025-05-04 | Coupled Distributional Random Expert Distillation for World Model Online Imitation Learning | Shangzhe Li et.al. | 2505.02228 | null |
2025-05-04 | LLM-Guided Probabilistic Program Induction for POMDP Model Estimation | Aidan Curtis et.al. | 2505.02216 | null |
2025-05-04 | Large Language Models are overconfident and amplify human bias | Fengfei Sun et.al. | 2505.02151 | null |
2025-05-04 | Spotting the Unexpected (STU): A 3D LiDAR Dataset for Anomaly Segmentation in Autonomous Driving | Alexey Nekrasov et.al. | 2505.02148 | null |
2025-05-04 | DriveAgent: Multi-Agent Structured Reasoning with LLM and Multimodal Sensor Fusion for Autonomous Driving | Xinmeng Hou et.al. | 2505.02123 | link |
2025-05-04 | Enhancing Safety Standards in Automated Systems Using Dynamic Bayesian Networks | Kranthi Kumar Talluri et.al. | 2505.02050 | null |
2025-05-04 | Sharp empirical Bernstein bounds for the variance of bounded random variables | Diego Martinez-Taboada et.al. | 2505.01987 | null |
2025-05-04 | D3HRL: A Distributed Hierarchical Reinforcement Learning Approach Based on Causal Discovery and Spurious Correlation Detection | Chenran Zhao et.al. | 2505.01979 | null |
2025-05-04 | Analyzing Cognitive Differences Among Large Language Models through the Lens of Social Worldview | Jiatao Li et.al. | 2505.01967 | null |
2025-05-03 | DriveNetBench: An Affordable and Configurable Single-Camera Benchmarking System for Autonomous Driving Networks | Ali Al-Bustami et.al. | 2505.01893 | link |
2025-05-03 | Securing 5G and Beyond-Enabled UAV Networks: Resilience Through Multiagent Learning and Transformers Detection | Joseanne Viana et.al. | 2505.01885 | null |
2025-05-03 | CMAWRNet: Multiple Adverse Weather Removal via a Unified Quaternion Neural Architecture | Vladimir Frants et.al. | 2505.01882 | null |
2025-05-03 | PhysNav-DG: A Novel Adaptive Framework for Robust VLM-Sensor Fusion in Navigation Applications | Trisanth Srinivasan et.al. | 2505.01881 | null |
2025-05-03 | Bayesian learning of the optimal action-value function in a Markov decision process | Jiaqi Guo et.al. | 2505.01859 | null |
2025-05-03 | DualDiff: Dual-branch Diffusion Model for Autonomous Driving with Semantic Fusion | Haoteng Li et.al. | 2505.01857 | null |
2025-05-03 | Harnessing the Power of LLMs, Informers and Decision Transformers for Intent-driven RAN Management in 6G | Md Arafat Habib et.al. | 2505.01841 | null |
2025-05-02 | An Efficient Real-Time Planning Method for Swarm Robotics Based on an Optimal Virtual Tube | Pengda Mao et.al. | 2505.01380 | null |
2025-05-02 | Power System Transition Planning: An Industry-Aligned Framework for Long-Term Optimization | Ahmed Al-Shafei et.al. | 2505.01331 | null |
2025-05-02 | Core-Set Selection for Data-efficient Land Cover Segmentation | Keiller Nogueira et.al. | 2505.01225 | link |
2025-05-02 | Design for a Digital Twin in Clinical Patient Care | Anna-Katharina Nitschke et.al. | 2505.01206 | null |
2025-05-02 | A Secured Triad of IoT, Machine Learning, and Blockchain for Crop Forecasting in Agriculture | Najmus Sakib Sizan et.al. | 2505.01196 | null |
2025-05-02 | Exploring the Impact of Explainable AI and Cognitive Capabilities on Users’ Decisions | Federico Maria Cau et.al. | 2505.01192 | null |
2025-05-02 | Secure Cluster-Based Hierarchical Federated Learning in Vehicular Networks | M. Saeid HaghighiFard et.al. | 2505.01186 | null |
2025-05-02 | A flexible Bayesian non-parametric mixture model reveals multiple dependencies of swap errors in visual working memory | Puria Radmard et.al. | 2505.01178 | null |
2025-05-02 | Empirical Comparison of Lightweight Forecasting Models for Seasonal and Non-Seasonal Time Series | Thanh Son Nguyen et.al. | 2505.01163 | null |
2025-05-02 | Exploring Equity of Climate Policies using Multi-Agent Multi-Objective Reinforcement Learning | Palok Biswas et.al. | 2505.01115 | null |
2025-05-02 | Multi-Objective Reinforcement Learning for Water Management | Zuzanna Osika et.al. | 2505.01094 | null |
2025-05-02 | Retrieval Augmented Learning: A Retrial-based Large Language Model Self-Supervised Learning and Autonomous Knowledge Generation | Zongyuan Li et.al. | 2505.01073 | null |
2025-05-02 | LMDepth: Lightweight Mamba-based Monocular Depth Estimation for Real-World Deployment | Jiahuan Long et.al. | 2505.00980 | null |
2025-05-02 | A Minimax-MDP Framework with Future-imposed Conditions for Learning-augmented Problems | Xin Chen et.al. | 2505.00973 | null |
2025-05-02 | Seeking to Collide: Online Safety-Critical Scenario Generation for Autonomous Driving with Retrieval Augmented Large Language Models | Yuewen Mei et.al. | 2505.00972 | null |
2025-05-02 | What Makes Teamwork Work? A Multimodal Case Study on Emotions and Diagnostic Expertise in an Intelligent Tutoring System | Xiaoshan Huang et.al. | 2505.00948 | link |
2025-05-02 | SSRLBot: Designing and Developing an LLM-based Agent using Socially Shared Regulated Learning | Xiaoshan Huang et.al. | 2505.00945 | null |
2025-05-02 | Autonomous Embodied Agents: When Robotics Meets Deep Learning Reasoning | Roberto Bigazzi et.al. | 2505.00935 | link |
2025-05-01 | Co-Designing a Knowledge Graph Navigation Interface: A Participatory Approach | Stanislava Gardasevic et.al. | 2505.00907 | null |
2025-05-01 | LLM Ethics Benchmark: A Three-Dimensional Assessment System for Evaluating Moral Reasoning in Large Language Models | Junfeng Jiao et.al. | 2505.00853 | link |
2025-05-01 | Can LLMs Help Improve Analogical Reasoning For Strategic Decisions? Experimental Evidence from Humans and GPT-4 | Phanish Puranam et.al. | 2505.00603 | null |
2025-05-01 | A Novel Feature-Aware Chaotic Image Encryption Scheme For Data Security and Privacy in IoT and Edge Networks | Muhammad Shahbaz Khan et.al. | 2505.00593 | null |
2025-05-01 | Safety-Critical Traffic Simulation with Guided Latent Diffusion Model | Mingxing Peng et.al. | 2505.00515 | null |
2025-05-01 | Inconsistency-based Active Learning for LiDAR Object Detection | Esteban Rivera et.al. | 2505.00511 | null |
2025-05-01 | HeAL3D: Heuristical-enhanced Active Learning for 3D Object Detection | Esteban Rivera et.al. | 2505.00507 | null |
2025-05-01 | Variational OOD State Correction for Offline Reinforcement Learning | Ke Jiang et.al. | 2505.00503 | null |
2025-05-01 | UserCentrix: An Agentic Memory-augmented AI Framework for Smart Spaces | Alaa Saleh et.al. | 2505.00472 | null |
2025-05-01 | Machine Learning Meets Transparency in Osteoporosis Risk Assessment: A Comparative Study of ML and Explainability Analysis | Farhana Elias et.al. | 2505.00410 | null |
2025-05-01 | iMacSR: Intermediate Multi-Access Supervision and Regularization in Training Autonomous Driving Models | Wei-Bin Kou et.al. | 2505.00404 | null |
2025-05-01 | Learning to Estimate Package Delivery Time in Mixed Imbalanced Delivery and Pickup Logistics Services | Jinhui Yi et.al. | 2505.00375 | null |
2025-05-01 | From GNNs to Trees: Multi-Granular Interpretability for Graph Neural Networks | Jie Yang et.al. | 2505.00364 | null |
2025-05-01 | CognitionNet: A Collaborative Neural Network for Play Style Discovery in Online Skill Gaming Platform | Rukma Talwadker et.al. | 2505.00325 | null |
2025-05-01 | FedEMA: Federated Exponential Moving Averaging with Negative Entropy Regularizer in Autonomous Driving | Wei-Bin Kou et.al. | 2505.00318 | null |
2025-05-01 | Statistical Learning for Heterogeneous Treatment Effects: Pretraining, Prognosis, and Prediction | Maximilian Schuessler et.al. | 2505.00310 | null |
2025-05-01 | AI-Assisted Decision-Making for Clinical Assessment of Auto-Segmented Contour Quality | Biling Wang et.al. | 2505.00308 | null |
2025-05-01 | Temporal Attention Evolutional Graph Convolutional Network for Multivariate Time Series Forecasting | Xinlong Zhao et.al. | 2505.00302 | null |
2025-05-01 | LightEMMA: Lightweight End-to-End Multimodal Model for Autonomous Driving | Zhijie Qiao et.al. | 2505.00284 | link |
2025-05-02 | Self-Generated In-Context Examples Improve LLM Agents for Sequential Decision-Making Tasks | Vishnu Sarukkai et.al. | 2505.00234 | null |
2025-05-01 | Predicting Estimated Times of Restoration for Electrical Outages Using Longitudinal Tabular Transformers | Bogireddy Sai Prasanna Teja et.al. | 2505.00225 | null |
2025-04-30 | PSN Game: Game-theoretic Planning via a Player Selection Network | Tianyu Qiu et.al. | 2505.00213 | null |
2025-04-30 | A Survey of Interactive Generative Video | Jiwen Yu et.al. | 2504.21853 | null |
2025-04-30 | Neuro-Symbolic Generation of Explanations for Robot Policies with Weighted Signal Temporal Logic | Mikihisa Yuasa et.al. | 2504.21841 | null |
2025-04-30 | LLM-based Interactive Imitation Learning for Robotic Manipulation | Jonas Werner et.al. | 2504.21769 | link |
2025-04-30 | TheraQuest: A Gamified, LLM-Powered Simulation for Massage Therapy Training | Shengqian Wang et.al. | 2504.21735 | null |
2025-04-30 | REHEARSE-3D: A Multi-modal Emulated Rain Dataset for 3D Point Cloud De-raining | Abu Mohammed Raisuddin et.al. | 2504.21699 | null |
2025-04-30 | Enhancing Self-Supervised Fine-Grained Video Object Tracking with Dynamic Memory Prediction | Zihan Zhou et.al. | 2504.21692 | null |
2025-04-30 | MF-LLM: Simulating Collective Decision Dynamics via a Mean-Field Large Language Model Framework | Qirui Mi et.al. | 2504.21582 | null |
2025-04-30 | A Study on Group Decision Making Problem Based on Fuzzy Reasoning and Bayesian Networks | Shui-jin Rong et.al. | 2504.21568 | null |
2025-04-30 | Online Experimental Design for Network Tomography | Xuchuang Wang et.al. | 2504.21549 | null |
2025-04-30 | Leveraging Systems and Control Theory for Social Robotics: A Model-Based Behavioral Control Approach to Human-Robot Interaction | Maria Morão Patrício et.al. | 2504.21548 | link |
2025-04-30 | UAV-VLN: End-to-End Vision Language guided Navigation for UAVs | Pranav Saxena et.al. | 2504.21432 | null |
2025-04-30 | Towards Improved Cervical Cancer Screening: Vision Transformer-Based Classification and Interpretability | Khoa Tuan Nguyen et.al. | 2504.21340 | null |
2025-04-30 | BiasGuard: A Reasoning-enhanced Bias Detection Tool For Large Language Models | Zhiting Fan et.al. | 2504.21299 | null |
2025-04-30 | Talk Before You Retrieve: Agent-Led Discussions for Better RAG in Medical QA | Xuanzhao Dong et.al. | 2504.21252 | link |
2025-04-30 | Data-driven operator learning for energy-efficient building control | Yuexin Bian et.al. | 2504.21243 | null |
2025-04-29 | Generalised Label-free Artefact Cleaning for Real-time Medical Pulsatile Time Series | Xuhang Chen et.al. | 2504.21209 | link |
2025-04-29 | Composite Safety Potential Field for Highway Driving Risk Assessment | Dachuan Zuo et.al. | 2504.21158 | null |
2025-04-29 | Comparative Analysis of Weather-Based Indexes and the Actuaries Climate Index $^{TM}$ for Crop Yield Prediction | Cem Yavrum et.al. | 2504.21143 | null |
2025-04-29 | Toward Efficient Exploration by Large Language Model Agents | Dilip Arumugam et.al. | 2504.20997 | null |
2025-04-29 | Real-Time Wayfinding Assistant for Blind and Low-Vision Users | Dabbrata Das et.al. | 2504.20976 | null |
2025-04-29 | XPG-RL: Reinforcement Learning with Explainable Priority Guidance for Efficiency-Boosted Mechanical Search | Yiting Zhang et.al. | 2504.20969 | null |
2025-04-29 | Opinion-Driven Decision-Making for Multi-Robot Navigation through Narrow Corridors | Norah K. Alghamdi et.al. | 2504.20947 | null |
2025-04-29 | GiBy: A Giant-Step Baby-Step Classifier For Anomaly Detection In Industrial Control Systems | Sarad Venugopalan et.al. | 2504.20906 | null |
2025-04-29 | Modeling AI-Human Collaboration as a Multi-Agent Adaptation | Prothit Sen et.al. | 2504.20903 | link |
2025-04-30 | Preference-centric Bandits: Optimality of Mixtures and Regret-efficient Algorithms | Meltem Tatlı et.al. | 2504.20877 | null |
2025-04-29 | Bitcoin, a DAO? | Mark C. Ballandies et.al. | 2504.20838 | null |
2025-04-29 | Intelligent Task Offloading in VANETs: A Hybrid AI-Driven Approach for Low-Latency and Energy Efficiency | Tariq Qayyum et.al. | 2504.20735 | null |
2025-04-29 | Decision-centric fairness: Evaluation and optimization for resource allocation problems | Simon De Vos et.al. | 2504.20642 | link |
2025-04-29 | DeeP-Mod: Deep Dynamic Programming based Environment Modelling using Feature Extraction | Chris Child et.al. | 2504.20535 | null |
2025-04-29 | Safe Bottom-Up Flexibility Provision from Distributed Energy Resources | Costas Mylonas et.al. | 2504.20529 | null |
2025-04-29 | SteelBlastQC: Shot-blasted Steel Surface Dataset with Interpretable Detection of Surface Defects | Irina Ruzavina et.al. | 2504.20510 | link |
2025-04-29 | The Panel Complexity of Sortition: Is 12 Angry Men Enough? | Johannes Brustle et.al. | 2504.20508 | null |
2025-04-29 | Neural Stereo Video Compression with Hybrid Disparity Compensation | Shiyin Jiang et.al. | 2504.20383 | null |
2025-04-29 | AKIBoards: A Structure-Following Multiagent System for Predicting Acute Kidney Injury | David Gordon et.al. | 2504.20368 | null |
2025-04-28 | A Virtual Cybersecurity Department for Securing Digital Twins in Water Distribution Systems | Mohammadhossein Homaei et.al. | 2504.20266 | null |
2025-04-28 | AI Recommendation Systems for Lane-Changing Using Adherence-Aware Reinforcement Learning | Weihao Sun et.al. | 2504.20187 | null |
2025-04-28 | Learning Streaming Video Representation via Multitask Training | Yibin Yan et.al. | 2504.20041 | null |
2025-04-28 | Modular Machine Learning: An Indispensable Path towards New-Generation Large Language Models | Xin Wang et.al. | 2504.20020 | null |
2025-04-28 | Socially-Aware Autonomous Driving: Inferring Yielding Intentions for Safer Interactions | Jing Wang et.al. | 2504.20004 | null |
2025-04-28 | Automated decision-making for dynamic task assignment at scale | Riccardo Lo Bianco et.al. | 2504.19933 | link |
2025-04-28 | Demographic Parity-aware Individualized Treatment Rules | Wenhai Cui et.al. | 2504.19914 | null |
2025-04-28 | Can AI Agents Design and Implement Drug Discovery Pipelines? | Khachik Smbatyan et.al. | 2504.19912 | null |
2025-04-28 | LLM-Powered GUI Agents in Phone Automation: Surveying Progress and Prospects | Guangyi Liu et.al. | 2504.19838 | link |
2025-04-28 | Bias correction in treatment effect estimates following data-driven biomarker cutoff selection | Chi Zhang et.al. | 2504.19776 | null |
2025-04-28 | A New Decision- Making Method Based on Shannon Entropy Analysis | Hamid Babaei et.al. | 2504.19753 | null |
2025-04-28 | The ATLAS of Traffic Lights: A Reliable Perception Framework for Autonomous Driving | Rupert Polley et.al. | 2504.19722 | null |
2025-04-28 | Open-set Anomaly Segmentation in Complex Scenarios | Song Xia et.al. | 2504.19706 | null |
2025-04-28 | Explaining Vision GNNs: A Semantic and Visual Analysis of Graph-based Image Classification | Nikolaos Chaidos et.al. | 2504.19682 | null |
2025-04-28 | From LLM Reasoning to Autonomous AI Agents: A Comprehensive Review | Mohamed Amine Ferrag et.al. | 2504.19678 | null |
2025-04-28 | ARTEMIS: Autoregressive End-to-End Trajectory Planning with Mixture of Experts for Autonomous Driving | Renju Feng et.al. | 2504.19580 | link |
2025-04-28 | CE-NPBG: Connectivity Enhanced Neural Point-Based Graphics for Novel View Synthesis in Autonomous Driving Scenes | Mohammad Altillawi et.al. | 2504.19557 | null |
2025-04-28 | \textit{From Freshness to Effectiveness}: Goal-Oriented Sampling for Remote Decision Making | Aimin Li et.al. | 2504.19507 | null |
2025-04-28 | An Automated Reinforcement Learning Reward Design Framework with Large Language Model for Cooperative Platoon Coordination | Dixiao Wei et.al. | 2504.19480 | null |
2025-04-28 | Systematic Bias in Large Language Models: Discrepant Response Patterns in Binary vs. Continuous Judgment Tasks | Yi-Long Lu et.al. | 2504.19445 | null |
2025-04-28 | Quantum-Inspired Cournot Model | Amarendra Sharma et.al. | 2504.19420 | null |
2025-04-28 | Geometry of efficient weight vectors | Kristóf Ábele-Nagy et.al. | 2504.19400 | null |
2025-04-25 | Enhancing Visual Interpretability and Explainability in Functional Survival Trees and Forests | Giuseppe Loffredo et.al. | 2504.18498 | null |
2025-04-25 | Automatic Bias Detection in Source Code Review | Yoseph Berhanu Alebachew et.al. | 2504.18449 | null |
2025-04-25 | NoiseController: Towards Consistent Multi-view Video Generation via Noise Decomposition and Collaboration | Haotian Dong et.al. | 2504.18448 | null |
2025-04-25 | Energy Security and Resilience: Reviewing Concepts and Advancing Planning Perspectives for Transforming Integrated Energy Systems | Richard Schmitz et.al. | 2504.18396 | null |
2025-04-25 | Explainable AI for UAV Mobility Management: A Deep Q-Network Approach for Handover Minimization | Irshad A. Meer et.al. | 2504.18371 | null |
2025-04-25 | Interpretable Affordance Detection on 3D Point Clouds with Probabilistic Prototypes | Maximilian Xiling Li et.al. | 2504.18355 | null |
2025-04-25 | Testing Individual Fairness in Graph Neural Networks | Roya Nasiri et.al. | 2504.18353 | null |
2025-04-25 | Depth-Constrained ASV Navigation with Deep RL and Limited Sensing | Amirhossein Zhalehmehrabi et.al. | 2504.18253 | null |
2025-04-25 | Automated Work Records for Precision Agriculture Management: A Low-Cost GNSS IoT Solution for Paddy Fields in Central Japan | M. Grosse et.al. | 2504.18222 | null |
2025-04-25 | What is the Added Value of UDA in the VFM Era? | Brunó B. Englert et.al. | 2504.18190 | null |
2025-04-25 | Study on Real-Time Road Surface Reconstruction Using Stereo Vision | Deepak Ghimire et.al. | 2504.18112 | null |
2025-04-25 | Opportunistic Collaborative Planning with Large Vision Model Guided Control and Joint Query-Service Optimization | Jiayi Chen et.al. | 2504.18057 | null |
2025-04-25 | DMS-Net:Dual-Modal Multi-Scale Siamese Network for Binocular Fundus Image Classification | Guohao Huo et.al. | 2504.18046 | null |
2025-04-25 | Differential Privacy-Driven Framework for Enhancing Heart Disease Prediction | Yazan Otoum et.al. | 2504.18007 | null |
2025-04-25 | Chatperone: An LLM-Based Negotiable Scaffolding System for Mediating Adolescent Mobile Interactions | Suwon Yoon et.al. | 2504.17997 | null |
2025-04-24 | CaRL: Learning Scalable Planning Policies with Simple Rewards | Bernhard Jaeger et.al. | 2504.17838 | null |
2025-04-25 | Data-Driven Calibration of Prediction Sets in Large Vision-Language Models Based on Inductive Conformal Prediction | Yuanchang Ye et.al. | 2504.17671 | null |
2025-04-24 | SAPO-RL: Sequential Actuator Placement Optimization for Fuselage Assembly via Reinforcement Learning | Peng Ye et.al. | 2504.17603 | null |
2025-04-24 | Auditing the Ethical Logic of Generative AI Models | W. Russell Neuman et.al. | 2504.17544 | null |
2025-04-24 | An Explainable Nature-Inspired Framework for Monkeypox Diagnosis: Xception Features Combined with NGBoost and African Vultures Optimization Algorithm | Ahmadreza Shateri et.al. | 2504.17540 | null |
2025-04-24 | Learning Isometric Embeddings of Road Networks using Multidimensional Scaling | Juan Carlos Climent Pardo et.al. | 2504.17534 | null |
2025-04-24 | Goal-Oriented Time-Series Forecasting: Foundation Framework Design | Luca-Andrei Fechete et.al. | 2504.17493 | null |
2025-04-24 | Longitudinal Control for Autonomous Racing with Combustion Engine Vehicles | Phillip Pitschi et.al. | 2504.17418 | null |
2025-04-24 | S2S-Net: Addressing the Domain Gap of Heterogeneous Sensor Systems in LiDAR-Based Collective Perception | Sven Teufel et.al. | 2504.17399 | null |
2025-04-24 | Towards User-Centred Design of AI-Assisted Decision-Making in Law Enforcement | Vesna Nowack et.al. | 2504.17393 | null |
2025-04-25 | Highly Accurate and Diverse Traffic Data: The DeepScenario Open 3D Dataset | Oussema Dhaouadi et.al. | 2504.17371 | null |
2025-04-24 | Doubly Adaptive Social Learning | Marco Carpentiero et.al. | 2504.17370 | null |
2025-04-24 | Tokenizing Stock Prices for Enhanced Multi-Step Forecast and Prediction | Zhuohang Zhu et.al. | 2504.17313 | null |
2025-04-24 | Perturbed Gradient Descent via Convex Quadratic Approximation for Nonconvex Bilevel Optimization | Nazanin Abolfazli et.al. | 2504.17215 | null |
2025-04-24 | A RAG-Based Multi-Agent LLM System for Natural Hazard Resilience and Adaptation | Yangxinyu Xie et.al. | 2504.17200 | null |
2025-04-24 | Causal rule ensemble approach for multi-arm data | Ke Wan et.al. | 2504.17166 | null |
2025-04-23 | A Novel Hybrid Approach Using an Attention-Based Transformer + GRU Model for Predicting Cryptocurrency Prices | Esam Mahdi et.al. | 2504.17079 | null |
2025-04-23 | A Systematic Approach to Design Real-World Human-in-the-Loop Deep Reinforcement Learning: Salient Features, Challenges and Trade-offs | Jalal Arabneydi et.al. | 2504.17006 | null |
2025-04-23 | Meta-Learning Online Dynamics Model Adaptation in Off-Road Autonomous Driving | Jacob Levy et.al. | 2504.16923 | null |
2025-04-23 | Enhancing Critical Thinking with AI: A Tailored Warning System for RAG Models | Xuyang Zhu et.al. | 2504.16883 | null |
2025-04-23 | Adversarial Knapsack for Sequential Competitive Resource Allocation | Omkar Thakoor et.al. | 2504.16752 | null |
2025-04-23 | Gaussian Splatting is an Effective Data Generator for 3D Object Detection | Farhad G. Zanjani et.al. | 2504.16740 | null |
2025-04-23 | Bridging Data Gaps and Building Knowledge Networks in Indian Football Analytics | Sneha Nanavati et.al. | 2504.16572 | null |
2025-04-23 | Using Causal Inference to Test Systems with Hidden and Interacting Variables: An Evaluative Case Study | Michael Foster et.al. | 2504.16526 | null |
2025-04-23 | Think Hierarchically, Act Dynamically: Hierarchical Multi-modal Fusion and Reasoning for Vision-and-Language Navigation | Junrong Yue et.al. | 2504.16516 | null |
2025-04-23 | Circinus: Efficient Query Planner for Compound ML Serving | Banruo Liu et.al. | 2504.16397 | null |
2025-04-23 | ConTextual: Improving Clinical Text Summarization in LLMs with Context-preserving Token Filtering and Knowledge Graphs | Fahmida Liza Piya et.al. | 2504.16394 | link |
2025-04-23 | SILM: A Subjective Intent Based Low-Latency Framework for Multiple Traffic Participants Joint Trajectory Prediction | Qu Weiming et.al. | 2504.16377 | null |
2025-04-23 | DPGP: A Hybrid 2D-3D Dual Path Potential Ghost Probe Zone Prediction Framework for Safe Autonomous Driving | Weiming Qu et.al. | 2504.16374 | null |
2025-04-23 | Revisiting Radar Camera Alignment by Contrastive Learning for 3D Object Detection | Linhua Kong et.al. | 2504.16368 | null |
2025-04-23 | Universal Online Contention Resolution with Preselected Order | Junyao Zhao et.al. | 2504.16327 | null |
2025-04-22 | FairPlay: A Collaborative Approach to Mitigate Bias in Datasets for Improved AI Fairness | Tina Behzad et.al. | 2504.16255 | null |
2025-04-22 | LLMs are Greedy Agents: Effects of RL Fine-tuning on Decision-Making Abilities | Thomas Schmied et.al. | 2504.16078 | null |
2025-04-22 | A Comparative and Measurement-Based Study on Real-Time Network KPI Extraction Methods for 5G and Beyond Applications | Batuhan Kaplan et.al. | 2504.16039 | null |
2025-04-22 | LLMs meet Federated Learning for Scalable and Secure IoT Management | Yazan Otoum et.al. | 2504.16032 | null |
2025-04-22 | Navigating the State of Cognitive Flow: Context-Aware AI Interventions for Effective Reasoning Support | Dinithi Dissanayake et.al. | 2504.16021 | null |
2025-04-22 | A UAV-Aided Digital Twin Framework for IoT Networks with High Accuracy and Synchronization | Ghofran Khalaf et.al. | 2504.15967 | null |
2025-04-22 | Supporting Data-Frame Dynamics in AI-assisted Decision Making | Chengbo Zheng et.al. | 2504.15894 | null |
2025-04-22 | MS-Occ: Multi-Stage LiDAR-Camera Fusion for 3D Semantic Occupancy Prediction | Zhiqiang Wei et.al. | 2504.15888 | null |
2025-04-22 | Beyond Attention: Investigating the Threshold Where Objective Robot Exclusion Becomes Subjective | Clarissa Sabrina Arlinghaus et.al. | 2504.15886 | null |
2025-04-23 | Bidirectional Task-Motion Planning Based on Hierarchical Reinforcement Learning for Strategic Confrontation | Qizhen Wu et.al. | 2504.15876 | null |
2025-04-22 | An Extended Horizon Tactical Decision-Making for Automated Driving Based on Monte Carlo Tree Search | Karim Essalmi et.al. | 2504.15869 | null |
2025-04-22 | The 2nd MERCADO Workshop at IEEE VIS 2025: Multimodal Experiences for Remote Communication Around Data Online | Wolfgang Büschel et.al. | 2504.15859 | null |
2025-04-22 | A closer look at how large language models trust humans: patterns and biases | Valeria Lerman et.al. | 2504.15801 | null |
2025-04-22 | Pose Optimization for Autonomous Driving Datasets using Neural Rendering Models | Quentin Herau et.al. | 2504.15776 | null |
2025-04-22 | Dynamic Intent Queries for Motion Transformer-based Trajectory Prediction | Tobias Demmler et.al. | 2504.15766 | null |
2025-04-22 | Enhancing Tennis Training with Real-Time Swing Data Visualisation in Immersive Virtual Reality | Ryan Najami et.al. | 2504.15746 | null |
2025-04-22 | SAGA: Semantic-Aware Gray color Augmentation for Visible-to-Thermal Domain Adaptation across Multi-View Drone and Ground-Based Vision Systems | Manjunath D et.al. | 2504.15728 | null |
2025-04-22 | Implementing Rational Choice Functions with LLMs and Measuring their Alignment with User Preferences | Anna Karnysheva et.al. | 2504.15719 | null |
2025-04-22 | FinTextSim: Enhancing Financial Text Analysis with BERTopic | Simon Jehnen et.al. | 2504.15683 | null |
2025-04-22 | Trustworthy Decentralized Autonomous Machines: A New Paradigm in Automation Economy | Fernando Castillo et.al. | 2504.15676 | null |
2025-04-22 | Symbolic Runtime Verification and Adaptive Decision-Making for Robot-Assisted Dressing | Yasmin Rafiq et.al. | 2504.15666 | null |
2025-04-21 | Diffusion Bridge Models for 3D Medical Image Translation | Shaorong Zhang et.al. | 2504.15267 | null |
2025-04-21 | Position: Bayesian Statistics Facilitates Stakeholder Participation in Evaluation of Generative AI | Yanan Long et.al. | 2504.15211 | null |
2025-04-21 | Scalable Discrete Event Simulation Tool for Large-Scale Cyber-Physical Energy Systems: Advancing System Efficiency and Scalability | Khandaker Akramul Haque et.al. | 2504.15198 | null |
2025-04-21 | Beyond Binary Opinions: A Deep Reinforcement Learning-Based Approach to Uncertainty-Aware Competitive Influence Maximization | Qi Zhang et.al. | 2504.15131 | null |
2025-04-21 | Optimal Behavior Planning for Implicit Communication using a Probabilistic Vehicle-Pedestrian Interaction Model | Markus Amann et.al. | 2504.15098 | null |
2025-04-21 | Hierarchical Attention Fusion of Visual and Textual Representations for Cross-Domain Sequential Recommendation | Wangyu Wu et.al. | 2504.15085 | null |
2025-04-21 | Distributed Cognition for AI-supported Remote Operations: Challenges and Research Directions | Rune Møberg Jacobsen et.al. | 2504.14996 | null |
2025-04-21 | Integrating Response Time and Attention Duration in Bayesian Preference Learning for Multiple Criteria Decision Aiding | Jiaxuan Jiang et.al. | 2504.14938 | null |
2025-04-21 | Dynamic Contrastive Skill Learning with State-Transition Based Skill Clustering and Dynamic Length Adjustment | Jinwoo Choi et.al. | 2504.14805 | null |
2025-04-20 | Adaptive Field Effect Planner for Safe Interactive Autonomous Driving on Curved Roads | Qinghao Li et.al. | 2504.14747 | null |
2025-04-20 | Wireless Large AI Model: Shaping the AI-Native Future of 6G and Beyond | Fenghao Zhu et.al. | 2504.14653 | null |
2025-04-20 | Surrogate Fitness Metrics for Interpretable Reinforcement Learning | Philipp Altmann et.al. | 2504.14645 | null |
2025-04-20 | Consensus in Motion: A Case of Dynamic Rationality of Sequential Learning in Probability Aggregation | Polina Gordienko et.al. | 2504.14624 | null |
2025-04-20 | HealthGenie: Empowering Users with Healthy Dietary Guidance through Knowledge Graph and Large Language Models | Fan Gao et.al. | 2504.14594 | null |
2025-04-20 | SMTT: Novel Structured Multi-task Tracking with Graph-Regularized Sparse Representation for Robust Thermal Infrared Target Tracking | Shang Zhang et.al. | 2504.14566 | null |
2025-04-20 | Should Benevolent Deception be Allowed in EHMI? A Mechanism Explanation Based on Game Theory | Linkun Liu et.al. | 2504.14539 | null |
2025-04-20 | Causality for Natural Language Processing | Zhijing Jin et.al. | 2504.14530 | null |
2025-04-20 | Are Vision LLMs Road-Ready? A Comprehensive Benchmark for Safety-Critical Driving Video Understanding | Tong Zeng et.al. | 2504.14526 | link |
2025-04-22 | ApexNav: An Adaptive Exploration Strategy for Zero-Shot Object Navigation with Target-centric Semantic Fusion | Mingjie Zhang et.al. | 2504.14478 | link |
2025-04-20 | Seeing Through Risk: A Symbolic Approximation of Prospect Theory | Ali Arslan Yousaf et.al. | 2504.14448 | null |
2025-04-18 | Constrained Average-Reward Intermittently Observable MDPs | Konstantin Avrachenkov et.al. | 2504.13823 | null |
2025-04-18 | Intelligent Interaction Strategies for Context-Aware Cognitive Augmentation | Xiangrong et.al. | 2504.13684 | null |
2025-04-18 | Continual Pre-Training is (not) What You Need in Domain Adaption | Pin-Er Chen et.al. | 2504.13603 | null |
2025-04-18 | LMPOcc: 3D Semantic Occupancy Prediction Utilizing Long-Term Memory Prior from Historical Traversals | Shanshuai Yuan et.al. | 2504.13596 | null |
2025-04-18 | Monitor and Recover: A Paradigm for Future Research on Distribution Shift in Learning-Enabled Cyber-Physical Systems | Vivian Lin et.al. | 2504.13484 | null |
2025-04-18 | LLM Sensitivity Evaluation Framework for Clinical Diagnosis | Chenwei Yan et.al. | 2504.13475 | null |
2025-04-18 | Testing the Fault-Tolerance of Multi-Sensor Fusion Perception in Autonomous Driving Systems | Haoxiang Tian et.al. | 2504.13420 | null |
2025-04-18 | A Model-Based Approach to Imitation Learning through Multi-Step Predictions | Haldun Balim et.al. | 2504.13413 | null |
2025-04-18 | LangCoop: Collaborative Driving with Language | Xiangbo Gao et.al. | 2504.13406 | link |
2025-04-18 | Towards a Multi-Agent Vision-Language System for Zero-Shot Novel Hazardous Object Detection for Autonomous Driving Safety | Shashank Shriram et.al. | 2504.13399 | link |
2025-04-18 | Intelligent data collection for network discrimination in material flow analysis using Bayesian optimal experimental design | Jiankan Liao et.al. | 2504.13382 | null |
2025-04-17 | ChartQA-X: Generating Explanations for Charts | Shamanthak Hegde et.al. | 2504.13275 | null |
2025-04-17 | Causal-Copilot: An Autonomous Causal Analysis Agent | Xinyue Wang et.al. | 2504.13263 | null |
2025-04-17 | CPG-EVAL: A Multi-Tiered Benchmark for Evaluating the Chinese Pedagogical Grammar Competence of Large Language Models | Dong Wang et.al. | 2504.13261 | null |
2025-04-17 | Long Range Navigator (LRN): Extending robot planning horizons beyond metric maps | Matt Schmittle et.al. | 2504.13149 | null |
2025-04-17 | PCBEAR: Pose Concept Bottleneck for Explainable Action Recognition | Jongseo Lee et.al. | 2504.13140 | null |
2025-04-17 | Why Ask One When You Can Ask $k$ ? Two-Stage Learning-to-Defer to a Set of Experts | Yannis Montreuil et.al. | 2504.12988 | null |
2025-04-17 | Accommodate Knowledge Conflicts in Retrieval-augmented LLMs: Towards Reliable Response Generation in the Wild | Jiatai Wang et.al. | 2504.12982 | null |
2025-04-17 | Safe Physics-Informed Machine Learning for Dynamics and Control | Jan Drgona et.al. | 2504.12952 | null |
2025-04-17 | Explainable AI in Usable Privacy and Security: Challenges and Opportunities | Vincent Freiberger et.al. | 2504.12931 | null |
2025-04-17 | Sliced-Wasserstein Distance-based Data Selection | Julien Pallage et.al. | 2504.12918 | null |
2025-04-17 | DashChat: Interactive Authoring of Industrial Dashboard Design Prototypes through Conversation with LLM-Powered Agents | S. Shen et.al. | 2504.12865 | null |
2025-04-17 | Enhancing Decentralization in Blockchain Decision-Making Through Quadratic Voting and Its Generalization | Lyudmila Kovalchuk et.al. | 2504.12859 | null |
2025-04-17 | Questions: A Taxonomy for Critical Reflection in Machine-Supported Decision-Making | Simon W. S. Fischer et.al. | 2504.12830 | null |
2025-04-17 | UncAD: Towards Safe End-to-end Autonomous Driving via Online Map Uncertainty | Pengxuan Yang et.al. | 2504.12826 | link |
2025-04-17 | Explainable Scene Understanding with Qualitative Representations and Graph Neural Networks | Nassim Belmecheri et.al. | 2504.12817 | null |
2025-04-17 | Approaching Current Challenges in Developing a Software Stack for Fully Autonomous Driving | Simon Sagmeister et.al. | 2504.12813 | null |
2025-04-17 | Enhancing Explainability and Reliable Decision-Making in Particle Swarm Optimization through Communication Topologies | Nitin Gupta et.al. | 2504.12803 | null |
2025-04-17 | Distributed Intelligent Sensing and Communications for 6G: Architecture and Use Cases | Kyriakos Stylianopoulos et.al. | 2504.12765 | null |
2025-04-17 | Self-Supervised Pre-training with Combined Datasets for 3D Perception in Autonomous Driving | Shumin Wang et.al. | 2504.12709 | null |
2025-04-17 | Collaborative Perception Datasets for Autonomous Driving: A Review | Naibang Wang et.al. | 2504.12696 | link |
2025-04-17 | Two Tasks, One Goal: Uniting Motion and Planning for Excellent End To End Autonomous Driving Performance | Lin Liu et.al. | 2504.12667 | null |
2025-04-17 | Autonomous Drone for Dynamic Smoke Plume Tracking | Srijan Kumar Pal et.al. | 2504.12664 | null |
2025-04-17 | Towards Characterizing Subjectivity of Individuals through Modeling Value Conflicts and Trade-offs | Younghun Lee et.al. | 2504.12633 | null |
2025-04-16 | Towards Human-Centered Early Prediction Models for Academic Performance in Real-World Contexts | Han Zhang et.al. | 2504.12236 | null |
2025-04-16 | Predictive Multiplicity in Survival Models: A Method for Quantifying Model Uncertainty in Predictive Maintenance Applications | Mustafa Cavus et.al. | 2504.12156 | null |
2025-04-16 | Deep Generative Models for Bayesian Inference on High-Rate Sensor Data: Applications in Automotive Radar and Medical Imaging | Tristan S. W. Stevens et.al. | 2504.12154 | null |
2025-04-16 | Self-Supervised Traversability Learning with Online Prototype Adaptation for Off-Road Autonomous Driving | Yafeng Bu et.al. | 2504.12109 | null |
2025-04-16 | Reasoning-Based AI for Startup Evaluation (R.A.I.S.E.): A Memory-Augmented, Multi-Step Decision Framework | Jack Preuveneers et.al. | 2504.12090 | null |
2025-04-16 | Contract-based hierarchical control using predictive feasibility value functions | Felix Berkel et.al. | 2504.12036 | null |
2025-04-16 | Evolutionary Reinforcement Learning for Interpretable Decision-Making in Supply Chain Management | Stefano Genetti et.al. | 2504.12023 | null |
2025-04-16 | Action Anticipation from SoccerNet Football Video Broadcasts | Mohamad Dalal et.al. | 2504.12021 | null |
2025-04-16 | Scaled Block Vecchia Approximation for High-Dimensional Gaussian Process Emulation on GPUs | Qilong Pan et.al. | 2504.12004 | null |
2025-04-16 | Novel-view X-ray Projection Synthesis through Geometry-Integrated Deep Learning | Daiqi Liu et.al. | 2504.11953 | link |
2025-04-17 | Causality-enhanced Decision-Making for Autonomous Mobile Robots in Dynamic Environments | Luca Castri et.al. | 2504.11901 | link |
2025-04-16 | Discrimination-free Insurance Pricing with Privatized Sensitive Attributes | Tianhe Zhang et.al. | 2504.11775 | null |
2025-04-16 | Inversion of biological strategies in engineering technology: in case underwater soft robot | Siqing Chen et.al. | 2504.11722 | null |
2025-04-16 | Steering Prosocial AI Agents: Computational Basis of LLM’s Decision Making in Social Simulation | Ji Ma et.al. | 2504.11671 | null |
2025-04-15 | DamageCAT: A Deep Learning Transformer Framework for Typology-Based Post-Disaster Building Damage Categorization | Yiming Xiao et.al. | 2504.11637 | link |
2025-04-15 | Graph-Theoretic Measures for Interpretable Multicriteria Decision Making in Emergency Department Layout Optimization | Ola Sarhan et.al. | 2504.11620 | null |
2025-04-15 | Dueling Deep Reinforcement Learning for Financial Time Series | Bruno Giorgio et.al. | 2504.11601 | null |
2025-04-15 | eXplainable AI for data driven control: an inverse optimal control approach | Federico Porcari et.al. | 2504.11446 | null |
2025-04-15 | Measures of Variability for Risk-averse Policy Gradient | Yudong Luo et.al. | 2504.11412 | null |
2025-04-15 | A Winner-Takes-All Mechanism for Event Generation | Yongkang Huo et.al. | 2504.11374 | link |
2025-04-15 | Cancer-Myth: Evaluating AI Chatbot on Patient Questions with False Presuppositions | Wang Bill Zhu et.al. | 2504.11373 | link |
2025-04-15 | Evaluating DAO Sustainability and Longevity Through On-Chain Governance Metrics | Silvio Meneguzzo et.al. | 2504.11341 | null |
2025-04-15 | Uncertainty Estimation for Trust Attribution to Speed-of-Sound Reconstruction with Variational Networks | Sonia Laguna et.al. | 2504.11307 | null |
2025-04-15 | DeepSelective: Feature Gating and Representation Matching for Interpretable Clinical Prediction | Ruochi Zhang et.al. | 2504.11264 | null |
2025-04-15 | The Lifetime of the Covid Memorial Wall: Modelling with Collections Demography, Social Media Data and Citizen Science | Josep Grau-Bové et.al. | 2504.11196 | null |
2025-04-15 | Clinically Interpretable Survival Risk Stratification in Head and Neck Cancer Using Bayesian Networks and Markov Blankets | Keyur D. Shah et.al. | 2504.11188 | null |
2025-04-15 | Benchmarking Next-Generation Reasoning-Focused Large Language Models in Ophthalmology: A Head-to-Head Evaluation on 5,888 Items | Minjie Zou et.al. | 2504.11186 | null |
2025-04-15 | Revealing Covert Attention by Analyzing Human and Reinforcement Learning Agent Gameplay | Henrik Krauss et.al. | 2504.11118 | null |
2025-04-15 | Towards global equity in political polarization research | Max Falkenberg et.al. | 2504.11090 | null |
2025-04-15 | “Even explanations will not help in trusting [this] fundamentally biased system”: A Predictive Policing Case-Study | Siddharth Mehrotra et.al. | 2504.11020 | null |
2025-04-16 | GATE3D: Generalized Attention-based Task-synergized Estimation in 3D* | Eunsoo Im et.al. | 2504.11014 | null |
2025-04-15 | Why am I seeing this? Towards recognizing social media recommender systems with missing recommendations | Sabrina Guidotti et.al. | 2504.11000 | null |
2025-04-15 | The Effectiveness of Business Process Visualisations: a Systematic Literature Review | E. C. Overes et.al. | 2504.10971 | null |
2025-04-15 | Can Vision-Language Models Understand and Interpret Dynamic Gestures from Pedestrians? Pilot Datasets and Exploration Towards Instructive Nonverbal Commands for Cooperative Autonomous Vehicles | Tonko E. W. Bossen et.al. | 2504.10873 | null |
2025-04-15 | Towards Spatially-Aware and Optimally Faithful Concept-Based Explanations | Shubham Kumar et.al. | 2504.10833 | null |
2025-04-15 | CLASH: Evaluating Language Models on Judging High-Stakes Dilemmas from Multiple Perspectives | Ayoung Lee et.al. | 2504.10823 | null |
2025-04-15 | PatrolVision: Automated License Plate Recognition in the wild | Anmol Singhal Navya Singhal et.al. | 2504.10810 | null |
2025-04-14 | Decoupled Diffusion Sparks Adaptive Scene Generation | Yunsong Zhou et.al. | 2504.10485 | null |
2025-04-14 | The Price of Competitive Information Disclosure | Siddhartha Banerjee et.al. | 2504.10459 | null |
2025-04-15 | Unchecked and Overlooked: Addressing the Checkbox Blind Spot in Large Language Models with CheckboxQA | Michał Turski et.al. | 2504.10419 | link |
2025-04-14 | Performance of Large Language Models in Supporting Medical Diagnosis and Treatment | Diogo Sousa et.al. | 2504.10405 | null |
2025-04-14 | Can LLMs Assist Expert Elicitation for Probabilistic Causal Modeling? | Olha Shaposhnyk et.al. | 2504.10397 | null |
2025-04-14 | Flying Hand: End-Effector-Centric Framework for Versatile Aerial Manipulation Teleoperation and Policy Learning | Guanqi He et.al. | 2504.10334 | null |
2025-04-14 | Siamese Network with Dual Attention for EEG-Driven Social Learning: Bridging the Human-Robot Gap in Long-Tail Autonomous Driving | Xiaoshan Zhou et.al. | 2504.10296 | null |
2025-04-14 | Characterizing LLM-driven Social Network: The Chirper.ai Case | Yiming Zhu et.al. | 2504.10286 | null |
2025-04-14 | Who Speaks for Ethics? How Demographics Shape Ethical Advocacy in Software Development | Lauren Olson et.al. | 2504.10276 | null |
2025-04-14 | LMFormer: Lane based Motion Prediction Transformer | Harsh Yadav et.al. | 2504.10275 | null |
2025-04-14 | Vision based driving agent for race car simulation environments | Gergely Bári et.al. | 2504.10266 | null |
2025-04-14 | Can Competition Enhance the Proficiency of Agents Powered by Large Language Models in the Realm of News-driven Time Series Forecasting? | Yuxuan Zhang et.al. | 2504.10210 | null |
2025-04-14 | Towards Quantifying Commonsense Reasoning with Mechanistic Insights | Abhinav Joshi et.al. | 2504.10077 | null |
2025-04-14 | Using Reinforcement Learning to Integrate Subjective Wellbeing into Climate Adaptation Decision Making | Arthur Vandervoort et.al. | 2504.10031 | null |
2025-04-14 | Sequence models for by-trial decoding of cognitive strategies from neural data | Rick den Otter et.al. | 2504.10028 | link |
2025-04-14 | Balancing Two Classifiers via A Simplex ETF Structure for Model Calibration | Jiani Ni et.al. | 2504.10007 | null |
2025-04-14 | Towards Resilient Tracking in Autonomous Vehicles: A Distributionally Robust Input and State Estimation Approach | Kasra Azizi et.al. | 2504.09974 | null |
2025-04-14 | Truncated Matrix Completion - An Empirical Study | Rishhabh Naik et.al. | 2504.09873 | null |
2025-04-14 | EthosGPT: Mapping Human Value Diversity to Advance Sustainable Development Goals (SDGs) | Luyao Zhang et.al. | 2504.09861 | link |
2025-04-14 | Offline Dynamic Inventory and Pricing Strategy: Addressing Censored and Dependent Demand | Korel Gundem et.al. | 2504.09831 | link |
2025-04-11 | Interaction-Required Suggestions for Control, Ownership, and Awareness in Human-AI Co-Writing | Kenneth C. Arnold et.al. | 2504.08726 | null |
2025-04-11 | Offline Reinforcement Learning using Human-Aligned Reward Labeling for Autonomous Emergency Braking in Occluded Pedestrian Crossing | Vinal Asodia et.al. | 2504.08704 | null |
2025-04-11 | TinyCenterSpeed: Efficient Center-Based Object Detection for Autonomous Racing | Neil Reichlin et.al. | 2504.08655 | link |
2025-04-11 | Ready, Bid, Go! On-Demand Delivery Using Fleets of Drones with Unknown, Heterogeneous Energy Storage Constraints | Mohamed S. Talamali et.al. | 2504.08585 | null |
2025-04-11 | Control Co-Design Under Uncertainty for Offshore Wind Farms: Optimizing Grid Integration, Energy Storage, and Market Participation | Himanshu Sharma et.al. | 2504.08555 | null |
2025-04-11 | Shadow Erosion and Nighttime Adaptability for Camera-Based Automated Driving Applications | Mohamed Sabry et.al. | 2504.08551 | null |
2025-04-11 | Datasets for Lane Detection in Autonomous Driving: A Comprehensive Review | Jörg Gamerdinger et.al. | 2504.08540 | null |
2025-04-11 | Dark Haptics: Exploring Manipulative Haptic Design in Mobile User Interfaces | Chenge Tang et.al. | 2504.08471 | null |
2025-04-11 | Road Grip Uncertainty Estimation Through Surface State Segmentation | Jyri Maanpää et.al. | 2504.08452 | null |
2025-04-11 | DRIP: DRop unImportant data Points – Enhancing Machine Learning Efficiency with Grad-CAM-Based Real-Time Data Prioritization for On-Device Training | Marcus Rüb et.al. | 2504.08364 | null |
2025-04-11 | SN-LiDAR: Semantic Neural Fields for Novel Space-time View LiDAR Synthesis | Yi Chen et.al. | 2504.08361 | link |
2025-04-11 | Scalable Conflict-free Decision Making with Photons | Kohei Konaka et.al. | 2504.08331 | null |
2025-04-11 | Evaluating the Bias in LLMs for Surveying Opinion and Decision Making in Healthcare | Yonchanok Khaokaew et.al. | 2504.08260 | null |
2025-04-11 | InSPE: Rapid Evaluation of Heterogeneous Multi-Modal Infrastructure Sensor Placement | Zhaoliang Zheng et.al. | 2504.08240 | null |
2025-04-11 | CATCH-FORM-ACTer: Compliance-Aware Tactile Control and Hybrid Deformation Regulation-Based Action Transformer for Viscoelastic Object Manipulation | Hongjun Ma et.al. | 2504.08232 | null |
2025-04-11 | VL-UR: Vision-Language-guided Universal Restoration of Images Degraded by Adverse Weather Conditions | Ziyan Liu et.al. | 2504.08219 | null |
2025-04-11 | Optimizing Power Grid Topologies with Reinforcement Learning: A Survey of Methods and Challenges | Erica van der Sar et.al. | 2504.08210 | link |
2025-04-11 | Advancing Autonomous Vehicle Safety: A Combined Fault Tree Analysis and Bayesian Network Approach | Lansu Dai et.al. | 2504.08206 | null |
2025-04-11 | EO-VLM: VLM-Guided Energy Overload Attacks on Vision Models | Minjae Seo et.al. | 2504.08205 | null |
2025-04-11 | Neural Encoding and Decoding at Scale | Yizi Zhang et.al. | 2504.08201 | null |
2025-04-10 | Detect Anything 3D in the Wild | Hanxue Zhang et.al. | 2504.07958 | link |
2025-04-10 | Open Datasets for Grid Modeling and Visualization: An Alberta Power Network Case | Ben Cheng et.al. | 2504.07870 | link |
2025-04-10 | Probabilistic Multi-Criteria Decision-Making for Circularity Performance of Modern Methods of Construction Products | Yiping Meng et.al. | 2504.07850 | null |
2025-04-10 | RASMD: RGB And SWIR Multispectral Driving Dataset for Robust Perception in Adverse Conditions | Youngwan Jin et.al. | 2504.07603 | null |
2025-04-10 | Enhancements for Developing a Comprehensive AI Fairness Assessment Standard | Avinash Agarwal et.al. | 2504.07516 | null |
2025-04-10 | Drive in Corridors: Enhancing the Safety of End-to-end Autonomous Driving via Corridor Learning and Planning | Zhiwei Zhang et.al. | 2504.07507 | null |
2025-04-10 | Bottleneck Identification in Resource-Constrained Project Scheduling via Constraint Relaxation | Lukáš Nedbálek et.al. | 2504.07495 | null |
2025-04-10 | Probability Estimation and Scheduling Optimization for Battery Swap Stations via LRU-Enhanced Genetic Algorithm and Dual-Factor Decision System | Anzhen Li et.al. | 2504.07453 | link |
2025-04-10 | Estimand framework development for eGFR slope estimation and comparative analyses across various estimation methods | Tuo Wang et.al. | 2504.07411 | null |
2025-04-09 | RAISE: Reinforenced Adaptive Instruction Selection For Large Language Models | Lv Qingsong et.al. | 2504.07282 | null |
2025-04-09 | Language Modeling for the Future of Finance: A Quantitative Survey into Metrics, Tasks, and Data Opportunities | Nikita Tatarinov et.al. | 2504.07274 | null |
2025-04-09 | Better Decisions through the Right Causal World Model | Elisabeth Dillies et.al. | 2504.07257 | null |
2025-04-09 | Reinforcement Learning Dynamics of Network Vaccination and Hysteresis: A Double-Edged Sword for Addressing Vaccine Hesitancy | Atticus McWhorter et.al. | 2504.07254 | link |
2025-04-09 | FlashDepth: Real-time Streaming Video Depth Estimation at 2K Resolution | Gene Chou et.al. | 2504.07093 | link |
2025-04-09 | AssistanceZero: Scalably Solving Assistance Games | Cassidy Laidlaw et.al. | 2504.07091 | link |
2025-04-11 | Review of Case-Based Reasoning for LLM Agents: Theoretical Foundations, Architectural Components, and Cognitive Integration | Kostas Hatalis et.al. | 2504.06943 | null |
2025-04-09 | Conformal Robust Beamforming via Generative Channel Models | Xin Su et.al. | 2504.06934 | null |
2025-04-09 | A Game Theoretic Treatment of Contagion in Trade Networks | John S. McAlister et.al. | 2504.06905 | null |
2025-04-09 | Persona Dynamics: Unveiling the Impact of Personality Traits on Agents in Text-Based Games | Seungwon Lim et.al. | 2504.06868 | link |
2025-04-09 | MovSAM: A Single-image Moving Object Segmentation Framework Based on Deep Thinking | Chang Nie et.al. | 2504.06863 | null |
2025-04-09 | Integrated Sensing and Communications Over the Years: An Evolution Perspective | Di Zhang et.al. | 2504.06830 | null |
2025-04-09 | Regret Bounds for Robust Online Decision Making | Alexander Appel et.al. | 2504.06820 | null |
2025-04-09 | A Meaningful Perturbation Metric for Evaluating Explainability Methods | Danielle Cohen et.al. | 2504.06800 | null |
2025-04-09 | Communicating complex statistical models to a public health audience: translating science into action with the FARSI approach | Mattia Stival et.al. | 2504.06787 | null |
2025-04-09 | Probabilistic Grading and Classification System for End-of-Life Building Components Toward Circular Economy Loop | Yiping Meng et.al. | 2504.06782 | null |
2025-04-09 | AI, Help Me Think $\unicode{x2014}$ but for Myself: Assisting People in Complex Decision-Making by Providing Different Kinds of Cognitive Support | Leon Reicherts et.al. | 2504.06771 | null |
2025-04-09 | Learning-Inspired Fuzzy Logic Algorithms for Enhanced Control of Oscillatory Systems | Vuong Anh Trung et.al. | 2504.06706 | null |
2025-04-09 | Bridging Research and Standardization: Innovations and Methodology for 6G Standard Contributions | Francesca Conserva et.al. | 2504.06682 | null |
2025-04-09 | Ranking alternatives from opinions on criteria | Takahiro Suzuki et.al. | 2504.06676 | null |
2025-04-09 | Dynamic Residual Safe Reinforcement Learning for Multi-Agent Safety-Critical Scenarios Decision-Making | Kaifeng Wang et.al. | 2504.06670 | null |
2025-04-10 | Uni-PrevPredMap: Extending PrevPredMap to a Unified Framework of Prior-Informed Modeling for Online Vectorized HD Map Construction | Nan Peng et.al. | 2504.06647 | link |
2025-04-09 | A Multi-Modal Interaction Framework for Efficient Human-Robot Collaborative Shelf Picking | Abhinav Pathak et.al. | 2504.06593 | null |
2025-04-09 | Recasting Arrow’s Impossibility Theorem as Gödelian Incomputability | Ori Livson et.al. | 2504.06589 | null |
2025-04-08 | V-MAGE: A Game Evaluation Framework for Assessing Visual-Centric Capabilities in Multimodal Large Language Models | Xiangxi Zheng et.al. | 2504.06148 | link |
2025-04-09 | Robo-taxi Fleet Coordination at Scale via Reinforcement Learning | Luigi Tresca et.al. | 2504.06125 | link |
2025-04-08 | Co-evolution of cooperation and resource allocation in the advantageous environment-based spatial multi-game using adaptive control | Chengbin Sun et.al. | 2504.06112 | null |
2025-04-08 | Uncertainty-Aware Hybrid Machine Learning in Virtual Sensors for Vehicle Sideslip Angle Estimation | Abinav Kalyanasundaram et.al. | 2504.06105 | null |
2025-04-08 | Explainable AI for building energy retrofitting under data scarcity | Panagiota Rempi et.al. | 2504.06055 | null |
2025-04-08 | Smart Exploration in Reinforcement Learning using Bounded Uncertainty Models | J. S. van Hulst et.al. | 2504.05978 | null |
2025-04-08 | AEGIS: Human Attention-based Explainable Guidance for Intelligent Vehicle Systems | Zhuoli Zhuang et.al. | 2504.05950 | null |
2025-04-08 | Widening the Role of Group Recommender Systems with CAJO | Francesco Ricci et.al. | 2504.05934 | null |
2025-04-08 | Interpreting the Win Ratio in Hierarchical Composite Endpoints: Challenges, Limitations, and Perspectives with Examples from Chronic Kidney Disease Trials | Henrik F. Thomsen et.al. | 2504.05909 | null |
2025-04-08 | PRIMEDrive-CoT: A Precognitive Chain-of-Thought Framework for Uncertainty-Aware Object Interaction in Driving Scene Scenario | Sriram Mandalika et.al. | 2504.05908 | null |
2025-04-08 | Why do zeroes happen? A model-based approach for demand classification | Ivan Svetunkov et.al. | 2504.05894 | null |
2025-04-08 | Momentum Boosted Episodic Memory for Improving Learning in Long-Tailed RL Environments | Dolton Fernandes et.al. | 2504.05840 | null |
2025-04-08 | MDK12-Bench: A Multi-Discipline Benchmark for Evaluating Reasoning in Multimodal Large Language Models | Pengfei Zhou et.al. | 2504.05782 | link |
2025-04-09 | Unraveling Human-AI Teaming: A Review and Outlook | Bowen Lou et.al. | 2504.05755 | null |
2025-04-08 | SAP-CoPE: Social-Aware Planning using Cooperative Pose Estimation with Infrastructure Sensor Nodes | Minghao Ning et.al. | 2504.05727 | link |
2025-04-08 | VADIS: A Visual Analytics Pipeline for Dynamic Document Representation and Information-Seeking | Rui Qiu et.al. | 2504.05697 | null |
2025-04-08 | POD: Predictive Object Detection with Single-Frame FMCW LiDAR Point Cloud | Yining Shi et.al. | 2504.05649 | null |
2025-04-08 | A Lightweight Large Vision-language Model for Multimodal Medical Images | Belal Alsinglawi et.al. | 2504.05575 | null |
2025-04-07 | Choices in the transformative Anthropocene | Miguel Pinheiro et.al. | 2504.05538 | null |
2025-04-07 | Deep Reinforcement Learning Algorithms for Option Hedging | Andrei Neagu et.al. | 2504.05521 | link |
2025-04-07 | The challenge of uncertainty quantification of large language models in medicine | Zahra Atf et.al. | 2504.05278 | null |
2025-04-07 | Texture2LoD3: Enabling LoD3 Building Reconstruction With Panoramic Images | Wenzhao Tang et.al. | 2504.05249 | null |
2025-04-07 | A moving target in AI-assisted decision-making: Dataset shift, model updating, and the problem of update opacity | Joshua Hatherley et.al. | 2504.05210 | null |
2025-04-07 | Resource-Efficient Beam Prediction in mmWave Communications with Multimodal Realistic Simulation Framework | Yu Min Park et.al. | 2504.05187 | null |
2025-04-07 | Interpretable Style Takagi-Sugeno-Kang Fuzzy Clustering | Suhang Gu et.al. | 2504.05125 | null |
2025-04-07 | Balancing Robustness and Efficiency in Embedded DNNs Through Activation Function Selection | Jon Gutiérrez Zaballa et.al. | 2504.05119 | null |
2025-04-07 | Flexible Estimation of the Heterogeneous Non-Parametric Component in a Relative Survival Cure Model | Fabrizio Di Mari et.al. | 2504.05093 | link |
2025-04-07 | AI-Driven Tactical Communications and Networking for Defense: A Survey and Emerging Trends | Victor Monzon Baeza et.al. | 2504.05071 | null |
2025-04-07 | MIAT: Maneuver-Intention-Aware Transformer for Spatio-Temporal Trajectory Prediction | Chandra Raskoti et.al. | 2504.05059 | null |
2025-04-07 | Deconstructing Jazz Piano Style Using Machine Learning | Huw Cheston et.al. | 2504.05009 | null |
2025-04-07 | Following the Whispers of Values: Unraveling Neural Mechanisms Behind Value-Oriented Behaviors in LLMs | Ling Hu et.al. | 2504.04994 | null |
2025-04-07 | Constrained Gaussian Process Motion Planning via Stein Variational Newton Inference | Jiayun Li et.al. | 2504.04936 | null |
2025-04-07 | GAMDTP: Dynamic Trajectory Prediction with Graph Attention Mamba Network | Yunxiang Liu et.al. | 2504.04862 | null |
2025-04-07 | Prior2Former – Evidential Modeling of Mask Transformers for Assumption-Free Open-World Panoptic Segmentation | Sebastian Schmidt et.al. | 2504.04841 | null |
2025-04-07 | Explanation-Driven Interventions for Artificial Intelligence Model Customization: Empowering End-Users to Tailor Black-Box AI in Rhinocytology | Andrea Esposito et.al. | 2504.04833 | null |
2025-04-07 | Multimodal Agricultural Agent Architecture (MA3): A New Paradigm for Intelligent Agricultural Decision-Making | Zhuoning Xu et.al. | 2504.04789 | null |
2025-04-07 | Playing Non-Embedded Card-Based Games with Reinforcement Learning | Tianyang Wu et.al. | 2504.04783 | link |
2025-04-07 | TathyaNyaya and FactLegalLlama: Advancing Factual Judgment Prediction and Explanation in the Indian Legal Context | Shubham Kumar Nigam et.al. | 2504.04737 | null |
2025-04-07 | Usability Testing of an Explainable AI-enhanced Tool for Clinical Decision Support: Insights from the Reflexive Thematic Analysis | Mohammad Golam Kibria et.al. | 2504.04703 | null |
2025-04-07 | Large-Scale Mixed-Traffic and Intersection Control using Multi-agent Reinforcement Learning | Songyang Liu et.al. | 2504.04691 | link |
2025-04-04 | Epicast 2.0: A large-scale, demographically detailed, agent-based model for simulating respiratory pathogen spread in the United States | Prescott C. Alexander et.al. | 2504.03604 | null |
2025-04-04 | Towards deployment-centric multimodal AI beyond vision and language | Xianyuan Liu et.al. | 2504.03603 | null |
2025-04-04 | PF3Det: A Prompted Foundation Feature Assisted Visual LiDAR 3D Detector | Kaidong Li et.al. | 2504.03563 | null |
2025-04-04 | Agentic Knowledgeable Self-awareness | Shuofei Qiao et.al. | 2504.03553 | link |
2025-04-04 | Optimistic Learning for Communication Networks | George Iosifidis et.al. | 2504.03499 | null |
2025-04-04 | Multi-encoder nnU-Net outperforms Transformer models with self-supervised pretraining | Seyedeh Sahar Taheri Otaghsara et.al. | 2504.03474 | null |
2025-04-04 | ZFusion: An Effective Fuser of Camera and 4D Radar for 3D Object Perception in Autonomous Driving | Sheng Yang et.al. | 2504.03438 | null |
2025-04-04 | DML-RAM: Deep Multimodal Learning Framework for Robotic Arm Manipulation using Pre-trained Models | Sathish Kumar et.al. | 2504.03423 | null |
2025-04-04 | A Modular Energy Aware Framework for Multicopter Modeling in Control and Planning Applications | Sebastian Gasche et.al. | 2504.03256 | null |
2025-04-04 | Augmenting Human Cognition With Generative AI: Lessons From AI-Assisted Decision-Making | Zelun Tony Zhang et.al. | 2504.03207 | null |
2025-04-04 | A Systematic Review on Women’s Participation in Agricultural Work and Nutritional Outcomes | Pallavi Gupta et.al. | 2504.03202 | null |
2025-04-04 | Water Mapping and Change Detection Using Time Series Derived from the Continuous Monitoring of Land Disturbance Algorithm | Huong Pham et.al. | 2504.03170 | null |
2025-04-04 | NuScenes-SpatialQA: A Spatial Understanding and Reasoning Benchmark for Vision-Language Models in Autonomous Driving | Kexin Tian et.al. | 2504.03164 | null |
2025-04-04 | MORAL: A Multimodal Reinforcement Learning Framework for Decision Making in Autonomous Laboratories | Natalie Tirabassi et.al. | 2504.03153 | null |
2025-04-04 | Performance-Aware Control of Modular Batteries For Fast Frequency Response | Yutong He et.al. | 2504.03150 | null |
2025-04-03 | Systematic Evaluation of Large Vision-Language Models for Surgical Artificial Intelligence | Anita Rau et.al. | 2504.02799 | null |
2025-04-03 | On Composable and Parametric Uncertainty in Systems Co-Design | Yujun Huang et.al. | 2504.02766 | null |
2025-04-03 | Echoes of the hidden: Uncovering coordination beyond network structure | Shahar Somin et.al. | 2504.02757 | null |
2025-04-03 | TeleMoM: Consensus-Driven Telecom Intelligence via Mixture of Models | Xinquan Wang et.al. | 2504.02712 | null |
2025-04-06 | Semiparametric Counterfactual Regression | Kwangho Kim et.al. | 2504.02694 | link |
2025-04-03 | A Set-Theoretic Robust Control Approach for Linear Quadratic Games with Unknown Counterparts | Francesco Bianchin et.al. | 2504.02679 | null |
2025-04-03 | Digital Twins for Internet of Battlespace Things (IoBT) Coalitions | Athanasios Gkelias et.al. | 2504.02561 | null |
2025-04-03 | Exploring Individual Factors in the Adoption of LLMs for Specific Software Engineering Tasks | Stefano Lambiase et.al. | 2504.02553 | null |
2025-04-03 | Human-Centered Development of an Explainable AI Framework for Real-Time Surgical Risk Surveillance | Andrea E Davidson et.al. | 2504.02551 | null |
2025-04-03 | Online Multivariate Regularized Distributional Regression for High-dimensional Probabilistic Electricity Price Forecasting | Simon Hirsch et.al. | 2504.02518 | link |
2025-04-03 | Am I Being Treated Fairly? A Conceptual Framework for Individuals to Ascertain Fairness | Juliett Suárez Ferreira et.al. | 2504.02461 | null |
2025-04-03 | CHARMS: Cognitive Hierarchical Agent with Reasoning and Motion Styles | Jingyi Wang et.al. | 2504.02450 | link |
2025-04-03 | Revolutionizing Medical Data Transmission with IoMT: A Comprehensive Survey of Wireless Communication Solutions and Future Directions | Jiasi Zhou et.al. | 2504.02446 | null |
2025-04-03 | AnesBench: Multi-Dimensional Evaluation of LLM Reasoning in Anesthesiology | Xiang Feng et.al. | 2504.02404 | link |
2025-04-03 | Benchmark of Segmentation Techniques for Pelvic Fracture in CT and X-ray: Summary of the PENGWIN 2024 Challenge | Yudi Sang et.al. | 2504.02382 | null |
2025-04-03 | A Comparative Study of MINLP and MPVC Formulations for Solving Complex Nonlinear Decision-Making Problems in Aerospace Applications | Andrea Ghezzi et.al. | 2504.02375 | null |
2025-04-03 | Liquid Neural Networks: Next-Generation AI for Telecom from First Principles | Fenghao Zhu et.al. | 2504.02352 | null |
2025-04-03 | Distributed Log-driven Anomaly Detection System based on Evolving Decision Making | Zhuoran Tan et.al. | 2504.02322 | null |
2025-04-03 | MinkOcc: Towards real-time label-efficient semantic occupancy prediction | Samuel Sze et.al. | 2504.02270 | null |
2025-04-03 | LLMs as Deceptive Agents: How Role-Based Prompting Induces Semantic Ambiguity in Puzzle Tasks | Seunghyun Yoo et.al. | 2504.02254 | null |
2025-04-03 | Toward Real-world BEV Perception: Depth Uncertainty Estimation via Gaussian Splatting | Shu-Wei Lu et.al. | 2504.01957 | null |
2025-04-02 | End-to-End Driving with Online Trajectory Evaluation via BEV World Model | Yingyan Li et.al. | 2504.01941 | link |
2025-04-02 | Gen-C: Populating Virtual Worlds with Generative Crowds | Andreas Panayiotou et.al. | 2504.01924 | null |
2025-04-02 | GMAI-VL-R1: Harnessing Reinforcement Learning for Multimodal Medical Reasoning | Yanzhou Su et.al. | 2504.01886 | null |
2025-04-02 | Tunable Thresholds and Frequency Encoding in a Spiking NOD Controller | Ian Xul Belaustegui et.al. | 2504.01878 | null |
2025-04-02 | Rethinking industrial artificial intelligence: a unified foundation framework | Jay Lee et.al. | 2504.01797 | null |
2025-04-02 | A Novel Dynamic Epidemic Model for Successive Opinion Diffusion in Social Networks | Bin Han et.al. | 2504.01718 | null |
2025-04-02 | Overlap-Aware Feature Learning for Robust Unsupervised Domain Adaptation for 3D Semantic Segmentation | Junjie Chen et.al. | 2504.01668 | null |
2025-04-02 | The Mini-SiTian Array: Design and application of Master Control System | Zheng Wang et.al. | 2504.01613 | null |
2025-04-02 | Vers une modélisation de la confiance dans le renseignement sur les menaces cyber | Laurent Bobelin et.al. | 2504.01606 | null |
2025-04-02 | Building Knowledge from Interactions: An LLM-Based Architecture for Adaptive Tutoring and Social Reasoning | Luca Garello et.al. | 2504.01588 | null |
2025-04-02 | A microscopic traffic flow model on network with destination-aware V2V communications and rational decision-making | Emiliano Cristiani et.al. | 2504.01480 | null |
2025-04-02 | Deep LG-Track: An Enhanced Localization-Confidence-Guided Multi-Object Tracker | Ting Meng et.al. | 2504.01457 | null |
2025-04-02 | Dynamic Incentive Strategies for Smart EV Charging Stations: An LLM-Driven User Digital Twin Approach | Yichen Sun et.al. | 2504.01423 | null |
2025-04-02 | DF-Calib: Targetless LiDAR-Camera Calibration via Depth Flow | Shu Han et.al. | 2504.01416 | null |
2025-04-02 | Balancing Subjectivity and Objectivity in Network Selection: A Decision-Making Framework Towards Digital Twins | Brahim Mefgouda et.al. | 2504.01414 | null |
2025-04-02 | Pedestrian-Aware Motion Planning for Autonomous Driving in Complex Urban Scenarios | Korbinian Moller et.al. | 2504.01409 | link |
2025-04-02 | From Shadows to Safety: Occlusion Tracking and Risk Mitigation for Urban Autonomous Driving | Korbinian Moller et.al. | 2504.01408 | link |
2025-04-02 | An Explainable Reconfiguration-Based Optimization Algorithm for Industrial and Reliability-Redundancy Allocation Problems | Dikshit Chauhan et.al. | 2504.01331 | null |
2025-04-02 | A Retina-Inspired Pathway to Real-Time Motion Prediction inside Image Sensors for Extreme-Edge Intelligence | Subhradip Chakraborty et.al. | 2504.01275 | null |
2025-03-31 | UniOcc: A Unified Benchmark for Occupancy Forecasting and Prediction in Autonomous Driving | Yuping Wang et.al. | 2503.24381 | link |
2025-04-01 | Self-Supervised Pretraining for Aerial Road Extraction | Rupert Polley et.al. | 2503.24326 | null |
2025-03-31 | Can Test-Time Scaling Improve World Foundation Model? | Wenyan Cong et.al. | 2503.24320 | link |
2025-03-31 | BEATS: Bias Evaluation and Assessment Test Suite for Large Language Models | Alok Abhishek et.al. | 2503.24310 | null |
2025-03-31 | New Statistical Framework for Extreme Error Probability in High-Stakes Domains for Reliable Machine Learning | Umberto Michelucci et.al. | 2503.24262 | null |
2025-03-31 | PAARS: Persona Aligned Agentic Retail Shoppers | Saab Mansour et.al. | 2503.24228 | null |
2025-03-31 | Moving Edge for On-Demand Edge Computing: An Uncertainty-aware Approach | Fangtong Zhou et.al. | 2503.24214 | null |
2025-03-31 | Agent-Based Simulations of Online Political Discussions: A Case Study on Elections in Germany | Abdul Sittar et.al. | 2503.24199 | null |
2025-03-31 | LLM4FS: Leveraging Large Language Models for Feature Selection and How to Improve It | Jianhao Li et.al. | 2503.24157 | null |
2025-03-31 | Convexity of chance constraints for elliptical and skewed distributions with copula structures dependent on decision variables | Heng Zhang et.al. | 2503.24153 | null |
2025-03-31 | 4D mmWave Radar in Adverse Environments for Autonomous Driving: A Survey | Xiangyuan Peng et.al. | 2503.24091 | null |
2025-03-31 | Frequency-Aware Attention-LSTM for PM $_{2.5}$ Time Series Forecasting | Jiahui LU et.al. | 2503.24043 | null |
2025-03-31 | DenseFormer: Learning Dense Depth Map from Sparse Depth and Image via Conditional Diffusion Model | Ming Yuan et.al. | 2503.23993 | null |
2025-03-31 | The more the merrier: logical and multistage processors in credit scoring | Arturo Pérez-Peralta et.al. | 2503.23979 | link |
2025-03-31 | Video-based Traffic Light Recognition by Rockchip RV1126 for Autonomous Driving | Miao Fan et.al. | 2503.23965 | null |
2025-03-31 | A Benchmark for Vision-Centric HD Mapping by V2I Systems | Miao Fan et.al. | 2503.23963 | null |
2025-03-31 | GLane3D : Detecting Lanes with Graph of 3D Keypoints | Halil İbrahim Öztürk et.al. | 2503.23882 | null |
2025-04-01 | When Counterfactual Reasoning Fails: Chaos and Real-World Complexity | Yahya Aalaila et.al. | 2503.23820 | null |
2025-03-31 | XLRS-Bench: Could Your Multimodal LLMs Understand Extremely Large Ultra-High-Resolution Remote Sensing Imagery? | Fengxiang Wang et.al. | 2503.23771 | null |
2025-03-31 | STI-Bench: Are MLLMs Ready for Precise Spatial-Temporal World Understanding? | Yun Li et.al. | 2503.23765 | null |
2025-03-31 | Towards Benchmarking and Assessing the Safety and Robustness of Autonomous Driving on Safety-critical Scenarios | Jingzheng Li et.al. | 2503.23708 | null |
2025-03-28 | Comparing methods to assess treatment effect heterogeneity in general parametric regression models | Yao Chen et.al. | 2503.22548 | null |
2025-03-28 | SafeCast: Risk-Responsive Motion Forecasting for Autonomous Vehicles | Haicheng Liao et.al. | 2503.22541 | null |
2025-03-28 | Unlocking LLM Repair Capabilities in Low-Resource Programming Languages Through Cross-Language Translation and Multi-Agent Refinement | Wenqiang Luo et.al. | 2503.22512 | null |
2025-03-28 | A Causal Framework to Measure and Mitigate Non-binary Treatment Discrimination | Ayan Majumdar et.al. | 2503.22454 | link |
2025-03-28 | NuGrounding: A Multi-View 3D Visual Grounding Framework in Autonomous Driving | Fuhao Li et.al. | 2503.22436 | null |
2025-03-28 | Collapse and Collision Aware Grasping for Cluttered Shelf Picking | Abhinav Pathak et.al. | 2503.22427 | null |
2025-03-28 | VITAL: More Understandable Feature Visualization through Distribution Alignment and Relevant Information Flow | Ada Gorgun et.al. | 2503.22399 | link |
2025-03-28 | VoteFlow: Enforcing Local Rigidity in Self-Supervised Scene Flow | Yancong Lin et.al. | 2503.22328 | link |
2025-03-28 | Estimation of Building Energy Demand Characteristics using Bayesian Statistics and Energy Signature Models | Justinas Smertinas et.al. | 2503.22321 | null |
2025-03-28 | A Dataset for Semantic Segmentation in the Presence of Unknowns | Zakaria Laskar et.al. | 2503.22309 | null |
2025-03-28 | CRLLK: Constrained Reinforcement Learning for Lane Keeping in Autonomous Driving | Xinwei Gao et.al. | 2503.22248 | null |
2025-03-28 | CoGen: 3D Consistent Video Generation via Adaptive Conditioning for Autonomous Driving | Yishen Ji et.al. | 2503.22231 | null |
2025-03-28 | Multi-modal Knowledge Distillation-based Human Trajectory Forecasting | Jaewoo Jeong et.al. | 2503.22201 | link |
2025-03-28 | An Advanced Ensemble Deep Learning Framework for Stock Price Prediction Using VAE, Transformer, and LSTM Model | Anindya Sarkar et.al. | 2503.22192 | null |
2025-03-28 | Synergistic Bleeding Region and Point Detection in Surgical Videos | Jialun Pei et.al. | 2503.22174 | null |
2025-03-28 | When Autonomy Breaks: The Hidden Existential Risk of AI | Joshua Krook et.al. | 2503.22151 | null |
2025-03-28 | Leveraging LLMs for Predicting Unknown Diagnoses from Clinical Notes | Dina Albassam et.al. | 2503.22092 | null |
2025-03-28 | Mitigating Trade-off: Stream and Query-guided Aggregation for Efficient and Effective 3D Occupancy Prediction | Seokha Moon et.al. | 2503.22087 | link |
2025-03-28 | A Deep Learning Framework for Boundary-Aware Semantic Segmentation | Tai An et.al. | 2503.22050 | null |
2025-03-27 | Cognitive Prompts Using Guilford’s Structure of Intellect Model | Oliver Kramer et.al. | 2503.22036 | null |
2025-03-27 | Energy Minimization for Participatory Federated Learning in IoT Analyzed via Game Theory | Alessandro Buratto et.al. | 2503.21722 | null |
2025-03-27 | Learning to Represent Individual Differences for Choice Decision Making | Yan-Ying Chen et.al. | 2503.21704 | null |
2025-03-27 | MAVERIX: Multimodal Audio-Visual Evaluation Reasoning IndeX | Liuyue Xie et.al. | 2503.21699 | null |
2025-03-27 | LLM-Gomoku: A Large Language Model-Based System for Strategic Gomoku with Self-Play and Reinforcement Learning | Hui Wang et.al. | 2503.21683 | null |
2025-03-27 | A friendly introduction to triangular transport | Maximilian Ramgraber et.al. | 2503.21673 | null |
2025-03-27 | InteractionMap: Improving Online Vectorized HDMap Construction with Interaction | Kuang Wu et.al. | 2503.21659 | null |
2025-03-27 | Towards Fully Automated Decision-Making Systems for Greenhouse Control: Challenges and Opportunities | Yongshuai Liu et.al. | 2503.21640 | null |
2025-03-27 | KRAFT – A Knowledge-Graph-Based Resource Allocation Framework | Leon Bein et.al. | 2503.21636 | null |
2025-03-27 | Fine-Grained Evaluation of Large Vision-Language Models in Autonomous Driving | Yue Li et.al. | 2503.21505 | link |
2025-03-27 | Fine-Grained Behavior and Lane Constraints Guided Trajectory Prediction Method | Wenyi Xiong et.al. | 2503.21477 | null |
2025-03-27 | OCEP: An Ontology-Based Complex Event Processing Framework for Healthcare Decision Support in Big Data Analytics | Ritesh Chandra et.al. | 2503.21453 | null |
2025-03-27 | Towards Generating Realistic 3D Semantic Training Data for Autonomous Driving | Lucas Nunes et.al. | 2503.21449 | link |
2025-03-27 | Exploring the Roles of Large Language Models in Reshaping Transportation Systems: A Survey, Framework, and Roadmap | Tong Nie et.al. | 2503.21411 | link |
2025-03-28 | LandMarkSystem Technical Report | Zhenxiang Ma et.al. | 2503.21364 | link |
2025-03-27 | Investigating the Duality of Interpretability and Explainability in Machine Learning | Moncef Garouani et.al. | 2503.21356 | null |
2025-03-27 | Large Language Models for Traffic and Transportation Research: Methodologies, State of the Art, and Future Opportunities | Yimo Yan et.al. | 2503.21330 | null |
2025-03-27 | A Theoretical Framework for Distribution-Aware Dataset Search | Aryan Esmailpour et.al. | 2503.21235 | null |
2025-03-27 | Knowledge Graphs as World Models for Semantic Material-Aware Obstacle Handling in Autonomous Vehicles | Ayush Bheemaiah et.al. | 2503.21232 | null |
2025-03-27 | Are We Solving a Well-Defined Problem? A Task-Centric Perspective on Recommendation Tasks | Aixin Sun et.al. | 2503.21188 | null |
2025-03-27 | Extending Silicon Lifetime: A Review of Design Techniques for Reliable Integrated Circuits | Shaik Jani Babu et.al. | 2503.21165 | null |
2025-03-26 | Mobile-MMLU: A Mobile Intelligence Language Understanding Benchmark | Sondos Mahmoud Bsharat et.al. | 2503.20786 | link |
2025-03-26 | MCTS-RAG: Enhancing Retrieval-Augmented Generation with Monte Carlo Tree Search | Yunhai Hu et.al. | 2503.20757 | null |
2025-03-26 | ADS-Edit: A Multimodal Knowledge Editing Dataset for Autonomous Driving Systems | Chenxi Wang et.al. | 2503.20756 | link |
2025-03-26 | Continual learning via probabilistic exchangeable sequence modelling | Hanwen Xing et.al. | 2503.20725 | null |
2025-03-26 | Data-driven Distributionally Robust Control Based on Sinkhorn Ambiguity Sets | Riccardo Cescon et.al. | 2503.20703 | link |
2025-03-26 | Benchmarking Machine Learning Methods for Distributed Acoustic Sensing | Shuaikai Shi et.al. | 2503.20681 | null |
2025-03-27 | DR-PETS: Learning-Based Control With Planning in Adversarial Environments | Hozefa Jesawada et.al. | 2503.20660 | null |
2025-03-26 | AccidentSim: Generating Physically Realistic Vehicle Collision Videos from Real-World Accident Reports | Xiangwen Zhang et.al. | 2503.20654 | null |
2025-03-26 | SaViD: Spectravista Aesthetic Vision Integration for Robust and Discerning 3D Object Detection in Challenging Environments | Tanmoy Dam et.al. | 2503.20614 | link |
2025-03-26 | Diffusion Counterfactuals for Image Regressors | Trung Duc Ha et.al. | 2503.20595 | link |
2025-03-26 | A Theoretical Framework for Prompt Engineering: Approximating Smooth Functions with Transformer Prompts | Ryumei Nakada et.al. | 2503.20561 | null |
2025-03-26 | GAIA-2: A Controllable Multi-View Generative World Model for Autonomous Driving | Lloyd Russell et.al. | 2503.20523 | null |
2025-03-26 | Vision-Amplified Semantic Entropy for Hallucination Detection in Medical Visual Question Answering | Zehui Liao et.al. | 2503.20504 | null |
2025-03-26 | Dissecting and Mitigating Diffusion Bias via Mechanistic Interpretability | Yingdong Shi et.al. | 2503.20483 | null |
2025-03-26 | Multi-agent Uncertainty-Aware Pessimistic Model-Based Reinforcement Learning for Connected Autonomous Vehicles | Ruoqi Wen et.al. | 2503.20462 | null |
2025-03-26 | Perspective-Shifted Neuro-Symbolic World Models: A Framework for Socially-Aware Robot Navigation | Kevin Alcedo et.al. | 2503.20425 | link |
2025-03-26 | Wasserstein Distributionally Robust Bayesian Optimization with Continuous Context | Francesco Micheli et.al. | 2503.20341 | link |
2025-03-26 | A Blockchain-based Quantum Binary Voting for Decentralized IoT Towards Industry 5.0 | Utkarsh Azad et.al. | 2503.20247 | null |
2025-03-26 | Dynamic Learning and Productivity for Data Analysts: A Bayesian Hidden Markov Model Perspective | Yue Yin et.al. | 2503.20233 | null |
2025-03-26 | Network Inversion for Generating Confidently Classified Counterfeits | Pirzada Suhail et.al. | 2503.20187 | null |
2025-03-25 | SuperFlow++: Enhanced Spatiotemporal Consistency for Cross-Modal Data Pretraining | Xiang Xu et.al. | 2503.19912 | link |
2025-03-25 | LENVIZ: A High-Resolution Low-Exposure Night Vision Benchmark Dataset | Manjushree Aithal et.al. | 2503.19804 | null |
2025-03-25 | Resilient Sensor Fusion under Adverse Sensor Failures via Multi-Modal Expert Fusion | Konyul Park et.al. | 2503.19776 | null |
2025-03-25 | ORION: A Holistic End-to-End Autonomous Driving Framework by Vision-Language Instructed Action Generation | Haoyu Fu et.al. | 2503.19755 | null |
2025-03-25 | Semi-SD: Semi-Supervised Metric Depth Estimation via Surrounding Cameras for Autonomous Driving | Yusen Xie et.al. | 2503.19713 | link |
2025-03-25 | High-Quality Spatial Reconstruction and Orthoimage Generation Using Efficient 2D Gaussian Splatting | Qian Wang et.al. | 2503.19703 | null |
2025-03-25 | Risk-Aware Reinforcement Learning for Autonomous Driving: Improving Safety When Driving through Intersection | Bo Leng et.al. | 2503.19690 | null |
2025-03-25 | Enabling Rapid Shared Human-AI Mental Model Alignment via the After-Action Review | Edward Gu et.al. | 2503.19607 | link |
2025-03-26 | Multi-agent Application System in Office Collaboration Scenarios | Songtao Sun et.al. | 2503.19584 | null |
2025-03-25 | A Comprehensive Bandwidth Testing Framework for the LHCb Upgrade Trigger System | Luke Grazette et.al. | 2503.19582 | null |
2025-03-25 | A theory of anticipated surprise for understanding risky intertemporal choices | Ho Ka Chan et.al. | 2503.19514 | null |
2025-03-25 | Multi-Agent Deep Reinforcement Learning for Safe Autonomous Driving with RICS-Assisted MEC | Xueyao Zhang et.al. | 2503.19418 | null |
2025-03-25 | Quantifying Symptom Causality in Clinical Decision Making: An Exploration Using CausaLM | Mehul Shetty et.al. | 2503.19394 | null |
2025-03-26 | ST-VLM: Kinematic Instruction Tuning for Spatio-Temporal Reasoning in Vision-Language Models | Dohwan Ko et.al. | 2503.19355 | null |
2025-03-25 | A Reliable and Efficient 5G Vehicular MEC: Guaranteed Task Completion with Minimal Latency | Mahsa Paknejad et.al. | 2503.19320 | null |
2025-03-25 | A Social Dynamical System for Twitter Analysis | Zhiping Xiao et.al. | 2503.19316 | null |
2025-03-25 | BIMII-Net: Brain-Inspired Multi-Iterative Interactive Network for RGB-T Road Scene Semantic Segmentation | Hanshuo Qiu et.al. | 2503.19303 | null |
2025-03-25 | Observation Adaptation via Annealed Importance Resampling for Partially Observable Markov Decision Processes | Yunuo Zhang et.al. | 2503.19302 | null |
2025-03-25 | CubeRobot: Grounding Language in Rubik’s Cube Manipulation via Vision-Language Model | Feiyang Wang et.al. | 2503.19281 | null |
2025-03-25 | Context-Aware Semantic Segmentation: Enhancing Pixel-Level Understanding with Large Language Models for Advanced Vision Applications | Ben Rahman et.al. | 2503.19276 | null |
2025-03-24 | SyncVP: Joint Diffusion for Synchronous Multi-Modal Video Prediction | Enrico Pallotta et.al. | 2503.18933 | link |
2025-03-24 | Causal Links Between Anthropogenic Emissions and Air Pollution Dynamics in Delhi | Sourish Das et.al. | 2503.18912 | null |
2025-03-24 | Building Blocks for Robust and Effective Semi-Supervised Real-World Object Detection | Moussa Kassem Sbeyti et.al. | 2503.18903 | null |
2025-03-24 | Statistical Proof of Execution (SPEX) | Michele Dallachiesa et.al. | 2503.18899 | null |
2025-03-24 | EconEvals: Benchmarks and Litmus Tests for LLM Agents in Unknown Environments | Sara Fish et.al. | 2503.18825 | null |
2025-03-24 | AttenMfg: An Attention Network Based Optimization Framework for Sensor-Driven Operations & Maintenance in Manufacturing Systems | Iman Kazemian et.al. | 2503.18780 | null |
2025-03-24 | Group Decision-Making System with Sentiment Analysis of Discussion Chat and Fuzzy Consensus Modeling | Adilet Yerkin et.al. | 2503.18765 | null |
2025-03-24 | Predicting the Road Ahead: A Knowledge Graph based Foundation Model for Scene Understanding in Autonomous Driving | Hongkuan Zhou et.al. | 2503.18730 | null |
2025-03-24 | AgentSpec: Customizable Runtime Enforcement for Safe and Reliable LLM Agents | Haoyu Wang et.al. | 2503.18666 | null |
2025-03-24 | Robust Lane Detection with Wavelet-Enhanced Context Modeling and Adaptive Sampling | Kunyang Li et.al. | 2503.18631 | null |
2025-03-24 | Reinforcement Learning in Switching Non-Stationary Markov Decision Processes: Algorithms and Convergence Analysis | Mohsen Amiri et.al. | 2503.18607 | null |
2025-03-24 | Explaining Domain Shifts in Language: Concept erasing for Interpretable Image Classification | Zequn Zeng et.al. | 2503.18483 | link |
2025-03-24 | The On-Board Computer of the AcubeSAT Mission | Konstantinos Tsoupos et.al. | 2503.18473 | null |
2025-03-24 | ReconDreamer++: Harmonizing Generative and Reconstructive Models for Driving Scene Representation | Guosheng Zhao et.al. | 2503.18438 | null |
2025-03-24 | Generative AI in Knowledge Work: Design Implications for Data Navigation and Decision-Making | Bhada Yun et.al. | 2503.18419 | null |
2025-03-24 | Global Profits, Local Decisions: Why Global Cooperation Falters in Multi-level Games | Jinhua Zhao et.al. | 2503.18398 | null |
2025-03-24 | Agent-based Modeling meets the Capability Approach for Human Development: Simulating Homelessness Policy-making | Alba Aguilera et.al. | 2503.18389 | null |
2025-03-24 | Efficient Inference in First Passage Time Models | Sicheng Liu et.al. | 2503.18381 | null |
2025-03-24 | Latent Embedding Adaptation for Human Preference Alignment in Diffusion Planners | Wen Zheng Terence Ng et.al. | 2503.18347 | null |
2025-03-24 | DeepFund: Will LLM be Professional at Fund Investment? A Live Arena Perspective | Changlun Li et.al. | 2503.18313 | null |
2025-03-21 | Dancing with Critiques: Enhancing LLM Reasoning with Stepwise Natural Language Self-Critique | Yansi Li et.al. | 2503.17363 | null |
2025-03-21 | Semi-Automated Design of Data-Intensive Architectures | Arianna Dragoni et.al. | 2503.17259 | link |
2025-03-21 | Decentralization: A Qualitative Survey of Node Operators | Alex Lynham et.al. | 2503.17246 | null |
2025-03-21 | LoGoFair: Post-Processing for Local and Global Fairness in Federated Learning | Li Zhang et.al. | 2503.17231 | link |
2025-03-21 | How to Promote Autonomous Driving with Evolving Technology: Business Strategy and Pricing Decision | Mingliang Li et.al. | 2503.17174 | null |
2025-03-21 | Hi-ALPS – An Experimental Robustness Quantification of Six LiDAR-based Object Detection Systems for Autonomous Driving | Alexandra Arzberger et.al. | 2503.17168 | null |
2025-03-21 | Enhancing Steering Estimation with Semantic-Aware GNNs | Fouad Makiyeh et.al. | 2503.17153 | null |
2025-03-21 | Structural and Practical Identifiability of Phenomenological Growth Models for Epidemic Forecasting | Yuganthi R. Liyanage et.al. | 2503.17135 | null |
2025-03-21 | R-LiViT: A LiDAR-Visual-Thermal Dataset Enabling Vulnerable Road User Focused Roadside Perception | Jonas Mirlach et.al. | 2503.17122 | null |
2025-03-21 | Modelling the Climate Change Debate in Italy through Information Supply and Demand | Irene Scalco et.al. | 2503.17026 | null |
2025-03-21 | When Words Outperform Vision: VLMs Can Self-Improve Via Text-Only Training For Human-Centered Decision Making | Zhe Hu et.al. | 2503.16965 | null |
2025-03-21 | Uncertainty-Driven Modeling of Microporosity and Permeability in Clastic Reservoirs Using Random Forest | Muhammad Risha et.al. | 2503.16957 | null |
2025-03-21 | Sparse Additive Contextual Bandits: A Nonparametric Approach for Online Decision-making with High-dimensional Covariates | Wenjia Wang et.al. | 2503.16941 | null |
2025-03-21 | Interpretable Machine Learning for Oral Lesion Diagnosis through Prototypical Instances Identification | Alessio Cascione et.al. | 2503.16938 | null |
2025-03-21 | MerGen: Micro-electrode recording synthesis using a generative data-driven approach | Thibault Martin et.al. | 2503.16928 | null |
2025-03-21 | Temporal Action Detection Model Compression by Progressive Block Drop | Xiaoyong Chen et.al. | 2503.16916 | null |
2025-03-21 | Rotatable RIS-Assisted Edge Computing: Orientation, Task Offloading, and Resource Optimization | Bin Li et.al. | 2503.16879 | null |
2025-03-21 | Early-MFC: Enhanced Flow Correlation Attacks on Tor via Multi-view Triplet Networks with Early Network Traffic | Yali Yuan et.al. | 2503.16847 | null |
2025-03-21 | When Debate Fails: Bias Reinforcement in Large Language Models | Jihwan Oh et.al. | 2503.16814 | null |
2025-03-21 | A-IDE : Agent-Integrated Denoising Experts | Uihyun Cho et.al. | 2503.16780 | null |
2025-03-20 | Truthful Elicitation of Imprecise Forecasts | Anurag Singh et.al. | 2503.16395 | null |
2025-03-20 | Panoptic-CUDAL Technical Report: Rural Australia Point Cloud Dataset in Rainy Conditions | Tzu-Yun Tseng et.al. | 2503.16378 | null |
2025-03-20 | JARVIS-VLA: Post-Training Large-Scale Vision Language Models to Play Visual Games with Keyboards and Mouse | Muyao Li et.al. | 2503.16365 | null |
2025-03-20 | Speeding up design and making to reduce time-to-project and time-to-market: an AI-Enhanced approach in engineering education | Giovanni Adorni et.al. | 2503.16307 | null |
2025-03-21 | Fin-R1: A Large Language Model for Financial Reasoning through Reinforcement Learning | Zhaowei Liu et.al. | 2503.16252 | link |
2025-03-20 | Deferring Concept Bottleneck Models: Learning to Defer Interventions to Inaccurate Experts | Andrea Pugnana et.al. | 2503.16199 | null |
2025-03-20 | Large Language Models for Water Distribution Systems Modeling and Decision-Making | Yinon Goldshtein et.al. | 2503.16191 | null |
2025-03-21 | GreenIQ: A Deep Search Platform for Comprehensive Carbon Market Analysis and Automated Report Generation | Oluwole Fagbohun et.al. | 2503.16041 | null |
2025-03-20 | The Lighthouse of Language: Enhancing LLM Agents via Critique-Guided Improvement | Ruihan Yang et.al. | 2503.16024 | null |
2025-03-20 | BadToken: Token-level Backdoor Attacks to Multi-modal Large Language Models | Zenghui Yuan et.al. | 2503.16023 | null |
2025-03-20 | Information maximization for a broad variety of multi-armed bandit games | Alex Barbier-Chebbah et.al. | 2503.15962 | null |
2025-03-21 | Advancing Mobile GUI Agents: A Verifier-Driven Approach to Practical Deployment | Gaole Dai et.al. | 2503.15937 | null |
2025-03-20 | BlockDance: Reuse Structurally Similar Spatio-Temporal Features to Accelerate Diffusion Transformers | Hui Zhang et.al. | 2503.15927 | null |
2025-03-20 | MiLA: Multi-view Intensive-fidelity Long-term Video Generation World Model for Autonomous Driving | Haiguang Wang et.al. | 2503.15875 | link |
2025-03-20 | Uncertainty Quantification and Confidence Calibration in Large Language Models: A Survey | Xiaoou Liu et.al. | 2503.15850 | null |
2025-03-20 | Temporal Point Process Modeling of Aggressive Behavior Onset in Psychiatric Inpatient Youths with Autism | Michael Potter et.al. | 2503.15821 | null |
2025-03-20 | Attention Pruning: Automated Fairness Repair of Language Models via Surrogate Simulated Annealing | Vishnu Asutosh Dasu et.al. | 2503.15815 | null |
2025-03-20 | AutoDrive-QA- Automated Generation of Multiple-Choice Questions for Autonomous Driving Datasets Using Large Vision-Language Models | Boshra Khalili et.al. | 2503.15778 | null |
2025-03-20 | Nano-3D: Metasurface-Based Neural Depth Imaging | Bingxuan Li et.al. | 2503.15770 | null |
2025-03-19 | Reinforcement Learning Environment with LLM-Controlled Adversary in D&D 5th Edition Combat | Joseph Emmanuel DL Dayo et.al. | 2503.15726 | null |
2025-03-19 | More Information is Not Always Better: Connections between Zero-Sum Local Nash Equilibria in Feedback and Open-Loop Information Patterns | Kushagra Gupta et.al. | 2503.15486 | null |
2025-03-19 | Evaluating Bias in Retrieval-Augmented Medical Question-Answering Systems | Yuelyu Ji et.al. | 2503.15454 | null |
2025-03-19 | V2X-DG: Domain Generalization for Vehicle-to-Everything Cooperative Perception | Baolu Li et.al. | 2503.15435 | null |
2025-03-19 | Advancing MG Energy Management: A Rolling Horizon Optimization Framework for Three-Phase Unbalanced Networks Integrating Convex Formulations | Pablo Cortés et.al. | 2503.15394 | null |
2025-03-19 | Real-world validation of a multimodal LLM-powered pipeline for High-Accuracy Clinical Trial Patient Matching leveraging EHR data | Anatole Callies et.al. | 2503.15374 | link |
2025-03-19 | Probabilistic Delay Forecasting in 5G Using Recurrent and Attention-Based Architectures | Samie Mostafavi et.al. | 2503.15297 | link |
2025-03-19 | EdgeRegNet: Edge Feature-based Multimodal Registration Network between Images and LiDAR Point Clouds | Yuanchao Yue et.al. | 2503.15284 | link |
2025-03-19 | CoE: Chain-of-Explanation via Automatic Visual Concept Circuit Description and Polysemanticity Quantification | Wenlong Yu et.al. | 2503.15234 | link |
2025-03-19 | When Pigs Get Sick: Multi-Agent AI for Swine Disease Detection | Tittaya Mairittha et.al. | 2503.15204 | null |
2025-03-19 | Learning Topology Actions for Power Grid Control: A Graph-Based Soft-Label Imitation Learning Approach | Mohamed Hassouna et.al. | 2503.15190 | null |
2025-03-19 | Multi-Agent Actor-Critic with Harmonic Annealing Pruning for Dynamic Spectrum Access Systems | George Stamatelis et.al. | 2503.15172 | null |
2025-03-19 | World Models in Artificial Intelligence: Sensing, Learning, and Reasoning Like a Child | Javier Del Ser et.al. | 2503.15168 | null |
2025-03-19 | A proposal of smooth interpolation to optimal transport for restoring biased data for algorithmic fairness | Elena M. De Diego et.al. | 2503.15119 | null |
2025-03-19 | VIPER: Visual Perception and Explainable Reasoning for Sequential Decision-Making | Mohamed Salim Aissi et.al. | 2503.15108 | null |
2025-03-19 | Diffusion-Based Forecasting for Uncertainty-Aware Model Predictive Control | Stelios Zarifis et.al. | 2503.15095 | null |
2025-03-19 | An Investigation of Beam Density on LiDAR Object Detection Performance | Christoph Griesbacher et.al. | 2503.15087 | null |
2025-03-19 | DRoPE: Directional Rotary Position Embedding for Efficient Agent Interaction Modeling | Jianbo Zhao et.al. | 2503.15029 | null |
2025-03-19 | Manifold Learning for Hyperspectral Images | Fethi Harkat et.al. | 2503.15016 | null |
2025-03-19 | USAM-Net: A U-Net-based Network for Improved Stereo Correspondence and Scene Depth Estimation using Features from a Pre-trained Image Segmentation network | Joseph Emmanuel DL Dayo et.al. | 2503.14950 | null |
2025-03-19 | Generating Multimodal Driving Scenes via Next-Scene Prediction | Yanhao Wu et.al. | 2503.14945 | null |
2025-03-19 | Advances in 4D Generation: A Survey | Qiaowei Miao et.al. | 2503.14501 | link |
2025-03-18 | Tracking Meets Large Multimodal Models for Driving Scenario Understanding | Ayesha Ishaq et.al. | 2503.14498 | link |
2025-03-18 | Characterizing Data Visualization Literacy: a Systematic Literature Review | Sara Beschi et.al. | 2503.14468 | null |
2025-03-18 | VisEscape: A Benchmark for Evaluating Exploration-driven Decision-making in Virtual Escape Rooms | Seungwon Lim et.al. | 2503.14427 | link |
2025-03-18 | On the Standard Performance Criteria for Applied Control Design: PID, MPC or Machine Learning Controller? | Pouria Sarhadi et.al. | 2503.14379 | link |
2025-03-18 | MANTRA: Enhancing Automated Method-Level Refactoring with Contextual RAG and Multi-Agent LLM Collaboration | Yisen Xu et.al. | 2503.14340 | null |
2025-03-18 | ADAPT: An Autonomous Forklift for Construction Site Operation | Johannes Huemer et.al. | 2503.14331 | null |
2025-03-18 | Video Streaming with Kairos: An MPC-Based ABR with Streaming-Aware Throughput Prediction | Ziyu Zhong et.al. | 2503.14271 | null |
2025-03-18 | DARS: Dynamic Action Re-Sampling to Enhance Coding Agent Performance by Adaptive Tree Traversal | Vaibhav Aggarwal et.al. | 2503.14269 | link |
2025-03-18 | Conversational Agents as Catalysts for Critical Thinking: Challenging Social Influence in Group Decision-making | Soohwan Lee et.al. | 2503.14263 | null |
2025-03-18 | A Parallel Hybrid Action Space Reinforcement Learning Model for Real-world Adaptive Traffic Signal Control | Yuxuan Wang et.al. | 2503.14250 | link |
2025-03-18 | Stochastic Trajectory Prediction under Unstructured Constraints | Hao Ma et.al. | 2503.14203 | null |
2025-03-18 | Driving behavior recognition via self-discovery learning | Yilin Wang et.al. | 2503.14194 | null |
2025-03-18 | Inferring Event Descriptions from Time Series with Language Models | Mingtian Tan et.al. | 2503.14190 | link |
2025-03-18 | Bridging Past and Future: End-to-End Autonomous Driving with Historical Prediction and Planning | Bozhou Zhang et.al. | 2503.14182 | link |
2025-03-18 | A Modular Edge Device Network for Surgery Digitalization | Vincent Schorp et.al. | 2503.14049 | null |
2025-03-18 | Predicting Human Choice Between Textually Described Lotteries | Eyal Marantz et.al. | 2503.14004 | null |
2025-03-18 | Empowering LLMs in Decision Games through Algorithmic Data Synthesis | Haolin Wang et.al. | 2503.13980 | null |
2025-03-18 | SimWorld: A Unified Benchmark for Simulator-Conditioned Scene Generation via World Model | Xinqing Li et.al. | 2503.13952 | link |
2025-03-18 | ChatBEV: A Visual Language Model that Understands BEV Maps | Qingyao Xu et.al. | 2503.13938 | null |
2025-03-17 | Faithfulness of LLM Self-Explanations for Commonsense Tasks: Larger Is Better, and Instruction-Tuning Allows Trade-Offs but Not Pareto Dominance | Noah Y. Siegel et.al. | 2503.13445 | null |
2025-03-17 | Deep Belief Markov Models for POMDP Inference | Giacomo Arcieri et.al. | 2503.13438 | null |
2025-03-17 | Uncovering Utility Functions from Observed Outcomes | Marta Grzeskiewicz et.al. | 2503.13432 | null |
2025-03-17 | AugMapNet: Improving Spatial Latent Structure via BEV Grid Augmentation for Enhanced Vectorized Online HD Map Construction | Thomas Monninger et.al. | 2503.13430 | null |
2025-03-17 | A Comprehensive Survey on Multi-Agent Cooperative Decision-Making: Scenarios, Approaches, Challenges and Perspectives | Weiqiang Jin et.al. | 2503.13415 | null |
2025-03-17 | Agents Play Thousands of 3D Video Games | Zhongwen Xu et.al. | 2503.13356 | null |
2025-03-17 | Local-Global Learning of Interpretable Control Policies: The Interface between MPC and Reinforcement Learning | Thomas Banker et.al. | 2503.13289 | null |
2025-03-17 | Artificial Intelligence-Driven Prognostic Classification of COVID-19 Using Chest X-rays: A Deep Learning Approach | Alfred Simbun et.al. | 2503.13277 | null |
2025-03-17 | Knowledge-Aware Iterative Retrieval for Multi-Agent Systems | Seyoung Song et.al. | 2503.13275 | null |
2025-03-17 | Robust Decision-Making Via Free Energy Minimization | Allahkaram Shafiei et.al. | 2503.13223 | null |
2025-03-17 | MAP: Evaluation and Multi-Agent Enhancement of Large Language Models for Inpatient Pathways | Zhen Chen et.al. | 2503.13205 | null |
2025-03-17 | Clustering is back: Reaching state-of-the-art LiDAR instance segmentation without training | Corentin Sautier et.al. | 2503.13203 | null |
2025-03-17 | GC-Fed: Gradient Centralized Federated Learning with Partial Client Participation | Jungwon Seo et.al. | 2503.13180 | link |
2025-03-17 | Collaborative AI Enhances Image Understanding in Materials Science | Ruoyan Avery Yin et.al. | 2503.13169 | null |
2025-03-17 | Patient-specific radiomic feature selection with reconstructed healthy persona of knee MR images | Yaxi Chen et.al. | 2503.13131 | null |
2025-03-17 | Exploring the Potential of Bilevel Optimization for Calibrating Neural Networks | Gabriele Sanguin et.al. | 2503.13113 | null |
2025-03-17 | InsightDrive: Insight Scene Representation for End-to-End Autonomous Driving | Ruiqi Song et.al. | 2503.13047 | null |
2025-03-17 | Knowledge Distillation: Enhancing Neural Network Compression with Integrated Gradients | David E. Hernandez et.al. | 2503.13008 | null |
2025-03-17 | SparseAlign: A Fully Sparse Framework for Cooperative Object Detection | Yunshuang Yuan et.al. | 2503.12982 | null |
2025-03-17 | OptiPMB: Enhancing 3D Multi-Object Tracking with Optimized Poisson Multi-Bernoulli Filtering | Guanhua Ding et.al. | 2503.12968 | null |
2025-03-14 | Centaur: Robust End-to-End Autonomous Driving with Test-Time Training | Chonghao Sima et.al. | 2503.11650 | null |
2025-03-14 | Finding a Fair Scoring Function for Top- $k$ Selection: Hardness, Algorithms, and Experiments | Guangya Cai et.al. | 2503.11575 | link |
2025-03-14 | VERIFY: A Benchmark of Visual Explanation and Reasoning for Investigating Multimodal Reasoning Fidelity | Jing Bi et.al. | 2503.11557 | null |
2025-03-14 | A Framework for a Capability-driven Evaluation of Scenario Understanding for Multimodal Large Language Models in Autonomous Driving | Tin Stribor Sohn et.al. | 2503.11400 | null |
2025-03-14 | Watch and Learn: Leveraging Expert Knowledge and Language for Surgical Video Understanding | David Gastager et.al. | 2503.11392 | null |
2025-03-14 | Certified Inductive Synthesis for Online Mixed-Integer Optimization | Marco Zamponi et.al. | 2503.11388 | null |
2025-03-14 | Hierarchical Information-Guided Spatio-Temporal Mamba for Stock Time Series Forecasting | Wenbo Yan et.al. | 2503.11387 | null |
2025-03-14 | BEVDiffLoc: End-to-End LiDAR Global Localization in BEV View based on Diffusion Model | Ziyue Wang et.al. | 2503.11372 | link |
2025-03-14 | Learning-Based MPC for Efficient Control of Autonomous Vehicles | Samuel Mallick et.al. | 2503.11359 | link |
2025-03-14 | Advancements in Real-Time Oncology Diagnosis: Harnessing AI and Image Fusion Techniques | Leila Bagheriye et.al. | 2503.11332 | null |
2025-03-14 | EmoAgent: Multi-Agent Collaboration of Plan, Edit, and Critic, for Affective Image Manipulation | Qi Mao et.al. | 2503.11290 | null |
2025-03-14 | AI and Deep Learning for Automated Segmentation and Quantitative Measurement of Spinal Structures in MRI | Praveen Shastry et.al. | 2503.11281 | null |
2025-03-14 | DynRsl-VLM: Enhancing Autonomous Driving Perception with Dynamic Resolution Vision-Language Models | Xirui Zhou et.al. | 2503.11265 | null |
2025-03-14 | Reliable and Cost-Efficient IoT Connectivity for Smart Agriculture: A Comparative Study of LPWAN, 5G, and Hybrid Connectivity Models | Mohamed Shabeer Mohamed Rafi et.al. | 2503.11162 | null |
2025-03-14 | Unifying Perplexing Behaviors in Modified BP Attributions through Alignment Perspective | Guanhua Zheng et.al. | 2503.11160 | null |
2025-03-14 | DriveGEN: Generalized and Robust 3D Detection in Driving via Controllable Text-to-Image Diffusion Generation | Hongbin Lin et.al. | 2503.11122 | link |
2025-03-14 | DeepSeek Powered Solid Dosage Formulation Design and Development | Leqi Lin et.al. | 2503.11068 | null |
2025-03-14 | Active Learning from Scene Embeddings for End-to-End Autonomous Driving | Wenhao Jiang et.al. | 2503.11062 | null |
2025-03-14 | Fourier Neural Operator based surrogates for $CO_2$ storage in realistic geologies | Anirban Chandra et.al. | 2503.11031 | null |
2025-03-14 | A Weighted Predict-and-Optimize Framework for Power System Operation Considering Varying Impacts of Uncertainty | Yingrui Zhuang et.al. | 2503.11001 | null |
2025-03-13 | Uncertainty in Action: Confidence Elicitation in Embodied Agents | Tianjiao Yu et.al. | 2503.10628 | null |
2025-03-13 | DriveLMM-o1: A Step-by-Step Reasoning Dataset and Large Multimodal Model for Driving Scenario Understanding | Ayesha Ishaq et.al. | 2503.10621 | link |
2025-03-13 | OCCUQ: Exploring Efficient Uncertainty Quantification for 3D Occupancy Prediction | Severin Heidrich et.al. | 2503.10605 | link |
2025-03-13 | MuDG: Taming Multi-modal Diffusion with Gaussian Splatting for Urban Scene Reconstruction | Yingshuang Zou et.al. | 2503.10604 | null |
2025-03-13 | Unlock the Power of Unlabeled Data in Language Driving Model | Chaoqun Wang et.al. | 2503.10586 | null |
2025-03-13 | LLMs in Disease Diagnosis: A Comparative Study of DeepSeek-R1 and O3 Mini Across Chronic Health Conditions | Gaurav Kumar Gupta et.al. | 2503.10486 | null |
2025-03-13 | Finetuning Generative Trajectory Model with Reinforcement Learning from Human Feedback | Derun Li et.al. | 2503.10434 | null |
2025-03-13 | CODEI: Resource-Efficient Task-Driven Co-Design of Perception and Decision Making for Mobile Robots Applied to Autonomous Vehicles | Dejan Milojevic et.al. | 2503.10296 | null |
2025-03-13 | LLM Agents Display Human Biases but Exhibit Distinct Learning Patterns | Idan Horowitz et.al. | 2503.10248 | null |
2025-03-13 | Interpretable Image Classification via Non-parametric Part Prototype Learning | Zhijie Zhu et.al. | 2503.10247 | null |
2025-03-13 | SCOOP: A Framework for Proactive Collaboration and Social Continual Learning through Natural Language Interaction andCausal Reasoning | Dimitri Ognibene et.al. | 2503.10241 | null |
2025-03-13 | CoStoDet-DDPM: Collaborative Training of Stochastic and Deterministic Models Improves Surgical Workflow Anticipation and Recognition | Kaixiang Yang et.al. | 2503.10216 | link |
2025-03-13 | TARS: Traffic-Aware Radar Scene Flow Estimation | Jialong Wu et.al. | 2503.10210 | null |
2025-03-13 | PRISM: Preference Refinement via Implicit Scene Modeling for 3D Vision-Language Preference-Based Reinforcement Learning | Yirong Sun et.al. | 2503.10177 | null |
2025-03-13 | GS-SDF: LiDAR-Augmented Gaussian Splatting and Neural SDF for Geometrically Consistent Rendering and Reconstruction | Jianheng Liu et.al. | 2503.10170 | link |
2025-03-13 | Unlocking Generalization Power in LiDAR Point Cloud Registration | Zhenxuan Zeng et.al. | 2503.10149 | link |
2025-03-13 | Mamba-VA: A Mamba-based Approach for Continuous Emotion Recognition in Valence-Arousal Space | Yuheng Liang et.al. | 2503.10104 | link |
2025-03-13 | Semantic Synergy: Unlocking Policy Insights and Learning Pathways Through Advanced Skill Mapping | Phoebe Koundouri et.al. | 2503.10094 | null |
2025-03-13 | Enhanced Route Planning with Calibrated Uncertainty Set | Lingxuan Tang et.al. | 2503.10088 | null |
2025-03-13 | Exploring Mutual Empowerment Between Wireless Networks and RL-based LLMs: A Survey | Yu Qiao et.al. | 2503.09956 | null |
2025-03-12 | SimLingo: Vision-Only Closed-Loop Autonomous Driving with Language-Action Alignment | Katrin Renz et.al. | 2503.09594 | null |
2025-03-12 | Evaluating Visual Explanations of Attention Maps for Transformer-based Medical Imaging | Minjae Chung et.al. | 2503.09535 | null |
2025-03-12 | CombatVLA: An Efficient Vision-Language-Action Model for Combat Tasks in 3D Action Role-Playing Games | Peng Chen et.al. | 2503.09527 | null |
2025-03-12 | Towards Robust Multimodal Representation: A Unified Approach with Adaptive Experts and Alignment | Nazanin Moradinasab et.al. | 2503.09498 | link |
2025-03-12 | Hybrid Rendering for Multimodal Autonomous Driving: Merging Neural and Physics-Based Simulation | Máté Tóth et.al. | 2503.09464 | null |
2025-03-13 | PCLA: A Framework for Testing Autonomous Agents in the CARLA Simulator | Masoud Jamshidiyan Tehrani et.al. | 2503.09385 | link |
2025-03-12 | Revisiting Medical Image Retrieval via Knowledge Consolidation | Yang Nan et.al. | 2503.09370 | null |
2025-03-12 | Post-interactive Multimodal Trajectory Prediction for Autonomous Driving | Ziyi Huang et.al. | 2503.09366 | null |
2025-03-12 | Unmask It! AI-Generated Product Review Detection in Dravidian Languages | Somsubhra De et.al. | 2503.09289 | link |
2025-03-12 | A Case Study on Model Checking and Runtime Verification for Awkernel | Akira Hasegawa et.al. | 2503.09282 | null |
2025-03-12 | COLA: A Scalable Multi-Agent Framework For Windows UI Task Automation | Di Zhao et.al. | 2503.09263 | link |
2025-03-12 | Other Vehicle Trajectories Are Also Needed: A Driving World Model Unifies Ego-Other Vehicle Trajectories in Video Latant Space | Jian Zhu et.al. | 2503.09215 | null |
2025-03-12 | AI-Driven Decision Support in Oncology: Evaluating Data Readiness for Skin Cancer Treatment | Joscha Grüger et.al. | 2503.09164 | null |
2025-03-12 | Is LLMs Hallucination Usable? LLM-based Negative Reasoning for Fake News Detection | Chaowei Zhang et.al. | 2503.09153 | null |
2025-03-12 | Specification languages for computational laws versus basic legal principles | Petia Guintchev et.al. | 2503.09129 | null |
2025-03-12 | Sometimes Painful but Certainly Promising: Feasibility and Trade-offs of Language Model Inference at the Edge | Maximilian Abstreiter et.al. | 2503.09114 | null |
2025-03-12 | Impact of Short-Duration Aerobic Exercise Intensity on Executive Function and Sleep | Yu Peng et.al. | 2503.09077 | null |
2025-03-12 | StratIncon Detector: Analyzing Strategy Inconsistencies Between Real-Time Strategy and Preferred Professional Strategy in MOBA Esports | Ruofei Ma et.al. | 2503.09060 | null |
2025-03-12 | Incentive Analysis for Agent Participation in Federated Learning | Lihui Yi et.al. | 2503.09039 | null |
2025-03-12 | Traffic Regulation-aware Path Planning with Regulation Databases and Vision-Language Models | Xu Han et.al. | 2503.09024 | null |
2025-03-11 | CoLMDriver: LLM-based Negotiation Benefits Cooperative Autonomous Driving | Changxing Liu et.al. | 2503.08683 | link |
2025-03-11 | Language-Depth Navigated Thermal and Visible Image Fusion | Jinchang Zhang et.al. | 2503.08676 | null |
2025-03-11 | Task-Oriented Co-Design of Communication, Computing, and Control for Edge-Enabled Industrial Cyber-Physical Systems | Yufeng Diao et.al. | 2503.08661 | null |
2025-03-11 | SegAgent: Exploring Pixel Understanding Capabilities in MLLMs by Imitating Human Annotator Trajectories | Muzhi Zhu et.al. | 2503.08625 | link |
2025-03-11 | HiP-AD: Hierarchical and Multi-Granularity Planning with Deformable Attention for Autonomous Driving in a Single Decoder | Yingqi Tang et.al. | 2503.08612 | link |
2025-03-11 | Vision Transformer for Intracranial Hemorrhage Classification in CT Scans Using an Entropy-Aware Fuzzy Integral Strategy for Adaptive Scan-Level Decision Fusion | Mehdi Hosseini Chagahi et.al. | 2503.08609 | null |
2025-03-11 | LiSu: A Dataset and Method for LiDAR Surface Normal Estimation | Dušan Malić et.al. | 2503.08601 | null |
2025-03-11 | DISTINGUISH Workflow: A New Paradigm of Dynamic Well Placement Using Generative Machine Learning | Sergey Alyaev et.al. | 2503.08509 | link |
2025-03-11 | Data Driven Decision Making with Time Series and Spatio-temporal Data | Bin Yang et.al. | 2503.08473 | null |
2025-03-12 | An Autonomous RL Agent Methodology for Dynamic Web UI Testing in a BDD Framework | Ali Hassaan Mughal et.al. | 2503.08464 | null |
2025-03-11 | JiSAM: Alleviate Labeling Burden and Corner Case Problems in Autonomous Driving via Minimal Real-World Data | Runjian Chen et.al. | 2503.08422 | null |
2025-03-11 | A Distributed Clustering Algorithm based on Coalition Game for Intelligent Vehicles | Weiyi Yang et.al. | 2503.08416 | null |
2025-03-11 | Clustered Flexible Calibration Plots For Binary Outcomes Using Random Effects Modeling | Lasai Barreñada et.al. | 2503.08389 | null |
2025-03-11 | V-Max: Making RL practical for Autonomous Driving | Valentin Charraut et.al. | 2503.08388 | link |
2025-03-11 | Distributed Satellites Dynamic Allocation for Grids with Time Windows: A Potential Game Approach | Weiyi Yang et.al. | 2503.08385 | null |
2025-03-11 | InfluenceNet: AI Models for Banzhaf and Shapley Value Prediction | Benjamin Kempinski et.al. | 2503.08381 | null |
2025-03-11 | Talk2PC: Enhancing 3D Visual Grounding through LiDAR and Radar Point Clouds Fusion for Autonomous Driving | Runwei Guan et.al. | 2503.08336 | null |
2025-03-11 | Towards Scalable and Cross-Lingual Specialist Language Models for Oncology | Morteza Rohanian et.al. | 2503.08323 | null |
2025-03-11 | Uni-Gaussians: Unifying Camera and Lidar Simulation with Gaussians for Dynamic Driving Scenarios | Zikang Yuan et.al. | 2503.08317 | null |
2025-03-11 | Reasoning in visual navigation of end-to-end trained agents: a dynamical systems approach | Steeven Janny et.al. | 2503.08306 | null |
2025-03-10 | Robusto-1 Dataset: Comparing Humans and VLMs on real out-of-distribution Autonomous Driving VQA from Peru | Dunant Cusipuma et.al. | 2503.07587 | null |
2025-03-10 | Graph-Dependent Regret Bounds in Multi-Armed Bandits with Interference | Fateme Jamshidi et.al. | 2503.07555 | null |
2025-03-10 | AI-Enabled Knowledge Sharing for Enhanced Collaboration and Decision-Making in Non-Profit Healthcare Organizations: A Scoping Review Protocol | Maurice Ongala et.al. | 2503.07540 | null |
2025-03-10 | Chameleon: Fast-slow Neuro-symbolic Lane Topology Extraction | Zongzheng Zhang et.al. | 2503.07485 | link |
2025-03-10 | CATPlan: Loss-based Collision Prediction in End-to-End Autonomous Driving | Ziliang Xiong et.al. | 2503.07425 | null |
2025-03-10 | GM-MoE: Low-Light Enhancement with Gated-Mechanism Mixture-of-Experts | Minwen Liao et.al. | 2503.07417 | null |
2025-03-10 | LEGO-Motion: Learning-Enhanced Grids with Occupancy Instance Modeling for Class-Agnostic Motion Prediction | Kangan Qian et.al. | 2503.07367 | null |
2025-03-10 | Artificial Utopia: Simulation and Intelligent Agents for a Democratised Future | Yannick Oswald et.al. | 2503.07364 | null |
2025-03-10 | Now you see me! A framework for obtaining class-relevant saliency maps | Nils Philipp Walter et.al. | 2503.07346 | null |
2025-03-10 | Temporal Triplane Transformers as Occupancy World Models | Haoran Xu et.al. | 2503.07338 | null |
2025-03-10 | Mitigating Hallucinations in YOLO-based Object Detection Models: A Revisit to Out-of-Distribution Detection | Weicheng He et.al. | 2503.07330 | null |
2025-03-10 | Decision-Dependent Stochastic Optimization: The Role of Distribution Dynamics | Zhiyu He et.al. | 2503.07324 | link |
2025-03-10 | The Influence Operation Ontology (IOO) | Alejandro David Cayuela Tudela et.al. | 2503.07304 | null |
2025-03-10 | CoT-Drive: Efficient Motion Forecasting for Autonomous Driving with LLMs and Chain-of-Thought Prompting | Haicheng Liao et.al. | 2503.07234 | null |
2025-03-10 | HisTrackMap: Global Vectorized High-Definition Map Construction via History Map Tracking | Jing Yang et.al. | 2503.07168 | null |
2025-03-10 | Controllable 3D Outdoor Scene Generation via Scene Graphs | Yuheng Liu et.al. | 2503.07152 | link |
2025-03-10 | Hierarchical Neuro-Symbolic Decision Transformer | Ali Baheri et.al. | 2503.07148 | null |
2025-03-10 | Photometric Decision-Making During the Dawn Choruses of Cicadas | Rakesh Khanna A. et.al. | 2503.07121 | null |
2025-03-10 | Correctness Learning: Deductive Verification Guided Learning for Human-AI Collaboration | Zhao Jin et.al. | 2503.07096 | null |
2025-03-10 | RS2V-L: Vehicle-Mounted LiDAR Data Generation from Roadside Sensor Observations | Ruidan Xing et.al. | 2503.07085 | null |
2025-03-07 | GoalFlow: Goal-Driven Flow Matching for Multimodal Trajectories Generation in End-to-End Autonomous Driving | Zebin Xing et.al. | 2503.05689 | link |
2025-03-07 | Algorithmic Data Minimization for Machine Learning over Internet-of-Things Data Streams | Ted Shaowang et.al. | 2503.05675 | null |
2025-03-07 | A Survey of Large Language Model Empowered Agents for Recommendation and Search: Towards Next-Generation Information Retrieval | Yu Zhang et.al. | 2503.05659 | link |
2025-03-07 | InDRiVE: Intrinsic Disagreement based Reinforcement for Vehicle Exploration through Curiosity Driven Generalized World Model | Feeza Khan Khanzada et.al. | 2503.05573 | null |
2025-03-07 | Tractable Representations for Convergent Approximation of Distributional HJB Equations | Julie Alhosh et.al. | 2503.05563 | null |
2025-03-07 | Cognitive Bias Detection Using Advanced Prompt Engineering | Frederic Lemieux et.al. | 2503.05516 | null |
2025-03-07 | FastMap: Fast Queries Initialization Based Vectorized HD Map Reconstruction Framework | Haotian Hu et.al. | 2503.05492 | link |
2025-03-07 | DecoupledGaussian: Object-Scene Decoupling for Physics-Based Interaction | Miaowei Wang et.al. | 2503.05484 | null |
2025-03-07 | A Hybrid Approach for Extending Automotive Radar Operation to NLOS Urban Scenarios | Aviran Gal et.al. | 2503.05413 | null |
2025-03-07 | Data-Driven Decision Making for Enhancing Small-Signal Stability in Hybrid AC/DC Grids Through Converter Control Role Assignment | Francesca Rossi et.al. | 2503.05386 | null |
2025-03-07 | Constrained Reinforcement Learning for the Dynamic Inventory Routing Problem under Stochastic Supply and Demand | Umur Hasturk et.al. | 2503.05276 | null |
2025-03-07 | Evidential Uncertainty Estimation for Multi-Modal Trajectory Prediction | Sajad Marvi et.al. | 2503.05274 | null |
2025-03-07 | L-FUSION: Laplacian Fetal Ultrasound Segmentation & Uncertainty Estimation | Johanna P. Müller et.al. | 2503.05245 | null |
2025-03-07 | Discrete Contrastive Learning for Diffusion Policies in Autonomous Driving | Kalle Kujanpää et.al. | 2503.05229 | null |
2025-03-07 | Reward-Centered ReST-MCTS: A Robust Decision-Making Framework for Robotic Manipulation in High Uncertainty Environments | Xibai Wang et.al. | 2503.05226 | null |
2025-03-07 | Operationalizing Cybersecurity Knowledge: Design, Implementation & Evaluation of a Knowledge Management System for CACAO Playbooks | Orestis Tsirakis et.al. | 2503.05206 | link |
2025-03-07 | Uncertainty-Aware Explainable Federated Learning | Yanci Zhang et.al. | 2503.05194 | null |
2025-03-07 | FinTMMBench: Benchmarking Temporal-Aware Multi-Modal RAG in Finance | Fengbin Zhu et.al. | 2503.05185 | null |
2025-03-07 | A Comprehensive LLM-powered Framework for Driving Intelligence Evaluation | Shanhe You et.al. | 2503.05164 | link |
2025-03-07 | Generative Trajectory Stitching through Diffusion Composition | Yunhao Luo et.al. | 2503.05153 | null |
2025-03-06 | The Influence of Prior Discourse on Conversational Agent-Driven Decision-Making | Stephen Pilli et.al. | 2503.04692 | null |
2025-03-06 | Quantifying the Reasoning Abilities of LLMs on Real-world Clinical Cases | Pengcheng Qiu et.al. | 2503.04691 | null |
2025-03-06 | Coarse graining and reduced order models for plume ejection dynamics | Ike Griss Salas et.al. | 2503.04690 | null |
2025-03-06 | ValuePilot: A Two-Phase Framework for Value-Driven Decision-Making | Yitong Luo et.al. | 2503.04569 | null |
2025-03-06 | Research on a Driver’s Perceived Risk Prediction Model Considering Traffic Scene Interaction | Chenhao Yang et.al. | 2503.04516 | null |
2025-03-06 | Energy-Aware Task Offloading for Rotatable STAR-RIS-Enhanced Mobile Edge Computing Systems | Dongdong Yang et.al. | 2503.04397 | null |
2025-03-06 | Delay-Aware Digital Twin Synchronization in Mobile Edge Networks with Semantic Communications | Bin Li et.al. | 2503.04387 | null |
2025-03-06 | Guidelines for Applying RL and MARL in Cybersecurity Applications | Vasilios Mavroudis et.al. | 2503.04262 | null |
2025-03-06 | Geometry-Constrained Monocular Scale Estimation Using Semantic Segmentation for Dynamic Scenes | Hui Zhang et.al. | 2503.04235 | null |
2025-03-06 | Quantum-Inspired Reinforcement Learning in the Presence of Epistemic Ambivalence | Alireza Habibi et.al. | 2503.04219 | null |
2025-03-06 | Simulation-based Analysis Of Highway Trajectory Planning Using High-Order Polynomial For Highly Automated Driving Function | Milin Patel et.al. | 2503.04159 | link |
2025-03-06 | Organize, Then Vote: Exploring Cognitive Load in Quadratic Survey Interfaces | Ti-Chung Cheng et.al. | 2503.04114 | link |
2025-03-06 | H3O: Hyper-Efficient 3D Occupancy Prediction with Heterogeneous Supervision | Yunxiao Shi et.al. | 2503.04059 | null |
2025-03-06 | Greedy Algorithm for Structured Bandits: A Sharp Characterization of Asymptotic Success / Failure | Aleksandrs Slivkins et.al. | 2503.04010 | null |
2025-03-05 | Enhancing Autonomous Driving Safety with Collision Scenario Integration | Zi Wang et.al. | 2503.03957 | null |
2025-03-05 | Safe LLM-Controlled Robots with Formal Guarantees via Reachability Analysis | Ahmad Hafez et.al. | 2503.03911 | link |
2025-03-05 | Optimal Policy Choices Under Uncertainty | Sarah Moon et.al. | 2503.03910 | null |
2025-03-05 | Pretrained LLMs as Real-Time Controllers for Robot Operated Serial Production Line | Muhammad Waseem et.al. | 2503.03889 | null |
2025-03-05 | Are Cognitive Biases as Important as they Seem for Data Visualization? | Ali Baigelenov et.al. | 2503.03852 | null |
2025-03-05 | RiskAgent: Autonomous Medical AI Copilot for Generalist Risk Prediction | Fenglin Liu et.al. | 2503.03802 | link |
2025-03-05 | Optimal Policy Design for Repeated Decision-Making under Social Influence | Chiara Ravazzi et.al. | 2503.03657 | null |
2025-03-05 | Large language models in finance: estimating financial sentiment for stock prediction | Kemal Kirtac et.al. | 2503.03612 | null |
2025-03-05 | Towards an Emotion-Aware Metaverse: A Human-Centric Shipboard Fire Drill Simulator | Musaab H. Hamed-Ahmed et.al. | 2503.03570 | null |
2025-03-05 | Higher Stakes, Healthier Trust? An Application-Grounded Approach to Assessing Healthy Trust in High-Stakes Human-AI Collaboration | David S. Johnson et.al. | 2503.03529 | link |
2025-03-05 | Parallelized Planning-Acting for Efficient LLM-based Multi-Agent Systems | Yaoru Li et.al. | 2503.03505 | link |
2025-03-05 | CoSDH: Communication-Efficient Collaborative Perception via Supply-Demand Awareness and Intermediate-Late Hybridization | Junhao Xu et.al. | 2503.03430 | link |
2025-03-05 | Predicting Practically? Domain Generalization for Predictive Analytics in Real-world Environments | Hanyu Duan et.al. | 2503.03399 | link |
2025-03-05 | IoT Integration Protocol for Enhanced Hospital Care | Ellie Zontou et.al. | 2503.03334 | null |
2025-03-05 | Trajectory Prediction for Autonomous Driving: Progress, Limitations, and Future Directions | Nadya Abdel Madjid et.al. | 2503.03262 | null |
2025-03-05 | A Survey of Foundation Models for Environmental Science | Runlong Yu et.al. | 2503.03142 | null |
2025-03-05 | Exploring Neural Ordinary Differential Equations as Interpretable Healthcare classifiers | Shi Li et.al. | 2503.03129 | null |
2025-03-05 | Don’t Shake the Wheel: Momentum-Aware Planning in End-to-End Autonomous Driving | Ziying Song et.al. | 2503.03125 | link |
2025-03-05 | Car-STAGE: Automated framework for large-scale high-dimensional simulated time-series data generation based on user-defined criteria | Asma A. Almutairi et.al. | 2503.03100 | null |
2025-03-05 | Hopfield Networks Meet Big Data: A Brain-Inspired Deep Learning Framework for Semantic Data Linking | Ashwin Viswanathan Kannan et.al. | 2503.03084 | null |
2025-03-05 | BEVDriver: Leveraging BEV Maps in LLMs for Robust Closed-Loop Driving | Katharina Winter et.al. | 2503.03074 | link |
2025-03-04 | Adopt a PET! An Exploration of PETs, Policy, and Practicalities for Industry in Canada | Masoumeh Shafieinejad et.al. | 2503.03027 | null |
2025-03-04 | Teaching AI to Handle Exceptions: Supervised Fine-Tuning with Human-Aligned Judgment | Matthew DosSantos DiSorbo et.al. | 2503.02976 | null |
2025-03-04 | A Theoretical Model for Grit in Pursuing Ambitious Ends | Avrim Blum et.al. | 2503.02952 | null |
2025-03-05 | Multiaccuracy and Multicalibration via Proxy Groups | Beepul Bharti et.al. | 2503.02870 | null |
2025-03-04 | Multimodal Deep Learning for Subtype Classification in Breast Cancer Using Histopathological Images and Gene Expression Data | Amin Honarmandi Shandiz et.al. | 2503.02849 | link |
2025-03-04 | RAAD-LLM: Adaptive Anomaly Detection Using LLMs and RAG Integration | Alicia Russell-Gilbert et.al. | 2503.02800 | null |
2025-03-04 | Implicit Bias in LLMs: A Survey | Xinru Lin et.al. | 2503.02776 | null |
2025-03-04 | From Metaphor to Mechanism: How LLMs Decode Traditional Chinese Medicine Symbolic Language for Modern Clinical Relevance | Jiacheng Tang et.al. | 2503.02760 | null |
2025-03-04 | Bridging VLM and KMP: Enabling Fine-grained robotic manipulation via Semantic Keypoints Representation | Junjie Zhu et.al. | 2503.02748 | null |
2025-03-04 | Federated Learning for Privacy-Preserving Feedforward Control in Multi-Agent Systems | Jakob Weber et.al. | 2503.02693 | link |
2025-03-04 | State of play and future directions in industrial computer vision AI standards | Artemis Stefanidou et.al. | 2503.02675 | null |
2025-03-04 | Human-aligned Safe Reinforcement Learning for Highway On-Ramp Merging in Dense Traffic | Yang Li et.al. | 2503.02624 | link |
2025-03-04 | Playing games with Large language models: Randomness and strategy | Alicia Vidler et.al. | 2503.02582 | null |
2025-03-04 | TS-CGNet: Temporal-Spatial Fusion Meets Centerline-Guided Diffusion for BEV Mapping | Xinying Hong et.al. | 2503.02578 | link |
2025-03-04 | RaceVLA: VLA-based Racing Drone Navigation with Human-like Behaviour | Valerii Serpiva et.al. | 2503.02572 | null |
2025-03-04 | Generator-Assistant Stepwise Rollback Framework for Large Language Model Agent | Xingzuo Li et.al. | 2503.02519 | link |
2025-03-04 | UAV-VLPA*: A Vision-Language-Path-Action System for Optimal Route Generation on a Large Scales | Oleg Sautenkov et.al. | 2503.02454 | null |
2025-03-04 | PIDLoc: Cross-View Pose Optimization Network Inspired by PID Controllers | Wooju Lee et.al. | 2503.02388 | null |
2025-03-04 | A Multi-Objective Portfolio of Portfolios Problem with Qualitative Performance Assessments | Maria Barbati et.al. | 2503.02373 | null |
2025-03-04 | Iterative Value Function Optimization for Guided Decoding | Zhenhua Liu et.al. | 2503.02368 | null |
2025-03-04 | Are Large Vision Language Models Good Game Players? | Xinyu Wang et.al. | 2503.02358 | null |
2025-03-04 | Unlocking a New Rust Programming Experience: Fast and Slow Thinking with LLMs to Conquer Undefined Behaviors | Renshuang Jiang et.al. | 2503.02335 | null |
2025-03-04 | A Game-Theoretic Approach for High-Resolution Automotive FMCW Radar Interference Avoidance | Yunian Pan et.al. | 2503.02327 | null |
2025-02-28 | Enabling AutoML for Zero-Touch Network Security: Use-Case Driven Analysis | Li Yang et.al. | 2502.21286 | link |
2025-02-28 | Towards Developing Ethical Reasoners: Integrating Probabilistic Reasoning and Decision-Making for Complex AI Systems | Nijesh Upreti et.al. | 2502.21250 | null |
2025-03-03 | Scalable Decision-Making in Stochastic Environments through Learned Temporal Abstraction | Baiting Luo et.al. | 2502.21186 | link |
2025-02-28 | Prospection and dispersal in metapopulations: a perspective from opinion dynamics models | Daniela Molas et.al. | 2502.21178 | null |
2025-02-28 | Multimodal Dreaming: A Global Workspace Approach to World Model-Based Reinforcement Learning | Léopold Maytié et.al. | 2502.21142 | null |
2025-02-28 | Predicting clinical outcomes from patient care pathways represented with temporal knowledge graphs | Jong Ho Jhee et.al. | 2502.21138 | null |
2025-02-28 | Dynamically Local-Enhancement Planner for Large-Scale Autonomous Driving | Nanshan Deng et.al. | 2502.21134 | null |
2025-02-28 | Optimizing Large Language Models for ESG Activity Detection in Financial Texts | Mattia Birti et.al. | 2502.21112 | link |
2025-02-28 | AuthSim: Towards Authentic and Effective Safety-critical Scenario Generation for Autonomous Driving Tests | Yukuan Yang et.al. | 2502.21100 | null |
2025-02-28 | Explainable Biomedical Claim Verification with Large Language Models | Siting Liang et.al. | 2502.21014 | null |
2025-02-28 | A Deep User Interface for Exploring LLaMa | Divya Perumal et.al. | 2502.20938 | null |
2025-02-28 | The Power of Personality: A Human Simulation Perspective to Investigate Large Language Model Agents | Yifan Duan et.al. | 2502.20859 | null |
2025-02-28 | Recent Advances in Numerical Solutions for Hamilton-Jacobi PDEs | Tingwei Meng et.al. | 2502.20833 | null |
2025-02-28 | Digital Player: Evaluating Large Language Models based Human-like Agent in Games | Jiawei Wang et.al. | 2502.20807 | link |
2025-02-28 | Multimodal Learning for Just-In-Time Software Defect Prediction in Autonomous Driving Systems | Faisal Mohammad et.al. | 2502.20806 | null |
2025-02-28 | MedHallTune: An Instruction-Tuning Benchmark for Mitigating Medical Hallucination in Vision-Language Models | Qiao Yan et.al. | 2502.20780 | link |
2025-02-28 | WorldModelBench: Judging Video Generation Models As World Models | Dacheng Li et.al. | 2502.20694 | null |
2025-02-28 | Delayed-Decision Motion Planning in the Presence of Multiple Predictions | David Isele et.al. | 2502.20636 | null |
2025-02-28 | LV-DOT: LiDAR-visual dynamic obstacle detection and tracking for autonomous robot navigation | Zhefan Xu et.al. | 2502.20607 | link |
2025-02-28 | Map Space Belief Prediction for Manipulation-Enhanced Mapping | Joao Marcos Correia Marques et.al. | 2502.20606 | null |
2025-02-27 | Expertise Is What We Want | Alan Ashworth et.al. | 2502.20335 | null |
2025-02-27 | Deep Reinforcement Learning based Autonomous Decision-Making for Cooperative UAVs: A Search and Rescue Real World Application | Thomas Hickling et.al. | 2502.20326 | null |
2025-02-27 | EAIRA: Establishing a Methodology for Evaluating AI Models as Scientific Research Assistants | Franck Cappello et.al. | 2502.20309 | link |
2025-02-27 | Electric power system security: the case for an integrated cyber-physical risk management framework | Efthymios Karangelos et.al. | 2502.20287 | null |
2025-02-27 | A review of Bayesian sensor-based estimation and uncertainty quantification of aerodynamic flows | Jeff D. Eldredge et.al. | 2502.20280 | null |
2025-02-27 | On the Importance of Reward Design in Reinforcement Learning-based Dynamic Algorithm Configuration: A Case Study on OneMax with (1+( $λ$,$λ$ ))-GA | Tai Nguyen et.al. | 2502.20265 | null |
2025-02-27 | Explainable physics-based constraints on reinforcement learning for accelerator controls | Jonathan Colen et.al. | 2502.20247 | null |
2025-02-27 | MARVEL: Multi-Agent Reinforcement Learning for constrained field-of-View multi-robot Exploration in Large-scale environments | Jimmy Chiun et.al. | 2502.20217 | link |
2025-02-27 | Similarity-Distance-Magnitude Universal Verification | Allen Schmaltz et.al. | 2502.20167 | link |
2025-02-27 | VDT-Auto: End-to-end Autonomous Driving with VLM-Guided Diffusion Transformers | Ziang Guo et.al. | 2502.20108 | null |
2025-02-27 | Minds on the Move: Decoding Trajectory Prediction in Autonomous Driving with Cognitive Insights | Haicheng Liao et.al. | 2502.20084 | null |
2025-02-27 | SegLocNet: Multimodal Localization Network for Autonomous Driving via Bird’s-Eye-View Segmentation | Zijie Zhou et.al. | 2502.20077 | link |
2025-02-27 | Cooperative games defined by multi-objective optimization in competition for subsurface resources | Per Pettersson et.al. | 2502.19987 | null |
2025-02-27 | LLM-driven Effective Knowledge Tracing by Integrating Dual-channel Difficulty | Jiahui Cen et.al. | 2502.19915 | null |
2025-02-27 | CarPlanner: Consistent Auto-regressive Trajectory Planning for Large-scale Reinforcement Learning in Autonomous Driving | Dongkun Zhang et.al. | 2502.19908 | null |
2025-02-27 | Shared Autonomy for Proximal Teaching | Megha Srivastava et.al. | 2502.19899 | null |
2025-02-27 | ColorDynamic: Generalizable, Scalable, Real-time, End-to-end Local Planner for Unstructured and Dynamic Environments | Jinghao Xin et.al. | 2502.19892 | link |
2025-02-27 | Postponing the choice: advantage of deferred measurements in quantum information processing | C. Carmeli et.al. | 2502.19871 | null |
2025-02-27 | Can a calibration metric be both testable and actionable? | Raphael Rossellini et.al. | 2502.19851 | link |
2025-02-27 | Fair and Actionable Causal Prescription Ruleset | Benton Li et.al. | 2502.19846 | null |
2025-02-26 | CryptoPulse: Short-Term Cryptocurrency Forecasting with Dual-Prediction and Cross-Correlated Market Indicators | Amit Kumar et.al. | 2502.19349 | link |
2025-02-26 | WOFOSTGym: A Crop Simulator for Learning Annual and Perennial Crop Management Strategies | William Solow et.al. | 2502.19308 | link |
2025-02-26 | EMT: A Visual Multi-Task Benchmark Dataset for Autonomous Driving in the Arab Gulf Region | Nadya Abdel Madjid et.al. | 2502.19260 | link |
2025-02-26 | AI-Powered Bayesian Inference | Veronika Ročková et.al. | 2502.19231 | null |
2025-02-26 | Knowledge Distillation for Semantic Segmentation: A Label Space Unification Approach | Anton Backhaus et.al. | 2502.19177 | null |
2025-02-26 | MEDDxAgent: A Unified Modular Agent Framework for Explainable Automatic Differential Diagnosis | Daniel Rose et.al. | 2502.19175 | null |
2025-02-26 | Voting or Consensus? Decision-Making in Multi-Agent Debate | Lars Benedikt Kaesberg et.al. | 2502.19130 | link |
2025-02-26 | Policy Testing with MDPFuzz (Replicability Study) | Quentin Mazouni et.al. | 2502.19116 | link |
2025-02-26 | Developing heuristic solution techniques for large-scale unit commitment models | Nils-Christian Kempke et.al. | 2502.19012 | null |
2025-02-26 | Impact of deep learning model uncertainty on manual corrections to auto-segmentation in prostate cancer radiotherapy | Viktor Rogowski et.al. | 2502.18973 | null |
2025-02-26 | A Causal Lens for Evaluating Faithfulness Metrics | Kerem Zaman et.al. | 2502.18848 | null |
2025-02-26 | FinTSB: A Comprehensive and Practical Benchmark for Financial Time Series Forecasting | Yifan Hu et.al. | 2502.18834 | link |
2025-02-26 | Data-Efficient Multi-Agent Spatial Planning with LLMs | Huangyuan Su et.al. | 2502.18822 | null |
2025-02-26 | CommGPT: A Graph and Retrieval-Augmented Multimodal Communication Foundation Model | Feibo Jiang et.al. | 2502.18763 | null |
2025-02-26 | Learning Autonomy: Off-Road Navigation Enhanced by Human Input | Akhil Nagariya et.al. | 2502.18760 | null |
2025-02-26 | Random Forest-of-Thoughts: Uncertainty-aware Reasoning for Computational Social Science | Xiaohua Wu et.al. | 2502.18729 | null |
2025-02-26 | Scaling Optimization Over Uncertainty via Compilation | Minsung Cho et.al. | 2502.18728 | null |
2025-02-25 | Wireless sensor networks data synchronization using node MCU memory for precision agriculture applications | Kashif Sattar et.al. | 2502.18671 | null |
2025-02-25 | Provably Efficient RL for Linear MDPs under Instantaneous Safety Constraints in Non-Convex Feature Spaces | Amirhossein Roknilamouki et.al. | 2502.18655 | null |
2025-02-25 | Enhancing Text Classification with a Novel Multi-Agent Collaboration Framework Leveraging BERT | Hediyeh Baban et.al. | 2502.18653 | null |
2025-02-25 | How Far are LLMs from Real Search? A Comprehensive Study on Efficiency, Completeness, and Inherent Capabilities | Minhua Lin et.al. | 2502.18387 | null |
2025-02-25 | Semantic and Goal-oriented Wireless Network Coverage: The Area of Effectiveness | Mattia Merluzzi et.al. | 2502.18381 | null |
2025-02-25 | Global-Decision-Focused Neural ODEs for Proactive Grid Resilience Management | Shuyi Chen et.al. | 2502.18321 | null |
2025-02-25 | Uncertainty Modeling in Multimodal Speech Analysis Across the Psychosis Spectrum | Morteza Rohanian et.al. | 2502.18285 | null |
2025-02-25 | Citrus: Leveraging Expert Cognitive Pathways in a Medical Language Model for Advanced Medical Decision Support | Guoxin Wang et.al. | 2502.18274 | link |
2025-02-25 | The Electric Location-Routing Problem: Improved Formulations and Effects of Nonlinear Charging | Luiz Eduardo Cotta Monteiro et.al. | 2502.18234 | null |
2025-02-25 | Debt Collection Negotiations with Large Language Models: An Evaluation System and Optimizing Decision Making with Multi-Agent | Xiaofeng Wang et.al. | 2502.18228 | null |
2025-02-25 | VLM-E2E: Enhancing End-to-End Autonomous Driving with Multimodal Driver Attention Fusion | Pei Liu et.al. | 2502.18042 | null |
2025-02-25 | Exploring the Effects of Traditional Chinese Medicine Scents on Mitigating Driving Fatigue | Nengyue Su et.al. | 2502.18013 | null |
2025-02-25 | Generalized Decision Focused Learning under Imprecise Uncertainty–Theoretical Study | Keivan Shariatmadar et.al. | 2502.17984 | null |
2025-02-25 | XGBoost-Based Prediction of ICU Mortality in Sepsis-Associated Acute Kidney Injury Patients Using MIMIC-IV Database with Validation from eICU Database | Shuheng Chen et.al. | 2502.17978 | null |
2025-02-25 | InVDriver: Intra-Instance Aware Vectorized Query-Based Autonomous Driving Transformer | Bo Zhang et.al. | 2502.17949 | null |
2025-02-25 | DeepSeek-R1 Outperforms Gemini 2.0 Pro, OpenAI o1, and o3-mini in Bilingual Complex Ophthalmology Reasoning | Pusheng Xu et.al. | 2502.17947 | null |
2025-02-25 | Assessing Large Language Models in Agentic Multilingual National Bias | Qianying Liu et.al. | 2502.17945 | null |
2025-02-25 | Integrating Boosted learning with Differential Evolution (DE) Optimizer: A Prediction of Groundwater Quality Risk Assessment in Odisha | Sonalika Subudhi et.al. | 2502.17929 | null |
2025-02-25 | VVRec: Reconstruction Attacks on DL-based Volumetric Video Upstreaming via Latent Diffusion Model with Gamma Distribution | Rui Lu et.al. | 2502.17880 | null |
2025-02-25 | Certified Decisions | Isaiah Andrews et.al. | 2502.17830 | null |
2025-02-25 | Easy-Poly: A Easy Polyhedral Framework For 3D Multi-Object Tracking | Peng Zhang et.al. | 2502.17822 | null |
2025-02-25 | CAML: Collaborative Auxiliary Modality Learning for Multi-Agent Systems | Rui Liu et.al. | 2502.17821 | null |
2025-02-25 | An Overview of Large Language Models for Statisticians | Wenlong Ji et.al. | 2502.17814 | null |
2025-02-24 | From System 1 to System 2: A Survey of Reasoning Large Language Models | Zhong-Zhi Li et.al. | 2502.17419 | link |
2025-02-24 | Bayesian Hierarchical Emulators for Multi-Level Models: BayHEm | Louise Kimpton et.al. | 2502.17367 | null |
2025-02-24 | User-Centric Evaluation Methods for Digital Twin Applications in Extended Reality | Francesco Vona et.al. | 2502.17346 | null |
2025-02-24 | GaussianFlowOcc: Sparse and Weakly Supervised Occupancy Estimation using Gaussian Splatting and Temporal Flow | Simon Boeder et.al. | 2502.17288 | null |
2025-02-24 | CoT-UQ: Improving Response-wise Uncertainty Quantification in LLMs with Chain-of-Thought | Boxuan Zhang et.al. | 2502.17214 | link |
2025-02-24 | AIRIS2 : a Smart Gateway Diversity Algorithm for Very High-Throughput Satellite Systems | Justin Cano et.al. | 2502.17181 | null |
2025-02-24 | Generative Models in Decision Making: A Survey | Yinchuan Li et.al. | 2502.17100 | null |
2025-02-24 | StochasticDominance.jl: A Julia Package for Higher Order Stochastic Dominance | Rajmadan Lakshmanan et.al. | 2502.17043 | null |
2025-02-24 | Predicting Liquidity-Aware Bond Yields using Causal GANs and Deep Reinforcement Learning with LLM Evaluation | Jaskaran Singh Walia et.al. | 2502.17011 | null |
2025-02-24 | MambaFlow: A Novel and Flow-guided State Space Model for Scene Flow Estimation | Jiehao Luo et.al. | 2502.16907 | link |
2025-02-24 | A Multi-LLM-Agent-Based Framework for Economic and Public Policy Analysis | Yuzhi Hao et.al. | 2502.16879 | null |
2025-02-24 | Toward Agentic AI: Generative Information Retrieval Inspired Intelligent Communications and Networking | Ruichen Zhang et.al. | 2502.16866 | null |
2025-02-24 | A Novel Multi-Task Teacher-Student Architecture with Self-Supervised Pretraining for 48-Hour Vasoactive-Inotropic Trend Analysis in Sepsis Mortality Prediction | Houji Jin et.al. | 2502.16834 | null |
2025-02-24 | Uncertainty Quantification of Large Language Models through Multi-Dimensional Responses | Tiejin Chen et.al. | 2502.16820 | null |
2025-02-24 | Multi-Agent Autonomous Driving Systems with Large Language Models: A Survey of Recent Advances | Yaozu Wu et.al. | 2502.16804 | null |
2025-02-23 | AUKT: Adaptive Uncertainty-Guided Knowledge Transfer with Conformal Prediction | Rui Liu et.al. | 2502.16736 | null |
2025-02-23 | Towards Optimal Adversarial Robust Reinforcement Learning with Infinity Measurement Error | Haoran Li et.al. | 2502.16734 | link |
2025-02-23 | DeepSeek reshaping healthcare in China’s tertiary hospitals | Jishizhan Chen et.al. | 2502.16732 | null |
2025-02-23 | From Text to Space: Mapping Abstract Spatial Models in LLMs during a Grid-World Navigation Task | Nicolas Martorell et.al. | 2502.16690 | link |
2025-02-23 | Reasoning within and between collective action problems | Ofer Tchernichovski et.al. | 2502.16677 | null |
2025-02-21 | VaViM and VaVAM: Autonomous Driving through Video Generative Modeling | Florent Bartoccioni et.al. | 2502.15672 | link |
2025-02-21 | Para-Lane: Multi-Lane Dataset Registering Parallel Scans for Benchmarking Novel View Synthesis | Ziqian Ni et.al. | 2502.15635 | null |
2025-02-21 | Chats-Grid: An Iterative Retrieval Q&A Optimization Scheme Leveraging Large Model and Retrieval Enhancement Generation in smart grid | Yunfeng Li et.al. | 2502.15583 | null |
2025-02-21 | Depth-aware Fusion Method based on Image and 4D Radar Spectrum for 3D Object Detection | Yue Sun et.al. | 2502.15516 | null |
2025-02-21 | Q-PETR: Quant-aware Position Embedding Transformation for Multi-View 3D Object Detection | Jiangyong Yu et.al. | 2502.15488 | null |
2025-02-21 | A modular risk concept for complex systems | Dag McGeorge et.al. | 2502.15482 | null |
2025-02-21 | Aligning Task- and Reconstruction-Oriented Communications for Edge Intelligence | Yufeng Diao et.al. | 2502.15472 | null |
2025-02-21 | Enhancing Vehicle Make and Model Recognition with 3D Attention Modules | Narges Semiromizadeh et.al. | 2502.15398 | null |
2025-02-21 | Beyond Tools: Understanding How Heavy Users Integrate LLMs into Everyday Tasks and Decision-Making | Eunhye Kim et.al. | 2502.15395 | null |
2025-02-21 | PFSD: A Multi-Modal Pedestrian-Focus Scene Dataset for Rich Tasks in Semi-Structured Environments | Yueting Liu et.al. | 2502.15342 | link |
2025-02-21 | Exploring Embodied Multimodal Large Models: Development, Datasets, and Future Directions | Shoubin Chen et.al. | 2502.15336 | null |
2025-02-21 | Detecting Future-related Contexts of Entity Mentions | Puneet Prashar et.al. | 2502.15332 | null |
2025-02-21 | The Evolving Landscape of LLM- and VLM-Integrated Reinforcement Learning | Sheila Schoepp et.al. | 2502.15214 | null |
2025-02-21 | Learning to Collaborate: A Capability Vectors-based Architecture for Adaptive Human-AI Decision Making | Renlong Jie et.al. | 2502.15196 | null |
2025-02-21 | OccProphet: Pushing Efficiency Frontier of Camera-Only 4D Occupancy Forecasting with Observer-Forecaster-Refiner Framework | Junliang Chen et.al. | 2502.15180 | link |
2025-02-21 | Investigating the Adaptive Robustness with Knowledge Conflicts in LLM-based Multi-Agent Systems | Tianjie Ju et.al. | 2502.15153 | link |
2025-02-21 | CurricuVLM: Towards Safe Autonomous Driving via Personalized Safety-Critical Curriculum Learning with Vision-Language Models | Zihao Sheng et.al. | 2502.15119 | null |
2025-02-20 | Synth It Like KITTI: Synthetic Data Generation for Object Detection in Driving Scenarios | Richard Marcus et.al. | 2502.15076 | link |
2025-02-20 | Visualizing Machine Learning Models for Enhanced Financial Decision-Making and Risk Management | Priyam Ganguly et.al. | 2502.15073 | null |
2025-02-20 | An Interpretable Machine Learning Approach to Understanding the Relationships between Solar Flares and Source Active Regions | Huseyin Cavus et.al. | 2502.15066 | null |
2025-02-20 | AVD2: Accident Video Diffusion for Accident Video Description | Cheng Li et.al. | 2502.14801 | null |
2025-02-20 | RendBEV: Semantic Novel View Synthesis for Self-Supervised Bird’s Eye View Segmentation | Henrique Piñeiro Monteagudo et.al. | 2502.14792 | null |
2025-02-20 | Making Universal Policies Universal | Niklas Höpner et.al. | 2502.14777 | link |
2025-02-20 | Multi-Objective Causal Bayesian Optimization | Shriya Bhatija et.al. | 2502.14755 | link |
2025-02-20 | MedVAE: Efficient Automated Interpretation of Medical Images with Large-Scale Generalizable Autoencoders | Maya Varma et.al. | 2502.14753 | link |
2025-02-20 | Human Misperception of Generative-AI Alignment: A Laboratory Experiment | Kevin He et.al. | 2502.14708 | null |
2025-02-20 | I-MCTS: Enhancing Agentic AutoML via Introspective Monte Carlo Tree Search | Zujie Liang et.al. | 2502.14693 | null |
2025-02-20 | BP-SGCN: Behavioral Pseudo-Label Informed Sparse Graph Convolution Network for Pedestrian and Heterogeneous Trajectory Prediction | Ruochen Li et.al. | 2502.14676 | link |
2025-02-20 | AlphaMaze: Enhancing Large Language Models’ Spatial Intelligence via GRPO | Alan Dao et.al. | 2502.14669 | link |
2025-02-20 | How Far are LLMs from Being Our Digital Twins? A Benchmark for Persona-Based Behavior Chain Simulation | Rui Li et.al. | 2502.14642 | link |
2025-02-20 | Real-world Troublemaker: A Novel Track Testing Framework for Automated Driving Systems in Safety-critical Interaction Scenarios | Xinrui Zhang et.al. | 2502.14574 | null |
2025-02-20 | Learning Temporal 3D Semantic Scene Completion via Optical Flow Guidance | Meng Wang et.al. | 2502.14520 | null |
2025-02-20 | CrossFuse: Learning Infrared and Visible Image Fusion by Cross-Sensor Top-K Vision Alignment and Beyond | Yukai Shi et.al. | 2502.14493 | null |
2025-02-20 | madupite: A High-Performance Distributed Solver for Large-Scale Markov Decision Processes | Matilde Gargiani et.al. | 2502.14474 | null |
2025-02-20 | Reliable Explainability of Deep Learning Spatial-Spectral Classifiers for Improved Semantic Segmentation in Autonomous Driving | Jon Gutiérrez-Zaballa et.al. | 2502.14416 | null |
2025-02-20 | A meta-model of belief dynamics with Personal, Expressed and Social beliefs | Filippo Zimmaro et.al. | 2502.14362 | null |
2025-02-20 | Beyond Self-Talk: A Communication-Centric Survey of LLM-Based Multi-Agent Systems | Bingyu Yan et.al. | 2502.14321 | null |
2025-02-20 | ODVerse33: Is the New YOLO Version Always Better? A Multi Domain benchmark from YOLO v5 to v11 | Tianyou Jiang et.al. | 2502.14314 | null |
2025-02-20 | The Impact and Feasibility of Self-Confidence Shaping for AI-Assisted Decision-Making | Takehiro Takayanagi et.al. | 2502.14311 | null |
2025-02-20 | MedHallu: A Comprehensive Benchmark for Detecting Medical Hallucinations in Large Language Models | Shrey Pandit et.al. | 2502.14302 | null |
2025-02-19 | Why Safeguarded Ships Run Aground? Aligned Large Language Models’ Safety Mechanisms Tend to Be Anchored in The Template Region | Chak Tou Leong et.al. | 2502.13946 | null |
2025-02-19 | AdaptiveStep: Automatically Dividing Reasoning Step through Model Confidence | Yuliang Liu et.al. | 2502.13943 | link |
2025-02-19 | Playing Hex and Counter Wargames using Reinforcement Learning and Recurrent Neural Networks | Guilherme Palma et.al. | 2502.13918 | link |
2025-02-19 | MEX: Memory-efficient Approach to Referring Multi-Object Tracking | Huu-Thien Tran et.al. | 2502.13875 | null |
2025-02-19 | RobustX: Robust Counterfactual Explanations Made Easy | Junqi Jiang et.al. | 2502.13751 | null |
2025-02-19 | A Framework for Semantics-based Situational Awareness during Mobile Robot Deployments | Tianshu Ruan et.al. | 2502.13677 | null |
2025-02-19 | Scalable Multi-Level optimization for Sequentially Cleared Energy Markets with a Case Study on Gas and Carbon Aware Unit Commitment | Yuxin Xia et.al. | 2502.13643 | null |
2025-02-19 | HawkBench: Investigating Resilience of RAG Methods on Stratified Information-Seeking Tasks | Hongjin Qian et.al. | 2502.13465 | null |
2025-02-19 | Integrating Sequential Hypothesis Testing into Adversarial Games: A Sun Zi-Inspired Framework | Haosheng Zhou et.al. | 2502.13462 | null |
2025-02-19 | MapNav: A Novel Memory Representation via Annotated Semantic Maps for VLM-based Vision-and-Language Navigation | Lingfeng Zhang et.al. | 2502.13451 | null |
2025-02-19 | MCTS-KBQA: Monte Carlo Tree Search for Knowledge Base Question Answering | Guanming Xiong et.al. | 2502.13428 | null |
2025-02-19 | Detecting LLM Fact-conflicting Hallucinations Enhanced by Temporal-logic-based Reasoning | Ningke Li et.al. | 2502.13416 | null |
2025-02-19 | Fighter Jet Navigation and Combat using Deep Reinforcement Learning with Explainable AI | Swati Kar et.al. | 2502.13373 | link |
2025-02-19 | RGAR: Recurrence Generation-augmented Retrieval for Factual-aware Medical Question Answering | Sichu Liang et.al. | 2502.13361 | null |
2025-02-18 | Capturing Human Cognitive Styles with Language: Towards an Experimental Evaluation Paradigm | Vasudha Varadarajan et.al. | 2502.13326 | null |
2025-02-18 | Adjust for Trust: Mitigating Trust-Induced Inappropriate Reliance on AI Assistance | Tejas Srinivasan et.al. | 2502.13321 | null |
2025-02-18 | Value Gradient Sampler: Sampling as Sequential Decision Making | Sangwoong Yoon et.al. | 2502.13280 | link |
2025-02-18 | RAD: Training an End-to-End Driving Policy via Large-Scale 3DGS-based Reinforcement Learning | Hao Gao et.al. | 2502.13144 | null |
2025-02-19 | STEER-ME: Assessing the Microeconomic Reasoning of Large Language Models | Narun Raman et.al. | 2502.13119 | null |
2025-02-18 | Improving Clinical Question Answering with Multi-Task Learning: A Joint Approach for Answer Extraction and Medical Categorization | Priyaranjan Pattnayak et.al. | 2502.13108 | null |
2025-02-18 | AI and the Transformation of Accountability and Discretion in Urban Governance | Stephen Goldsmith et.al. | 2502.13101 | null |
2025-02-18 | AI-Assisted Decision Making with Human Learning | Gali Noti et.al. | 2502.13062 | null |
2025-02-18 | AEIA-MN: Evaluating the Robustness of Multimodal LLM-Powered Mobile Agents Against Active Environmental Injection Attacks | Yurun Chen et.al. | 2502.13053 | null |
2025-02-18 | Fragility-aware Classification for Understanding Risk and Improving Generalization | Chen Yang et.al. | 2502.13024 | null |
2025-02-18 | Efficient and Sharp Off-Policy Learning under Unobserved Confounding | Konstantin Hess et.al. | 2502.13022 | null |
2025-02-18 | Adaptive Tool Use in Large Language Models with Meta-Cognition Trigger | Wenjun Li et.al. | 2502.12961 | null |
2025-02-18 | Fraud-R1 : A Multi-Round Benchmark for Assessing the Robustness of LLM Against Augmented Fraud and Phishing Inducements | Shu Yang et.al. | 2502.12904 | null |
2025-02-18 | MediaMind: Revolutionizing Media Monitoring using Agentification | Ahmet Gunduz et.al. | 2502.12745 | null |
2025-02-18 | IPSR Model: Misinformation Intervention through Prebunking in Social Networks | Robert Rai et.al. | 2502.12740 | null |
2025-02-18 | myEye2Wheeler: A Two-Wheeler Indian Driver Real-World Eye-Tracking Dataset | Bhaiya Vaibhaw Kumar et.al. | 2502.12723 | null |
2025-02-18 | RadSplatter: Extending 3D Gaussian Splatting to Radio Frequencies for Wireless Radiomap Extrapolation | Yiheng Wang et.al. | 2502.12686 | null |
2025-02-18 | Label Drop for Multi-Aspect Relation Modeling in Universal Information Extraction | Lu Yang et.al. | 2502.12614 | link |
2025-02-18 | Hypernetwork-based approach for optimal composition design in partially controlled multi-agent systems | Kyeonghyeon Park et.al. | 2502.12605 | null |
2025-02-18 | Seamless Graph Task Scheduling over Dynamic Vehicular Clouds: A Hybrid Methodology for Integrating Pilot and Instantaneous Decisions | Bingshuo Guo et.al. | 2502.12557 | null |
2025-02-18 | Cohesive Subgraph Discovery in Hypergraphs: A Locality-Driven Indexing Framework | Song Kim et.al. | 2502.12523 | null |
2025-02-18 | Should I Trust You? Detecting Deception in Negotiations using Counterfactual RL | Wichayaporn Wongkamjan et.al. | 2502.12436 | null |
2025-02-17 | Detecting Systematic Weaknesses in Vision Models along Predefined Human-Understandable Dimensions | Sujan Sai Gannamaneni et.al. | 2502.12360 | null |
2025-02-17 | Scaling Autonomous Agents via Automatic Reward Modeling And Planning | Zhenfang Chen et.al. | 2502.12130 | null |
2025-02-17 | A-MEM: Agentic Memory for LLM Agents | Wujiang Xu et.al. | 2502.12110 | link |
2025-02-17 | Bandwidth-Adaptive Spatiotemporal Correspondence Identification for Collaborative Perception | Peng Gao et.al. | 2502.12098 | null |
2025-02-17 | Using economic value signals from primate prefrontal cortex in neuro-engineering applications | Tevin C. Rouse et.al. | 2502.12092 | null |
2025-02-17 | QoS based resource management for concurrent operation using MCTS | Sebastian Durst et.al. | 2502.11938 | null |
2025-02-17 | From Text to Trust: Empowering AI-assisted Decision Making with Adaptive LLM-powered Analysis | Zhuoyan Li et.al. | 2502.11919 | null |
2025-02-17 | Leveraging Dual Process Theory in Language Agent Framework for Real-time Simultaneous Human-AI Collaboration | Shao Zhang et.al. | 2502.11882 | link |
2025-02-17 | Does Knowledge About Perceptual Uncertainty Help an Agent in Automated Driving? | Natalie Grabowsky et.al. | 2502.11864 | null |
2025-02-17 | Residual Learning towards High-fidelity Vehicle Dynamics Modeling with Transformer | Jinyu Miao et.al. | 2502.11800 | null |
2025-02-17 | MaskGWM: A Generalizable Driving World Model with Video Mask Reconstruction | Jingcheng Ni et.al. | 2502.11663 | link |
2025-02-17 | Competing LLM Agents in a Non-Cooperative Game of Opinion Polarisation | Amin Qasmi et.al. | 2502.11649 | null |
2025-02-17 | DELMAN: Dynamic Defense Against Large Language Model Jailbreaking with Model Editing | Yi Wang et.al. | 2502.11647 | null |
2025-02-17 | User-Centric Data Management in Decentralized Internet of Behaviors System | Shiqi Zhang et.al. | 2502.11616 | null |
2025-02-17 | BIG-AOME: Designing Bodily Interaction Gamification towards Anti-sedentary Online Meeting Environments | Jiaqi Jiang et.al. | 2502.11463 | null |
2025-02-17 | \textsc{FLAG-Trader}: Fusion LLM-Agent with Gradient-based Reinforcement Learning for Financial Trading | Guojun Xiong et.al. | 2502.11433 | null |
2025-02-17 | PrivilegedDreamer: Explicit Imagination of Privileged Information for Rapid Adaptation of Learned Policies | Morgan Byrd et.al. | 2502.11377 | null |
2025-02-17 | HI-GVF: Shared Control based on Human-Influenced Guiding Vector Fields for Human-multi-robot Cooperation | Pengming Zhu et.al. | 2502.11370 | null |
2025-02-17 | “Nuclear Deployed!”: Analyzing Catastrophic Risks in Decision-making of Autonomous LLM Agents | Rongwu Xu et.al. | 2502.11355 | link |
2025-02-17 | A Framework for Learning Scoring Rules in Autonomous Driving Planning Systems | Zikang Xiong et.al. | 2502.11352 | null |
2025-02-17 | ExaGPT: Example-Based Machine-Generated Text Detection for Human Interpretability | Ryuto Koike et.al. | 2502.11336 | null |
2025-02-14 | SegX: Improving Interpretability of Clinical Image Diagnosis with Segmentation-based Enhancement | Yuhao Zhang et.al. | 2502.10296 | link |
2025-02-14 | Large Language Models and Synthetic Data for Monitoring Dataset Mentions in Research Papers | Aivin V. Solatorio et.al. | 2502.10263 | link |
2025-02-14 | Learning to Solve the Min-Max Mixed-Shelves Picker-Routing Problem via Hierarchical and Parallel Decoding | Laurin Luttmann et.al. | 2502.10233 | link |
2025-02-14 | Do Large Language Models Reason Causally Like Us? Even Better? | Hanna M. Dettki et.al. | 2502.10215 | null |
2025-02-14 | STMA: A Spatio-Temporal Memory Agent for Long-Horizon Embodied Task Planning | Mingcong Lei et.al. | 2502.10177 | null |
2025-02-14 | Technical Risks of (Lethal) Autonomous Weapons Systems | Heramb Podar et.al. | 2502.10174 | null |
2025-02-14 | Modeling biases in binary decision-making within the generalized nonlinear q-voter model | Maciej Doniec et.al. | 2502.10172 | link |
2025-02-14 | Cooperative Multi-Agent Planning with Adaptive Skill Synthesis | Zhiyuan Li et.al. | 2502.10148 | null |
2025-02-14 | Interpretable Concept-based Deep Learning Framework for Multimodal Human Behavior Modeling | Xinyu Li et.al. | 2502.10145 | null |
2025-02-14 | COMBINEX: A Unified Counterfactual Explainer for Graph Neural Networks via Node Feature and Structural Perturbations | Flavio Giorgi et.al. | 2502.10111 | null |
2025-02-14 | Structuring the Environment Nudges Participants Toward Hierarchical Over Shortest Path Planning | Valeria Simonelli et.al. | 2502.10098 | null |
2025-02-14 | DiSciPLE: Learning Interpretable Programs for Scientific Visual Discovery | Utkarsh Mall et.al. | 2502.10060 | null |
2025-02-14 | InterGridNet: An Electric Network Frequency Approach for Audio Source Location Classification Using Convolutional Neural Networks | Christos Korgialas et.al. | 2502.10011 | null |
2025-02-14 | Decision Information Meets Large Language Models: The Future of Explainable Operations Research | Yansen Zhang et.al. | 2502.09994 | link |
2025-02-14 | V2V-LLM: Vehicle-to-Vehicle Cooperative Autonomous Driving with Multi-Modal Large Language Models | Hsu-kuang Chiu et.al. | 2502.09980 | null |
2025-02-14 | Dual Control for Interactive Autonomous Merging with Model Predictive Diffusion | Jacob Knaup et.al. | 2502.09918 | null |
2025-02-14 | Automated Hypothesis Validation with Agentic Sequential Falsifications | Kexin Huang et.al. | 2502.09858 | link |
2025-02-14 | Learning Fair Policies for Infectious Diseases Mitigation using Path Integral Control | Zhuangzhuang Jia et.al. | 2502.09831 | null |
2025-02-13 | Medical Applications of Graph Convolutional Networks Using Electronic Health Records: A Survey | Garrik Hoyt et.al. | 2502.09781 | null |
2025-02-13 | SoK: Come Together – Unifying Security, Information Theory, and Cognition for a Mixed Reality Deception Attack Ontology & Analysis Framework | Ali Teymourian et.al. | 2502.09763 | null |
2025-02-13 | Rolling Ahead Diffusion for Traffic Scene Simulation | Yunpeng Liu et.al. | 2502.09587 | null |
2025-02-13 | PenTest++: Elevating Ethical Hacking with AI and Automation | Haitham S. Al-Sinani et.al. | 2502.09484 | null |
2025-02-13 | Generalizable Reinforcement Learning with Biologically Inspired Hyperdimensional Occupancy Grid Maps for Exploration and Goal-Directed Path Planning | Shay Snyder et.al. | 2502.09393 | null |
2025-02-13 | Language Agents as Digital Representatives in Collective Decision-Making | Daniel Jarrett et.al. | 2502.09369 | null |
2025-02-13 | Revisiting Topological Interference Management: A Learning-to-Code on Graphs Perspective | Zhiwei Shan et.al. | 2502.09344 | null |
2025-02-13 | Towards Seamless Hierarchical Federated Learning under Intermittent Client Participation: A Stagewise Decision-Making Methodology | Minghong Wu et.al. | 2502.09303 | null |
2025-02-13 | FLARES: Fast and Accurate LiDAR Multi-Range Semantic Segmentation | Bin Yang et.al. | 2502.09274 | null |
2025-02-13 | GEVRM: Goal-Expressive Video Generation Model For Robust Visual Manipulation | Hongyin Zhang et.al. | 2502.09268 | null |
2025-02-13 | Properties of Path-Independent Choice Correspondences and Their Applications to Efficient and Stable Matchings | Keisuke Bando et.al. | 2502.09265 | null |
2025-02-13 | LimSim Series: An Autonomous Driving Simulation Platform for Validation and Enhancement | Daocheng Fu et.al. | 2502.09170 | link |
2025-02-13 | Show Me the Work: Fact-Checkers’ Requirements for Explainable Automated Fact-Checking | Greta Warren et.al. | 2502.09083 | null |
2025-02-13 | The Datafication of Care in Public Homelessness Services | Erina Seh-Young Moon et.al. | 2502.09043 | null |
2025-02-13 | Topo2Seq: Enhanced Topology Reasoning via Topology Sequence Learning | Yiming Yang et.al. | 2502.08974 | null |
2025-02-13 | A Comprehensive Survey on Imbalanced Data Learning | Xinyi Gao et.al. | 2502.08960 | null |
2025-02-13 | Single-Agent Planning in a Multi-Agent System: A Unified Framework for Type-Based Planners | Fengming Zhu et.al. | 2502.08950 | link |
2025-02-13 | PathFinder: A Multi-Modal Multi-Agent System for Medical Diagnostic Decision-Making Applied to Histopathology | Fatemeh Ghezloo et.al. | 2502.08916 | null |
2025-02-13 | Estimating Probabilities of Causation with Machine Learning Models | Shuai Wang et.al. | 2502.08858 | null |
2025-02-12 | A procedure for assessing of machine health index data prediction quality | Daniel Kuzio et.al. | 2502.08837 | null |
2025-02-12 | A method for classification of data with uncertainty using hypothesis testing | Shoma Yokura et.al. | 2502.08582 | null |
2025-02-12 | Beyond Predictions: A Participatory Framework for Multi-Stakeholder Decision-Making | Vittoria Vineis et.al. | 2502.08542 | null |
2025-02-12 | MoDitector: Module-Directed Testing for Autonomous Driving Systems | Renzhi Wang et.al. | 2502.08504 | null |
2025-02-12 | Accelerating Stable Matching between Workers and Time-Dependent Tasks for Dynamic MCS: A Stagewise Service Trading Approach | Houyi Qi et.al. | 2502.08386 | null |
2025-02-12 | AdvSwap: Covert Adversarial Perturbation with High Frequency Info-swapping for Autonomous Driving Perception | Yuanhao Huang et.al. | 2502.08374 | null |
2025-02-12 | Uncertainty Aware Human-machine Collaboration in Camouflaged Object Detection | Ziyue Yang et.al. | 2502.08373 | link |
2025-02-12 | Model-Free Counterfactual Subset Selection at Scale | Minh Hieu Nguyen et.al. | 2502.08326 | null |
2025-02-12 | FixDrive: Automatically Repairing Autonomous Vehicle Driving Behaviour for $0.08 per Violation | Yang Sun et.al. | 2502.08260 | link |
2025-02-12 | DNNs May Determine Major Properties of Their Outputs Early, with Timing Possibly Driven by Bias | Song Park et.al. | 2502.08167 | null |
2025-02-12 | Provably Robust Federated Reinforcement Learning | Minghong Fang et.al. | 2502.08123 | null |
2025-02-12 | Large language models perpetuate bias in palliative care: development and analysis of the Palliative Care Adversarial Dataset (PCAD) | Naomi Akhras et.al. | 2502.08073 | null |
2025-02-12 | Multi-Agent Performative Prediction Beyond the Insensitivity Assumption: A Case Study for Mortgage Competition | Guanghui Wang et.al. | 2502.08063 | null |
2025-02-12 | End-to-End Predictive Planner for Autonomous Driving with Consistency Models | Anjian Li et.al. | 2502.08033 | null |
2025-02-11 | Joint Modelling Histology and Molecular Markers for Cancer Classification | Xiaofei Wang et.al. | 2502.07979 | link |
2025-02-11 | VSC-RL: Advancing Autonomous Vision-Language Agents with Variational Subgoal-Conditioned Reinforcement Learning | Qingyuan Wu et.al. | 2502.07949 | null |
2025-02-11 | Breaking Down Bias: On The Limits of Generalizable Pruning Strategies | Sibo Ma et.al. | 2502.07771 | null |
2025-02-11 | An Advanced NLP Framework for Automated Medical Diagnosis with DeBERTa and Dynamic Contextual Positional Gating | Mohammad Ali Labbaf Khaniki et.al. | 2502.07755 | null |
2025-02-11 | Whole-Genome Phenotype Prediction with Machine Learning: Open Problems in Bacterial Genomics | Tamsin James et.al. | 2502.07749 | null |
2025-02-11 | Human Decision-making is Susceptible to AI-driven Manipulation | Sahand Sabour et.al. | 2502.07663 | link |
2025-02-11 | Response rate estimation in single-stage basket trials: A comparison of estimators that allow for borrowing across cohorts | Antonios Daletzakis et.al. | 2502.07639 | null |
2025-02-11 | Divide and Merge: Motion and Semantic Learning in End-to-End Autonomous Driving | Yinzhe Shen et.al. | 2502.07631 | null |
2025-02-11 | Decision-Making Under Complete Uncertainty: You Will Regret Not Being Greedy | Kristijan Atanasov et.al. | 2502.07593 | null |
2025-02-11 | Logarithmic Regret for Online KL-Regularized Reinforcement Learning | Heyang Zhao et.al. | 2502.07460 | null |
2025-02-11 | Fast-COS: A Fast One-Stage Object Detector Based on Reparameterized Attention Vision Transformer for Autonomous Driving | Novendra Setyawan et.al. | 2502.07417 | null |
2025-02-11 | USRNet: Unified Scene Recovery Network for Enhancing Traffic Imaging under Multiple Adverse Weather Conditions | Yuxu Lu et.al. | 2502.07372 | link |
2025-02-11 | Coarse Set Theory: A Mathematical Foundation for Coarse Ethics | Takashi Izumo et.al. | 2502.07347 | null |
2025-02-11 | Semi-Supervised Vision-Centric 3D Occupancy World Model for Autonomous Driving | Xiang Li et.al. | 2502.07309 | link |
2025-02-11 | Fairness in Multi-Agent AI: A Unified Framework for Ethical and Equitable Autonomous Systems | Rajesh Ranjan et.al. | 2502.07254 | null |
2025-02-11 | Pareto Optimal Algorithmic Recourse in Multi-cost Function | Wen-Ling Chen et.al. | 2502.07214 | null |
2025-02-11 | Space-Aware Instruction Tuning: Dataset and Benchmark for Guide Dog Robots Assisting the Visually Impaired | ByungOk Han et.al. | 2502.07183 | link |
2025-02-11 | Online Aggregation of Trajectory Predictors | Alex Tong et.al. | 2502.07178 | null |
2025-02-11 | Advancing Geological Carbon Storage Monitoring With 3d Digital Shadow Technology | Abhinav Prakash Gahlot et.al. | 2502.07169 | null |
2025-02-11 | Bayesian Optimization for Building Social-Influence-Free Consensus | Masaki Adachi et.al. | 2502.07166 | null |
2025-02-11 | Mesh2SSM++: A Probabilistic Framework for Unsupervised Learning of Statistical Shape Model of Anatomies from Surface Meshes | Krithika Iyer et.al. | 2502.07145 | null |
2025-02-10 | Structural Reformation of Large Language Model Neuron Encapsulation for Divergent Information Aggregation | Denis Bakushev et.al. | 2502.07124 | null |
2025-02-10 | Incentivizing Desirable Effort Profiles in Strategic Classification: The Role of Causality and Uncertainty | Valia Efthymiou et.al. | 2502.06749 | null |
2025-02-10 | Application of Artificial Intelligence (AI) in Civil Engineering | Temitope Funmilayo Awolusi et.al. | 2502.06727 | null |
2025-02-10 | AgilePilot: DRL-Based Drone Agent for Real-Time Motion Planning in Dynamic Environments by Leveraging Object Detection | Roohan Ahmed Khan et.al. | 2502.06725 | null |
2025-02-10 | Transfer Your Perspective: Controllable 3D Generation from Any Viewpoint in a Driving Scene | Tai-Yu Pan et.al. | 2502.06682 | null |
2025-02-10 | Koopman-Equivariant Gaussian Processes | Petar Bevanda et.al. | 2502.06645 | null |
2025-02-10 | A Survey on Video Analytics in Cloud-Edge-Terminal Collaborative Systems | Linxiao Gong et.al. | 2502.06581 | null |
2025-02-10 | The Minimal Search Space for Conditional Causal Bandits | Francisco N. F. Q. Simoes et.al. | 2502.06577 | null |
2025-02-10 | SIGMA: Sheaf-Informed Geometric Multi-Agent Pathfinding | Shuhao Liao et.al. | 2502.06440 | link |
2025-02-10 | Occ-LLM: Enhancing Autonomous Driving with Occupancy-Based Large Language Models | Tianshuo Xu et.al. | 2502.06419 | null |
2025-02-10 | Toolbox for Developing Physics Informed Neural Networks for Power Systems Components | Ioannis Karampinis et.al. | 2502.06412 | null |
2025-02-10 | Habitizing Diffusion Planning for Efficient and Effective Decision Making | Haofei Lu et.al. | 2502.06401 | link |
2025-02-10 | Occlusion-Aware Contingency Safety-Critical Planning for Autonomous Vehicles | Lei Zheng et.al. | 2502.06359 | null |
2025-02-10 | Beyond Batch Learning: Global Awareness Enhanced Domain Adaptation | Lingkun Luo et.al. | 2502.06272 | null |
2025-02-10 | Amplifying Minority Voices: AI-Mediated Devil’s Advocate System for Inclusive Group Decision-Making | Soohwan Lee et.al. | 2502.06251 | null |
2025-02-10 | Words or Numbers? How Framing Uncertainties Affects Risk Assessment and Decision-Making | Robin Bodenberger et.al. | 2502.06241 | null |
2025-02-10 | Predicting Energy Demand with Tensor Factor Models | Mattia Banin et.al. | 2502.06213 | null |
2025-02-10 | Unveiling the Capabilities of Large Language Models in Detecting Offensive Language with Annotation Disagreement | Junyu Lu et.al. | 2502.06207 | link |
2025-02-10 | Actual Achieved Gain and Optimal Perceived Gain: Modeling Human Take-over Decisions Towards Automated Vehicles’ Suggestions | Shuning Zhang et.al. | 2502.06179 | null |
2025-02-10 | Dynamic Pricing with Adversarially-Censored Demands | Jianyu Xu et.al. | 2502.06168 | null |
2025-02-10 | The Value of Information in Human-AI Decision-making | Ziyang Guo et.al. | 2502.06152 | null |
2025-02-07 | Bridging Voting and Deliberation with Algorithms: Field Insights from vTaiwan and Kultur Komitee | Joshua C. Yang et.al. | 2502.05017 | null |
2025-02-07 | Conformal Prediction for Electricity Price Forecasting in the Day-Ahead and Real-Time Balancing Market | Ciaran O’Connor et.al. | 2502.04935 | null |
2025-02-07 | Mobile Network-specialized Large Language Models for 6G: Architectures, Innovations, Challenges, and Future Trends | Abdelaali Chaoub et.al. | 2502.04933 | null |
2025-02-07 | Unified Approaches in Self-Supervised Event Stream Modeling: Progress and Prospects | Levente Zólyomi et.al. | 2502.04899 | null |
2025-02-07 | Adaptive Learning-based Model Predictive Control Strategy for Drift Vehicles | Bei Zhou et.al. | 2502.04696 | null |
2025-02-07 | Learning Strategic Language Agents in the Werewolf Game with Iterative Latent Space Policy Optimization | Zelai Xu et.al. | 2502.04686 | null |
2025-02-07 | Shifting Attention to You: Personalized Brain-Inspired AI Models | Stephen Chong Zhao et.al. | 2502.04658 | null |
2025-02-07 | Agentic Reasoning: Reasoning LLMs with Tools for the Deep Research | Junde Wu et.al. | 2502.04644 | link |
2025-02-06 | Unifying and Optimizing Data Values for Selection via Sequential-Decision-Making | Hongliang Chi et.al. | 2502.04554 | null |
2025-02-06 | Regulating Reality: Exploring Synthetic Media Through Multistakeholder AI Governance | Claire R. Leibowicz et.al. | 2502.04526 | null |
2025-02-06 | KVTuner: Sensitivity-Aware Layer-wise Mixed Precision KV Cache Quantization for Efficient and Nearly Lossless LLM Inference | Xing Li et.al. | 2502.04420 | link |
2025-02-06 | MedRAG: Enhancing Retrieval-augmented Generation with Knowledge Graph-Elicited Reasoning for Healthcare Copilot | Xuejiao Zhao et.al. | 2502.04413 | link |
2025-02-06 | SMART: Advancing Scalable Map Priors for Driving Topology Reasoning | Junjie Ye et.al. | 2502.04329 | null |
2025-02-06 | Free Energy Risk Metrics for Systemically Safe AI: Gatekeeping Multi-Agent Study | Michael Walters et.al. | 2502.04249 | null |
2025-02-06 | Safeguarding connected autonomous vehicle communication: Protocols, intra- and inter-vehicular attacks and defenses | Mohammed Aledhari et.al. | 2502.04201 | null |
2025-02-06 | Strategic Learning with Local Explanations as Feedback | Kiet Q. H. Vo et.al. | 2502.04058 | null |
2025-02-07 | Towards Explainable Spoofed Speech Attribution and Detection:a Probabilistic Approach for Characterizing Speech Synthesizer Components | Jagabandhu Mishra et.al. | 2502.04049 | null |
2025-02-06 | Debiasing Architectural Decision-Making: An Experiment With Students and Practitioners | Klara Borowa et.al. | 2502.04011 | null |
2025-02-06 | A Self-supervised Multimodal Deep Learning Approach to Differentiate Post-radiotherapy Progression from Pseudoprogression in Glioblastoma | Ahmed Gomaa et.al. | 2502.03999 | null |
2025-02-06 | Bilevel Multi-Armed Bandit-Based Hierarchical Reinforcement Learning for Interaction-Aware Self-Driving at Unsignalized Intersections | Zengqi Peng et.al. | 2502.03960 | null |
2025-02-06 | Fairness Aware Reinforcement Learning via Proximal Policy Optimization | Gabriele La Malfa et.al. | 2502.03953 | null |
2025-02-06 | Rule-Based Modeling of Low-Dimensional Data with PCA and Binary Particle Swarm Optimization (BPSO) in ANFIS | Afnan Al-Ali et.al. | 2502.03895 | null |
2025-02-06 | Advanced Object Detection and Pose Estimation with Hybrid Task Cascade and High-Resolution Networks | Yuhui Jin et.al. | 2502.03877 | null |
2025-02-06 | PAGNet: Pluggable Adaptive Generative Networks for Information Completion in Multi-Agent Communication | Zhuohui Zhang et.al. | 2502.03845 | null |
2025-02-06 | Optimized Unet with Attention Mechanism for Multi-Scale Semantic Segmentation | Xuan Li et.al. | 2502.03813 | null |
2025-02-06 | PRISM: A Robust Framework for Skill-based Meta-Reinforcement Learning with Noisy Demonstrations | Sanghyeon Lee et.al. | 2502.03752 | null |
2025-02-06 | More Modality, More AI: Exploring Design Opportunities of AI-Based Multi-modal Remote Monitoring Technologies for Early Detection of Mental Health Sequelae in Youth Concussion Patients | Bingsheng Yao et.al. | 2502.03732 | null |
2025-02-06 | Reduce Lap Time for Autonomous Racing with Curvature-Integrated MPCC Local Trajectory Planning Method | Zhouheng Li et.al. | 2502.03695 | link |
2025-02-05 | Vehicle Routing Problems in the Age of Semi-Autonomous Driving | Hins Hu et.al. | 2502.03655 | null |
2025-02-05 | Investigating Corporate Social Responsibility Initiatives: Examining the case of corporate Covid-19 response | Meheli Basu et.al. | 2502.03421 | null |
2025-02-05 | CAPE: Covariate-Adjusted Pre-Training for Epidemic Time Series Forecasting | Zewen Liu et.al. | 2502.03393 | null |
2025-02-05 | A Structured Reasoning Framework for Unbalanced Data Classification Using Probabilistic Models | Junliang Du et.al. | 2502.03386 | null |
2025-02-05 | Robust Autonomy Emerges from Self-Play | Marco Cusumano-Towner et.al. | 2502.03349 | null |
2025-02-05 | Actions Speak Louder Than Words: Rate-Reward Trade-off in Markov Decision Processes | Haotian Wu et.al. | 2502.03335 | null |
2025-02-05 | A Scalable Approach to Probabilistic Neuro-Symbolic Verification | Vasileios Manginas et.al. | 2502.03274 | null |
2025-02-05 | Cooperation, satisfaction, and rationality in social games on complex networks with aspiration-driven players | M. Aguilar-Janita et.al. | 2502.03109 | null |
2025-02-05 | Driver Assistance System Based on Multimodal Data Hazard Detection | Long Zhouxiang et.al. | 2502.03005 | null |
2025-02-05 | Label Anything: An Interpretable, High-Fidelity and Prompt-Free Annotator | Wei-Bin Kou et.al. | 2502.02972 | null |
2025-02-05 | ScholaWrite: A Dataset of End-to-End Scholarly Writing Process | Linghe Wang et.al. | 2502.02904 | null |
2025-02-05 | Data-driven Causal Discovery for Pedestrians-Autonomous Personal Mobility Vehicle Interactions with eHMIs: From Psychological States to Walking Behaviors | Hailong Liu et.al. | 2502.02805 | null |
2025-02-05 | Early Stopping in Contextual Bandits and Inferences | Zihan Cui et.al. | 2502.02793 | null |
2025-02-04 | Runway capacity expansion planning for public airports under demand uncertainty | Ziyue Li et.al. | 2502.02783 | null |
2025-02-04 | SD++: Enhancing Standard Definition Maps by Incorporating Road Knowledge using LLMs | Hitvarth Diwanji et.al. | 2502.02773 | null |
2025-02-04 | How Inclusively do LMs Perceive Social and Moral Norms? | Michael Galarnyk et.al. | 2502.02696 | link |
2025-02-04 | Intelligent Sensing-to-Action for Robust Autonomy at the Edge: Opportunities and Challenges | Amit Ranjan Trivedi et.al. | 2502.02692 | null |
2025-02-04 | QLASS: Boosting Language Agent Inference via Q-Guided Stepwise Search | Zongyu Lin et.al. | 2502.02584 | link |
2025-02-04 | Fairness in Survival Analysis: A Novel Conditional Mutual Information Augmentation Approach | Tianyang Xie et.al. | 2502.02567 | null |
2025-02-04 | Decision Theoretic Foundations for Conformal Prediction: Optimal Uncertainty Quantification for Risk-Averse Agents | Shayan Kiyani et.al. | 2502.02561 | null |
2025-02-04 | Anytime Incremental $ρ$ POMDP Planning in Continuous Spaces | Ron Benchetrit et.al. | 2502.02549 | null |
2025-02-04 | Uncertainty Quantification for Collaborative Object Detection Under Adversarial Attacks | Huiqun Huang et.al. | 2502.02537 | null |
2025-02-04 | The Skin Game: Revolutionizing Standards for AI Dermatology Model Comparison | Łukasz Miętkiewicz et.al. | 2502.02500 | link |
2025-02-04 | Generative Psycho-Lexical Approach for Constructing Value Systems in Large Language Models | Haoran Ye et.al. | 2502.02444 | null |
2025-02-04 | Medical Multimodal Model Stealing Attacks via Adversarial Domain Alignment | Yaling Shen et.al. | 2502.02438 | null |
2025-02-04 | Event-aided Semantic Scene Completion | Shangwei Guo et.al. | 2502.02334 | link |
2025-02-04 | Policy-Guided Causal State Representation for Offline Reinforcement Learning Recommendation | Siyu Wang et.al. | 2502.02327 | null |
2025-02-04 | Improving Generalization Ability for 3D Object Detection by Learning Sparsity-invariant Features | Hsin-Cheng Lu et.al. | 2502.02322 | link |
2025-02-04 | Human-Aided Trajectory Planning for Automated Vehicles through Teleoperation and Arbitration Graphs | Nick Le Large et.al. | 2502.02207 | null |
2025-02-04 | VLA-Cache: Towards Efficient Vision-Language-Action Model via Adaptive Token Caching in Robotic Manipulation | Siyu Xu et.al. | 2502.02175 | null |
2025-02-04 | On the Guidance of Flow Matching | Ruiqi Feng et.al. | 2502.02150 | link |
2025-02-04 | Risk-Aware Driving Scenario Analysis with Large Language Models | Yuan Gao et.al. | 2502.02145 | link |
2025-02-04 | Synthesis of Model Predictive Control and Reinforcement Learning: Survey and Classification | Rudolf Reiter et.al. | 2502.02133 | null |
2025-02-04 | Online Clustering of Dueling Bandits | Zhiyong Wang et.al. | 2502.02079 | null |
2025-02-04 | CH-MARL: Constrained Hierarchical Multiagent Reinforcement Learning for Sustainable Maritime Logistics | Saad Alqithami et.al. | 2502.02060 | null |
2025-02-04 | From Accidents to Insights: Leveraging Multimodal Data for Scenario-Driven ADS Testing | Siwei Luo et.al. | 2502.02025 | null |
2025-02-04 | Toward a Low-Cost Perception System in Autonomous Vehicles: A Spectrum Learning Approach | Mohammed Alsakabi et.al. | 2502.01940 | null |
2025-01-31 | Vintix: Action Model via In-Context Reinforcement Learning | Andrey Polubarov et.al. | 2501.19400 | link |
2025-01-31 | Do LLMs Strategically Reveal, Conceal, and Infer Information? A Theoretical and Empirical Analysis in The Chameleon Game | Mustafa O. Karabag et.al. | 2501.19398 | link |
2025-01-31 | Learning Contracts in Hierarchical Multi-Agent Systems | Antoine Scheid et.al. | 2501.19388 | null |
2025-01-31 | CoSTI: Consistency Models for (a faster) Spatio-Temporal Imputation | Javier Solís-García et.al. | 2501.19364 | link |
2025-01-31 | Towards Adaptive Self-Improvement for Smarter Energy Systems | Alexander Sommer et.al. | 2501.19340 | null |
2025-01-31 | Offline Learning for Combinatorial Multi-armed Bandits | Xutong Liu et.al. | 2501.19300 | null |
2025-01-31 | Neuro-LIFT: A Neuromorphic, LLM-based Interactive Framework for Autonomous Drone FlighT at the Edge | Amogh Joshi et.al. | 2501.19259 | null |
2025-01-31 | Rethinking Early Stopping: Refine, Then Calibrate | Eugène Berta et.al. | 2501.19195 | link |
2025-01-31 | Imitation Game for Adversarial Disillusion with Multimodal Generative Chain-of-Thought Role-Play | Ching-Chun Chang et.al. | 2501.19143 | null |
2025-01-31 | Quantum Internet Use Case Analysis for the Automotive Industry | K. L. van der Enden et.al. | 2501.19070 | null |
2025-01-31 | SynthmanticLiDAR: A Synthetic Dataset for Semantic Segmentation on LiDAR Imaging | Javier Montalvo et.al. | 2501.19035 | link |
2025-01-31 | Open-Source Autonomous Driving Software Platforms: Comparison of Autoware and Apollo | Hee-Yang Jung et.al. | 2501.18942 | null |
2025-01-30 | Deceptive Sequential Decision-Making via Regularized Policy Optimization | Yerin Kim et.al. | 2501.18803 | null |
2025-01-30 | Zero-shot Large Language Models for Long Clinical Text Summarization with Temporal Reasoning | Maya Kruse et.al. | 2501.18724 | null |
2025-01-30 | Bandits with Anytime Knapsacks | Eray Can Elumar et.al. | 2501.18560 | null |
2025-01-30 | A Hybrid Data-Driven Approach For Analyzing And Predicting Inpatient Length Of Stay In Health Centre | Tasfia Noor Chowdhury et.al. | 2501.18535 | null |
2025-01-30 | Design and Validation of Learning Aware HMI For Learning-Enabled Increasingly Autonomous Systems | Parth Ganeriwala et.al. | 2501.18506 | null |
2025-01-30 | Exploring Potential Prompt Injection Attacks in Federated Military LLMs and Their Mitigation | Youngjoon Lee et.al. | 2501.18416 | null |
2025-01-30 | MedXpertQA: Benchmarking Expert-Level Medical Reasoning and Understanding | Yuxin Zuo et.al. | 2501.18362 | null |
2025-01-30 | Contextual Online Decision Making with Infinite-Dimensional Functional Regression | Haichen Hu et.al. | 2501.18359 | null |
2025-01-30 | Adaptive Video Streaming with AI-Based Optimization for Dynamic Network Conditions | Mohammad Tarik et.al. | 2501.18332 | null |
2025-01-30 | Functional-Ordinal Canonical Correlation Analysis With Application to Data from Optical Sensors | Giulia Patanè et.al. | 2501.18317 | null |
2025-01-30 | Statistical multi-metric evaluation and visualization of LLM system predictive performance | Samuel Ackerman et.al. | 2501.18243 | null |
2025-01-30 | Economic Rationality under Specialization: Evidence of Decision Bias in AI Agents | ShuiDe Wen et.al. | 2501.18190 | null |
2025-01-30 | Advancing Personalized Federated Learning: Integrative Approaches with AI for Enhanced Privacy and Customization | Kevin Cooper et.al. | 2501.18174 | null |
2025-01-30 | IROAM: Improving Roadside Monocular 3D Object Detection Learning from Autonomous Vehicle Data Domain | Zhe Wang et.al. | 2501.18162 | null |
2025-01-30 | Using Computer Vision for Skin Disease Diagnosis in Bangladesh Enhancing Interpretability and Transparency in Deep Learning Models for Skin Cancer Classification | Rafiul Islam et.al. | 2501.18161 | null |
2025-01-30 | VQLTI: Long-Term Tropical Cyclone Intensity Forecasting with Physical Constraints | Xinyu Wang et.al. | 2501.18122 | link |
2025-01-30 | DIAL: Distribution-Informed Adaptive Learning of Multi-Task Constraints for Safety-Critical Systems | Se-Wook Yoo et.al. | 2501.18086 | null |
2025-01-30 | Normative Evaluation of Large Language Models with Everyday Moral Dilemmas | Pratik S. Sachdeva et.al. | 2501.18081 | null |
2025-01-29 | Learning the Optimal Stopping for Early Classification within Finite Horizons via Sequential Probability Ratio Test | Akinori F. Ebihara et.al. | 2501.18059 | link |
2025-01-29 | Dynamic Coalitions in Games on Graphs with Preferences over Temporal Goals | A. Kaan Ata Yilmaz et.al. | 2501.18022 | null |
2025-01-29 | TransRAD: Retentive Vision Transformer for Enhanced Radar Object Detection | Lei Cheng et.al. | 2501.17977 | link |
2025-01-29 | GRACE: Generalizing Robot-Assisted Caregiving with User Functionality Embeddings | Ziang Liu et.al. | 2501.17855 | null |
2025-01-29 | SSF: Sparse Long-Range Scene Flow for Autonomous Driving | Ajinkya Khoche et.al. | 2501.17821 | link |
2025-01-29 | LEKA:LLM-Enhanced Knowledge Augmentation | Xinhao Zhang et.al. | 2501.17802 | null |
2025-01-29 | Decision-Theoretic Approaches in Learning-Augmented Algorithms | Spyros Angelopoulos et.al. | 2501.17701 | null |
2025-01-29 | CAMP in the Odyssey: Provably Robust Reinforcement Learning with Certified Radius Maximization | Derui Wang et.al. | 2501.17667 | link |
2025-01-29 | Is Conversational XAI All You Need? Human-AI Decision Making With a Conversational XAI Assistant | Gaole He et.al. | 2501.17546 | link |
2025-01-29 | Actions Speak Louder than Words: Agent Decisions Reveal Implicit Biases in Language Models | Yuxuan Li et.al. | 2501.17420 | null |
2025-01-29 | A Dual-Agent Adversarial Framework for Robust Generalization in Deep Reinforcement Learning | Zhengpeng Xie et.al. | 2501.17384 | null |
2025-01-29 | ASAP: Learning Generalizable Online Bin Packing via Adaptive Selection After Pruning | Han Fang et.al. | 2501.17377 | null |
2025-01-28 | Better Slow than Sorry: Introducing Positive Friction for Reliable Dialogue Systems | Mert İnan et.al. | 2501.17348 | null |
2025-01-28 | Anomaly Detection in Cooperative Vehicle Perception Systems under Imperfect Communication | Ashish Bastola et.al. | 2501.17329 | link |
2025-01-28 | WASUP: Interpretable Classification with Weight-Input Alignment and Class-Discriminative SUPports Vectors | Tom Nuno Wolf et.al. | 2501.17328 | null |
2025-01-28 | A Contrastive Teacher-Student Framework for Novelty Detection under Style Shifts | Hossein Mirzaei et.al. | 2501.17289 | null |
2025-01-28 | Scenario Understanding of Traffic Scenes Through Large Visual Language Models | Rivera Esteban et.al. | 2501.17131 | null |
2025-01-28 | Revisit Mixture Models for Multi-Agent Simulation: Experimental Study within a Unified Framework | Longzhong Lin et.al. | 2501.17015 | null |
2025-01-28 | Using Sustainability Impact Scores for Software Architecture Evaluation | Iffat Fatima et.al. | 2501.17004 | null |
2025-01-28 | Pareto sensitivity, most-changing sub-fronts, and knee solutions | Tommaso Giovannelli et.al. | 2501.16993 | link |
2025-01-28 | The Third Moment of AI Ethics: Developing Relatable and Contextualized Tools | Sarah Hladikova et.al. | 2501.16954 | null |
2025-01-28 | Quantifying Uncertainty and Variability in Machine Learning: Confidence Intervals for Quantiles in Performance Metric Distributions | Christoph Lehmann et.al. | 2501.16931 | null |
2025-01-28 | RDMM: Fine-Tuned LLM Models for On-Device Robotic Decision Making with Enhanced Contextual Awareness in Specific Domains | Shady Nasrat et.al. | 2501.16899 | link |
2025-01-28 | HD-CB: The First Exploration of Hyperdimensional Computing for Contextual Bandits Problems | Marco Angioli et.al. | 2501.16863 | null |
2025-01-28 | Target-driven Self-Distillation for Partial Observed Trajectories Forecasting | Pengfei Zhu et.al. | 2501.16767 | null |
2025-01-28 | Overcoming Semantic Dilution in Transformer-Based Next Frame Prediction | Hy Nguyen et.al. | 2501.16753 | null |
2025-01-28 | Dream to Drive with Predictive Individual World Model | Yinfeng Gao et.al. | 2501.16733 | link |
2025-01-28 | Explainability and AI Confidence in Clinical Decision Support Systems: Effects on Trust, Diagnostic Performance, and Cognitive Load in Breast Cancer Care | Olya Rezaeian et.al. | 2501.16693 | null |
2025-01-28 | SliceOcc: Indoor 3D Semantic Occupancy Prediction with Vertical Slice Representation | Jianing Li et.al. | 2501.16684 | link |
2025-01-28 | Vehicle occupancy estimation in Automated Guideway Transit via deep learning with Wi-Fi probe requests | Ziyue Li et.al. | 2501.16644 | null |
2025-01-28 | Towards Resource-Efficient Compound AI Systems | Gohar Irfan Chaudhry et.al. | 2501.16634 | null |
2025-01-28 | Engaging with AI: How Interface Design Shapes Human-AI Collaboration in High-Stakes Decision-Making | Zichen Chen et.al. | 2501.16627 | null |
2025-01-28 | Impact and influence of modern AI in metadata management | Wenli Yang et.al. | 2501.16605 | null |
2025-01-27 | Reconciling Predictive Multiplicity in Practice | Tina Behzad et.al. | 2501.16549 | link |
2025-01-27 | Sample-Efficient Behavior Cloning Using General Domain Knowledge | Feiyu Zhu et.al. | 2501.16546 | null |
2025-01-27 | Responsible Generative AI Use by Product Managers: Recoupling Ethical Principles and Practices | Genevieve Smith et.al. | 2501.16531 | null |
2025-01-27 | Sequential Decision Making in Stochastic Games with Incomplete Preferences over Temporal Objectives | Abhishek Ninad Kulkarni et.al. | 2501.16291 | null |
2025-01-27 | Enhancing Visual Inspection Capability of Multi-Modal Large Language Models on Medical Time Series with Supportive Conformalized and Interpretable Small Specialized Models | Huayu Li et.al. | 2501.16215 | link |
2025-01-27 | Towards Explainable Multimodal Depression Recognition for Clinical Interviews | Wenjie Zheng et.al. | 2501.16106 | link |
2025-01-27 | Multi-Agent Meta-Offline Reinforcement Learning for Timely UAV Path Planning and Data Collection | Eslam Eldeeb et.al. | 2501.16098 | null |
2025-01-27 | Value-oriented forecast reconciliation for renewables in electricity markets | Honglin Wen et.al. | 2501.16086 | null |
2025-01-27 | Addressing Out-of-Label Hazard Detection in Dashcam Videos: Insights from the COOOL Challenge | Anh-Kiet Duong et.al. | 2501.16037 | link |
2025-01-27 | Demographic Benchmarking: Bridging Socio-Technical Gaps in Bias Detection | Gemma Galdon Clavell et.al. | 2501.15985 | null |
2025-01-27 | Integrating Probabilistic Trees and Causal Networks for Clinical and Epidemiological Data | Sheresh Zahoor et.al. | 2501.15973 | null |
2025-01-27 | LLM-attacker: Enhancing Closed-loop Adversarial Scenario Generation for Autonomous Driving with Large Language Models | Yuewen Mei et.al. | 2501.15850 | null |
2025-01-27 | Adaptive AI-based Decentralized Resource Management in the Cloud-Edge Continuum | Lanpei Li et.al. | 2501.15802 | null |
2025-01-27 | Harnessing Diverse Perspectives: A Multi-Agent Framework for Enhanced Error Detection in Knowledge Graphs | Yu Li et.al. | 2501.15791 | link |
2025-01-27 | Prioritized Value-Decomposition Network for Explainable AI-Enabled Network Slicing | Shavbo Salehi et.al. | 2501.15734 | null |
2025-01-26 | An Empirical Study on Decision-Making Aspects in Responsible Software Engineering for AI | Lekshmi Murali Rani et.al. | 2501.15691 | null |
2025-01-26 | A Comprehensive Survey on Self-Interpretable Neural Networks | Yang Ji et.al. | 2501.15638 | link |
2025-01-26 | Be Intentional About Fairness!: Fairness, Size, and Multiplicity in the Rashomon Set | Gordon Dai et.al. | 2501.15634 | null |
2025-01-26 | Engage and Mobilize! Understanding Evolving Patterns of Social Media Usage in Emergency Management | Hemant Purohit et.al. | 2501.15608 | null |
2025-01-26 | Diffusion-Based Planning for Autonomous Driving with Flexible Guidance | Yinan Zheng et.al. | 2501.15564 | null |
2025-01-26 | Preventing Household Bankruptcy: The One-Third Rule in Financial Planning with Mathematical Validation and Game-Theoretic Insights | Aditi Godbole et.al. | 2501.15557 | null |
2025-01-26 | UNIDOOR: A Universal Framework for Action-Level Backdoor Attacks in Deep Reinforcement Learning | Oubo Ma et.al. | 2501.15529 | link |
2025-01-26 | A general, flexible and harmonious framework to construct interpretable functions in regression analysis | Tianyu Zhan et.al. | 2501.15526 | link |
2025-01-24 | HERMES: A Unified Self-Driving World Model for Simultaneous 3D Scene Understanding and Generation | Xin Zhou et.al. | 2501.14729 | link |
2025-01-24 | Decision-Focused Learning for Complex System Identification: HVAC Management System Application | Pietro Favaro et.al. | 2501.14708 | null |
2025-01-24 | Approach to Designing CV Systems for Medical Applications: Data, Architecture and AI | Dmitry Ryabtsev et.al. | 2501.14689 | null |
2025-01-24 | ACT-JEPA: Joint-Embedding Predictive Architecture Improves Policy Representation Learning | Aleksandar Vujinovic et.al. | 2501.14622 | null |
2025-01-24 | QuIP: Experimental design for expensive simulators with many Qualitative factors via Integer Programming | Yen-Chun Liu et.al. | 2501.14616 | null |
2025-01-24 | 3DLabelProp: Geometric-Driven Domain Generalization for LiDAR Semantic Segmentation in Autonomous Driving | Jules Sanchez et.al. | 2501.14605 | link |
2025-01-24 | Reducing Action Space for Deep Reinforcement Learning via Causal Effect Estimation | Wenzhang Liu et.al. | 2501.14543 | link |
2025-01-24 | Design and Implementation of a Psychiatry Resident Training System Based on Large Language Models | Zhenguang Zhong et.al. | 2501.14530 | link |
2025-01-24 | Deep-BrownConrady: Prediction of Camera Calibration and Distortion Parameters Using Deep Learning and Synthetic Data | Faiz Muhammad Chaudhry et.al. | 2501.14510 | null |
2025-01-24 | A decomposition of Fisher’s information to inform sample size for developing fair and precise clinical prediction models – Part 2: time-to-event outcomes | Richard D Riley et.al. | 2501.14482 | link |
2025-01-24 | MARL-OT: Multi-Agent Reinforcement Learning Guided Online Fuzzing to Detect Safety Violation in Autonomous Driving Systems | Linfeng Liang et.al. | 2501.14451 | null |
2025-01-24 | Additive Manufacturing Processes Protocol Prediction by Artificial Intelligence using X-ray Computed Tomography data | Sunita Khod et.al. | 2501.14306 | null |
2025-01-24 | Deep Learning-Powered Classification of Thoracic Diseases in Chest X-Rays | Yiming Lei et.al. | 2501.14279 | null |
2025-01-24 | Multi-agent KTO: Reinforcing Strategic Interactions of Large Language Model in Language Game | Rong Ye et.al. | 2501.14225 | null |
2025-01-23 | Development of a Validation and Inspection Tool for Armband-based Lifelog Data (VITAL) to Facilitate the Clinical Use of Wearable Data: A Prototype and Usability Evaluation | Im Eunyoung et.al. | 2501.14133 | null |
2025-01-23 | Autonomous Structural Memory Manipulation for Large Language Models Using Hierarchical Embedding Augmentation | Derek Yotheringhay et.al. | 2501.14119 | null |
2025-01-23 | Collaborating in a competitive world: Heterogeneous Multi-Agent Decision Making in Symbiotic Supply Chain Environments | Wan Wang et.al. | 2501.14111 | link |
2025-01-23 | Making Reliable and Flexible Decisions in Long-tailed Classification | Bolian Li et.al. | 2501.14090 | link |
2025-01-23 | Human-Alignment Influences the Utility of AI-assisted Decision Making | Nina L. Corvelo Benz et.al. | 2501.14035 | link |
2025-01-23 | What Does an Audio Deepfake Detector Focus on? A Study in the Time Domain | Petr Grinberg et.al. | 2501.13887 | null |
2025-01-23 | On the Reasoning Capacity of AI Models and How to Quantify It | Santosh Kumar Radha et.al. | 2501.13833 | null |
2025-01-23 | Black-Box Adversarial Attack on Vision Language Models for Autonomous Driving | Lu Wang et.al. | 2501.13563 | null |
2025-01-23 | Text-driven Online Action Detection | Manuel Benavent-Lledo et.al. | 2501.13518 | link |
2025-01-23 | Knowledge-Informed Multi-Agent Trajectory Prediction at Signalized Intersections for Infrastructure-to-Everything | Huilin Yin et.al. | 2501.13461 | null |
2025-01-23 | BMG-Q: Localized Bipartite Match Graph Attention Q-Learning for Ride-Pooling Order Dispatch | Yulong Hu et.al. | 2501.13448 | null |
2025-01-23 | GeomGS: LiDAR-Guided Geometry-Aware Gaussian Splatting for Robot Localization | Jaewon Lee et.al. | 2501.13417 | null |
2025-01-22 | State Combinatorial Generalization In Decision Making With Conditional Diffusion Models | Xintong Duan et.al. | 2501.13241 | null |
2025-01-22 | On the development of open geographical data infrastructures in Latin America: progress and challenges | Daniela Ballari et.al. | 2501.13235 | null |
2025-01-22 | Safe and Efficient Robot Action Planning in the Presence of Unconcerned Humans | Mohsen Amiri et.al. | 2501.13203 | null |
2025-01-22 | QuFeX: Quantum feature extraction module for hybrid quantum-classical deep neural networks | Naman Jain et.al. | 2501.13165 | null |
2025-01-22 | Efficient Implementation of LinearUCB through Algorithmic Improvements and Vector Computing Acceleration for Embedded Learning Systems | Marco Angioli et.al. | 2501.13139 | null |
2025-01-22 | Autonomy-of-Experts Models | Ang Lv et.al. | 2501.13074 | null |
2025-01-23 | AdaWM: Adaptive World Model based Planning for Autonomous Driving | Hang Wang et.al. | 2501.13072 | null |
2025-01-22 | FilmAgent: A Multi-Agent Framework for End-to-End Film Automation in Virtual 3D Spaces | Zhenran Xu et.al. | 2501.12909 | null |
2025-01-22 | A Functional Software Reference Architecture for LLM-Integrated Systems | Alessio Bucaioni et.al. | 2501.12904 | null |
2025-01-22 | Designing and Evaluating an Educational Recommender System with Different Levels of User Control | Qurat Ul Ain et.al. | 2501.12894 | null |
2025-01-22 | Closed-loop robust control of long-term diabetes progression via physical activity management | Pierluigi Francesco De Paola et.al. | 2501.12892 | null |
2025-01-22 | As Confidence Aligns: Exploring the Effect of AI Confidence on Human Self-confidence in Human-AI Decision Making | Jingshu Li et.al. | 2501.12868 | null |
2025-01-22 | ACEBench: Who Wins the Match Point in Tool Learning? | Chen Chen et.al. | 2501.12851 | null |
2025-01-22 | To Measure or Not: A Cost-Sensitive, Selective Measuring Environment for Agricultural Management Decisions with Reinforcement Learning | Hilmy Baja et.al. | 2501.12823 | link |
2025-01-22 | Unveiling Zero-Space Detection: A Novel Framework for Autonomous Ransomware Identification in High-Velocity Environments | Lafedi Svet et.al. | 2501.12811 | null |
2025-01-22 | Int2Planner: An Intention-based Multi-modal Motion Planner for Integrated Prediction and Planning | Xiaolei Chen et.al. | 2501.12799 | null |
2025-01-22 | Cost Optimization for Serverless Edge Computing with Budget Constraints using Deep Reinforcement Learning | Chen Chen et.al. | 2501.12783 | null |
2025-01-22 | Online Rack Placement in Large-Scale Data Centers | Saumil Baxi et.al. | 2501.12725 | null |
2025-01-22 | PPO-Based Vehicle Control for Ramp Merging Scheme Assisted by Enhanced C-V2X | Qiong Wu et.al. | 2501.12656 | link |
2025-01-22 | Inverse Reinforcement Learning with Switching Rewards and History Dependency for Characterizing Animal Behaviors | Jingyang Ke et.al. | 2501.12633 | null |
2025-01-22 | Improved Detection and Diagnosis of Faults in Deep Neural Networks Using Hierarchical and Explainable Classification | Sigma Jahan et.al. | 2501.12560 | null |
2025-01-21 | R2D2: Remembering, Reflecting and Dynamic Decision Making for Web Agents | Tenghao Huang et.al. | 2501.12485 | null |
2025-01-21 | Empowering AIOps: Leveraging Large Language Models for IT Operations ManagementOperations Management | Arthur Vitui et.al. | 2501.12461 | link |
2025-01-21 | UI-TARS: Pioneering Automated GUI Interaction with Native Agents | Yujia Qin et.al. | 2501.12326 | link |
2025-01-21 | RALAD: Bridging the Real-to-Sim Domain Gap in Autonomous Driving with Retrieval-Augmented Learning | Jiacheng Zuo et.al. | 2501.12296 | link |
2025-01-21 | Solar Panel Selection using Extended WASPAS with Disc Intuitionistic Fuzzy Choquet Integral Operators: CASPAS Methodology | Mahmut Can Bozyiğit et.al. | 2501.12251 | null |
2025-01-21 | Video Deblurring by Sharpness Prior Detection and Edge Information | Yang Tian et.al. | 2501.12246 | link |
2025-01-21 | Convergence of time-delayed opinion dynamics with complex interaction types | Lingling Yao et.al. | 2501.12219 | null |
2025-01-21 | RL-RC-DoT: A Block-level RL agent for Task-Aware Video Compression | Uri Gadot et.al. | 2501.12216 | null |
2025-01-21 | Explainability for Vision Foundation Models: A Survey | Rémi Kazmierczak et.al. | 2501.12203 | null |
2025-01-21 | DataPro – A Standardized Data Understanding and Processing Procedure: A Case Study of an Eco-Driving Project | Zhipeng Ma et.al. | 2501.12176 | null |
2025-01-21 | Optimizing Portfolio Performance through Clustering and Sharpe Ratio-Based Optimization: A Comparative Backtesting Approach | Keon Vin Park et.al. | 2501.12074 | null |
2025-01-21 | Adaptive Class Learning to Screen Diabetic Disorders in Fundus Images of Eye | Shramana Dey et.al. | 2501.12048 | null |
2025-01-21 | Select2Drive: Pragmatic Communications for Real-Time Collaborative Autonomous Driving | Jiahao Huang et.al. | 2501.12040 | null |
2025-01-21 | A Stochastic Geometry Based Techno-Economic Analysis of RIS-Assisted Cellular Networks | Guodong Sun et.al. | 2501.12037 | link |
2025-01-21 | Bridging Visualization and Optimization: Multimodal Large Language Models on Graph-Structured Combinatorial Optimization | Jie Zhao et.al. | 2501.11968 | null |
2025-01-21 | Make Full Use of Testing Information: An Integrated Accelerated Testing and Evaluation Method for Autonomous Driving Systems | Xinzheng Wu et.al. | 2501.11924 | null |
2025-01-21 | Bridging the Communication Gap: Evaluating AI Labeling Practices for Trustworthy AI Development | Raphael Fischer et.al. | 2501.11909 | link |
2025-01-21 | Equilibria under Dynamic Benchmark Consistency in Non-Stationary Multi-Agent Systems | Ludovico Crippa et.al. | 2501.11897 | null |
2025-01-21 | Survey on Monocular Metric Depth Estimation | Jiuling Zhang et.al. | 2501.11841 | null |
2025-01-21 | Utilising Deep Learning to Elicit Expert Uncertainty | Julia R. Falconer et.al. | 2501.11813 | link |
2025-01-20 | Human-AI Collaborative Game Testing with Vision Language Models | Boran Zhang et.al. | 2501.11782 | null |
2025-01-20 | Optimizing for aggressive-style strategies in Flesh and Blood is NP-hard | Leonardo Gasparini Romão et.al. | 2501.11683 | null |
2025-01-17 | Uncertainty-Aware Digital Twins: Robust Model Predictive Control using Time-Series Deep Quantile Learning | Yi-Ping Chen et.al. | 2501.10337 | null |
2025-01-17 | Towards Preventing Overreliance on Task-Oriented Conversational AI Through Accountability Modeling | Suvodip Dey et.al. | 2501.10316 | link |
2025-01-17 | Enhancing AI Transparency: XRL-Based Resource Management and RAN Slicing for 6G ORAN Architecture | Suvidha Mhatre et.al. | 2501.10292 | null |
2025-01-17 | Pairwise Elimination with Instance-Dependent Guarantees for Bandits with Cost Subsidy | Ishank Juneja et.al. | 2501.10290 | null |
2025-01-17 | MutualForce: Mutual-Aware Enhancement for 4D Radar-LiDAR 3D Object Detection | Xiangyuan Peng et.al. | 2501.10266 | null |
2025-01-17 | Michscan: Black-Box Neural Network Integrity Checking at Runtime Through Power Analysis | Robi Paul et.al. | 2501.10174 | null |
2025-01-17 | Small Decision Trees for MDPs with Deductive Synthesis | Roman Andriushchenko et.al. | 2501.10126 | null |
2025-01-17 | LLM Reasoner and Automated Planner: A new NPC approach | Israel Puerta-Merino et.al. | 2501.10106 | null |
2025-01-17 | A recursive Bayesian neural network for constitutive modeling of sands under monotonic loading | Toiba Noor et.al. | 2501.10088 | null |
2025-01-17 | AirRAG: Activating Intrinsic Reasoning for Retrieval Augmented Generation via Tree-based Search | Wenfeng Feng et.al. | 2501.10053 | null |
2025-01-17 | Mapping scientific communities at scale | Victor Barbier et.al. | 2501.10035 | link |
2025-01-17 | Enhancing Crash Frequency Modeling Based on Augmented Multi-Type Data by Hybrid VAE-Diffusion-Based Generative Neural Networks | Junlan Chen et.al. | 2501.10017 | null |
2025-01-17 | A note on the theoretical approach to Grassmannians and Plücker coordinates for additive skew-symmetric pairwise comparisons matrices | Waldemar W. Koczkodaj et.al. | 2501.10014 | null |
2025-01-17 | Explainable artificial intelligence (XAI): from inherent explainability to large language models | Fuseini Mumuni et.al. | 2501.09967 | null |
2025-01-17 | Demo: Interactive Visualization of Semantic Relationships in a Biomedical Project’s Talent Knowledge Graph | Jiawei Xu et.al. | 2501.09909 | null |
2025-01-16 | Fine-grained Testing for Autonomous Driving Software: a Study on Autoware with LLM-driven Unit Testing | Wenhan Wang et.al. | 2501.09866 | null |
2025-01-16 | Detection of Vascular Leukoencephalopathy in CT Images | Z. Cernekova et.al. | 2501.09863 | null |
2025-01-16 | Empirical Evaluation of Embedding Models in the Context of Text Classification in Document Review in Construction Delay Disputes | Fusheng Wei et.al. | 2501.09859 | null |
2025-01-16 | From Explainability to Interpretability: Interpretable Policies in Reinforcement Learning Via Model Explanation | Peilang Li et.al. | 2501.09858 | null |
2025-01-16 | Distilling Multi-modal Large Language Models for Autonomous Driving | Deepti Hegde et.al. | 2501.09757 | null |
2025-01-16 | NS-Gym: Open-Source Simulation Environments and Benchmarks for Non-Stationary Markov Decision Processes | Nathaniel S. Keplinger et.al. | 2501.09646 | link |
2025-01-16 | The Devil is in the Details: Simple Remedies for Image-to-LiDAR Representation Learning | Wonjun Jo et.al. | 2501.09485 | null |
2025-01-16 | Exploring the Inquiry-Diagnosis Relationship with Advanced Patient Simulators | Zhaocheng Liu et.al. | 2501.09484 | link |
2025-01-16 | MonoSOWA: Scalable monocular 3D Object detector Without human Annotations | Jan Skvrna et.al. | 2501.09481 | null |
2025-01-16 | RE-POSE: Synergizing Reinforcement Learning-Based Partitioning and Offloading for Edge Object Detection | Jianrui Shi et.al. | 2501.09465 | null |
2025-01-16 | Agile System Development Lifecycle for AI Systems: Decision Architecture | Asif Q. Gill et.al. | 2501.09434 | null |
2025-01-16 | On Learning Informative Trajectory Embeddings for Imitation, Classification and Regression | Zichang Ge et.al. | 2501.09327 | link |
2025-01-16 | Modeling Language for Scenario Development of Autonomous Driving Systems | Toshiaki Aoki et.al. | 2501.09319 | null |
2025-01-16 | SOP-Agent: Empower General Purpose AI Agent with Domain-Specific SOPs | Anbang Ye et.al. | 2501.09316 | null |
2025-01-16 | Community attitudes towards the environmental cost of computational fluid dynamics research | Miranda van Heel et.al. | 2501.09314 | null |
2025-01-16 | Redefining Affordance via Computational Rationality | Yi-Chi Liao et.al. | 2501.09233 | null |
2025-01-15 | Valid post-selection inference for penalized G-estimation with longitudinal observational data | Ajmery Jaman et.al. | 2501.09196 | null |
2025-01-15 | Embodied Scene Understanding for Vision Language Models via MetaVQA | Weizhen Wang et.al. | 2501.09167 | null |
2025-01-15 | Agentic Retrieval-Augmented Generation: A Survey on Agentic RAG | Aditi Singh et.al. | 2501.09136 | link |
2025-01-15 | Stochastic Optimal Control of Prosumers in a District Heating System | Maalvladédon Ganet Somé et.al. | 2501.09088 | null |
2025-01-15 | Training-Aware Risk Control for Intensity Modulated Radiation Therapies Quality Assurance with Conformal Prediction | Kevin He et.al. | 2501.08963 | link |
2025-01-15 | Physical AI Agents: Integrating Cognitive Intelligence with Real-World Action | Fouad Bousetouane et.al. | 2501.08944 | null |
2025-01-15 | Visual WetlandBirds Dataset: Bird Species Identification and Behavior Recognition in Videos | Javier Rodriguez-Juan et.al. | 2501.08931 | link |
2025-01-15 | Modeling Melt Pool Features and Spatter Using Symbolic Regression and Machine Learning | Olabode T. Ajenifujah et.al. | 2501.08922 | null |
2025-01-15 | PAC Learnability of Scenario Decision-Making Algorithms: Necessary and Sufficient Conditions | Guillaume O. Berger et.al. | 2501.08887 | null |
2025-01-15 | Improved Compression Bounds for Scenario Decision Making | Guillaume O. Berger et.al. | 2501.08884 | null |
2025-01-15 | The geometry of moral decision making | Roland M. Friedrich et.al. | 2501.08865 | null |
2025-01-15 | Generative Planning with 3D-vision Language Pre-training for End-to-End Autonomous Driving | Tengpeng Li et.al. | 2501.08861 | link |
2025-01-15 | Beating Competitive Ratio 4 for Graphic Matroid Secretary | Kiarash Banihashem et.al. | 2501.08846 | null |
2025-01-15 | Visualisation of multi-indication randomised control trial evidence to support decision-making in oncology: a case study on bevacizumab | Sumayya Anwer et.al. | 2501.08744 | null |
2025-01-15 | Consensus ranking by quantum annealing | Daniele Franch et.al. | 2501.08664 | null |
2025-01-15 | BRIGHT-VO: Brightness-Guided Hybrid Transformer for Visual Odometry with Multi-modality Refinement Module | Dongzhihan Wang et.al. | 2501.08659 | null |
2025-01-15 | Transformer-based Multivariate Time Series Anomaly Localization | Charalampos Shimillas et.al. | 2501.08628 | null |
2025-01-15 | ANSR-DT: An Adaptive Neuro-Symbolic Learning and Reasoning Framework for Digital Twins | Safayat Bin Hakim et.al. | 2501.08561 | link |
2025-01-15 | Learning Hyperplane Tree: A Piecewise Linear and Fully Interpretable Decision-making Framework | Hongyi Li et.al. | 2501.08515 | null |
2025-01-14 | OptiChat: Bridging Optimization Models and Practitioners with Large Language Models | Hao Chen et.al. | 2501.08406 | link |
2025-01-14 | Decoding Interpretable Logic Rules from Neural Networks | Chuqin Geng et.al. | 2501.08281 | null |
2025-01-14 | Using Gamified Experiments to Tame Complexity: the case of the Schelling Model of Segregation | Aleix Nicolás Olivé et.al. | 2501.08280 | null |
2025-01-14 | Dynamic Pricing in High-Speed Railways Using Multi-Agent Reinforcement Learning | Enrique Adrian Villarrubia-Martin et.al. | 2501.08234 | null |
2025-01-14 | Big Batch Bayesian Active Learning by Considering Predictive Probabilities | Sebastian W. Ober et.al. | 2501.08223 | null |
2025-01-14 | LeapVAD: A Leap in Autonomous Driving via Cognitive Perception and Dual-Process Thinking | Yukai Ma et.al. | 2501.08168 | null |
2025-01-14 | Potential and Perils of Large Language Models as Judges of Unstructured Textual Data | Rewina Bedemariam et.al. | 2501.08167 | null |
2025-01-14 | FairTTTS: A Tree Test Time Simulation Method for Fairness-Aware Classification | Nurit Cohen-Inger et.al. | 2501.08155 | link |
2025-01-14 | Hybrid Action Based Reinforcement Learning for Multi-Objective Compatible Autonomous Driving | Guizhe Jin et.al. | 2501.08096 | null |
2025-01-14 | Benchmarking Vision Foundation Models for Input Monitoring in Autonomous Driving | Nert Keser et.al. | 2501.08083 | null |
2025-01-14 | LLM-Ehnanced Holonic Architecture for Ad-Hoc Scalable SoS | Muhammad Ashfaq et.al. | 2501.07992 | null |
2025-01-14 | GAC-Net_Geometric and attention-based Network for Depth Completion | Kuang Zhu et.al. | 2501.07988 | null |
2025-01-14 | Phase of Flight Classification in Aviation Safety using LSTM, GRU, and BiLSTM: A Case Study with ASN Dataset | Aziida Nanyonga et.al. | 2501.07925 | null |
2025-01-14 | Examining the Representation of Youth in the US Policy Documents through the Lens of Research | Miftahul Jannat Mokarrama et.al. | 2501.07858 | link |
2025-01-14 | A Low-cost and Ultra-lightweight Binary Neural Network for Traffic Signal Recognition | Mingke Xiao et.al. | 2501.07808 | null |
2025-01-14 | Visual Language Models as Operator Agents in the Space Domain | Alejandro Carrasco et.al. | 2501.07802 | null |
2025-01-14 | Black-box Optimization with Simultaneous Statistical Inference for Optimal Performance | Teng Lian et.al. | 2501.07795 | null |
2025-01-14 | HgPCN: A Heterogeneous Architecture for E2E Embedded Point Cloud Inference | Yiming Gao et.al. | 2501.07767 | null |
2025-01-13 | ML-assisted Randomization Tests for Detecting Treatment Effects in A/B Experiments | Wenxuan Guo et.al. | 2501.07722 | null |
2025-01-13 | Energy-Efficient Cryogenic Neuromorphic Network with Superconducting Memristor | Md Mazharul Islam et.al. | 2501.07683 | null |
2025-01-13 | RbRL2.0: Integrated Reward and Policy Learning for Rating-based Reinforcement Learning | Mingkang Wu et.al. | 2501.07502 | null |
2025-01-13 | Encrypted Computation of Collision Probability for Secure Satellite Conjunction Analysis | Jihoon Suh et.al. | 2501.07476 | null |
2025-01-13 | Predicting System Dynamics of Universal Growth Patterns in Complex Systems | Leila Hedayatifar et.al. | 2501.07349 | null |
2025-01-13 | Code and Pixels: Multi-Modal Contrastive Pre-training for Enhanced Tabular Data Analysis | Kankana Roy et.al. | 2501.07304 | null |
2025-01-13 | PO-GVINS: Tightly Coupled GNSS-Visual-Inertial Integration with Pose-Only Representation | Zhuo Xu et.al. | 2501.07259 | null |
2025-01-13 | Self-organized institutions in evolutionary dynamical-systems game | Kenji Itao et.al. | 2501.07249 | null |
2025-01-13 | Privacy-Preserving Data Quality Assessment for Time-Series IoT Sensors | Novoneel Chakraborty et.al. | 2501.07154 | null |
2025-01-13 | TIMRL: A Novel Meta-Reinforcement Learning Framework for Non-Stationary and Multi-Task Environments | Chenyang Qi et.al. | 2501.07146 | null |
2025-01-13 | How GPT learns layer by layer | Jason Du et.al. | 2501.07108 | link |
2025-01-13 | Optimization with Multi-sourced Reference Information and Unknown Trust: A Distributionally Robust Approach | Yanru Guo et.al. | 2501.07057 | null |
2025-01-13 | Statistical Modeling of Networked Evolutionary Public Goods Games | Hiroyasu Ando et.al. | 2501.07007 | link |
2025-01-13 | LEO: Boosting Mixture of Vision Encoders for Multimodal Large Language Models | Mozhgan Nasr Azadani et.al. | 2501.06986 | link |
2025-01-12 | MedGrad E-CLIP: Enhancing Trust and Transparency in AI-Driven Skin Lesion Diagnosis | Sadia Kamal et.al. | 2501.06887 | null |
2025-01-12 | Integrators at War: Mediating in AI-assisted Resort-to-Force Decisions | Dennis Müller et.al. | 2501.06861 | null |
2025-01-12 | Faithful Counterfactual Visual Explanations (FCVE) | Bismillah Khan et.al. | 2501.06841 | null |
2025-01-12 | Pareto Set Learning for Multi-Objective Reinforcement Learning | Erlong Liu et.al. | 2501.06773 | null |
2025-01-12 | Procedural Fairness and Its Relationship with Distributive Fairness in Machine Learning | Ziming Wang et.al. | 2501.06753 | link |
2025-01-12 | Application of Vision-Language Model to Pedestrians Behavior and Scene Understanding in Autonomous Driving | Haoxiang Gao et.al. | 2501.06680 | null |
2025-01-11 | A Geometric Analysis-Based Safety Assessment Framework for MASS Route Decision-Making in Restricted Waters | Zilong Xu et.al. | 2501.06670 | null |
2025-01-11 | Common Sense Is All You Need | Hugo Latapie et.al. | 2501.06642 | null |
2025-01-10 | Probabilistic Forecasts of Load, Solar and Wind for Electricity Price Forecasting | Bartosz Uniejewski et.al. | 2501.06180 | null |
2025-01-10 | CoDriveVLM: VLM-Enhanced Urban Cooperative Dispatching and Motion Planning for Future Autonomous Mobility on Demand Systems | Haichao Liu et.al. | 2501.06132 | link |
2025-01-10 | Vehicle-in-Virtual-Environment (VVE) Based Autonomous Driving Function Development and Evaluation Methodology for Vulnerable Road User Safety | Haochong Chen et.al. | 2501.06113 | null |
2025-01-10 | From Conversation to Automation: Leveraging Large Language Models to Analyze Strategies in Problem Solving Therapy | Elham Aghakhani et.al. | 2501.06101 | null |
2025-01-10 | All AI Models are Wrong, but Some are Optimal | Akhil S Anand et.al. | 2501.06086 | null |
2025-01-10 | Distilling Calibration via Conformalized Credal Inference | Jiayi Huang et.al. | 2501.06066 | null |
2025-01-10 | COMIX: Compositional Explanations using Prototypes | Sarath Sivaprasad et.al. | 2501.06059 | null |
2025-01-10 | Nonlinear partial differential equations in neuroscience: from modelling to mathematical theory | José A Carrillo et.al. | 2501.06015 | null |
2025-01-10 | Minimizing Occlusion Effect on Multi-View Camera Perception in BEV with Multi-Sensor Fusion | Sanjay Kumar et.al. | 2501.05997 | null |
2025-01-10 | Diffusion Models for Smarter UAVs: Decision-Making and Modeling | Yousef Emami et.al. | 2501.05819 | null |
2025-01-10 | Robust Counterfactual Explanations under Model Multiplicity Using Multi-Objective Optimization | Keita Kinjo et.al. | 2501.05795 | null |
2025-01-10 | TB-Bench: Training and Testing Multi-Modal AI for Understanding Spatio-Temporal Traffic Behaviors from Dashcam Images/Videos | Korawat Charoenpitaks et.al. | 2501.05733 | link |
2025-01-09 | Datasheets for Healthcare AI: A Framework for Transparency and Bias Mitigation | Marjia Siddik et.al. | 2501.05617 | null |
2025-01-09 | Vision-Language Models for Autonomous Driving: CLIP-Based Dynamic Scene Understanding | Mohammed Elhenawy et.al. | 2501.05566 | null |
2025-01-09 | Bayesian Model Selection for Network Discrimination and Risk-informed Decision Making in Material Flow Analysis | Jiankan Liao et.al. | 2501.05556 | null |
2025-01-09 | The more polypersonal the better – a short look on space geometry of fine-tuned layers | Sergei Kudriashov et.al. | 2501.05503 | null |
2025-01-09 | Strategy Masking: A Method for Guardrails in Value-based Reinforcement Learning Agents | Jonathan Keane et.al. | 2501.05501 | null |
2025-01-09 | Explainable AI-Enhanced Deep Learning for Pumpkin Leaf Disease Detection: A Comparative Analysis of CNN Architectures | Md. Arafat Alam Khandaker et.al. | 2501.05449 | null |
2025-01-09 | From Images to Insights: Transforming Brain Cancer Diagnosis with Explainable AI | Md. Arafat Alam Khandaker et.al. | 2501.05426 | null |
2025-01-09 | Mechanistic understanding and validation of large AI models with SemanticLens | Maximilian Dreyer et.al. | 2501.05398 | link |
2025-01-09 | The global consensus on the risk management of autonomous driving | Sebastian Krügel et.al. | 2501.05391 | null |
2025-01-09 | Integrating Explainable AI for Effective Malware Detection in Encrypted Network Traffic | Sileshi Nibret Zeleke et.al. | 2501.05387 | null |
2025-01-09 | Optimizing Distributed Deployment of Mixture-of-Experts Model Inference in Serverless Computing | Mengfan Liu et.al. | 2501.05313 | null |
2025-01-09 | Off-Policy Evaluation and Counterfactual Methods in Dynamic Auction Environments | Ritam Guha et.al. | 2501.05278 | null |
2025-01-09 | Domain-Incremental Semantic Segmentation for Autonomous Driving under Adverse Driving Conditions | Shishir Muralidhara et.al. | 2501.05246 | null |
2025-01-09 | CorrDiff: Adaptive Delay-aware Detector with Temporal Cue Inputs for Real-time Object Detection | Xiang Zhang et.al. | 2501.05132 | null |
2025-01-09 | Variable Goal Approach (VGA) Enhancing Pedestrian Dynamics Modeling | Kanika Jain et.al. | 2501.05100 | null |
2025-01-09 | DriVLM: Domain Adaptation of Vision-Language Models in Autonomous Driving | Xuran Zheng et.al. | 2501.05081 | null |
2025-01-09 | LearningFlow: Automated Policy Learning Workflow for Urban Driving with Large Language Models | Zengqi Peng et.al. | 2501.05057 | null |
2025-01-09 | UAV-VLA: Vision-Language-Action System for Large Scale Aerial Mission Generation | Oleg Sautenkov et.al. | 2501.05014 | link |
2025-01-09 | CuRLA: Curriculum Learning Based Deep Reinforcement Learning for Autonomous Driving | Bhargava Uppuluri et.al. | 2501.04982 | null |
2025-01-09 | AD-L-JEPA: Self-Supervised Spatial World Models with Joint Embedding Predictive Architecture for Autonomous Driving with LiDAR Data | Haoran Zhu et.al. | 2501.04969 | link |
2025-01-09 | Balancing Exploration and Cybersickness: Investigating Curiosity-Driven Behavior in Virtual Environments | Tangyao Li et.al. | 2501.04905 | null |
2025-01-08 | Leveraging Log Probabilities in Language Models to Forecast Future Events | Tommaso Soru et.al. | 2501.04880 | null |
2025-01-08 | Deep Transfer $Q$ -Learning for Offline Non-Stationary Reinforcement Learning | Jinhang Chai et.al. | 2501.04870 | null |
2025-01-08 | Universal quasi-particle kinetics control the cell death decision | Felix Meige et.al. | 2501.04862 | null |
2025-01-08 | A mixture transition distribution approach to portfolio optimization | Riccardo De Blasis et.al. | 2501.04646 | null |
2025-01-08 | Analysis of Climatic Trends and Variability in Indian Topography | Ayush Prusty et.al. | 2501.04578 | null |
2025-01-08 | Effective Two-Stage Double Auction for Dynamic Resource Trading in Edge Networks via Overbooking | Sicheng Wu et.al. | 2501.04507 | null |
2025-01-08 | Integrating remote sensing data assimilation, deep learning and large language model for interactive wheat breeding yield prediction | Guofeng Yang et.al. | 2501.04487 | null |
2025-01-08 | Motif Discovery Framework for Psychiatric EEG Data Classification | Melanija Kraljevska et.al. | 2501.04441 | null |
2025-01-08 | Integrating LLMs with ITS: Recent Advances, Potentials, Challenges, and Future Directions | Doaa Mahmud et.al. | 2501.04437 | null |
2025-01-08 | FGU3R: Fine-Grained Fusion via Unified 3D Representation for Multimodal 3D Object Detection | Guoxin Zhang et.al. | 2501.04373 | null |
2025-01-08 | Who Does the Giant Number Pile Like Best: Analyzing Fairness in Hiring Contexts | Preethi Seshadri et.al. | 2501.04316 | link |
2025-01-08 | H-MBA: Hierarchical MamBa Adaptation for Multi-Modal Video Understanding in Autonomous Driving | Siran Chen et.al. | 2501.04302 | null |
2025-01-08 | Drift-oriented Self-evolving Encrypted Traffic Application Classification for Actual Network Environment | Zihan Chen et.al. | 2501.04246 | null |
2025-01-08 | Making school choice lotteries transparent | Lingbo Huang et.al. | 2501.04243 | null |
2025-01-08 | GNN-based Decentralized Perception in Multirobot Systems for Predicting Worker Actions | Ali Imran et.al. | 2501.04193 | null |
2025-01-07 | A cautious use of auxiliary outcomes for decision-making in randomized clinical trials | Massimiliano Russo et.al. | 2501.04187 | link |
2025-01-07 | Deep Learning-based Feature Discovery for Decoding Phenotypic Plasticity in Pediatric High-Grade Gliomas Single-Cell Transcriptomics | Abicumaran Uthamacumaran et.al. | 2501.04181 | null |
2025-01-07 | Machine Learning for Identifying Grain Boundaries in Scanning Electron Microscopy (SEM) Images of Nanoparticle Superlattices | Aanish Paruchuri et.al. | 2501.04172 | null |
2025-01-07 | SEIHRDV: a multi-age multi-group epidemiological model and its validation on the COVID-19 epidemics in Italy | Luca Dede’ et.al. | 2501.04148 | null |
2025-01-07 | Bridging Impulse Control of Piecewise Deterministic Markov Processes and Markov Decision Processes: Frameworks, Extensions, and Open Challenges | Alice Cleynen et.al. | 2501.04120 | null |
2025-01-07 | LargeAD: Large-Scale Cross-Sensor Data Pretraining for Autonomous Driving | Lingdong Kong et.al. | 2501.04005 | null |
2025-01-07 | Are VLMs Ready for Autonomous Driving? An Empirical Study from the Reliability, Data, and Metric Perspectives | Shaoyuan Xie et.al. | 2501.04003 | link |
2025-01-07 | Multi-Hypothesis Prediction for Portfolio Optimization: A Structured Ensemble Learning Approach to Risk Diversification | Alejandro Rodriguez Dominguez et.al. | 2501.03919 | null |
2025-01-07 | Exploring the Potential of Large Language Models in Public Transportation: San Antonio Case Study | Ramya Jonnala et.al. | 2501.03904 | null |
2025-01-07 | Explainable Reinforcement Learning via Temporal Policy Decomposition | Franco Ruggeri et.al. | 2501.03902 | null |
2025-01-07 | Three-dimensional attention Transformer for state evaluation in real-time strategy games | Yanqing Ye et.al. | 2501.03832 | null |
2025-01-07 | Image Segmentation: Inducing graph-based learning | Aryan Singh et.al. | 2501.03765 | link |
2025-01-07 | Controlling the low-temperature Ising model using spatiotemporal Markov decision theory | M. C. de Jongh et.al. | 2501.03668 | null |
2025-01-07 | Hybrid Machine Learning Model with a Constrained Action Space for Trajectory Prediction | Alexander Fertig et.al. | 2501.03666 | null |
2025-01-07 | Collision Risk Quantification and Conflict Resolution in Trajectory Tracking for Acceleration-Actuated Multi-Robot Systems | Xiaoxiao Li et.al. | 2501.03585 | null |
2025-01-07 | SenseRAG: Constructing Environmental Knowledge Bases with Proactive Querying for LLM-Based Autonomous Driving | Xuewen Luo et.al. | 2501.03535 | null |
2025-01-07 | Align-Pro: A Principled Approach to Prompt Optimization for LLM Alignment | Prashant Trivedi et.al. | 2501.03486 | null |
2025-01-07 | Radar Signal Recognition through Self-Supervised Learning and Domain Adaptation | Zi Huang et.al. | 2501.03461 | link |
2025-01-06 | Mixture-of-Experts Graph Transformers for Interpretable Particle Collision Detection | Donatella Genovese et.al. | 2501.03432 | null |
2025-01-06 | On factors influencing consumer preference in pipeline stages: an experiment | Paramahansa Pramanik et.al. | 2501.03418 | null |
2025-01-06 | High-frequency Density Nowcasts of U.S. State-Level Carbon Dioxide Emissions | Ignacio Garrón et.al. | 2501.03380 | null |
2025-01-06 | Quantum Feature-Empowered Deep Classification for Fast Mangrove Mapping | Chia-Hsiang Lin et.al. | 2501.03360 | null |
2025-01-06 | Data integrity vs. inference accuracy in large AIS datasets | Adam Kiersztyn et.al. | 2501.03358 | null |
2025-01-06 | Analyzing Bias in Swiss Federal Supreme Court Judgments Using Facebook’s Holistic Bias Dataset: Implications for Language Model Training | Sabine Wehnert et.al. | 2501.03324 | null |
2025-01-06 | BoostStep: Boosting mathematical capability of Large Language Models via improved single-step reasoning | Beichen Zhang et.al. | 2501.03226 | link |
2025-01-06 | RW-Net: Enhancing Few-Shot Point Cloud Classification with a Wavelet Transform Projection-based Network | Haosheng Zhang et.al. | 2501.03221 | null |
2025-01-06 | MObI: Multimodal Object Inpainting Using Diffusion Models | Alexandru Buburuzan et.al. | 2501.03173 | null |
2025-01-06 | Co-Activation Graph Analysis of Safety-Verified and Explainable Deep Reinforcement Learning Policies | Dennis Gross et.al. | 2501.03142 | link |
2025-01-06 | PRMBench: A Fine-grained and Challenging Benchmark for Process-Level Reward Models | Mingyang Song et.al. | 2501.03124 | link |
2025-01-06 | Assessing the impact of external factors on the occurrence of emergencies | Félicien Hêche et.al. | 2501.03111 | null |
2025-01-06 | Investigating Discontinuous X-ray Irradiation as a Damage Mitigation Strategy for [M(COD)Cl] $_2$ Catalysts | Nathalie K. Fernando et.al. | 2501.03057 | null |
2025-01-06 | CAMP: Collaborative Attention Model with Profiles for Vehicle Routing Problems | Chuanbo Hua et.al. | 2501.02977 | link |
2025-01-06 | 4D-CS: Exploiting Cluster Prior for 4D Spatio-Temporal LiDAR Semantic Segmentation | Jiexi Zhong et.al. | 2501.02937 | null |
2025-01-06 | Comprehensive Pathological Image Segmentation via Teacher Aggregation for Tumor Microenvironment Analysis | Daisuke Komura et.al. | 2501.02909 | null |
2025-01-06 | Revisiting Communication Efficiency in Multi-Agent Reinforcement Learning from the Dimensional Analysis Perspective | Chuxiong Sun et.al. | 2501.02888 | null |
2025-01-06 | Diff-Lung: Diffusion-Based Texture Synthesis for Enhanced Pathological Tissue Segmentation in Lung CT Scans | Rezkellah Noureddine Khiati et.al. | 2501.02867 | null |
2025-01-06 | A Novel Vision Transformer for Camera-LiDAR Fusion based Traffic Object Segmentation | Toomas Tahves et.al. | 2501.02858 | null |
2025-01-06 | ICFNet: Integrated Cross-modal Fusion Network for Survival Prediction | Binyu Zhang et.al. | 2501.02778 | link |
2025-01-06 | LDMapNet-U: An End-to-End System for City-Scale Lane-Level Map Updating | Deguo Xia et.al. | 2501.02763 | null |
2025-01-06 | Beyond $\mathcal{O}(\sqrt{T})$ Regret: Decoupling Learning and Decision-making in Online Linear Programming | Wenzhi Gao et.al. | 2501.02761 | null |
2025-01-05 | Markov Decision Processes for Satellite Maneuver Planning and Collision Avoidance | William Kuhl et.al. | 2501.02667 | null |
2025-01-05 | A review on reinforcement learning methods for mobility on demand systems | Tarek Chouaki et.al. | 2501.02569 | null |
2025-01-05 | UDMC: Unified Decision-Making and Control Framework for Urban Autonomous Driving with Motion Prediction of Traffic Participants | Haichao Liu et.al. | 2501.02530 | link |
2025-01-05 | The Explore of Knowledge Management Dynamic Capabilities, AI-Driven Knowledge Sharing, Knowledge-Based Organizational Support, and Organizational Learning on Job Performance: Evidence from Chinese Technological Companies | Jun Cui et.al. | 2501.02468 | null |
2025-01-03 | Evaluating Scenario-based Decision-making for Interactive Autonomous Driving Using Rational Criteria: A Survey | Zhen Tian et.al. | 2501.01886 | null |
2025-01-03 | DFF: Decision-Focused Fine-tuning for Smarter Predict-then-Optimize with Limited Data | Jiaqi Yang et.al. | 2501.01874 | null |
2025-01-03 | ASKCOS: an open source software suite for synthesis planning | Zhengkai Tu et.al. | 2501.01835 | null |
2025-01-03 | Adverse Weather Conditions Augmentation of LiDAR Scenes with Latent Diffusion Models | Andrea Matteazzi et.al. | 2501.01761 | null |
2025-01-03 | Beyond Non-Degeneracy: Revisiting Certainty Equivalent Heuristic for Online Linear Programming | Yilun Chen et.al. | 2501.01716 | null |
2025-01-03 | Enhancing Large Vision Model in Street Scene Semantic Understanding through Leveraging Posterior Optimization Trajectory | Wei-Bin Kou et.al. | 2501.01710 | null |
2025-01-03 | FairSense: Long-Term Fairness Analysis of ML-Enabled Systems | Yining She et.al. | 2501.01665 | link |
2025-01-03 | MIRAGE: Exploring How Large Language Models Perform in Complex Social Interactive Environments | Cai Yin et.al. | 2501.01652 | link |
2025-01-02 | Interruption Handling for Conversational Robots | Shiye Cao et.al. | 2501.01568 | link |
2025-01-02 | AI-Enabled Operations at Fermi Complex: Multivariate Time Series Prediction for Outage Prediction and Diagnosis | Milan Jain et.al. | 2501.01509 | null |
2025-01-02 | Optimal Strategy Revision in Population Games: A Mean Field Game Theory Perspective | Julian Barreiro-Gomez et.al. | 2501.01389 | null |
2025-01-02 | Marketing Mix Modeling in Lemonade | Roy Ravid et.al. | 2501.01276 | null |
2025-01-02 | Design of mechanisms for ensuring the execution of tasks in project planning | Oksana Mulesa et.al. | 2501.01255 | null |
2025-01-02 | Harnessing Multi-Agent LLMs for Complex Engineering Problem-Solving: A Framework for Senior Design Projects | Abdullah Mushtaq et.al. | 2501.01205 | null |
2025-01-02 | Machine Learning-Based Prediction of ICU Readmissions in Intracerebral Hemorrhage Patients: Insights from the MIMIC Databases | Shuheng Chen et.al. | 2501.01183 | null |
2025-01-02 | Embodied AI-Enhanced Vehicular Networks: An Integrated Large Language Models and Reinforcement Learning Method | Ruichen Zhang et.al. | 2501.01141 | null |
2025-01-02 | Communicating Unexpectedness for Out-of-Distribution Multi-Agent Reinforcement Learning | Min Whoo Lee et.al. | 2501.01140 | null |
2025-01-02 | Enhancing Precision of Automated Teller Machines Network Quality Assessment: Machine Learning and Multi Classifier Fusion Approaches | Alireza Safarzadeh et.al. | 2501.01067 | null |
2025-01-02 | MSC-Bench: Benchmarking and Analyzing Multi-Sensor Corruption for Driving Perception | Xiaoshuai Hao et.al. | 2501.01037 | null |
2025-01-02 | Reasoning based on symbolic and parametric knowledge bases: a survey | Mayi Xu et.al. | 2501.01030 | null |
2025-01-02 | Deep Reinforcement Learning for Job Scheduling and Resource Management in Cloud Computing: An Algorithm-Level Review | Yan Gu et.al. | 2501.01007 | null |
2025-01-01 | Demystifying Online Clustering of Bandits: Enhanced Exploration Under Stochastic and Smoothed Adversarial Contexts | Zhuohua Li et.al. | 2501.00891 | null |
2025-01-01 | On the Parameterized Complexity of Controlling Amendment and Successive Winners | Yongjie Yang et.al. | 2501.00860 | null |
2025-01-01 | LLM-Powered Multi-Agent System for Automated Crypto Portfolio Management | Yichen Luo et.al. | 2501.00826 | null |
2025-01-01 | LENS-XAI: Redefining Lightweight and Explainable Network Security through Knowledge Distillation and Variational Autoencoders for Scalable Intrusion Detection in Cybersecurity | Muhammet Anil Yagiz et.al. | 2501.00790 | null |
2024-12-31 | Improving Policy-Oriented Agent-Based Modeling with History Matching: A Case Study | David O’Gara et.al. | 2501.00616 | null |
2024-12-31 | DreamDrive: Generative 4D Scene Modeling from Street View Images | Jiageng Mao et.al. | 2501.00601 | null |
2024-12-31 | Online Video Understanding: A Comprehensive Benchmark and Memory-Augmented Method | Zhenpeng Huang et.al. | 2501.00584 | null |
2024-12-31 | Monty Hall and Optimized Conformal Prediction to Improve Decision-Making with LLMs | Harit Vishwakarma et.al. | 2501.00555 | null |
2024-12-31 | Toward Information Theoretic Active Inverse Reinforcement Learning | Ondrej Bajgar et.al. | 2501.00381 | null |
2024-12-30 | Open RAN-Enabled Deep Learning-Assisted Mobility Management for Connected Vehicles | Maria Barbosa et.al. | 2412.21161 | null |
2024-12-30 | Learning Epidemiological Dynamics via the Finite Expression Method | Jianda Du et.al. | 2412.21049 | null |
2024-12-30 | Plancraft: an evaluation dataset for planning with LLM agents | Gautier Dagan et.al. | 2412.21033 | link |
2024-12-30 | KARPA: A Training-free Method of Adapting Knowledge Graph as References for Large Language Model’s Reasoning Path Aggregation | Siyuan Fang et.al. | 2412.20995 | null |
2024-12-30 | TiGDistill-BEV: Multi-view BEV 3D Object Detection via Target Inner-Geometry Learning Distillation | Shaoqing Xu et.al. | 2412.20911 | link |
2024-12-30 | Rethinking Aleatoric and Epistemic Uncertainty | Freddie Bickford Smith et.al. | 2412.20892 | null |
2024-12-31 | A Tale of Two Imperatives: Privacy and Explainability | Supriya Manna et.al. | 2412.20798 | link |
2024-12-30 | DEMO: A Dynamics-Enhanced Learning Model for Multi-Horizon Trajectory Prediction in Autonomous Vehicles | Chengyue Wang et.al. | 2412.20784 | null |
2024-12-30 | Joint Scoring Rules: Zero-Sum Competition Avoids Performative Prediction | Rubi Hudson et.al. | 2412.20732 | null |
2024-12-30 | Residual Connection Networks in Medical Image Processing: Exploration of ResUnet++ Model Driven by Human Computer Interaction | Peixin Dai et.al. | 2412.20709 | null |
2024-12-30 | Revolutionizing Mobility:The Latest Advancements in Autonomous Vehicle Technology | Venkata Sai Chandra Prasanth Narisetty et.al. | 2412.20688 | null |
2024-12-29 | Conformable Convolution for Topologically Aware Learning of Complex Anatomical Structures | Yousef Yeganeh et.al. | 2412.20608 | null |
2024-12-29 | MR-Occ: Efficient Camera-LiDAR 3D Semantic Occupancy Prediction Using Hierarchical Multi-Resolution Voxel Representation | Minjae Seong et.al. | 2412.20480 | null |
2024-12-29 | A Comprehensive Framework for Reliable Legal AI: Combining Specialized Expert Systems and Adaptive Refinement | Sidra Nasir et.al. | 2412.20468 | null |
2024-12-29 | Treatment Effect Estimation for Graph-Structured Targets | Shonosuke Harada et.al. | 2412.20436 | null |
2024-12-29 | Automated Demand Forecasting in small to medium-sized enterprises | Thomas Gaertner et.al. | 2412.20420 | null |
2024-12-28 | High-fidelity social learning via shared episodic memories enhances collaborative foraging through mnemonic convergence | Ismael T. Freire et.al. | 2412.20271 | null |
2024-12-28 | Leveraging Large Language Models for Enhancing Autonomous Vehicle Perception | Athanasios Karagounis et.al. | 2412.20230 | null |
2024-12-31 | Towards Real-Time 2D Mapping: Harnessing Drones, AI, and Computer Vision for Advanced Insights | Bharath Kumar Agnur et.al. | 2412.20210 | null |
2024-12-28 | Robust Quickest Change Detection with Sampling Control | Yingze Hou et.al. | 2412.20207 | null |
2024-12-27 | ProKAN: Progressive Stacking of Kolmogorov-Arnold Networks for Efficient Liver Segmentation | Bhavesh Gyanchandani et.al. | 2412.19713 | null |
2024-12-27 | From prediction to explanation: managing influential negative reviews through explainable AI | Rongping Shen et.al. | 2412.19692 | null |
2024-12-27 | A Review on the Integration of Artificial Intelligence and Medical Imaging in IVF Ovarian Stimulation | Jana Zakall et.al. | 2412.19688 | null |
2024-12-27 | xFLIE: Leveraging Actionable Hierarchical Scene Representations for Autonomous Semantic-Aware Inspection Missions | Vignesh Kottayam Viswanathan et.al. | 2412.19571 | link |
2024-12-27 | Quantiles under ambiguity and risk sharing | Peng Liu et.al. | 2412.19546 | null |
2024-12-27 | Uncertainty quantification for improving radiomic-based models in radiation pneumonitis prediction | Chanon Puttanawarut et.al. | 2412.19511 | null |
2024-12-27 | DrivingWorld: ConstructingWorld Model for Autonomous Driving via Video GPT | Xiaotao Hu et.al. | 2412.19505 | link |
2024-12-27 | Casevo: A Cognitive Agents and Social Evolution Simulator | Zexun Jiang et.al. | 2412.19498 | link |
2024-12-27 | Disparate Model Performance and Stability in Machine Learning Clinical Support for Diabetes and Heart Diseases | Ioannis Bilionis et.al. | 2412.19495 | null |
2024-12-27 | An Overview of Machine Learning-Driven Resource Allocation in IoT Networks | Zhengdong Li et.al. | 2412.19478 | null |
2024-12-27 | DriveEditor: A Unified 3D Information-Guided Framework for Controllable Object Editing in Driving Scenes | Yiyuan Liang et.al. | 2412.19458 | link |
2024-12-27 | MLLM-SUL: Multimodal Large Language Model for Semantic Scene Understanding and Localization in Traffic Scenarios | Jiaqi Fan et.al. | 2412.19406 | link |
2024-12-27 | Fully Data-driven but Interpretable Human Behavioural Modelling with Differentiable Discrete Choice Model | Fumiyasu Makinoshima et.al. | 2412.19403 | null |
2024-12-27 | Two-echelon Electric Vehicle Routing Problem in Parcel Delivery: A Literature Review | Nima Moradi et.al. | 2412.19395 | null |
2024-12-26 | Central limit theorems for vector-valued composite functionals with smoothing and applications | Huhui Chen et.al. | 2412.19367 | null |
2024-12-26 | xSRL: Safety-Aware Explainable Reinforcement Learning – Safety as a Product of Explainability | Risal Shahriar Shefin et.al. | 2412.19311 | link |
2024-12-26 | Optimizing Fantasy Sports Team Selection with Deep Reinforcement Learning | Shamik Bhattacharjee et.al. | 2412.19215 | null |
2024-12-26 | Modality-Projection Universal Model for Comprehensive Full-Body Medical Imaging Segmentation | Yixin Chen et.al. | 2412.19026 | link |
2024-12-26 | A theory of appropriateness with applications to generative artificial intelligence | Joel Z. Leibo et.al. | 2412.19010 | null |
2024-12-25 | TravelAgent: Generative Agents in the Built Environment | Ariel Noyman et.al. | 2412.18985 | null |
2024-12-24 | Explaining in Diffusion: Explaining a Classifier Through Hierarchical Semantics with Text-to-Image Diffusion Models | Tahira Kazimi et.al. | 2412.18604 | null |
2024-12-24 | Modeling the Centaur: Human-Machine Synergy in Sequential Decision Making | David Shoresh et.al. | 2412.18593 | link |
2024-12-24 | ClassifyViStA:WCE Classification with Visual understanding through Segmentation and Attention | S. Balasubramanian et.al. | 2412.18591 | link |
2024-12-24 | A Paragraph is All It Takes: Rich Robot Behaviors from Interacting, Trusted LLMs | OpenMind et.al. | 2412.18588 | null |
2024-12-24 | Dynamic Optimization of Portfolio Allocation Using Deep Reinforcement Learning | Gang Huang et.al. | 2412.18563 | link |
2024-12-24 | Accelerating process control and optimization via machine learning: A review | Ilias Mitrai et.al. | 2412.18529 | null |
2024-12-24 | Bayesian Optimization of Bilevel Problems | Omer Ekmekcioglu et.al. | 2412.18518 | null |
2024-12-24 | Large Language Model guided Deep Reinforcement Learning for Decision Making in Autonomous Driving | Hao Pang et.al. | 2412.18511 | null |
2024-12-24 | An Overview and Discussion of the Suitability of Existing Speech Datasets to Train Machine Learning Models for Collective Problem Solving | Gnaneswar Villuri et.al. | 2412.18489 | null |
2024-12-24 | Defining and Detecting the Defects of the Large Language Model-based Autonomous Agents | Kaiwen Ning et.al. | 2412.18371 | link |
2024-12-24 | Point-DeepONet: A Deep Operator Network Integrating PointNet for Nonlinear Analysis of Non-Parametric 3D Geometries and Load Conditions | Jangseop Park et.al. | 2412.18362 | link |
2024-12-24 | Navigating Data Corruption in Machine Learning: Balancing Quality, Quantity, and Imputation Strategies | Qi Liu et.al. | 2412.18296 | link |
2024-12-24 | MinsStudio: A Streamlined Package for Minecraft AI Agent Development | Shaofei Cai et.al. | 2412.18293 | link |
2024-12-24 | GDM4MMIMO: Generative Diffusion Models for Massive MIMO Communications | Zhenzhou Jin et.al. | 2412.18281 | null |
2024-12-24 | Leveraging Convolutional Neural Network-Transformer Synergy for Predictive Modeling in Risk-Based Applications | Yuhan Wang et.al. | 2412.18222 | null |
2024-12-24 | Quantum framework for Reinforcement Learning: integrating Markov Decision Process, quantum arithmetic, and trajectory search | Thet Htar Su et.al. | 2412.18208 | null |
2024-12-24 | INVESTORBENCH: A Benchmark for Financial Decision-Making Tasks with LLM-based Agent | Haohang Li et.al. | 2412.18174 | null |
2024-12-24 | Parallel Neural Computing for Scene Understanding from LiDAR Perception in Autonomous Racing | Suwesh Prasad Sah et.al. | 2412.18165 | link |
2024-12-24 | GeneSUM: Large Language Model-based Gene Summary Extraction | Zhijian Chen et.al. | 2412.18154 | null |
2024-12-24 | An Instrumental Value for Data Production and its Application to Data Pricing | Rui Ai et.al. | 2412.18140 | null |
2024-12-23 | Observation Interference in Partially Observable Assistance Games | Scott Emmons et.al. | 2412.17797 | null |
2024-12-23 | HyperQ-Opt: Q-learning for Hyperparameter Optimization | Md. Tarek Hasan et.al. | 2412.17765 | null |
2024-12-23 | Establishing Reality-Virtuality Interconnections in Urban Digital Twins for Superior Intelligent Road Inspection | Yikang Zhang et.al. | 2412.17699 | null |
2024-12-23 | EPE-P: Evidence-based Parameter-efficient Prompting for Multimodal Learning with Missing Modalities | Zhe Chen et.al. | 2412.17677 | link |
2024-12-23 | Dynamic safety cases for frontier AI | Carmen Cârlan et.al. | 2412.17618 | null |
2024-12-23 | PC Agent: While You Sleep, AI Works – A Cognitive Journey into Digital World | Yanheng He et.al. | 2412.17589 | link |
2024-12-23 | Enhancing Cancer Diagnosis with Explainable & Trustworthy Deep Learning Models | Badaru I. Olumuyiwa et.al. | 2412.17527 | null |
2024-12-23 | DeepMF: Deep Motion Factorization for Closed-Loop Safety-Critical Driving Scenario Simulation | Yizhe Li et.al. | 2412.17487 | null |
2024-12-23 | Collective dynamics behind success | Manuel S. Mariani et.al. | 2412.17472 | null |
2024-12-23 | The Role of XAI in Transforming Aeronautics and Aerospace Systems | Francisco Javier Cantero Zorita et.al. | 2412.17440 | null |
2024-12-23 | MineAgent: Towards Remote-Sensing Mineral Exploration with Multimodal Large Language Models | Beibei Yu et.al. | 2412.17339 | null |
2024-12-23 | A Dual-Perspective Metaphor Detection Framework Using Large Language Models | Yujie Lin et.al. | 2412.17332 | link |
2024-12-23 | Feature Based Methods Domain Adaptation for Object Detection: A Review Paper | Helia Mohamadi et.al. | 2412.17325 | null |
2024-12-23 | LegalAgentBench: Evaluating LLM Agents in Legal Domain | Haitao Li et.al. | 2412.17259 | link |
2024-12-23 | An Intrinsically Explainable Approach to Detecting Vertebral Compression Fractures in CT Scans via Neurosymbolic Modeling | Blanca Inigo et.al. | 2412.17258 | null |
2024-12-23 | Asymptotically Optimal Distributionally Robust Solutions through Forecasting and Operations Decentralization | Yue Lin et.al. | 2412.17257 | null |
2024-12-23 | OLiDM: Object-aware LiDAR Diffusion Models for Autonomous Driving | Tianyi Yan et.al. | 2412.17226 | null |
2024-12-22 | Fairness in Reinforcement Learning with Bisimulation Metrics | Sahand Rezaei-Shoshtari et.al. | 2412.17123 | null |
2024-12-22 | Multi-Agent Sampling: Scaling Inference Compute for Data Synthesis with Tree Search-Based Agentic Collaboration | Hai Ye et.al. | 2412.17061 | link |
2024-12-22 | Modular Conversational Agents for Surveys and Interviews | Jiangbo Yu et.al. | 2412.17049 | null |
2024-12-20 | Camera-Based Localization and Enhanced Normalized Mutual Information | Vishnu Teja Kunde et.al. | 2412.16137 | null |
2024-12-20 | Predicting human cooperation: sensitizing drift-diffusion model to interaction and external stimuli | Lucila G. Alvarez-Zuzek et.al. | 2412.16121 | null |
2024-12-20 | Convolutional Deep Operator Networks for Learning Nonlinear Focused Ultrasound Wave Propagation in Heterogeneous Spinal Cord Anatomy | Avisha Kumar et.al. | 2412.16118 | link |
2024-12-20 | Quantifying the benefit of load uncertainty reduction for the design of district energy systems under grid constraints using the Value of Information | Max Langtry et.al. | 2412.16105 | link |
2024-12-20 | Explainable AI for Multivariate Time Series Pattern Exploration: Latent Space Visual Analytics with Time Fusion Transformer and Variational Autoencoders in Power Grid Event Diagnosis | Haowen Xu et.al. | 2412.16098 | null |
2024-12-20 | On the Impact of 3D Visualization of Repository Metrics in Software Engineering Education | Dario Di Dario et.al. | 2412.16061 | null |
2024-12-20 | Segmentation of arbitrary features in very high resolution remote sensing imagery | Henry Cording et.al. | 2412.16046 | link |
2024-12-20 | Applying Predictive Analytics to Occupational Health and Safety in India | Ritwik Raj Saxena et.al. | 2412.16038 | null |
2024-12-20 | Designing Visual Explanations and Learner Controls to Engage Adolescents in AI-Supported Exercise Selection | Jeroen Ooge et.al. | 2412.16034 | null |
2024-12-20 | Simulation-based Bayesian predictive probability of success for interim monitoring of clinical trials with competing event data: two case studies | Chiara Micoli et.al. | 2412.15899 | link |
2024-12-20 | Sparse Point Clouds Assisted Learned Image Compression | Yiheng Jiang et.al. | 2412.15752 | null |
2024-12-20 | Prompt-based Unifying Inference Attack on Graph Neural Networks | Yuecen Wei et.al. | 2412.15735 | link |
2024-12-20 | Climate Impact Assessment Requires Weighting: Introducing the Weighted Climate Dataset | Marco Gortan et.al. | 2412.15699 | null |
2024-12-20 | Parameterized Complexity of (d,r)-Domination via Modular Decomposition | Gennaro Cordasco et.al. | 2412.15671 | null |
2024-12-20 | Tacit Learning with Adaptive Information Selection for Cooperative Multi-Agent Reinforcement Learning | Lunjun Liu et.al. | 2412.15639 | null |
2024-12-20 | Microservices-Based Framework for Predictive Analytics and Real-time Performance Enhancement in Travel Reservation Systems | Biman Barua et.al. | 2412.15616 | null |
2024-12-20 | Mask-RadarNet: Enhancing Transformer With Spatial-Temporal Semantic Context for Radar Object Detection in Autonomous Driving | Yuzhi Wu et.al. | 2412.15595 | null |
2024-12-20 | To Rely or Not to Rely? Evaluating Interventions for Appropriate Reliance on Large Language Models | Jessica Y. Bo et.al. | 2412.15584 | null |
2024-12-20 | VLM-RL: A Unified Vision Language Models and Reinforcement Learning Framework for Safe Autonomous Driving | Zilin Huang et.al. | 2412.15544 | null |
2024-12-20 | Humanlike Cognitive Patterns as Emergent Phenomena in Large Language Models | Zhisheng Tang et.al. | 2412.15501 | null |
2024-12-19 | OpenEMMA: Open-Source Multimodal Model for End-to-End Autonomous Driving | Shuo Xing et.al. | 2412.15208 | link |
2024-12-19 | AutoTrust: Benchmarking Trustworthiness in Large Vision Language Models for Autonomous Driving | Shuo Xing et.al. | 2412.15206 | link |
2024-12-19 | Operationalising Rawlsian Ethics for Fairness in Norm-Learning Agents | Jessica Woodgate et.al. | 2412.15163 | null |
2024-12-19 | Probabilistic Strategy Logic with Degrees of Observability | Chunyan Mu et.al. | 2412.15135 | null |
2024-12-19 | Measuring, Modeling, and Helping People Account for Privacy Risks in Online Self-Disclosures with AI | Isadora Krsek et.al. | 2412.15047 | null |
2024-12-19 | Autonomous Navigation in Dynamic Human Environments with an Embedded 2D LiDAR-based Person Tracker | Davide Plozza et.al. | 2412.15000 | null |
2024-12-19 | From Nonequilibrium to Equilibrium: Insights from a Two-Population Occupation Model | Jerome Garnier-Brun et.al. | 2412.14996 | null |
2024-12-19 | Co-optimization of Vehicle Dynamics and Powertrain Management for Connected and Automated Electric Vehicles | Zongtan Li et.al. | 2412.14984 | null |
2024-12-19 | Unveiling Uncertainty: A Deep Dive into Calibration and Performance of Multimodal Large Language Models | Zijun Chen et.al. | 2412.14660 | link |
2024-12-19 | Optimization of Collective Bayesian Decision-Making in a Swarm of Miniaturized Vibration-Sensing Robots | Thiemen Siemensma et.al. | 2412.14646 | null |
2024-12-19 | A Shapley Value Estimation Speedup for Efficient Explainable Quantum AI | Iain Burge et.al. | 2412.14639 | link |
2024-12-19 | A Model-free Biomimetics Algorithm for Deterministic Partially Observable Markov Decision Process | Yide Yu et.al. | 2412.14614 | null |
2024-12-19 | Leveraging Time Series Categorization and Temporal Fusion Transformers to Improve Cryptocurrency Price Forecasting | Arash Peik et.al. | 2412.14529 | null |
2024-12-19 | Drive-1-to-3: Enriching Diffusion Priors for Novel View Synthesis of Real Vehicles | Chuang Lin et.al. | 2412.14494 | null |
2024-12-19 | Mediation Analysis for Probabilities of Causation | Yuta Kawakami et.al. | 2412.14491 | null |
2024-12-19 | VLM-AD: End-to-End Autonomous Driving through Vision-Language Model Supervision | Yi Xu et.al. | 2412.14446 | null |
2024-12-19 | DriveGPT: Scaling Autoregressive Behavior Models for Driving | Xin Huang et.al. | 2412.14415 | null |
2024-12-18 | In-Group Love, Out-Group Hate: A Framework to Measure Affective Polarization via Contentious Online Discussions | Buddhika Nettasinghe et.al. | 2412.14414 | null |
2024-12-18 | Uncertainty Awareness in Wireless Communications, Sensing, and Learning | Shixiong Wang et.al. | 2412.14369 | null |
2024-12-18 | On Calibration in Multi-Distribution Learning | Rajeev Verma et.al. | 2412.14142 | null |
2024-12-18 | Adaptive Concept Bottleneck for Foundation Models Under Distribution Shifts | Jihye Choi et.al. | 2412.14097 | null |
2024-12-18 | Joint Perception and Prediction for Autonomous Driving: A Survey | Lucas Dal’Col et.al. | 2412.14088 | link |
2024-12-18 | Online MDP with Transition Prototypes: A Robust Adaptive Approach | Shuo Sun et.al. | 2412.14075 | null |
2024-12-18 | A Review of Multimodal Explainable Artificial Intelligence: Past, Present and Future | Shilin Sun et.al. | 2412.14056 | link |
2024-12-18 | What If: Causal Analysis with Graph Databases | Amedeo Pachera et.al. | 2412.13965 | null |
2024-12-18 | Threshold UCT: Cost-Constrained Monte Carlo Tree Search with Pareto Curves | Martin Kurečka et.al. | 2412.13962 | null |
2024-12-18 | A Black-Box Evaluation Framework for Semantic Robustness in Bird’s Eye View Detection | Fu Wang et.al. | 2412.13913 | link |
2024-12-18 | Object Style Diffusion for Generalized Object Detection in Urban Scene | Hao Li et.al. | 2412.13815 | null |
2024-12-18 | SimADFuzz: Simulation-Feedback Fuzz Testing for Autonomous Driving Systems | Huiwen Yang et.al. | 2412.13802 | null |
2024-12-18 | Designing an LLM-Based Copilot for Manufacturing Equipment Selection | Jonas Werheid et.al. | 2412.13774 | null |
2024-12-18 | An Efficient Occupancy World Model via Decoupled Dynamic Flow and Image-assisted Training | Haiming Zhang et.al. | 2412.13772 | null |
2024-12-18 | From Risk to Readiness: VR-Based Safety Training for Industrial Hazards | Gianni Vercelli et.al. | 2412.13725 | null |
2024-12-18 | Optical aberrations in autonomous driving: Physics-informed parameterized temperature scaling for neural network uncertainty calibration | Dominik Werner Wolf et.al. | 2412.13695 | null |
2024-12-18 | Exploring Multi-Modal Integration with Tool-Augmented LLM Agents for Precise Causal Discovery | ChengAo Shen et.al. | 2412.13667 | link |
2024-12-18 | 4.5 Million (Suspected) Fake Stars in GitHub: A Growing Spiral of Popularity Contests, Scams, and Malware | Hao He et.al. | 2412.13459 | link |
2024-12-18 | Pre-training a Density-Aware Pose Transformer for Robust LiDAR-based 3D Human Pose Estimation | Xiaoqi An et.al. | 2412.13454 | link |
2024-12-18 | Detecting Machine-Generated Music with Explainability – A Challenge and Early Benchmarks | Yupei Li et.al. | 2412.13421 | null |
2024-12-18 | Exploring Transformer-Augmented LSTM for Temporal and Spatial Feature Learning in Trajectory Prediction | Chandra Raskoti et.al. | 2412.13419 | null |
2024-12-17 | Quantitative Predictive Monitoring and Control for Safe Human-Machine Interaction | Shuyang Dong et.al. | 2412.13365 | null |
2024-12-17 | GaussTR: Foundation Model-Aligned Gaussian Transformer for Self-Supervised 3D Spatial Understanding | Haoyi Jiang et.al. | 2412.13193 | link |
2024-12-17 | StreetCrafter: Street View Synthesis with Controllable Video Diffusion Models | Yunzhi Yan et.al. | 2412.13188 | null |
2024-12-17 | A Conformal Approach to Feature-based Newsvendor under Model Misspecification | Junyu Cao et.al. | 2412.13159 | null |
2024-12-17 | Unlocking the Potential of Digital Pathology: Novel Baselines for Compression | Maximilian Fischer et.al. | 2412.13137 | null |
2024-12-17 | Previous Knowledge Utilization In Online Anytime Belief Space Planning | Michael Novitsky et.al. | 2412.13128 | link |
2024-12-17 | Active Reinforcement Learning Strategies for Offline Policy Improvement | Ambedkar Dukkipati et.al. | 2412.13106 | null |
2024-12-17 | Incremental Online Learning of Randomized Neural Network with Forward Regularization | Junda Wang et.al. | 2412.13096 | null |
2024-12-17 | SMOSE: Sparse Mixture of Shallow Experts for Interpretable Reinforcement Learning in Continuous Control Tasks | Mátyás Vincze et.al. | 2412.13053 | link |
2024-12-17 | A New Adversarial Perspective for LiDAR-based 3D Object Detection | Shijun Zheng et.al. | 2412.13017 | null |
2024-12-17 | Strengthened and Faster Linear Approximation to Joint Chance Constraints with Wasserstein Ambiguity | Yihong Zhou et.al. | 2412.12992 | link |
2024-12-17 | Efficient Event-based Semantic Segmentation with Spike-driven Lightweight Transformer-based Networks | Xiaxin Zhu et.al. | 2412.12843 | null |
2024-12-17 | A Survey on Recommendation Unlearning: Fundamentals, Taxonomy, Evaluation, and Open Questions | Yuyuan Li et.al. | 2412.12836 | null |
2024-12-17 | Ask for More Than Bayes Optimal: A Theory of Indecisions for Classification | Mohamed Ndaoud et.al. | 2412.12807 | null |
2024-12-17 | Open-World Panoptic Segmentation | Matteo Sodano et.al. | 2412.12740 | null |
2024-12-17 | Using LLM-Generated Draft Replies to Support Human Experts in Responding to Stakeholder Inquiries in Maritime Industry: A Real-World Case Study of Industrial AI | Tita Alissa Bach et.al. | 2412.12732 | null |
2024-12-17 | Information, entropy and the paradox of choice: A theoretical framework for understanding choice satisfaction | Mojtaba Madadi Asl et.al. | 2412.12721 | link |
2024-12-17 | MapExpert: Online HD Map Construction with Simple and Efficient Sparse Map Element Expert | Dapeng Zhang et.al. | 2412.12704 | null |
2024-12-17 | Preference Robust Ordinal Priority Approach and its Satisficing Extension for Multi-Attribute Decision-Making with Incomplete Information | Renlong Wang et.al. | 2412.12690 | null |
2024-12-17 | DriveTester: A Unified Platform for Simulation-Based Autonomous Driving Testing | Mingfei Cheng et.al. | 2412.12656 | link |
2024-12-17 | Improving the Transferability of 3D Point Cloud Attack via Spectral-aware Admix and Optimization Designs | Shiyu Hu et.al. | 2412.12626 | null |
2024-12-16 | PanSplat: 4K Panorama Synthesis with Feed-Forward Gaussian Splatting | Cheng Zhang et.al. | 2412.12096 | link |
2024-12-16 | Revelations: A Decidable Class of POMDPs with Omega-Regular Objectives | Marius Belly et.al. | 2412.12063 | link |
2024-12-16 | Learning to Navigate in Mazes with Novel Layouts using Abstract Top-down Maps | Linfeng Zhao et.al. | 2412.12024 | null |
2024-12-16 | CP-Guard: Malicious Agent Detection and Defense in Collaborative Bird’s Eye View Perception | Senkang Hu et.al. | 2412.12000 | null |
2024-12-16 | Weak Strategyproofness in Randomized Social Choice | Felix Brandt et.al. | 2412.11977 | null |
2024-12-16 | Ensemble Learning and 3D Pix2Pix for Comprehensive Brain Tumor Analysis in Multimodal MRI | Ramy A. Zeineldin et.al. | 2412.11849 | null |
2024-12-16 | Evaluating the Efficacy of Vectocardiographic and ECG Parameters for Efficient Tertiary Cardiology Care Allocation Using Decision Tree Analysis | Lucas José da Costa et.al. | 2412.11839 | null |
2024-12-16 | But Can You Use It? Design Recommendations for Differentially Private Interactive Systems | Liudas Panavas et.al. | 2412.11794 | null |
2024-12-16 | Prediction of social dilemmas in networked populations via graph neural networks | Huaiyu Tan et.al. | 2412.11775 | null |
2024-12-16 | Point Cloud-Assisted Neural Image Compression | Ziqun Li et.al. | 2412.11771 | null |
2024-12-16 | GHIssuemarket: A Sandbox Environment for SWE-Agents Economic Experimentation | Mohamed A. Fouad et.al. | 2412.11722 | link |
2024-12-16 | Multimodal LLM for Intelligent Transportation Systems | Dexter Le et.al. | 2412.11683 | null |
2024-12-16 | NEST: A Neuromodulated Small-world Hypergraph Trajectory Prediction Model for Autonomous Driving | Chengyue Wang et.al. | 2412.11682 | null |
2024-12-16 | DINO-Foresight Looking into the Future with DINO | Efstathios Karypidis et.al. | 2412.11673 | link |
2024-12-16 | BioBridge: Unified Bio-Embedding with Bridging Modality in Code-Switched EMR | Jangyeong Jeon et.al. | 2412.11671 | link |
2024-12-16 | A New Sampling Method Base on Sequential Tests with Fixed Sample Size Upper Limit | Dihong Huang et.al. | 2412.11651 | null |
2024-12-16 | Aligning Visual and Semantic Interpretability through Visually Grounded Concept Bottleneck Models | Patrick Knab et.al. | 2412.11576 | link |
2024-12-16 | Embodied CoT Distillation From LLM To Off-the-shelf Agents | Wonje Choi et.al. | 2412.11499 | link |
2024-12-16 | AEPHORA: AI/ML-Based Energy-Efficient Proactive Handover and Resource Allocation | Bowen Xie et.al. | 2412.11491 | null |
2024-12-16 | HGSFusion: Radar-Camera Fusion with Hybrid Generation and Synchronization for 3D Object Detection | Zijian Gu et.al. | 2412.11489 | link |
2024-12-13 | GaussianWorld: Gaussian World Model for Streaming 3D Occupancy Prediction | Sicheng Zuo et.al. | 2412.10373 | link |
2024-12-13 | GaussianAD: Gaussian-Centric End-to-End Autonomous Driving | Wenzhao Zheng et.al. | 2412.10371 | link |
2024-12-13 | Trustworthy and Explainable Decision-Making for Workforce allocation | Guillaume Povéda et.al. | 2412.10272 | null |
2024-12-13 | Deep Gaussian Process Priors for Bayesian Image Reconstruction | Jonas Latz et.al. | 2412.10248 | link |
2024-12-13 | Physics Instrument Design with Reinforcement Learning | Shah Rukh Qasim et.al. | 2412.10237 | null |
2024-12-13 | Solving Robust Markov Decision Processes: Generic, Reliable, Efficient | Tobias Meggendorfer et.al. | 2412.10185 | null |
2024-12-13 | Timealign: A multi-modal object detection method for time misalignment fusing in autonomous driving | Zhihang Song et.al. | 2412.10033 | null |
2024-12-13 | WiseAD: Knowledge Augmented End-to-End Autonomous Driving with Vision-Language Model | Songyan Zhang et.al. | 2412.09951 | link |
2024-12-13 | Analyzing Fairness of Computer Vision and Natural Language Processing Models | Ahmed Rashed et.al. | 2412.09900 | null |
2024-12-13 | Analyzing Fairness of Classification Machine Learning Model with Structured Dataset | Ahmed Rashed et.al. | 2412.09896 | null |
2024-12-13 | Is it the model or the metric – On robustness measures of deeplearning models | Zhijin Lyu et.al. | 2412.09795 | null |
2024-12-13 | EI-Drive: A Platform for Cooperative Perception with Realistic Communication Models | Hanchu Zhou et.al. | 2412.09782 | null |
2024-12-13 | A Novel Methodology in Credit Spread Prediction Based on Ensemble Learning and Feature Selection | Yu Shao et.al. | 2412.09769 | null |
2024-12-12 | Double-Exponential Increases in Inference Energy: The Cost of the Race for Accuracy | Zeyu Yang et.al. | 2412.09731 | null |
2024-12-12 | Doe-1: Closed-Loop Autonomous Driving with Large World Model | Wenzhao Zheng et.al. | 2412.09627 | link |
2024-12-13 | Hidden Biases of End-to-End Driving Datasets | Julian Zimmerlin et.al. | 2412.09602 | link |
2024-12-12 | Wait-Less Offline Tuning and Re-solving for Online Decision Making | Jingruo Sun et.al. | 2412.09594 | null |
2024-12-12 | A novel ML-fuzzy control system for optimizing PHEV fuel efficiency and extending electric range under diverse driving conditions | Mehrdad Raeesi et.al. | 2412.09499 | null |
2024-12-12 | Distributional Reinforcement Learning based Integrated Decision Making and Control for Autonomous Surface Vehicles | Xi Lin et.al. | 2412.09466 | link |
2024-12-12 | Probabilistic digital twins for geotechnical design and construction | Dafydd Cotoarbă et.al. | 2412.09432 | null |
2024-12-12 | Slope Considered Online Nonlinear Trajectory Planning with Differential Energy Model for Autonomous Driving | Zhaofeng Tian et.al. | 2412.09424 | null |
2024-12-12 | Reinforcement Learning Within the Classical Robotics Stack: A Case Study in Robot Soccer | Adam Labiosa et.al. | 2412.09417 | null |
2024-12-12 | Multimodal Sentiment Analysis based on Video and Audio Inputs | Antonio Fernandez et.al. | 2412.09317 | null |
2024-12-12 | LMAgent: A Large-scale Multimodal Agents Society for Multi-user Simulation | Yijun Liu et.al. | 2412.09237 | null |
2024-12-12 | MMD-OPT : Maximum Mean Discrepancy Based Sample Efficient Collision Risk Minimization for Autonomous Driving | Basant Sharma et.al. | 2412.09121 | null |
2024-12-12 | Reconfigurable Intelligent Surface for Internet of Robotic Things | Wanli Ni et.al. | 2412.09117 | null |
2024-12-12 | Forest-of-Thought: Scaling Test-Time Compute for Enhancing LLM Reasoning | Zhenni Bi et.al. | 2412.09078 | link |
2024-12-12 | DrivingRecon: Large 4D Gaussian Reconstruction Model For Autonomous Driving | Hao Lu et.al. | 2412.09043 | link |
2024-12-12 | Beyond forecast leaderboards: Measuring individual model importance based on contribution to ensemble accuracy | Minsu Kim et.al. | 2412.08916 | link |
2024-12-12 | AI-assisted Knowledge Discovery in Biomedical Literature to Support Decision-making in Precision Oncology | Ting He et.al. | 2412.08900 | null |
2024-12-12 | Words of War: Exploring the Presidential Rhetorical Arsenal with Deep Learning | Wyatt Scott et.al. | 2412.08868 | null |
2024-12-12 | On the Precise Asymptotics and Refined Regret of the Variance-Aware UCB Algorithm | Yuxuan Han et.al. | 2412.08843 | null |
2024-12-12 | EMATO: Energy-Model-Aware Trajectory Optimization for Autonomous Driving | Zhaofeng Tian et.al. | 2412.08830 | null |
2024-12-11 | Latent Safety-Constrained Policy Approach for Safe Offline Reinforcement Learning | Prajwal Koirala et.al. | 2412.08794 | null |
2024-12-11 | GPD-1: Generative Pre-training for Driving | Zixun Xie et.al. | 2412.08643 | link |
2024-12-11 | GenPlan: Generative sequence models as adaptive planners | Akash Karthikeyan et.al. | 2412.08565 | link |
2024-12-11 | An End-to-End Collaborative Learning Approach for Connected Autonomous Vehicles in Occluded Scenarios | Leandro Parada et.al. | 2412.08562 | null |
2024-12-11 | MaestroMotif: Skill Design from Artificial Intelligence Feedback | Martin Klissarov et.al. | 2412.08542 | null |
2024-12-11 | Enhancing Interpretability Through Loss-Defined Classification Objective in Structured Latent Spaces | Daniel Geissler et.al. | 2412.08515 | null |
2024-12-11 | Detecting Conversational Mental Manipulation with Intent-Aware Prompting | Jiayuan Ma et.al. | 2412.08414 | link |
2024-12-11 | Pysical Informed Driving World Model | Zhuoran Yang et.al. | 2412.08410 | null |
2024-12-11 | ConDSeg: A General Medical Image Segmentation Framework via Contrast-Driven Feature Enhancement | Mengqi Lei et.al. | 2412.08345 | link |
2024-12-11 | Task-specific Self-body Controller Acquisition by Musculoskeletal Humanoids: Application to Pedal Control in Autonomous Driving | Kento Kawaharazuka et.al. | 2412.08270 | null |
2024-12-11 | Neural Observation Field Guided Hybrid Optimization of Camera Placement | Yihan Cao et.al. | 2412.08266 | link |
2024-12-11 | Unified HT-CNNs Architecture: Transfer Learning for Segmenting Diverse Brain Tumors in MRI from Gliomas to Pediatric Tumors | Ramy A. Zeineldin et.al. | 2412.08240 | null |
2024-12-11 | DocSum: Domain-Adaptive Pre-training for Document Abstractive Summarization | Phan Phuong Mai Chau et.al. | 2412.08196 | null |
2024-12-11 | Diversity Drives Fairness: Ensemble of Higher Order Mutants for Intersectional Fairness of Machine Learning Software | Zhenpeng Chen et.al. | 2412.08167 | null |
2024-12-11 | Learn How to Query from Unlabeled Data Streams in Federated Learning | Yuchang Sun et.al. | 2412.08138 | link |
2024-12-11 | Using Large Language Models for Parametric Shape Optimization | Xinxin Zhang et.al. | 2412.08072 | null |
2024-12-11 | Static-Dynamic Class-level Perception Consistency in Video Semantic Segmentation | Zhigang Cen et.al. | 2412.08034 | null |
2024-12-11 | Quantum-Cognitive Neural Networks: Assessing Confidence and Uncertainty with Human Decision-Making Simulations | Milan Maksimovic et.al. | 2412.08010 | null |
2024-12-11 | Survey on Human-Vehicle Interactions and AI Collaboration for Optimal Decision-Making in Automated Driving | Abu Jafar Md Muzahid et.al. | 2412.08005 | null |
2024-12-11 | Accurate Prediction of Temperature Indicators in Eastern China Using a Multi-Scale CNN-LSTM-Attention model | Jiajiang Shen et.al. | 2412.07997 | null |
2024-12-10 | A Monadic Calculus with Episodic Flows | Sotirios Henning et.al. | 2412.07939 | null |
2024-12-10 | Test-time Correction with Human Feedback: An Online 3D Detection System via Visual Prompting | Zetong Yang et.al. | 2412.07768 | null |
2024-12-10 | Predictive Modeling of Homeless Service Assignment: A Representation Learning Approach | Khandker Sadia Rahman et.al. | 2412.07747 | null |
2024-12-10 | DriveMM: All-in-One Large Multimodal Model for Autonomous Driving | Zhijian Huang et.al. | 2412.07689 | link |
2024-12-10 | Optimizing Sensor Redundancy in Sequential Decision-Making Problems | Jonas Nüßlein et.al. | 2412.07686 | null |
2024-12-10 | Automating Business Intelligence Requirements with Generative AI and Semantic Search | Nimrod Busany et.al. | 2412.07668 | null |
2024-12-10 | Swarm Behavior Cloning | Jonas Nüßlein et.al. | 2412.07617 | null |
2024-12-10 | Hallucination Elimination and Semantic Enhancement Framework for Vision-Language Models in Traffic Scenarios | Jiaqi Fan et.al. | 2412.07518 | link |
2024-12-10 | A Real-time Degeneracy Sensing and Compensation Method for Enhanced LiDAR SLAM | Zongbo Liao et.al. | 2412.07513 | null |
2024-12-10 | A Robust Sustainability Assessment Methodology for Aircraft Parts: Application to a Fuselage Panel | Aikaterini A. Anagnostopoulou et.al. | 2412.07421 | null |
2024-12-10 | ITPNet: Towards Instantaneous Trajectory Prediction for Autonomous Driving | Rongqing Li et.al. | 2412.07369 | null |
2024-12-10 | Addressing Key Challenges of Adversarial Attacks and Defenses in the Tabular Domain: A Methodological Framework for Coherence and Consistency | Yael Itzhakev et.al. | 2412.07326 | null |
2024-12-10 | HARP: Hesitation-Aware Reframing in Transformer Inference Pass | Romain Storaï et.al. | 2412.07282 | link |
2024-12-10 | Human-Computer Interaction and Human-AI Collaboration in Advanced Air Mobility: A Comprehensive Review | Fatma Yamac Sagirli et.al. | 2412.07241 | null |
2024-12-10 | Epidemiological Model Calibration via Graybox Bayesian Optimization | Puhua Niu et.al. | 2412.07193 | null |
2024-12-10 | Effective Reward Specification in Deep Reinforcement Learning | Julien Roy et.al. | 2412.07177 | null |
2024-12-10 | Fast Occupancy Network | Mingjie Lu et.al. | 2412.07163 | null |
2024-12-09 | A Note on Sample Complexity of Interactive Imitation Learning with Log Loss | Yichen Li et.al. | 2412.07057 | null |
2024-12-09 | GenAI4UQ: A Software for Inverse Uncertainty Quantification Using Conditional Generative Models | Ming Fan et.al. | 2412.07026 | link |
2024-12-09 | Creating a Cooperative AI Policymaking Platform through Open Source Collaboration | Aiden Lewington et.al. | 2412.06936 | null |
2024-12-09 | Driv3R: Learning Dense 4D Reconstruction for Autonomous Driving | Xin Fei et.al. | 2412.06777 | link |
2024-12-09 | 3D Graph Attention Networks for High Fidelity Pediatric Glioma Segmentation | Harish Thangaraj et.al. | 2412.06743 | null |
2024-12-09 | Digital Transformation in the Water Distribution System based on the Digital Twins Concept | MohammadHossein Homaei et.al. | 2412.06694 | link |
2024-12-09 | Policy Agnostic RL: Offline RL and Online RL Fine-Tuning of Any Class and Backbone | Max Sobol Mark et.al. | 2412.06685 | null |
2024-12-09 | Exploring Critical Testing Scenarios for Decision-Making Policies: An LLM Approach | Weichao Xu et.al. | 2412.06684 | null |
2024-12-09 | Toward LLM-Agent-Based Modeling of Transportation Systems: A Conceptual Framework | Tianming Liu et.al. | 2412.06681 | null |
2024-12-09 | Generalized Design of Basket Trials with P-value Combination Test | Heng Zhou et.al. | 2412.06622 | null |
2024-12-09 | Prediction of Occluded Pedestrians in Road Scenes using Human-like Reasoning: Insights from the OccluRoads Dataset | Melo Castillo Angie Nataly et.al. | 2412.06549 | null |
2024-12-09 | PPT: Pre-Training with Pseudo-Labeled Trajectories for Motion Forecasting | Yihong Xu et.al. | 2412.06491 | null |
2024-12-09 | Towards Civic Digital Twins: Co-Design the Citizen-Centric Future of Bologna | Massimiliano Luca et.al. | 2412.06328 | null |
2024-12-09 | World knowledge-enhanced Reasoning Using Instruction-guided Interactor in Autonomous Driving | Mingliang Zhai et.al. | 2412.06324 | null |
2024-12-09 | Omni-Scene: Omni-Gaussian Representation for Ego-Centric Sparse-View Scene Reconstruction | Dongxu Wei et.al. | 2412.06273 | null |
2024-12-09 | Towards a Comprehensive Framework for Cyber-Incident Response Decision Support in Smart Grids | Omer Sen et.al. | 2412.06254 | null |
2024-12-09 | LLMs as Debate Partners: Utilizing Genetic Algorithms and Adversarial Search for Adaptive Arguments | Prakash Aryan et.al. | 2412.06229 | link |
2024-12-09 | Discrete-Time Distribution Steering using Monte Carlo Tree Search | Alexandros E. Tzikas et.al. | 2412.06220 | link |
2024-12-09 | Pilot-guided Multimodal Semantic Communication for Audio-Visual Event Localization | Fei Yu et.al. | 2412.06208 | null |
2024-12-09 | Conservative Contextual Bandits: Beyond Linear Representations | Rohan Deb et.al. | 2412.06165 | null |
2024-12-09 | AgentAlign: Misalignment-Adapted Multi-Agent Perception for Resilient Inter-Agent Sensor Correlations | Zonglin Meng et.al. | 2412.06142 | null |
2024-12-09 | HSDA: High-frequency Shuffle Data Augmentation for Bird’s-Eye-View Map Segmentation | Calvin Glisson et.al. | 2412.06127 | link |
2024-12-08 | Multifidelity Uncertainty Quantification for Ice Sheet Simulations | Nicole Aretz et.al. | 2412.06110 | link |
2024-12-06 | Stag-1: Towards Realistic 4D Driving Simulation with Video Generation Model | Lening Wang et.al. | 2412.05280 | link |
2024-12-06 | Reinforcement Learning: An Overview | Kevin Murphy et.al. | 2412.05265 | null |
2024-12-06 | Uncertainty Quantification for Transformer Models for Dark-Pattern Detection | Javier Muñoz et.al. | 2412.05251 | null |
2024-12-06 | SurgBox: Agent-Driven Operating Room Sandbox with Surgery Copilot | Jinlin Wu et.al. | 2412.05187 | link |
2024-12-06 | Towards Flexible 3D Perception: Object-Centric Occupancy Completion Augments 3D Object Detection | Chaoda Zheng et.al. | 2412.05154 | link |
2024-12-06 | Explingo: Explaining AI Predictions using Large Language Models | Alexandra Zytek et.al. | 2412.05145 | link |
2024-12-06 | A Parametric, Second-Order Cone Representable Model of Fairness for Decision-Making Problems | Kaarthik Sundar et.al. | 2412.05143 | null |
2024-12-06 | Constructing optimal treatment length strategies to maximize quality-adjusted lifetimes | Hao Sun et.al. | 2412.05108 | null |
2024-12-06 | Integrating Semantic Communication and Human Decision-Making into an End-to-End Sensing-Decision Framework | Edgar Beck et.al. | 2412.05103 | null |
2024-12-06 | Backdooring Outlier Detection Methods: A Novel Attack Approach | ZeinabSadat Taghavi et.al. | 2412.05010 | null |
2024-12-06 | Who Speaks Next? Multi-party AI Discussion Leveraging the Systematics of Turn-taking in Murder Mystery Games | Ryota Nonomura et.al. | 2412.04937 | link |
2024-12-06 | Nonmyopic Global Optimisation via Approximate Dynamic Programming | Filippo Airaldi et.al. | 2412.04882 | link |
2024-12-06 | Self-Organizing Complex Networks with AI-Driven Adaptive Nodes for Optimized Connectivity and Energy Efficiency | Azra Seyyedi et.al. | 2412.04874 | null |
2024-12-06 | Using Machine Learning to Discover Parsimonious and Physically-Interpretable Representations of Catchment-Scale Rainfall-Runoff Dynamics | Yuan-Heng Wang et.al. | 2412.04845 | null |
2024-12-06 | UniMLVG: Unified Framework for Multi-view Long Video Generation with Comprehensive Control Capabilities for Autonomous Driving | Rui Chen et.al. | 2412.04842 | link |
2024-12-06 | Automatic Prediction of Stroke Treatment Outcomes: Latest Advances and Perspectives | Zeynel A. Samak et.al. | 2412.04812 | null |
2024-12-06 | Question Answering for Decisionmaking in Green Building Design: A Multimodal Data Reasoning Method Driven by Large Language Models | Yihui Li et.al. | 2412.04741 | null |
2024-12-05 | Multiclass Post-Earthquake Building Assessment Integrating Optical and SAR Satellite Imagery, Ground Motion, and Soil Data with Transformers | Deepank Singh et.al. | 2412.04664 | null |
2024-12-05 | Fairness-aware Principal Component Analysis for Mortality Forecasting and Annuity Pricing | Fei Huang et.al. | 2412.04663 | null |
2024-12-05 | Game-Theoretic Foundations for Cyber Resilience Against Deceptive Information Attacks in Intelligent Transportation Systems | Ya-Ting Yang et.al. | 2412.04627 | null |
2024-12-05 | Probabilistic Gaussian Superposition for Efficient 3D Occupancy Prediction | Yuanhui Huang et.al. | 2412.04384 | link |
2024-12-05 | Sensor-Driven Predictive Vehicle Maintenance and Routing Problem with Time Windows | Iman Kazemian et.al. | 2412.04350 | null |
2024-12-05 | Reinforcement Learning for Freeway Lane-Change Regulation via Connected Vehicles | Ke Sun et.al. | 2412.04341 | null |
2024-12-05 | Reflective Teacher: Semi-Supervised Multimodal 3D Object Detection in Bird’s-Eye-View via Uncertainty Measure | Saheli Hazra et.al. | 2412.04337 | null |
2024-12-05 | YOLO-CCA: A Context-Based Approach for Traffic Sign Detection | Linfeng Jiang et.al. | 2412.04289 | link |
2024-12-05 | Deep Causal Inference for Point-referenced Spatial Data with Continuous Treatments | Ziyang Jiang et.al. | 2412.04285 | link |
2024-12-05 | On Extrapolation of Treatment Effects in Multiple-Cutoff Regression Discontinuity Designs | Yuta Okamoto et.al. | 2412.04265 | null |
2024-12-05 | CALMM-Drive: Confidence-Aware Autonomous Driving with Large Multimodal Model | Ruoyu Yao et.al. | 2412.04209 | null |
2024-12-05 | Towards Comprehensive Legislative Requirements for Cyber Physical Systems Testing in the European Union | Guillaume Nguyen et.al. | 2412.04132 | null |
2024-12-05 | Integrated Sensing and Communications for Low-Altitude Economy: A Deep Reinforcement Learning Approach | Xiaowen Ye et.al. | 2412.04074 | null |
2024-12-05 | AI4EF: Artificial Intelligence for Energy Efficiency in the Building Sector | Alexandros Menelaos Tzortzis et.al. | 2412.04045 | null |
2024-12-05 | Considerations Influencing Offense-Defense Dynamics From Artificial Intelligence | Giulio Corsi et.al. | 2412.04029 | null |
2024-12-05 | A Model of the Sidewalk Salsa | Olger Siebinga et.al. | 2412.04023 | null |
2024-12-05 | Computing diverse pair of solutions for tractable SAT | Tatsuya Gima et.al. | 2412.04016 | null |
2024-12-05 | Quality Control in Open-Ended Crowdsourcing: A Survey | Lei Chai et.al. | 2412.03991 | null |
2024-12-05 | UNCOVER: Unknown Class Object Detection for Autonomous Vehicles in Real-time | Lars Schmarje et.al. | 2412.03986 | null |
2024-12-05 | Electronic Health Records-Based Data-Driven Diabetes Knowledge Unveiling and Risk Prognosis | Huadong Pang et.al. | 2412.03961 | null |
2024-12-05 | Quantized and Interpretable Learning Scheme for Deep Neural Networks in Classification Task | Alireza Maleki et.al. | 2412.03915 | null |
2024-12-05 | A Unified Framework for Evaluating the Effectiveness and Enhancing the Transparency of Explainable AI Methods in Real-World Applications | Md. Ariful Islam et.al. | 2412.03884 | null |
2024-12-05 | Learning Based MPC for Autonomous Driving Using a Low Dimensional Residual Model | Yaoyu Li et.al. | 2412.03874 | null |
2024-12-04 | Streaming Detection of Queried Event Start | Cristobal Eyzaguirre et.al. | 2412.03567 | link |
2024-12-04 | FreeSim: Toward Free-viewpoint Camera Simulation in Driving Scenes | Lue Fan et.al. | 2412.03566 | null |
2024-12-04 | Seeing Beyond Views: Multi-View Driving Scene Video Generation with Holistic Attention | Hannan Lu et.al. | 2412.03520 | null |
2024-12-04 | Data Fusion of Semantic and Depth Information in the Context of Object Detection | Md Abu Yusuf et.al. | 2412.03490 | null |
2024-12-04 | From Words to Workflows: Automating Business Processes | Laura Minkova et.al. | 2412.03446 | null |
2024-12-04 | BIMCaP: BIM-based AI-supported LiDAR-Camera Pose Refinement | Miguel Arturo Vega Torres et.al. | 2412.03434 | link |
2024-12-04 | Governance as a complex, networked, democratic, satisfiability problem | Laurent Hébert-Dufresne et.al. | 2412.03421 | null |
2024-12-04 | Learning Semantic Association Rules from Internet of Things Data | Erkan Karabulut et.al. | 2412.03417 | link |
2024-12-04 | Risk-aware Classification via Uncertainty Quantification | Murat Sensoy et.al. | 2412.03391 | null |
2024-12-04 | AI-Driven Day-to-Day Route Choice | Leizhen Wang et.al. | 2412.03338 | link |
2024-12-04 | Are Explanations Helpful? A Comparative Analysis of Explainability Methods in Skin Lesion Classifiers | Rosa Y. G. Paccotacya-Yanque et.al. | 2412.03166 | link |
2024-12-04 | LLM-Twin: A Generated-Persona Approach for Survey Pre-Testing | Sunwoong Kim et.al. | 2412.03162 | null |
2024-12-04 | LEP-QNN: Loan Eligibility Prediction Using Quantum Neural Networks | Nouhaila Innan et.al. | 2412.03158 | null |
2024-12-04 | Hybrid deep learning-based strategy for the hepatocellular carcinoma cancer grade classification of H&E stained liver histopathology images | Ajinkya Deshpande et.al. | 2412.03084 | null |
2024-12-04 | Coordinated Multi-Armed Bandits for Improved Spatial Reuse in Wi-Fi | Francesc Wilhelmi et.al. | 2412.03076 | null |
2024-12-04 | A Survey of Wireless Sensing Security from a Role-Based View: Victim, Weapon, and Shield | Ruixu Geng et.al. | 2412.03064 | link |
2024-12-04 | Lightweight Stochastic Video Prediction via Hybrid Warping | Kazuki Kotoyori et.al. | 2412.03061 | null |
2024-12-04 | Less is More: A Stealthy and Efficient Adversarial Attack Method for DRL-based Autonomous Driving Policies | Junchao Fan et.al. | 2412.03051 | null |
2024-12-04 | Data Acquisition for Improving Model Fairness using Reinforcement Learning | Jahid Hasan et.al. | 2412.03009 | null |
2024-12-04 | Data-driven Koopman Operator-based Prediction and Control Using Model Averaging | Daisuke Uchida et.al. | 2412.02984 | null |
2024-12-03 | Preliminary Investigation into Data Scaling Laws for Imitation Learning-Based End-to-End Autonomous Driving | Yupeng Zheng et.al. | 2412.02689 | link |
2024-12-03 | Wasserstein Markets for Differentially-Private Data | Saurab Chhachhi et.al. | 2412.02609 | link |
2024-12-03 | Explainable CTR Prediction via LLM Reasoning | Xiaohan Yu et.al. | 2412.02588 | null |
2024-12-03 | Generating Critical Scenarios for Testing Automated Driving Systems | Trung-Hieu Nguyen et.al. | 2412.02574 | link |
2024-12-03 | Semantic Tokens in Retrieval Augmented Generation | Joel Suro et.al. | 2412.02563 | null |
2024-12-03 | Time-Series-Informed Closed-loop Learning for Sequential Decision Making and Control | Sebastian Hirt et.al. | 2412.02423 | null |
2024-12-03 | OMENN: One Matrix to Explain Neural Networks | Adam Wróbel et.al. | 2412.02399 | null |
2024-12-03 | Who Walks With You Matters: Perceiving Social Interactions with Groups for Pedestrian Trajectory Prediction | Ziqian Zou et.al. | 2412.02395 | null |
2024-12-03 | Social patch foraging theory in an egalitarian group | Lisa Blum Moyse et.al. | 2412.02381 | null |
2024-12-03 | Use of surrogate endpoints in health technology assessment: a review of selected NICE technology appraisals in oncology | Lorna Wheaton et.al. | 2412.02380 | null |
2024-12-03 | Trajectory-based Road Autolabeling with Lidar-Camera Fusion in Winter Conditions | Eerik Alamikkotervo et.al. | 2412.02370 | link |
2024-12-03 | Step-by-Step Guidance to Differential Anemia Diagnosis with Real-World Data and Deep Reinforcement Learning | Lillian Muyama et.al. | 2412.02273 | link |
2024-12-03 | Technical Report on Reinforcement Learning Control on the Lucas-Nülle Inverted Pendulum | Maximilian Schenke et.al. | 2412.02264 | null |
2024-12-03 | Selective Reviews of Bandit Problems in AI via a Statistical View | Pengjie Zhou et.al. | 2412.02251 | null |
2024-12-03 | An Automated Data Mining Framework Using Autoencoders for Feature Extraction and Dimensionality Reduction | Yaxin Liang et.al. | 2412.02211 | null |
2024-12-03 | DataLab: A Unifed Platform for LLM-Powered Business Intelligence | Luoxuan Weng et.al. | 2412.02205 | null |
2024-12-03 | Self-Supervised Learning-Based Path Planning and Obstacle Avoidance Using PPO and B-Splines in Unknown Environments | Shahab Shokouhi et.al. | 2412.02176 | null |
2024-12-03 | Underload: Defending against Latency Attacks for Object Detectors on Edge Devices | Tianyi Wang et.al. | 2412.02171 | null |
2024-12-03 | CausalMob: Causal Human Mobility Prediction with LLMs-derived Human Intentions toward Public Events | Xiaojie Yang et.al. | 2412.02155 | link |
2024-12-03 | Failure Probability Estimation for Black-Box Autonomous Systems using State-Dependent Importance Sampling Proposals | Harrison Delecki et.al. | 2412.02154 | null |
2024-11-29 | FlowCLAS: Enhancing Normalizing Flow Via Contrastive Learning For Anomaly Segmentation | Chang Won Lee et.al. | 2411.19888 | null |
2024-11-29 | SpaRC: Sparse Radar-Camera Fusion for 3D Object Detection | Philipp Wolters et.al. | 2411.19860 | null |
2024-11-29 | Collective decision-making with heterogeneous biases: Role of network topology and susceptibility | Yunus Sevinchan et.al. | 2411.19829 | null |
2024-11-29 | A Multi-Loss Strategy for Vehicle Trajectory Prediction: Combining Off-Road, Diversity, and Directional Consistency Losses | Ahmad Rahimi et.al. | 2411.19747 | link |
2024-11-29 | Graph Neural Networks for Heart Failure Prediction on an EHR-Based Patient Similarity Graph | Heloisa Oss Boll et.al. | 2411.19742 | link |
2024-11-29 | The Streetscape Application Services Stack (SASS): Towards a Distributed Sensing Architecture for Urban Applications | Navid Salami Pargoo et.al. | 2411.19714 | null |
2024-11-29 | RMIO: A Model-Based MARL Framework for Scenarios with Observation Loss in Some Agents | Shi Zifeng et.al. | 2411.19639 | null |
2024-11-29 | AdvFuzz: Finding More Violations Caused by the EGO Vehicle in Simulation Testing by Adversarial NPC Vehicles | You Lu et.al. | 2411.19567 | null |
2024-11-29 | ReconDreamer: Crafting World Models for Driving Scene Reconstruction via Online Restoration | Chaojun Ni et.al. | 2411.19548 | null |
2024-11-29 | A Local Information Aggregation based Multi-Agent Reinforcement Learning for Robot Swarm Dynamic Task Allocation | Yang Lv et.al. | 2411.19526 | null |
2024-11-29 | Auto-RAG: Autonomous Retrieval-Augmented Generation for Large Language Models | Tian Yu et.al. | 2411.19443 | link |
2024-11-28 | Mapping Public Perception of Artificial Intelligence: Expectations, Risk-Benefit Tradeoffs, and Value As Determinants for Societal Acceptance | Philipp Brauner et.al. | 2411.19356 | null |
2024-11-28 | UrbanCAD: Towards Highly Controllable and Photorealistic 3D Vehicles for Urban Scene Simulation | Yichong Lu et.al. | 2411.19292 | null |
2024-11-28 | SADG: Segment Any Dynamic Gaussian Without Object Trackers | Yun-Jin Li et.al. | 2411.19290 | link |
2024-11-28 | BPQP: A Differentiable Convex Optimization Framework for Efficient End-to-End Learning | Jianming Pan et.al. | 2411.19285 | null |
2024-11-28 | On-chip Hyperspectral Image Segmentation with Fully Convolutional Networks for Scene Understanding in Autonomous Driving | Jon Gutiérrez-Zaballa et.al. | 2411.19274 | null |
2024-11-28 | Contrastive representations of high-dimensional, structured treatments | Oriol Corcoll Andreu et.al. | 2411.19245 | null |
2024-11-28 | InstanceGaussian: Appearance-Semantic Joint Gaussian Representation for 3D Instance-Level Perception | Haijie Li et.al. | 2411.19235 | null |
2024-11-28 | Convex Regularization and Convergence of Policy Gradient Flows under Safety Constraints | Pekka Malo et.al. | 2411.19193 | null |
2024-11-28 | Per-event Uncertainty Quantification for Flow Cytometry using Calibration Beads | Prajakta Bedekar et.al. | 2411.19191 | null |
2024-11-27 | Collective decision making by embodied neural agents | Nicolas Coucke et.al. | 2411.18498 | link |
2024-11-27 | Bhirkuti’s Test of Bias Acceptance: Examining in Psychometric Simulations | Aneel Bhusal et.al. | 2411.18481 | null |
2024-11-27 | An End-to-End Smart Predict-then-Optimize Framework for Vehicle Relocation Problems in Large-Scale Vehicle Crowd Sensing | Xinyu Wang et.al. | 2411.18432 | null |
2024-11-27 | Neural Image Unfolding: Flattening Sparse Anatomical Structures using Neural Fields | Leonhard Rist et.al. | 2411.18415 | null |
2024-11-27 | Two-Timescale Digital Twin Assisted Model Interference and Retraining over Wireless Network | Jiayi Cong et.al. | 2411.18329 | null |
2024-11-27 | Learning optimal objective values for MILP | Lara Scavuzzo et.al. | 2411.18321 | link |
2024-11-27 | MvKeTR: Chest CT Report Generation with Multi-View Perception and Knowledge Enhancement | Xiwei Deng et.al. | 2411.18309 | null |
2024-11-27 | InterHub: A Naturalistic Trajectory Dataset with Dense Interaction for Autonomous Driving | Xiyan Jiang et.al. | 2411.18302 | link |
2024-11-27 | Visual Adversarial Attack on Vision-Language Models for Autonomous Driving | Tianyuan Zhang et.al. | 2411.18275 | null |
2024-11-27 | Dynamic Retail Pricing via Q-Learning – A Reinforcement Learning Framework for Enhanced Revenue Management | Mohit Apte et.al. | 2411.18261 | null |
2024-11-27 | From Open Vocabulary to Open World: Teaching Vision Language Models to Detect Novel Objects | Zizhao Li et.al. | 2411.18207 | link |
2024-11-27 | Critic-V: VLM Critics Help Catch VLM Errors in Multimodal Reasoning | Di Zhang et.al. | 2411.18203 | null |
2024-11-27 | SentiXRL: An advanced large language Model Framework for Multilingual Fine-Grained Emotion Classification in Complex Text Environment | Jie Wang et.al. | 2411.18162 | null |
2024-11-27 | Enhancing Visual Reasoning with Autonomous Imagination in Multimodal Large Language Models | Jingming Liu et.al. | 2411.18142 | null |
2024-11-27 | Edge-Assisted Accelerated Cooperative Sensing for CAVs: Task Placement and Resource Allocation | Yuxuan Wang et.al. | 2411.18129 | null |
2024-11-27 | A Machine Learning-based Framework towards Assessment of Decision-Makers’ Biases | Wanxue Dong et.al. | 2411.18122 | null |
2024-11-27 | Large Scale Evaluation of Deep Learning-based Explainable Solar Flare Forecasting Models with Attribution-based Proximity Analysis | Temitope Adeyeha et.al. | 2411.18070 | null |
2024-11-27 | Heterogeneous Relationships of Subjects and Shapelets for Semi-supervised Multivariate Series Classification | Mingsen Du et.al. | 2411.18043 | null |
2024-11-27 | FASIONAD : FAst and Slow FusION Thinking Systems for Human-Like Autonomous Driving with Adaptive Feedback | Kangan Qian et.al. | 2411.18013 | null |
2024-11-26 | Stealthy Multi-Task Adversarial Attacks | Jiacheng Guo et.al. | 2411.17936 | null |
2024-11-26 | Explainable AI for Classifying UTI Risk Groups Using a Real-World Linked EHR and Pathology Lab Dataset | Yujie Dai et.al. | 2411.17645 | null |
2024-11-26 | Modality-Incremental Learning with Disjoint Relevance Mapping Networks for Image-based Semantic Segmentation | Niharika Hegde et.al. | 2411.17610 | null |
2024-11-26 | Belief patterns with information processing | Federico Vaccari et.al. | 2411.17597 | null |
2024-11-26 | What Differentiates Educational Literature? A Multimodal Fusion Approach of Transformers and Computational Linguistics | Jordan J. Bird et.al. | 2411.17593 | null |
2024-11-26 | Decision making in stochastic extensive form II: Stochastic extensive forms and games | E. Emanuel Rapsch et.al. | 2411.17587 | null |
2024-11-26 | Multi-Objective Reinforcement Learning for Automated Resilient Cyber Defence | Ross O’Driscoll et.al. | 2411.17585 | null |
2024-11-26 | Rapid Deployment of Domain-specific Hyperspectral Image Processors with Application to Autonomous Driving | Jon Gutiérrez-Zaballa et.al. | 2411.17543 | null |
2024-11-26 | AI-Augmented Ethical Hacking: A Practical Examination of Manual Exploitation and Privilege Escalation in Linux Environments | Haitham S. Al-Sinani et.al. | 2411.17539 | null |
2024-11-26 | HSI-Drive v2.0: More Data for New Challenges in Scene Understanding for Autonomous Driving | Jon Gutiérrez-Zaballa et.al. | 2411.17530 | null |
2024-11-26 | Confidence-Aware Deep Learning for Load Plan Adjustments in the Parcel Service Industry | Thomas Bruys et.al. | 2411.17502 | null |
2024-11-26 | A Graph Neural Network deep-dive into successful counterattacks | Joris Bekkers et.al. | 2411.17450 | null |
2024-11-26 | CoA: Chain-of-Action for Generative Semantic Labels | Meng Wei et.al. | 2411.17406 | link |
2024-11-26 | LHPF: Look back the History and Plan for the Future in Autonomous Driving | Sheng Wang et.al. | 2411.17253 | null |
2024-11-26 | DGNN-YOLO: Dynamic Graph Neural Networks with YOLO11 for Small Object Detection and Tracking in Traffic Surveillance | Shahriar Soudeep et.al. | 2411.17251 | null |
2024-11-26 | Fault Localization from the Semantic Code Search Perspective | Yihao Qin et.al. | 2411.17230 | null |
2024-11-26 | Interval-based validation of a nonlinear estimator | Maël Godard et.al. | 2411.17215 | null |
2024-11-26 | Enhancing Lane Segment Perception and Topology Reasoning with Crowdsourcing Trajectory Priors | Peijin Jia et.al. | 2411.17161 | null |
2024-11-26 | Fast, Precise Thompson Sampling for Bayesian Optimization | David Sweet et.al. | 2411.17071 | null |
2024-11-26 | Conformalised Conditional Normalising Flows for Joint Prediction Regions in time series | Eshant English et.al. | 2411.17042 | null |
2024-11-25 | Explainable AI Approach using Near Misses Analysis | Eran Kaufman et.al. | 2411.16895 | null |
2024-11-25 | Winning opinion: Following Your Friends’ Advice or That of Their Friends? | Francisco J. Muñoz et.al. | 2411.16671 | null |
2024-11-25 | CatNet: Effective FDR Control in LSTM with Gaussian Mirrors and SHAP Feature Importance | Jiaan Han et.al. | 2411.16666 | null |
2024-11-25 | Large Language Model-based Decision-making for COLREGs and the Control of Autonomous Surface Vehicles | Klinsmann Agyei et.al. | 2411.16587 | link |
2024-11-25 | Generating Out-Of-Distribution Scenarios Using Language Models | Erfan Aasi et.al. | 2411.16554 | null |
2024-11-25 | Responsible forecasting: identifying and typifying forecasting harms | Bahman Rostami-Tabar et.al. | 2411.16531 | null |
2024-11-25 | Characterized Diffusion Networks for Enhanced Autonomous Driving Trajectory Prediction | Haoming Li et.al. | 2411.16457 | null |
2024-11-25 | A Study on Unsupervised Domain Adaptation for Semantic Segmentation in the Era of Vision-Language Models | Manuel Schwonberg et.al. | 2411.16407 | null |
2024-11-25 | A Review of Bayesian Uncertainty Quantification in Deep Probabilistic Image Segmentation | M. M. A. Valiuddin et.al. | 2411.16370 | null |
2024-11-25 | Monocular Lane Detection Based on Deep Learning: A Survey | Xin He et.al. | 2411.16316 | link |
2024-11-25 | A Performance Increment Strategy for Semantic Segmentation of Low-Resolution Images from Damaged Roads | Rafael S. Toledo et.al. | 2411.16295 | link |
2024-11-25 | FinML-Chain: A Blockchain-Integrated Dataset for Enhanced Financial Machine Learning | Jingfeng Chen et.al. | 2411.16277 | null |
2024-11-25 | Efficient pooling of predictions via kernel embeddings | Sam Allen et.al. | 2411.16246 | null |
2024-11-25 | Interpreting Object-level Foundation Models via Visual Precision Search | Ruoyu Chen et.al. | 2411.16198 | link |
2024-11-25 | The Critical Canvas–How to regain information autonomy in the AI era | Dong Chen et.al. | 2411.16193 | null |
2024-11-25 | Multi-Robot Reliable Navigation in Uncertain Topological Environments with Graph Attention Networks | Zhuoyuan Yu et.al. | 2411.16134 | link |
2024-11-25 | End-to-End Steering for Autonomous Vehicles via Conditional Imitation Co-Learning | Mahmoud M. Kishky et.al. | 2411.16131 | null |
2024-11-25 | Three Cars Approaching within 100m! Enhancing Distant Geometry by Tri-Axis Voxel Scanning for Camera-based Semantic Scene Completion | Jongseong Bae et.al. | 2411.16129 | null |
2024-11-25 | Ensemble Learning via Knowledge Transfer for CTR Prediction | Honghao Li et.al. | 2411.16122 | link |
2024-11-25 | DP-CDA: An Algorithm for Enhanced Privacy Preservation in Dataset Synthesis Through Randomized Mixing | Utsab Saha et.al. | 2411.16121 | null |
2024-11-25 | Why the Agent Made that Decision: Explaining Deep Reinforcement Learning with Vision Masks | Rui Zuo et.al. | 2411.16120 | null |
2024-11-22 | DiffusionDrive: Truncated Diffusion Model for End-to-End Autonomous Driving | Bencheng Liao et.al. | 2411.15139 | link |
2024-11-22 | Enhancing Autonomous Driving Safety through World Model-Based Predictive Navigation and Adaptive Learning Algorithms for 5G Wireless Applications | Hong Ding et.al. | 2411.15042 | null |
2024-11-22 | MSSF: A 4D Radar and Camera Fusion Framework With Multi-Stage Sampling for 3D Object Detection in Autonomous Driving | Hongsi Liu et.al. | 2411.15016 | null |
2024-11-22 | FTA generation using GenAI with an Autonomy sensor Usecase | Sneha Sudhir Shetiya et.al. | 2411.15007 | null |
2024-11-22 | Optimization Strategies for Parallel Computation of Skylines | Paolo Ciaccia et.al. | 2411.14968 | null |
2024-11-22 | LiDAR-based End-to-end Temporal Perception for Vehicle-Infrastructure Cooperation | Zhenwei Yang et.al. | 2411.14927 | null |
2024-11-22 | Exploring Kolmogorov-Arnold Networks for Interpretable Time Series Classification | Irina Barašin et.al. | 2411.14904 | link |
2024-11-22 | Benchmarking the Robustness of Optical Flow Estimation to Corruptions | Zhonghua Yi et.al. | 2411.14865 | link |
2024-11-22 | Jovis: A Visualization Tool for PostgreSQL Query Optimizer | Yoojin Choi et.al. | 2411.14788 | null |
2024-11-22 | Resolution-Agnostic Transformer-based Climate Downscaling | Declan Curran et.al. | 2411.14774 | null |
2024-11-22 | TopoSD: Topology-Enhanced Lane Segment Perception with SDMap Prior | Sen Yang et.al. | 2411.14751 | null |
2024-11-22 | Universal and Context-Independent Triggers for Precise Control of LLM Outputs | Jiashuo Liang et.al. | 2411.14738 | null |
2024-11-22 | VisionPAD: A Vision-Centric Pre-training Paradigm for Autonomous Driving | Haiming Zhang et.al. | 2411.14716 | null |
2024-11-21 | A Systematic Study of Multi-Agent Deep Reinforcement Learning for Safe and Robust Autonomous Highway Ramp Entry | Larry Schester et.al. | 2411.14593 | null |
2024-11-21 | Enhancing GeoAI and location encoding with spatial point pattern statistics: A Case Study of Terrain Feature Classification | Sizhe Wang et.al. | 2411.14560 | null |
2024-11-21 | Combining missing data imputation and internal validation in clinical risk prediction models | Junhui Mi et.al. | 2411.14542 | link |
2024-11-21 | GMAI-VL & GMAI-VL-5.5M: A Large Vision-Language Model and A Comprehensive Multimodal Dataset Towards General Medical AI | Tianbin Li et.al. | 2411.14522 | link |
2024-11-21 | Open Challenges in the Formal Verification of Autonomous Driving | Paolo Burgio et.al. | 2411.14520 | null |
2024-11-21 | Model Checking for Reinforcement Learning in Autonomous Driving: One Can Do More Than You Think! | Rong Gu et.al. | 2411.14375 | null |
2024-11-21 | Formal Simulation and Visualisation of Hybrid Programs | Pedro Mendes et.al. | 2411.14365 | null |
2024-11-21 | Generalizing End-To-End Autonomous Driving In Real-World Environments Using Zero-Shot LLMs | Zeyu Dong et.al. | 2411.14256 | null |
2024-11-21 | BERT-Based Approach for Automating Course Articulation Matrix Construction with Explainable AI | Natenaile Asmamaw Shiferaw et.al. | 2411.14254 | link |
2024-11-21 | Natural Language Reinforcement Learning | Xidong Feng et.al. | 2411.14251 | link |
2024-11-21 | Towards Context-Rich Automated Biodiversity Assessments: Deriving AI-Powered Insights from Camera Trap Data | Paul Fergus et.al. | 2411.14219 | null |
2024-11-21 | Grand Challenges in the Verification of Autonomous Systems | Kevin Leahy et.al. | 2411.14155 | null |
2024-11-21 | Forecasting Future International Events: A Reliable Dataset for Text-Based Event Modeling | Daehoon Gwak et.al. | 2411.14042 | link |
2024-11-21 | Dual-Arm Telerobotic Platform for Robotic Hotbox Operations for Nuclear Waste Disposition in EM Sites | Joong-Ku Lee et.al. | 2411.13994 | null |
2024-11-21 | Market Making without Regret | Nicolò Cesa-Bianchi et.al. | 2411.13993 | null |
2024-11-21 | FedRAV: Hierarchically Federated Region-Learning for Traffic Object Classification of Autonomous Vehicles | Yijun Zhai et.al. | 2411.13979 | link |
2024-11-21 | Breadboarding the European Moon Rover System: discussion and results of the analogue field test campaign | Cristina Luna et.al. | 2411.13978 | null |
2024-11-21 | ICODE: Modeling Dynamical Systems with Extrinsic Input Information | Zhaoyi Li et.al. | 2411.13914 | null |
2024-11-21 | Towards Full Delegation: Designing Ideal Agentic Behaviors for Travel Planning | Song Jiang et.al. | 2411.13904 | null |
2024-11-21 | Trajectory Tracking Using Frenet Coordinates with Deep Deterministic Policy Gradient | Tongzhou Jiang et.al. | 2411.13885 | null |
2024-11-21 | Interactive and Expressive Code-Augmented Planning with Large Language Models | Anthony Z. Liu et.al. | 2411.13826 | null |
2024-11-21 | Dynamic spatial interaction models for a leader’s resource allocation and followers’ multiple activities | Hanbat Jeong et.al. | 2411.13810 | null |
2024-11-21 | MagicDriveDiT: High-Resolution Long Video Generation for Autonomous Driving with Adaptive Control | Ruiyuan Gao et.al. | 2411.13807 | null |
2024-11-21 | A Survey on Adversarial Robustness of LiDAR-based Machine Learning Perception in Autonomous Vehicles | Junae Kim et.al. | 2411.13778 | null |
2024-11-20 | Exploring Large Language Models for Climate Forecasting | Yang Wang et.al. | 2411.13724 | null |
2024-11-20 | BALROG: Benchmarking Agentic LLM and VLM Reasoning On Games | Davide Paglieri et.al. | 2411.13543 | null |
2024-11-20 | Advancing Complex Medical Communication in Arabic with Sporo AraSum: Surpassing Existing Large Language Models | Chanseo Lee et.al. | 2411.13518 | null |
2024-11-20 | Disentangling Memory and Reasoning Ability in Large Language Models | Mingyu Jin et.al. | 2411.13504 | link |
2024-11-20 | Neural machine translation of seismic waves for petrophysical inversion | José Cunha Teixeira et.al. | 2411.13491 | null |
2024-11-20 | Unleashing the Power of Large Language Models for Group POI Recommendations | Jing Long et.al. | 2411.13415 | null |
2024-11-20 | A Survey On Enhancing Reinforcement Learning in Complex Environments: Insights from Human and LLM Feedback | Alireza Rashidi Laleh et.al. | 2411.13410 | null |
2024-11-20 | Explainable Finite-Memory Policies for Partially Observable Markov Decision Processes | Muqsit Azeem et.al. | 2411.13365 | null |
2024-11-20 | WHALES: A Multi-agent Scheduling Dataset for Enhanced Cooperation in Autonomous Driving | Siwei Chen et.al. | 2411.13340 | link |
2024-11-20 | A Resource Efficient Fusion Network for Object Detection in Bird’s-Eye View using Camera and Raw Radar Data | Kavin Chandrasekaran et.al. | 2411.13311 | link |
2024-11-20 | A computational framework for integrating Predictive processes with evidence Accumulation Models (PAM) | Antonino Visalli et.al. | 2411.13203 | link |
2024-11-20 | Guided Object-Oriented Development | Harrie Passier et.al. | 2411.13200 | null |
2024-11-20 | Quantitative Fairness – A Framework For The Design Of Equitable Cybernetic Societies | Kevin Riehl et.al. | 2411.13184 | null |
2024-11-20 | YCB-LUMA: YCB Object Dataset with Luminance Keying for Object Localization | Thomas Pöllabauer et.al. | 2411.13149 | link |
2024-11-20 | Provably Efficient Action-Manipulation Attack Against Continuous Reinforcement Learning | Zhi Luo et.al. | 2411.13116 | null |
2024-11-20 | DriveMLLM: A Benchmark for Spatial Understanding with Multimodal Large Language Models in Autonomous Driving | Xianda Guo et.al. | 2411.13112 | link |
2024-11-20 | Hints of Prompt: Enhancing Visual Representation for Multimodal LLMs in Autonomous Driving | Hao Zhou et.al. | 2411.13076 | null |
2024-11-20 | MEGL: Multimodal Explanation-Guided Learning | Yifei Zhang et.al. | 2411.13053 | null |
2024-11-20 | Study of Group III-V Waveguides on Sapphire Platform for Photonic Integrated Circuits | Manoj Kumar Shah et.al. | 2411.13035 | null |
2024-11-20 | Hierarchical Diffusion Policy: manipulation trajectory generation via contact guidance | Dexin Wang et.al. | 2411.12982 | link |
2024-11-20 | LaVida Drive: Vision-Text Interaction VLM for Autonomous Driving with Token Selection, Recovery and Enhancement | Siwen Jiao et.al. | 2411.12980 | null |
2024-11-19 | Dimensions of Generative AI Evaluation Design | P. Alex Dow et.al. | 2411.12709 | null |
2024-11-19 | OrigamiPlot: An R Package and Shiny Web App Enhanced Visualizations for Multivariate Data | Yiwen Lu et.al. | 2411.12674 | null |
2024-11-19 | Smart Predict-then-Optimize Method with Dependent Data: Risk Bounds and Calibration of Autoregression | Jixian Liu et.al. | 2411.12653 | null |
2024-11-19 | DLBacktrace: A Model Agnostic Explainability for any Deep Learning Models | Vinay Kumar Sankarapu et.al. | 2411.12643 | link |
2024-11-19 | M3D: Dual-Stream Selective State Spaces and Depth-Driven Framework for High-Fidelity Single-View 3D Reconstruction | Luoxi Zhang et.al. | 2411.12635 | link |
2024-11-19 | GaussianPretrain: A Simple Unified 3D Gaussian Representation for Visual Pre-training in Autonomous Driving | Shaoqing Xu et.al. | 2411.12452 | link |
2024-11-19 | Motif Channel Opened in a White-Box: Stereo Matching via Motif Correlation Graph | Ziyang Chen et.al. | 2411.12426 | link |
2024-11-19 | A general modeling and simulation framework for dynamic vehicle routing | Markó Horváth et.al. | 2411.12406 | link |
2024-11-19 | C $^{2}$ INet: Realizing Incremental Trajectory Prediction with Prior-Aware Continual Causal Intervention | Xiaohe Li et.al. | 2411.12313 | null |
2024-11-19 | Could Humans Outshine AI in Visual Data Analysis? | Ratanond Koonchanok et.al. | 2411.12299 | null |
2024-11-19 | A Survey of Medical Vision-and-Language Applications and Their Techniques | Qi Chen et.al. | 2411.12195 | link |
2024-11-19 | Action-Attentive Deep Reinforcement Learning for Autonomous Alignment of Beamlines | Siyu Wang et.al. | 2411.12183 | link |
2024-11-19 | Robust 3D Semantic Occupancy Prediction with Calibration-free Spatial Transformation | Zhuangwei Zhuang et.al. | 2411.12177 | link |
2024-11-19 | SkillTree: Explainable Skill-Based Deep Reinforcement Learning for Long-Horizon Control Tasks | Yongyan Wen et.al. | 2411.12173 | null |
2024-11-18 | Coverage-Constrained Human-AI Cooperation with Multiple Experts | Zheng Zhang et.al. | 2411.11976 | null |
2024-11-19 | Generative World Explorer | Taiming Lu et.al. | 2411.11844 | null |
2024-11-18 | Exploring the Requirements of Clinicians for Explainable AI Decision Support Systems in Intensive Care | Jeffrey N. Clark et.al. | 2411.11774 | null |
2024-11-18 | Robust Reinforcement Learning under Diffusion Models for Data with Jumps | Chenyang Jiang et.al. | 2411.11697 | null |
2024-11-18 | TrojanRobot: Backdoor Attacks Against Robotic Manipulation in the Physical World | Xianlong Wang et.al. | 2411.11683 | null |
2024-11-18 | On the Incorporation of Stability Constraints into Sequential Operational Scheduling | Wangkun Xu et.al. | 2411.11652 | null |
2024-11-18 | ST-Tree with Interpretability for Multivariate Time Series Classification | Mingsen Du et.al. | 2411.11620 | link |
2024-11-18 | VLN-Game: Vision-Language Equilibrium Search for Zero-Shot Semantic Navigation | Bangguo Yu et.al. | 2411.11609 | null |
2024-11-18 | Transformer networks for Heavy flavor jet tagging | A. Hammad et.al. | 2411.11519 | null |
2024-11-18 | Structure learning with Temporal Gaussian Mixture for model-based Reinforcement Learning | Théophile Champion et.al. | 2411.11511 | null |
2024-11-18 | SignEye: Traffic Sign Interpretation from Vehicle First-Person View | Chuang Yang et.al. | 2411.11507 | null |
2024-11-18 | MGNiceNet: Unified Monocular Geometric Scene Understanding | Markus Schön et.al. | 2411.11466 | null |
2024-11-18 | The ADUULM-360 Dataset – A Multi-Modal Dataset for Depth Estimation in Adverse Weather | Markus Schön et.al. | 2411.11455 | null |
2024-11-18 | Robust Markov Decision Processes: A Place Where AI and Formal Methods Meet | Marnix Suilen et.al. | 2411.11451 | null |
2024-11-18 | Deliberative XAI: How Explanations Impact Understanding and Decision-Making of AI Novices in Collective and Individual Settings | Timothée Schmude et.al. | 2411.11449 | null |
2024-11-18 | Causal Effect of Group Diversity on Redundancy and Coverage in Peer-Reviewing | Navita Goyal et.al. | 2411.11437 | null |
2024-11-18 | Cross-Patient Pseudo Bags Generation and Curriculum Contrastive Learning for Imbalanced Multiclassification of Whole Slide Image | Yonghuang Wu et.al. | 2411.11262 | null |
2024-11-18 | DrivingSphere: Building a High-fidelity 4D World for Closed-loop Simulation | Tianyi Yan et.al. | 2411.11252 | link |
2024-11-17 | DeepSPV: An Interpretable Deep Learning Pipeline for 3D Spleen Volume Estimation from 2D Ultrasound Images | Zhen Yuan et.al. | 2411.11190 | null |
2024-11-17 | Integrated Ising Model with global inhibition for decision making | Olga Tapinova et.al. | 2411.11143 | null |
2024-11-17 | Financial News-Driven LLM Reinforcement Learning for Portfolio Management | Ananya Unnikrishnan et.al. | 2411.11059 | null |
2024-11-15 | Emotion Detection in Reddit: Comparative Study of Machine Learning and Deep Learning Techniques | Maliheh Alaeddini et.al. | 2411.10328 | null |
2024-11-15 | Moving Forward: A Review of Autonomous Driving Software and Hardware Systems | Xu Wang et.al. | 2411.10291 | null |
2024-11-15 | From Score-Driven to Value-Sharing: Understanding Chinese Family Use of AI to Support Decision Making of College Applications | Si Chen et.al. | 2411.10280 | null |
2024-11-15 | Towards Sample-Efficiency and Generalization of Transfer and Inverse Reinforcement Learning: A Comprehensive Literature Review | Hossein Hassani et.al. | 2411.10268 | null |
2024-11-15 | Artificial Intelligence in Pediatric Echocardiography: Exploring Challenges, Opportunities, and Clinical Applications with Explainable AI and Federated Learning | Mohammed Yaseen Jabarulla et.al. | 2411.10255 | null |
2024-11-15 | Uncertainty in Supply Chain Digital Twins: A Quantum-Classical Hybrid Approach | Abdullah Abdullah et.al. | 2411.10254 | null |
2024-11-15 | Learning Generalizable 3D Manipulation With 10 Demonstrations | Yu Ren et.al. | 2411.10203 | link |
2024-11-15 | Agentic LLMs in the Supply Chain: Towards Autonomous Multi-Agent Consensus-Seeking | Valeria Jannelli et.al. | 2411.10184 | null |
2024-11-15 | Let people fail! Exploring the influence of explainable virtual and robotic agents in learning-by-doing tasks | Marco Matarese et.al. | 2411.10176 | null |
2024-11-15 | Imagine-2-Drive: High-Fidelity World Modeling in CARLA for Autonomous Vehicles | Anant Garg et.al. | 2411.10171 | null |
2024-11-15 | Better Safe Than Sorry: Enhancing Arbitration Graphs for Safe and Robust Autonomous Decision-Making | Piotr Spieker et.al. | 2411.10170 | link |
2024-11-15 | Adapting the Biological SSVEP Response to Artificial Neural Networks | Emirhan Böge et.al. | 2411.10084 | null |
2024-11-15 | Explanation for Trajectory Planning using Multi-modal Large Language Model for Autonomous Driving | Shota Yamazaki et.al. | 2411.09971 | null |
2024-11-15 | Planning by Simulation: Motion Planning with Learning-based Parallel Scenario Prediction for Autonomous Driving | Tian Niu et.al. | 2411.09887 | null |
2024-11-15 | Fair Secretaries with Unfair Predictions | Eric Balkanski et.al. | 2411.09854 | null |
2024-11-14 | Robustness Assessment of Static Structures for Efficient Object Handling | Philippe Nadeau et.al. | 2411.09810 | null |
2024-11-14 | Fair Resource Allocation in Weakly Coupled Markov Decision Processes | Xiaohui Tu et.al. | 2411.09804 | null |
2024-11-14 | Modular Fault Diagnosis Framework for Complex Autonomous Driving Systems | Stefan Orf et.al. | 2411.09643 | null |
2024-11-14 | The Moral Foundations Weibo Corpus | Renjie Cao et.al. | 2411.09612 | null |
2024-11-14 | Expert Study on Interpretable Machine Learning Models with Missing Data | Lena Stempfle et.al. | 2411.09591 | null |
2024-11-14 | An Approach to Twinning and Mining Collaborative Network of Construction Projects | Jia-Rui Lin et.al. | 2411.09486 | null |
2024-11-14 | Socio-Economic Consequences of Generative AI: A Review of Methodological Approaches | Carlos J. Costa et.al. | 2411.09313 | null |
2024-11-14 | LHRS-Bot-Nova: Improved Multimodal Large Language Model for Remote Sensing Vision-Language Interpretation | Zhenshi Li et.al. | 2411.09301 | link |
2024-11-14 | SAFES: Sequential Privacy and Fairness Enhancing Data Synthesis for Responsible AI | Spencer Giddens et.al. | 2411.09178 | link |
2024-11-14 | Gazing at Rewards: Eye Movements as a Lens into Human and AI Decision-Making in Hybrid Visual Foraging | Bo Wang et.al. | 2411.09176 | null |
2024-11-13 | A probabilistic reduced-order modeling framework for patient-specific cardio-mechanical analysis | Robin Willems et.al. | 2411.08822 | null |
2024-11-13 | Evaluating World Models with LLM for Decision Making | Chang Yang et.al. | 2411.08794 | null |
2024-11-13 | SANDWICH: Towards an Offline, Differentiable, Fully-Trainable Wireless Neural Ray-Tracing Surrogate | Yifei Jin et.al. | 2411.08767 | null |
2024-11-13 | Logic-based Knowledge Awareness for Autonomous Agents in Continuous Spaces | Arabinda Ghosh et.al. | 2411.08754 | null |
2024-11-13 | Polymetis:Large Language Modeling for Multiple Material Domains | Chao Huang et.al. | 2411.08728 | null |
2024-11-13 | High-resolution optical and acoustic remote sensing datasets of the Puck Lagoon, Southern Baltic | Łukasz Janowski et.al. | 2411.08712 | null |
2024-11-13 | TRACE: Transformer-based Risk Assessment for Clinical Evaluation | Dionysis Christopoulos et.al. | 2411.08701 | link |
2024-11-13 | UniMat: Unifying Materials Embeddings through Multi-modal Learning | Janghoon Ock et.al. | 2411.08664 | null |
2024-11-13 | Robot See, Robot Do: Imitation Reward for Noisy Financial Environments | Sven Goluža et.al. | 2411.08637 | null |
2024-11-13 | Zero-shot capability of SAM-family models for bone segmentation in CT scans | Caroline Magg et.al. | 2411.08629 | null |
2024-11-13 | An Empirical Examination of the Evaluative AI Framework | Jaroslaw Kornowicz et.al. | 2411.08583 | null |
2024-11-13 | TimeLess: A Vision for the Next Generation of Software Development | Zeeshan Rasheed et.al. | 2411.08507 | null |
2024-11-13 | Towards Objective and Unbiased Decision Assessments with LLM-Enhanced Hierarchical Attention Networks | Junhua Liu et.al. | 2411.08504 | link |
2024-11-13 | Methodology for a Statistical Analysis of Influencing Factors on 3D Object Detection Performance | Anton Kuznietsov et.al. | 2411.08482 | null |
2024-11-13 | Learning Dynamic Cognitive Map with Autonomous Navigation | Daria de Tinguy et.al. | 2411.08447 | link |
2024-11-13 | 3D Multi-Object Tracking with Semi-Supervised GRU-Kalman Filter | Xiaoxiang Wang et.al. | 2411.08433 | null |
2024-11-13 | Hybrid Vector Auto Regression and Neural Network Model for Order Flow Imbalance Prediction in High Frequency Trading | Abdul Rahman et.al. | 2411.08382 | link |
2024-11-13 | A Fuzzy Reinforcement LSTM-based Long-term Prediction Model for Fault Conditions in Nuclear Power Plants | Siwei Li et.al. | 2411.08370 | null |
2024-11-13 | How Transit Countries Become Refugee Destinations: Insights from Central and Eastern Europe | Liliana Harding et.al. | 2411.08350 | null |
2024-11-13 | TowerDebias: A Novel Debiasing Method based on the Tower Property | Norman Matloff et.al. | 2411.08297 | null |
2024-11-12 | Investigating the Effectiveness of Explainability Methods in Parkinson’s Detection from Speech | Eleonora Mancini et.al. | 2411.08013 | null |
2024-11-12 | Optimal Control of Mechanical Ventilators with Learned Respiratory Dynamics | Isaac Ronald Ward et.al. | 2411.07971 | link |
2024-11-12 | Learning Memory Mechanisms for Decision Making through Demonstrations | William Yue et.al. | 2411.07954 | link |
2024-11-12 | CryptoLLM: Unleashing the Power of Prompted LLMs for SmartQnA and Classification of Crypto Posts | Aniket Deroy et.al. | 2411.07917 | null |
2024-11-12 | Evidential time-to-event prediction model with well-calibrated uncertainty estimation | Ling Huang et.al. | 2411.07853 | null |
2024-11-12 | Impact of R&D and AI Investments on Economic Growth and Credit Rating | Davit Gondauri et.al. | 2411.07817 | null |
2024-11-12 | PatchCTG: Patch Cardiotocography Transformer for Antepartum Fetal Health Monitoring | M. Jaleed Khan et.al. | 2411.07796 | link |
2024-11-12 | ALOcc: Adaptive Lifting-based 3D Semantic Occupancy and Cost Volume-based Flow Prediction | Dubing Chen et.al. | 2411.07725 | link |
2024-11-12 | OWLed: Outlier-weighed Layerwise Pruning for Efficient Autonomous Driving Framework | Jiaxi Li et.al. | 2411.07711 | link |
2024-11-12 | xCG: Explainable Cell Graphs for Survival Prediction in Non-Small Cell Lung Cancer | Marvin Sextro et.al. | 2411.07643 | link |
2024-11-12 | A Simple Multi-agent Joint Prediction Method for Autonomous Driving | Mingyi Wang et.al. | 2411.07612 | null |
2024-11-12 | Collaborative and Federated Black-box Optimization: A Bayesian Optimization Perspective | Raed Al Kontar et.al. | 2411.07523 | null |
2024-11-11 | Towards a criteria-based approach to selecting human-AI interaction mode | Jessica Irons et.al. | 2411.07406 | null |
2024-11-11 | Advancements in Constitutive Model Calibration: Leveraging the Power of Full-Field DIC Measurements and In-Situ Load Path Selection for Reliable Parameter Inference | Denielle Ricciardi et.al. | 2411.07310 | null |
2024-11-11 | RoundTable: Investigating Group Decision-Making Mechanism in Multi-Agent Collaboration | Young-Min Cho et.al. | 2411.07161 | null |
2024-11-12 | OCMDP: Observation-Constrained Markov Decision Process | Taiyi Wang et.al. | 2411.07087 | null |
2024-11-11 | HeteroSample: Meta-path Guided Sampling for Heterogeneous Graph Representation Learning | Ao Liu et.al. | 2411.07022 | null |
2024-11-11 | SIESEF-FusionNet: Spatial Inter-correlation Enhancement and Spatially-Embedded Feature Fusion Network for LiDAR Point Cloud Semantic Segmentation | Jiale Chen et.al. | 2411.06991 | null |
2024-11-11 | Cancer-Answer: Empowering Cancer Care with Advanced Large Language Models | Aniket Deroy et.al. | 2411.06946 | null |
2024-11-11 | Distributed Graph Augmentation Protocols to Achieve Strong Connectivity in Multi-Agent Networks | Guilherme Ramos et.al. | 2411.06880 | link |
2024-11-11 | Classification of residential and non-residential buildings based on satellite data using deep learning | Jai G Singla et.al. | 2411.06879 | null |
2024-11-11 | Multi-Modal interpretable automatic video captioning | Antoine Hanna-Asaad et.al. | 2411.06872 | null |
2024-11-11 | Learning Interpretable Network Dynamics via Universal Neural Symbolic Regression | Jiao Hu et.al. | 2411.06833 | null |
2024-11-11 | Streetwise Agents: Empowering Offline RL Policies to Outsmart Exogenous Stochastic Disturbances in RTC | Aditya Soni et.al. | 2411.06815 | null |
2024-11-11 | AssistRAG: Boosting the Potential of Large Language Models with an Intelligent Information Assistant | Yujia Zhou et.al. | 2411.06805 | link |
2024-11-11 | Large-scale moral machine experiment on large language models | Muhammad Shahrul Zaim bin Ahmad et.al. | 2411.06790 | link |
2024-11-11 | Model Partition and Resource Allocation for Split Learning in Vehicular Edge Networks | Lu Yu et.al. | 2411.06773 | null |
2024-11-11 | DP and QP Based Decision-making and Planning for Autonomous Vehicle | Zhicheng Zhang et.al. | 2411.06751 | null |
2024-11-10 | SequentialSamplingModels.jl: Simulating and Evaluating Cognitive Models of Response Times in Julia | Kianté Fernandez et.al. | 2411.06631 | null |
2024-11-10 | Towards Graph Neural Network Surrogates Leveraging Mechanistic Expert Knowledge for Pandemic Response | Agatha Schmidt et.al. | 2411.06500 | null |
2024-11-10 | ClinicalBench: Can LLMs Beat Traditional ML Models in Clinical Prediction? | Canyu Chen et.al. | 2411.06469 | null |
2024-11-10 | Mastering NIM and Impartial Games with Weak Neural Networks: An AlphaZero-inspired Multi-Frame Approach | Søren Riis et.al. | 2411.06403 | null |
2024-11-10 | Local vs. Global Models for Hierarchical Forecasting | Zhao Yingjie et.al. | 2411.06394 | null |
2024-11-10 | Regret Minimization and Statistical Inference in Online Decision Making with High-dimensional Covariates | Congyuan Duan et.al. | 2411.06329 | null |
2024-11-08 | GazeSearch: Radiology Findings Search Benchmark | Trong Thang Pham et.al. | 2411.05780 | link |
2024-11-08 | Multi-armed Bandits with Missing Outcome | Ilia Mahrooghi et.al. | 2411.05661 | link |
2024-11-08 | WHALE: Towards Generalizable and Scalable World Models for Embodied Decision-making | Zhilong Zhang et.al. | 2411.05619 | null |
2024-11-08 | Expectation vs. Reality: Towards Verification of Psychological Games | Marta Kwiatkowska et.al. | 2411.05599 | null |
2024-11-08 | Parameterized Voter Relevance in Facility Location Games with Tree-Shaped Invitation Graphs | Ryoto Ando et.al. | 2411.05574 | null |
2024-11-08 | Open-set object detection: towards unified problem formulation and benchmarking | Hejer Ammar et.al. | 2411.05564 | null |
2024-11-08 | BayesianFitForecast: A User-Friendly R Toolbox for Parameter Estimation and Forecasting with Ordinary Differential Equations | Hamed Karami et.al. | 2411.05371 | link |
2024-11-08 | Stochastic games of parental vaccination decision making and bounded rationality | Andras Balogh et.al. | 2411.05369 | null |
2024-11-08 | Agricultural Landscape Understanding At Country-Scale | Radhika Dua et.al. | 2411.05359 | null |
2024-11-08 | LLM-PySC2: Starcraft II learning environment for Large Language Models | Zongyuan Li et.al. | 2411.05348 | link |
2024-11-08 | Differentiable Calibration of Inexact Stochastic Simulation Models via Kernel Score Minimization | Ziwei Su et.al. | 2411.05315 | null |
2024-11-08 | ZOPP: A Framework of Zero-shot Offboard Panoptic Perception for Autonomous Driving | Tao Ma et.al. | 2411.05311 | null |
2024-11-08 | SimpleBEV: Improved LiDAR-Camera Fusion Architecture for 3D Object Detection | Yun Zhao et.al. | 2411.05292 | null |
2024-11-08 | Decoding Report Generators: A Cyclic Vision-Language Adapter for Counterfactual Explanations | Yingying Fang et.al. | 2411.05261 | null |
2024-11-07 | Pruning the Path to Optimal Care: Identifying Systematically Suboptimal Medical Decision-Making with Inverse Reinforcement Learning | Inko Bovenzi et.al. | 2411.05237 | null |
2024-11-07 | Bootstrap Pettitt test for detecting change point in hydroclimatological data: a case study for Itaipu hydroelectric plant in Brazil | Luiza Chiarelli Conte et.al. | 2411.05233 | null |
2024-11-07 | AGE2HIE: Transfer Learning from Brain Age to Predicting Neurocognitive Outcome for Infant Brain Injury | Rina Bao et.al. | 2411.05188 | null |
2024-11-07 | Inverse Transition Learning: Learning Dynamics from Demonstrations | Leo Benac et.al. | 2411.05174 | null |
2024-11-07 | Few-Shot Task Learning through Inverse Generative Modeling | Aviv Netanyahu et.al. | 2411.04987 | null |
2024-11-07 | Position Paper On Diagnostic Uncertainty Estimation from Large Language Models: Next-Word Probability Is Not Pre-test Probability | Yanjun Gao et.al. | 2411.04962 | null |
2024-11-07 | Orbit: A Framework for Designing and Evaluating Multi-objective Rankers | Chenyang Yang et.al. | 2411.04798 | null |
2024-11-07 | Enhancing Investment Analysis: Optimizing AI-Agent Collaboration in Financial Research | Xuewen Han et.al. | 2411.04788 | link |
2024-11-07 | From CNN to ConvRNN: Adapting Visualization Techniques for Time-Series Anomaly Detection | Fabien Poirier et.al. | 2411.04707 | null |
2024-11-07 | Semantic-Aware Resource Management for C-V2X Platooning via Multi-Agent Reinforcement Learning | Zhiyu Shao et.al. | 2411.04672 | link |
2024-11-07 | IGDrivSim: A Benchmark for the Imitation Gap in Autonomous Driving | Clémence Grislain et.al. | 2411.04653 | link |
2024-11-07 | DISCO: DISCovering Overfittings as Causal Rules for Text Classification Models | Zijian Zhang et.al. | 2411.04649 | null |
2024-11-07 | Bayesian reconstruction of sparse raster-scanned mid-infrared optoacoustic signals enables fast, label-free chemical microscopy | Constantin Berger et.al. | 2411.04648 | null |
2024-11-07 | Dynamic Detection of Relevant Objectives and Adaptation to Preference Drifts in Interactive Evolutionary Multi-Objective Optimization | Seyed Mahdi Shavarani et.al. | 2411.04547 | null |
2024-11-07 | Enabling Adaptive Agent Training in Open-Ended Simulators by Targeting Diversity | Robby Costales et.al. | 2411.04466 | link |
2024-11-07 | GPT-Guided Monte Carlo Tree Search for Symbolic Regression in Financial Fraud Detection | Prashank Kadam et.al. | 2411.04459 | null |
2024-11-07 | Seeing Through Pixel Motion: Learning Obstacle Avoidance from Optical Flow with One Camera | Yu Hu et.al. | 2411.04413 | null |
2024-11-07 | LidaRefer: Outdoor 3D Visual Grounding for Autonomous Driving with Transformers | Yeong-Seung Baek et.al. | 2411.04351 | null |
2024-11-07 | Survival of the Notable: Gender Asymmetry in Wikipedia Collective Deliberations | Khandaker Tasnim Huq et.al. | 2411.04340 | null |
2024-11-07 | CodeTree: Agent-guided Tree Search for Code Generation with Large Language Models | Jierui Li et.al. | 2411.04329 | null |
2024-11-06 | Multimodal Structure-Aware Quantum Data Processing | Hala Hawashin et.al. | 2411.04242 | link |
2024-11-06 | Using Linked Micromaps for Evidence-Based Policy | Randall Powers et.al. | 2411.04211 | link |
2024-11-06 | A Capacitated Collection-and-Delivery-Point Location Problem with Random Utility Maximizing Customers | David Pinzon Ulloa et.al. | 2411.04200 | null |
2024-11-06 | A Comparative Study of Deep Reinforcement Learning for Crop Production Management | Joseph Balderas et.al. | 2411.04106 | null |
2024-11-06 | Aligning Characteristic Descriptors with Images for Human-Expert-like Explainability | Bharat Chandra Yalavarthi et.al. | 2411.04008 | null |
2024-11-06 | Synomaly Noise and Multi-Stage Diffusion: A Novel Approach for Unsupervised Anomaly Detection in Ultrasound Imaging | Yuan Bi et.al. | 2411.04004 | link |
2024-11-06 | Fine-tuning – a Transfer Learning approach | Joseph Arul Raj et.al. | 2411.03941 | null |
2024-11-06 | A Causal Framework for Precision Rehabilitation | R. James Cotton et.al. | 2411.03919 | null |
2024-11-06 | AdaSociety: An Adaptive Environment with Social Structures for Multi-Agent Decision-Making | Yizhe Huang et.al. | 2411.03865 | link |
2024-11-06 | A Comparative Study of Recent Large Language Models on Generating Hospital Discharge Summaries for Lung Cancer Patients | Yiming Li et.al. | 2411.03805 | null |
2024-11-06 | Navigating the landscape of multimodal AI in medicine: a scoping review on technical challenges and clinical applications | Daan Schouten et.al. | 2411.03782 | null |
2024-11-06 | Human-in-the-Loop Feature Selection Using Interpretable Kolmogorov-Arnold Network-based Double Deep Q-Network | Md Abrar Jahin et.al. | 2411.03740 | null |
2024-11-06 | Explaining Human Activity Recognition with SHAP: Validating Insights with Perturbation and Quantitative Measures | Felix Tempel et.al. | 2411.03714 | link |
2024-11-06 | Generalized Trusted Multi-view Classification Framework with Hierarchical Opinion Aggregation | Long Shi et.al. | 2411.03713 | link |
2024-11-06 | Graph-Based Multi-Modal Sensor Fusion for Autonomous Driving | Depanshu Sani et.al. | 2411.03702 | null |
2024-11-06 | OccLoff: Learning Optimized Feature Fusion for 3D Occupancy Prediction | Ji Zhang et.al. | 2411.03696 | null |
2024-11-06 | Towards 3D Semantic Scene Completion for Autonomous Driving: A Meta-Learning Framework Empowered by Deformable Large-Kernel Attention and Mamba Model | Yansong Qu et.al. | 2411.03672 | null |
2024-11-06 | Evaluating Moral Beliefs across LLMs through a Pluralistic Framework | Xuelin Liu et.al. | 2411.03665 | link |
2024-11-06 | RTify: Aligning Deep Neural Networks with Human Behavioral Decisions | Yu-Ang Cheng et.al. | 2411.03630 | link |
2024-11-06 | Hiring as Exploration | Danielle Li et.al. | 2411.03616 | null |
2024-11-06 | Can Robotic Cues Manipulate Human Decisions? Exploring Consensus Building via Bias-Controlled Non-linear Opinion Dynamics and Robotic Eye Gaze Mediated Interaction in Human-Robot Teaming | Rajul Kumar et.al. | 2411.03581 | null |
2024-11-06 | Hybrid Attention for Robust RGB-T Pedestrian Detection in Real-World Conditions | Arunkumar Rathinam et.al. | 2411.03576 | null |
2024-11-05 | Digital Twin for Autonomous Surface Vessels: Enabler for Safe Maritime Navigation | Daniel Menges et.al. | 2411.03465 | null |
2024-11-05 | Causal Responsibility Attribution for Human-AI Collaboration | Yahang Qi et.al. | 2411.03275 | link |
2024-11-05 | Knowledge Graphs of Driving Scenes to Empower the Emerging Capabilities of Neurosymbolic AI | Ruwan Wickramarachchi et.al. | 2411.03225 | null |
2024-11-05 | GIS Copilot: Towards an Autonomous GIS Agent for Spatial Analysis | Temitope Akinboyewa et.al. | 2411.03205 | link |
2024-11-05 | Evaluating Machine Learning Models against Clinical Protocols for Enhanced Interpretability and Continuity of Care | Christel Sirocchi et.al. | 2411.03105 | link |
2024-11-05 | Precise Drive with VLM: First Prize Solution for PRCV 2024 Drive LM challenge | Bin Huang et.al. | 2411.02999 | null |
2024-11-05 | Autonomous Decision Making for UAV Cooperative Pursuit-Evasion Game with Reinforcement Learning | Yang Zhao et.al. | 2411.02983 | null |
2024-11-05 | Region-Guided Attack on the Segment Anything Model (SAM) | Xiaoliang Liu et.al. | 2411.02974 | null |
2024-11-05 | Multi-modal NeRF Self-Supervision for LiDAR Semantic Segmentation | Xavier Timoneda et.al. | 2411.02969 | null |
2024-11-05 | Mapping Africa Settlements: High Resolution Urban and Rural Map by Deep Learning and Satellite Imagery | Mohammad Kakooei et.al. | 2411.02935 | link |
2024-11-05 | Exploring the Interplay Between Video Generation and World Models in Autonomous Driving: A Survey | Ao Fu et.al. | 2411.02914 | null |
2024-11-05 | A new family of ladder operators for macroscopic systems, with applications | Fabio Bagarello et.al. | 2411.02879 | null |
2024-11-05 | Safety Verification for Evasive Collision Avoidance in Autonomous Vehicles with Enhanced Resolutions | Aliasghar Arab et.al. | 2411.02706 | null |
2024-11-04 | Geometry of naturalistic object representations in recurrent neural network models of working memory | Xiaoxuan Lei et.al. | 2411.02685 | null |
2024-11-04 | Visually Analyze SHAP Plots to Diagnose Misclassifications in ML-based Intrusion Detection | Maraz Mia et.al. | 2411.02670 | null |
2024-11-04 | Designing and Evaluating Sampling Strategies for Multiple-Forecast Visualization (MFV) | Ruishi Zou et.al. | 2411.02576 | null |
2024-11-04 | Enhancing Risk Assessment in Transformers with Loss-at-Risk Functions | Jinghan Zhang et.al. | 2411.02558 | null |
2024-11-04 | Imagining and building wise machines: The centrality of AI metacognition | Samuel G. B. Johnson et.al. | 2411.02478 | null |
2024-11-04 | Energy-Aware Dynamic Neural Inference | Marcello Bullo et.al. | 2411.02471 | null |
2024-11-04 | WebRL: Training LLM Web Agents via Self-Evolving Online Curriculum Reinforcement Learning | Zehan Qi et.al. | 2411.02337 | link |
2024-11-04 | Federated GNNs for EEG-Based Stroke Assessment | Andrea Protani et.al. | 2411.02286 | null |
2024-11-04 | Stochastic Optimal Control of an Industrial Power-to-Heat System with High-Temperature Heat Pump and Thermal Energy Storage | Eric Pilling et.al. | 2411.02211 | null |
2024-11-04 | Learning Multiple Initial Solutions to Optimization Problems | Elad Sharony et.al. | 2411.02158 | link |
2024-11-04 | Optimizing AoI at Query in Multiuser Wireless Uplink Networks: A Whittle Index Approach | Jingwei Liu et.al. | 2411.02108 | null |
2024-11-04 | Amortized Bayesian Experimental Design for Decision-Making | Daolang Huang et.al. | 2411.02064 | link |
2024-11-04 | Probability of Error Analysis for NOMA Systems in Rayleigh Fading Channels: Enabling IoT in Civil Engineering | Amr Abdelbari et.al. | 2411.01977 | null |
2024-11-04 | The Certainty Ratio $C_ρ$ : a novel metric for assessing the reliability of classifier predictions | Jesus S. Aguilar-Ruiz et.al. | 2411.01973 | null |
2024-11-04 | Advancing DeFi Analytics: Efficiency Analysis with Decentralized Exchanges Comparison Service | Evgenii Onishchuk et.al. | 2411.01950 | null |
2024-11-04 | Datasets for Advanced Bankruptcy Prediction: A survey and Taxonomy | Xinlin Wang et.al. | 2411.01928 | null |
2024-11-04 | Traffic and Safety Rule Compliance of Humans in Diverse Driving Situations | Michael Kurenkov et.al. | 2411.01909 | null |
2024-11-04 | Towards the Industrial Metaverse: A Game-Based VR Application for Fire Drill and Evacuation Training for Ships and Shipbuilding | Musaab H. Hamed-Ahmed et.al. | 2411.01895 | null |
2024-11-04 | Causal Discovery and Classification Using Lempel-Ziv Complexity | Dhruthi et.al. | 2411.01881 | link |
2024-11-04 | Fixing the Loose Brake: Exponential-Tailed Stopping Time in Best Arm Identification | Kapilan Balagopalan et.al. | 2411.01808 | null |
2024-11-03 | Nash equilibria in four-strategy quantum game extensions of the Prisoner’s Dilemma | Piotr Frąckiewicz et.al. | 2411.01711 | null |
2024-11-03 | Understanding the decision-making process of choice modellers | Gabriel Nova et.al. | 2411.01704 | null |
2024-11-03 | Co-clustering for Federated Recommender System | Xinrui He et.al. | 2411.01690 | link |
2024-11-03 | ROAD-Waymo: Action Awareness at Scale for Autonomous Driving | Salman Khan et.al. | 2411.01683 | link |
2024-11-03 | Autoformulation of Mathematical Optimization Models Using LLMs | Nicolás Astorga et.al. | 2411.01679 | null |
2024-11-03 | Know Where You’re Uncertain When Planning with Multimodal Foundation Models: A Formal Framework | Neel P. Bhatt et.al. | 2411.01639 | null |
2024-10-31 | Language-Driven Policy Distillation for Cooperative Driving in Multi-Agent Reinforcement Learning | Jiaqi Liu et.al. | 2410.24152 | null |
2024-10-31 | AIDOVECL: AI-generated Dataset of Outpainted Vehicles for Eye-level Classification and Localization | Amir Kazemi et.al. | 2410.24116 | null |
2024-10-31 | Attention is All You Need to Optimize Wind Farm Operations and Maintenance | Iman Kazemian et.al. | 2410.24052 | null |
2024-10-31 | Representative Social Choice: From Learning Theory to AI Alignment | Tianyi Qiu et.al. | 2410.23953 | null |
2024-10-31 | Responsible Retrieval Augmented Generation for Climate Decision Making from Documents | Matyas Juhasz et.al. | 2410.23902 | null |
2024-10-31 | Plan-on-Graph: Self-Correcting Adaptive Planning of Large Language Model on Knowledge Graphs | Liyi Chen et.al. | 2410.23875 | link |
2024-10-31 | Driving by the Rules: A Benchmark for Integrating Traffic Sign Regulations into Vectorized HD Map | Xinyuan Chang et.al. | 2410.23780 | null |
2024-10-31 | Features characterizing safe aerial-aquatic robots | Andrea Giordano et.al. | 2410.23722 | null |
2024-10-31 | Automatically Learning Hybrid Digital Twins of Dynamical Systems | Samuel Holt et.al. | 2410.23691 | link |
2024-10-31 | Coach Reservation for Groups Requests | Carlos H. Cardonha et.al. | 2410.23542 | null |
2024-10-30 | Development and Comparative Analysis of Machine Learning Models for Hypoxemia Severity Triage in CBRNE Emergency Scenarios Using Physiological and Demographic Data from Medical-Grade Devices | Santino Nanini et.al. | 2410.23503 | null |
2024-10-30 | Venire: A Machine Learning-Guided Panel Review System for Community Content Moderation | Vinay Koshy et.al. | 2410.23448 | null |
2024-10-30 | Estimating Neural Network Robustness via Lipschitz Constant and Architecture Sensitivity | Abulikemu Abuduweili et.al. | 2410.23382 | null |
2024-10-30 | OpenSatMap: A Fine-grained High-resolution Satellite Dataset for Large-scale Map Construction | Hongbo Zhao et.al. | 2410.23278 | null |
2024-10-30 | EMMA: End-to-End Multimodal Model for Autonomous Driving | Jyh-Jing Hwang et.al. | 2410.23262 | null |
2024-10-31 | Enhancing Autonomous Driving Safety Analysis with Generative AI: A Comparative Study on Automated Hazard and Risk Assessment | Alireza Abbaspour et.al. | 2410.23207 | null |
2024-10-30 | S3PT: Scene Semantics and Structure Guided Clustering to Boost Self-Supervised Pre-Training for Autonomous Driving | Maciej K. Wozniak et.al. | 2410.23085 | null |
2024-10-30 | Online Intrinsic Rewards for Decision Making Agents from Large Language Model Feedback | Qinqing Zheng et.al. | 2410.23022 | link |
2024-10-31 | DiffLight: A Partial Rewards Conditioned Diffusion Model for Traffic Signal Control with Missing Data | Hanyang Chen et.al. | 2410.22938 | link |
2024-11-01 | Multi-Agent Large Language Models for Conversational Task-Solving | Jonas Becker et.al. | 2410.22932 | null |
2024-10-30 | Self-optimization in distributed manufacturing systems using Modular State-based Stackelberg Games | Steve Yuwono et.al. | 2410.22912 | null |
2024-10-30 | YOLOv11 for Vehicle Detection: Advancements, Performance, and Applications in Intelligent Transportation Systems | Mujadded Al Rabbani Alif et.al. | 2410.22898 | null |
2024-10-30 | A Graph-Based Model for Vehicle-Centric Data Sharing Ecosystem | Haiyue Yuan et.al. | 2410.22897 | null |
2024-10-30 | Reliability Assessment of Information Sources Based on Random Permutation Set | Juntao Xu et.al. | 2410.22772 | null |
2024-10-30 | Self-Driving Car Racing: Application of Deep Reinforcement Learning | Florentiana Yuwono et.al. | 2410.22766 | null |
2024-10-30 | A Game-Theoretic Approach for Security Control Selection | Dylan Léveillé et.al. | 2410.22762 | null |
2024-10-30 | SoftCTRL: Soft conservative KL-control of Transformer Reinforcement Learning for Autonomous Driving | Minh Tri Huynh et.al. | 2410.22752 | null |
2024-10-30 | Analysis of Classifier Training on Synthetic Data for Cross-Domain Datasets | Andoni Cortés et.al. | 2410.22748 | null |
2024-10-30 | Clustering Computer Mouse Tracking Data with Informed Hierarchical Shrinkage Partition Priors | Ziyi Song et.al. | 2410.22675 | link |
2024-10-30 | CoGS: Model Agnostic Causality Constrained Counterfactual Explanations using goal-directed ASP | Sopam Dasgupta et.al. | 2410.22615 | null |
2024-10-29 | Pre-Trained Vision Models as Perception Backbones for Safety Filters in Autonomous Driving | Yuxuan Yang et.al. | 2410.22585 | null |
2024-10-29 | Auto-Intent: Automated Intent Discovery and Self-Exploration for Large Language Model Web Agents | Jaekyeom Kim et.al. | 2410.22552 | null |
2024-10-29 | An Efficient Approach to Generate Safe Drivable Space by LiDAR-Camera-HDmap Fusion | Minghao Ning et.al. | 2410.22314 | link |
2024-10-29 | Senna: Bridging Large Vision-Language Models and End-to-End Autonomous Driving | Bo Jiang et.al. | 2410.22313 | link |
2024-10-29 | Fourier Head: Helping Large Language Models Learn Complex Probability Distributions | Nate Gillman et.al. | 2410.22269 | null |
2024-10-29 | MAPUNetR: A Hybrid Vision Transformer and U-Net Architecture for Efficient and Interpretable Medical Image Segmentation | Ovais Iqbal Shah et.al. | 2410.22223 | null |
2024-10-29 | Democratizing Reward Design for Personal and Representative Value-Alignment | Carter Blair et.al. | 2410.22203 | null |
2024-10-29 | EnvoDat: A Large-Scale Multisensory Dataset for Robotic Spatial Awareness and Semantic Reasoning in Heterogeneous Environments | Linus Nwankwo et.al. | 2410.22200 | null |
2024-10-29 | Hyperspectral Imaging-Based Perception in Autonomous Driving Scenarios: Benchmarking Baseline Semantic Segmentation Models | Imad Ali Shah et.al. | 2410.22101 | link |
2024-10-29 | Markov Stochastic Choice | Kremena Valkanova et.al. | 2410.22001 | null |
2024-10-29 | ActiveSplat: High-Fidelity Scene Reconstruction through Active Gaussian Splatting | Yuetao Li et.al. | 2410.21955 | link |
2024-10-29 | On the Robustness of Adversarial Training Against Uncertainty Attacks | Emanuele Ledda et.al. | 2410.21952 | link |
2024-10-29 | Reliable Semantic Understanding for Real World Zero-shot Object Goal Navigation | Halil Utku Unlu et.al. | 2410.21926 | null |
2024-10-29 | Cognitive Semantic Augmentation LEO Satellite Networks for Earth Observation | Hong-fu Chou et.al. | 2410.21916 | null |
2024-10-29 | Bayesian Stability Selection and Inference on Inclusion Probabilities | Mahdi Nouraie et.al. | 2410.21914 | link |
2024-10-29 | Building Altruistic and Moral AI Agent with Brain-inspired Affective Empathy Mechanisms | Feifei Zhao et.al. | 2410.21882 | null |
2024-10-29 | Enhanced Survival Prediction in Head and Neck Cancer Using Convolutional Block Attention and Multimodal Data Fusion | Aiman Farooq et.al. | 2410.21831 | null |
2024-10-30 | First-in-human spinal cord tumor imaging with fast adaptive focus tracking robotic-OCT | Bin He et.al. | 2410.21809 | null |
2024-10-29 | SS3DM: Benchmarking Street-View Surface Reconstruction with a Synthetic 3D Mesh Dataset | Yubin Hu et.al. | 2410.21739 | null |
2024-10-30 | Enhancing Safety and Robustness of Vision-Based Controllers via Reachability Analysis | Kaustav Chakraborty et.al. | 2410.21736 | null |
2024-10-28 | Adaptive Self-Calibration for Minimalistic Collective Perception by Imperfect Robot Swarms | Khai Yi Chin et.al. | 2410.21546 | link |
2024-10-28 | Bayesian Regression for Predicting Subscription to Bank Term Deposits in Direct Marketing Campaigns | Muhammad Farhan Tanvir et.al. | 2410.21539 | null |
2024-10-28 | Quantum Reinforcement Learning-Based Two-Stage Unit Commitment Framework for Enhanced Power Systems Robustness | Xiang Wei et.al. | 2410.21240 | null |
2024-10-28 | Belief in the Machine: Investigating Epistemological Blind Spots of Language Models | Mirac Suzgun et.al. | 2410.21195 | link |
2024-10-28 | Towards Human-centered Design of Explainable Artificial Intelligence (XAI): A Survey of Empirical Studies | Shuai Ma et.al. | 2410.21183 | null |
2024-10-28 | coVoxSLAM: GPU Accelerated Globally Consistent Dense SLAM | Emiliano Höss et.al. | 2410.21149 | link |
2024-10-28 | Towards Unifying Evaluation of Counterfactual Explanations: Leveraging Large Language Models for Human-Centric Assessments | Marharyta Domnich et.al. | 2410.21131 | link |
2024-10-28 | CloudHeatMap: Heatmap-Based Monitoring for Large-Scale Cloud Systems | Sarah Sohana et.al. | 2410.21092 | link |
2024-10-28 | Efficient Mixture-of-Expert for Video-based Driver State and Physiological Multi-task Estimation in Conditional Autonomous Driving | Jiyao Wang et.al. | 2410.21086 | null |
2024-10-28 | Exploring the Reliability of Foundation Model-Based Frontier Selection in Zero-Shot Object Goal Navigation | Shuaihang Yuan et.al. | 2410.21037 | null |
2024-10-28 | Edge Perception: Intelligent Wireless Sensing at Network Edge | Yuanhao Cui et.al. | 2410.21017 | null |
2024-10-28 | A Review of Graph-Powered Data Quality Applications for IoT Monitoring Sensor Networks | Pau Ferrer-Cid et.al. | 2410.21006 | null |
2024-10-28 | Efficient Bilinear Attention-based Fusion for Medical Visual Question Answering | Zhilin Zhang et.al. | 2410.21000 | null |
2024-10-28 | BEVPose: Unveiling Scene Semantics through Pose-Guided Multi-Modal BEV Alignment | Mehdi Hosseinzadeh et.al. | 2410.20969 | null |
2024-10-28 | Active Legibility in Multiagent Reinforcement Learning | Yanyu Liu et.al. | 2410.20954 | null |
2024-10-28 | On Spatio-Temporal Stochastic Frontier Models | Elisa Fusco et.al. | 2410.20915 | null |
2024-10-28 | Explainability in AI Based Applications: A Framework for Comparing Different Techniques | Arne Grobrugge et.al. | 2410.20873 | null |
2024-10-28 | Bridging the Gap between Expert and Language Models: Concept-guided Chess Commentary Generation and Evaluation | Jaechang Kim et.al. | 2410.20811 | null |
2024-10-28 | SparseTem: Boosting the Efficiency of CNN-Based Video Encoders by Exploiting Temporal Continuity | Kunyun Wang et.al. | 2410.20790 | null |
2024-10-27 | Language Models And A Second Opinion Use Case: The Pocket Professional | David Noever et.al. | 2410.20636 | null |
2024-10-27 | Toward Conditional Distribution Calibration in Survival Prediction | Shi-ang Qi et.al. | 2410.20579 | link |
2024-10-27 | Deep Reinforcement Learning Agents for Strategic Production Policies in Microeconomic Market Simulations | Eduardo C. Garrido-Merchán et.al. | 2410.20550 | link |
2024-10-25 | Robust Thompson Sampling Algorithms Against Reward Poisoning Attacks | Yinglun Xu et.al. | 2410.19705 | null |
2024-10-25 | Optimizing Hearthstone Agents using an Evolutionary Algorithm | Pablo García-Sánchez et.al. | 2410.19681 | link |
2024-10-25 | Planning-Aware Diffusion Networks for Enhanced Motion Forecasting in Autonomous Driving | Liu Yunhao et.al. | 2410.19639 | null |
2024-10-25 | Multi-modal Motion Prediction using Temporal Ensembling with Learning-based Aggregation | Kai-Yin Hong et.al. | 2410.19606 | null |
2024-10-25 | AgentForge: A Flexible Low-Code Platform for Reinforcement Learning Agent Design | Francisco Erivaldo Fernandes Junior et.al. | 2410.19528 | link |
2024-10-25 | COR-MP: Conservation of Resources Model for Maneuver Planning | Karim Essalmi et.al. | 2410.19510 | null |
2024-10-25 | Robust Time Series Causal Discovery for Agent-Based Model Validation | Gene Yu et.al. | 2410.19412 | null |
2024-10-25 | Autonomous Building Cyber-Physical Systems Using Decentralized Autonomous Organizations, Digital Twins, and Large Language Model | Reachsak Ly et.al. | 2410.19262 | null |
2024-10-25 | Enhancing Exchange Rate Forecasting with Explainable Deep Learning Models | Shuchen Meng et.al. | 2410.19241 | null |
2024-10-25 | Designing LLM-Agents with Personalities: A Psychometric Approach | Muhua Huang et.al. | 2410.19238 | null |
2024-10-24 | Context-Aware Trajectory Anomaly Detection | Haoji Hu et.al. | 2410.19136 | null |
2024-10-24 | Learning to Look: Seeking Information for Decision Making via Policy Factorization | Shivin Dass et.al. | 2410.18964 | null |
2024-10-24 | Context is Key: A Benchmark for Forecasting with Essential Textual Information | Andrew Robert Williams et.al. | 2410.18959 | link |
2024-10-24 | From Efficiency to Equity: Measuring Fairness in Preference Learning | Shreeyash Gowaikar et.al. | 2410.18841 | null |
2024-10-24 | Large Generative AI Models meet Open Networks for 6G: Integration, Platform, and Monetization | Peizheng Li et.al. | 2410.18790 | null |
2024-10-24 | A Joint Representation Using Continuous and Discrete Features for Cardiovascular Diseases Risk Prediction on Chest CT Scans | Minfeng Xu et.al. | 2410.18610 | link |
2024-10-24 | Learning Transparent Reward Models via Unsupervised Feature Selection | Daulet Baimukashev et.al. | 2410.18608 | null |
2024-10-24 | Aligning CodeLLMs with Direct Preference Optimization | Yibo Miao et.al. | 2410.18585 | null |
2024-10-24 | Resilience-based post disaster recovery optimization for infrastructure system via Deep Reinforcement Learning | Huangbin Liang et.al. | 2410.18577 | null |
2024-10-24 | Zero-shot Object Navigation with Vision-Language Models Reasoning | Congcong Wen et.al. | 2410.18570 | null |
2024-10-24 | Learn 2 Rage: Experiencing The Emotional Roller Coaster That Is Reinforcement Learning | Lachlan Mares et.al. | 2410.18462 | null |
2024-10-23 | Augmenting Training Data with Vector-Quantized Variational Autoencoder for Classifying RF Signals | Srihari Kamesh Kompella et.al. | 2410.18283 | null |
2024-10-23 | Real-Time Integrated Learning and Decision-Making for Asset Networks | Peter Verleijsdonk et.al. | 2410.18246 | null |
2024-10-23 | Characterising Open Source Co-opetition in Company-hosted Open Source Software Projects: The Cases of PyTorch, TensorFlow, and Transformers | Cailean Osborne et.al. | 2410.18241 | null |
2024-10-23 | CARLA2Real: a tool for reducing the sim2real gap in CARLA simulator | Stefanos Pasios et.al. | 2410.18238 | link |
2024-10-23 | WorldSimBench: Towards Video Generation Models as World Simulators | Yiran Qin et.al. | 2410.18072 | null |
2024-10-25 | MiniFed : Integrating LLM-based Agentic-Workflow for Simulating FOMC Meeting | Sungil Seok et.al. | 2410.18012 | null |
2024-10-23 | Lightweight Neural App Control | Filippos Christianos et.al. | 2410.17883 | null |
2024-10-23 | Identifiable Representation and Model Learning for Latent Dynamic Systems | Congxi Zhang et.al. | 2410.17882 | null |
2024-10-23 | ROCKET-1: Master Open-World Interaction with Visual-Temporal Context Prompting | Shaofei Cai et.al. | 2410.17856 | link |
2024-10-23 | Exploiting Text-Image Latent Spaces for the Description of Visual Concepts | Laines Schmalwasser et.al. | 2410.17832 | null |
2024-10-23 | PGDiffSeg: Prior-Guided Denoising Diffusion Model with Parameter-Shared Attention for Breast Cancer Segmentation | Feiyan Feng et.al. | 2410.17812 | null |
2024-10-23 | e-Values for Real-Time Residential Electricity Demand Forecast Model Selection | Fabian Backhaus et.al. | 2410.17800 | null |
2024-10-23 | Pointer: An Energy-Efficient ReRAM-based Point Cloud Recognition Accelerator with Inter-layer and Intra-layer Optimizations | Qijun Zhang et.al. | 2410.17782 | null |
2024-10-23 | Learning Versatile Skills with Curriculum Masking | Yao Tang et.al. | 2410.17744 | link |
2024-10-23 | YOLO-Vehicle-Pro: A Cloud-Edge Collaborative Framework for Object Detection in Autonomous Driving under Adverse Weather Conditions | Xiguang Li et.al. | 2410.17734 | null |
2024-10-23 | Longitudinal Causal Image Synthesis | Yujia Li et.al. | 2410.17691 | link |
2024-10-23 | Integrating Large Language Models for UAV Control in Simulated Environments: A Modular Interaction Approach | Abhishek Phadke et.al. | 2410.17602 | null |
2024-10-23 | Predicting Company Growth by Econophysics informed Machine Learning | Ruyi Tao et.al. | 2410.17587 | null |
2024-10-23 | Real-time Vehicle-to-Vehicle Communication Based Network Cooperative Control System through Distributed Database and Multimodal Perception: Demonstrated in Crossroads | Xinwen Zhu et.al. | 2410.17576 | link |
2024-10-23 | Bridging Swarm Intelligence and Reinforcement Learning | Karthik Soma et.al. | 2410.17517 | null |
2024-10-23 | Detecting fake review buyers using network structure: Direct evidence from Amazon | Sherry He et.al. | 2410.17507 | null |
2024-10-23 | Learning Fair and Preferable Allocations through Neural Network | Ryota Maruo et.al. | 2410.17500 | null |
2024-10-22 | Episodic Future Thinking Mechanism for Multi-agent Reinforcement Learning | Dongsu Lee et.al. | 2410.17373 | null |
2024-10-22 | Literature Meets Data: A Synergistic Approach to Hypothesis Generation | Haokun Liu et.al. | 2410.17309 | link |
2024-10-22 | Hierarchical Upper Confidence Bounds for Constrained Online Learning | Ali Baheri et.al. | 2410.17216 | null |
2024-10-22 | YOLO-TS: Real-Time Traffic Sign Detection with Enhanced Accuracy Using Optimized Receptive Fields and Anchor-Free Fusion | Junzhou Chen et.al. | 2410.17144 | null |
2024-10-22 | Trustworthy XAI and Application | MD Abdullah Al Nasim et.al. | 2410.17139 | null |
2024-10-22 | Impact of Cognitive Dissonance on Social Hysteresis: Insights fromthe Expressed and Private Opinions Model | Kamińska Barbara et.al. | 2410.16934 | null |
2024-10-22 | EnvBridge: Bridging Diverse Environments with Cross-Environment Knowledge Transfer for Embodied AI | Tomoyuki Kagaya et.al. | 2410.16919 | null |
2024-10-22 | Distribution of Responsibility During the Usage of AI-Based Exoskeletons for Upper Limb Rehabilitation | Huaxi et.al. | 2410.16887 | null |
2024-10-22 | Contrasting Attitudes Towards Current and Future AI Applications for Computerised Interpretation of ECG: A Clinical Stakeholder Interview Study | Lukas Hughes-Noehrer et.al. | 2410.16879 | null |
2024-10-22 | Pedestrian motion prediction evaluation for urban autonomous driving | Dmytro Zabolotnii et.al. | 2410.16864 | link |
2024-10-22 | Dynamic graph neural networks for enhanced volatility prediction in financial markets | Pulikandala Nithish Kumar et.al. | 2410.16858 | null |
2024-10-22 | Safe Load Balancing in Software-Defined-Networking | Lam Dinh et.al. | 2410.16846 | null |
2024-10-22 | Assessment of Transformer-Based Encoder-Decoder Model for Human-Like Summarization | Sindhu Nair et.al. | 2410.16842 | null |
2024-10-22 | SpikMamba: When SNN meets Mamba in Event-based Human Action Recognition | Jiaqi Chen et.al. | 2410.16746 | link |
2024-10-22 | Efficient Scheduling of Vehicular Tasks on Edge Systems with Green Energy and Battery Storage | Suvarthi Sarkar et.al. | 2410.16724 | null |
2024-10-22 | Resource-Efficient Sensor Fusion via System-Wide Dynamic Gated Neural Networks | Chetna Singhal et.al. | 2410.16723 | null |
2024-10-22 | SERN: Simulation-Enhanced Realistic Navigation for Multi-Agent Robotic Systems in Contested Environments | Jumman Hossain et.al. | 2410.16686 | null |
2024-10-22 | Improving Causal Reasoning in Large Language Models: A Survey | Siheng Xiong et.al. | 2410.16676 | link |
2024-10-22 | Convex Markov Games: A Framework for Fairness, Imitation, and Creativity in Multi-Agent Learning | Ian Gemp et.al. | 2410.16600 | null |
2024-10-22 | Dynamic Adaptive Rank Space Exploration for Efficient Sentiment Analysis with Large Language Models | Hongcheng Ding et.al. | 2410.16589 | null |
2024-10-21 | How Can We Diagnose and Treat Bias in Large Language Models for Clinical Decision-Making? | Kenza Benkirane et.al. | 2410.16574 | link |
2024-10-21 | Raising the Stakes: Performance Pressure Improves AI-Assisted Decision Making | Nikita Haduong et.al. | 2410.16560 | null |
2024-10-21 | Reflection-Bench: probing AI intelligence with reflection | Lingyu Li et.al. | 2410.16270 | link |
2024-10-22 | Mini-InternVL: A Flexible-Transfer Pocket Multimodal Model with 5% Parameters and 90% Performance | Zhangwei Gao et.al. | 2410.16261 | link |
2024-10-21 | Managing Bandwidth: The Key to Cloud-Assisted Autonomous Driving | Alexander Krentsel et.al. | 2410.16227 | null |
2024-10-21 | CiteClick: A Browser Extension for Real-Time Scholar Citation Tracking | Nishat Raihan et.al. | 2410.16211 | null |
2024-10-22 | LASER: Script Execution by Autonomous Agents for On-demand Traffic Simulation | Hao Gao et.al. | 2410.16197 | link |
2024-10-21 | Increasing Interpretability of Neural Networks By Approximating Human Visual Saliency | Aidan Boyd et.al. | 2410.16115 | null |
2024-10-21 | Fine-Tuning LLMs for Reliable Medical Question-Answering Services | Ali Anaissi et.al. | 2410.16088 | null |
2024-10-21 | A New Approach to Solving SMAC Task: Generating Decision Tree Code from Large Language Models | Yue Deng et.al. | 2410.16024 | link |
2024-10-21 | Systematic Exploration of Dialogue Summarization Approaches for Reproducibility, Comparative Assessment, and Methodological Innovations for Advancing Natural Language Processing in Abstractive Summarization | Yugandhar Reddy Gogireddy et.al. | 2410.15962 | null |
2024-10-21 | Bench4Merge: A Comprehensive Benchmark for Merging in Realistic Dense Traffic with Micro-Interactive Vehicles | Zhengming Wang et.al. | 2410.15912 | link |
2024-10-21 | How to Build a Pre-trained Multimodal model for Simultaneously Chatting and Decision-making? | Zuojin Tang et.al. | 2410.15885 | null |
2024-10-21 | Triplane Grasping: Efficient 6-DoF Grasping with Single RGB Images | Yiming Li et.al. | 2410.15879 | null |
2024-10-21 | WildOcc: A Benchmark for Off-Road 3D Semantic Occupancy Prediction | Heng Zhai et.al. | 2410.15792 | null |
2024-10-21 | High-Fidelity Transfer of Functional Priors for Wide Bayesian Neural Networks by Learning Activations | Marcin Sendera et.al. | 2410.15777 | link |
2024-10-21 | Generalizing Motion Planners with Mixture of Experts for Autonomous Driving | Qiao Sun et.al. | 2410.15774 | link |
2024-10-21 | Solving Sparse \& High-Dimensional-Output Regression via Compression | Renyuan Li et.al. | 2410.15762 | null |
2024-10-21 | Learning-to-Defer for Extractive Question Answering | Montreuil Yannis et.al. | 2410.15761 | null |
2024-10-21 | SPARC: Prediction-Based Safe Control for Coupled Controllable and Uncontrollable Agents with Conformal Predictions | Shuqi Wang et.al. | 2410.15660 | null |
2024-10-21 | How to Find the Exact Pareto Front for Multi-Objective MDPs? | Yining Li et.al. | 2410.15557 | null |
2024-10-21 | A Dual Process VLA: Efficient Robotic Manipulation Leveraging VLM | ByungOk Han et.al. | 2410.15549 | null |
2024-10-18 | Enhancing AI Accessibility in Veterinary Medicine: Linking Classifiers and Electronic Health Records | Chun Yin Kong et.al. | 2410.14625 | null |
2024-10-18 | MultiOrg: A Multi-rater Organoid-detection Dataset | Christina Bukas et.al. | 2410.14612 | null |
2024-10-18 | Towards Unsupervised Validation of Anomaly-Detection Models | Lihi Idan et.al. | 2410.14579 | null |
2024-10-18 | Spectral Representations for Accurate Causal Uncertainty Quantification with Gaussian Processes | Hugh Dance et.al. | 2410.14483 | null |
2024-10-18 | From Simple to Complex: Knowledge Transfer in Safe and Efficient Reinforcement Learning for Autonomous Driving | Rongliang Zhou et.al. | 2410.14468 | null |
2024-10-18 | Personalizing Low-Rank Bayesian Neural Networks Via Federated Learning | Boning Zhang et.al. | 2410.14390 | null |
2024-10-18 | A Model Checker for Natural Strategic Ability | Marco Aruta et.al. | 2410.14374 | null |
2024-10-18 | Assistive AI for Augmenting Human Decision-making | Natabara Máté Gyöngyössy et.al. | 2410.14353 | null |
2024-10-18 | Continuous models combining slacks-based measures of efficiency and super-efficiency | Vicente J. Bolos et.al. | 2410.14303 | null |
2024-10-18 | Optimizing Collaborative Robotics since Pre-Deployment via Cyber-Physical Systems’ Digital Twins | Christian Cella et.al. | 2410.14298 | null |
2024-10-18 | Towards Robust Knowledge Representations in Multilingual LLMs for Equivalence and Inheritance based Consistent Reasoning | Gaurav Arora et.al. | 2410.14235 | null |
2024-10-18 | LabSafety Bench: Benchmarking LLMs on Safety Issues in Scientific Labs | Yujun Zhou et.al. | 2410.14182 | null |
2024-10-18 | XForecast: Evaluating Natural Language Explanations for Time Series Forecasting | Taha Aksu et.al. | 2410.14180 | null |
2024-10-18 | Learning autonomous driving from aerial imagery | Varun Murali et.al. | 2410.14177 | null |
2024-10-18 | Auto Detecting Cognitive Events Using Machine Learning on Pupillary Data | Quang Dang et.al. | 2410.14174 | null |
2024-10-17 | Interpreting Inflammation Prediction Model via Tag-based Cohort Explanation | Fanyu Meng et.al. | 2410.14082 | null |
2024-10-17 | Sliding Puzzles Gym: A Scalable Benchmark for State Representation in Visual Reinforcement Learning | Bryan L. M. de Oliveira et.al. | 2410.14038 | link |
2024-10-17 | Recurrent Neural Goodness-of-Fit Test for Time Series | Aoran Zhang et.al. | 2410.13986 | link |
2024-10-17 | FinQAPT: Empowering Financial Decisions with End-to-End LLM-driven Question Answering Pipeline | Kuldeep Singh et.al. | 2410.13959 | null |
2024-10-17 | Identifying High Consideration E-Commerce Search Queries | Zhiyu Chen et.al. | 2410.13951 | null |
2024-10-17 | UniDrive: Towards Universal Driving Perception Across Camera Configurations | Ye Li et.al. | 2410.13864 | link |
2024-10-17 | MobA: A Two-Level Agent System for Efficient Mobile Task Automation | Zichen Zhu et.al. | 2410.13757 | link |
2024-10-17 | Optimizing Probabilistic Conformal Prediction with Vectorized Non-Conformity Scores | Minxing Zheng et.al. | 2410.13735 | null |
2024-10-17 | The Subtlety of Optimal Paternalism in a Population with Bounded Rationality | Charles F. Manski et.al. | 2410.13658 | null |
2024-10-17 | A Sequential Game Framework for Target Tracking | Daniel Leal et.al. | 2410.13587 | null |
2024-10-17 | Pseudo Dataset Generation for Out-of-Domain Multi-Camera View Recommendation | Kuan-Ying Lee et.al. | 2410.13585 | null |
2024-10-17 | DriveDreamer4D: World Models Are Effective Data Machines for 4D Driving Scene Representation | Guosheng Zhao et.al. | 2410.13571 | null |
2024-10-17 | RGB to Hyperspectral: Spectral Reconstruction for Enhanced Surgical Imaging | Tobias Czempiel et.al. | 2410.13570 | null |
2024-10-17 | Interactive Navigation with Adaptive Non-prehensile Mobile Manipulation | Cunxi Dai et.al. | 2410.13418 | null |
2024-10-17 | Accurate Checkerboard Corner Detection under Defoucs | Zezhun Shi et.al. | 2410.13371 | link |
2024-10-17 | Probing-RAG: Self-Probing to Guide Language Models in Selective Document Retrieval | Ingeol Baek et.al. | 2410.13339 | null |
2024-10-17 | Breaking Chains: Unraveling the Links in Multi-Hop Knowledge Unlearning | Minseok Choi et.al. | 2410.13274 | null |
2024-10-17 | FDF: Flexible Decoupled Framework for Time Series Forecasting with Conditional Denoising and Polynomial Modeling | Jintao Zhang et.al. | 2410.13253 | link |
2024-10-17 | Annealed Stein Variational Gradient Descent for Improved Uncertainty Estimation in Full-Waveform Inversion | Miguel Corrales et.al. | 2410.13249 | link |
2024-10-17 | Web Agents with World Models: Learning and Leveraging Environment Dynamics in Web Navigation | Hyungjoo Chae et.al. | 2410.13232 | link |
2024-10-17 | LLMOPT: Learning to Define and Solve General Optimization Problems from Scratch | Caigao Jiang et.al. | 2410.13213 | link |
2024-10-17 | Measuring Free-Form Decision-Making Inconsistency of Language Models in Military Crisis Simulations | Aryan Shrivastava et.al. | 2410.13204 | link |
2024-10-16 | Future of Algorithmic Organization: Large-Scale Analysis of Decentralized Autonomous Organizations (DAOs) | Tanusree Sharma et.al. | 2410.13095 | null |
2024-10-16 | Double-Bayesian Learning | Stefan Jaeger et.al. | 2410.12984 | null |
2024-10-16 | Multi-modal graph neural networks for localized off-grid weather forecasting | Qidong Yang et.al. | 2410.12938 | link |
2024-10-16 | Machine Learning-Augmented Ontology-Based Data Access for Renewable Energy Data | Marco Calautti et.al. | 2410.12734 | null |
2024-10-16 | Best-Worst Disaggregation: An approach to the preference disaggregation problem | Matteo Brunelli et.al. | 2410.12678 | null |
2024-10-16 | MambaBEV: An efficient 3D detection model with Mamba2 | Zihan You et.al. | 2410.12673 | null |
2024-10-16 | Hybrid Decision Making for Scalable Multi-Agent Navigation: Integrating Semantic Maps, Discrete Coordination, and Model Predictive Control | Koen de Vos et.al. | 2410.12651 | null |
2024-10-16 | Rethinking Visual Counterfactual Explanations Through Region Constraint | Bartlomiej Sobieski et.al. | 2410.12591 | link |
2024-10-16 | Self-DenseMobileNet: A Robust Framework for Lung Nodule Classification using Self-ONN and Stacking-based Meta-Classifier | Md. Sohanur Rahman et.al. | 2410.12584 | null |
2024-10-16 | STRUX: An LLM for Decision-Making with Structured Explanations | Yiming Lu et.al. | 2410.12583 | null |
2024-10-16 | Robust RL with LLM-Driven Data Synthesis and Policy Adaptation for Autonomous Driving | Sihao Wu et.al. | 2410.12568 | null |
2024-10-16 | Counterfactual Effect Decomposition in Multi-Agent Sequential Decision Making | Stelios Triantafyllou et.al. | 2410.12539 | link |
2024-10-16 | Insights from the Inverse: Reconstructing LLM Training Goals Through Inverse RL | Jared Joselowitz et.al. | 2410.12491 | null |
2024-10-16 | SAC-GLAM: Improving Online RL for LLM agents with Soft Actor-Critic and Hindsight Relabeling | Loris Gaven et.al. | 2410.12481 | null |
2024-10-16 | ConLUX: Concept-Based Local Unified Explanations | Junhao Liu et.al. | 2410.12439 | null |
2024-10-16 | Conformity in Large Language Models | Xiaochen Zhu et.al. | 2410.12428 | null |
2024-10-16 | Real-time Stereo-based 3D Object Detection for Streaming Perception | Changcai Li et.al. | 2410.12394 | link |
2024-10-16 | Proactive Agent: Shifting LLM Agents from Reactive Responses to Active Assistance | Yaxi Lu et.al. | 2410.12361 | link |
2024-10-16 | TPFL: A Trustworthy Personalized Federated Learning Framework via Subjective Logic | Jinqian Chen et.al. | 2410.12316 | null |
2024-10-16 | Consistency Calibration: Improving Uncertainty Calibration via Consistency among Perturbed Neighbors | Linwei Tao et.al. | 2410.12295 | null |
2024-10-16 | Implementation of EMR System in Indonesian Health Facilities: Benefits and Constraints | Rasyid Juliansyah et.al. | 2410.12226 | null |
2024-10-16 | Sparse Prototype Network for Explainable Pedestrian Behavior Prediction | Yan Feng et.al. | 2410.12195 | link |
2024-10-16 | ExoTST: Exogenous-Aware Temporal Sequence Transformer for Time Series Prediction | Kshitij Tayal et.al. | 2410.12184 | null |
2024-10-15 | Technical Report of 1:10 Scale Autonomous Vehicle Robot | Amirhossein Kheiri Holighi et.al. | 2410.11746 | null |
2024-10-15 | MTU-Bench: A Multi-granularity Tool-Use Benchmark for Large Language Models | Pei Wang et.al. | 2410.11710 | link |
2024-10-15 | Fully-discrete provably Lyapunov consistent discretizations for convection-diffusion-reaction PDE systems | Rasha Al Jahdali et.al. | 2410.11669 | null |
2024-10-15 | Black-box Uncertainty Quantification Method for LLM-as-a-Judge | Nico Wagner et.al. | 2410.11594 | null |
2024-10-15 | A Data-Driven Aggressive Autonomous Racing Framework Utilizing Local Trajectory Planning with Velocity Prediction | Zhouheng Li et.al. | 2410.11570 | link |
2024-10-15 | Effect modification and non-collapsibility leads to conflicting treatment decisions: a review of marginal and conditional estimands and recommendations for decision-making | David M. Phillippo et.al. | 2410.11438 | null |
2024-10-15 | DODT: Enhanced Online Decision Transformer Learning through Dreamer’s Actor-Critic Trajectory Forecasting | Eric Hanchen Jiang et.al. | 2410.11359 | null |
2024-10-15 | DIAR: Diffusion-model-guided Implicit Q-learning with Adaptive Revaluation | Jaehyun Park et.al. | 2410.11338 | null |
2024-10-15 | Diffusion-Based Offline RL for Improved Decision-Making in Augmented ARC Task | Yunho Kim et.al. | 2410.11324 | null |
2024-10-15 | Communication-Control Codesign for Large-Scale Wireless Networked Control Systems | Gaoyang Pang et.al. | 2410.11316 | null |
2024-10-15 | Process Reward Model with Q-Value Rankings | Wendi Li et.al. | 2410.11287 | link |
2024-10-15 | Bayes Adaptive Monte Carlo Tree Search for Offline Model-based Reinforcement Learning | Jiayu Chen et.al. | 2410.11234 | link |
2024-10-15 | TEOcc: Radar-camera Multi-modal Occupancy Prediction via Temporal Enhancement | Zhiwei Lin et.al. | 2410.11228 | link |
2024-10-14 | Gender Bias in Decision-Making with Large Language Models: A Study of Relationship Conflicts | Sharon Levy et.al. | 2410.11084 | link |
2024-10-14 | SGUQ: Staged Graph Convolution Neural Network for Alzheimer’s Disease Diagnosis using Multi-Omics Data | Liang Tao et.al. | 2410.11046 | link |
2024-10-14 | Persistent Topological Features in Large Language Models | Yuri Gardinazzi et.al. | 2410.11042 | link |
2024-10-14 | ET-Former: Efficient Triplane Deformable Attention for 3D Semantic Scene Completion From Monocular Camera | Jing Liang et.al. | 2410.11019 | null |
2024-10-14 | 6G RIS-aided Single-LEO Localization with Slow and Fast Doppler Effects | Sharief Saleh et.al. | 2410.11010 | null |
2024-10-14 | Condition-Aware Multimodal Fusion for Robust Semantic Perception of Driving Scenes | Tim Broedermann et.al. | 2410.10791 | link |
2024-10-14 | 3DArticCyclists: Generating Simulated Dynamic 3D Cyclists for Human-Object Interaction (HOI) and Autonomous Driving Applications | Eduardo R. Corral-Soto et.al. | 2410.10782 | null |
2024-10-14 | Focused ReAct: Improving ReAct through Reiterate and Early Stop | Shuoqiu Li et.al. | 2410.10779 | null |
2024-10-14 | Towards Calibrated Losses for Adversarial Robust Reject Option Classification | Vrund Shah et.al. | 2410.10736 | link |
2024-10-14 | Navigation under uncertainty: Trajectory prediction and occlusion reasoning with switching dynamical systems | Ran Wei et.al. | 2410.10653 | null |
2024-10-14 | Echo State Networks for Spatio-Temporal Area-Level Data | Zhenhua Wang et.al. | 2410.10641 | null |
2024-10-14 | Intelligent prospector v2.0: exploration drill planning under epistemic model uncertainty | John Mern et.al. | 2410.10610 | null |
2024-10-14 | Burning RED: Unlocking Subtask-Driven Reinforcement Learning and Risk-Awareness in Average-Reward Markov Decision Processes | Juan Sebastian Rojas et.al. | 2410.10578 | null |
2024-10-14 | Words to Wheels: Vision-Based Autonomous Driving Understanding Human Language Instructions Using Foundation Models | Chanhoe Ryu et.al. | 2410.10577 | null |
2024-10-14 | When Precedents Clash | Cecilia Di Florio et.al. | 2410.10567 | null |
2024-10-14 | Graph Classification Gaussian Processes via Hodgelet Spectral Features | Mathieu Alain et.al. | 2410.10546 | null |
2024-10-14 | Exploiting Local Features and Range Images for Small Data Real-Time Point Cloud Semantic Segmentation | Daniel Fusaro et.al. | 2410.10510 | link |
2024-10-15 | Ada-K Routing: Boosting the Efficiency of MoE-based LLMs | Tongtian Yue et.al. | 2410.10456 | null |
2024-10-14 | QUITE: Quantifying Uncertainty in Natural Language Text in Bayesian Reasoning Scenarios | Timo Pierre Schrader et.al. | 2410.10449 | null |
2024-10-14 | In-Materia Speech Recognition | Mohamadreza Zolfagharinejad et.al. | 2410.10434 | null |
2024-10-14 | DOME: Taming Diffusion Model into High-Fidelity Controllable Occupancy World Model | Songen Gu et.al. | 2410.10429 | null |
2024-10-15 | Improved Depth Estimation of Bayesian Neural Networks | Bart van Erp et.al. | 2410.10395 | link |
2024-10-14 | MentalGLM Series: Explainable Large Language Models for Mental Health Analysis on Chinese Social Media | Wei Zhai et.al. | 2410.10323 | link |
2024-10-14 | Preliminary Evaluation of an Ultrasound-Guided Robotic System for Autonomous Percutaneous Intervention | Pratima Mohan et.al. | 2410.10299 | null |
2024-10-14 | ROA-BEV: 2D Region-Oriented Attention for BEV-based 3D Object | Jiwei Chen et.al. | 2410.10298 | null |
2024-10-11 | Variance reduction combining pre-experiment and in-experiment data | Zhexiao Lin et.al. | 2410.09027 | null |
2024-10-11 | Learning Representations of Instruments for Partial Identification of Treatment Effects | Jonas Schweisthal et.al. | 2410.08976 | link |
2024-10-11 | Transferable Belief Model on Quantum Circuits | Qianli Zhou et.al. | 2410.08949 | null |
2024-10-11 | DiffPO: A causal diffusion model for learning distributions of potential outcomes | Yuchen Ma et.al. | 2410.08924 | null |
2024-10-11 | Hybrid LLM-DDQN based Joint Optimization of V2I Communication and Autonomous Driving | Zijiang Yan et.al. | 2410.08854 | null |
2024-10-11 | Online Learning for Intelligent Thermal Management of Interference-coupled and Passively Cooled Base Stations | Zhanwei Yu et.al. | 2410.08799 | null |
2024-10-11 | Integrating Expert Judgment and Algorithmic Decision Making: An Indistinguishability Framework | Rohan Alur et.al. | 2410.08783 | link |
2024-10-11 | VideoSAM: Open-World Video Segmentation | Pinxue Guo et.al. | 2410.08781 | null |
2024-10-11 | Causal machine learning for predicting treatment outcomes | Stefan Feuerriegel et.al. | 2410.08770 | null |
2024-10-11 | MMLF: Multi-modal Multi-class Late Fusion for Object Detection with Uncertainty Estimation | Qihang Yang et.al. | 2410.08739 | null |
2024-10-11 | Investigating Human-Computer Interaction and Visual Comprehension in Text Generation Process of Natural Language Generation Models | Yunchao Wang et.al. | 2410.08723 | null |
2024-10-11 | Impact of Surface Reflections in Maritime Obstacle Detection | Samed Yalçın et.al. | 2410.08713 | link |
2024-10-11 | Opacity Enforcement by Edit Functions Under Incomparable Observations | Wei Duan et.al. | 2410.08471 | null |
2024-10-11 | AdvDiffuser: Generating Adversarial Safety-Critical Driving Scenarios via Guided Diffusion | Yuting Xie et.al. | 2410.08453 | null |
2024-10-11 | JurEE not Judges: safeguarding llm interactions with small, specialised Encoder Ensembles | Dom Nasrabadi et.al. | 2410.08442 | null |
2024-10-10 | Can LLMs advance democratic values? | Seth Lazar et.al. | 2410.08418 | null |
2024-10-10 | Are We Ready for Real-Time LiDAR Semantic Segmentation in Autonomous Driving? | Samir Abou Haidar et.al. | 2410.08365 | null |
2024-10-10 | Large Legislative Models: Towards Efficient AI Policymaking in Economic Simulations | Henry Gasztowtt et.al. | 2410.08345 | link |
2024-10-10 | Towards Foundation Models for Mixed Integer Linear Programming | Sirui Li et.al. | 2410.08288 | null |
2024-10-10 | RGM: Reconstructing High-fidelity 3D Car Assets with Relightable 3D-GS Generative Model from a Single Image | Xiaoxue Chen et.al. | 2410.08181 | null |
2024-10-10 | Mars: Situated Inductive Reasoning in an Open-World Environment | Xiaojuan Tang et.al. | 2410.08126 | null |
2024-10-10 | A Generative AI Technique for Synthesizing a Digital Twin for U.S. Residential Solar Adoption and Generation | Aparna Kishore et.al. | 2410.08098 | null |
2024-10-10 | Gaussian Process Thompson Sampling via Rootfinding | Taiwo A. Adebiyi et.al. | 2410.08071 | null |
2024-10-10 | Agent-based modeling for realistic reproduction of human mobility and contact behavior to evaluate test and isolation strategies in epidemic infectious disease spread | David Kerkmann et.al. | 2410.08050 | link |
2024-10-10 | Harmonic Oscillator based Particle Swarm Optimization | Yury Chernyak et.al. | 2410.08043 | null |
2024-10-10 | APOLLO: A GPT-based tool to detect phishing emails and generate explanations that warn users | Giuseppe Desolda et.al. | 2410.07997 | null |
2024-10-10 | Octopus Inspired Optimization Algorithm: Multi-Level Structures and Parallel Computing Strategies | Xu Wang et.al. | 2410.07968 | link |
2024-10-10 | Eco-driving Incentive Mechanisms for Mitigating Emissions in Urban Transportation | M. Umar B. Niazi et.al. | 2410.07952 | null |
2024-10-10 | AI Surrogate Model for Distributed Computing Workloads | David K. Park et.al. | 2410.07940 | null |
2024-10-10 | Offline Hierarchical Reinforcement Learning via Inverse Optimization | Carolin Schmidt et.al. | 2410.07933 | null |
2024-10-10 | Decision-Aware Predictive Model Selection for Workforce Allocation | Eric G. Stratman et.al. | 2410.07932 | null |
2024-10-10 | Efficient Reinforcement Learning with Large Language Model Priors | Xue Yan et.al. | 2410.07927 | null |
2024-10-10 | Understanding Human Activity with Uncertainty Measure for Novelty in Graph Convolutional Networks | Hao Xing et.al. | 2410.07917 | null |
2024-10-10 | L-VITeX: Light-weight Visual Intuition for Terrain Exploration | Antar Mazumder et.al. | 2410.07872 | null |
2024-10-10 | Autonomous Vehicles Path Planning under Temporal Logic Specifications | Akshay Dhonthi et.al. | 2410.07845 | null |
2024-10-10 | Fine-Tuning Language Models for Ethical Ambiguity: A Comparative Study of Alignment with Human Responses | Pranav Senthilkumar et.al. | 2410.07826 | null |
2024-10-10 | HeightFormer: A Semantic Alignment Monocular 3D Object Detection Method from Roadside Perspective | Pei Liu et.al. | 2410.07758 | null |
2024-10-10 | Give Me a Choice: The Consequences of Restricting Choices Through AI-Support for Perceived Autonomy, Motivational Variables, and Decision Performance | Cedric Faas et.al. | 2410.07728 | null |
2024-10-10 | Autonomous Driving in Unstructured Environments: How Far Have We Come? | Chen Min et.al. | 2410.07701 | link |
2024-10-09 | Embodied Agent Interface: Benchmarking LLMs for Embodied Decision Making | Manling Li et.al. | 2410.07166 | link |
2024-10-09 | Identifying and Addressing Delusions for Target-Directed Decision-Making | Mingde Zhao et.al. | 2410.07096 | link |
2024-10-09 | Optimizing Estimators of Squared Calibration Errors in Classification | Sebastian G. Gruber et.al. | 2410.07014 | null |
2024-10-09 | Reproducing and Extending Experiments in Behavioral Strategy with Large Language Models | Daniel Albert et.al. | 2410.06932 | null |
2024-10-09 | How hard can it be? Quantifying MITRE attack campaigns with attack trees and cATM logic | Stefano M. Nicoletti et.al. | 2410.06692 | null |
2024-10-09 | $β$ -calibration of Language Model Confidence Scores for Generative QA | Putra Manggala et.al. | 2410.06615 | null |
2024-10-09 | Decentralized Clinical Trials in the Era of Real-World Evidence: A Statistical Perspective | Jie Chen et.al. | 2410.06591 | null |
2024-10-09 | Use of Real-World Data and Real-World Evidence in Rare Disease Drug Development: A Statistical Perspective | Jie Chen et.al. | 2410.06586 | null |
2024-10-09 | Challenges and Possible Strategies to Address Them in Rare Disease Drug Development: A Statistical Perspective | Jie Chen et.al. | 2410.06585 | null |
2024-10-10 | When Does Interference Matter? Decision-Making in Platform Experiments | Ramesh Johari et.al. | 2410.06580 | null |
2024-10-09 | Detecting Bias and Enhancing Diagnostic Accuracy in Large Language Models for Healthcare | Pardis Sadat Zahraei et.al. | 2410.06566 | null |
2024-10-09 | QuadBEV: An Efficient Quadruple-Task Perception Framework via Bird’s-Eye-View Representation | Yuxin Li et.al. | 2410.06516 | null |
2024-10-09 | Overcoming Autoware-Ubuntu Incompatibility in Autonomous Driving Systems-Equipped Vehicles: Lessons Learned | Dada Zhang et.al. | 2410.06492 | null |
2024-10-09 | Flipping-based Policy for Chance-Constrained Markov Decision Processes | Xun Shen et.al. | 2410.06474 | null |
2024-10-09 | Modeling chaotic Lorenz ODE System using Scientific Machine Learning | Sameera S Kashyap et.al. | 2410.06452 | null |
2024-10-08 | Biased AI can Influence Political Decision-Making | Jillian Fisher et.al. | 2410.06415 | null |
2024-10-08 | BEVLoc: Cross-View Localization and Matching via Birds-Eye-View Synthesis | Christopher Klammer et.al. | 2410.06410 | link |
2024-10-08 | Cooperative and Asynchronous Transformer-based Mission Planning for Heterogeneous Teams of Mobile Robots | Milad Farjadnasab et.al. | 2410.06372 | link |
2024-10-08 | HumVI: A Multilingual Dataset for Detecting Violent Incidents Impacting Humanitarian Aid | Hemank Lamba et.al. | 2410.06370 | link |
2024-10-10 | Context-Aware Command Understanding for Tabletop Scenarios | Paul Gajewski et.al. | 2410.06355 | null |
2024-10-07 | LADEV: A Language-Driven Testing and Evaluation Platform for Vision-Language-Action Models in Robotic Manipulation | Zhijie Wang et.al. | 2410.05191 | null |
2024-10-07 | ReasoningRank: Teaching Student Models to Rank through Reasoning-Based Knowledge Distillation | Yuelyu Ji et.al. | 2410.05168 | null |
2024-10-07 | Assouad, Fano, and Le Cam with Interaction: A Unifying Lower Bound Framework and Characterization for Bandit Learnability | Fan Chen et.al. | 2410.05117 | null |
2024-10-07 | LiDAR-GS:Real-time LiDAR Re-Simulation using Gaussian Splatting | Qifeng Chen et.al. | 2410.05111 | null |
2024-10-07 | HE-Drive: Human-Like End-to-End Driving with Vision Language Models | Junming Wang et.al. | 2410.05051 | null |
2024-10-07 | Real-time Ship Recognition and Georeferencing for the Improvement of Maritime Situational Awareness | Borja Carrillo Perez et.al. | 2410.04946 | null |
2024-10-07 | PRFusion: Toward Effective and Robust Multi-Modal Place Recognition with Image and Point Cloud Fusion | Sijie Wang et.al. | 2410.04939 | link |
2024-10-07 | Why am I seeing this: Democratizing End User Auditing for Online Content Recommendations | Chaoran Chen et.al. | 2410.04917 | null |
2024-10-07 | Data-driven Diffusion Models for Enhancing Safety in Autonomous Vehicle Traffic Simulations | Jinxiong Lu et.al. | 2410.04809 | null |
2024-10-07 | WTCL-Dehaze: Rethinking Real-world Image Dehazing via Wavelet Transform and Contrastive Learning | Divine Joseph Appiah et.al. | 2410.04762 | null |
2024-10-07 | Driving with Regulation: Interpretable Decision-Making for Autonomous Vehicles with Retrieval-Augmented Reasoning via LLM | Tianhui Cai et.al. | 2410.04759 | null |
2024-10-07 | Diffusion Models in 3D Vision: A Survey | Zhen Wang et.al. | 2410.04738 | null |
2024-10-07 | Does the Infamous Pie Chart Really Hurt Decision-Making in the Real World? Assessing the Role of Visualization in High-Level Academic Decisions | Yixuan Li et.al. | 2410.04686 | null |
2024-10-06 | VISTA: A Visual and Textual Attention Dataset for Interpreting Multimodal Models | Harshit et.al. | 2410.04609 | null |
2024-10-06 | CardioAI: A Multimodal AI-based System to Support Symptom Monitoring and Risk Detection of Cancer Treatment-Induced Cardiotoxicity | Siyi Wu et.al. | 2410.04592 | null |
2024-10-06 | Ranking Policy Learning via Marketplace Expected Value Estimation From Observational Data | Ehsan Ebrahimzadeh et.al. | 2410.04568 | null |
2024-10-06 | Bisimulation metric for Model Predictive Control | Yutaka Shimizu et.al. | 2410.04553 | link |
2024-10-06 | In-Place Panoptic Radiance Field Segmentation with Perceptual Prior for 3D Scene Understanding | Shenghao Li et.al. | 2410.04529 | null |
2024-10-06 | A Reinforcement Learning Engine with Reduced Action and State Space for Scalable Cyber-Physical Optimal Response | Shining Sun et.al. | 2410.04518 | null |
2024-10-06 | Two-fund separation under hyperbolically distributed returns and concave utility function | Nuerxiati Abudurexiti et.al. | 2410.04459 | null |
2024-10-04 | Minimax-optimal trust-aware multi-armed bandits | Changxiao Cai et.al. | 2410.03651 | null |
2024-10-04 | Open-World Reinforcement Learning over Long Short-Term Imagination | Jiajian Li et.al. | 2410.03618 | link |
2024-10-04 | A Multi-model Approach for Video Data Retrieval in Autonomous Vehicle Development | Jesper Knapp et.al. | 2410.03580 | null |
2024-10-04 | MO-DDN: A Coarse-to-Fine Attribute-based Exploration Agent for Multi-object Demand-driven Navigation | Hongcheng Wang et.al. | 2410.03488 | null |
2024-10-04 | Predictive Coding for Decision Transformer | Tung M. Luu et.al. | 2410.03408 | link |
2024-10-04 | Make Interval Bound Propagation great again | Patryk Krukowski et.al. | 2410.03373 | link |
2024-10-04 | SELU: Self-Learning Embodied MLLMs in Unknown Environments | Boyu Li et.al. | 2410.03303 | null |
2024-10-04 | Deliberate Reasoning for LLMs as Structure-aware Planning with Accurate World Model | Siheng Xiong et.al. | 2410.03136 | null |
2024-10-04 | Spatial-aware decision-making with ring attractors in reinforcement learning systems | Marcos Negre Saura et.al. | 2410.03119 | null |
2024-10-04 | Strategic Insights from Simulation Gaming of AI Race Dynamics | Ross Gruetzemacher et.al. | 2410.03092 | null |
2024-10-04 | MetaOOD: Automatic Selection of OOD Detection Models | Yuehan Qin et.al. | 2410.03074 | null |
2024-10-03 | Learning a Fast Mixing Exogenous Block MDP using a Single Trajectory | Alexander Levine et.al. | 2410.03016 | link |
2024-10-03 | Harm Ratio: A Novel and Versatile Fairness Criterion | Soroush Ebadian et.al. | 2410.02977 | null |
2024-10-03 | Acoustic signaling enables collective perception and control in active matter systems | Alexander Ziepke et.al. | 2410.02940 | null |
2024-10-03 | ReLIC: A Recipe for 64k Steps of In-Context Reinforcement Learning for Embodied AI | Ahmad Elawady et.al. | 2410.02751 | link |
2024-10-03 | DailyDilemmas: Revealing Value Preferences of LLMs with Quandaries of Daily Life | Yu Ying Chiu et.al. | 2410.02683 | null |
2024-10-03 | Grounded Answers for Multi-agent Decision-making Problem through Generative World Model | Zeyang Liu et.al. | 2410.02664 | null |
2024-10-03 | Agent Security Bench (ASB): Formalizing and Benchmarking Attacks and Defenses in LLM-based Agents | Hanrong Zhang et.al. | 2410.02644 | link |
2024-10-03 | Spatial-Temporal Multi-Cuts for Online Multiple-Camera Vehicle Tracking | Fabian Herzog et.al. | 2410.02638 | link |
2024-10-03 | Beyond Expected Returns: A Policy Gradient Algorithm for Cumulative Prospect Theoretic Reinforcement Learning | Olivier Lepel et.al. | 2410.02605 | null |
2024-10-03 | Expected Maximin Fairness in Max-Cut and other Combinatorial Optimization Problems | Jad Salem et.al. | 2410.02589 | null |
2024-10-03 | Spontaneous Symmetry Breaking, Group Decision Making and Beyond 1. Echo Chambers and Random Polarization | Serge Galam et.al. | 2410.02582 | null |
2024-10-03 | ColaCare: Enhancing Electronic Health Record Modeling through Large Language Model-Driven Multi-Agent Collaboration | Zixiang Wang et.al. | 2410.02551 | null |
2024-10-03 | Meta-Models: An Architecture for Decoding LLM Behaviors Through Interpreted Embeddings and Natural Language | Anthony Costarelli et.al. | 2410.02472 | link |
2024-10-03 | Behavior Trees in Functional Safety Supervisors for Autonomous Vehicles | Carlos Conejo et.al. | 2410.02469 | link |
2024-10-03 | Aggregation of Constrained Crowd Opinions for Urban Planning | Akanksha Das et.al. | 2410.02454 | null |
2024-10-03 | Self-eXplainable AI for Medical Image Analysis: A Survey and New Outlooks | Junlin Hou et.al. | 2410.02331 | null |
2024-10-03 | Selection Guidelines for Geographical SMR Protocols: A Communication Pattern-based Latency Modeling Approach | Kohya Shiozaki et.al. | 2410.02295 | null |
2024-10-03 | Perfect Counterfactuals in Imperfect Worlds: Modelling Noisy Implementation of Actions in Sequential Algorithmic Recourse | Yueqing Xuan et.al. | 2410.02273 | null |
2024-10-03 | End-to-end Driving in High-Interaction Traffic Scenarios with Reinforcement Learning | Yueyuan Li et.al. | 2410.02253 | null |
2024-10-03 | Probabilistic road classification in historical maps using synthetic data and deep learning | Dominik J. Mühlematter et.al. | 2410.02250 | link |
2024-10-03 | SEAL: SEmantic-Augmented Imitation Learning via Language Model | Chengyang Gu et.al. | 2410.02231 | null |
2024-10-03 | Measuring, Evaluating and Improving Logical Consistency in Large Language Models | Yinhong Liu et.al. | 2410.02205 | null |
2024-10-03 | Remember and Recall: Associative-Memory-based Trajectory Prediction | Hang Guo et.al. | 2410.02201 | null |
2024-10-02 | Bellman Diffusion: Generative Modeling as Learning a Linear Operator in the Distribution Space | Yangming Li et.al. | 2410.01796 | null |
2024-10-02 | DeFine: Enhancing LLM Decision-Making with Factor Profiles and Analogical Reasoning | Yebowen Hu et.al. | 2410.01772 | null |
2024-10-02 | Decision-Focused Uncertainty Quantification | Santiago Cortes-Gomez et.al. | 2410.01767 | link |
2024-10-02 | Mimicking Human Intuition: Cognitive Belief-Driven Q-Learning | Xingrui Gu et.al. | 2410.01739 | null |
2024-10-02 | Open3DTrack: Towards Open-Vocabulary 3D Multi-Object Tracking | Ayesha Ishaq et.al. | 2410.01678 | link |
2024-10-02 | Moral Alignment for LLM Agents | Elizaveta Tennant et.al. | 2410.01639 | null |
2024-10-02 | Entropy-Based Uncertainty Modeling for Trajectory Prediction in Autonomous Driving | Aron Distelzweig et.al. | 2410.01628 | null |
2024-10-02 | AI-Native Network Digital Twin for Intelligent Network Management in 6G | Wen Wu et.al. | 2410.01584 | null |
2024-10-02 | Uncertainty quantification in neutron and gamma time correlation measurements | Paul Lartaud et.al. | 2410.01522 | null |
2024-10-02 | One Wave to Explain Them All: A Unifying Perspective on Post-hoc Explainability | Gabriel Kasmi et.al. | 2410.01482 | null |
2024-10-02 | Adaptive teachers for amortized samplers | Minsu Kim et.al. | 2410.01432 | link |
2024-10-02 | Regularized e-processes: anytime valid inference with knowledge-based efficiency gains | Ryan Martin et.al. | 2410.01427 | null |
2024-10-02 | CSLens: Towards Better Deploying Charging Stations via Visual Analytics – A Coupled Networks Perspective | Yutian Zhang et.al. | 2410.01384 | null |
2024-10-02 | MARLens: Understanding Multi-agent Reinforcement Learning for Traffic Signal Control via Visual Analytics | Yutian Zhang et.al. | 2410.01364 | null |
2024-10-02 | Detecting Viral Social Events through Censored Observation with Deep Survival Analysis | Maryam Ramezani et.al. | 2410.01320 | null |
2024-10-02 | FanCric : Multi-Agentic Framework for Crafting Fantasy 11 Cricket Teams | Mohit Bhatnagar et.al. | 2410.01307 | null |
2024-10-02 | What Did I Say Again? Relating User Needs to Search Outcomes in Conversational Commerce | Kevin Schott et.al. | 2410.01291 | null |
2024-10-02 | Uncertainty-aware Human Mobility Modeling and Anomaly Detection | Haomin Wen et.al. | 2410.01281 | null |
2024-10-02 | Perceptual Piercing: Human Visual Cue-based Object Detection in Low Visibility Conditions | Ashutosh Kumar et.al. | 2410.01225 | link |
2024-10-02 | An uncertainty-aware Digital Shadow for underground multimodal CO2 storage monitoring | Abhinav Prakash Gahlot et.al. | 2410.01218 | null |
2024-09-30 | Maia-2: A Unified Model for Human-AI Alignment in Chess | Zhenwei Tang et.al. | 2409.20553 | link |
2024-09-30 | Best Practices for Responsible Machine Learning in Credit Scoring | Giovani Valdrighi et.al. | 2409.20536 | link |
2024-09-30 | End-to-End Conformal Calibration for Optimization Under Uncertainty | Christopher Yeh et.al. | 2409.20534 | link |
2024-09-30 | Quantifying Metrics for Wildfire Ignition Risk from Geographic Data in Power Shutoff Decision-Making | Ryan Piansky et.al. | 2409.20511 | null |
2024-09-30 | Online Decision Deferral under Budget Constraints | Mirabel Reid et.al. | 2409.20489 | link |
2024-09-30 | The Secretary Problem with Predicted Additive Gap | Alexander Braun et.al. | 2409.20460 | null |
2024-09-30 | Sufficient and Necessary Explanations (and What Lies in Between) | Beepul Bharti et.al. | 2409.20427 | null |
2024-09-30 | Conformal Prediction for Dose-Response Models with Continuous Treatments | Jarne Verhaeghe et.al. | 2409.20412 | link |
2024-09-30 | Efficient Driving Behavior Narration and Reasoning on Edge Device Using Large Language Models | Yizhou Huang et.al. | 2409.20364 | null |
2024-09-30 | Leveraging CAM Algorithms for Explaining Medical Semantic Segmentation | Tillmann Rheude et.al. | 2409.20287 | link |
2024-09-30 | Learning to Ground Existentially Quantified Goals | Martin Funkquist et.al. | 2409.20259 | null |
2024-09-30 | Inferring Preferences from Demonstrations in Multi-objective Reinforcement Learning | Junlin Lu et.al. | 2409.20258 | link |
2024-09-30 | Feature Extractor or Decision Maker: Rethinking the Role of Visual Encoders in Visuomotor Policies | Ruiyu Wang et.al. | 2409.20248 | null |
2024-09-30 | Customized Information and Domain-centric Knowledge Graph Construction with Large Language Models | Frank Wawrzik et.al. | 2409.20010 | null |
2024-10-01 | OccRWKV: Rethinking Efficient 3D Semantic Occupancy Prediction with Linear Complexity | Junming Wang et.al. | 2409.19987 | null |
2024-09-30 | DAOcc: 3D Object Detection Assisted Multi-Sensor Fusion for 3D Occupancy Prediction | Zhen Yang et.al. | 2409.19972 | link |
2024-09-30 | Data-driven decision-making under uncertainty with entropic risk measure | Utsav Sadana et.al. | 2409.19926 | null |
2024-10-01 | On The Planning Abilities of OpenAI’s o1 Models: Feasibility, Optimality, and Generalizability | Kevin Wang et.al. | 2409.19924 | link |
2024-09-30 | ForecastBench: A Dynamic Benchmark of AI Forecasting Capabilities | Ezra Karger et.al. | 2409.19839 | link |
2024-09-29 | Generalizability of Graph Neural Networks for Decentralized Unlabeled Motion Planning | Shreyas Muthusamy et.al. | 2409.19829 | null |
2024-09-27 | LML: Language Model Learning a Dataset for Data-Augmented Prediction | Praneeth Vadlapati et.al. | 2409.18957 | link |
2024-09-27 | AIPatient: Simulating Patients with EHRs and LLM Powered Agentic Workflow | Huizi Yu et.al. | 2409.18924 | null |
2024-09-27 | Moldable Development Patterns | Oscar Nierstrasz et.al. | 2409.18811 | null |
2024-09-27 | Open-Nav: Exploring Zero-Shot Vision-and-Language Navigation in Continuous Environment with Open-Source LLMs | Yanyuan Qiao et.al. | 2409.18794 | null |
2024-09-27 | Enhancing Explainability in Multimodal Large Language Models Using Ontological Context | Jihen Amara et.al. | 2409.18753 | null |
2024-09-27 | Renewal equations for vector-borne diseases | Cathal Mills et.al. | 2409.18726 | null |
2024-09-27 | The Craft of Selective Prediction: Towards Reliable Case Outcome Classification – An Empirical Study on European Court of Human Rights Cases | T. Y. S. S. Santosh et.al. | 2409.18645 | null |
2024-09-27 | Incorporating Precedents for Legal Judgement Prediction on European Court of Human Rights Cases | T. Y. S. S. Santosh et.al. | 2409.18644 | null |
2024-09-27 | DP-SCC-PL:Differentially Private Decentralized Byzantine-Resilient Stochastic Optimization via Self-Centered Clipping Under Polyak-Łojasiewicz Condition | Jinhui Hu et.al. | 2409.18632 | null |
2024-09-27 | Unsupervised Cognition | Alfredo Ibias et.al. | 2409.18624 | null |
2024-09-27 | Analysis of Truncated Singular Value Decomposition for Koopman Operator-Based Lane Change Model | Chinnawut Nantabut et.al. | 2409.18586 | null |
2024-09-27 | Climate Adaptation with Reinforcement Learning: Experiments with Flooding and Transportation in Copenhagen | Miguel Costa et.al. | 2409.18574 | link |
2024-09-27 | BoT-Drive: Hierarchical Behavior and Trajectory Planning for Autonomous Driving using POMDPs | Xuanjin Jin et.al. | 2409.18411 | null |
2024-09-27 | Multimodal Trajectory Prediction for Autonomous Driving on Unstructured Roads using Deep Convolutional Network | Lei Li et.al. | 2409.18399 | null |
2024-09-27 | ChARLES: Change-Aware Recovery of Latent Evolution Semantics in Relational Data | Shiyi He et.al. | 2409.18386 | null |
2024-09-27 | Robo-CSK-Organizer: Commonsense Knowledge to Organize Detected Objects for Multipurpose Robots | Rafael Hidalgo et.al. | 2409.18385 | null |
2024-09-27 | A model-constrained Discontinuous Galerkin Network (DGNet) for Compressible Euler Equations with Out-of-Distribution Generalization | Hai Van Nguyen et.al. | 2409.18371 | null |
2024-09-26 | Improving Agent Behaviors with RL Fine-tuning for Autonomous Driving | Zhenghao Peng et.al. | 2409.18343 | null |
2024-09-26 | Does End-to-End Autonomous Driving Really Need Perception Tasks? | Peidong Li et.al. | 2409.18341 | link |
2024-09-26 | Spatial Visibility and Temporal Dynamics: Revolutionizing Field of View Prediction in Adaptive Point Cloud Video Streaming | Chen Li et.al. | 2409.18236 | null |
2024-09-26 | DiffSSC: Semantic LiDAR Scan Completion using Denoising Diffusion Probabilistic Models | Helin Cao et.al. | 2409.18092 | null |
2024-09-26 | DualAD: Dual-Layer Planning for Reasoning in Autonomous Driving | Dingrui Wang et.al. | 2409.18053 | link |
2024-09-26 | HARMONIC: A Framework for Explanatory Cognitive Robots | Sanjay Oruganti et.al. | 2409.18037 | null |
2024-09-26 | Reasoning Multi-Agent Behavioral Topology for Interactive Autonomous Driving | Haochen Liu et.al. | 2409.18031 | link |
2024-09-26 | ReliOcc: Towards Reliable Semantic Occupancy Prediction via Uncertainty Learning | Song Wang et.al. | 2409.18026 | null |
2024-09-26 | Safe Time-Varying Optimization based on Gaussian Processes with Spatio-Temporal Kernel | Jialin Li et.al. | 2409.18000 | null |
2024-09-26 | A Decision-Making Method in Polyhedral Convex Set Optimization | Andreas Löhne et.al. | 2409.17998 | null |
2024-09-26 | Adaptive Stream Processing on Edge Devices through Active Inference | Boris Sedlak et.al. | 2409.17937 | null |
2024-09-26 | PhantomLiDAR: Cross-modality Signal Injection Attacks against LiDAR | Zizhi Jin et.al. | 2409.17907 | null |
2024-09-27 | A New Dataset for Monocular Depth Estimation Under Viewpoint Shifts | Aurel Pjetri et.al. | 2409.17851 | null |
2024-09-26 | CASPFormer: Trajectory Prediction from BEV Images with Deformable Attention | Harsh Yadav et.al. | 2409.17790 | null |
2024-09-26 | AlterMOMA: Fusion Redundancy Pruning for Camera-LiDAR Fusion Models with Alternative Modality Masking | Shiqi Sun et.al. | 2409.17728 | null |
2024-09-26 | Hierarchical End-to-End Autonomous Driving: Integrating BEV Perception with Deep Reinforcement Learning | Siyi Lu et.al. | 2409.17659 | null |
2024-09-26 | Intervention strategies for misinformation sharing on social media: A bibliometric analysis | Juanita Zainudin et.al. | 2409.17637 | null |
2024-09-27 | Learning Occlusion-aware Decision-making from Agent Interaction via Active Perception | Jie Jia et.al. | 2409.17618 | null |
2024-09-26 | Good Data Is All Imitation Learning Needs | Amir Samadi et.al. | 2409.17605 | null |
2024-09-26 | Planned behavior, perceptual biases, and the dynamics of collective action | Alice C Schwarze et.al. | 2409.17573 | null |
2024-09-26 | Joint Source-Channel Coding: Fundamentals and Recent Progress in Practical Designs | Deniz Gündüz et.al. | 2409.17557 | null |
2024-09-26 | GLinSAT: The General Linear Satisfiability Neural Network Layer By Accelerated Gradient Descent | Hongtai Zeng et.al. | 2409.17500 | link |
2024-09-26 | How Do Observational Astronomers Learn to Inspect Imaging Data | Hugo Walsh et.al. | 2409.17468 | null |
2024-09-25 | Learning with Dynamics: Autonomous Regulation of UAV Based Communication Networks with Dynamic UAV Crew | Ran Zhang et.al. | 2409.17139 | null |
2024-09-25 | Enhancing robot reliability for health-care facilities by means of Human-Aware Navigation Planning | Olga E. Sorokoletova et.al. | 2409.17131 | null |
2024-09-25 | On-orbit Servicing for Spacecraft Collision Avoidance With Autonomous Decision Making | Susmitha Patnala et.al. | 2409.17125 | null |
2024-09-25 | Deep Learning and Machine Learning, Advancing Big Data Analytics and Management: Handy Appetizer | Benji Peng et.al. | 2409.17120 | null |
2024-09-25 | Dynamic Obstacle Avoidance through Uncertainty-Based Adaptive Planning with Diffusion | Vineet Punyamoorty et.al. | 2409.16950 | null |
2024-09-25 | Quantifying Visual Properties of GAM Shape Plots: Impact on Perceived Cognitive Load and Interpretability | Sven Kruschel et.al. | 2409.16870 | null |
2024-09-25 | The Role of Language Models in Modern Healthcare: A Comprehensive Review | Amna Khalid et.al. | 2409.16860 | null |
2024-09-25 | Dispute resolution in legal mediation with quantitative argumentation | Xiao Chi et.al. | 2409.16854 | null |
2024-09-25 | Uncertainty Representations in State-Space Layers for Deep Reinforcement Learning under Partial Observability | Carlos E. Luis et.al. | 2409.16824 | null |
2024-09-25 | PeerArg: Argumentative Peer Review with LLMs | Purin Sukpanichnant et.al. | 2409.16813 | null |
2024-09-25 | Spacewalker: Traversing Representation Spaces for Fast Interactive Exploration and Annotation of Unstructured Data | Lukas Heine et.al. | 2409.16793 | link |
2024-09-25 | MSI-Agent: Incorporating Multi-Scale Insight into Embodied Agents for Superior Planning and Decision-Making | Dayuan Fu et.al. | 2409.16686 | null |
2024-09-25 | Skyeyes: Ground Roaming using Aerial View Images | Zhiyuan Gao et.al. | 2409.16685 | null |
2024-09-25 | An Integrated Machine Learning and Deep Learning Framework for Credit Card Approval Prediction | Kejian Tong et.al. | 2409.16676 | null |
2024-09-25 | Stochastic Shortest Path Problem with Failure Probability | Ritsusamuel Otsubo et.al. | 2409.16672 | null |
2024-09-26 | Mitigating Covariate Shift in Imitation Learning for Autonomous Vehicles Using Latent Space Generative World Models | Alexander Popov et.al. | 2409.16663 | null |
2024-09-25 | Examining the Rat in the Tunnel: Interpretable Multi-Label Classification of Tor-based Malware | Ishan Karunanayake et.al. | 2409.16639 | null |
2024-09-25 | Optimized Monte Carlo Tree Search for Enhanced Decision Making in the FrozenLake Environment | Esteban Aldana Guerra et.al. | 2409.16620 | null |
2024-09-25 | CasFT: Future Trend Modeling for Information Popularity Prediction with Dynamic Cues-Driven Diffusion Models | Xin Jing et.al. | 2409.16619 | null |
2024-09-25 | EMIT- Event-Based Masked Auto Encoding for Irregular Time Series | Hrishikesh Patel et.al. | 2409.16554 | link |
2024-09-18 | Finetuning Language Models to Emit Linguistic Expressions of Uncertainty | Arslan Chaudhry et.al. | 2409.12180 | null |
2024-09-18 | Publishing Instincts: An Exploration-Exploitation Framework for Studying Academic Publishing Behavior and “Home Venues” | Teddy Lazebnik et.al. | 2409.12158 | null |
2024-09-18 | Decoding Style: Efficient Fine-Tuning of LLMs for Image-Guided Outfit Recommendation with Preference | Najmeh Forouzandehmehr et.al. | 2409.12150 | null |
2024-09-18 | Pareto Data Framework: Steps Towards Resource-Efficient Decision Making Using Minimum Viable Data (MVD) | Tashfain Ahmed et.al. | 2409.12112 | null |
2024-09-18 | Unveiling the Black Box: Independent Functional Module Evaluation for Bird’s-Eye-View Perception Model | Ludan Zhang et.al. | 2409.11969 | null |
2024-09-18 | Optimizing Job Shop Scheduling in the Furniture Industry: A Reinforcement Learning Approach Considering Machine Setup, Batch Variability, and Intralogistics | Malte Schneevogt et.al. | 2409.11820 | null |
2024-09-18 | Conformal Prediction for Manifold-based Source Localization with Gaussian Processes | Vadim Rozenfeld et.al. | 2409.11804 | null |
2024-09-18 | Explaining Non-monotonic Normative Reasoning using Argumentation Theory with Deontic Logic | Zhe Yu et.al. | 2409.11780 | null |
2024-09-18 | RopeBEV: A Multi-Camera Roadside Perception Network in Bird’s-Eye-View | Jinrang Jia et.al. | 2409.11706 | null |
2024-09-18 | From Words to Wheels: Automated Style-Customized Policy Generation for Autonomous Driving | Xu Han et.al. | 2409.11694 | null |
2024-09-18 | Towards Explainable Goal Recognition Using Weight of Evidence (WoE): A Human-Centered Approach | Abeer Alshehri et.al. | 2409.11675 | null |
2024-09-18 | Blockchain-Enabled IoV: Secure Communication and Trustworthy Decision-Making | Jingyi Sun et.al. | 2409.11621 | null |
2024-09-17 | Exploring Dimensions of Expertise in AR-Guided Psychomotor Tasks | Steven Yoo et.al. | 2409.11599 | null |
2024-09-17 | Automating proton PBS treatment planning for head and neck cancers using policy gradient-based deep reinforcement learning | Qingqing Wang et.al. | 2409.11576 | null |
2024-09-17 | Balancing Optimality and Diversity: Human-Centered Decision Making through Generative Curation | Michael Lingzhi Li et.al. | 2409.11535 | null |
2024-09-17 | Leveraging AI-Generated Emotional Self-Voice to Nudge People towards their Ideal Selves | Cathy Mengying Fang et.al. | 2409.11531 | null |
2024-09-17 | Partially Observable Contextual Bandits with Linear Payoffs | Sihan Zeng et.al. | 2409.11521 | null |
2024-09-17 | Beyond Algorithmic Fairness: A Guide to Develop and Deploy Ethical AI-Enabled Decision-Support Tools | Rosemarie Santa Gonzalez et.al. | 2409.11489 | null |
2024-09-24 | Consensus decision making on a complete graph: complex behaviour from simple assumptions | P. Sarkanych et.al. | 2409.11475 | null |
2024-09-17 | UniLCD: Unified Local-Cloud Decision-Making via Reinforcement Learning | Kathakoli Sengupta et.al. | 2409.11403 | null |
2024-09-17 | RenderWorld: World Model with Self-Supervised 3D Label | Ziyang Yan et.al. | 2409.11356 | null |
2024-09-17 | TopoMaskV2: Enhanced Instance-Mask-Based Formulation for the Road Topology Problem | M. Esat Kalfaoglu et.al. | 2409.11325 | null |
2024-09-17 | Navigating Process Mining: A Case study using pm4py | Ali Jlidi et.al. | 2409.11294 | null |
2024-09-17 | Cost-informed dimensionality reduction for structural digital twin technologies | Aidan J. Hughes et.al. | 2409.11236 | null |
2024-09-18 | High-Order Evolving Graphs for Enhanced Representation of Traffic Dynamics | Aditya Humnabadkar et.al. | 2409.11206 | null |
2024-09-17 | Optimization of Rulebooks via Asymptotically Representing Lexicographic Hierarchies for Autonomous Vehicles | Matteo Penlington et.al. | 2409.11199 | null |
2024-09-18 | Annealed Winner-Takes-All for Motion Forecasting | Yihong Xu et.al. | 2409.11172 | link |
2024-09-17 | UltimateDO: An Efficient Framework to Marry Occupancy Prediction with 3D Object Detection via Channel2height | Zichen Yu et.al. | 2409.11160 | null |
2024-09-17 | Unleashing the Potential of Mamba: Boosting a LiDAR 3D Sparse Detector by Using Cross-Model Knowledge Distillation | Rui Yu et.al. | 2409.11018 | null |
2024-09-17 | PSFHS Challenge Report: Pubic Symphysis and Fetal Head Segmentation from Intrapartum Ultrasound Images | Jieyun Bai et.al. | 2409.10980 | null |
2024-09-17 | Beyond Rationality: Unveiling the Role of Animal Spirits and Inflation Extrapolation in Central Bank Communication of the US | Arpan Chakraborty et.al. | 2409.10938 | null |
2024-09-17 | TrajSSL: Trajectory-Enhanced Semi-Supervised 3D Object Detection | Philip Jacobson et.al. | 2409.10901 | null |
2024-09-17 | DIGIMON: Diagnosis and Mitigation of Sampling Skew for Reinforcement Learning based Meta-Planner in Robot Navigation | Shiwei Feng et.al. | 2409.10832 | null |
2024-09-16 | NaviQAte: Functionality-Guided Web Application Navigation | Mobina Shahbandeh et.al. | 2409.10741 | null |
2024-09-16 | Trustworthy Conceptual Explanations for Neural Networks in Robot Decision-Making | Som Sagar et.al. | 2409.10733 | link |
2024-09-16 | Aligning Judgment Using Task Context and Explanations to Improve Human-Recommender System Performance | Divya Srivastava et.al. | 2409.10717 | null |
2024-09-16 | CoMamba: Real-time Cooperative Perception Unlocked with State Space Models | Jinlong Li et.al. | 2409.10699 | null |
2024-09-16 | Disentangling Uncertainty for Safe Social Navigation using Deep Reinforcement Learning | Daniel Flögel et.al. | 2409.10655 | null |
2024-09-16 | Development of Data Evaluation Benchmark for Data Wrangling Recommendation System | Yuqing Wang et.al. | 2409.10635 | null |
2024-09-16 | MusicLIME: Explainable Multimodal Music Understanding | Theodoros Sotirou et.al. | 2409.10496 | link |
2024-09-16 | Radar Teach and Repeat: Architecture and Initial Field Testing | Xinyuan Qiao et.al. | 2409.10491 | link |
2024-09-16 | XLM for Autonomous Driving Systems: A Comprehensive Review | Sonda Fourati et.al. | 2409.10484 | null |
2024-09-16 | Quantile Fourier regressions for decision making under uncertainty | Arash Khojaste et.al. | 2409.10455 | null |
2024-09-16 | Stretchable Arduinos embedded in soft robots | Stephanie J. Woodman et.al. | 2409.10333 | link |
2024-09-16 | DRIVE: Dependable Robust Interpretable Visionary Ensemble Framework in Autonomous Driving | Songning Lai et.al. | 2409.10330 | null |
2024-09-16 | InfoDisent: Explainability of Image Classification Models by Information Disentanglement | Łukasz Struski et.al. | 2409.10329 | null |
2024-09-16 | Fairness, not Emotion, Drives Socioeconomic Decision Making | Rudra Mukhopadhyay et.al. | 2409.10322 | null |
2024-09-16 | SEAL: Towards Safe Autonomous Driving via Skill-Enabled Adversary Learning for Closed-Loop Scenario Generation | Benjamin Stoler et.al. | 2409.10320 | link |
2024-09-16 | A Note on Piecewise Affine Decision Rules for Robust, Stochastic, and Data-Driven Optimization | Simon Thomä et.al. | 2409.10295 | link |
2024-09-16 | ReflectDiffu: Reflect between Emotion-intent Contagion and Mimicry for Empathetic Response Generation via a RL-Diffusion Framework | Jiahao Yuan et.al. | 2409.10289 | link |
2024-09-16 | Questioning AI: Promoting Decision-Making Autonomy Through Reflection | Simon WS Fischer et.al. | 2409.10250 | null |
2024-09-16 | Robust Bird’s Eye View Segmentation by Adapting DINOv2 | Merve Rabia Barın et.al. | 2409.10228 | null |
2024-09-16 | LLMs for clinical risk prediction | Mohamed Rezk et.al. | 2409.10191 | null |
2024-09-16 | ExelMap: Explainable Element-based HD-Map Change Detection and Update | Lena Wild et.al. | 2409.10178 | null |
2024-09-16 | Maneuver Decision-Making with Trajectory Streams Prediction for Autonomous Vehicles | Mais Jamal et.al. | 2409.10165 | null |
2024-09-16 | AALF: Almost Always Linear Forecasting | Matthias Jakobs et.al. | 2409.10142 | link |
2024-09-16 | Advancing Towards a Marine Digital Twin Platform: Modeling the Mar Menor Coastal Lagoon Ecosystem in the South Western Mediterranean | Yu Ye et.al. | 2409.10134 | null |
2024-09-16 | Human Insights Driven Latent Space for Different Driving Perspectives: A Unified Encoder for Efficient Multi-Task Inference | Huy-Dung Nguyen et.al. | 2409.10095 | null |
2024-09-16 | LeGEND: A Top-Down Approach to Scenario Generation of Autonomous Driving Systems Assisted by Large Language Models | Shuncheng Tang et.al. | 2409.10066 | link |
2024-09-13 | Generic and ML Workloads in an HPC Datacenter: Node Energy, Job Failures, and Node-Job Analysis | Xiaoyu Chu et.al. | 2409.08949 | link |
2024-09-13 | Visual Language Tracking with Multi-modal Interaction: A Robust Benchmark | Xuchen Li et.al. | 2409.08887 | null |
2024-09-13 | Electrocardiogram Report Generation and Question Answering via Retrieval-Augmented Self-Supervised Modeling | Jialu Tang et.al. | 2409.08788 | null |
2024-09-13 | Causal Transformer for Fusion and Pose Estimation in Deep Visual Inertial Odometry | Yunus Bilge Kurt et.al. | 2409.08769 | link |
2024-09-13 | GenMapping: Unleashing the Potential of Inverse Perspective Mapping for Robust Online HD Map Construction | Siyu Li et.al. | 2409.08688 | link |
2024-09-13 | xTED: Cross-Domain Policy Adaptation via Diffusion-Based Trajectory Editing | Haoyi Niu et.al. | 2409.08687 | link |
2024-09-13 | Agile Decision-Making and Safety-Critical Motion Planning for Emergency Autonomous Vehicles | Yiming Shu et.al. | 2409.08665 | null |
2024-09-13 | Optimizing Item-based Marketing Promotion Efficiency in C2C Marketplace with Dynamic Sequential Coupon Allocation Framework | Jie Yang et.al. | 2409.08609 | null |
2024-09-13 | Common revenue allocation in DMUs with two stages based on DEA cross-efficiency and cooperative game | Xinyu Wang et.al. | 2409.08502 | null |
2024-09-12 | An Experimental Study of Competitive Market Behavior Through LLMs | Jingru Jia et.al. | 2409.08357 | null |
2024-09-13 | The Design of Informative Take-Over Requests for Semi-Autonomous Cyber-Physical Systems: Combining Spoken Language and Visual Icons in a Drone-Controller Setting | Ashwini Gundappa et.al. | 2409.08253 | null |
2024-09-12 | How can the tragedy of the commons be prevented?: Introducing Linear Quadratic Mixed Mean Field Games | Gokce Dayanikli et.al. | 2409.08235 | null |
2024-09-12 | Model Ensemble for Brain Tumor Segmentation in Magnetic Resonance Imaging | Daniel Capellán-Martín et.al. | 2409.08232 | link |
2024-09-12 | Optimal Management of Grid-Interactive Efficient Buildings via Safe Reinforcement Learning | Xiang Huo et.al. | 2409.08132 | null |
2024-09-12 | The JPEG Pleno Learning-based Point Cloud Coding Standard: Serving Man and Machine | André F. R. Guarda et.al. | 2409.08130 | null |
2024-09-12 | Value of Communication: Data-Driven Topology Optimization for Distributed Linear Cyber-Physical Systems | Michael Nestor et.al. | 2409.08116 | null |
2024-09-12 | SoVAR: Building Generalizable Scenarios from Accident Reports for Autonomous Driving Testing | An Guo et.al. | 2409.08081 | link |
2024-09-12 | LED: Light Enhanced Depth Estimation at Night | Simon de Moreau et.al. | 2409.08031 | link |
2024-09-12 | Games for AI Control: Models of Safety Evaluations of AI Deployment Protocols | Charlie Griffin et.al. | 2409.07985 | link |
2024-09-12 | WirelessAgent: Large Language Model Agents for Intelligent Wireless Networks | Jingwen Tong et.al. | 2409.07964 | link |
2024-09-12 | On an optimization model for firefighting helicopter planning | Marta Rodríguez Barreiro et.al. | 2409.07937 | null |
2024-09-12 | Conformal Distributed Remote Inference in Sensor Networks Under Reliability and Communication Constraints | Meiyi Zhu et.al. | 2409.07902 | null |
2024-09-12 | Real-time Multi-view Omnidirectional Depth Estimation System for Robots and Autonomous Driving on Real Scenes | Ming Li et.al. | 2409.07843 | null |
2024-09-12 | ReGentS: Real-World Safety-Critical Driving Scenario Generation Made Stable | Yuan Yin et.al. | 2409.07830 | link |
2024-09-12 | GateAttentionPose: Enhancing Pose Estimation with Agent Attention and Improved Gated Convolutions | Liang Feng et.al. | 2409.07798 | null |
2024-09-12 | ROCAS: Root Cause Analysis of Autonomous Driving Accidents via Cyber-Physical Co-mutation | Shiwei Feng et.al. | 2409.07774 | link |
2024-09-12 | GatedUniPose: A Novel Approach for Pose Estimation Combining UniRepLKNet and Gated Convolution | Liang Feng et.al. | 2409.07752 | null |
2024-09-12 | Attack End-to-End Autonomous Driving through Module-Wise Noise | Lu Wang et.al. | 2409.07706 | null |
2024-09-11 | Gaussian Process Upper Confidence Bounds in Distributed Point Target Tracking over Wireless Sensor Networks | Xingchi Liu et.al. | 2409.07652 | null |
2024-09-11 | A Survey of Inverse Constrained Reinforcement Learning: Definitions, Progress and Challenges | Guiliang Liu et.al. | 2409.07569 | link |
2024-09-11 | Hierarchical Reinforcement Learning for Temporal Abstraction of Listwise Recommendation | Luo Ji et.al. | 2409.07416 | null |
2024-09-11 | Dynamic Bayesian Networks, Elicitation and Data Embedding for Secure Environments | Kieran Drury et.al. | 2409.07389 | null |
2024-09-11 | Multi-source Stable Variable Importance Measure via Adversarial Machine Learning | Zitao Wang et.al. | 2409.07380 | null |
2024-09-11 | Policy consequences of the new neuroeconomic framework | A. David Redish et.al. | 2409.07373 | null |
2024-09-11 | The Role of Explainable AI in Revolutionizing Human Health Monitoring | Abdullah Alharthi et.al. | 2409.07347 | null |
2024-09-11 | Explanation, Debate, Align: A Weak-to-Strong Framework for Language Model Generalization | Mehrdad Zakershahrak et.al. | 2409.07335 | null |
2024-09-11 | Module-wise Adaptive Adversarial Training for End-to-end Autonomous Driving | Tianyuan Zhang et.al. | 2409.07321 | null |
2024-09-11 | MiniDrive: More Efficient Vision-Language Models with Multi-Level 2D Features as Text Tokens for Autonomous Driving | Enming Zhang et.al. | 2409.07267 | link |
2024-09-11 | Behavioral Cloning Models Reality Check for Autonomous Driving | Mustafa Yildirim et.al. | 2409.07218 | null |
2024-09-11 | Quantum Monte Carlo methods for Newsvendor problem with Multiple Unreliable Suppliers | Monit Sharma et.al. | 2409.07183 | null |
2024-09-11 | Fast Medical Shape Reconstruction via Meta-learned Implicit Neural Representations | Gaia Romana De Paolis et.al. | 2409.07100 | null |
2024-09-11 | A Novel Voting System for Medical Catalogues in National Health Insurance | Xingyuan Liang et.al. | 2409.07057 | null |
2024-09-10 | Cross-Modal Self-Supervised Learning with Effective Contrastive Units for LiDAR Point Clouds | Mu Cai et.al. | 2409.06827 | link |
2024-09-10 | Hint-AD: Holistically Aligned Interpretability in End-to-End Autonomous Driving | Kairui Ding et.al. | 2409.06702 | null |
2024-09-10 | Technical Report of Mobile Manipulator Robot for Industrial Environments | Erfan Amoozad Khalili et.al. | 2409.06693 | null |
2024-09-10 | Designing Resource Allocation Tools to Promote Fair Allocation: Do Visualization and Information Framing Matter? | Arnav Verma et.al. | 2409.06688 | null |
2024-09-10 | Memory and Personality in Ideological Polarization: The Politico-physics of Mnemomatter | Shengkai Li et.al. | 2409.06660 | null |
2024-09-10 | Transtreaming: Adaptive Delay-aware Transformer for Real-time Streaming Perception | Xiang Zhang et.al. | 2409.06584 | null |
2024-09-10 | MAGDA: Multi-agent guideline-driven diagnostic assistance | David Bani-Harouni et.al. | 2409.06351 | null |
2024-09-10 | Multi-Weather Image Restoration via Histogram-Based Transformer Feature Enhancement | Yang Wen et.al. | 2409.06334 | null |
2024-09-10 | Towards Robust Uncertainty-Aware Incomplete Multi-View Classification | Mulin Chen et.al. | 2409.06270 | null |
2024-09-10 | UdeerLID+: Integrating LiDAR, Image, and Relative Depth with Semi-Supervised | Tao Ni et.al. | 2409.06197 | null |
2024-09-11 | MyGo: Consistent and Controllable Multi-View Driving Video Generation with Camera Control | Yining Yao et.al. | 2409.06189 | null |
2024-09-10 | HierLLM: Hierarchical Large Language Model for Question Recommendation | Yuxuan Liu et.al. | 2409.06177 | null |
2024-09-09 | Coarse Descriptions and Cautious Preferences | Evan Piermont et.al. | 2409.06054 | null |
2024-09-09 | Bridging Rested and Restless Bandits with Graph-Triggering: Rising and Rotting | Gianmarco Genalti et.al. | 2409.05980 | null |
2024-09-09 | Predicting Electricity Consumption with Random Walks on Gaussian Processes | Chloé Hashimoto-Cullen et.al. | 2409.05934 | null |
2024-09-09 | A Framework for Evaluating PM2.5 Forecasts from the Perspective of Individual Decision Making | Renato Berlinghieri et.al. | 2409.05866 | link |
2024-09-09 | Promptable Closed-loop Traffic Simulation | Shuhan Tan et.al. | 2409.05863 | null |
2024-09-09 | An Introduction to Quantum Reinforcement Learning (QRL) | Samuel Yen-Chi Chen et.al. | 2409.05846 | null |
2024-09-09 | Vision-Driven 2D Supervised Fine-Tuning Framework for Bird’s Eye View Perception | Lei He et.al. | 2409.05834 | null |
2024-09-09 | Limits on the computational expressivity of non-equilibrium biophysical processes | Carlos Floyd et.al. | 2409.05827 | null |
2024-09-09 | Cooperative Decision-Making for CAVs at Unsignalized Intersections: A MARL Approach with Attention and Hierarchical Game Priors | Jiaqi Liu et.al. | 2409.05712 | null |
2024-09-09 | Quantum Volunteer’s Dilemma | Dax Enshan Koh et.al. | 2409.05708 | null |
2024-09-09 | Replay Consolidation with Label Propagation for Continual Object Detection | Riccardo De Monte et.al. | 2409.05650 | null |
2024-09-09 | Forward KL Regularized Preference Optimization for Aligning Diffusion Policies | Zhao Shan et.al. | 2409.05622 | null |
2024-09-09 | StratXplore: Strategic Novelty-seeking and Instruction-aligned Exploration for Vision and Language Navigation | Muraleekrishna Gopinathan et.al. | 2409.05593 | null |
2024-09-09 | Interpretable Responsibility Sharing as a Heuristic for Task and Motion Planning | Arda Sarp Yenicesu et.al. | 2409.05586 | link |
2024-09-10 | DriveScape: Towards High-Resolution Controllable Multi-View Driving Video Generation | Wei Wu et.al. | 2409.05463 | null |
2024-09-09 | Distribution Discrepancy and Feature Heterogeneity for Active 3D Object Detection | Huang-Yu Chen et.al. | 2409.05425 | link |
2024-09-09 | Common or specific source, features or scores; it is all a matter of information | Aafko Boonstra et.al. | 2409.05403 | null |
2024-09-09 | Diagnostic Reasoning in Natural Language: Computational Model and Application | Nils Dycke et.al. | 2409.05367 | null |
2024-09-09 | Driving with Prior Maps: Unified Vector Prior Encoding for Autonomous Vehicle Mapping | Shuang Zeng et.al. | 2409.05352 | null |
2024-09-09 | ICPR 2024 Competition on Safe Segmentation of Drive Scenes in Unstructured Traffic and Adverse Weather Conditions | Furqan Ahmed Shaik et.al. | 2409.05327 | null |
2024-09-09 | Developing Trajectory Planning with Behavioral Cloning and Proximal Policy Optimization for Path-Tracking and Static Obstacle Nudging | Mingyan Zhou et.al. | 2409.05289 | link |
2024-09-08 | Sliding-Window Thompson Sampling for Non-Stationary Settings | Marco Fiandri et.al. | 2409.05181 | null |
2024-09-08 | Enhancing the Performance of Multi-Vehicle Navigation in Unstructured Environments using Hard Sample Mining | Yining Ma et.al. | 2409.05119 | link |
2024-09-06 | Future Does Matter: Boosting 3D Object Detection with Temporal Motion Estimation in Point Cloud Sequences | Rui Yu et.al. | 2409.04390 | null |
2024-09-06 | Evaluating Fairness in Transaction Fraud Models: Fairness Metrics, Bias Audits, and Challenges | Parameswaran Kamalaruban et.al. | 2409.04373 | null |
2024-09-06 | A naive aggregation algorithm for improving generalization in a class of learning problems | Getachew K Befekadu et.al. | 2409.04352 | null |
2024-09-06 | Active learning for regression in engineering populations: A risk-informed approach | Daniel R. Clarkson et.al. | 2409.04328 | null |
2024-09-06 | Safe and Efficient Path Planning under Uncertainty via Deep Collision Probability Fields | Felix Herrmann et.al. | 2409.04306 | null |
2024-09-06 | SPACE: A Python-based Simulator for Evaluating Decentralized Multi-Robot Task Allocation Algorithms | Inmo Jang et.al. | 2409.04230 | link |
2024-09-06 | Secure Traffic Sign Recognition: An Attention-Enabled Universal Image Inpainting Mechanism against Light Patch Attacks | Hangcheng Cao et.al. | 2409.04133 | null |
2024-09-06 | Algorithmic Collusion Without Threats | Eshwar Ram Arunachaleswaran et.al. | 2409.03956 | link |
2024-09-05 | DRAL: Deep Reinforcement Adaptive Learning for Multi-UAVs Navigation in Unknown Indoor Environment | Kangtong Mo et.al. | 2409.03930 | null |
2024-09-05 | Understanding Fairness Metrics in Recommender Systems: A Healthcare Perspective | Veronica Kecki et.al. | 2409.03893 | null |
2024-09-05 | Multi-agent Path Finding for Mixed Autonomy Traffic Coordination | Han Zheng et.al. | 2409.03881 | null |
2024-09-05 | PARCO: Learning Parallel Autoregressive Policies for Efficient Multi-Agent Combinatorial Optimization | Federico Berto et.al. | 2409.03811 | link |
2024-09-05 | A Deep Generative Learning Approach for Two-stage Adaptive Robust Optimization | Aron Brenner et.al. | 2409.03731 | null |
2024-09-05 | A Fused Large Language Model for Predicting Startup Success | Abdurahman Maarouf et.al. | 2409.03668 | null |
2024-09-05 | Prediction Accuracy & Reliability: Classification and Object Localization under Distribution Shift | Fabian Diet et.al. | 2409.03543 | null |
2024-09-05 | Distributionally Robust Optimisation with Bayesian Ambiguity Sets | Charita Dellaporta et.al. | 2409.03492 | null |
2024-09-05 | Neural HD Map Generation from Multiple Vectorized Tiles Locally Produced by Autonomous Vehicles | Miao Fan et.al. | 2409.03445 | null |
2024-09-05 | F3T: A soft tactile unit with 3D force and temperature mathematical decoupling ability for robots | Xiong Yang et.al. | 2409.03421 | null |
2024-09-06 | CogniDual Framework: Self-Training Large Language Models within a Dual-System Theoretical Framework for Improving Cognitive Tasks | Yongxin Deng et.al. | 2409.03381 | null |
2024-09-05 | YOLO-PPA based Efficient Traffic Sign Detection for Cruise Control in Autonomous Driving | Jingyu Zhang et.al. | 2409.03320 | null |
2024-09-05 | OccLLaMA: An Occupancy-Language-Action Generative World Model for Autonomous Driving | Julong Wei et.al. | 2409.03272 | null |
2024-09-05 | Multiple weather images restoration using the task transformer and adaptive mixup strategy | Yang Wen et.al. | 2409.03249 | null |
2024-09-05 | Enhancing Healthcare LLM Trust with Atypical Presentations Recalibration | Jeremy Qin et.al. | 2409.03225 | link |
2024-09-05 | InfraLib: Enabling Reinforcement Learning and Decision Making for Large Scale Infrastructure Management | Pranay Thangeda et.al. | 2409.03167 | null |
2024-09-05 | Autonomous Drifting Based on Maximal Safety Probability Learning | Hikaru Hoshino et.al. | 2409.03160 | link |
2024-09-05 | Non-stationary and Sparsely-correlated Multi-output Gaussian Process with Spike-and-Slab Prior | Wang Xinming et.al. | 2409.03149 | null |
2024-09-04 | Developing, Analyzing, and Evaluating Self-Drive Algorithms Using Drive-by-Wire Electric Vehicles | Beñat Froemming-Aldanondo et.al. | 2409.03114 | link |
2024-09-04 | Explainable AI for computational pathology identifies model limitations and tissue biomarkers | Jakub R. Kaczmarzyk et.al. | 2409.03080 | link |
2024-09-04 | Incorporating dense metric depth into neural 3D representations for view synthesis and relighting | Arkadeep Narayan Chaudhury et.al. | 2409.03061 | null |
2024-09-04 | UC-NeRF: Uncertainty-aware Conditional Neural Radiance Fields from Endoscopic Sparse Views | Jiaxin Guo et.al. | 2409.02917 | link |
2024-09-04 | Can LVLMs Obtain a Driver’s License? A Benchmark Towards Reliable AGI for Autonomous Driving | Yuhang Lu et.al. | 2409.02914 | null |
2024-09-04 | Exploring Sentiment Dynamics and Predictive Behaviors in Cryptocurrency Discussions by Few-Shot Learning with Large Language Models | Moein Shahiki Tash et.al. | 2409.02836 | null |
2024-09-04 | Towards Edge-Based Data Lake Architecture for Intelligent Transportation System | Danilo Fernandes et.al. | 2409.02808 | null |
2024-09-04 | Beyond Nash Equilibrium: Achieving Bayesian Perfect Equilibrium with Belief Update Fictitious Play | Qi Ju et.al. | 2409.02706 | link |
2024-09-04 | Decision Transformer for Enhancing Neural Local Search on the Job Shop Scheduling Problem | Constantin Waubert de Puiseau et.al. | 2409.02697 | null |
2024-09-04 | The Role of Artificial Intelligence and Machine Learning in Software Testing | Ahmed Ramadan et.al. | 2409.02693 | null |
2024-09-04 | Improved Single Camera BEV Perception Using Multi-Camera Training | Daniel Busch et.al. | 2409.02676 | null |
2024-09-04 | PUB: Plot Understanding Benchmark and Dataset for Evaluating Large Language Models on Synthetic Visual Data Interpretation | Aneta Pawelec et.al. | 2409.02617 | null |
2024-09-04 | AlignGroup: Learning and Aligning Group Consensus with Member Preferences for Group Recommendation | Jinfeng Xu et.al. | 2409.02580 | link |
2024-09-05 | Assembling the Puzzle: Exploring Collaboration and Data Sensemaking in Nursing Practices for Remote Patient Monitoring | Mihnea Calota et.al. | 2409.02579 | null |
2024-09-04 | How Do You Perceive My Face? Recognizing Facial Expressions in Multi-Modal Context by Modeling Mental Representations | Florian Blume et.al. | 2409.02566 | null |
2024-09-04 | Want a Ride? Attitudes Towards Autonomous Driving and Behavior in Autonomous Vehicles | Enrico Del Re et.al. | 2409.02556 | null |
2024-09-05 | A Sequential Decision-Making Model for Perimeter Identification | Ayal Taitler et.al. | 2409.02549 | null |
2024-09-04 | A Joint Time and Energy-Efficient Federated Learning-based Computation Offloading Method for Mobile Edge Computing | Anwesha Mukherjee et.al. | 2409.02548 | null |
2024-09-04 | Cog-GA: A Large Language Models-based Generative Agent for Vision-Language Navigation in Continuous Environments | Zhiyuan Li et.al. | 2409.02522 | null |
2024-09-04 | TLD: A Vehicle Tail Light signal Dataset and Benchmark | Jinhao Chai et.al. | 2409.02508 | null |
2024-09-04 | eRSS-RAMP: A Rule-Adherence Motion Planner Based on Extended Responsibility-Sensitive Safety for Autonomous Driving | Pengfei Lin et.al. | 2409.02503 | null |
2024-09-04 | A Learnable Color Correction Matrix for RAW Reconstruction | Anqi Liu et.al. | 2409.02497 | null |
2024-09-04 | TASAR: Transferable Attack on Skeletal Action Recognition | Yunfeng Diao et.al. | 2409.02483 | link |
2024-08-30 | Dual-criterion Dose Finding Designs Based on Dose-Limiting Toxicity and Tolerability | Yunlong Yang et.al. | 2408.17392 | null |
2024-08-30 | Structuring a Training Strategy to Robustify Perception Models with Realistic Image Augmentations | Ahmed Hammam et.al. | 2408.17311 | null |
2024-08-30 | An Integer Linear Programming Model for Earth Observation Missions | Vincenzo Basco et.al. | 2408.17288 | null |
2024-08-30 | How Could Generative AI Support Compliance with the EU AI Act? A Review for Safe Automated Driving Perception | Mert Keser et.al. | 2408.17222 | null |
2024-08-30 | NanoMVG: USV-Centric Low-Power Multi-Task Visual Grounding based on Prompt-Guided Camera and 4D mmWave Radar | Runwei Guan et.al. | 2408.17207 | null |
2024-08-30 | Modelling Growth, Remodelling and Damage of a Thick-walled Fibre-reinforced Artery with Active Response: Application to Cerebral Vasospasm and Treatment | Giulia Pederzani et.al. | 2408.17206 | null |
2024-08-30 | Towards Symbolic XAI – Explanation Through Human Understandable Logical Relationships Between Features | Thomas Schnake et.al. | 2408.17198 | null |
2024-09-03 | Controllable Edge-Type-Specific Interpretation in Multi-Relational Graph Neural Networks for Drug Response Prediction | Xiaodi Li et.al. | 2408.17129 | link |
2024-08-30 | A Two-Timescale Decision-Hazard-Decision Formulation for Storage Usage Values Calculation | Camila Martinez Parra et.al. | 2408.17113 | null |
2024-08-30 | UTrack: Multi-Object Tracking with Uncertain Detections | Edgardo Solano-Carrillo et.al. | 2408.17098 | link |
2024-08-30 | Reasoning AI Performance Degradation in 6G Networks with Large Language Models | Liming Huang et.al. | 2408.17097 | null |
2024-08-30 | PIB: Prioritized Information Bottleneck Framework for Collaborative Edge Video Analytics | Zhengru Fang et.al. | 2408.17047 | link |
2024-08-30 | Tonal Cognition in Sonification: Exploring the Needs of Practitioners in Sonic Interaction Design | Minsik Choi et.al. | 2408.17012 | null |
2024-08-30 | Transient Fault Tolerant Semantic Segmentation for Autonomous Driving | Leonardo Iurada et.al. | 2408.16952 | link |
2024-08-29 | Enhancing Autism Spectrum Disorder Early Detection with the Parent-Child Dyads Block-Play Protocol and an Attention-enhanced GCN-xLSTM Hybrid Deep Learning Framework | Xiang Li et.al. | 2408.16924 | null |
2024-08-29 | Auricular Vagus Nerve Stimulation for Enhancing Remote Pilot Training and Operations | William J. Tyler et.al. | 2408.16755 | null |
2024-08-29 | Enhanced forecasting of stock prices based on variational mode decomposition, PatchTST, and adaptive scale-weighted layer | Xiaorui Xue et.al. | 2408.16707 | null |
2024-08-29 | RoboMNIST: A Multimodal Dataset for Multi-Robot Activity Recognition Using WiFi Sensing, Video, and Audio | Kian Behzad et.al. | 2408.16703 | link |
2024-08-29 | A Catalog of Fairness-Aware Practices in Machine Learning Engineering | Gianmario Voria et.al. | 2408.16683 | null |
2024-08-29 | DriveGenVLM: Real-world Video Generation for Vision Language Model based Autonomous Driving | Yongjie Fu et.al. | 2408.16647 | null |
2024-08-29 | RLCP: A Reinforcement Learning-based Copyright Protection Method for Text-to-Image Diffusion Model | Zhuan Shi et.al. | 2408.16634 | null |
2024-08-29 | CooTest: An Automated Testing Approach for V2X Communication Systems | An Guo et.al. | 2408.16470 | link |
2024-08-29 | Consensus Planning with Primal, Dual, and Proximal Agents | Alvaro Maggiar et.al. | 2408.16462 | null |
2024-08-29 | BEVal: A Cross-dataset Evaluation Study of BEV Segmentation Models for Autononomous Driving | Manuel Alejandro Diaz-Zapata et.al. | 2408.16322 | link |
2024-08-29 | Passenger hazard perception based on EEG signals for highly automated driving vehicles | Ashton Yu Xuan Tan et.al. | 2408.16315 | null |
2024-08-29 | PolarBEVDet: Exploring Polar Representation for Multi-View 3D Object Detection in Bird’s-Eye-View | Zichen Yu et.al. | 2408.16200 | link |
2024-08-28 | Improving the Prediction of Individual Engagement in Recommendations Using Cognitive Models | Roderick Seow et.al. | 2408.16147 | null |
2024-08-28 | EPO: Hierarchical LLM Agents with Environment Preference Optimization | Qi Zhao et.al. | 2408.16090 | link |
2024-08-28 | Logic-Enhanced Language Model Agents for Trustworthy Social Simulations | Agnieszka Mensfelt et.al. | 2408.16081 | link |
2024-08-28 | WebPilot: A Versatile and Autonomous Multi-Agent System for Web Task Execution with Strategic Exploration | Yao Zhang et.al. | 2408.15978 | null |
2024-08-28 | SLAM2REF: Advancing Long-Term Mapping with 3D LiDAR and Reference Map Integration for Precise 6-DoF Trajectory Estimation and Map Extension | Miguel Arturo Vega Torres et.al. | 2408.15948 | link |
2024-08-28 | GenDDS: Generating Diverse Driving Video Scenarios with Prompt-to-Video Generative Model | Yongjie Fu et.al. | 2408.15868 | null |
2024-08-28 | FlowAct: A Proactive Multimodal Human-robot Interaction System with Continuous Flow of Perception and Modular Action Sub-systems | Timothée Dhaussy et.al. | 2408.15864 | null |
2024-08-28 | Network transferability of adversarial patches in real-time object detection | Jens Bayer et.al. | 2408.15833 | link |
2024-08-28 | Emulating Brain-like Rapid Learning in Neuromorphic Edge Computing | Kenneth Stewart et.al. | 2408.15800 | link |
2024-08-28 | LogicGame: Benchmarking Rule-Based Reasoning Abilities of Large Language Models | Jiayi Gui et.al. | 2408.15778 | link |
2024-08-28 | Str-L Pose: Integrating Point and Structured Line for Relative Pose Estimation in Dual-Graph | Zherong Zhang et.al. | 2408.15750 | null |
2024-08-28 | Comparing diversity, negativity, and stereotypes in Chinese-language AI technologies: a case study on Baidu, Ernie and Qwen | Geng Liu et.al. | 2408.15696 | link |
2024-08-28 | TeFF: Tracking-enhanced Forgetting-free Few-shot 3D LiDAR Semantic Segmentation | Junbao Zhou et.al. | 2408.15657 | link |
2024-08-28 | Correlation-Adjusted Simultaneous Testing for Ultra High-dimensional Grouped Data | Iris Ivy Gauran et.al. | 2408.15623 | null |
2024-08-28 | Latent Relationship Mining of Glaucoma Biomarkers: a TRI-LSTM based Deep Learning | Cheng Huang et.al. | 2408.15555 | null |
2024-08-28 | Trustworthy and Responsible AI for Human-Centric Autonomous Decision-Making Systems | Farzaneh Dehghani et.al. | 2408.15550 | null |
2024-08-28 | RoboSense: Large-scale Dataset and Benchmark for Multi-sensor Low-speed Autonomous Driving | Haisheng Su et.al. | 2408.15503 | link |
2024-08-28 | MODULI: Unlocking Preference Generalization via Diffusion Models for Offline Multi-Objective Reinforcement Learning | Yifu Yuan et.al. | 2408.15501 | null |
2024-08-28 | PersonalizedUS: Interpretable Breast Cancer Risk Assessment with Local Coverage Uncertainty Quantification | Alek Fröhlich et.al. | 2408.15458 | null |
2024-08-27 | Understanding GNNs for Boolean Satisfiability through Approximation Algorithms | Jan Hůla et.al. | 2408.15418 | null |
2024-08-27 | Panoptic Perception for Autonomous Driving: A Survey | Yunge Li et.al. | 2408.15388 | null |
2024-08-27 | Drone-assisted Road Gaussian Splatting with Cross-view Uncertainty | Saining Zhang et.al. | 2408.15242 | link |
2024-08-27 | Learning-based Multi-View Stereo: A Survey | Fangjinhua Wang et.al. | 2408.15235 | null |
2024-08-27 | Using LLMs for Explaining Sets of Counterfactual Examples to Final Users | Arturo Fredes et.al. | 2408.15133 | link |
2024-08-27 | T-FAKE: Synthesizing Thermal Images for Facial Landmarking | Philipp Flotho et.al. | 2408.15127 | link |
2024-08-27 | Subgroup Analysis via Model-based Rule Forest | I-Ling Cheng et.al. | 2408.15057 | null |
2024-08-27 | Cross-subject Brain Functional Connectivity Analysis for Multi-task Cognitive State Evaluation | Jun Chen et.al. | 2408.15018 | null |
2024-08-27 | Adversarial Manhole: Challenging Monocular Depth Estimation and Semantic Segmentation Models with Patch Attack | Naufal Suryanto et.al. | 2408.14879 | link |
2024-08-27 | Unsupervised-to-Online Reinforcement Learning | Junsu Kim et.al. | 2408.14785 | null |
2024-08-27 | Optimization model for electric aircraft tow tractors considering operator coalition | Dan-Wen Bao et.al. | 2408.14748 | null |
2024-08-26 | Artificial Intelligence in Landscape Architecture: A Survey | Yue Xing et.al. | 2408.14700 | null |
2024-08-26 | Enhancing Neural Network Interpretability Through Conductance-Based Information Plane Analysis | Jaouad Dabounou et.al. | 2408.14681 | null |
2024-08-26 | Relationships are Complicated! An Analysis of Relationships Between Datasets on the Web | Kate Lin et.al. | 2408.14636 | link |
2024-08-26 | EVINCE: Optimizing Adversarial LLM Dialogues via Conditional Statistics and Information Theory | Edward Y. Chang et.al. | 2408.14575 | null |
2024-08-26 | Aiding Humans in Financial Fraud Decision Making: Toward an XAI-Visualization Framework | Angelos Chatzimparmpas et.al. | 2408.14552 | null |
2024-08-26 | Taxicab distance based best-worst method for multi-criteria decision-making: An analytical approach | Harshit Ratandhara et.al. | 2408.14452 | null |
2024-08-26 | Driving in the Occupancy World: Vision-Centric 4D Occupancy Forecasting and Planning via World Models for Autonomous Driving | Yu Yang et.al. | 2408.14197 | null |
2024-08-26 | EMDFNet: Efficient Multi-scale and Diverse Feature Network for Traffic Sign Detection | Pengyu Li et.al. | 2408.14189 | null |
2024-08-26 | DynamicRouteGPT: A Real-Time Multi-Vehicle Dynamic Navigation Framework Based on Large Language Models | Ziai Zhou et.al. | 2408.14185 | null |
2024-08-26 | Dynamic Pricing for Electric Vehicle Charging | Arun Kumar Kalakanti et.al. | 2408.14169 | null |
2024-08-26 | Crowd-Calibrator: Can Annotator Disagreement Inform Calibration in Subjective Tasks? | Urja Khurana et.al. | 2408.14141 | null |
2024-08-26 | Quantitative Representation of Scenario Difficulty for Autonomous Driving Based on Adversarial Policy Search | Shuo Yang et.al. | 2408.14000 | null |
2024-08-26 | FusionSAM: Latent Space driven Segment Anything Model for Multimodal Fusion and Segmentation | Daixun Li et.al. | 2408.13980 | null |
2024-08-26 | Speeding Ticket: Unveiling the Energy and Emission Burden of AI-Accelerated Distributed and Decentralized Power Dispatch Models | Meiyi Li et.al. | 2408.13968 | null |
2024-08-25 | Optimizing Luxury Vehicle Dealership Networks: A Graph Neural Network Approach to Site Selection | Luca Silvano Carocci et.al. | 2408.13961 | link |
2024-08-27 | Time Series Analysis for Education: Methods, Applications, and Future Directions | Shengzhong Mao et.al. | 2408.13960 | link |
2024-08-25 | Bridging the Gap between Real-world and Synthetic Images for Testing Autonomous Driving Systems | Mohammad Hossein Amini et.al. | 2408.13950 | null |
2024-08-25 | CoT Rerailer: Enhancing the Reliability of Large Language Models in Complex Reasoning Tasks through Error Detection and Correction | Guangya Wan et.al. | 2408.13940 | null |
2024-08-25 | TraIL-Det: Transformation-Invariant Local Feature Networks for 3D LiDAR Object Detection with Unsupervised Pre-Training | Li Li et.al. | 2408.13902 | null |
2024-08-25 | Making Large Language Models Better Planners with Reasoning-Decision Alignment | Zhijian Huang et.al. | 2408.13890 | null |
2024-08-25 | Exploring Reliable Matching with Phase Enhancement for Night-time Semantic Segmentation | Yuwen Pan et.al. | 2408.13838 | null |
2024-08-25 | Towards Reliable Medical Question Answering: Techniques and Challenges in Mitigating Hallucinations in Language Models | Duy Khoa Pham et.al. | 2408.13808 | null |
2024-08-25 | TripleMixer: A 3D Point Cloud Denoising Model for Adverse Weather | Xiongwei Zhao et.al. | 2408.13802 | link |
2024-08-25 | CV-MOS: A Cross-View Model for Motion Segmentation | Xiaoyu Tang et.al. | 2408.13790 | link |
2024-08-25 | Enhancing Adaptive Deep Networks for Image Classification via Uncertainty-aware Decision Fusion | Xu Zhang et.al. | 2408.13744 | link |
2024-08-23 | Temporal Fairness in Decision Making Problems | Manuel R. Torres et.al. | 2408.13208 | null |
2024-08-23 | Causal machine learning for sustainable agroecosystems | Vasileios Sitokonstantinou et.al. | 2408.13155 | null |
2024-08-23 | Interpretable breast cancer classification using CNNs on mammographic images | Ann-Kristin Balve et.al. | 2408.13154 | link |
2024-08-23 | Analysis of child development facts and myths using text mining techniques and classification models | Mehedi Tajrian et.al. | 2408.13091 | null |
2024-08-23 | General Intelligent Imaging and Uncertainty Quantification by Deterministic Diffusion Model | Weiru Fan et.al. | 2408.13061 | null |
2024-08-23 | Fair Pairs: Fairness-Aware Ranking Recovery from Pairwise Comparisons | Georg Ahnert et.al. | 2408.13034 | link |
2024-08-23 | Focused Discriminative Training For Streaming CTC-Trained Automatic Speech Recognition Models | Adnan Haider et.al. | 2408.13008 | null |
2024-08-23 | Systematic Evaluation of LLM-as-a-Judge in LLM Alignment Tasks: Explainable Metrics and Diverse Prompt Templates | Hui Wei et.al. | 2408.13006 | link |
2024-08-23 | MedDec: A Dataset for Extracting Medical Decisions from Discharge Summaries | Mohamed Elgaar et.al. | 2408.12980 | link |
2024-08-23 | iSee: Advancing Multi-Shot Explainable AI Using Case-based Recommendations | Anjana Wijekoon et.al. | 2408.12941 | null |
2024-08-23 | ml_edm package: a Python toolkit for Machine Learning based Early Decision Making | Aurélien Renault et.al. | 2408.12925 | link |
2024-08-23 | Structural Representation Learning and Disentanglement for Evidential Chinese Patent Approval Prediction | Jinzhi Shan et.al. | 2408.12852 | null |
2024-08-23 | Underwater SONAR Image Classification and Analysis using LIME-based Explainable Artificial Intelligence | Purushothaman Natarajan et.al. | 2408.12837 | link |
2024-08-23 | Courteous MPC for Autonomous Driving with CBF-inspired Risk Assessment | Yanze Zhang et.al. | 2408.12822 | null |
2024-08-23 | VALE: A Multimodal Visual and Language Explanation Framework for Image Classifiers using eXplainable AI and Language Models | Purushothaman Natarajan et.al. | 2408.12808 | link |
2024-08-23 | A Safe Self-evolution Algorithm for Autonomous Driving Based on Data-Driven Risk Quantification Model | Shuo Yang et.al. | 2408.12805 | null |
2024-08-22 | Does Spatial Information Improve Influenza Forecasting? | Gabrielle Thivierge et.al. | 2408.12722 | link |
2024-08-22 | Revisiting Cross-Domain Problem for LiDAR-based 3D Object Detection | Ruixiao Zhang et.al. | 2408.12708 | null |
2024-08-22 | Can LLMs Understand Social Norms in Autonomous Driving Games? | Boxuan Wang et.al. | 2408.12680 | null |
2024-08-22 | A Monte Carlo Tree Search approach to QAOA: finding a needle in the haystack | Andoni Agirre et.al. | 2408.12648 | null |
2024-08-22 | The Importance of Cognitive Biases in the Recommendation Ecosystem | Markus Schedl et.al. | 2408.12492 | null |
2024-08-22 | Frame Order Matters: A Temporal Sequence-Aware Model for Few-Shot Action Recognition | Bozheng Li et.al. | 2408.12475 | null |
2024-08-22 | Multi-Knowledge Fusion Network for Time Series Representation Learning | Sagar Srinivas Sakhinana et.al. | 2408.12423 | null |
2024-08-22 | Advancing Strategic Planning and Dynamic Control of Complex Projects | L. G. Teuber et.al. | 2408.12422 | null |
2024-08-22 | Multi-Source Knowledge-Based Hybrid Neural Framework for Time Series Representation Learning | Sagar Srinivas Sakhinana et.al. | 2408.12409 | null |
2024-08-22 | Enhancing Uncertainty Communication in Time Series Predictions: Insights and Recommendations | Apoorva Karagappa et.al. | 2408.12365 | null |
2024-08-22 | Graph Retrieval Augmented Trustworthiness Reasoning | Ying Zhu et.al. | 2408.12333 | link |
2024-08-22 | Multimodal Foundational Models for Unsupervised 3D General Obstacle Detection | Tamás Matuszka et.al. | 2408.12322 | null |
2024-08-22 | A Safety-Oriented Self-Learning Algorithm for Autonomous Driving: Evolution Starting from a Basic Model | Shuo Yang et.al. | 2408.12190 | null |
2024-08-22 | A Safe and Efficient Self-evolving Algorithm for Decision-making and Control of Autonomous Driving Systems | Shuo Yang et.al. | 2408.12187 | null |
2024-08-22 | DRExplainer: Quantifiable Interpretability in Drug Response Prediction with Directed Graph Convolutional Network | Haoyuan Shi et.al. | 2408.12139 | link |
2024-08-22 | Pareto Inverse Reinforcement Learning for Diverse Expert Policy Generation | Woo Kyung Kim et.al. | 2408.12110 | null |
2024-08-22 | Enhancing Sampling Protocol for Robust Point Cloud Classification | Chongshou Li et.al. | 2408.12062 | null |
2024-08-21 | Reasoning and Tools for Human-Level Forecasting | Elvis Hsieh et.al. | 2408.12036 | null |
2024-08-21 | Let Community Rules Be Reflected in Online Content Moderation | Wangjiaxuan Xin et.al. | 2408.12035 | null |
2024-08-21 | Sentiment and Emotion-aware Multi-criteria Fuzzy Group Decision Making System | Adilet Yerkin et.al. | 2408.11976 | null |
2024-08-21 | Valuing an Engagement Surface using a Large Scale Dynamic Causal Model | Abhimanyu Mukerji et.al. | 2408.11967 | null |
2024-08-21 | Decoding SEC Actions: Enforcement Trends through Analyzing Blockchain litigation using LLM-based Thematic Factor Mapping | Junliang Luo et.al. | 2408.11961 | null |
2024-08-21 | Decoding Pedestrian Stress on Urban Streets using Electrodermal Activity Monitoring in Virtual Immersive Reality | Mohsen Nazemi et.al. | 2408.11769 | null |
2024-08-21 | Less is more: AI Decision-Making using Dynamic Deep Neural Networks for Short-Term Stock Index Prediction | CJ Finnegan et.al. | 2408.11740 | null |
2024-08-21 | Explainable Deep Learning Framework for Human Activity Recognition | Yiran Huang et.al. | 2408.11552 | null |
2024-08-21 | MambaOcc: Visual State Space Model for BEV-based Occupancy Prediction with Local Adaptive Reordering | Yonglin Tian et.al. | 2408.11464 | null |
2024-08-21 | Probabilistic Medical Predictions of Large Language Models | Bowen Gu et.al. | 2408.11316 | null |
2024-08-21 | Counterfactuals As a Means for Evaluating Faithfulness of Attribution Methods in Autoregressive Language Models | Sepehr Kamahi et.al. | 2408.11252 | link |
2024-08-20 | Optimal Guarantees for Online Selection Over Time | Sebastian Perez-Salazar et.al. | 2408.11224 | null |
2024-08-20 | Statistical Challenges with Dataset Construction: Why You Will Never Have Enough Images | Josh Goldman et.al. | 2408.11160 | null |
2024-08-20 | Experimentation, deployment and monitoring Machine Learning models: Approaches for applying MLOps | Diego Nogare et.al. | 2408.11112 | null |
2024-08-20 | ISLES’24: Improving final infarct prediction in ischemic stroke using multimodal imaging and clinical data | Ezequiel de la Rosa et.al. | 2408.10966 | null |
2024-08-20 | Conformalized Interval Arithmetic with Symmetric Calibration | Rui Luo et.al. | 2408.10939 | link |
2024-08-20 | Enhancing End-to-End Autonomous Driving Systems Through Synchronized Human Behavior Data | Yiqun Duan et.al. | 2408.10908 | null |
2024-08-20 | Leveraging LLMs for the Quality Assurance of Software Requirements | Sebastian Lubos et.al. | 2408.10886 | null |
2024-08-20 | Open 3D World in Autonomous Driving | Xinlong Cheng et.al. | 2408.10880 | null |
2024-08-20 | Multi-agent based modeling for investigating excess heat utilization from electrolyzer production to district heating network | Kristoffer Christensen et.al. | 2408.10783 | null |
2024-08-20 | Classification of Endoscopy and Video Capsule Images using CNN-Transformer Model | Aliza Subedi et.al. | 2408.10733 | null |
2024-08-20 | Towards reliable real-time trajectory optimization | Fatemeh Rastgar et.al. | 2408.10731 | null |
2024-08-20 | On NVD Users’ Attitudes, Experiences, Hopes and Hurdles | Julia Wunder et.al. | 2408.10695 | null |
2024-08-20 | Privacy-preserving Universal Adversarial Defense for Black-box Models | Qiao Li et.al. | 2408.10647 | null |
2024-08-20 | Finding the DeepDream for Time Series: Activation Maximization for Univariate Time Series | Udo Schlegel et.al. | 2408.10628 | null |
2024-08-20 | Safety Metric Aware Trajectory Repairing for Automated Driving | Kailin Tong et.al. | 2408.10622 | null |
2024-08-20 | MV-MOS: Multi-View Feature Fusion for 3D Moving Object Segmentation | Jintao Cheng et.al. | 2408.10602 | link |
2024-08-20 | Constrained Behavior Cloning for Robotic Learning | Wensheng Liang et.al. | 2408.10568 | null |
2024-08-20 | Leveraging Temporal Contexts to Enhance Vehicle-Infrastructure Cooperative Perception | Jiaru Zhong et.al. | 2408.10531 | null |
2024-08-20 | Approximate Estimation of High-dimension Execution Skill for Dynamic Agents in Continuous Domains | Delma Nieves-Rivera et.al. | 2408.10512 | null |
2024-08-20 | An End-to-End Reinforcement Learning Based Approach for Micro-View Order-Dispatching in Ride-Hailing | Xinlang Yue et.al. | 2408.10479 | null |
2024-08-19 | System-Level Design Space Exploration for High-Level Synthesis under End-to-End Latency Constraints | Yuchao Liao et.al. | 2408.10431 | null |
2024-08-19 | Real-Time Digital Twin Platform: A Case Study on Core Network Selection in Aeronautical Ad-Hoc Networks | Lal Verda Cakir et.al. | 2408.10409 | null |
2024-08-19 | Tax Credits and Household Behavior: The Roles of Myopic Decision-Making and Liquidity in a Simulated Economy | Jialin Dong et.al. | 2408.10391 | null |
2024-08-19 | FFAA: Multimodal Large Language Model based Explainable Open-World Face Forgery Analysis Assistant | Zhengchao Huang et.al. | 2408.10072 | link |
2024-08-19 | Edge-Cloud Collaborative Motion Planning for Autonomous Driving with Large Language Models | Jiao Chen et.al. | 2408.09972 | null |
2024-08-19 | Control by Adding Players to Change or Maintain the Shapley-Shubik or the Penrose-Banzhaf Power Index in Weighted Voting Games Is Complete for NP^PP | Joanna Kaczmarek et.al. | 2408.09953 | null |
2024-08-19 | Segment-Anything Models Achieve Zero-shot Robustness in Autonomous Driving | Jun Yan et.al. | 2408.09839 | link |
2024-08-19 | Automated Vehicle Driver Monitoring Dataset from Real-World Scenarios | Mohamed Sabry et.al. | 2408.09833 | null |
2024-08-19 | GoNoGo: An Efficient LLM-based Multi-Agent System for Streamlining Automotive Software Release Decision-Making | Arsham Gholamzadeh Khoee et.al. | 2408.09785 | null |
2024-08-19 | Quantitative 3D Map Accuracy Evaluation Hardware and Algorithm for LiDAR(-Inertial) SLAM | Sanghyun Hahn et.al. | 2408.09727 | link |
2024-08-19 | Optimal Replenishment Strategy for Satellite Constellation with Dual Supply Modes | Taehyun Sung et.al. | 2408.09696 | null |
2024-08-19 | Continuous-Time Dynamic Decision Making with Costly Information | Christoph Knochenhauer et.al. | 2408.09693 | null |
2024-08-19 | Multi-Agent Reinforcement Learning for Autonomous Driving: A Survey | Ruiqi Zhang et.al. | 2408.09675 | link |
2024-08-19 | BLADE: Benchmarking Language Model Agents for Data-Driven Science | Ken Gu et.al. | 2408.09667 | link |
2024-08-19 | Contextual Bandits for Unbounded Context Distributions | Puning Zhao et.al. | 2408.09655 | null |
2024-08-18 | Experimental Design For Causal Inference Through An Optimization Lens | Jinglong Zhao et.al. | 2408.09607 | null |
2024-08-18 | Prescribed-time Convergent Distributed Multiobjective Optimization with Dynamic Event-triggered Communication | Tengyang Gong et.al. | 2408.09602 | null |
2024-08-18 | Sample-Optimal Large-Scale Optimal Subset Selection | Zaile Li et.al. | 2408.09537 | null |
2024-08-18 | Towards Safe and Robust Autonomous Vehicle Platooning: A Self-Organizing Cooperative Control Framework | Chengkai Xu et.al. | 2408.09468 | null |
2024-08-18 | In-Memory Learning Automata Architecture using Y-Flash Cell | Omar Ghazal et.al. | 2408.09456 | null |
2024-08-18 | Retina-inspired Object Motion Segmentation | Victoria Clerico et.al. | 2408.09454 | null |
2024-08-18 | OVOSE: Open-Vocabulary Semantic Segmentation in Event-Based Cameras | Muhammad Rameez Ur Rahman et.al. | 2408.09424 | link |
2024-08-18 | Value-Enriched Population Synthesis: Integrating a Motivational Layer | Alba Aguilera et.al. | 2408.09407 | link |
2024-08-16 | Visual Agents as Fast and Slow Thinkers | Guangyan Sun et.al. | 2408.08862 | link |
2024-08-16 | HistoGym: A Reinforcement Learning Environment for Histopathological Image Analysis | Zhi-Bo Liu et.al. | 2408.08847 | link |
2024-08-16 | Shapley Marginal Surplus for Strong Models | Daniel de Marchi et.al. | 2408.08845 | null |
2024-08-16 | Retrieval-augmented Few-shot Medical Image Segmentation with Foundation Models | Lin Zhao et.al. | 2408.08813 | null |
2024-08-16 | Artificial Intelligence and Strategic Decision-Making: Evidence from Entrepreneurs and Investors | Felipe A. Csaszar et.al. | 2408.08811 | null |
2024-08-16 | PriorMapNet: Enhancing Online Vectorized HD Map Construction with Priors | Rongxuan Wang et.al. | 2408.08802 | null |
2024-08-16 | A Transparency Paradox? Investigating the Impact of Explanation Specificity and Autonomous Vehicle Perceptual Inaccuracies on Passengers | Daniel Omeiza et.al. | 2408.08785 | null |
2024-08-16 | Multi-task Learning Approach for Intracranial Hemorrhage Prognosis | Miriam Cobo et.al. | 2408.08784 | link |
2024-08-16 | Beyond Proportional Individual Guarantees for Binary Perpetual Voting | Yotam Gafni et.al. | 2408.08767 | null |
2024-08-16 | SYMPOL: Symbolic Tree-Based On-Policy Reinforcement Learning | Sascha Marton et.al. | 2408.08761 | link |
2024-08-16 | SE-SGformer: A Self-Explainable Signed Graph Transformer for Link Sign Prediction | Lu Li et.al. | 2408.08754 | link |
2024-08-16 | Quantifying the Effectiveness of Student Organization Activities using Natural Language Processing | Lyberius Ennio F. Taruc et.al. | 2408.08694 | null |
2024-08-16 | Med-PMC: Medical Personalized Multi-modal Consultation with a Proactive Ask-First-Observe-Next Paradigm | Hongcheng Liu et.al. | 2408.08693 | link |
2024-08-16 | A survey on secure decentralized optimization and learning | Changxin Liu et.al. | 2408.08628 | null |
2024-08-16 | RPLUW/M: Enabling RPL on the Internet of Underwater Things | Mohammadhossein Homaei et.al. | 2408.08607 | null |
2024-08-16 | S-RAF: A Simulation-Based Robustness Assessment Framework for Responsible Autonomous Driving | Daniel Omeiza et.al. | 2408.08584 | link |
2024-08-16 | AgentSimulator: An Agent-based Approach for Data-driven Business Process Simulation | Lukas Kirchdorfer et.al. | 2408.08571 | link |
2024-08-16 | Multilevel Graph Reinforcement Learning for Consistent Cognitive Decision-making in Heterogeneous Mixed Autonomy | Xin Gao et.al. | 2408.08516 | null |
2024-08-16 | CoSEC: A Coaxial Stereo Event Camera Dataset for Autonomous Driving | Shihan Peng et.al. | 2408.08500 | null |
2024-08-16 | The Limitations of Model Retraining in the Face of Performativity | Anmol Kabra et.al. | 2408.08499 | null |
2024-08-15 | Autonomous Behavior Planning For Humanoid Loco-manipulation Through Grounded Language Model | Jin Wang et.al. | 2408.08282 | null |
2024-08-15 | A Conflicts-free, Speed-lossless KAN-based Reinforcement Learning Decision System for Interactive Driving in Roundabouts | Zhihao Lin et.al. | 2408.08242 | null |
2024-08-15 | Learned Multimodal Compression for Autonomous Driving | Hadi Hadizadeh et.al. | 2408.08211 | null |
2024-08-15 | Confidence-weighted integration of human and machine judgments for superior decision-making | Felipe Yáñez et.al. | 2408.08083 | link |
2024-08-15 | A Survey on Integrated Sensing, Communication, and Computation | Dingzhu Wen et.al. | 2408.08074 | null |
2024-08-15 | Co-Fix3D: Enhancing 3D Object Detection with Collaborative Refinement | Wenxuan Li et.al. | 2408.07999 | link |
2024-08-15 | Capturing the Complexity of Human Strategic Decision-Making with Machine Learning | Jian-Qiao Zhu et.al. | 2408.07865 | null |
2024-08-14 | From Decision to Action in Surgical Autonomy: Multi-Modal Large Language Models for Robot-Assisted Blood Suction | Sadra Zargarzadeh et.al. | 2408.07806 | null |
2024-08-14 | NeuroPapyri: A Deep Attention Embedding Network for Handwritten Papyri Retrieval | Giuseppe De Gregorio et.al. | 2408.07785 | null |
2024-08-14 | MedTsLLM: Leveraging LLMs for Multimodal Medical Time Series Analysis | Nimeesha Chan et.al. | 2408.07773 | link |
2024-08-14 | Panacea+: Panoramic and Controllable Video Generation for Autonomous Driving | Yuqing Wen et.al. | 2408.07605 | null |
2024-08-14 | A Nested Graph Reinforcement Learning-based Decision-making Strategy for Eco-platooning | Xin Gao et.al. | 2408.07578 | null |
2024-08-14 | Development of a Multi-Agent Clinical Decision Support System for Korean Triage and Acuity Scale (KTAS)-Based Triage and Treatment Planning in Emergency Departments | Seungjun Han et.al. | 2408.07531 | null |
2024-08-14 | LLMI3D: Empowering LLM with 3D Perception from a Single 2D Image | Fan Yang et.al. | 2408.07422 | null |
2024-08-14 | The Restaurant Meal Delivery Problem with Ghost Kitchens | Gal Neria et.al. | 2408.07417 | null |
2024-08-14 | Risk Occupancy: A New and Efficient Paradigm through Vehicle-Road-Cloud Collaboration | Jiaxing Chen et.al. | 2408.07367 | null |
2024-08-14 | Towards Few-shot Self-explaining Graph Neural Networks | Jingyu Peng et.al. | 2408.07340 | link |
2024-08-14 | Learning Decisions Offline from Censored Observations with ε-insensitive Operational Costs | Minxia Chen et.al. | 2408.07305 | null |
2024-08-14 | NL2OR: Solve Complex Operations Research Problems Using Natural Language Inputs | Junxuan Li et.al. | 2408.07272 | null |
2024-08-13 | Neural embedding of beliefs reveals the role of relative dissonance in human decision-making | Byunghwee Lee et.al. | 2408.07237 | link |
2024-08-13 | Agent Q: Advanced Reasoning and Learning for Autonomous AI Agents | Pranav Putta et.al. | 2408.07199 | null |
2024-08-13 | Efficient Human-Object-Interaction (EHOI) Detection via Interaction Label Coding and Conditional Decision | Tsung-Shan Yang et.al. | 2408.07018 | null |
2024-08-14 | Automatic Feature Recognition and Dimensional Attributes Extraction From CAD Models for Hybrid Additive-Subtractive Manufacturing | Muhammad Tayyab Khan et.al. | 2408.06891 | null |
2024-08-13 | Geotree of Geodetector: An Anatomy of Knowledge Diffusion of a Novel Statistic | Yuting Liang et.al. | 2408.06839 | null |
2024-08-13 | FlatFusion: Delving into Details of Sparse Transformer-based Camera-LiDAR Fusion for Autonomous Driving | Yutao Zhu et.al. | 2408.06832 | null |
2024-08-13 | Exploring Domain Shift on Radar-Based 3D Object Detection Amidst Diverse Environmental Conditions | Miao Zhang et.al. | 2408.06772 | null |
2024-08-13 | Adaptive Data Quality Scoring Operations Framework using Drift-Aware Mechanism for Industrial Applications | Firas Bayram et.al. | 2408.06724 | null |
2024-08-13 | MAPPO-PIS: A Multi-Agent Proximal Policy Optimization Method with Prior Intent Sharing for CAVs’ Cooperative Decision-Making | Yicheng Guo et.al. | 2408.06656 | link |
2024-08-13 | Dynamic Pricing of Electric Vehicle Charging Station Alliances Under Information Asymmetry | Zeyu Liu et.al. | 2408.06645 | null |
2024-08-13 | A lightweight YOLOv5-FFM model for occlusion pedestrian detection | Xiangjie Luo et.al. | 2408.06633 | null |
2024-08-13 | IFShip: A Large Vision-Language Model for Interpretable Fine-grained Ship Classification via Domain Knowledge-Enhanced Instruction Tuning | Mingning Guo et.al. | 2408.06631 | link |
2024-08-14 | OpenEP: Open-Ended Future Event Prediction | Yong Guan et.al. | 2408.06578 | null |
2024-08-13 | Value of Information and Reward Specification in Active Inference and POMDPs | Ran Wei et.al. | 2408.06542 | null |
2024-08-12 | Hierarchical in-Context Reinforcement Learning with Hindsight Modular Reflections for Planning | Chuanneng Sun et.al. | 2408.06520 | null |
2024-08-12 | Decentralized Cooperation in Heterogeneous Multi-Agent Reinforcement Learning via Graph Neural Network-Based Intrinsic Motivation | Jahir Sadik Monon et.al. | 2408.06503 | link |
2024-08-12 | Towards Autonomous Agents: Adaptive-planning, Reasoning, and Acting in Language Models | Yen-Che Hsiao et.al. | 2408.06458 | link |
2024-08-12 | Finding Patterns in Ambiguity: Interpretable Stress Testing in the Decision~Boundary | Inês Gomes et.al. | 2408.06302 | link |
2024-08-12 | A Digital Twin Framework Utilizing Machine Learning for Robust Predictive Maintenance: Enhancing Tire Health Monitoring | Vispi Karkaria et.al. | 2408.06220 | null |
2024-08-12 | IIT Bombay Racing Driverless: Autonomous Driving Stack for Formula Student AI | Yash Rampuria et.al. | 2408.06113 | null |
2024-08-12 | Building Decision Making Models Through Language Model Regime | Yu Zhang et.al. | 2408.06087 | null |
2024-08-12 | Sequential sampling without comparison to boundary through model-free reinforcement learning | Jamal Esmaily et.al. | 2408.06080 | null |
2024-08-12 | A-BDD: Leveraging Data Augmentations for Safe Autonomous Driving in Adverse Weather and Lighting | Felix Assion et.al. | 2408.06071 | null |
2024-08-12 | Perceptual Similarity for Measuring Decision-Making Style and Policy Diversity in Games | Chiu-Chou Lin et.al. | 2408.06051 | link |
2024-08-12 | Exploring and Learning Structure: Active Inference Approach in Navigational Agents | Daria de Tinguy et.al. | 2408.05982 | null |
2024-08-12 | Match Point AI: A Novel AI Framework for Evaluating Data-Driven Tennis Strategies | Carlo Nübel et.al. | 2408.05960 | link |
2024-08-12 | Statistically Optimal Uncertainty Quantification for Expensive Black-Box Models | Shengyi He et.al. | 2408.05887 | null |
2024-08-12 | Multi-Agent Deep Reinforcement Learning Framework for Wireless MAC Protocol Design and Optimization | Navid Keshtiarast et.al. | 2408.05884 | null |
2024-08-12 | Toward Pedestrian Head Tracking: A Benchmark Dataset and an Information Fusion Network | Kailai Sun et.al. | 2408.05877 | null |
2024-08-11 | Root Cause Attribution of Delivery Risks via Causal Discovery with Reinforcement Learning | Shi Bo et.al. | 2408.05860 | null |
2024-08-11 | Egocentric Vision Language Planning | Zhirui Fang et.al. | 2408.05802 | null |
2024-08-11 | Parallel Distributional Deep Reinforcement Learning for Mapless Navigation of Terrestrial Mobile Robots | Victor Augusto Kich et.al. | 2408.05744 | link |
2024-08-11 | ICSFuzz: Collision Detector Bug Discovery in Autonomous Driving Simulators | Weiwei Fu et.al. | 2408.05694 | null |
2024-08-10 | Residual-INR: Communication Efficient On-Device Learning Using Implicit Neural Representation | Hanqiu Chen et.al. | 2408.05617 | link |
2024-08-10 | Meta Clustering of Neural Bandits | Yikun Ban et.al. | 2408.05586 | null |
2024-08-10 | What Matters in Autonomous Driving Anomaly Detection: A Weakly Supervised Horizon | Utkarsh Tiwari et.al. | 2408.05562 | link |
2024-08-10 | S-SIRUS: an explainability algorithm for spatial regression Random Forest | Luca Patelli et.al. | 2408.05537 | link |
2024-08-09 | Modeling Transit in a Fully Integrated Agent-Based Framework: Methodology and Large-Scale Application | Omer Verbas et.al. | 2408.05176 | null |
2024-08-09 | Cautious Calibration in Binary Classification | Mari-Liis Allikivi et.al. | 2408.05120 | link |
2024-08-09 | Depth Helps: Improving Pre-trained RGB-based Policy with Depth Information Injection | Xincheng Pang et.al. | 2408.05107 | null |
2024-08-09 | Evaluating Layout Dimensionalities in PC+VR Asymmetric Collaborative Decision Making | Daniel Enriquez et.al. | 2408.05105 | null |
2024-08-09 | DeepInteraction++: Multi-Modality Interaction for Autonomous Driving | Zeyu Yang et.al. | 2408.05075 | link |
2024-08-09 | Surgical-VQLA++: Adversarial Contrastive Learning for Calibrated Robust Visual Question-Localized Answering in Robotic Surgery | Long Bai et.al. | 2408.04958 | link |
2024-08-12 | Unleashing Artificial Cognition: Integrating Multiple AI Systems | Muntasir Adnan et.al. | 2408.04910 | link |
2024-08-09 | CTE-MLO: Continuous-time and Efficient Multi-LiDAR Odometry with Localizability-aware Point Cloud Sampling | Hongming Shen et.al. | 2408.04901 | link |
2024-08-09 | VLM-MPC: Vision Language Foundation Model (VLM)-Guided Model Predictive Controller (MPC) for Autonomous Driving | Keke Long et.al. | 2408.04821 | null |
2024-08-08 | DaedalusData: Exploration, Knowledge Externalization and Labeling of Particles in Medical Manufacturing – A Design Study | Alexander Wyss et.al. | 2408.04749 | null |
2024-08-08 | Eliminating Backdoors in Neural Code Models via Trigger Inversion | Weisong Sun et.al. | 2408.04683 | null |
2024-08-08 | Field Testing and Detection of Camera Interference for Autonomous Driving | Ki Beom Park et.al. | 2408.04524 | null |
2024-08-08 | Model-Based Transfer Learning for Contextual Reinforcement Learning | Jung-Hoon Cho et.al. | 2408.04498 | link |
2024-08-08 | Multi-Objective LQR with Linear Scalarization | Ali Jadbabaie et.al. | 2408.04488 | null |
2024-08-09 | Achieving Robust Data-driven Contextual Decision Making in a Data Augmentation Way | Zhaoen Li et.al. | 2408.04469 | null |
2024-08-08 | Reinforcement Learning from Human Feedback for Lane Changing of Autonomous Vehicles in Mixed Traffic | Yuting Wang et.al. | 2408.04447 | null |
2024-08-08 | Non-maximizing policies that fulfill multi-criterion aspirations in expectation | Simon Dima et.al. | 2408.04385 | null |
2024-08-08 | Design and Implementation of Smart Infrastructures and Connected Vehicles in A Mini-city Platform | Daniel Vargas et.al. | 2408.04195 | null |
2024-08-08 | The Data Addition Dilemma | Judy Hanwen Shen et.al. | 2408.04154 | link |
2024-08-07 | Machine Learning-Based Reward-Driven Tuning of Scanning Probe Microscopy: Towards Fully Automated Microscopy | Yu Liu et.al. | 2408.04055 | null |
2024-08-07 | Learning Rate-Free Reinforcement Learning: A Case for Model Selection with Non-Stationary Objectives | Aida Afshar et.al. | 2408.04046 | link |
2024-08-07 | How Well Can Vision Language Models See Image Details? | Chenhui Gou et.al. | 2408.03940 | null |
2024-08-07 | MORTAR: A Model-based Runtime Action Repair Framework for AI-enabled Cyber-Physical Systems | Renzhi Wang et.al. | 2408.03892 | null |
2024-08-07 | GAIA – A Large Language Model for Advanced Power Dispatch | Yuheng Cheng et.al. | 2408.03847 | null |
2024-08-07 | Vision-Language Guidance for LiDAR-based Unsupervised 3D Object Detection | Christian Fruhwirth-Reisinger et.al. | 2408.03790 | link |
2024-08-07 | Methodological Explainability Evaluation of an Interpretable Deep Learning Model for Post-Hepatectomy Liver Failure Prediction Incorporating Counterfactual Explanations and Layerwise Relevance Propagation: A Prospective In Silico Trial | Xian Zhong et.al. | 2408.03771 | null |
2024-08-07 | Intuitionistic Fuzzy Cognitive Maps for Interpretable Image Classification | Georgia Sovatzidi et.al. | 2408.03745 | null |
2024-08-07 | MS-Mapping: An Uncertainty-Aware Large-Scale Multi-Session LiDAR Mapping System | Xiangcheng Hu et.al. | 2408.03723 | link |
2024-08-07 | Asynchronous Credit Assignment Framework for Multi-Agent Reinforcement Learning | Yongheng Liang et.al. | 2408.03692 | null |
2024-08-07 | AgentsCoMerge: Large Language Model Empowered Collaborative Decision Making for Ramp Merging | Senkang Hu et.al. | 2408.03624 | null |
2024-08-07 | DRAMA: An Efficient End-to-end Motion Planner for Autonomous Driving with Mamba | Chengran Yuan et.al. | 2408.03601 | null |
2024-08-07 | Clinical Challenges and AI Opportunities in Decision-Making for Cancer Treatment-Induced Cardiotoxicity | Siyi Wu et.al. | 2408.03586 | null |
2024-08-07 | Leveraging LLMs for Enhanced Open-Vocabulary 3D Scene Understanding in Autonomous Driving | Amirhosein Chahe et.al. | 2408.03516 | link |
2024-08-06 | Communication-Aware Consistent Edge Selection for Mobile Users and Autonomous Vehicles | Nazish Tahir et.al. | 2408.03435 | null |
2024-08-06 | Probabilistic Scores of Classifiers, Calibration is not Enough | Agathe Fernandes Machado et.al. | 2408.03421 | link |
2024-08-07 | Adversarial Safety-Critical Scenario Generation using Naturalistic Human Driving Priors | Kunkun Hao et.al. | 2408.03200 | null |
2024-08-06 | RELIEF: Reinforcement Learning Empowered Graph Feature Prompt Tuning | Jiapeng Zhu et.al. | 2408.03195 | link |
2024-08-06 | Integrated Intention Prediction and Decision-Making with Spectrum Attention Net and Proximal Policy Optimization | Xiao Zhou et.al. | 2408.03191 | null |
2024-08-06 | QADQN: Quantum Attention Deep Q-Network for Financial Market Prediction | Siddhant Dutta et.al. | 2408.03088 | null |
2024-08-06 | Research on Autonomous Driving Decision-making Strategies based Deep Reinforcement Learning | Zixiang Wang et.al. | 2408.03084 | null |
2024-08-06 | Considerations on free-surface detachment and bed entrainment of fluvial plastics | Matthias Kramer et.al. | 2408.03081 | null |
2024-08-06 | SCOPE: A Synthetic Multi-Modal Dataset for Collective Perception Including Physical-Correct Weather Conditions | Jörg Gamerdinger et.al. | 2408.03065 | null |
2024-08-06 | Social Behavior as a Key to Learning-based Multi-Agent Pathfinding Dilemmas | Chengyang He et.al. | 2408.03063 | null |
2024-08-06 | Uniqueness Analysis of Controllability Scores and Their Application to Brain Networks | Kazuhiro Sato et.al. | 2408.03023 | null |
2024-08-06 | Cross-cultural analysis of pedestrian group behaviour influence on crossing decisions in interactions with autonomous vehicles | Sergio Martín Serrano et.al. | 2408.03003 | null |
2024-08-06 | Accuracy and Consistency of LLMs in the Registered Dietitian Exam: The Impact of Prompt Engineering and Knowledge Retrieval | Iman Azimi et.al. | 2408.02964 | link |
2024-08-06 | Few-shot Scooping Under Domain Shift via Simulated Maximal Deployment Gaps | Yifan Zhu et.al. | 2408.02949 | null |
2024-08-06 | Reinforcement Learning based Workflow Scheduling in Cloud and Edge Computing Environments: A Taxonomy, Review and Future Directions | Amanda Jayanetti et.al. | 2408.02938 | null |
2024-08-06 | Compromising Embodied Agents with Contextual Backdoor Attacks | Aishan Liu et.al. | 2408.02882 | null |
2024-08-05 | On The Stability of Moral Preferences: A Problem with Computational Elicitation Methods | Kyle Boerstler et.al. | 2408.02862 | null |
2024-08-05 | Nash Equilibrium in Games on Graphs with Incomplete Preferences | Abhishek N. Kulkarni et.al. | 2408.02860 | null |
2024-08-05 | SiCo: A Size-Controllable Virtual Try-On Approach for Informed Decision-Making | Sherry X. Chen et.al. | 2408.02803 | link |
2024-08-05 | LLM economicus? Mapping the Behavioral Biases of LLMs via Utility Theory | Jillian Ross et.al. | 2408.02784 | null |
2024-08-05 | Enhancing Medical Learning and Reasoning Systems: A Boxology-Based Comparative Analysis of Design Patterns | Chi Him Ng et.al. | 2408.02709 | null |
2024-08-05 | From LLMs to LLM-based Agents for Software Engineering: A Survey of Current, Challenges and Future | Haolin Jin et.al. | 2408.02479 | null |
2024-08-05 | An Integrated Approach to Importance Sampling and Machine Learning for Efficient Monte Carlo Estimation of Distortion Risk Measures in Black Box Models | Sören Bettels et.al. | 2408.02401 | null |
2024-08-05 | Explain via Any Concept: Concept Bottleneck Model with Open Vocabulary Concepts | Andong Tan et.al. | 2408.02265 | null |
2024-08-05 | Is Large Language Model Good at Database Knob Tuning? A Comprehensive Experimental Evaluation | Yiyan Li et.al. | 2408.02213 | null |
2024-08-04 | SPINEX-TimeSeries: Similarity-based Predictions with Explainable Neighbors Exploration for Time Series and Forecasting Problems | Ahmed Z Naser et.al. | 2408.02159 | null |
2024-08-04 | Model Hijacking Attack in Federated Learning | Zheng Li et.al. | 2408.02131 | null |
2024-08-04 | Value-Based Rationales Improve Social Experience: A Multiagent Simulation Study | Sz-Ting Tzeng et.al. | 2408.02117 | null |
2024-08-04 | KAN-RCBEVDepth: A multi-modal fusion algorithm in object detection for autonomous driving | Zhihao Lai et.al. | 2408.02088 | null |
2024-08-04 | Reinforcement Learning for an Efficient and Effective Malware Investigation during Cyber Incident Response | Dipo Dunsin et.al. | 2408.01999 | null |
2024-08-04 | Optimal and efficient text counterfactuals using Graph Neural Networks | Dimitris Lymperopoulos et.al. | 2408.01969 | link |
2024-08-04 | Bilateral Trade Flow Prediction by Gravity-informed Graph Auto-encoder | Naoto Minakawa et.al. | 2408.01938 | null |
2024-08-03 | Impact of Major Health Events on Pharmaceutical Stocks: A Comprehensive Analysis Using Macroeconomic and Market Indicators | Morteza Maleki et.al. | 2408.01883 | null |
2024-08-03 | ST-SACLF: Style Transfer Informed Self-Attention Classifier for Bias-Aware Painting Classification | Mridula Vijendran et.al. | 2408.01827 | link |
2024-08-03 | STDA: Spatio-Temporal Dual-Encoder Network Incorporating Driver Attention to Predict Driver Behaviors Under Safety-Critical Scenarios | Dongyang Xu et.al. | 2408.01774 | null |
2024-08-03 | LAM3D: Leveraging Attention for Monocular 3D Object Detection | Diana-Alexandra Sas et.al. | 2408.01739 | null |
2024-08-03 | Self-Emotion Blended Dialogue Generation in Social Simulation Agents | Qiang Zhang et.al. | 2408.01633 | null |
2024-08-03 | A Comparative Analysis of Wealth Index Predictions in Africa between three Multi-Source Inference Models | Márton Karsai et.al. | 2408.01631 | link |
2024-08-03 | Weighted Brier Score – an Overall Summary Measure for Risk Prediction Models with Clinical Utility Consideration | Kehao Zhu et.al. | 2408.01626 | null |
2024-08-03 | Data-Driven Machine Learning Approaches for Predicting In-Hospital Sepsis Mortality | Arseniy Shumilov et.al. | 2408.01612 | null |
2024-08-02 | Counterfactual Explanations for Medical Image Classification and Regression using Diffusion Autoencoder | Matan Atad et.al. | 2408.01571 | link |
2024-08-02 | NeuralBeta: Estimating Beta Using Deep Learning | Yuxin Liu et.al. | 2408.01387 | null |
2024-08-02 | A Robotics-Inspired Scanpath Model Reveals the Importance of Uncertainty and Semantic Object Cues for Gaze Guidance in Dynamic Scenes | Vito Mengers et.al. | 2408.01322 | link |
2024-08-02 | PsybORG+: Cognitive Modeling for Triggering and Detection of Cognitive Biases of Advanced Persistent Threats | Shuo Huang et.al. | 2408.01310 | null |
2024-08-02 | A Decision-driven Methodology for Designing Uncertainty-aware AI Self-Assessment | Gregory Canal et.al. | 2408.01301 | null |
2024-08-02 | Assessing Robustness of Machine Learning Models using Covariate Perturbations | Arun Prakash R et.al. | 2408.01300 | null |
2024-08-02 | The Mismeasure of Man and Models: Evaluating Allocational Harms in Large Language Models | Hannah Chen et.al. | 2408.01285 | null |
2024-08-02 | Metareasoning in uncertain environments: a meta-BAMDP framework | Prakhar Godara et.al. | 2408.01253 | null |
2024-08-02 | Game Theory Based Community-Aware Opinion Dynamics | Shanfan Zhang et.al. | 2408.01196 | link |
2024-08-02 | A Short-Term Planning Framework for the Operation of Tanker-Based Water Distribution Systems in Urban Areas | Abhilasha Maheshwari et.al. | 2408.01184 | null |
2024-08-02 | CommonUppRoad: A Framework of Formal Modelling, Verifying, Learning, and Visualisation of Autonomous Vehicles | Rong Gu et.al. | 2408.01093 | null |
2024-08-02 | Effect of Fog Particle Size Distribution on 3D Object Detection Under Adverse Weather Conditions | Ajinkya Shinde et.al. | 2408.01085 | null |
2024-08-02 | MambaST: A Plug-and-Play Cross-Spectral Spatial-Temporal Fuser for Efficient Pedestrian Detection | Xiangbo Gao et.al. | 2408.01037 | link |
2024-08-02 | Adaptive Two-Stage Cloud Resource Scaling via Hierarchical Multi-Indicator Forecasting and Bayesian Decision-Making | Yang Luo et.al. | 2408.01000 | link |
2024-08-02 | A Quantal Response Analysis of Defender-Attacker Sequential Security Games | Md Reya Shad Azim et.al. | 2408.00964 | null |
2024-08-01 | Generalisation of Total Uncertainty in AI: A Theoretical Study | Keivan Shariatmadar et.al. | 2408.00946 | null |
2024-08-01 | Optimizing Diffusion Models for Joint Trajectory Prediction and Controllable Generation | Yixiao Wang et.al. | 2408.00766 | null |
2024-08-02 | Reinforcement Learning applied to Insurance Portfolio Pursuit | Edward James Young et.al. | 2408.00713 | link |
2024-08-01 | Future of Artificial Intelligence in Agile Software Development | Mariyam Mahboob et.al. | 2408.00703 | null |
2024-08-01 | Learning in Multi-Objective Public Goods Games with Non-Linear Utilities | Nicole Orzan et.al. | 2408.00682 | null |
2024-08-01 | Deep Learning in Medical Image Classification from MRI-based Brain Tumor Images | Xiaoyi Liu et.al. | 2408.00636 | null |
2024-08-01 | MUFASA: Multi-View Fusion and Adaptation Network with Spatial Awareness for Radar Object Detection | Xiangyuan Peng et.al. | 2408.00565 | null |
2024-08-01 | Spatial Weather, Socio-Economic and Political Risks in Probabilistic Load Forecasting | Monika Zimmermann et.al. | 2408.00507 | null |
2024-08-01 | Explainable Emotion Decoding for Human and Computer Vision | Alessio Borriero et.al. | 2408.00493 | null |
2024-08-01 | An Operational Scheduling Framework for Tanker-based Water Distribution System under Uncertainty | Abhilasha Maheshwari et.al. | 2408.00431 | null |
2024-08-01 | DriveArena: A Closed-loop Generative Simulation Platform for Autonomous Driving | Xuemeng Yang et.al. | 2408.00415 | null |
2024-08-01 | Enabling Next-Generation V2X Perception: Wireless Rigid Body Localization and Tracking | Niclas Führling et.al. | 2408.00349 | null |
2024-08-01 | RoCo:Robust Collaborative Perception By Iterative Object Matching and Pose Adjustment | Zhe Huang et.al. | 2408.00257 | link |
2024-08-01 | Joint Vehicle Connection and Beamforming Optimization in Digital Twin Assisted Integrated Sensing and Communication Vehicular Networks | Weihang Ding et.al. | 2408.00248 | null |
2024-08-02 | Bringing Data into the Conversation: Adapting Content from Business Intelligence Dashboards for Threaded Collaboration Platforms | Hyeok Kim et.al. | 2408.00242 | null |
2024-08-01 | Invariant Discovery of Features Across Multiple Length Scales: Applications in Microscopy and Autonomous Materials Characterization | Aditya Raghavan et.al. | 2408.00229 | null |
2024-08-01 | Load Balancing in Federated Learning | Alireza Javani et.al. | 2408.00217 | null |
2024-07-31 | Areas of Improvement for Autonomous Vehicles: A Machine Learning Analysis of Disengagement Reports | Tyler Ward et.al. | 2408.00051 | null |
2024-07-31 | Algorithms for Collaborative Machine Learning under Statistical Heterogeneity | Seok-Ju Hahn et.al. | 2408.00050 | null |
2024-07-31 | Coordinating Decisions via Quantum Telepathy | Dawei Ding et.al. | 2407.21723 | null |
2024-07-31 | An Explainable Vision Transformer with Transfer Learning Combined with Support Vector Machine Based Efficient Drought Stress Identification | Aswini Kumar Patra et.al. | 2407.21666 | null |
2024-07-31 | MART: MultiscAle Relational Transformer Networks for Multi-agent Trajectory Prediction | Seongju Lee et.al. | 2407.21635 | link |
2024-07-31 | Voxel Scene Graph for Intracranial Hemorrhage | Antoine P. Sanner et.al. | 2407.21580 | link |
2024-08-01 | Analysis of Functional Insufficiencies and Triggering Conditions to Improve the SOTIF of an MPC-based Trajectory Planner | Mirko Conrad et.al. | 2407.21569 | null |
2024-07-31 | Interpreting and learning voice commands with a Large Language Model for a robot system | Stanislau Stankevich et.al. | 2407.21512 | null |
2024-07-31 | Mitral Regurgitation Recogniton based on Unsupervised Out-of-Distribution Detection with Residual Diffusion Amplification | Zhe Liu et.al. | 2407.21497 | null |
2024-07-31 | KemenkeuGPT: Leveraging a Large Language Model on Indonesia’s Government Financial Data and Regulations to Enhance Decision Making | Gilang Fajar Febrian et.al. | 2407.21459 | null |
2024-07-31 | Cost-Effective Hallucination Detection for LLMs | Simon Valentin et.al. | 2407.21424 | null |
2024-07-31 | Pathology Foundation Models | Mieko Ochi et.al. | 2407.21317 | null |
2024-07-31 | Who should I trust? A Visual Analytics Approach for Comparing Net Load Forecasting Models | Kaustav Bhattacharjee et.al. | 2407.21299 | null |
2024-07-31 | SimpleLLM4AD: An End-to-End Vision-Language Model with Graph Visual Question Answering for Autonomous Driving | Peiru Zheng et.al. | 2407.21293 | null |
2024-07-30 | Towards an Integrated Performance Framework for Fire Science and Management Workflows | H. Ahmed et.al. | 2407.21231 | null |
2024-07-30 | Algorithm-Assisted Decision Making and Racial Disparities in Housing: A Study of the Allegheny Housing Assessment Tool | Lingwei Cheng et.al. | 2407.21209 | null |
2024-07-30 | Deduction Game Framework and Information Set Entropy Search | Fandi Meng et.al. | 2407.21178 | null |
2024-07-30 | Extending choice assessments to choice functions: An algorithm for computing the natural extension | Arne Decadt et.al. | 2407.21164 | null |
2024-07-30 | Self-supervised Multi-future Occupancy Forecasting for Autonomous Driving | Bernard Lange et.al. | 2407.21126 | null |
2024-07-30 | Zero Shot Health Trajectory Prediction Using Transformer | Pawel Renc et.al. | 2407.21124 | link |
2024-07-30 | Integrating Agent-Based and Compartmental Models for Infectious Disease Modeling: A Novel Hybrid Approach | Inan Bostanci et.al. | 2407.20993 | null |
2024-07-30 | From Feature Importance to Natural Language Explanations Using LLMs with RAG | Sule Tekkesinoglu et.al. | 2407.20990 | link |
2024-07-30 | Learning Ordinality in Semantic Segmentation | Rafael Cristino et.al. | 2407.20959 | null |
2024-07-30 | Non-linear inhibitory responses enhance performance in collective decision-making | David March-Pons et.al. | 2407.20927 | null |
2024-07-30 | How to Choose a Reinforcement-Learning Algorithm | Fabian Bongratz et.al. | 2407.20917 | null |
2024-07-30 | Optimizing 5G-Advanced Networks for Time-critical Applications: The Role of L4S | Guangjin Pan et.al. | 2407.20852 | null |
2024-07-30 | Task-Oriented Communication for Vehicle-to-Infrastructure Cooperative Perception | Jiawei Shao et.al. | 2407.20748 | null |
2024-07-30 | Architectural Influence on Variational Quantum Circuits in Multi-Agent Reinforcement Learning: Evolutionary Strategies for Optimization | Michael Kölle et.al. | 2407.20739 | null |
2024-07-30 | Practices and Strategies in Responsive Thematic Map Design: A Report from Design Workshops with Experts | Sarah Schöttler et.al. | 2407.20735 | null |
2024-07-30 | Scene-Specific Trajectory Sets: Maximizing Representation in Motion Forecasting | Abhishek Vivekanandan et.al. | 2407.20732 | null |
2024-07-30 | Exploring Loss Landscapes through the Lens of Spin Glass Theory | Hao Liao et.al. | 2407.20724 | null |
2024-07-30 | On-the-fly Communication-and-Computing to Enable Representation Learning for Distributed Point Clouds | Xu Chen et.al. | 2407.20710 | null |
2024-07-30 | Powerful A/B-Testing Metrics and Where to Find Them | Olivier Jeunen et.al. | 2407.20665 | null |
2024-07-30 | Enhancing Agricultural Machinery Management through Advanced LLM Integration | Emily Johnson et.al. | 2407.20588 | null |
2024-07-30 | Laplace approximation for Bayesian variable selection via Le Cam’s one-step procedure | Tianrui Hou et.al. | 2407.20580 | null |
2024-07-30 | DiffusionCounterfactuals: Inferring High-dimensional Counterfactuals with Guidance of Causal Representations | Jiageng Zhu et.al. | 2407.20553 | null |
2024-07-30 | Evaluating Fairness in Black-box Algorithmic Markets: A Case Study of Ride Sharing in Chicago | Yuhan Liu et.al. | 2407.20522 | null |
2024-07-29 | Domain Adaptable Prescriptive AI Agent for Enterprise | Piero Orderique et.al. | 2407.20447 | null |
2024-07-29 | Appraisal-Guided Proximal Policy Optimization: Modeling Psychological Disorders in Dynamic Grid World | Hari Prasad et.al. | 2407.20383 | null |
2024-07-29 | SAPG: Split and Aggregate Policy Gradients | Jayesh Singla et.al. | 2407.20230 | null |
2024-07-29 | Time series forecasting with high stakes: A field study of the air cargo industry | Abhinav Garg et.al. | 2407.20192 | null |
2024-07-29 | An Interpretable Rule Creation Method for Black-Box Models based on Surrogate Trees – SRules | Mario Parrón Verdasco et.al. | 2407.20070 | null |
2024-07-29 | Collision Probability Distribution Estimation via Temporal Difference Learning | Thomas Steinecker et.al. | 2407.20000 | link |
2024-07-29 | Private and Secure Fuzzy Name Matching | Harsh Kasyap et.al. | 2407.19979 | null |
2024-07-29 | Hydrodynamics of pulsating active liquids | Tirthankar Banerjee et.al. | 2407.19955 | null |
2024-07-29 | AOTree: Aspect Order Tree-based Model for Explainable Recommendation | Wenxin Zhao et.al. | 2407.19937 | null |
2024-07-29 | Anomalous State Sequence Modeling to Enhance Safety in Reinforcement Learning | Leen Kweider et.al. | 2407.19860 | null |
2024-07-29 | Evolution of cooperation in the public goods game with Q-learning | Guozhong Zheng et.al. | 2407.19851 | null |
2024-07-29 | Legal Minds, Algorithmic Decisions: How LLMs Apply Constitutional Principles in Complex Scenarios | Camilla Bignotti et.al. | 2407.19760 | null |
2024-07-29 | ALEN: A Dual-Approach for Uniform and Non-Uniform Low-Light Image Enhancement | Ezequiel Perez-Zarate et.al. | 2407.19708 | link |
2024-07-29 | Towards Detecting IoT Event Spoofing Attacks Using Time-Series Classification | Uzma Maroof et.al. | 2407.19662 | null |
2024-07-29 | AI-Driven Healthcare: A Survey on Ensuring Fairness and Mitigating Bias | Sribala Vidyadhari Chinta et.al. | 2407.19655 | null |
2024-07-29 | “A Good Bot Always Knows Its Limitations”: Assessing Autonomous System Decision-making Competencies through Factorized Machine Self-confidence | Brett Israelsen et.al. | 2407.19631 | link |
2024-07-28 | Evaluating LLMs for Text-to-SQL Generation With Complex SQL Workload | Limin Ma et.al. | 2407.19517 | null |
2024-07-28 | EPD: Long-term Memory Extraction, Context-awared Planning and Multi-iteration Decision @ EgoPlan Challenge ICML 2024 | Letian Shi et.al. | 2407.19510 | link |
2024-07-28 | HD-maps as Prior Information for Globally Consistent Mapping in GPS-denied Environments | Waqas Ali et.al. | 2407.19463 | null |
2024-07-28 | Reputation-Driven Asynchronous Federated Learning for Enhanced Trajectory Prediction with Blockchain | Weiliang Chen et.al. | 2407.19428 | null |
2024-07-28 | The influence of Automated Decision-Making systems in the context of street-level bureaucrats’ practices | Manuel Portela et.al. | 2407.19427 | null |
2024-07-28 | Logic Distillation: Learning from Code Function by Function for Planning and Decision-making | Dong Chen et.al. | 2407.19405 | null |
2024-07-26 | Wolf: Captioning Everything with a World Summarization Framework | Boyi Li et.al. | 2407.18908 | null |
2024-07-26 | SHANGUS: Deep Reinforcement Learning Meets Heuristic Optimization for Speedy Frontier-Based Exploration of Autonomous Vehicles in Unknown Spaces | Seunghyeop Nam et.al. | 2407.18892 | null |
2024-07-26 | Agent-Based Insight into Eco-Choices: Simulating the Fast Fashion Shift | Daria Soboleva et.al. | 2407.18814 | null |
2024-07-26 | HERO-SLAM: Hybrid Enhanced Robust Optimization of Neural SLAM | Zhe Xin et.al. | 2407.18813 | null |
2024-07-26 | Foundation Models for the Digital Twin Creation of Cyber-Physical Systems | Shaukat Ali et.al. | 2407.18779 | null |
2024-07-26 | Set risk measures | Marcelo Righi et.al. | 2407.18687 | null |
2024-07-26 | Reinforcement Learning for Sustainable Energy: A Survey | Koen Ponse et.al. | 2407.18597 | null |
2024-07-26 | PP-TIL: Personalized Planning for Autonomous Driving with Instance-based Transfer Imitation Learning | Fangze Lin et.al. | 2407.18569 | link |
2024-07-29 | Multi-Agent Trajectory Prediction with Difficulty-Guided Feature Enhancement Network | Guipeng Xin et.al. | 2407.18551 | link |
2024-07-26 | Socially efficient mechanism on the minimum budget | Hirota Kinoshita et.al. | 2407.18515 | null |
2024-07-26 | Design Spaces and How Software Designers Use Them: a sampler | Mary Shaw et.al. | 2407.18502 | null |
2024-07-26 | Gaussian Lane Keeping: A Robust Prediction Baseline | David Isele et.al. | 2407.18451 | null |
2024-07-26 | Impact of Recurrent Neural Networks and Deep Learning Frameworks on Real-time Lightweight Time Series Anomaly Detection | Ming-Chang Lee et.al. | 2407.18439 | null |
2024-07-25 | Adversarial Robust Decision Transformer: Enhancing Robustness of RvS via Minimax Returns-to-go | Xiaohang Tang et.al. | 2407.18414 | link |
2024-07-25 | Large Language Model Integrated Healthcare Cyber-Physical Systems Architecture | Malithi Wanniarachchi Kankanamge et.al. | 2407.18407 | null |
2024-07-25 | Phase transition in a kinetic mean-field game model of inertial self-propelled agents | Piyush Grover et.al. | 2407.18400 | null |
2024-07-25 | Galaxy Mergers in UNIONS – I: A Simulation-driven Hybrid Deep Learning Ensemble for Pure Galaxy Merger Classification | Leonardo Ferreira et.al. | 2407.18396 | null |
2024-07-25 | Automated Ensemble Multimodal Machine Learning for Healthcare | Fergus Imrie et.al. | 2407.18227 | null |
2024-07-25 | Differentiable Quantum Architecture Search in Asynchronous Quantum Reinforcement Learning | Samuel Yen-Chi Chen et.al. | 2407.18202 | null |
2024-07-25 | Taxonomy-Aware Continual Semantic Segmentation in Hyperbolic Spaces for Open-World Perception | Julia Hindel et.al. | 2407.18145 | null |
2024-07-25 | TiCoSS: Tightening the Coupling between Semantic Segmentation and Stereo Matching within A Joint Learning Framework | Guanfeng Tang et.al. | 2407.18038 | null |
2024-07-25 | ECG Arrhythmia Detection Using Disease-specific Attention-based Deep Learning Model | Linpeng Jin et.al. | 2407.18033 | null |
2024-07-25 | Network Inversion of Convolutional Neural Nets | Pirzada Suhail et.al. | 2407.18002 | null |
2024-07-25 | StreamMOS: Streaming Moving Object Segmentation with Multi-View Perception and Dual-Span Memory | Zhiheng Li et.al. | 2407.17905 | link |
2024-07-25 | Financial Statement Analysis with Large Language Models | Alex Kim et.al. | 2407.17866 | null |
2024-07-26 | MDS-ED: Multimodal Decision Support in the Emergency Department – a Benchmark Dataset for Diagnoses and Deterioration Prediction in Emergency Medicine | Juan Miguel Lopez Alcaraz et.al. | 2407.17856 | link |
2024-07-25 | Long-term Fairness in Ride-Hailing Platform | Yufan Kang et.al. | 2407.17839 | null |
2024-07-25 | Image Segmentation via Divisive Normalization: dealing with environmental diversity | Pablo Hernández-Cámara et.al. | 2407.17829 | null |
2024-07-25 | CRASH: Crash Recognition and Anticipation System Harnessing with Context-Aware and Temporal Focus Attentions | Haicheng Liao et.al. | 2407.17757 | null |
2024-07-25 | Control Informed Design of the IAC Autonomous Racecar for Operation at the Dynamic Envelope | Qilun Zhu et.al. | 2407.17737 | null |
2024-07-25 | Enhancing Agent Learning through World Dynamics Modeling | Zhiyuan Sun et.al. | 2407.17695 | link |
2024-07-24 | Towards Neural Network based Cognitive Models of Dynamic Decision-Making by Humans | Changyu Chen et.al. | 2407.17622 | link |
2024-07-24 | Toward human-centered shared autonomy AI paradigms for human-robot teaming in healthcare | Reza Abiri et.al. | 2407.17464 | null |
2024-07-24 | Hidden or Inferred: Fair Learning-To-Rank with Unknown Demographics | Oluseun Olulana et.al. | 2407.17459 | link |
2024-07-24 | 3D Question Answering for City Scene Understanding | Penglei Sun et.al. | 2407.17398 | null |
2024-07-24 | Five reasons against assuming a data-generating distribution in Machine Learning | Benedikt Höltgen et.al. | 2407.17395 | null |
2024-07-24 | Causal modelling without counterfactuals and individualised effects | Benedikt Höltgen et.al. | 2407.17385 | null |
2024-07-24 | Gradient-based inference of abstract task representations for generalization in neural networks | Ali Hummos et.al. | 2407.17356 | null |
2024-07-25 | Enhanced Deep Learning Methodologies and MRI Selection Techniques for Dementia Diagnosis in the Elderly Population | Nikolaos Ntampakis et.al. | 2407.17324 | null |
2024-07-24 | Physical Adversarial Attack on Monocular Depth Estimation via Shape-Varying Patches | Chenxing Zhao et.al. | 2407.17312 | null |
2024-07-24 | An MDP-Based Approach for Distribution System Control with PV Generation and Battery Storage | Robert Sosnowski et.al. | 2407.17257 | null |
2024-07-24 | Testing Large Language Models on Driving Theory Knowledge and Skills for Connected Autonomous Vehicles | Zuoyin Tang et.al. | 2407.17211 | null |
2024-07-24 | Semantic Vehicle-to-Everything (V2X) Communications Towards 6G | Tengfei Lyu et.al. | 2407.17186 | null |
2024-07-24 | Generalized Ordinal Priority Approach for Multi-Attribute Decision-Making under Incomplete Preference Information | Renlong Wang et.al. | 2407.17099 | null |
2024-07-24 | NewsUnfold: Creating a News-Reading Application That Indicates Linguistic Media Bias and Collects Feedback | Smi Hinterreiter et.al. | 2407.17045 | null |
2024-07-24 | Applications of Multi-Agent Deep Reinforcement Learning Communication in Network Management: A Survey | Yue Pi et.al. | 2407.17030 | null |
2024-07-25 | Simulation in discrete choice models evaluation: SDCM, a simulation tool for performance evaluation of DCMs | Amirreza Talebi et.al. | 2407.17014 | null |
2024-07-24 | Progressive Query Refinement Framework for Bird’s-Eye-View Semantic Segmentation from Surrounding Images | Dooseop Choi et.al. | 2407.17003 | link |
2024-07-24 | Toward an Integrated Decision Making Framework for Optimized Stroke Diagnosis with DSA and Treatment under Uncertainty | Nur Ahmad Khatim et.al. | 2407.16962 | link |
2024-07-23 | On the Separability of Vector-Valued Risk Measures | Çağın Ararat et.al. | 2407.16878 | null |
2024-07-23 | Trust Your Gut: Comparing Human and Machine Inference from Noisy Visualizations | Ratanond Koonchanok et.al. | 2407.16871 | null |
2024-07-23 | SECRM-2D: RL-Based Efficient and Comfortable Route-Following Autonomous Driving with Analytic Safety Guarantees | Tianyu Shi et.al. | 2407.16857 | null |
2024-07-24 | A Simulation Benchmark for Autonomous Racing with Large-Scale Human Data | Adrian Remonda et.al. | 2407.16680 | link |
2024-07-23 | Deformable Convolution Based Road Scene Semantic Segmentation of Fisheye Images in Autonomous Driving | Anam Manzoor et.al. | 2407.16647 | null |
2024-07-24 | Velocity Driven Vision: Asynchronous Sensor Fusion Birds Eye View Models for Autonomous Vehicles | Seamie Hayes et.al. | 2407.16636 | null |
2024-07-23 | Knowledge-driven AI-generated data for accurate and interpretable breast ultrasound diagnoses | Haojun Yu et.al. | 2407.16634 | null |
2024-07-23 | MonoWAD: Weather-Adaptive Diffusion Model for Robust Monocular 3D Object Detection | Youngmin Oh et.al. | 2407.16448 | link |
2024-07-23 | Can time series forecasting be automated? A benchmark and analysis | Anvitha Thirthapura Sreedhara et.al. | 2407.16445 | null |
2024-07-23 | Evaluating Uncertainties in Electricity Markets via Machine Learning and Quantum Computing | Shuyang Zhu et.al. | 2407.16404 | null |
2024-07-23 | Cleaning Robots in Public Spaces: A Survey and Proposal for Benchmarking Based on Stakeholders Interviews | Raphael Memmesheimer et.al. | 2407.16393 | null |
2024-07-23 | PhenoFlow: A Human-LLM Driven Visual Analytics System for Exploring Large and Complex Stroke Datasets | Jaeyoung Kim et.al. | 2407.16329 | null |
2024-07-23 | Improving multidimensional projection quality with user-specific metrics and optimal scaling | Maniru Ibrahim et.al. | 2407.16328 | null |
2024-07-23 | Understanding Impacts of Electromagnetic Signal Injection Attacks on Object Detection | Youqian Zhang et.al. | 2407.16327 | null |
2024-07-23 | MOMAland: A Set of Benchmarks for Multi-Objective Multi-Agent Reinforcement Learning | Florian Felten et.al. | 2407.16312 | link |
2024-07-23 | Optimizing Robotic Manipulation with Decision-RWKV: A Recurrent Sequence Modeling Approach for Lifelong Learning | Yujian Dong et.al. | 2407.16306 | link |
2024-07-23 | On the Use of Immersive Digital Technologies for Designing and Operating UAVs | Yousef Emami et.al. | 2407.16288 | null |
2024-07-23 | When, Where, and What? An Novel Benchmark for Accident Anticipation and Localization with Large Language Models | Haicheng Liao et.al. | 2407.16277 | null |
2024-07-23 | Identifiable latent bandits: Combining observational data and exploration for personalized healthcare | Ahmet Zahid Balcıoğlu et.al. | 2407.16239 | null |
2024-07-23 | Strategy and Skill Learning for Physics-based Table Tennis Animation | Jiashun Wang et.al. | 2407.16210 | null |
2024-07-23 | LiCROcc: Teach Radar for Accurate Semantic Occupancy Prediction using LiDAR and Camera | Yukai Ma et.al. | 2407.16197 | null |
2024-07-23 | Advanced AI Framework for Enhanced Detection and Assessment of Abdominal Trauma: Integrating 3D Segmentation with 2D CNN and RNN Models | Liheng Jiang et.al. | 2407.16165 | null |
2024-07-23 | Diffusion Models as Optimizers for Efficient Planning in Offline RL | Renming Huang et.al. | 2407.16142 | link |
2024-07-22 | MILAN: Milli-Annotations for Lidar Semantic Segmentation | Nermin Samet et.al. | 2407.15797 | null |
2024-07-22 | Extracting Structured Insights from Financial News: An Augmented LLM Driven Approach | Rian Dolphin et.al. | 2407.15788 | null |
2024-07-22 | Concept-Based Interpretable Reinforcement Learning with Limited to No Human Labels | Zhuorui Ye et.al. | 2407.15786 | null |
2024-07-22 | CrashEventLLM: Predicting System Crashes with Large Language Models | Priyanka Mudgal et.al. | 2407.15716 | null |
2024-07-22 | Flow-guided Motion Prediction with Semantics and Dynamic Occupancy Grid Maps | Rabbia Asghar et.al. | 2407.15675 | null |
2024-07-22 | DriveDiTFit: Fine-tuning Diffusion Transformers for Autonomous Driving | Jiahang Tu et.al. | 2407.15661 | link |
2024-07-22 | Evaluation of Reinforcement Learning for Autonomous Penetration Testing using A3C, Q-learning and DQN | Norman Becker et.al. | 2407.15656 | null |
2024-07-22 | Psychometric Alignment: Capturing Human Knowledge Distributions via Language Models | Joy He-Yueya et.al. | 2407.15645 | link |
2024-07-22 | Reinforcement Learning Meets Visual Odometry | Nico Messikommer et.al. | 2407.15626 | link |
2024-07-22 | Towards a Universal Evaluation Model for Careful and Competent Autonomous Driving | Kethan Reddy et.al. | 2407.15596 | null |
2024-07-22 | Empowering Agile-Based Generative Software Development through Human-AI Teamwork | Sai Zhang et.al. | 2407.15568 | link |
2024-07-22 | Interpretable Concept-Based Memory Reasoning | David Debot et.al. | 2407.15527 | link |
2024-07-22 | WTS: A Pedestrian-Centric Traffic Video Dataset for Fine-grained Spatial-Temporal Understanding | Quan Kong et.al. | 2407.15350 | null |
2024-07-22 | Explore the LiDAR-Camera Dynamic Adjustment Fusion for 3D Object Detection | Yiran Yang et.al. | 2407.15334 | link |
2024-07-21 | Explaining Decisions of Agents in Mixed-Motive Games | Maayan Orner et.al. | 2407.15255 | null |
2024-07-21 | Decoding Multilingual Moral Preferences: Unveiling LLM’s Biases Through the Moral Machine Experiment | Karina Vida et.al. | 2407.15184 | link |
2024-07-20 | Is Behavior Cloning All You Need? Understanding Horizon in Imitation Learning | Dylan J. Foster et.al. | 2407.15007 | null |
2024-07-20 | A Measure for Level of Autonomy Based on Observable System Behavior | Jason M. Pittman et.al. | 2407.14975 | null |
2024-07-20 | (Non-)Commutative Aggregation | Yuzhao Yang et.al. | 2407.14959 | null |
2024-07-20 | CoCoG-2: Controllable generation of visual stimuli for understanding human concept representation | Chen Wei et.al. | 2407.14949 | link |
2024-07-19 | Quantifying the value of positive transfer: An experimental case study | Aidan J. Hughes et.al. | 2407.14342 | null |
2024-07-19 | Complementary Learning for Real-World Model Failure Detection | Daniel Bogdoll et.al. | 2407.14306 | link |
2024-07-19 | Hyperparameter Optimization for Driving Strategies Based on Reinforcement Learning | Nihal Acharya Adde et.al. | 2407.14262 | null |
2024-07-19 | KoMA: Knowledge-driven Multi-agent Framework for Autonomous Driving with Large Language Models | Kemou Jiang et.al. | 2407.14239 | null |
2024-07-19 | Domain Adaptation for Industrial Time-series Forecasting via Counterfactual Inference | Chao Min et.al. | 2407.14214 | null |
2024-07-19 | Achieving Well-Informed Decision-Making in Drug Discovery: A Comprehensive Calibration Study using Neural Network-Based Structure-Activity Models | Hannah Rosa Friesacher et.al. | 2407.14185 | link |
2024-07-19 | Integrated Push-and-Pull Update Model for Goal-Oriented Effective Communication | Pouya Agheli et.al. | 2407.14092 | null |
2024-07-19 | Data Guards: Challenges and Solutions for Fostering Trust in Data | Nicole Sultanum et.al. | 2407.14042 | null |
2024-07-19 | Causal Inference with Complex Treatments: A Survey | Yingrong Wang et.al. | 2407.14022 | link |
2024-07-19 | A trustworthy blockchain-based energy trading scheme for V2G operations in distributed power grids via integrated scheduling and trading framework | Yunwang Chen et.al. | 2407.13988 | null |
2024-07-18 | Boosting Online 3D Multi-Object Tracking through Camera-Radar Cross Check | Sheng-Yao Kuan et.al. | 2407.13937 | null |
2024-07-18 | Unmasking Social Bots: How Confident Are We? | James Giroux et.al. | 2407.13929 | link |
2024-07-18 | PRAGyan – Connecting the Dots in Tweets | Rahul Ravi et.al. | 2407.13909 | null |
2024-07-18 | A review of handcrafted and deep radiomics in neurological diseases: transitioning from oncology to clinical neuroimaging | Elizaveta Lavrova et.al. | 2407.13813 | null |
2024-07-18 | Black-Box Opinion Manipulation Attacks to Retrieval-Augmented Generation of Large Language Models | Zhuo Chen et.al. | 2407.13757 | null |
2024-07-18 | Temporal Representation Learning for Stock Similarities and Its Applications in Investment Management | Yoontae Hwang et.al. | 2407.13751 | null |
2024-07-18 | Managing Risk using Rolling Forecasts in Energy-Limited and Stochastic Energy Systems | Thomas Mortimer et.al. | 2407.13626 | null |
2024-07-18 | The Storage Location Assignment and Picker Routing Problem: A Generic Branch-Cut-and-Price Algorithm | Thibault Prunet et.al. | 2407.13570 | link |
2024-07-19 | Hyp2Nav: Hyperbolic Planning and Curiosity for Crowd Navigation | Guido Maria D’Amely di Melendugno et.al. | 2407.13567 | link |
2024-07-18 | Fundamental Visual Navigation Algorithms: Indirect Sequential, Biased Diffusive, & Direct Pathing | Patrick Govoni et.al. | 2407.13535 | null |
2024-07-19 | Mask2Map: Vectorized HD Map Construction Using Bird’s Eye View Segmentation Masks | Sehwan Choi et.al. | 2407.13517 | link |
2024-07-18 | Risk-Aware Vehicle Trajectory Prediction Under Safety-Critical Scenarios | Qingfan Wang et.al. | 2407.13480 | null |
2024-07-18 | Improving Out-of-Distribution Generalization of Trajectory Prediction for Autonomous Driving via Polynomial Representations | Yue Yao et.al. | 2407.13431 | link |
2024-07-18 | Ultra-Low-Latency Edge Inference for Distributed Sensing | Zhanwei Wang et.al. | 2407.13360 | null |
2024-07-18 | Why do you cite? An investigation on citation intents and decision-making classification processes | Lorenzo Paolini et.al. | 2407.13329 | null |
2024-07-18 | CoD, Towards an Interpretable Medical Agent using Chain of Diagnosis | Junying Chen et.al. | 2407.13301 | link |
2024-07-18 | $μ$ Drive: User-Controlled Autonomous Driving | Kun Wang et.al. | 2407.13201 | null |
2024-07-18 | Adaptive Foundation Models for Online Decisions: HyperAgent with Fast Incremental Uncertainty Estimation | Yingru Li et.al. | 2407.13195 | link |
2024-07-18 | Real-Time 3D Occupancy Prediction via Geometric-Semantic Disentanglement | Yulin He et.al. | 2407.13155 | null |
2024-07-19 | PG-Rainbow: Using Distributional Reinforcement Learning in Policy Gradient Methods | WooJae Jeon et.al. | 2407.13146 | null |
2024-07-18 | OE-BevSeg: An Object Informed and Environment Aware Multimodal Framework for Bird’s-eye-view Vehicle Semantic Segmentation | Jian Sun et.al. | 2407.13137 | null |
2024-07-18 | PG-Attack: A Precision-Guided Adversarial Attack Framework Against Vision Foundation Models for Autonomous Driving | Jiyuan Fu et.al. | 2407.13111 | link |
2024-07-18 | On Causally Disentangled State Representation Learning for Reinforcement Learning based Recommender Systems | Siyu Wang et.al. | 2407.13091 | null |
2024-07-17 | Fighting Sampling Bias: A Framework for Training and Evaluating Credit Scoring Models | Nikita Kozodoi et.al. | 2407.13009 | null |
2024-07-17 | AgentPoison: Red-teaming LLM Agents via Poisoning Memory or Knowledge Bases | Zhaorun Chen et.al. | 2407.12784 | link |
2024-07-17 | Bayesian spatial functional data clustering: applications in disease surveillance | Ruiman Zhong et.al. | 2407.12633 | null |
2024-07-17 | Continuous reasoning for adaptive container image distribution in the cloud-edge continuum | Damiano Azzolini et.al. | 2407.12605 | link |
2024-07-17 | Policies Grow on Trees: Model Checking Families of MDPs | Roman Andriushchenko et.al. | 2407.12552 | null |
2024-07-17 | Hierarchical and Decoupled BEV Perception Learning Framework for Autonomous Driving | Yuqi Dai et.al. | 2407.12491 | null |
2024-07-17 | What’s Distributive Justice Got to Do with It? Rethinking Algorithmic Fairness from the Perspective of Approximate Justice | Corinna Hertweck et.al. | 2407.12488 | null |
2024-07-17 | Fisheye-Calib-Adapter: An Easy Tool for Fisheye Camera Model Conversion | Sangjun Lee et.al. | 2407.12405 | link |
2024-07-17 | MEDFuse: Multimodal EHR Data Fusion with Masked Lab-Test Modeling and Large Language Models | Thao Minh Nguyen Phan et.al. | 2407.12309 | null |
2024-07-16 | CLUE: Safe Model-Based RL HVAC Control Using Epistemic Uncertainty Estimation | Xianzhong Ding et.al. | 2407.12195 | link |
2024-07-16 | Satisficing Exploration for Deep Reinforcement Learning | Dilip Arumugam et.al. | 2407.12185 | null |
2024-07-16 | Exploration Unbound | Dilip Arumugam et.al. | 2407.12178 | null |
2024-07-16 | Interpretability in Action: Exploratory Analysis of VPT, a Minecraft Agent | Karolis Jucys et.al. | 2407.12161 | null |
2024-07-16 | Predicting Emotion Intensity in Polish Political Texts: Comparing Supervised Models and Large Language Models in a Resource-Poor Language | Hubert Plisiecki et.al. | 2407.12141 | link |
2024-07-16 | UrbanWorld: An Urban World Model for 3D City Generation | Yu Shang et.al. | 2407.11965 | link |
2024-07-16 | Gated Temporal Diffusion for Stochastic Long-Term Dense Anticipation | Olga Zatsarynna et.al. | 2407.11954 | link |
2024-07-16 | Beyond Spatial Explanations: Explainable Face Recognition in the Frequency Domain | Marco Huber et.al. | 2407.11941 | null |
2024-07-16 | InferAct: Inferring Safe Actions for LLM-Based Agents Through Preemptive Evaluation and Human Feedback | Haishuo Fang et.al. | 2407.11843 | null |
2024-07-16 | MapDistill: Boosting Efficient Camera-based HD Map Construction via Camera-LiDAR Fusion Model Distillation | Xiaoshuai Hao et.al. | 2407.11682 | null |
2024-07-16 | Perception Helps Planning: Facilitating Multi-Stage Lane-Level Integration via Double-Edge Structures | Guoliang You et.al. | 2407.11644 | null |
2024-07-16 | Rethinking Fair Graph Neural Networks from Re-balancing | Zhixun Li et.al. | 2407.11624 | link |
2024-07-16 | DRL-based Joint Resource Scheduling of eMBB and URLLC in O-RAN | Rana M. Sohaib et.al. | 2407.11558 | null |
2024-07-16 | How Personality Traits Influence Negotiation Outcomes? A Simulation based on Large Language Models | Yin Jou Huang et.al. | 2407.11549 | link |
2024-07-16 | Generally-Occurring Model Change for Robust Counterfactual Explanations | Ao Xu et.al. | 2407.11426 | null |
2024-07-16 | Incremental high average-utility itemset mining: survey and challenges | Jing Chen et.al. | 2407.11425 | null |
2024-07-16 | EndoFinder: Online Image Retrieval for Explainable Colorectal Polyp Diagnosis | Ruijie Yang et.al. | 2407.11401 | null |
2024-07-16 | InvAgent: A Large Language Model based Multi-Agent System for Inventory Management in Supply Chains | Yinzhu Quan et.al. | 2407.11384 | link |
2024-07-17 | Segment, Lift and Fit: Automatic 3D Shape Labeling from 2D Prompts | Jianhao Li et.al. | 2407.11382 | null |
2024-07-16 | Adaptive Environment-Aware Robotic Arm Reaching Based on a Bio-Inspired Neurodynamical Computational Framework | Dimitrios Chatziparaschis et.al. | 2407.11377 | null |
2024-07-16 | Mask-Free Neuron Concept Annotation for Interpreting Neural Networks in Medical Domain | Hyeon Bae Kim et.al. | 2407.11375 | link |
2024-07-16 | Continuity Preserving Online CenterLine Graph Learning | Yunhui Han et.al. | 2407.11337 | link |
2024-07-15 | Novel Approach for Predicting the Air Quality Index of Megacities through Attention-Enhanced Deep Multitask Spatiotemporal Learning | Harun Khan et.al. | 2407.11283 | null |
2024-07-15 | Intelligent Cross-Organizational Process Mining: A Survey and New Perspectives | Yiyuan Yang et.al. | 2407.11280 | null |
2024-07-15 | CICAPT-IIOT: A provenance-based APT attack dataset for IIoT environment | Erfan Ghiasvand et.al. | 2407.11278 | null |
2024-07-15 | RepVF: A Unified Vector Fields Representation for Multi-task 3D Perception | Chunliang Li et.al. | 2407.10876 | link |
2024-07-15 | Enhancing Cyber Security through Predictive Analytics: Real-Time Threat Detection and Response | Muhammad Danish et.al. | 2407.10864 | link |
2024-07-15 | DINO Pre-training for Vision-based End-to-end Autonomous Driving | Shubham Juneja et.al. | 2407.10803 | null |
2024-07-15 | Interactive Public Transport Infrastructure Analysis through Mobility Profiles: Making the Mobility Transition Transparent | Yannick Metz et.al. | 2407.10791 | null |
2024-07-15 | The Missing Link: Allocation Performance in Causal Machine Learning | Unai Fischer-Abaigar et.al. | 2407.10779 | null |
2024-07-15 | Last-Iterate Global Convergence of Policy Gradients for Constrained Reinforcement Learning | Alessandro Montenegro et.al. | 2407.10775 | null |
2024-07-15 | Multi-Objective Optimization and Multi-Criteria Decision-Making Approach to Design Multi-Tubular Packed-Bed Membrane Reactor in Oxidative Dehydrogenation of Ethane | Seyed Reza Nabavi et.al. | 2407.10774 | null |
2024-07-15 | Globally-Constrained Decentralized Optimization with Variable Coupling | Dandan Wang et.al. | 2407.10770 | null |
2024-07-15 | Addressing Image Hallucination in Text-to-Image Generation through Factual Image Retrieval | Youngsun Lim et.al. | 2407.10683 | null |
2024-07-15 | XEQ Scale for Evaluating XAI Experience Quality Grounded in Psychometric Theory | Anjana Wijekoon et.al. | 2407.10662 | null |
2024-07-15 | Exploring incentive strategies and predicting development trends for new energy vehicles | Tao Jin et.al. | 2407.10611 | null |
2024-07-15 | Leveraging Hybrid Intelligence Towards Sustainable and Energy-Efficient Machine Learning | Daniel Geissler et.al. | 2407.10580 | null |
2024-07-15 | Understanding the Dependence of Perception Model Competency on Regions in an Image | Sara Pohland et.al. | 2407.10543 | link |
2024-07-15 | Communication- and Computation-Efficient Distributed Decision-Making in Multi-Robot Networks | Zirui Xu et.al. | 2407.10382 | link |
2024-07-14 | Mapping the Scholarship of Dark Pattern Regulation: A Systematic Review of Concepts, Regulatory Paradigms, and Solutions from an Interdisciplinary Perspective | Weiwei Yi et.al. | 2407.10340 | null |
2024-07-14 | Follow the Rules: Reasoning for Video Anomaly Detection with Large Language Models | Yuchen Yang et.al. | 2407.10299 | link |
2024-07-14 | Next-Generation 6G Networks: Deploying Cybertwin Technology for Enhanced Healthcare Solutions | Alinafe Kaliwo et.al. | 2407.10292 | null |
2024-07-14 | Towards detailed and interpretable hybrid modeling of continental-scale bird migration | Fiona Lippert et.al. | 2407.10259 | null |
2024-07-14 | RAPiD-Seg: Range-Aware Pointwise Distance Distribution Networks for 3D LiDAR Segmentation | Li Li et.al. | 2407.10159 | link |
2024-07-14 | FSD-BEV: Foreground Self-Distillation for Multi-view 3D Object Detection | Zheng Jiang et.al. | 2407.10135 | link |
2024-07-12 | Adaptive Prediction Ensemble: Improving Out-of-Distribution Generalization of Motion Forecasting | Jinning Li et.al. | 2407.09475 | null |
2024-07-12 | TRAVERSE: Traffic-Responsive Autonomous Vehicle Experience & Rare-event Simulation for Enhanced safety | Sandeep Thalapanane et.al. | 2407.09466 | null |
2024-07-12 | Neuroevolution of Decentralized Decision-Making in N-Bead Swimmers Leads to Scalable and Robust Collective Locomotion | Benedikt Hartl et.al. | 2407.09438 | null |
2024-07-12 | Good Intentions, Risky Inventions: A Method for Assessing the Risks and Benefits of AI in Mobile and Wearable Uses | Marios Constantinides et.al. | 2407.09322 | link |
2024-07-12 | Sample size for developing a prediction model with a binary outcome: targeting precise individual risk estimates to improve clinical decisions and fairness | Richard D Riley et.al. | 2407.09293 | null |
2024-07-12 | Predicting and Understanding Human Action Decisions: Insights from Large Language Models and Cognitive Instance-Based Learning | Thuy Ngoc Nguyen et.al. | 2407.09281 | null |
2024-07-12 | GNN with Model-based RL for Multi-agent Systems | Hanxiao Chen et.al. | 2407.09249 | null |
2024-07-12 | Decentralized multi-agent reinforcement learning algorithm using a cluster-synchronized laser network | Shun Kotoku et.al. | 2407.09124 | null |
2024-07-12 | KUNPENG: An Embodied Large Model for Intelligent Maritime | Naiyao Wang et.al. | 2407.09048 | link |
2024-07-12 | Privacy-Preserving Collaborative Genomic Research: A Real-Life Deployment and Vision | Zahra Rahmani et.al. | 2407.09004 | null |
2024-07-12 | Communication-Aware Reinforcement Learning for Cooperative Adaptive Cruise Control | Sicong Jiang et.al. | 2407.08964 | null |
2024-07-12 | Bora: Biomedical Generalist Video Generation Model | Weixiang Sun et.al. | 2407.08944 | null |
2024-07-12 | Deep Attention Driven Reinforcement Learning (DAD-RL) for Autonomous Vehicle Decision-Making in Dynamic Environment | Jayabrata Chowdhury et.al. | 2407.08932 | link |
2024-07-11 | DeepCodeProbe: Towards Understanding What Models Trained on Code Learn | Vahid Majdinasab et.al. | 2407.08890 | link |
2024-07-11 | Generalizable Physics-informed Learning for Stochastic Safety-critical Systems | Zhuoyuan Wang et.al. | 2407.08868 | null |
2024-07-11 | Latent Spaces Enable Transformer-Based Dose Prediction in Complex Radiotherapy Plans | Edward Wang et.al. | 2407.08650 | link |
2024-07-11 | A Review of Nine Physics Engines for Reinforcement Learning Research | Michael Kaup et.al. | 2407.08590 | null |
2024-07-11 | MapLocNet: Coarse-to-Fine Feature Registration for Visual Re-Localization in Navigation Maps | Hang Wu et.al. | 2407.08561 | null |
2024-07-11 | BLOS-BEV: Navigation Map Enhanced Lane Segmentation Network, Beyond Line of Sight | Hang Wu et.al. | 2407.08526 | null |
2024-07-11 | Converging Paradigms: The Synergy of Symbolic and Connectionist AI in LLM-Empowered Autonomous Agents | Haoyi Xiong et.al. | 2407.08516 | null |
2024-07-11 | Joint Optimization of Age of Information and Energy Consumption in NR-V2X System based on Deep Reinforcement Learning | Shulin Song et.al. | 2407.08458 | link |
2024-07-11 | CLEO: Continual Learning of Evolving Ontologies | Shishir Muralidhara et.al. | 2407.08411 | null |
2024-07-11 | Specialist vision-language models for clinical ophthalmology | Robbie Holland et.al. | 2407.08410 | link |
2024-07-11 | Data-Driven Model Predictive Control for Autonomous Vehicle Steering | Jiarui Zhang et.al. | 2407.08401 | null |
2024-07-11 | Accurate Cooperative Localization Utilizing LiDAR-equipped Roadside Infrastructure for Autonomous Driving | Yuze Jiang et.al. | 2407.08384 | null |
2024-07-11 | WayveScenes101: A Dataset and Benchmark for Novel View Synthesis in Autonomous Driving | Jannik Zürn et.al. | 2407.08280 | link |
2024-07-11 | Explainability of Sub-Field Level Crop Yield Prediction using Remote Sensing | Hiba Najjar et.al. | 2407.08274 | null |
2024-07-11 | Efficient Reinforcement Learning On Passive RRAM Crossbar Array | Arjun Tyagi et.al. | 2407.08242 | null |
2024-07-11 | CoGS: Causality Constrained Counterfactual Explanations using goal-directed ASP | Sopam Dasgupta et.al. | 2407.08179 | null |
2024-07-10 | NDST: Neural Driving Style Transfer for Human-Like Vision-Based Autonomous Driving | Donghyun Kim et.al. | 2407.08073 | null |
2024-07-10 | Deep Learning-Based Robust Multi-Object Tracking via Fusion of mmWave Radar and Camera Sensors | Lei Cheng et.al. | 2407.08049 | null |
2024-07-10 | Flow4D: Leveraging 4D Voxel Network for LiDAR Scene Flow Estimation | Jaeyeul Kim et.al. | 2407.07995 | link |
2024-07-10 | RoBus: A Multimodal Dataset for Controllable Road Networks and Building Layouts Generation | Tao Li et.al. | 2407.07835 | link |
2024-07-10 | When to Accept Automated Predictions and When to Defer to Human Judgment? | Daniel Sikar et.al. | 2407.07821 | null |
2024-07-10 | The Misclassification Likelihood Matrix: Some Classes Are More Likely To Be Misclassified Than Others | Daniel Sikar et.al. | 2407.07818 | null |
2024-07-11 | Evaluating Large Language Models with Grid-Based Game Competitions: An Extensible LLM Benchmark and Leaderboard | Oguzhan Topsakal et.al. | 2407.07796 | link |
2024-07-10 | LSM: A Comprehensive Metric for Assessing the Safety of Lane Detection Systems in Autonomous Driving | Jörg Gamerdinger et.al. | 2407.07740 | null |
2024-07-10 | Towards Human-Like Driving: Active Inference in Autonomous Vehicle Control | Elahe Delavari et.al. | 2407.07684 | null |
2024-07-10 | Why should we ever automate moral decision making? | Vincent Conitzer et.al. | 2407.07671 | null |
2024-07-10 | Pessimism Meets Risk: Risk-Sensitive Offline Reinforcement Learning | Dake Zhang et.al. | 2407.07631 | null |
2024-07-10 | Let Occ Flow: Self-Supervised 3D Occupancy Flow Prediction | Yili Liu et.al. | 2407.07587 | null |
2024-07-10 | Invisible Optical Adversarial Stripes on Traffic Sign against Autonomous Vehicles | Dongfang Guo et.al. | 2407.07510 | null |
2024-07-10 | Exploring the Untouched Sweeps for Conflict-Aware 3D Segmentation Pretraining | Tianfang Sun et.al. | 2407.07465 | null |
2024-07-10 | CM-DQN: A Value-Based Deep Reinforcement Learning Model to Simulate Confirmation Bias | Jiacheng Shen et.al. | 2407.07454 | link |
2024-07-10 | Long-Term Fairness in Sequential Multi-Agent Selection with Positive Reinforcement | Bhagyashree Puranik et.al. | 2407.07350 | link |
2024-07-11 | FALFormer: Feature-aware Landmarks self-attention for Whole-slide Image Classification | Doanh C. Bui et.al. | 2407.07340 | link |
2024-07-10 | Event-Aided Time-to-Collision Estimation for Autonomous Driving | Jinghang Li et.al. | 2407.07324 | null |
2024-07-09 | Exploring Camera Encoder Designs for Autonomous Driving Perception | Barath Lakshmanan et.al. | 2407.07276 | null |
2024-07-09 | The mouth speaks as much as the eyes: Free-ranging dogs depend on inner facial features for human recognition | Rohan Sarkar et.al. | 2407.07192 | null |
2024-07-09 | Can Learned Optimization Make Reinforcement Learning Less Difficult? | Alexander David Goldie et.al. | 2407.07082 | link |
2024-07-09 | Less is More: Efficient Brain-Inspired Learning for Autonomous Driving Trajectory Prediction | Haicheng Liao et.al. | 2407.07020 | null |
2024-07-09 | End-To-End Causal Effect Estimation from Unstructured Natural Language Data | Nikita Dhawan et.al. | 2407.07018 | null |
2024-07-09 | Explainable AI for Enhancing Efficiency of DL-based Channel Estimation | Abdul Karim Gizzini et.al. | 2407.07009 | null |
2024-07-09 | Learning to Complement and to Defer to Multiple Users | Zheng Zhang et.al. | 2407.07003 | link |
2024-07-09 | Towards Open-World Mobile Manipulation in Homes: Lessons from the Neurips 2023 HomeRobot Open Vocabulary Mobile Manipulation Challenge | Sriram Yenamandra et.al. | 2407.06939 | null |
2024-07-09 | Efficiency of the convex hull of the columns of certain triple perturbed consistent matrices | Susana Furtado et.al. | 2407.06878 | null |
2024-07-08 | A Mamba-based Siamese Network for Remote Sensing Change Detection | Jay N. Paranjape et.al. | 2407.06839 | link |
2024-07-09 | MDP Geometry, Normalization and Value Free Solvers | Arsenii Mustafin et.al. | 2407.06712 | null |
2024-07-09 | Integrating Clinical Knowledge into Concept Bottleneck Models | Winnie Pang et.al. | 2407.06600 | link |
2024-07-10 | FinCon: A Synthesized LLM Multi-Agent System with Conceptual Verbal Reinforcement for Enhanced Financial Decision Making | Yangyang Yu et.al. | 2407.06567 | null |
2024-07-09 | Exploring the Causality of End-to-End Autonomous Driving | Jiankun Li et.al. | 2407.06546 | link |
2024-07-09 | Graph Neural Networks and Deep Reinforcement Learning Based Resource Allocation for V2X Communications | Maoxin Ji et.al. | 2407.06518 | link |
2024-07-09 | VQA-Diff: Exploiting VQA and Diffusion for Zero-Shot Image-to-3D Vehicle Asset Generation in Autonomous Driving | Yibo Liu et.al. | 2407.06516 | null |
2024-07-09 | Computer vision tasks for intelligent aerospace missions: An overview | Huilin Chen et.al. | 2407.06513 | null |
2024-07-09 | Economic span selection of bridge based on deep reinforcement learning | Leye Zhang et.al. | 2407.06507 | link |
2024-07-09 | Not all explicit cues help communicate: Pedestrians’ perceptions, fixations, and decisions toward automated vehicles with varied appearance | Wei Lyu et.al. | 2407.06505 | null |
2024-07-10 | Optimal Decision Making Through Scenario Simulations Using Large Language Models | Sumedh Rasal et.al. | 2407.06486 | null |
2024-07-10 | Enhanced Safety in Autonomous Driving: Integrating Latent State Diffusion Model for End-to-End Navigation | Jianuo Huang et.al. | 2407.06317 | null |
2024-07-10 | 4D Contrastive Superflows are Dense 3D Representation Learners | Xiang Xu et.al. | 2407.06190 | link |
2024-07-08 | Real Space Imaging of Field-Driven Decision-Making in Nanomagnetic Galton Boards | Hanu Arava et.al. | 2407.06130 | null |
2024-07-08 | Enhancing Language Model Rationality with Bi-Directional Deliberation Reasoning | Yadong Zhang et.al. | 2407.06112 | null |
2024-07-08 | PerlDiff: Controllable Street View Synthesis Using Perspective-Layout Diffusion Models | Jinhua Zhang et.al. | 2407.06109 | link |
2024-07-08 | Object-Oriented Material Classification and 3D Clustering for Improved Semantic Perception and Mapping in Mobile Robots | Siva Krishna Ravipati et.al. | 2407.06077 | link |
2024-07-08 | How to Add Baskets to an Ongoing Basket Trial with Information Borrowing | Libby Daniells et.al. | 2407.06069 | link |
2024-07-08 | RHRSegNet: Relighting High-Resolution Night-Time Semantic Segmentation | Sarah Elmahdy et.al. | 2407.06016 | null |
2024-07-08 | Simulation-based Benchmarking for Causal Structure Learning in Gene Perturbation Experiments | Luka Kovačević et.al. | 2407.06015 | link |
2024-07-08 | Towards A Comprehensive Visual Saliency Explanation Framework for AI-based Face Recognition Systems | Yuhang Lu et.al. | 2407.05983 | null |
2024-07-08 | Enhancing Vision-Language Models with Scene Graphs for Traffic Accident Understanding | Aaron Lohner et.al. | 2407.05910 | null |
2024-07-08 | Affordances-Oriented Planning using Foundation Models for Continuous Vision-Language Navigation | Jiaqi Chen et.al. | 2407.05890 | null |
2024-07-08 | Cross-domain Few-shot In-context Learning for Enhancing Traffic Sign Recognition | Yaozong Gan et.al. | 2407.05814 | null |
2024-07-08 | MapsTP: HD Map Images Based Multimodal Trajectory Prediction for Automated Vehicles | Sushil Sharma et.al. | 2407.05811 | null |
2024-07-08 | Boosting 3D Object Detection with Semantic-Aware Multi-Branch Framework | Hao Jing et.al. | 2407.05769 | null |
2024-07-08 | Potential of Multimodal Large Language Models for Data Mining of Medical Images and Free-text Reports | Yutong Zhang et.al. | 2407.05758 | null |
2024-07-08 | BEVWorld: A Multimodal World Model for Autonomous Driving via Unified BEV Latent Space | Yumeng Zhang et.al. | 2407.05679 | link |
2024-07-08 | MSTF: Multiscale Transformer for Incomplete Trajectory Prediction | Zhanwen Liu et.al. | 2407.05671 | null |
2024-07-08 | Explainable Image Recognition via Enhanced Slot-attention Based Classifier | Bowen Wang et.al. | 2407.05616 | null |
2024-07-08 | GenFollower: Enhancing Car-Following Prediction with Large Language Models | Xianda Chen et.al. | 2407.05611 | null |
2024-07-08 | Cost-Efficient Computation Offloading in SAGIN: A Deep Reinforcement Learning and Perception-Aided Approach | Yulan Gao et.al. | 2407.05571 | null |
2024-07-05 | DCZNMaker: A Web-based Application for Multi-Attribute Utilities Analysis | Adrienne Kline et.al. | 2407.04655 | null |
2024-07-05 | Multiple stage stochastic linear programming with multiple objectives: flexible decision making | Andreas H. Hamel et.al. | 2407.04602 | null |
2024-07-05 | Leveraging Large Language Models for Integrated Satellite-Aerial-Terrestrial Networks: Recent Advances and Future Directions | Shumaila Javaid et.al. | 2407.04581 | null |
2024-07-05 | Graph Reinforcement Learning in Power Grids: A Survey | Mohamed Hassouna et.al. | 2407.04522 | null |
2024-07-05 | Leveraging Graph Structures to Detect Hallucinations in Large Language Models | Noa Nonkes et.al. | 2407.04485 | link |
2024-07-05 | Optimizing the image correction pipeline for pedestrian detection in the thermal-infrared domain | Christophe Karam et.al. | 2407.04484 | null |
2024-07-05 | Are Large Language Models Strategic Decision Makers? A Study of Performance and Bias in Two-Player Non-Zero-Sum Games | Nathan Herr et.al. | 2407.04467 | null |
2024-07-05 | Nash epidemics | Simon K. Schnyder et.al. | 2407.04366 | null |
2024-07-05 | AriGraph: Learning Knowledge Graph World Models with Episodic Memory for LLM Agents | Petr Anokhin et.al. | 2407.04363 | link |
2024-07-05 | Dance of the ADS: Orchestrating Failures through Historically-Informed Scenario Fuzzing | Tong Wang et.al. | 2407.04359 | null |
2024-07-05 | MobileFlow: A Multimodal LLM For Mobile GUI Agent | Songqin Nong et.al. | 2407.04346 | null |
2024-07-05 | Towards Stable 3D Object Detection | Jiabao Wang et.al. | 2407.04305 | null |
2024-07-05 | Robust Decision Transformer: Tackling Data Corruption in Offline RL via Sequence Modeling | Jiawei Xu et.al. | 2407.04285 | null |
2024-07-05 | WOMD-Reasoning: A Large-Scale Language Dataset for Interaction and Driving Intentions Reasoning | Yiheng Li et.al. | 2407.04281 | link |
2024-07-05 | Research, Applications and Prospects of Event-Based Pedestrian Detection: A Survey | Han Wang et.al. | 2407.04277 | null |
2024-07-04 | Quantifying Prediction Consistency Under Model Multiplicity in Tabular LLMs | Faisal Hamman et.al. | 2407.04173 | null |
2024-07-04 | ChartGemma: Visual Instruction-tuning for Chart Reasoning in the Wild | Ahmed Masry et.al. | 2407.04172 | link |
2024-07-04 | Annotating Control-Flow Graphs for Formalized Test Coverage Criteria | Sean Kauffman et.al. | 2407.04144 | null |
2024-07-04 | Behavioural gap assessment of human-vehicle interaction in real and virtual reality-based scenarios in autonomous driving | Sergio. Martín Serrano et.al. | 2407.04070 | null |
2024-07-04 | Detect Closer Surfaces that can be Seen: New Modeling and Evaluation in Cross-domain 3D Object Detection | Ruixiao Zhang et.al. | 2407.04061 | link |
2024-07-03 | Cooperative Multi-Agent Deep Reinforcement Learning Methods for UAV-aided Mobile Edge Computing Networks | Mintae Kim et.al. | 2407.03280 | null |
2024-07-03 | Streaming Large-Scale Electron Microscopy Data to a Supercomputing Facility | Samuel S. Welborn et.al. | 2407.03215 | null |
2024-07-03 | Tail calibration of probabilistic forecasts | Sam Allen et.al. | 2407.03167 | link |
2024-07-03 | xApp Distillation: AI-based Conflict Mitigation in B5G O-RAN | Hakan Erdol et.al. | 2407.03068 | null |
2024-07-03 | Predictions and Decision Making for Resilient Intelligent Sustainable Energy Systems | Martin Braun et.al. | 2407.03021 | null |
2024-07-03 | VIVA: A Benchmark for Vision-Grounded Decision-Making with Human Values | Zhe Hu et.al. | 2407.03000 | null |
2024-07-04 | Timely Requesting for Time-Critical Content Users in Decentralized F-RANs | Xingran Chen et.al. | 2407.02930 | null |
2024-07-03 | Efficient Fusion and Task Guided Embedding for End-to-end Autonomous Driving | Yipin Guo et.al. | 2407.02878 | null |
2024-07-03 | A Radiometric Correction based Optical Modeling Approach to Removing Reflection Noise in TLS Point Clouds of Urban Scenes | Li Fang et.al. | 2407.02830 | link |
2024-07-03 | Optimization of End-to-End AoI in Edge-Enabled Vehicular Fog Systems: A Dueling-DQN Approach | Seifu Birhanu Tadele et.al. | 2407.02815 | null |
2024-07-03 | Solving Motion Planning Tasks with a Scalable Generative Model | Yihan Hu et.al. | 2407.02797 | link |
2024-07-03 | DRLQ: A Deep Reinforcement Learning-based Task Placement for Quantum Cloud Computing | Hoa T. Nguyen et.al. | 2407.02748 | null |
2024-07-04 | The path towards contact-based physical human-robot interaction | Mohammad Farajtabar et.al. | 2407.02664 | null |
2024-07-02 | ResearchBot: Bridging the Gap between Academic Research and Practical Programming Communities | Sahar Farzanehpour et.al. | 2407.02643 | null |
2024-07-02 | D-Rax: Domain-specific Radiologic assistant leveraging multi-modal data and eXpert model predictions | Hareem Nisar et.al. | 2407.02604 | null |
2024-07-04 | AutoSplat: Constrained Gaussian Splatting for Autonomous Driving Scene Reconstruction | Mustafa Khan et.al. | 2407.02598 | null |
2024-07-02 | Diffusion Models for Tabular Data Imputation and Synthetic Data Generation | Mario Villaizán-Vallelado et.al. | 2407.02549 | null |
2024-07-02 | AXIAL: Attention-based eXplainability for Interpretable Alzheimer’s Localized Diagnosis using 2D CNNs on 3D MRI brain scans | Gabriele Lozupone et.al. | 2407.02418 | link |
2024-07-02 | Multilingual Trolley Problems for Language Models | Zhijing Jin et.al. | 2407.02273 | link |
2024-07-02 | Research on Reliable and Safe Occupancy Grid Prediction in Underground Parking Lots | JiaQi Luo et.al. | 2407.02197 | null |
2024-07-02 | I2EKF-LO: A Dual-Iteration Extended Kalman Filter Based LiDAR Odometry | Wenlu Yu et.al. | 2407.02190 | link |
2024-07-02 | Distributional Regression U-Nets for the Postprocessing of Precipitation Ensemble Forecasts | Romain Pic et.al. | 2407.02125 | link |
2024-07-02 | Automated Knowledge Graph Learning in Industrial Processes | Lolitta Ammann et.al. | 2407.02106 | null |
2024-07-02 | DM3D: Distortion-Minimized Weight Pruning for Lossless 3D Object Detection | Kaixin Xu et.al. | 2407.02098 | null |
2024-07-02 | LiDAR-based HD Map Localization using Semantic Generalized ICP with Road Marking Detection | Yansong Gong et.al. | 2407.02061 | null |
2024-07-02 | Revolutionising Role-Playing Games with ChatGPT | Rita Stampfl et.al. | 2407.02048 | null |
2024-07-03 | ViG-Bias: Visually Grounded Bias Discovery and Mitigation | Badr-Eddine Marani et.al. | 2407.01996 | link |
2024-07-02 | FlowTrack: Point-level Flow Network for 3D Single Object Tracking | Shuo Li et.al. | 2407.01959 | null |
2024-07-02 | Cloud-Edge-Terminal Collaborative AIGC for Autonomous Driving | Jianan Zhang et.al. | 2407.01956 | null |
2024-07-02 | CatMemo at the FinLLM Challenge Task: Fine-Tuning Large Language Models using Data Fusion in Financial Applications | Yupeng Cao et.al. | 2407.01953 | null |
2024-07-02 | LDP: A Local Diffusion Planner for Efficient Robot Navigation and Collision Avoidance | Wenhao Yu et.al. | 2407.01950 | null |
2024-07-02 | Probabilistic 3D Correspondence Prediction from Sparse Unsegmented Images | Krithika Iyer et.al. | 2407.01931 | null |
2024-07-02 | Securing Distributed Network Digital Twin Systems Against Model Poisoning Attacks | Zifan Zhang et.al. | 2407.01917 | null |
2024-07-02 | Beyond Numeric Awards: In-Context Dueling Bandits with LLM Agents | Fanzeng Xia et.al. | 2407.01887 | null |
2024-07-01 | An Efficient and Sybil Attack Resistant Voting Mechanism | Jeremias Lenzi et.al. | 2407.01844 | null |
2024-07-01 | Improving Trip Mode Choice Modeling Using Ensemble Synthesizer (ENSY) | Amirhossein Parsi et.al. | 2407.01769 | null |
2024-07-01 | Predicting Trust Dynamics with Dynamic SEM in Human-AI Cooperation | Sota Kaneko et.al. | 2407.01752 | null |
2024-06-28 | Futility analyses for the MCP-Mod methodology based on longitudinal models | Björn Bornkamp et.al. | 2406.19965 | null |
2024-06-28 | Automated Deep Neural Network Inference Partitioning for Distributed Embedded Systems | Fabian Kreß et.al. | 2406.19913 | null |
2024-06-28 | Evaluating potential landing sites for the Artemis III mission using a multi-criteria decision making approach | Eloy Peña-Asensio et.al. | 2406.19863 | null |
2024-06-28 | Operator World Models for Reinforcement Learning | Pietro Novelli et.al. | 2406.19861 | link |
2024-06-28 | StreamMOTP: Streaming and Unified Framework for Joint 3D Multi-Object Tracking and Trajectory Prediction | Jiaheng Zhuang et.al. | 2406.19844 | null |
2024-06-28 | LCSim: A Large-Scale Controllable Traffic Simulator | Yuheng Zhang et.al. | 2406.19781 | link |
2024-06-28 | Deep Fusion Model for Brain Tumor Classification Using Fine-Grained Gradient Preservation | Niful Islam et.al. | 2406.19690 | null |
2024-06-28 | Enhancing Radiological Diagnosis: A Collaborative Approach Integrating AI and Human Expertise for Visual Miss Correction | Akash Awasthi et.al. | 2406.19686 | null |
2024-06-28 | Deep Learning-based Depth Estimation Methods from Monocular Image and Videos: A Comprehensive Survey | Uchitha Rajapaksha et.al. | 2406.19675 | null |
2024-07-02 | Practical Power System Inertia Monitoring Based on Pumped Storage Hydropower Operation Signature | Hongyu Li et.al. | 2406.19627 | null |
2024-06-28 | Multimodal Data Integration for Precision Oncology: Challenges and Future Directions | Huajun Zhou et.al. | 2406.19611 | null |
2024-06-27 | Semantic orchestration and exploitation of material data: A dataspace solution demonstrated on steel and cooper applications | Yoav Nahshon et.al. | 2406.19509 | null |
2024-06-27 | Multi-agent Cooperative Games Using Belief Map Assisted Training | Qinwei Huang et.al. | 2406.19477 | link |
2024-06-27 | TTP-Based Cyber Resilience Index: A Probabilistic Quantitative Approach to Measure Defence Effectiveness Against Cyber Attacks | Lampis Alevizos et.al. | 2406.19374 | null |
2024-06-27 | The Odyssey of Commonsense Causality: From Foundational Benchmarks to Cutting-Edge Reasoning | Shaobo Cui et.al. | 2406.19307 | null |
2024-06-28 | FlowVQA: Mapping Multimodal Logic in Visual Question Answering with Flowcharts | Shubhankar Singh et.al. | 2406.19237 | null |
2024-06-27 | Think Step by Step: Chain-of-Gesture Prompting for Error Detection in Robotic Surgical Videos | Zhimin Shao et.al. | 2406.19217 | link |
2024-06-27 | CELLO: Causal Evaluation of Large Vision-Language Models | Meiqi Chen et.al. | 2406.19131 | link |
2024-06-27 | Evidential Concept Embedding Models: Towards Reliable Concept Explanations for Skin Disease Diagnosis | Yibo Gao et.al. | 2406.19130 | link |
2024-06-27 | BiCo-Fusion: Bidirectional Complementary LiDAR-Camera Fusion for Semantic- and Spatial-Aware 3D Object Detection | Yang Song et.al. | 2406.19048 | null |
2024-06-27 | Fine-tuned network relies on generic representation to solve unseen cognitive task | Dongyan Lin et.al. | 2406.18926 | null |
2024-06-27 | The Rise of Artificial Intelligence in Educational Measurement: Opportunities and Ethical Challenges | Okan Bulut et.al. | 2406.18900 | null |
2024-06-27 | Sequential three-way group decision-making for double hierarchy hesitant fuzzy linguistic term set | Nanfang Luo et.al. | 2406.18884 | null |
2024-06-27 | From Biased Selective Labels to Pseudo-Labels: An Expectation-Maximization Framework for Learning from Biased Decisions | Trenton Chang et.al. | 2406.18865 | link |
2024-06-27 | Predicting the duration of traffic incidents for Sydney greater metropolitan area using machine learning methods | Artur Grigorev et.al. | 2406.18861 | link |
2024-06-28 | The Impact of Feature Representation on the Accuracy of Photonic Neural Networks | Mauricio Gomes de Queiroz et.al. | 2406.18757 | link |
2024-06-26 | Decentralized Semantic Traffic Control in AVs Using RL and DQN for Dynamic Roadblocks | Emanuel Figetakis et.al. | 2406.18741 | null |
2024-06-26 | Petal-X: Human-Centered Visual Explanations to Improve Cardiovascular Risk Communication | Diego Rojo et.al. | 2406.18690 | null |
2024-06-26 | A Zero Auxiliary Knowledge Membership Inference Attack on Aggregate Location Data | Vincent Guan et.al. | 2406.18671 | null |
2024-06-26 | Mental Modeling of Reinforcement Learning Agents by Language Models | Wenhao Lu et.al. | 2406.18505 | null |
2024-06-26 | Complexity Aversion | Yuan Gu et.al. | 2406.18463 | null |
2024-06-27 | XLD: A Cross-Lane Dataset for Benchmarking Novel Driving View Synthesis | Hao Li et.al. | 2406.18360 | null |
2024-06-26 | Kolmogorov-Arnold Graph Neural Networks | Gianluca De Carlo et.al. | 2406.18354 | null |
2024-06-26 | Octo-planner: On-device Language Model for Planner-Action Agents | Wei Chen et.al. | 2406.18082 | null |
2024-06-26 | On Calibration of Speech Classification Models: Insights from Energy-Based Model Investigations | Yaqian Hao et.al. | 2406.18065 | null |
2024-06-26 | Multi-step Knowledge Retrieval and Inference over Unstructured Data | Aditya Kalyanpur et.al. | 2406.17987 | null |
2024-06-25 | Emerging AI-based weather prediction models as downscaling tools | Nikolay Koldunov et.al. | 2406.17977 | null |
2024-06-25 | Unbiasing on the Fly: Explanation-Guided Human Oversight of Machine Learning System Decisions | Hussaini Mamman et.al. | 2406.17906 | null |
2024-06-25 | Analysis of the Causes of Car Accidents in the United States of America in 2023: Gauge People Understanding of Data Visualisation | Hamoud Alhazmi et.al. | 2406.17872 | link |
2024-06-25 | End-to-End Autonomous Driving without Costly Modularization and 3D Manual Annotation | Mingzhe Guo et.al. | 2406.17680 | null |
2024-06-25 | MDHA: Multi-Scale Deformable Transformer with Hybrid Anchors for Multi-View 3D Object Detection | Michelle Adeline et.al. | 2406.17654 | link |
2024-06-25 | Querying Labeled Time Series Data with Scenario Programs | Devan Shanker et.al. | 2406.17627 | null |
2024-06-26 | Enhancing Explainability of Knowledge Learning Paths: Causal Knowledge Networks | Yuang Wei et.al. | 2406.17518 | null |
2024-06-25 | Robust Pareto Design of GaN HEMTs for Millimeter-Wave Applications | Rafael Perez Martinez et.al. | 2406.17337 | null |
2024-06-25 | Task Adaptation in Industrial Human-Robot Interaction: Leveraging Riemannian Motion Policies | Mike Allenspach et.al. | 2406.17333 | null |
2024-06-25 | The State-Action-Reward-State-Action Algorithm in Spatial Prisoner’s Dilemma Game | Lanyu Yang et.al. | 2406.17326 | null |
2024-06-25 | Can We Trust the Performance Evaluation of Uncertainty Estimation Methods in Text Summarization? | Jianfeng He et.al. | 2406.17274 | link |
2024-06-25 | Image-Guided Outdoor LiDAR Perception Quality Assessment for Autonomous Driving | Ce Zhang et.al. | 2406.17265 | null |
2024-06-25 | Large Language Models are Interpretable Learners | Ruochen Wang et.al. | 2406.17224 | link |
2024-06-25 | VR-based Blockchain-enabled Data Visualization Framework For Manufacturing Industry | Nitol Saha et.al. | 2406.17207 | null |
2024-06-25 | Model Checking of vGOAL | Yi Yang et.al. | 2406.17206 | null |
2024-06-24 | Paraphrase and Aggregate with Large Language Models for Minimizing Intent Classification Errors | Vikas Yadav et.al. | 2406.17163 | null |
2024-06-24 | Integrating Generative AI with Network Digital Twins for Enhanced Network Operations | Kassi Muhammad et.al. | 2406.17112 | null |
2024-06-24 | Learning Temporal Distances: Contrastive Successor Features Can Provide a Metric Structure for Decision-Making | Vivek Myers et.al. | 2406.17098 | link |
2024-06-24 | Boosting Bitcoin Minute Trend Prediction Using the Separation Index | Zeinab Shahsafdari et.al. | 2406.17083 | null |
2024-06-24 | Large Language Models Assume People are More Rational than We Really are | Ryan Liu et.al. | 2406.17055 | link |
2024-06-26 | Fair game: Urban free-ranging dogs balance resource use and risk aversion at seasonal fairs | Sourabh Biswas et.al. | 2406.17004 | null |
2024-06-24 | GPT-4V Explorations: Mining Autonomous Driving | Zixuan Li et.al. | 2406.16817 | null |
2024-06-24 | ShanghaiTech Mapping Robot is All You Need: Robot System for Collecting Universal Ground Vehicle Datasets | Bowen Xu et.al. | 2406.16713 | null |
2024-06-24 | Hacking a surrogate model approach to XAI | Alexander Wilhelm et.al. | 2406.16626 | null |
2024-06-24 | QuadrupedGPT: Towards a Versatile Quadruped Agent in Open-ended Worlds | Ye Wang et.al. | 2406.16578 | null |
2024-06-24 | Differentiable Distributionally Robust Optimization Layers | Xutao Ma et.al. | 2406.16571 | link |
2024-06-24 | Conditional Bayesian Quadrature | Zonghao Chen et.al. | 2406.16530 | link |
2024-06-24 | UNO Arena for Evaluating Sequential Decision-Making Capability of Large Language Models | Zhanyue Qin et.al. | 2406.16382 | null |
2024-06-24 | What Do VLMs NOTICE? A Mechanistic Interpretability Pipeline for Noise-free Text-Image Corruption and Evaluation | Michal Golovanevsky et.al. | 2406.16320 | link |
2024-06-24 | SegNet4D: Effective and Efficient 4D LiDAR Semantic Segmentation in Autonomous Driving Environments | Neng Wang et.al. | 2406.16279 | link |
2024-06-23 | Hardware-Aware Neural Dropout Search for Reliable Uncertainty Prediction on FPGA | Zehuan Zhang et.al. | 2406.16198 | link |
2024-06-23 | Accelerating Matrix Diagonalization through Decision Transformers with Epsilon-Greedy Optimization | Kshitij Bhatta et.al. | 2406.16191 | null |
2024-06-23 | DV-3DLane: End-to-end Multi-modal 3D Lane Detection with Dual-view Representation | Yueru Luo et.al. | 2406.16072 | link |
2024-06-23 | Entropy-driven decision-making dynamics sheds light on the emergence of the “paradox of choice” | Manish Gupta et.al. | 2406.16051 | null |
2024-06-23 | Imperfect-Recall Games: Equilibrium Concepts and Their Complexity | Emanuel Tewolde et.al. | 2406.15970 | null |
2024-06-22 | LLM-Powered Explanations: Unraveling Recommendations Through Subgraph Reasoning | Guangsi Shi et.al. | 2406.15859 | null |
2024-06-22 | Learning Abstract World Model for Value-preserving Planning with Options | Rafael Rodriguez-Sanchez et.al. | 2406.15850 | null |
2024-06-22 | CaT-BENCH: Benchmarking Language Model Understanding of Causal and Temporal Dependencies in Plans | Yash Kumar Lal et.al. | 2406.15823 | null |
2024-06-22 | Privacy Implications of Explainable AI in Data-Driven Systems | Fatima Ezzeddine et.al. | 2406.15789 | null |
2024-06-22 | ISS-Scenario: Scenario-based Testing in CARLA | Renjue Li et.al. | 2406.15777 | link |
2024-06-21 | PathoWAve: A Deep Learning-based Weight Averaging Method for Improving Domain Generalization in Histopathology Images | Parastoo Sotoudeh Sharifi et.al. | 2406.15685 | link |
2024-06-21 | NAVSIM: Data-Driven Non-Reactive Autonomous Vehicle Simulation and Benchmarking | Daniel Dauner et.al. | 2406.15349 | link |
2024-06-21 | Towards Robust Training Datasets for Machine Learning with Ontologies: A Case Study for Emergency Road Vehicle Detection | Lynn Vonderhaar et.al. | 2406.15268 | null |
2024-06-21 | Multimodal Deformable Image Registration for Long-COVID Analysis Based on Progressive Alignment and Multi-perspective Loss | Jiahua Li et.al. | 2406.15172 | null |
2024-06-21 | A Unified Framework for Input Feature Attribution Analysis | Jingyi Sun et.al. | 2406.15085 | null |
2024-06-21 | KnobTree: Intelligent Database Parameter Configuration via Explainable Reinforcement Learning | Jiahan Chen et.al. | 2406.15073 | null |
2024-06-21 | Colorful Priority $k$ -Supplier | Chandra Chekuri et.al. | 2406.14984 | null |
2024-06-21 | Autonomous Decision Making for Air Taxi Networks | Alex Vesel et.al. | 2406.14832 | link |
2024-06-20 | ImageFlowNet: Forecasting Multiscale Trajectories of Disease Progression with Irregularly-Sampled Longitudinal Medical Images | Chen Liu et.al. | 2406.14794 | link |
2024-06-20 | Active Learning for Fair and Stable Online Allocations | Riddhiman Bhattacharya et.al. | 2406.14784 | null |
2024-06-20 | Multi-Task Lane-Free Driving Strategy for Connected and Automated Vehicles: A Multi-Agent Deep Reinforcement Learning Approach | Mehran Berahman et.al. | 2406.14766 | null |
2024-06-20 | Risk thresholds for frontier AI | Leonie Koessler et.al. | 2406.14713 | null |
2024-06-20 | Preferential Multi-Objective Bayesian Optimization | Raul Astudillo et.al. | 2406.14699 | null |
2024-06-20 | Advantage Alignment Algorithms | Juan Agustin Duque et.al. | 2406.14662 | null |
2024-06-20 | ICAL: Continual Learning of Multimodal Agents by Transforming Trajectories into Actionable Insights | Gabriel Sarch et.al. | 2406.14596 | null |
2024-06-20 | Enhancing Dropout-based Bayesian Neural Networks with Multi-Exit on FPGA | Hao et.al. | 2406.14593 | link |
2024-06-21 | Asynchronous Large Language Model Enhanced Planner for Autonomous Driving | Yuan Chen et.al. | 2406.14556 | link |
2024-06-20 | MacroHFT: Memory Augmented Context-aware Reinforcement Learning On High Frequency Trading | Chuqiao Zong et.al. | 2406.14537 | link |
2024-06-20 | Energy Mapping of Existing Building Stock in Cambridge using Energy Performance Certificates and Thermal Infrared Imagery | Yinglong He et.al. | 2406.14520 | null |
2024-06-20 | FutureNet-LOF: Joint Trajectory Prediction and Lane Occupancy Field Prediction with Future Context Encoding | Mingkun Wang et.al. | 2406.14422 | null |
2024-06-20 | PoseBench: Benchmarking the Robustness of Pose Estimation Models under Corruptions | Sihan Ma et.al. | 2406.14367 | null |
2024-06-20 | iWISDM: Assessing instruction following in multimodal models at scale | Xiaoxuan Lei et.al. | 2406.14343 | link |
2024-06-20 | Self-supervised Interpretable Concept-based Models for Text Classification | Francesco De Santis et.al. | 2406.14335 | null |
2024-06-20 | Reproducibility in Machine Learning-based Research: Overview, Barriers and Drivers | Harald Semmelrock et.al. | 2406.14325 | null |
2024-06-21 | E-ANT: A Large-Scale Dataset for Efficient Automatic GUI NavigaTion | Ke Wang et.al. | 2406.14250 | null |
2024-06-20 | Uncertainty and Self-Supervision in Single-View Depth | Javier Rodriguez-Puigvert et.al. | 2406.14226 | null |
2024-06-21 | REVEAL-IT: REinforcement learning with Visibility of Evolving Agent poLicy for InTerpretability | Shuang Ao et.al. | 2406.14214 | link |
2024-06-20 | Tractable Equilibrium Computation in Markov Games through Risk Aversion | Eric Mazumdar et.al. | 2406.14156 | null |
2024-06-20 | Self-Attention in Transformer Networks Explains Monkeys’ Gaze Pattern in Pac-Man Game | Zhongqiao Lin et.al. | 2406.14100 | null |
2024-06-20 | GTP-UDrive: Unified Game-Theoretic Trajectory Planner and Decision-Maker for Autonomous Driving in Mixed Traffic Environments | Nouhed Naidja et.al. | 2406.14077 | null |
2024-06-20 | Urban-Focused Multi-Task Offline Reinforcement Learning with Contrastive Data Sharing | Xinbo Zhao et.al. | 2406.14054 | null |
2024-06-20 | MR-BEN: A Comprehensive Meta-Reasoning Benchmark for Large Language Models | Zhongshen Zeng et.al. | 2406.13975 | null |
2024-06-20 | CityBench: Evaluating the Capabilities of Large Language Model as World Model | Jie Feng et.al. | 2406.13945 | link |
2024-06-20 | A Decision-Making GPT Model Augmented with Entropy Regularization for Autonomous Vehicles | Jiaqi Liu et.al. | 2406.13908 | null |
2024-06-20 | The Use of Multimodal Large Language Models to Detect Objects from Thermal Images: Transportation Applications | Huthaifa I. Ashqar et.al. | 2406.13898 | null |
2024-06-19 | Combining Combined Forecasts: a Network Approach | Marcos R. Fernandes et.al. | 2406.13749 | null |
2024-06-18 | Scalable Rule Lists Learning with Sampling | Leonardo Pellegrina et.al. | 2406.12803 | link |
2024-06-18 | Online-Adaptive Anomaly Detection for Defect Identification in Aircraft Assembly | Siddhant Shete et.al. | 2406.12698 | null |
2024-06-18 | Investigating the Role of Explainability and AI Literacy in User Compliance | Niklas Kühl et.al. | 2406.12660 | null |
2024-06-18 | Ask-before-Plan: Proactive Language Agents for Real-World Planning | Xuan Zhang et.al. | 2406.12639 | link |
2024-06-18 | Reparameterizable Dual-Resolution Network for Real-time Semantic Segmentation | Guoyu Yang et.al. | 2406.12496 | link |
2024-06-18 | PlanRAG: A Plan-then-Retrieval Augmented Generation for Generative Large Language Models as Decision Makers | Myeonghwa Lee et.al. | 2406.12430 | link |
2024-06-18 | Deep Temporal Deaggregation: Large-Scale Spatio-Temporal Generative Models | David Bergström et.al. | 2406.12423 | null |
2024-06-18 | UAV-based Intelligent Information Systems on Winter Road Safety for Autonomous Vehicles | Siva Ariram et.al. | 2406.12370 | null |
2024-06-18 | A framework for developing a knowledge management platform | Marie Lisandra Zepeda Mendoza et.al. | 2406.12313 | null |
2024-06-19 | Is Your HD Map Constructor Reliable under Sensor Corruptions? | Xiaoshuai Hao et.al. | 2406.12214 | null |
2024-06-19 | MiSuRe is all you need to explain your image segmentation | Syed Nouman Hasany et.al. | 2406.12173 | null |
2024-06-18 | Statistical Uncertainty in Word Embeddings: GloVe-V | Andrea Vallebueno et.al. | 2406.12165 | link |
2024-06-17 | Efficient Sequential Decision Making with Large Language Models | Dingyang Chen et.al. | 2406.12125 | null |
2024-06-19 | Computing in the Life Sciences: From Early Algorithms to Modern AI | Samuel A. Donkor et.al. | 2406.12108 | link |
2024-06-17 | DistillNeRF: Perceiving 3D Scenes from Single-Glance Images by Distilling Neural Fields and Foundation Model Features | Letian Wang et.al. | 2406.12095 | null |
2024-06-17 | Grade Score: Quantifying LLM Performance in Option Selection | Dmitri Iourovitski et.al. | 2406.12043 | link |
2024-06-17 | FinTruthQA: A Benchmark Dataset for Evaluating the Quality of Financial Information Disclosure | Ziyue Xu et.al. | 2406.12009 | link |
2024-06-17 | Online Pareto-Optimal Decision-Making for Complex Tasks using Active Inference | Peter Amorese et.al. | 2406.11984 | null |
2024-06-17 | Crossfusor: A Cross-Attention Transformer Enhanced Conditional Diffusion Model for Car-Following Trajectory Prediction | Junwei You et.al. | 2406.11941 | null |
2024-06-17 | Optimal Transport-Assisted Risk-Sensitive Q-Learning | Zahra Shahrooei et.al. | 2406.11774 | null |
2024-06-18 | CHG Shapley: Efficient Data Valuation and Selection towards Trustworthy Machine Learning | Huaiguang Cai et.al. | 2406.11730 | link |
2024-06-17 | A First Physical-World Trajectory Prediction Attack via LiDAR-induced Deceptions in Autonomous Driving | Yang Lou et.al. | 2406.11707 | null |
2024-06-17 | Communication-Efficient MARL for Platoon Stability and Energy-efficiency Co-optimization in Cooperative Adaptive Cruise Control of CAVs | Min Hua et.al. | 2406.11653 | null |
2024-06-17 | Statistical Evolution of ODI Cricket: Analyzing Performance Trends and Effect Sizes | Pratik Mullick et.al. | 2406.11652 | null |
2024-06-17 | GRID-FAST: A Grid-based Intersection Detection for Fast Semantic Topometric Mapping | Scott Fredriksson et.al. | 2406.11635 | null |
2024-06-17 | Multistability of Small Zero-One Reaction Networks | Yue Jiao et.al. | 2406.11586 | link |
2024-06-17 | Constrained Reinforcement Learning with Average Reward Objective: Model-Based and Model-Free Algorithms | Vaneet Aggarwal et.al. | 2406.11481 | null |
2024-06-17 | Calibrating Where It Matters: Constrained Temperature Scaling | Stephen McKenna et.al. | 2406.11456 | null |
2024-06-17 | Can AI with High Reasoning Ability Replicate Human-like Decision Making in Economic Experiments? | Ayato Kitadai et.al. | 2406.11426 | null |
2024-06-17 | Predictive Probabilities Made Simple: A Fast and Accurate Method for Clinical Trial Decision Making | Joe Marion et.al. | 2406.11406 | null |
2024-06-17 | Uncertainties in ROC (Receiver Operating Characteristic) Curves Derived from Counting Data | M. P. Fewell et.al. | 2406.11396 | null |
2024-06-17 | Unveiling Assumptions: Exploring the Decisions of AI Chatbots and Human Testers | Francisco Gomes de Oliveira Neto et.al. | 2406.11339 | null |
2024-06-17 | Semi-Supervised Domain Adaptation Using Target-Oriented Domain Augmentation for 3D Object Detection | Yecheol Kim et.al. | 2406.11313 | link |
2024-06-17 | Development of an Adaptive Multi-Domain Artificial Intelligence System Built using Machine Learning and Expert Systems Technologies | Jeremy Straub et.al. | 2406.11272 | null |
2024-06-17 | Learning Iterative Reasoning through Energy Diffusion | Yilun Du et.al. | 2406.11179 | null |
2024-06-17 | Unanimity of two selves in decision making | Pierre Bardier et.al. | 2406.11166 | null |
2024-06-17 | Model Adaptation for Time Constrained Embodied Control | Jaehyun Song et.al. | 2406.11128 | null |
2024-06-16 | Not All Bias is Bad: Balancing Rational Deviations and Cognitive Biases in Large Language Model Reasoning | Liman Wang et.al. | 2406.10999 | link |
2024-06-18 | City-LEO: Toward Transparent City Management Using LLM with End-to-End Optimization | Zihao Jiao et.al. | 2406.10958 | null |
2024-06-14 | CarLLaVA: Vision language models for camera-only closed-loop driving | Katrin Renz et.al. | 2406.10165 | null |
2024-06-14 | MapVision: CVPR 2024 Autonomous Grand Challenge Mapless Driving Tech Report | Zhongyu Yang et.al. | 2406.10125 | null |
2024-06-14 | DurLAR: A High-fidelity 128-channel LiDAR Dataset with Panoramic Ambient and Reflectivity Imagery for Multi-modal Autonomous Driving Applications | Li Li et.al. | 2406.10068 | link |
2024-06-14 | Global Crop-Specific Fertilization Dataset from 1961-2019 | Fernando Coello et.al. | 2406.10001 | link |
2024-06-14 | SemanticSpray++: A Multimodal Dataset for Autonomous Driving in Wet Surface Conditions | Aldi Piroli et.al. | 2406.09945 | null |
2024-06-14 | CliBench: Multifaceted Evaluation of Large Language Models in Clinical Decisions on Diagnoses, Procedures, Lab Tests Orders and Prescriptions | Mingyu Derek Ma et.al. | 2406.09923 | link |
2024-06-14 | Globally Optimal GNSS Multi-Antenna Lever Arm Calibration | Thomas Wodtko et.al. | 2406.09866 | null |
2024-06-14 | LUMA: A Benchmark Dataset for Learning from Uncertain and Multimodal Data | Grigor Bezirganyan et.al. | 2406.09864 | link |
2024-06-14 | Retrieval Augmented Fact Verification by Synthesizing Contrastive Arguments | Zhenrui Yue et.al. | 2406.09815 | null |
2024-06-14 | A Two-Stage Masked Autoencoder Based Network for Indoor Depth Completion | Kailai Sun et.al. | 2406.09792 | link |
2024-06-14 | Road to Serenity: Individual Variations in the Efficacy of Unobtrusive Respiratory Guidance for Driving Stress Regulation | A. J. Bequet et.al. | 2406.09777 | null |
2024-06-14 | Research on Edge Detection of LiDAR Images Based on Artificial Intelligence Technology | Haowei Yang et.al. | 2406.09773 | null |
2024-06-14 | Mix Q-learning for Lane Changing: A Collaborative Decision-Making Method in Multi-Agent Deep Reinforcement Learning | Xiaojun Bi et.al. | 2406.09755 | null |
2024-06-14 | MoME: Mixture of Multimodal Experts for Cancer Survival Prediction | Conghao Xiong et.al. | 2406.09696 | link |
2024-06-13 | Cross-Modality Program Representation Learning for Electronic Design Automation with High-Level Synthesis | Zongyue Qin et.al. | 2406.09606 | null |
2024-06-13 | Towards Domain Adaptive Neural Contextual Bandits | Ziyan Wang et.al. | 2406.09564 | null |
2024-06-13 | Finite-Agent Stochastic Differential Games on Large Graphs: I. The Linear-Quadratic Case | Ruimeng Hu et.al. | 2406.09523 | null |
2024-06-13 | CleanDiffuser: An Easy-to-use Modularized Library for Diffusion Models in Decision Making | Zibin Dong et.al. | 2406.09509 | link |
2024-06-13 | Fair Data Generation via Score-based Diffusion Model | Yujie Lin et.al. | 2406.09495 | null |
2024-06-13 | SimGen: Simulator-conditioned Driving Scene Generation | Yunsong Zhou et.al. | 2406.09386 | null |
2024-06-13 | Active Inference Meeting Energy-Efficient Control of Parallel and Identical Machines | Yavar Taheri Yeganeh et.al. | 2406.09322 | link |
2024-06-13 | A tutorial on fairness in machine learning in healthcare | Jianhui Gao et.al. | 2406.09307 | null |
2024-06-13 | General Bayesian Predictive Synthesis | Masahiro Kato et.al. | 2406.09254 | null |
2024-06-13 | Optimizing Visual Question Answering Models for Driving: Bridging the Gap Between Human and Machine Attention Patterns | Kaavya Rekanar et.al. | 2406.09203 | null |
2024-06-13 | Auto-Vocabulary Segmentation for LiDAR Points | Weijie Wei et.al. | 2406.09126 | link |
2024-06-13 | Beyond Recommendations: From Backward to Forward AI Support of Pilots’ Decision-Making Process | Zelun Tony Zhang et.al. | 2406.08959 | null |
2024-06-13 | Beyond the Calibration Point: Mechanism Comparison in Differential Privacy | Georgios Kaissis et.al. | 2406.08918 | null |
2024-06-13 | CIMRL: Combining IMitiation and Reinforcement Learning for Safe Autonomous Driving | Jonathan Booher et.al. | 2406.08878 | null |
2024-06-13 | Trajectory Planning for Autonomous Driving in Unstructured Scenarios Based on Graph Neural Network and Numerical Optimization | Sumin Zhang et.al. | 2406.08855 | null |
2024-06-13 | Current applications and potential future directions of reinforcement learning-based Digital Twins in agriculture | Georg Goldenits et.al. | 2406.08854 | null |
2024-06-13 | Conceptual Learning via Embedding Approximations for Reinforcing Interpretability and Transparency | Maor Dikter et.al. | 2406.08840 | link |
2024-06-13 | Interpretable Temporal Class Activation Representation for Audio Spoofing Detection | Menglu Li et.al. | 2406.08825 | link |
2024-06-13 | BEVSpread: Spread Voxel Pooling for Bird’s-Eye-View Representation in Vision-based Roadside 3D Object Detection | Wenjie Wang et.al. | 2406.08785 | link |
2024-06-13 | Mathematical models for off-ball scoring prediction in basketball | Rikako Kono et.al. | 2406.08749 | link |
2024-06-13 | UruBots Autonomous Cars Team One Description Paper for FIRA 2024 | Pablo Moraes et.al. | 2406.08745 | null |
2024-06-12 | Defining a Reference Architecture for Edge Systems in Highly-Uncertain Environments | Kevin Pitstick et.al. | 2406.08583 | null |
2024-06-12 | Enhancing End-to-End Autonomous Driving with Latent World Model | Yingyan Li et.al. | 2406.08481 | link |
2024-06-12 | PRIBOOT: A New Data-Driven Expert for Improved Driving Simulations | Daniel Coelho et.al. | 2406.08421 | link |
2024-06-12 | LaneCPP: Continuous 3D Lane Detection using Physical Priors | Maximilian Pittner et.al. | 2406.08381 | null |
2024-06-12 | Utilizing Navigation Path to Generate Target Point for Enhanced End-to-End Autonomous Driving Planning | Yuanhua Shen et.al. | 2406.08349 | null |
2024-06-12 | Causality for Tabular Data Synthesis: A High-Order Structure Causal Benchmark Framework | Ruibo Tu et.al. | 2406.08311 | link |
2024-06-12 | The Importance of Positional Encoding Initialization in Transformers for Relational Reasoning | Takuya Ito et.al. | 2406.08272 | null |
2024-06-12 | Valeo4Cast: A Modular Approach to End-to-End Forecasting | Yihong Xu et.al. | 2406.08113 | link |
2024-06-12 | Conference Proceedings of The European DAO Workshop 2024 | Florian Spychiger et.al. | 2406.08110 | null |
2024-06-13 | CoXQL: A Dataset for Parsing Explanation Requests in Conversational XAI Systems | Qianli Wang et.al. | 2406.08101 | link |
2024-06-12 | LVBench: An Extreme Long Video Understanding Benchmark | Weihan Wang et.al. | 2406.08035 | link |
2024-06-12 | Deep reinforcement learning with positional context for intraday trading | Sven Goluža et.al. | 2406.08013 | null |
2024-06-12 | Efficient Adaptation in Mixed-Motive Environments via Hierarchical Opponent Modeling and Planning | Yizhe Huang et.al. | 2406.08002 | null |
2024-06-12 | Asymptotically Optimal Regret for Black-Box Predict-then-Optimize | Samuel Tan et.al. | 2406.07866 | null |
2024-06-12 | Are Objective Explanatory Evaluation metrics Trustworthy? An Adversarial Analysis | Prithwijit Chowdhury et.al. | 2406.07820 | null |
2024-06-11 | “It answers questions that I didn’t know I had”: Ph.D. Students’ Evaluation of an Information Sharing Knowledge Graph | Stanislava Gardasevic et.al. | 2406.07730 | null |
2024-06-11 | Out-Of-Context Prompting Boosts Fairness and Robustness in Large Language Model Predictions | Leonardo Cotta et.al. | 2406.07685 | null |
2024-06-11 | PLT-D3: A High-fidelity Dynamic Driving Simulation Dataset for Stereo Depth and Scene Flow | Joshua Tokarsky et.al. | 2406.07667 | null |
2024-06-11 | Comparing Deep Learning Models for Rice Mapping in Bhutan Using High Resolution Satellite Imagery | Biplov Bhandari et.al. | 2406.07482 | link |
2024-06-11 | Beyond ELBOs: A Large-Scale Evaluation of Variational Methods for Sampling | Denis Blessing et.al. | 2406.07423 | link |
2024-06-11 | Enhancing Tabular Data Optimization with a Flexible Graph-based Reinforced Exploration Strategy | Xiaohan Huang et.al. | 2406.07404 | null |
2024-06-11 | Accessing GPT-4 level Mathematical Olympiad Solutions via Monte Carlo Tree Self-refine with LLaMa-3 8B | Di Zhang et.al. | 2406.07394 | link |
2024-06-11 | World Models with Hints of Large Language Models for Goal Achieving | Zeyuan Liu et.al. | 2406.07381 | null |
2024-06-11 | EdgeTimer: Adaptive Multi-Timescale Scheduling in Mobile Edge Computing with Deep Reinforcement Learning | Yijun Hao et.al. | 2406.07342 | null |
2024-06-11 | Capacity Credit Evaluation of Generalized Energy Storage Considering Endogenous Uncertainty | Ning Qi et.al. | 2406.07338 | null |
2024-06-11 | Instruct Large Language Models to Drive like Humans | Ruijun Zhang et.al. | 2406.07296 | link |
2024-06-11 | Optimal policy design for decision problems under social influence | Valentina Breschi et.al. | 2406.07282 | null |
2024-06-11 | Towards Human-AI Collaboration in Healthcare: Guided Deferral Systems with Large Language Models | Joshua Strong et.al. | 2406.07212 | null |
2024-06-11 | Bilevel optimization with sustainability perspective: a survey on applications | Giulia Caselli et.al. | 2406.07184 | null |
2024-06-11 | EFFOcc: A Minimal Baseline for EFficient Fusion-based 3D Occupancy Network | Yining Shi et.al. | 2406.07042 | link |
2024-06-11 | PanoSSC: Exploring Monocular Panoptic 3D Scene Reconstruction for Autonomous Driving | Yining Shi et.al. | 2406.07037 | null |
2024-06-12 | LiSD: An Efficient Multi-Task Learning Framework for LiDAR Segmentation and Detection | Jiahua Xu et.al. | 2406.07023 | null |
2024-06-11 | SmartPQ: An Adaptive Concurrent Priority Queue for NUMA Architectures | Christina Giannoula et.al. | 2406.06900 | null |
2024-06-10 | Satisficing Exploration in Bandit Optimization | Qing Feng et.al. | 2406.06802 | null |
2024-06-10 | An Elliptic Kernel Unsupervised Autoencoder-Graph Convolutional Network Ensemble Model for Hyperspectral Unmixing | Estefania Alfaro-Mejia et.al. | 2406.06742 | null |
2024-06-10 | Long-Term Fairness Inquiries and Pursuits in Machine Learning: A Survey of Notions, Methods, and Challenges | Usman Gohar et.al. | 2406.06736 | null |
2024-06-10 | PatchRefiner: Leveraging Synthetic Data for Real-Domain High-Resolution Monocular Metric Depth Estimation | Zhenyu Li et.al. | 2406.06679 | null |
2024-06-10 | Adaptive Opponent Policy Detection in Multi-Agent MDPs: Real-Time Strategy Switch Identification Using Running Error Estimation | Mohidul Haque Mridul et.al. | 2406.06500 | null |
2024-06-10 | Can Language Models Serve as Text-Based World Simulators? | Ruoyao Wang et.al. | 2406.06485 | null |
2024-06-10 | Language Models are Alignable Decision-Makers: Dataset and Application to the Medical Triage Domain | Brian Hu et.al. | 2406.06435 | link |
2024-06-10 | Hybrid Video Anomaly Detection for Anomalous Scenarios in Autonomous Driving | Daniel Bogdoll et.al. | 2406.06423 | null |
2024-06-10 | UMAD: Unsupervised Mask-Level Anomaly Detection for Autonomous Driving | Daniel Bogdoll et.al. | 2406.06370 | null |
2024-06-10 | DualAD: Disentangling the Dynamic and Static World for End-to-End Driving | Simon Doll et.al. | 2406.06264 | null |
2024-06-10 | Data Augmentation in Earth Observation: A Diffusion Model Approach | Tiago Sousa et.al. | 2406.06218 | null |
2024-06-10 | Federated Machine Reasoning for Resource Provisioning in 6G O-RAN | Swastika Roy et.al. | 2406.06128 | null |
2024-06-10 | Sim-To-Real Transfer for Visual Reinforcement Learning of Deformable Object Manipulation for Robot-Assisted Surgery | Paul Maria Scheikl et.al. | 2406.06092 | null |
2024-06-10 | Algorithms for Multi-Criteria Decision-Making and Efficiency Analysis Problems | Fuh-Hwa Franklin Liu et.al. | 2406.06090 | null |
2024-06-10 | Text Analysis of ETDs in ProQuest Dissertations and Theses (PQDT) Global (2016-2018) | Manika Lamba et.al. | 2406.06076 | null |
2024-06-10 | Risk Sensitivity in Markov Games and Multi-Agent Reinforcement Learning: A Systematic Review | Hafez Ghaemi et.al. | 2406.06041 | null |
2024-06-10 | Decision-Making Behavior Evaluation Framework for LLMs under Uncertain Context | Jingru Jia et.al. | 2406.05972 | null |
2024-06-09 | Hello Again! LLM-powered Personalized Agent for Long-term Dialogue | Hao Li et.al. | 2406.05925 | link |
2024-06-09 | Self-supervised Adversarial Training of Monocular Depth Estimation against Physical-World Attacks | Zhiyuan Cheng et.al. | 2406.05857 | link |
2024-06-09 | BOSC: A toolbox for aerial imagery mapping | Ricard Durall et.al. | 2406.05833 | link |
2024-06-09 | ControlLoc: Physical-World Hijacking Attack on Visual Perception in Autonomous Driving | Chen Ma et.al. | 2406.05810 | null |
2024-06-09 | Production and distribution planning, scheduling, and routing optimization in a yogurt supply chain under demand uncertainty: A case study | Babak Javadi et.al. | 2406.05803 | null |
2024-06-09 | SlowPerception: Physical-World Latency Attack against Visual Perception in Autonomous Driving | Chen Ma et.al. | 2406.05800 | null |
2024-06-09 | Numerical solution of a PDE arising from prediction with expert advice | Jeff Calder et.al. | 2406.05754 | link |
2024-06-07 | Pretraining Decision Transformers with Reward Prediction for In-Context Multi-task Structured Bandit Learning | Subhojyoti Mukherjee et.al. | 2406.05064 | null |
2024-06-07 | Digital Twins of the EM Environment: Benchmark for Ray Launching Models | Michele Zhu et.al. | 2406.05042 | link |
2024-06-07 | Online Frequency Scheduling by Learning Parallel Actions | Anastasios Giovanidis et.al. | 2406.05041 | null |
2024-06-07 | CityCraft: A Real Crafter for 3D City Generation | Jie Deng et.al. | 2406.04983 | null |
2024-06-07 | Beyond Data, Towards Sustainability: A Sydney Case Study on Urban Digital Twins | Ammar Sohail et.al. | 2406.04902 | null |
2024-06-07 | Dynamic prediction of death risk given a renewal hospitalization process | Telmo J. Pérez-Izquierdo et.al. | 2406.04849 | link |
2024-06-07 | Fragile Model Watermarking: A Comprehensive Survey of Evolution, Characteristics, and Classification | Zhenzhe Gao et.al. | 2406.04809 | null |
2024-06-07 | Predictive Dynamic Fusion | Bing Cao et.al. | 2406.04802 | link |
2024-06-07 | SelfGoal: Your Language Agents Already Know How to Achieve High-level Goals | Ruihan Yang et.al. | 2406.04784 | null |
2024-06-07 | EAIA: An Efficient and Anonymous Identity Authentication Scheme in 5G-V2V | Qianmin Du et.al. | 2406.04705 | null |
2024-06-06 | Tangent differential privacy | Lexing Ying et.al. | 2406.04535 | null |
2024-06-06 | Step Out and Seek Around: On Warm-Start Training with Incremental Data | Maying Shen et.al. | 2406.04484 | null |
2024-06-06 | Optimizing Autonomous Driving for Safety: A Human-Centric Approach with LLM-Enhanced RLHF | Yuan Sun et.al. | 2406.04481 | null |
2024-06-06 | Everywhere & Nowhere: Envisioning a Computing Continuum for Science | Manish Parashar et.al. | 2406.04480 | null |
2024-06-06 | MoralBench: Moral Evaluation of LLMs | Jianchao Ji et.al. | 2406.04428 | link |
2024-06-06 | DeTra: A Unified Model for Object Detection and Trajectory Forecasting | Sergio Casas et.al. | 2406.04426 | null |
2024-06-06 | Regularized KL-Divergence for Well-Defined Function-Space Variational Inference in Bayesian neural networks | Tristan Cinquin et.al. | 2406.04317 | null |
2024-06-06 | Do Language Models Understand Morality? Towards a Robust Detection of Moral Content | Luana Bulla et.al. | 2406.04143 | link |
2024-06-06 | Explainability and Hate Speech: Structured Explanations Make Social Media Moderators Faster | Agostina Calabrese et.al. | 2406.04106 | link |
2024-06-06 | Leveraging automatic strategy discovery to teach people how to select better projects | Lovis Heindrich et.al. | 2406.04082 | link |
2024-06-06 | A Road-Map for Transferring Software Engineering methods for Model-Based Early V&V of Behaviour to Systems Engineering | Johan Cederbladh et.al. | 2406.04037 | null |
2024-06-06 | Contrastive Sparse Autoencoders for Interpreting Planning of Chess-Playing Agents | Yoann Poupart et.al. | 2406.04028 | link |
2024-06-06 | Frequency-based Matcher for Long-tailed Semantic Segmentation | Shan Li et.al. | 2406.03917 | link |
2024-06-06 | Memorization in deep learning: A survey | Jiaheng Wei et.al. | 2406.03880 | null |
2024-06-06 | Bench2Drive: Towards Multi-Ability Benchmarking of Closed-Loop End-To-End Autonomous Driving | Xiaosong Jia et.al. | 2406.03877 | link |
2024-06-06 | Small area estimation with generalized random forests: Estimating poverty rates in Mexico | Nicolas Frink et.al. | 2406.03861 | null |
2024-06-06 | Performance of large language models in numerical vs. semantic medical knowledge: Benchmarking on evidence-based Q&As | Eden Avnat et.al. | 2406.03855 | null |
2024-06-06 | Monocular Localization with Semantics Map for Autonomous Vehicles | Jixiang Wan et.al. | 2406.03835 | null |
2024-06-06 | Views about ChatGPT: Are human decision making and human learning necessary? | Eiji Yamamura et.al. | 2406.03823 | null |
2024-06-06 | Bayesian generalized method of moments applied to pseudo-observations in survival analysis | Léa Orsini et.al. | 2406.03821 | link |
2024-06-06 | POAM: Probabilistic Online Attentive Mapping for Efficient Robotic Information Gathering | Weizhe Chen et.al. | 2406.03669 | link |
2024-06-05 | Ensembling Portfolio Strategies for Long-Term Investments: A Distribution-Free Preference Framework for Decision-Making and Algorithms | Duy Khanh Lam et.al. | 2406.03652 | null |
2024-06-05 | FedPylot: Navigating Federated Learning for Real-Time Object Detection in Internet of Vehicles | Cyprien Quéméneur et.al. | 2406.03611 | link |
2024-06-05 | AD-H: Autonomous Driving with Hierarchical Agents | Zaibin Zhang et.al. | 2406.03474 | null |
2024-06-05 | Polarization Wavefront Lidar: Learning Large Scene Reconstruction from Polarized Wavefronts | Dominik Scheuble et.al. | 2406.03461 | null |
2024-06-05 | RemixTape: Enriching Narratives about Metrics with Semantic Alignment and Contextual Recommendation | Matthew Brehmer et.al. | 2406.03415 | null |
2024-06-05 | What Matters in Hierarchical Search for Combinatorial Reasoning Problems? | Michał Zawalski et.al. | 2406.03361 | link |
2024-06-05 | The Good, the Bad, and the Hulk-like GPT: Analyzing Emotional Decisions of Large Language Models in Cooperation and Bargaining Games | Mikhail Mozikov et.al. | 2406.03299 | null |
2024-06-05 | Prompt-based Visual Alignment for Zero-shot Policy Transfer | Haihan Gao et.al. | 2406.03250 | null |
2024-06-05 | Challenges and Considerations in the Evaluation of Bayesian Causal Discovery | Amir Mohammad Karimi Mamaghan et.al. | 2406.03209 | null |
2024-06-05 | Situation Monitor: Diversity-Driven Zero-Shot Out-of-Distribution Detection using Budding Ensemble Architecture for Object Detection | Qutub Syed et.al. | 2406.03188 | null |
2024-06-05 | Missci: Reconstructing Fallacies in Misrepresented Science | Max Glockner et.al. | 2406.03181 | link |
2024-06-06 | Detecting Model Misspecification in Amortized Bayesian Inference with Neural Networks: An Extended Investigation | Marvin Schmitt et.al. | 2406.03154 | null |
2024-06-05 | Enhanced Automotive Object Detection via RGB-D Fusion in a DiffusionDet Framework | Eliraz Orfaig et.al. | 2406.03129 | null |
2024-06-05 | Enhancing 3D Lane Detection and Topology Reasoning with 2D Lane Priors | Han Li et.al. | 2406.03105 | link |
2024-06-05 | Task-Oriented Wireless Communications for Collaborative Perception in Intelligent Unmanned Systems | Sheng Zhou et.al. | 2406.03086 | null |
2024-06-05 | “Give Me an Example Like This”: Episodic Active Reinforcement Learning from Demonstrations | Muhan Hou et.al. | 2406.03069 | link |
2024-06-05 | Efficient Exploration of the Rashomon Set of Rule Set Models | Martino Ciaperoni et.al. | 2406.03059 | null |
2024-06-05 | Correlation of Software-in-the-Loop Simulation with Physical Testing for Autonomous Driving | Zhennan Fei et.al. | 2406.03040 | null |
2024-06-05 | Analyzing the Influence of Training Samples on Explanations | André Artelt et.al. | 2406.03012 | null |
2024-06-05 | Unveiling Selection Biases: Exploring Order and Token Sensitivity in Large Language Models | Sheng-Lun Wei et.al. | 2406.03009 | null |
2024-06-05 | DriVLMe: Enhancing LLM-based Autonomous Driving Agents with Embodied and Social Experiences | Yidong Huang et.al. | 2406.03008 | link |
2024-06-05 | Simplification of Risk Averse POMDPs with Performance Guarantees | Yaacov Pariente et.al. | 2406.03000 | null |
2024-06-04 | Enhancing predictive imaging biomarker discovery through treatment effect analysis | Shuhan Xiao et.al. | 2406.02534 | link |
2024-06-04 | How Do Neural Spoofing Countermeasures Detect Partially Spoofed Audio? | Tianchi Liu et.al. | 2406.02483 | null |
2024-06-04 | A Generalized Apprenticeship Learning Framework for Modeling Heterogeneous Student Pedagogical Strategies | Md Mirajul Islam et.al. | 2406.02450 | null |
2024-06-04 | Out-of-Distribution Runtime Adaptation with Conformalized Neural Network Ensembles | Polo Contreras et.al. | 2406.02436 | null |
2024-06-04 | Decoupling of neural network calibration measures | Dominik Werner Wolf et.al. | 2406.02411 | null |
2024-06-04 | XRec: Large Language Models for Explainable Recommendation | Qiyao Ma et.al. | 2406.02377 | link |
2024-06-04 | Label-wise Aleatoric and Epistemic Uncertainty Quantification | Yusuf Sale et.al. | 2406.02354 | link |
2024-06-04 | Disentangled Representation via Variational AutoEncoder for Continuous Treatment Effect Estimation | Ruijing Cui et.al. | 2406.02310 | null |
2024-06-04 | Enabling Decision-Making with the Modified Causal Forest: Policy Trees for Treatment Assignment | Hugo Bodory et.al. | 2406.02241 | null |
2024-06-04 | Towards an Extensible Model-Based Digital Twin Framework for Space Launch Vehicles | Ran Wei et.al. | 2406.02222 | null |
2024-06-04 | Rectifying Reinforcement Learning for Reward Matching | Haoran He et.al. | 2406.02213 | null |
2024-06-04 | Radar Spectra-Language Model for Automotive Scene Parsing | Mariia Pushkareva et.al. | 2406.02158 | null |
2024-06-04 | UA-Track: Uncertainty-Aware End-to-End 3D Multi-Object Tracking | Lijun Zhou et.al. | 2406.02147 | null |
2024-06-04 | Why Would You Suggest That? Human Trust in Language Model Responses | Manasi Sharma et.al. | 2406.02018 | null |
2024-06-04 | Mamba as Decision Maker: Exploring Multi-scale Sequence Modeling in Offline Reinforcement Learning | Jiahang Cao et.al. | 2406.02013 | link |
2024-06-05 | Exploring Real World Map Change Generalization of Prior-Informed HD Map Prediction Models | Samuel M. Bateman et.al. | 2406.01961 | null |
2024-06-04 | Improving Generalization in Aerial and Terrestrial Mobile Robots Control Through Delayed Policy Learning | Ricardo B. Grando et.al. | 2406.01952 | null |
2024-06-04 | Orthogonal Causal Calibration | Justin Whitehouse et.al. | 2406.01933 | null |
2024-06-04 | ProGEO: Generating Prompts through Image-Text Contrastive Learning for Visual Geo-localization | Chen Mao et.al. | 2406.01906 | link |
2024-06-04 | Large Language Model-Enabled Multi-Agent Manufacturing Systems | Jonghan Lim et.al. | 2406.01893 | null |
2024-05-31 | Designing for Fairness in Human-Robot Interactions | Houston Claure et.al. | 2405.21044 | null |
2024-05-31 | G-Transformer for Conditional Average Potential Outcome Estimation over Time | Konstantin Hess et.al. | 2405.21012 | link |
2024-05-31 | Hard Cases Detection in Motion Prediction by Vision-Language Foundation Models | Yi Yang et.al. | 2405.20991 | link |
2024-05-31 | Uncertainty Quantification for Bird’s Eye View Semantic Segmentation: Methods and Benchmarks | Linlin Yu et.al. | 2405.20986 | null |
2024-05-31 | Goal-Oriented Sensor Reporting Scheduling for Non-linear Dynamic System Monitoring | Prasoon Raghuwanshi et.al. | 2405.20983 | null |
2024-05-31 | Unravelling the Use of Digital Twins to Assist Decision- and Policy-Making in Smart Cities | Lucy Temple et.al. | 2405.20916 | null |
2024-05-31 | Pursuing Overall Welfare in Federated Learning through Sequential Decision Making | Seok-Ju Hahn et.al. | 2405.20821 | link |
2024-05-31 | ADESSE: Advice Explanations in Complex Repeated Decision-Making Environments | Sören Schleibaum et.al. | 2405.20705 | link |
2024-05-31 | A flexible numerical tool for large dynamic DC networks | Erwin Luesink et.al. | 2405.20704 | null |
2024-05-31 | Robust Stable Spiking Neural Networks | Jianhao Ding et.al. | 2405.20694 | link |
2024-05-31 | In-Context Decision Transformer: Reinforcement Learning via Hierarchical Chain-of-Thought | Sili Huang et.al. | 2405.20692 | link |
2024-05-31 | Searching for internal symbols underlying deep learning | Jung H. Lee et.al. | 2405.20605 | null |
2024-05-31 | Class-Based Time Series Data Augmentation to Mitigate Extreme Class Imbalance for Solar Flare Prediction | Junzhi Wen et.al. | 2405.20590 | null |
2024-05-30 | Aquatic Navigation: A Challenging Benchmark for Deep Reinforcement Learning | Davide Corsi et.al. | 2405.20534 | link |
2024-05-30 | Probabilities of Causation for Continuous and Vector Variables | Yuta Kawakami et.al. | 2405.20487 | null |
2024-05-30 | Policy Trees for Prediction: Interpretable and Adaptive Model Selection for Machine Learning | Dimitris Bertsimas et.al. | 2405.20486 | null |
2024-05-30 | Quality of Non-Convergent Best Response Processes in Multi-Agent Systems through Sink Equilibrium | Rohit Konda et.al. | 2405.20426 | null |
2024-05-30 | Learning 3D Robotics Perception using Inductive Priors | Muhammad Zubair Irshad et.al. | 2405.20364 | null |
2024-05-30 | OccSora: 4D Occupancy Generation Models as World Simulators for Autonomous Driving | Lening Wang et.al. | 2405.20337 | link |
2024-05-30 | $\textit{S}^3$ Gaussian: Self-Supervised Street Gaussians for Autonomous Driving | Nan Huang et.al. | 2405.20323 | link |
2024-05-30 | Low-rank and sparse approximations for contact mechanics | Kiran Sagar Kollepara et.al. | 2405.20211 | null |
2024-05-31 | Using Large Language Models for Humanitarian Frontline Negotiation: Opportunities and Considerations | Zilin Ma et.al. | 2405.20195 | null |
2024-05-30 | MCDS-VSS: Moving Camera Dynamic Scene Video Semantic Segmentation by Filtering with Self-Supervised Geometry and Motion | Angel Villar-Corrales et.al. | 2405.19921 | link |
2024-05-30 | Fourier Controller Networks for Real-Time Decision-Making in Embodied Learning | Hengkai Tan et.al. | 2405.19885 | null |
2024-05-30 | From Words to Actions: Unveiling the Theoretical Underpinnings of LLM-Driven Autonomous Systems | Jianliang He et.al. | 2405.19883 | null |
2024-05-30 | Developing a Comprehensive Measurement Tool for Assessing the Rate of BIM Adoption in the Construction Industry | Mohammed Abdulsalam Alsofiani et.al. | 2405.19755 | null |
2024-05-30 | GaussianPrediction: Dynamic 3D Gaussian Prediction for Motion Extrapolation and Free View Synthesis | Boming Zhao et.al. | 2405.19745 | null |
2024-05-30 | Learning Task-relevant Sequence Representations via Intrinsic Dynamics Characteristics in Reinforcement Learning | Dayang Liang et.al. | 2405.19736 | link |
2024-05-30 | Generalized Bayesian Nash Equilibrium with Continuous Type and Action Spaces | Yuan Tao et.al. | 2405.19721 | null |
2024-05-31 | Autonomous Driving with Spiking Neural Networks | Rui-Jie Zhu et.al. | 2405.19687 | link |
2024-05-30 | Texture-guided Coding for Deep Features | Lei Xiong et.al. | 2405.19669 | null |
2024-05-30 | Reconciling Model Multiplicity for Downstream Decision Making | Ally Yalei Du et.al. | 2405.19667 | null |
2024-05-31 | SparseDrive: End-to-End Autonomous Driving via Sparse Scene Representation | Wenchao Sun et.al. | 2405.19620 | link |
2024-05-29 | Distributed Online Planning for Min-Max Problems in Networked Markov Games | Alexandros E. Tzikas et.al. | 2405.19570 | link |
2024-05-29 | Participation in the age of foundation models | Harini Suresh et.al. | 2405.19479 | null |
2024-05-29 | Posterior Sampling via Autoregressive Generation | Kelly W Zhang et.al. | 2405.19466 | null |
2024-05-29 | Reasoning3D – Grounding and Reasoning in 3D: Fine-Grained Zero-Shot Open-Vocabulary 3D Reasoning Part Segmentation via Large Vision-Language Models | Tianrun Chen et.al. | 2405.19326 | null |
2024-05-29 | Language Models Trained to do Arithmetic Predict Human Risky and Intertemporal Choice | Jian-Qiao Zhu et.al. | 2405.19313 | null |
2024-05-29 | Real-Time Environment Condition Classification for Autonomous Vehicles | Marco Introvigne et.al. | 2405.19305 | link |
2024-05-29 | Towards Next-Generation Urban Decision Support Systems through AI-Powered Generation of Scientific Ontology using Large Language Models – A Case in Optimizing Intermodal Freight Transportation | Jose Tupayachi et.al. | 2405.19255 | null |
2024-05-29 | Diffusion-based Dynamics Models for Long-Horizon Rollout in Offline Reinforcement Learning | Hanye Zhao et.al. | 2405.19189 | link |
2024-05-29 | Conditional Latent ODEs for Motion Prediction in Autonomous Driving | Khang Truong Giang et.al. | 2405.19183 | link |
2024-05-29 | Learning Interpretable Scheduling Algorithms for Data Processing Clusters | Zhibo Hu et.al. | 2405.19131 | null |
2024-05-29 | Early Detection of Critical Urban Events using Mobile Phone Network Data | Pierre Lemaire et.al. | 2405.19125 | link |
2024-05-29 | Can Graph Learning Improve Task Planning? | Xixi Wu et.al. | 2405.19119 | link |
2024-05-29 | Quantum Optimal Control of Squeezing in Cavity Optomechanics | Anton Halaski et.al. | 2405.19070 | null |
2024-05-29 | A Good Foundation is Worth Many Labels: Label-Efficient Panoptic Segmentation | Niclas Vödisch et.al. | 2405.19035 | link |
2024-05-29 | Distributed Management of Fluctuating Energy Resources in Dynamic Networked Systems | Xiaotong Cheng et.al. | 2405.19015 | null |
2024-05-29 | Optimizing Vehicular Networks with Variational Quantum Circuits-based Reinforcement Learning | Zijiang Yan et.al. | 2405.18984 | null |
2024-05-29 | DecomCAM: Advancing Beyond Saliency Maps through Decomposition and Integration | Yuguang Yang et.al. | 2405.18882 | link |
2024-05-29 | On Fairness Concerns in the Blockchain Ecosystem | Johnnatan Messias Peixoto Afonso et.al. | 2405.18876 | null |
2024-05-29 | SSGA-Net: Stepwise Spatial Global-local Aggregation Networks for for Autonomous Driving | Yiming Cui et.al. | 2405.18857 | null |
2024-05-29 | LetsMap: Unsupervised Representation Learning for Semantic BEV Mapping | Nikhil Gosala et.al. | 2405.18852 | null |
2024-05-29 | SFANet: Spatial-Frequency Attention Network for Weather Forecasting | Jiaze Wang et.al. | 2405.18849 | null |
2024-05-29 | FDQN: A Flexible Deep Q-Network Framework for Game Automation | Prabhath Reddy Gujavarthy et.al. | 2405.18761 | link |
2024-05-29 | Multi-objective Cross-task Learning via Goal-conditioned GPT-based Decision Transformers for Surgical Robot Task Automation | Jiawei Fu et.al. | 2405.18757 | null |
2024-05-28 | 3D StreetUnveiler with Semantic-Aware 2DGS | Jingwei Xu et.al. | 2405.18416 | null |
2024-05-28 | Is a 3D-Tokenized LLM the Key to Reliable Autonomous Driving? | Yifan Bai et.al. | 2405.18361 | null |
2024-05-28 | MMCTAgent: Multi-modal Critical Thinking Agent Framework for Complex Visual Reasoning | Somnath Kumar et.al. | 2405.18358 | null |
2024-05-28 | Can Automatic Metrics Assess High-Quality Translations? | Sweta Agrawal et.al. | 2405.18348 | null |
2024-05-28 | Safe Multi-Agent Reinforcement Learning with Bilevel Optimization in Autonomous Driving | Zhi Zheng et.al. | 2405.18209 | link |
2024-05-28 | LLM experiments with simulation: Large Language Model Multi-Agent System for Process Simulation Parametrization in Digital Twins | Yuchen Xia et.al. | 2405.18092 | link |
2024-05-28 | Towards Dialogues for Joint Human-AI Reasoning and Value Alignment | Elfia Bezou-Vrakatseli et.al. | 2405.18073 | null |
2024-05-28 | MULi-Ev: Maintaining Unperturbed LiDAR-Event Calibration | Mathieu Cocheteux et.al. | 2405.18021 | null |
2024-05-28 | Self-supervised Pre-training for Transferable Multi-modal Perception | Xiaohao Xu et.al. | 2405.17942 | link |
2024-05-28 | Cycle-YOLO: A Efficient and Robust Framework for Pavement Damage Detection | Zhengji Li et.al. | 2405.17905 | null |
2024-05-28 | Data-Driven Predictive Control and MPC: Do we achieve optimality? | Akhil S Anand et.al. | 2405.17892 | null |
2024-05-28 | Resisting Stochastic Risks in Diffusion Planners with the Trajectory Aggregation Tree | Lang Feng et.al. | 2405.17879 | link |
2024-05-28 | Ai.llude: Encouraging Rewriting AI-Generated Text to Support Creative Expression | David Zhou et.al. | 2405.17843 | null |
2024-05-28 | LNS2+RL: Combining Multi-agent Reinforcement Learning with Large Neighborhood Search in Multi-agent Path Finding | Yutong Wang et.al. | 2405.17794 | link |
2024-05-28 | Online Analytic Exemplar-Free Continual Learning with Large Models for Imbalanced Autonomous Driving Task | Huiping Zhuang et.al. | 2405.17779 | link |
2024-05-27 | OPERA: Automatic Offline Policy Evaluation with Re-weighted Aggregates of Multiple Estimators | Allen Nie et.al. | 2405.17708 | null |
2024-05-27 | Ontology-Enhanced Decision-Making for Autonomous Agents in Dynamic and Partially Observable Environments | Saeedeh Ghanadbashi et.al. | 2405.17691 | null |
2024-05-27 | Adaptive Device-Edge Collaboration on DNN Inference in AIoT: A Digital Twin-Assisted Approach | Shisheng Hu et.al. | 2405.17664 | null |
2024-05-27 | Robust Perception and Navigation of Autonomous Surface Vehicles in Challenging Environments | Mingi Jeong et.al. | 2405.17657 | null |
2024-05-27 | The Economic Implications of Large Language Model Selection on Earnings and Return on Investment: A Decision Theoretic Model | Geraldo Xexéo et.al. | 2405.17637 | null |
2024-05-27 | GaussianFormer: Scene as Gaussians for Vision-Based 3D Semantic Occupancy Prediction | Yuanhui Huang et.al. | 2405.17429 | link |
2024-05-27 | Benchmarking and Improving Bird’s Eye View Perception Robustness in Autonomous Driving | Shaoyuan Xie et.al. | 2405.17426 | link |
2024-05-27 | LARM: Large Auto-Regressive Model for Long-Horizon Embodied Intelligence | Zhuoling Li et.al. | 2405.17424 | null |
2024-05-27 | Hardness-Aware Scene Synthesis for Semi-Supervised 3D Object Detection | Shuai Zeng et.al. | 2405.17422 | link |
2024-05-27 | MultiOOD: Scaling Out-of-Distribution Detection for Multiple Modalities | Hao Dong et.al. | 2405.17419 | link |
2024-05-27 | Vista: A Generalizable Driving World Model with High Fidelity and Versatile Controllability | Shenyuan Gao et.al. | 2405.17398 | link |
2024-05-27 | BehaviorGPT: Smart Agent Simulation for Autonomous Driving with Next-Patch Prediction | Zikang Zhou et.al. | 2405.17372 | null |
2024-05-27 | Rethinking Transformers in Solving POMDPs | Chenhao Lu et.al. | 2405.17358 | link |
2024-05-27 | Exploring and steering the moral compass of Large Language Models | Alejandro Tlaie et.al. | 2405.17345 | link |
2024-05-27 | Leveraging Offline Data in Linear Latent Bandits | Chinmaya Kausik et.al. | 2405.17324 | null |
2024-05-27 | Towards Accurate Ego-lane Identification with Early Time Series Classification | Yuchuan Jin et.al. | 2405.17270 | null |
2024-05-27 | “Pass the butter”: A study on desktop-classic multitasking robotic arm based on advanced YOLOv7 and BERT | Haohua Que et.al. | 2405.17250 | null |
2024-05-27 | InsigHTable: Insight-driven Hierarchical Table Visualization with Reinforcement Learning | Guozheng Li et.al. | 2405.17229 | null |
2024-05-27 | Collage is the New Writing: Exploring the Fragmentation of Text and User Interfaces in AI Tools | Daniel Buschek et.al. | 2405.17217 | null |
2024-05-27 | CoSLight: Co-optimizing Collaborator Selection and Decision-making to Enhance Traffic Signal Control | Jingqing Ruan et.al. | 2405.17152 | link |
2024-05-27 | DINO-SD: Champion Solution for ICRA 2024 RoboDepth Challenge | Yifan Mao et.al. | 2405.17102 | null |
2024-05-27 | A Two-Level Stochastic Model for the Lateral Movement of Vehicles Within Their Lane Under Homogeneous Traffic Conditions | Nicole Neis et.al. | 2405.17080 | null |
2024-05-27 | Efficient mid-term forecasting of hourly electricity load using generalized additive models | Monika Zimmermann et.al. | 2405.17070 | link |
2024-05-27 | BWArea Model: Learning World Model, Inverse Dynamics, and Policy for Controllable Language Generation | Chengxing Jia et.al. | 2405.17039 | null |
2024-05-27 | SCaRL- A Synthetic Multi-Modal Dataset for Autonomous Driving | Avinash Nittur Ramesh et.al. | 2405.17030 | null |
2024-05-24 | Enhancing Adverse Drug Event Detection with Multimodal Dataset: Corpus Creation and Model Development | Pranab Sahoo et.al. | 2405.15766 | link |
2024-05-24 | An Adaptive Framework for Manipulator Skill Reproduction in Dynamic Environments | Ryan Donald et.al. | 2405.15711 | link |
2024-05-24 | SMART: Scalable Multi-agent Real-time Simulation via Next-token Prediction | Wei Wu et.al. | 2405.15677 | link |
2024-05-24 | Serving economic prosperity: economic impact assessments (EIA) on Earth observation-based services and tools by SERVIR | Reetwika Basu et.al. | 2405.15672 | null |
2024-05-24 | Predictive Uncertainty Quantification with Missing Covariates | Margaux Zaffran et.al. | 2405.15641 | null |
2024-05-24 | Federated Behavioural Planes: Explaining the Evolution of Client Behaviour in Federated Learning | Dario Fenoglio et.al. | 2405.15632 | link |
2024-05-24 | Inverse-RLignment: Inverse Reinforcement Learning from Demonstrations for LLM Alignment | Hao Sun et.al. | 2405.15624 | null |
2024-05-24 | Online Changepoint Detection via Dynamic Mode Decomposition | Victor K. Khamesi et.al. | 2405.15576 | null |
2024-05-24 | Transformer-XL for Long Sequence Tasks in Robotic Learning from Demonstration | Gao Tianci et.al. | 2405.15562 | null |
2024-05-24 | Mind the Gap: A Causal Perspective on Bias Amplification in Prediction & Decision-Making | Drago Plecko et.al. | 2405.15446 | null |
2024-05-24 | Decentralized Virtual Research Environment: Empowering Peer-to-Peer Trustworthy Data Sharing and Collaboration | Yuandou Wang et.al. | 2405.15392 | null |
2024-05-24 | Efficient Recurrent Off-Policy RL Requires a Context-Encoder-Specific Learning Rate | Fan-Ming Luo et.al. | 2405.15384 | link |
2024-05-24 | Large Language Models can Deliver Accurate and Interpretable Time Series Anomaly Detection | Jun Liu et.al. | 2405.15370 | null |
2024-05-24 | Continuously Learning, Adapting, and Improving: A Dual-Process Approach to Autonomous Driving | Jianbiao Mei et.al. | 2405.15324 | link |
2024-05-24 | Trajectory-Based Multi-Objective Hyperparameter Optimization for Model Retraining | Wenyu Wang et.al. | 2405.15303 | null |
2024-05-24 | Learning Invariant Causal Mechanism from Vision-Language Models | Zeen Song et.al. | 2405.15289 | null |
2024-05-24 | 3D Unsupervised Learning by Distilling 2D Open-Vocabulary Segmentation Models for Autonomous Driving | Boyi Sun et.al. | 2405.15286 | link |
2024-05-24 | Talk to Parallel LiDARs: A Human-LiDAR Interaction Method Based on 3D Visual Grounding | Yuhang Liu et.al. | 2405.15274 | null |
2024-05-24 | iVideoGPT: Interactive VideoGPTs are Scalable World Models | Jialong Wu et.al. | 2405.15223 | link |
2024-05-24 | Computational analysis on a linkage between generalized logit dynamic and discounted mean field game | Hidekazu Yoshioka et.al. | 2405.15180 | null |
2024-05-23 | An Empirical Study of Training State-of-the-Art LiDAR Segmentation Models | Jiahao Sun et.al. | 2405.14870 | link |
2024-05-23 | Local Causal Discovery for Structural Evidence of Direct Discrimination | Jacqueline Maasch et.al. | 2405.14848 | link |
2024-05-23 | As an AI Language Model, “Yes I Would Recommend Calling the Police’’: Norm Inconsistency in LLM Decision-Making | Shomik Jain et.al. | 2405.14812 | null |
2024-05-23 | DIDI: Diffusion-Guided Diversity for Offline Behavioral Generation | Jinxin Liu et.al. | 2405.14790 | link |
2024-05-23 | FinRobot: An Open-Source AI Agent Platform for Financial Applications using Large Language Models | Hongyang Yang et.al. | 2405.14767 | link |
2024-05-23 | TopoLogic: An Interpretable Pipeline for Lane Topology Reasoning on Driving Scenes | Yanping Fu et.al. | 2405.14747 | null |
2024-05-23 | Exploring Prosocial Irrationality for LLM Agents: A Social Cognition View | Xuan Liu et.al. | 2405.14744 | null |
2024-05-23 | Iterative Causal Segmentation: Filling the Gap between Market Segmentation and Marketing Strategy | Kaihua Ding et.al. | 2405.14743 | null |
2024-05-23 | A Systematic and Formal Study of the Impact of Local Differential Privacy on Fairness: Preliminary Results | Karima Makhlouf et.al. | 2405.14725 | null |
2024-05-23 | Learning-Based Intermittent CSI Estimation with Adaptive Intervals in Integrated Sensing and Communication Systems | Jie Chen et.al. | 2405.14724 | null |
2024-05-23 | Decision-Focused Forecasting: Decision Losses for Multistage Optimisation | Egon Peršak et.al. | 2405.14719 | link |
2024-05-23 | CityGPT: Towards Urban IoT Learning, Analysis and Interaction with Multi-Agent System | Qinghua Guan et.al. | 2405.14691 | null |
2024-05-23 | PerLLM: Personalized Inference Scheduling with Edge-Cloud Collaboration for Diverse LLM Services | Zheming Yang et.al. | 2405.14636 | null |
2024-05-23 | SE3D: A Framework For Saliency Method Evaluation In 3D Imaging | Mariusz Wiśniewski et.al. | 2405.14584 | link |
2024-05-23 | Explainable automatic industrial carbon footprint estimation from bank transaction classification using natural language processing | Jaime González-González et.al. | 2405.14505 | null |
2024-05-23 | Improving Single Domain-Generalized Object Detection: A Focus on Diversification and Alignment | Muhammad Sohail Danish et.al. | 2405.14497 | link |
2024-05-23 | MagicDrive3D: Controllable 3D Generation for Any-View Rendering in Street Scenes | Ruiyuan Gao et.al. | 2405.14475 | null |
2024-05-23 | Adaptive sampling with PIXL on the Mars Perseverance rover | Peter R. Lawson et.al. | 2405.14471 | null |
2024-05-23 | LoRA-Ensemble: Efficient Uncertainty Modelling for Self-attention Networks | Michelle Halbheer et.al. | 2405.14438 | link |
2024-05-23 | Motion-based video compression for resource-constrained camera traps | Malika Nisal Ratnayake et.al. | 2405.14419 | null |
2024-05-21 | Strategic Deployment of Honeypots in Blockchain-based IoT Systems | Daniel Commey et.al. | 2405.12951 | null |
2024-05-21 | Hybrid PDE-ODE Models for Efficient Simulation of Infection Spread in Epidemiology | Kristina Maier et.al. | 2405.12938 | link |
2024-05-21 | Skin-in-the-Game: Decision Making via Multi-Stakeholder Alignment in LLMs | Bilgehan Sel et.al. | 2405.12933 | null |
2024-05-21 | The implications of state aggregation in deteriorating Markov Decision Processes with optimal threshold policies | Madeleine Pollack et.al. | 2405.12912 | null |
2024-05-21 | Transparency Distortion Robustness for SOTA Image Segmentation Tasks | Volker Knauthe et.al. | 2405.12864 | null |
2024-05-21 | SmartFlow: Robotic Process Automation using LLMs | Arushi Jain et.al. | 2405.12842 | null |
2024-05-21 | Consumer lying in online reviews: recent evidence | Shawn Berry et.al. | 2405.12743 | null |
2024-05-21 | Multimodal video analysis for crowd anomaly detection using open access tourism cameras | Alejandro Dionis-Ros et.al. | 2405.12708 | null |
2024-05-21 | A Multimodal Learning-based Approach for Autonomous Landing of UAV | Francisco Neves et.al. | 2405.12681 | null |
2024-05-21 | Towards an AI/ML-defined Radio for Wi-Fi: Overview, Challenges, and Roadmap | Boris Bellalta et.al. | 2405.12675 | null |
2024-05-21 | TempoScale: A Cloud Workloads Prediction Approach Integrating Short-Term and Long-Term Information | Linfeng Wen et.al. | 2405.12635 | link |
2024-05-21 | Asymptotic Properties of Matthews Correlation Coefficient | Yuki Itaya et.al. | 2405.12622 | link |
2024-05-21 | Efficient modeling of sub-kilometer surface wind with Gaussian processes and neural networks | Francesco Zanetta et.al. | 2405.12614 | null |
2024-05-21 | Ergodic Unobservable MDPs: Decidability of Approximation | Krishnendu Chatterjee et.al. | 2405.12583 | null |
2024-05-21 | Active Object Detection with Knowledge Aggregation and Distillation from Large Models | Dejie Yang et.al. | 2405.12509 | link |
2024-05-21 | CLRKDNet: Speeding up Lane Detection with Knowledge Distillation | Weiqing Qi et.al. | 2405.12503 | link |
2024-05-21 | GASE: Graph Attention Sampling with Edges Fusion for Solving Vehicle Routing Problems | Zhenwei Wang et.al. | 2405.12475 | null |
2024-05-21 | Mutual Information Analysis in Multimodal Learning Systems | Hadi Hadizadeh et.al. | 2405.12456 | null |
2024-05-20 | A Unified Linear Programming Framework for Offline Reward Learning from Human Demonstrations and Feedback | Kihyun Kim et.al. | 2405.12421 | null |
2024-05-20 | Conformal Counterfactual Inference under Hidden Confounding | Zonghao Chen et.al. | 2405.12387 | null |
2024-05-20 | Optimistic Query Routing in Clustering-based Approximate Maximum Inner Product Search | Sebastian Bruch et.al. | 2405.12207 | link |
2024-05-20 | Robust VAR Capability Curve of DER with Uncertain Renewable Generation | Aditya Shankar Kar et.al. | 2405.12184 | null |
2024-05-20 | EdgeLoc: A Communication-Adaptive Parallel System for Real-Time Localization in Infrastructure-Assisted Autonomous Driving | Boyi Liu et.al. | 2405.12120 | null |
2024-05-20 | PATE: Proximity-Aware Time series anomaly Evaluation | Ramin Ghorbani et.al. | 2405.12096 | link |
2024-05-20 | Is Mamba Compatible with Trajectory Optimization in Offline Reinforcement Learning? | Yang Dai et.al. | 2405.12094 | link |
2024-05-20 | Safe by Design Autonomous Driving Systems | Marius Bozga et.al. | 2405.11995 | null |
2024-05-20 | Tutorial on Silicon Photonics Integrated Platform Fiber Edge Coupling | Sergey S. Avdeev et.al. | 2405.11980 | null |
2024-05-20 | A comprehensive overview of deep learning techniques for 3D point cloud classification and semantic segmentation | Sushmita Sarker et.al. | 2405.11903 | null |
2024-05-20 | Social norm dynamics in a behavioral epidemic model on multiplex networks | Christos Charalambous et.al. | 2405.11887 | null |
2024-05-20 | Stereo-Knowledge Distillation from dpMV to Dual Pixels for Light Field Video Reconstruction | Aryan Garg et.al. | 2405.11823 | null |
2024-05-20 | Efficient Multi-agent Reinforcement Learning by Planning | Qihan Liu et.al. | 2405.11778 | link |
2024-05-20 | Configurable Mirror Descent: Towards a Unification of Decision Making | Pengdeng Li et.al. | 2405.11746 | link |
2024-05-20 | Estimating optimal tailored active surveillance strategy under interval censoring | Muxuan Liang et.al. | 2405.11720 | null |
2024-05-20 | QComp: A QSAR-Based Data Completion Framework for Drug Discovery | Bingjia Yang et.al. | 2405.11703 | link |
2024-05-19 | FADet: A Multi-sensor 3D Object Detection Network based on Local Featured Attention | Ziang Guo et.al. | 2405.11682 | link |
2024-05-21 | Interpretable Machine Learning Enhances Disease Prognosis: Applications on COVID-19 and Onward | Jinzhi Shen et.al. | 2405.11672 | null |
2024-05-19 | Auto-Platoon : Freight by example | Tharun V. Puthanveettil et.al. | 2405.11659 | link |
2024-05-19 | URDFormer: A Pipeline for Constructing Articulated Simulation Environments from Real-World Images | Zoey Chen et.al. | 2405.11656 | null |
2024-05-19 | Movie Revenue Prediction using Machine Learning Models | Vikranth Udandarao et.al. | 2405.11651 | link |
2024-05-19 | Searching Realistic-Looking Adversarial Objects For Autonomous Driving Systems | Shengxiang Sun et.al. | 2405.11629 | null |
2024-05-17 | Strategic control for a Boltzmann like decision-making model | Luis Guillermo Venegas-Pineda et.al. | 2405.10915 | null |
2024-05-17 | Contestable AI needs Computational Argumentation | Francesco Leofante et.al. | 2405.10729 | null |
2024-05-17 | Challenging the Human-in-the-loop in Algorithmic Decision-making | Sebastian Tschiatschek et.al. | 2405.10706 | null |
2024-05-17 | Empowering Prior to Court Legal Analysis: A Transparent and Accessible Dataset for Defensive Statement Classification and Interpretation | Yannis Spyridis et.al. | 2405.10702 | null |
2024-05-17 | Pragmatic Communication for Remote Control of Finite-State Markov Processes | Pietro Talli et.al. | 2405.10672 | null |
2024-05-17 | GEOcc: Geometrically Enhanced 3D Occupancy Network with Implicit-Explicit Depth Fusion and Contextual Self-Supervision | Xin Tan et.al. | 2405.10591 | null |
2024-05-17 | Team Samsung-RAL: Technical Report for 2024 RoboDrive Challenge-Robust Map Segmentation Track | Xiaoshuai Hao et.al. | 2405.10567 | null |
2024-05-17 | NeRO: Neural Road Surface Reconstruction | Ruibo Wang et.al. | 2405.10554 | link |
2024-05-17 | Guidelines for evaluation of complex multi agent test scenarios | Ana Isabel Garcia Guerra et.al. | 2405.10526 | null |
2024-05-16 | Tell me more: Intent Fulfilment Framework for Enhancing User Experiences in Conversational XAI | Anjana Wijekoon et.al. | 2405.10446 | null |
2024-05-16 | Monitizer: Automating Design and Evaluation of Neural Network Monitors | Muqsit Azeem et.al. | 2405.10350 | null |
2024-05-16 | Stochastic Q-learning for Large Discrete Action Spaces | Fares Fourati et.al. | 2405.10310 | null |
2024-05-16 | Conformal Alignment: Knowing When to Trust Foundation Models with Guarantees | Yu Gui et.al. | 2405.10301 | link |
2024-05-17 | Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning | Yuexiang Zhai et.al. | 2405.10292 | null |
2024-05-16 | Towards Consistent and Explainable Motion Prediction using Heterogeneous Graph Attention | Tobias Demmler et.al. | 2405.10134 | null |
2024-05-16 | Cooperative Visual-LiDAR Extrinsic Calibration Technology for Intersection Vehicle-Infrastructure: A review | Xinyu Zhang et.al. | 2405.10132 | null |
2024-05-16 | When Large Language Model Meets Optimization | Sen Huang et.al. | 2405.10098 | null |
2024-05-16 | Optimizing Search and Rescue UAV Connectivity in Challenging Terrain through Multi Q-Learning | Mohammed M. H. Qazzaz et.al. | 2405.10042 | null |
2024-05-16 | $Δ\text{-}{\rm OPE}$ : Off-Policy Estimation with Pairs of Policies | Olivier Jeunen et.al. | 2405.10024 | link |
2024-05-16 | Solving the enigma: Deriving optimal explanations of deep networks | Michail Mamalakis et.al. | 2405.10008 | link |
2024-05-16 | A Unified Deep Transfer Learning Model for Accurate IoT Localization in Diverse Environments | Abdullahi Isa Ahmed et.al. | 2405.09960 | null |
2024-05-16 | Infrared Adversarial Car Stickers | Xiaopei Zhu et.al. | 2405.09924 | null |
2024-05-16 | PillarNeXt: Improving the 3D detector by introducing Voxel2Pillar feature encoding and extracting multi-scale features | Xusheng Li et.al. | 2405.09828 | null |
2024-05-16 | Collision Avoidance Metric for 3D Camera Evaluation | Vage Taamazyan et.al. | 2405.09755 | link |
2024-05-15 | Synth-to-Real Unsupervised Domain Adaptation for Instance Segmentation | Guo Yachan et.al. | 2405.09682 | null |
2024-05-15 | Challenges and opportunities for digital twins in precision medicine: a complex systems perspective | Manlio De Domenico et.al. | 2405.09649 | null |
2024-05-15 | DemOpts: Fairness corrections in COVID-19 case prediction models | Naman Awasthi et.al. | 2405.09483 | null |
2024-05-15 | Facilitating Opinion Diversity through Hybrid NLP Approaches | Michiel van der Meer et.al. | 2405.09439 | null |
2024-05-15 | The Unfairness of $\varepsilon$ -Fairness | Tolulope Fadina et.al. | 2405.09360 | null |
2024-05-15 | Multi-Source Conformal Inference Under Distribution Shift | Yi Liu et.al. | 2405.09331 | link |
2024-05-15 | Reinforcement Learning-Based Framework for the Intelligent Adaptation of User Interfaces | Daniel Gaspar-Figueiredo et.al. | 2405.09255 | null |
2024-05-15 | CarDreamer: Open-Source Learning Platform for World Model based Autonomous Driving | Dechen Gao et.al. | 2405.09111 | link |
2024-05-15 | Explainable AI for Ship Collision Avoidance: Decoding Decision-Making Processes and Behavioral Intentions | Hitoshi Yoshioka et.al. | 2405.09081 | link |
2024-05-15 | Perception Without Vision for Trajectory Prediction: Ego Vehicle Dynamics as Scene Representation for Efficient Active Learning in Autonomous Driving | Ross Greer et.al. | 2405.09049 | null |
2024-05-15 | Deep Learning in Earthquake Engineering: A Comprehensive Review | Yazhou Xie et.al. | 2405.09021 | null |
2024-05-14 | Contextual Emotion Recognition using Large Vision Language Models | Yasaman Etesam et.al. | 2405.08992 | null |
2024-05-14 | Bird’s-Eye View to Street-View: A Survey | Khawlah Bajbaa et.al. | 2405.08961 | null |
2024-05-14 | The Pitfalls and Promise of Conformal Inference Under Adversarial Attacks | Ziquan Liu et.al. | 2405.08886 | link |
2024-05-14 | The RoboDrive Challenge: Drive Anytime Anywhere in Any Condition | Lingdong Kong et.al. | 2405.08816 | null |
2024-05-14 | Ambiguous Annotations: When is a Pedestrian not a Pedestrian? | Luisa Schwirten et.al. | 2405.08794 | null |
2024-05-14 | Beyond the Black Box: Do More Complex Models Provide Superior XAI Explanations? | Mateusz Cedro et.al. | 2405.08658 | null |
2024-05-14 | vMFER: Von Mises-Fisher Experience Resampling Based on Uncertainty of Gradient Directions for Policy Improvement | Yiwen Zhu et.al. | 2405.08638 | null |
2024-05-15 | Learning Decision Policies with Instrumental Variables through Double Machine Learning | Daqian Shao et.al. | 2405.08498 | link |
2024-05-14 | Work-in-Progress: Crash Course: Can (Under Attack) Autonomous Driving Beat Human Drivers? | Francesco Marchiori et.al. | 2405.08466 | null |
2024-05-14 | Large-Scale Metric Computation in Online Controlled Experiment Platform | Tao Xiong et.al. | 2405.08411 | null |
2024-05-14 | Towards Multi-Task Generative-AI Edge Services with an Attention-based Diffusion DRL Approach | Yaju Liu et.al. | 2405.08328 | null |
2024-05-14 | Deep Reinforcement Learning for Real-Time Ground Delay Program Revision and Corresponding Flight Delay Assignments | Ke Liu et.al. | 2405.08298 | null |
2024-05-14 | Airport Delay Prediction with Temporal Fusion Transformers | Ke Liu et.al. | 2405.08293 | null |
2024-05-14 | VS-Assistant: Versatile Surgery Assistant on the Demand of Surgeons | Zhen Chen et.al. | 2405.08272 | null |
2024-05-13 | Factors Shaping Financial Success: A Deep Dive into Influencing Variables | Michael Zhou et.al. | 2405.08233 | null |
2024-05-13 | Community detection in bipartite signed networks is highly dependent on parameter choice | Elena Candellone et.al. | 2405.08203 | link |
2024-05-13 | Optimizing Task Scheduling in Heterogeneous Computing Environments: A Comparative Analysis of CPU, GPU, and ASIC Platforms Using E2C Simulator | Ali Mohammadjafari et.al. | 2405.08187 | null |
2024-05-13 | Do Bayesian imaging methods report trustworthy probabilities? | David Y. W. Thong et.al. | 2405.08179 | null |
2024-05-13 | Equivariant Deep Learning of Mixed-Integer Optimal Control Solutions for Vehicle Decision Making and Motion Planning | Rudolf Reiter et.al. | 2405.08122 | null |
2024-05-13 | SPIN: Simultaneous Perception, Interaction and Navigation | Shagun Uppal et.al. | 2405.07991 | null |
2024-05-13 | A Generalist Learner for Multifaceted Medical Image Interpretation | Hong-Yu Zhou et.al. | 2405.07988 | null |
2024-05-13 | OverlapMamba: Novel Shift State Space Model for LiDAR-based Place Recognition | Qiuchi Xiang et.al. | 2405.07966 | link |
2024-05-13 | Fast Computation of Superquantile-Constrained Optimization Through Implicit Scenario Reduction | Jake Roth et.al. | 2405.07965 | link |
2024-05-13 | AgentClinic: a multimodal agent benchmark to evaluate AI in simulated clinical environments | Samuel Schmidgall et.al. | 2405.07960 | null |
2024-05-13 | IMAFD: An Interpretable Multi-stage Approach to Flood Detection from time series Multispectral Data | Ziyang Zhang et.al. | 2405.07916 | null |
2024-05-13 | AnoVox: A Benchmark for Multimodal Anomaly Detection in Autonomous Driving | Daniel Bogdoll et.al. | 2405.07865 | link |
2024-05-13 | Collective Decision-Making on Task Allocation Feasibility | Samratul Fuady et.al. | 2405.07799 | null |
2024-05-13 | Human-Modeling in Sequential Decision-Making: An Analysis through the Lens of Human-Aware AI | Silvia Tulli et.al. | 2405.07773 | null |
2024-05-13 | Waste Factor and Waste Figure: A Unified Theory for Modeling and Analyzing Wasted Power in Radio Access Networks for Improved Sustainability | Theodore S. Rappaport et.al. | 2405.07710 | null |
2024-05-13 | oTTC: Object Time-to-Contact for Motion Estimation in Autonomous Driving | Abdul Hannan Khan et.al. | 2405.07698 | null |
2024-05-13 | Evaluating the Explainable AI Method Grad-CAM for Breath Classification on Newborn Time Series Data | Camelia Oprea et.al. | 2405.07590 | null |
2024-05-13 | MaskFuser: Masked Fusion of Joint Multi-Modal Tokenization for End-to-End Autonomous Driving | Yiqun Duan et.al. | 2405.07573 | null |
2024-05-13 | Safety-Aware Human-Lead Vehicle Platooning by Proactively Reacting to Uncertain Human Behaving | Jia Hu et.al. | 2405.07556 | null |
2024-05-13 | Prompt-based Code Completion via Multi-Retrieval Augmented Generation | Hanzhuo Tan et.al. | 2405.07530 | null |
2024-05-13 | Consistency Policy: Accelerated Visuomotor Policies via Consistency Distillation | Aaditya Prasad et.al. | 2405.07503 | null |
2024-05-12 | CaFA: Global Weather Forecasting with Factorized Attention on Sphere | Zijie Li et.al. | 2405.07395 | link |
2024-05-12 | Explainable Convolutional Neural Networks for Retinal Fundus Classification and Cutting-Edge Segmentation Models for Retinal Blood Vessels from Fundus Images | Fatema Tuj Johora Faria et.al. | 2405.07338 | link |
2024-05-12 | Computational analysis of US Congressional speeches reveals a shift from evidence to intuition | Segun Taofeek Aroyehun et.al. | 2405.07323 | null |
2024-05-12 | Enhancing Decision-Making in Optimization through LLM-Assisted Inference: A Neural Networks Perspective | Gaurav Singh et.al. | 2405.07212 | null |
2024-05-10 | Multi-Object Tracking in the Dark | Xinzhe Wang et.al. | 2405.06600 | link |
2024-05-10 | Hierarchical Learned Risk-Aware Planning Framework for Human Driving Modeling | Nathan Ludlow et.al. | 2405.06578 | null |
2024-05-10 | Good Things Come in Trees: Emotion and Context Aware Behaviour Trees for Ethical Robotic Decision-Making | Paige Tuttösí et.al. | 2405.06543 | null |
2024-05-10 | Aspect-based Sentiment Evaluation of Chess Moves (ASSESS): an NLP-based Method for Evaluating Chess Strategies from Textbooks | Haifa Alrdahi et.al. | 2405.06499 | null |
2024-05-10 | Autonomous Driving with a Deep Dual-Model Solution for Steering and Braking Control | Ana Petra Jukić et.al. | 2405.06473 | null |
2024-05-10 | Residual-based Attention Physics-informed Neural Networks for Efficient Spatio-Temporal Lifetime Assessment of Transformers Operated in Renewable Power Plants | Ibai Ramirez et.al. | 2405.06443 | null |
2024-05-10 | Building Trust in AI-Driven Decision Making for Cyber-Physical Systems (CPS): A Comprehensive Review | Rahul Umesh Mhapsekar et.al. | 2405.06347 | null |
2024-05-10 | FedGCS: A Generative Framework for Efficient Client Selection in Federated Learning via Gradient-based Optimization | Zhiyuan Ning et.al. | 2405.06312 | link |
2024-05-10 | Exploring the Interplay of Interpretability and Robustness in Deep Neural Networks: A Saliency-guided Approach | Amira Guesmi et.al. | 2405.06278 | null |
2024-05-10 | XAI4LLM. Let Machine Learning Models and LLMs Collaborate for Enhanced In-Context Learning in Healthcare | Fatemeh Nazary et.al. | 2405.06270 | null |
2024-05-10 | Selective Focus: Investigating Semantics Sensitivity in Post-training Quantization for Lane Detection | Yunqian Fan et.al. | 2405.06264 | null |
2024-05-09 | Probing Multimodal LLMs as World Models for Driving | Shiva Sreeram et.al. | 2405.05956 | link |
2024-05-09 | A Survey on Visualization Approaches in Political Science for Social and Political Factors: Progress to Date and Future Opportunities | Dongyun Han et.al. | 2405.05947 | null |
2024-05-09 | Co-driver: VLM-based Autonomous Driving Assistant with Human-like Behavior and Understanding for Complex Road Scenes | Ziang Guo et.al. | 2405.05885 | link |
2024-05-09 | Informed Decision-Making through Advancements in Open Set Recognition and Unknown Sample Detection | Atefeh Mahdavi et.al. | 2405.05836 | null |
2024-05-09 | Robots Can Feel: LLM-based Framework for Robot Ethical Reasoning | Artem Lykov et.al. | 2405.05824 | link |
2024-05-09 | Optimal Baseline Corrections for Off-Policy Contextual Bandits | Shashank Gupta et.al. | 2405.05736 | link |
2024-05-09 | TransAnaNet: Transformer-based Anatomy Change Prediction Network for Head and Neck Cancer Patient Radiotherapy | Meixu Chen et.al. | 2405.05674 | null |
2024-05-09 | Emerging Optimization Problems for Distribution in Same-day Delivery | Yuanyuan Li et.al. | 2405.05620 | null |
2024-05-09 | Towards Robust Physical-world Backdoor Attacks on Lane Detection | Xinwei Zhang et.al. | 2405.05553 | null |
2024-05-09 | Benchmarking Neural Radiance Fields for Autonomous Robots: An Overview | Yuhang Ming et.al. | 2405.05526 | null |
2024-05-09 | Multi-Scale Dilated Convolution Network for Long-Term Time Series Forecasting | Feifei Li et.al. | 2405.05499 | null |
2024-05-09 | Advancing Head and Neck Cancer Survival Prediction via Multi-Label Learning and Deep Model Interpretation | Meixu Chen et.al. | 2405.05488 | null |
2024-05-09 | Design of Targeted Community-Based Resource Allocation in the Presence of Vaccine Hesitancy via a Data-Driven Compartmental Stochastic Optimization Model | Hieu Bui et.al. | 2405.05487 | null |
2024-05-09 | Topological bifurcations in a mean-field game | Ali Akbar Rezaei Lori et.al. | 2405.05473 | null |
2024-05-08 | Mitigating Exaggerated Safety in Large Language Models | Ruchi Bhalani et.al. | 2405.05418 | null |
2024-05-08 | Multi-Modal Data-Efficient 3D Scene Understanding for Autonomous Driving | Lingdong Kong et.al. | 2405.05258 | link |
2024-05-08 | Clustering Retail Products Based on Customer Behaviour | Vladimír Holý et.al. | 2405.05218 | null |
2024-05-08 | A Survey on Occupancy Perception for Autonomous Driving: The Information Fusion Perspective | Huaiyuan Xu et.al. | 2405.05173 | link |
2024-05-08 | DenserRadar: A 4D millimeter-wave radar point cloud detector based on dense LiDAR point clouds | Zeyu Han et.al. | 2405.05131 | null |
2024-05-08 | Novel Actor-Critic Algorithm for Robust Decision Making of CAV under Delays and Loss of V2X Data | Zine el abidine Kherroubi et.al. | 2405.05072 | null |
2024-05-08 | Designing Skill-Compatible AI: Methodologies and Frameworks in Chess | Karim Hamade et.al. | 2405.05066 | link |
2024-05-08 | Impact of Tone-Aware Explanations in Recommender Systems | Ayano Okoso et.al. | 2405.05061 | null |
2024-05-08 | Quantum Circuit Ansatz: Abstraction and Reuse of Quantum Algorithm Design | Xiaoyu Guo et.al. | 2405.05021 | null |
2024-05-08 | Overcoming Anchoring Bias: The Potential of AI and XAI-based Decision Support | Felix Haag et.al. | 2405.04972 | null |
2024-05-08 | Traj-LLM: A New Exploration for Empowering Trajectory Prediction with Pre-trained Large Language Models | Zhengxing Lan et.al. | 2405.04909 | null |
2024-05-07 | Enhancing Organizational Performance: Harnessing AI and NLP for User Feedback Analysis in Product Development | Tian Tian et.al. | 2405.04692 | null |
2024-05-07 | ACEGEN: Reinforcement learning of generative chemical agents for drug discovery | Albert Bou et.al. | 2405.04657 | link |
2024-05-07 | New allometric models for the USA create a step-change in forest carbon estimation, modeling, and mapping | Lucas K. Johnson et.al. | 2405.04507 | null |
2024-05-07 | TorchDriveEnv: A Reinforcement Learning Benchmark for Autonomous Driving with Reactive, Realistic, and Diverse Non-Playable Characters | Jonathan Wilder Lavington et.al. | 2405.04491 | null |
2024-05-07 | POV Learning: Individual Alignment of Multimodal Models using Human Perception | Simon Werner et.al. | 2405.04443 | null |
2024-05-07 | Designing, Developing, and Validating Network Intelligence for Scaling in Service-Based Architectures based on Deep Reinforcement Learning | Paola Soto et.al. | 2405.04441 | null |
2024-05-07 | Designing the Network Intelligence Stratum for 6G Networks | Paola Soto et.al. | 2405.04432 | null |
2024-05-07 | Mathematical Modeling of $^{18}$F-Fluoromisonidazole ($^{18}$ F-FMISO) Radiopharmaceutical Transport in Vascularized Solid Tumors | Mohammad Amin Abazari et.al. | 2405.04418 | null |
2024-05-09 | Weakly-Supervised Residual Evidential Learning for Multi-Instance Uncertainty Estimation | Pei Liu et.al. | 2405.04405 | link |
2024-05-07 | Efficient Online Set-valued Classification with Bandit Feedback | Zhou Wang et.al. | 2405.04393 | null |
2024-05-07 | DriveWorld: 4D Pre-trained Scene Understanding via World Models for Autonomous Driving | Chen Min et.al. | 2405.04390 | null |
2024-05-07 | Bayesian Simultaneous Localization and Multi-Lane Tracking Using Onboard Sensors and a SD Map | Yuxuan Xia et.al. | 2405.04290 | null |
2024-05-07 | pFedLVM: A Large Vision Model (LVM)-Driven and Latent Feature-Based Personalized Federated Learning Framework in Autonomous Driving | Wei-Bin Kou et.al. | 2405.04146 | null |
2024-05-07 | Policy Learning with a Language Bottleneck | Megha Srivastava et.al. | 2405.04118 | link |
2024-05-07 | ESP: Extro-Spective Prediction for Long-term Behavior Reasoning in Emergency Scenarios | Dingrui Wang et.al. | 2405.04100 | null |
2024-05-07 | Counterfactual and Semifactual Explanations in Abstract Argumentation: Formal Foundations, Complexity and Computation | Gianvincenzo Alfano et.al. | 2405.04081 | null |
2024-05-07 | Feature Map Convergence Evaluation for Functional Module | Ludan Zhang et.al. | 2405.04041 | null |
2024-05-07 | Uncovering implementable dormant pruning decisions from three different stakeholder perspectives | Deanna Flynn et.al. | 2405.04030 | null |
2024-05-07 | Lumbar Spine Tumor Segmentation and Localization in T2 MRI Images Using AI | Rikathi Pal et.al. | 2405.04023 | null |
2024-05-07 | Certified Policy Verification and Synthesis for MDPs under Distributional Reach-avoidance Properties | S. Akshay et.al. | 2405.04015 | null |
2024-05-07 | Deep Event-based Object Detection in Autonomous Driving: A Survey | Bingquan Zhou et.al. | 2405.03995 | null |
2024-05-07 | Unified End-to-End V2X Cooperative Autonomous Driving | Zhiwei Li et.al. | 2405.03971 | null |
2024-05-06 | Anti-Heroes: An Ethics-focused Method for Responsible Designer Intentions | Shikha Mehta et.al. | 2405.03674 | null |
2024-05-06 | RoboCar: A Rapidly Deployable Open-Source Platform for Autonomous Driving Research | Mehdi Testouri et.al. | 2405.03572 | link |
2024-05-06 | Is Sora a World Simulator? A Comprehensive Survey on General World Models and Beyond | Zheng Zhu et.al. | 2405.03520 | link |
2024-05-06 | Uncertainty of Supply Chains: Risk and Ambiguity | d’Artis Kancs et.al. | 2405.03451 | null |
2024-05-06 | The high dimensional psychological profile and cultural bias of ChatGPT | Hang Yuan et.al. | 2405.03387 | null |
2024-05-06 | Enhancing Q-Learning with Large Language Model Heuristics | Xiefeng Wu et.al. | 2405.03341 | null |
2024-05-06 | Functional Equivalence with NARS | Robert Johansson et.al. | 2405.03340 | null |
2024-05-06 | Artificial Intelligence in the Autonomous Navigation of Endovascular Interventions: A Systematic Review | Harry Robertshaw et.al. | 2405.03305 | null |
2024-05-06 | End-to-End Reinforcement Learning of Curative Curtailment with Partial Measurement Availability | Hinrikus Wolf et.al. | 2405.03262 | null |
2024-05-06 | Anchored Answers: Unravelling Positional Bias in GPT-2’s Multiple-Choice Questions | Ruizhe Li et.al. | 2405.03205 | link |
2024-05-05 | High Order Reasoning for Time Critical Recommendation in Evidence-based Medicine | Manjiang Yu et.al. | 2405.03010 | null |
2024-05-05 | MERIT: Multi-view Evidential learning for Reliable and Interpretable liver fibrosis sTaging | Yuanye Liu et.al. | 2405.02918 | null |
2024-05-05 | SalFAU-Net: Saliency Fusion Attention U-Net for Salient Object Detection | Kassaw Abraham Mulat et.al. | 2405.02906 | null |
2024-05-05 | Region-specific Risk Quantification for Interpretable Prognosis of COVID-19 | Zhusi Zhong et.al. | 2405.02815 | link |
2024-05-04 | Sub-goal Distillation: A Method to Improve Small Language Agents | Maryam Hashemzadeh et.al. | 2405.02749 | link |
2024-05-04 | Grouping predictors via network-wide metrics | Brandon Woosuk Park et.al. | 2405.02715 | null |
2024-05-04 | Ambush strategy enhances organisms’ performance in rock-paper-scissors games | R. Barbalho et.al. | 2405.02674 | null |
2024-05-04 | Interpretable Multi-View Clustering | Mudi Jiang et.al. | 2405.02644 | null |
2024-05-04 | Accelerating Autonomy: Insights from Pro Racers in the Era of Autonomous Racing - An Expert Interview Study | Frederik Werner et.al. | 2405.02620 | link |
2024-05-04 | MEXGEN: An Effective and Efficient Information Gain Approximation for Information Gathering Path Planning | Joshua Chesser et.al. | 2405.02605 | null |
2024-05-03 | Subgraph2vec: A random walk-based algorithm for embedding knowledge graphs | Elika Bozorgi et.al. | 2405.02240 | null |
2024-05-03 | Optimistic Regret Bounds for Online Learning in Adversarial Markov Decision Processes | Sang Bin Moon et.al. | 2405.02188 | null |
2024-05-03 | Characterized Diffusion and Spatial-Temporal Interaction Network for Trajectory Prediction in Autonomous Driving | Haicheng Liao et.al. | 2405.02145 | null |
2024-05-03 | Multi-Objective Recommendation via Multivariate Policy Learning | Olivier Jeunen et.al. | 2405.02141 | link |
2024-05-03 | Learning from Evolution: Improving Collective Decision-Making Mechanisms using Insights from Evolutionary Robotics | Tanja Katharina Kaiser et.al. | 2405.02133 | null |
2024-05-03 | Argumentative Large Language Models for Explainable and Contestable Decision-Making | Gabriel Freedman et.al. | 2405.02079 | link |
2024-05-03 | Sampling to Achieve the Goal: An Age-aware Remote Markov Decision Process | Aimin Li et.al. | 2405.02042 | link |
2024-05-03 | Obstacle Avoidance of Autonomous Vehicles: An LPVMPC with Scheduling Trust Region | Maryam Nezami et.al. | 2405.02030 | null |
2024-05-03 | DiffMap: Enhancing Map Segmentation with Map Prior Using Diffusion Model | Peijin Jia et.al. | 2405.02008 | null |
2024-05-03 | M ${^2}$ Depth: Self-supervised Two-Frame Multi-camera Metric Depth Estimation | Yingshuang Zou et.al. | 2405.02004 | null |
2024-05-03 | Mathematics of statistical sequential decision-making: concentration, risk-awareness and modelling in stochastic bandits, with applications to bariatric surgery | Patrick Saux et.al. | 2405.01994 | null |
2024-05-03 | Conformal Prediction for Natural Language Processing: A Survey | Margarida M. Campos et.al. | 2405.01976 | null |
2024-05-03 | Unleashing the Power of AI: Transforming Marketing Decision-Making in Heavy Machinery with Machine Learning, Radar Chart Simulation, and Markov Chain Analysis | Tian Tian et.al. | 2405.01913 | null |
2024-05-03 | Transforming Investment Strategies and Strategic Decision-Making: Unveiling a Novel Methodology for Enhanced Performance and Risk Management in Financial Markets | Tian Tian et.al. | 2405.01892 | null |
2024-05-03 | Explainable Risk Classification in Financial Reports | Xue Wen Tan et.al. | 2405.01881 | null |
2024-05-03 | SocialGFs: Learning Social Gradient Fields for Multi-Agent Reinforcement Learning | Qian Long et.al. | 2405.01839 | null |
2024-05-03 | Non-linear Welfare-Aware Strategic Learning | Tian Xie et.al. | 2405.01810 | link |
2024-05-03 | Algorithmic Decision-Making under Agents with Persistent Improvement | Tian Xie et.al. | 2405.01807 | link |
2024-05-02 | Large Language Models for UAVs: Current State and Pathways to the Future | Shumaila Javaid et.al. | 2405.01745 | null |
2024-05-02 | Explainability Guided Adversarial Evasion Attacks on Malware Detectors | Kshitiz Aryal et.al. | 2405.01728 | null |
2024-05-02 | Multi-Space Alignments Towards Universal LiDAR Segmentation | Youquan Liu et.al. | 2405.01538 | link |
2024-05-02 | OmniDrive: A Holistic LLM-Agent Framework for Autonomous Driving with 3D Perception, Reasoning and Planning | Shihao Wang et.al. | 2405.01533 | link |
2024-05-02 | Supporting Business Document Workflows via Collection-Centric Information Foraging with Large Language Models | Raymond Fok et.al. | 2405.01501 | null |
2024-05-02 | A Basic Overview of Various Stochastic Approaches to Financial Modeling With Examples | Aashrit Cunchala et.al. | 2405.01397 | null |
2024-05-02 | An Advanced Framework for Ultra-Realistic Simulation and Digital Twinning for Autonomous Vehicles | Yuankai He et.al. | 2405.01328 | null |
2024-05-02 | MFTraj: Map-Free, Behavior-Driven Trajectory Prediction for Autonomous Driving | Haicheng Liao et.al. | 2405.01266 | null |
2024-05-02 | Causal Influence in Federated Edge Inference | Mert Kayaalp et.al. | 2405.01260 | null |
2024-05-02 | A Survey on Semantic Communication Networks: Architecture, Security, and Privacy | Shaolong Guo et.al. | 2405.01221 | null |
2024-05-02 | How A/B testing changes the dynamics of information spreading on a social network | Matteo Ottaviani et.al. | 2405.01165 | null |
2024-05-02 | Federated Learning with Heterogeneous Data Handling for Robust Vehicular Object Detection | Ahmad Khalil et.al. | 2405.01108 | link |
2024-05-02 | Poisoning Attacks on Federated Learning for Autonomous Driving | Sonakshi Garg et.al. | 2405.01073 | null |
2024-05-02 | Rare Collision Risk Estimation of Autonomous Vehicles with Multi-Agent Situation Awareness | Mahdieh Zaker et.al. | 2405.01011 | null |
2024-05-02 | Generative manufacturing systems using diffusion models and ChatGPT | Xingyu Li et.al. | 2405.00958 | null |
2024-05-02 | Provably Efficient Reinforcement Learning for Adversarial Restless Multi-Armed Bandits with Unknown Transitions and Bandit Feedback | Guojun Xiong et.al. | 2405.00950 | null |
2024-05-01 | DiL-NeRF: Delving into Lidar for Neural Radiance Field on Street Scenes | Shanlin Sun et.al. | 2405.00900 | null |
2024-05-01 | Uncovering Agendas: A Novel French & English Dataset for Agenda Detection on Social Media | Gregorios Katsios et.al. | 2405.00821 | link |
2024-05-01 | ADM: Accelerated Diffusion Model via Estimated Priors for Robust Motion Prediction under Uncertainties | Jiahui Li et.al. | 2405.00797 | link |
2024-05-01 | Lane Segmentation Refinement with Diffusion Models | Antonio Ruiz et.al. | 2405.00620 | null |
2024-05-03 | New Trends on the Systems Approach to Modeling SARS-CoV-2 Pandemics in a Globally Connected Planet | Giulia Bertaglia et.al. | 2405.00541 | null |
2024-05-01 | Design Implications for a Social and Collaborative Understanding of online Information Assessment Practices, Challenges and Heuristics | Vasilis Vlachokyriakos et.al. | 2405.00519 | null |
2024-05-01 | GAD-Generative Learning for HD Map-Free Autonomous Driving | Weijian Sun et.al. | 2405.00515 | link |
2024-05-01 | On the Relevance of Byzantine Robust Optimization Against Data Poisoning | Sadegh Farhadkhani et.al. | 2405.00491 | null |
2024-05-01 | RAG-based Explainable Prediction of Road Users Behaviors for Automated Driving using Knowledge Graphs and Large Language Models | Mohamed Manzour Hussien et.al. | 2405.00449 | null |
2024-05-01 | Geometric Insights into Focal Loss: Reducing Curvature for Enhanced Model Calibration | Masanari Kimura et.al. | 2405.00442 | null |
2024-05-01 | UCB-driven Utility Function Search for Multi-objective Reinforcement Learning | Yucheng Shi et.al. | 2405.00410 | link |
2024-05-01 | Dual-Role AoI-based Incentive Mechanism for HD map Crowdsourcing | Wentao Ye et.al. | 2405.00353 | null |
2024-05-01 | A Self-explaining Neural Architecture for Generalizable Concept Learning | Sanchit Sinha et.al. | 2405.00349 | null |
2024-05-01 | Finding the white male: The prevalence and consequences of algorithmic gender and race bias in political Google searches | Tobias Rohrbach et.al. | 2405.00335 | null |
2024-05-01 | Reevaluating coexistence and stability in ecosystem networks to address ecological transients: methods and implications | Sarah A. Vollert et.al. | 2405.00333 | null |
2024-05-01 | Enhance Planning with Physics-informed Safety Controllor for End-to-end Autonomous Driving | Hang Zhou et.al. | 2405.00316 | null |
2024-05-01 | Social Life Simulation for Non-Cognitive Skills Learning | Zihan Yan et.al. | 2405.00273 | null |
2024-04-30 | SemVecNet: Generalizable Vector Map Generation for Arbitrary Sensor Configurations | Narayanan Elavathur Ranganatha et.al. | 2405.00250 | link |
2024-04-30 | Guiding Attention in End-to-End Driving Models | Diego Porres et.al. | 2405.00242 | link |
2024-04-30 | STT: Stateful Tracking with Transformers for Autonomous Driving | Longlong Jing et.al. | 2405.00236 | null |
2024-04-30 | Comparing Motion Distortion Between Vehicle Field Deployments | Nicolas Samson et.al. | 2405.00189 | null |
2024-04-30 | Heart Rate and Body Temperature Relationship in Children Admitted to PICU – A Machine Learning Approach | Emilie Lu et.al. | 2405.00180 | null |
2024-04-30 | Analyzing Transport Policies in Developing Countries with ABM | Kathleen Salazar-Serna et.al. | 2404.19745 | link |
2024-04-30 | Collaborative Control Method of Transit Signal Priority Based on Cooperative Game and Reinforcement Learning | Hao Qin et.al. | 2404.19683 | null |
2024-04-30 | The Drawback of Insight: Detailed Explanations Can Reduce Agreement with XAI | Sabid Bin Habib Pias et.al. | 2404.19629 | null |
2024-04-30 | Enhancing Deep Learning Model Explainability in Brain Tumor Datasets using Post-Heuristic Approaches | Konstantinos Pasvantis et.al. | 2404.19568 | null |
2024-04-30 | Choosing a consultant in a dynamic investment problem | Yuval Cornfeld et.al. | 2404.19507 | null |
2024-04-30 | The harms of class imbalance corrections for machine learning based prediction models: a simulation study | Alex Carriero et.al. | 2404.19494 | link |
2024-04-30 | Transformer-Enhanced Motion Planner: Attention-Guided Sampling for State-Specific Decision Making | Lei Zhuang et.al. | 2404.19403 | null |
2024-04-30 | Online Electricity Purchase for Data Center with Dynamic Virtual Battery from Flexibility Aggregation | Kekun Gao et.al. | 2404.19387 | null |
2024-04-30 | Pseudo Label Refinery for Unsupervised Domain Adaptation on Cross-dataset 3D Object Detection | Zhanwei Zhang et.al. | 2404.19384 | null |
2024-04-30 | SemanticFormer: Holistic and Semantic Traffic Scene Representation for Trajectory Prediction using Knowledge Graphs | Zhigang Sun et.al. | 2404.19379 | link |
2024-04-30 | Reliable or Deceptive? Investigating Gated Features for Smooth Visual Explanations in CNNs | Soham Mitra et.al. | 2404.19341 | link |
2024-04-30 | G2LTraj: A Global-to-Local Generation Approach for Trajectory Prediction | Zhanwei Zhang et.al. | 2404.19330 | link |
2024-04-30 | Bias Mitigation via Compensation: A Reinforcement Learning Perspective | Nandhini Swaminathan et.al. | 2404.19256 | null |
2024-04-29 | Scalable Bayesian Inference in the Era of Deep Learning: From Gaussian Processes to Deep Neural Networks | Javier Antoran et.al. | 2404.19157 | null |
2024-04-29 | Blind Spots and Biases: Exploring the Role of Annotator Cognitive Biases in NLP | Sanjana Gautam et.al. | 2404.19071 | null |
2024-04-29 | Synthesizing the Born rule with reinforcement learning | Rodrigo S. Piera et.al. | 2404.19011 | null |
2024-04-29 | Detecting critical treatment effect bias in small subgroups | Piersilvio De Bartolomeis et.al. | 2404.18905 | link |
2024-04-29 | Overcoming Knowledge Barriers: Online Imitation Learning from Observation with Pretrained World Models | Xingyuan Zhang et.al. | 2404.18896 | link |
2024-04-29 | PlanNetX: Learning an Efficient Neural Network Planner from MPC for Longitudinal Control | Jasper Hoffmann et.al. | 2404.18863 | null |
2024-04-29 | Safe Reach Set Computation via Neural Barrier Certificates | Alessandro Abate et.al. | 2404.18813 | null |
2024-04-29 | Three-state Opinion Dynamics for Financial Markets on Complex Networks | Bernardo J. Zubillaga et.al. | 2404.18709 | null |
2024-04-29 | Why You Should Not Trust Interpretations in Machine Learning: Adversarial Attacks on Partial Dependence Plots | Xi Xin et.al. | 2404.18702 | null |
2024-04-29 | Work Smarter…Not Harder: Efficient Minimization of Dependency Length in SOV Languages | Sidharth Ranjan et.al. | 2404.18684 | null |
2024-04-29 | LLMClean: Context-Aware Tabular Data Cleaning via LLM-Generated OFDs | Fabian Biester et.al. | 2404.18681 | null |
2024-04-29 | Enhancing Uncertain Demand Prediction in Hospitals Using Simple and Advanced Machine Learning | Annie Hu et.al. | 2404.18670 | null |
2024-04-29 | Uncertainty-boosted Robust Video Activity Anticipation | Zhaobo Qi et.al. | 2404.18648 | link |
2024-04-29 | CoSense3D: an Agent-based Efficient Learning Framework for Collective Perception | Yunshuang Yuan et.al. | 2404.18617 | link |
2024-04-29 | Assessing Quality Metrics for Neural Reality Gap Input Mitigation in Autonomous Driving Testing | Stefano Carlo Lambertenghi et.al. | 2404.18577 | link |
2024-04-29 | Predicting Safety Misbehaviours in Autonomous Driving Systems using Uncertainty Quantification | Ruben Grewal et.al. | 2404.18573 | link |
2024-04-29 | IncidentResponseGPT: Generating Traffic Incident Response Plans with Generative Artificial Intelligence | Artur Grigorev et.al. | 2404.18550 | null |
2024-04-29 | Reduced-Rank Multi-objective Policy Learning and Optimization | Ezinne Nwankwo et.al. | 2404.18490 | null |
2024-04-29 | MRIC: Model-Based Reinforcement-Imitation Learning with Mixture-of-Codebooks for Autonomous Driving Simulation | Baotian He et.al. | 2404.18464 | null |
2024-04-29 | $ν$ -DBA: Neural Implicit Dense Bundle Adjustment Enables Image-Only Driving Scene Reconstruction | Yunxuan Mao et.al. | 2404.18439 | null |
2024-04-28 | Bias Neutralization Framework: Measuring Fairness in Large Language Models with Bias Intelligence Quotient (BiQ) | Malur Narayan et.al. | 2404.18276 | null |
2024-04-28 | A General Causal Inference Framework for Cross-Sectional Observational Data | Yonghe Zhao et.al. | 2404.18197 | null |
2024-04-28 | Application and practice of AI technology in quantitative investment | Shuochen Bi et.al. | 2404.18184 | null |
2024-04-26 | The Role of Marketing in Public Policy Decision Making: The Case of Fuel Subsidy Removal in Nigeria | Salome O. Ighomereho et.al. | 2404.17551 | null |
2024-04-26 | CoCar NextGen: a Multi-Purpose Platform for Connected Autonomous Driving Research | Marc Heinrich et.al. | 2404.17550 | null |
2024-04-26 | A Cognitive-Driven Trajectory Prediction Model for Autonomous Driving in Mixed Autonomy Environment | Haicheng Liao et.al. | 2404.17520 | null |
2024-04-26 | Q-Learning to navigate turbulence without a map | Marco Rando et.al. | 2404.17495 | null |
2024-04-26 | Causally Abstracted Multi-armed Bandits | Fabio Massimo Zennaro et.al. | 2404.17493 | link |
2024-04-26 | A multi-agent model of hierarchical decision dynamics | Paul Kinsler et.al. | 2404.17477 | null |
2024-04-26 | Cost-Sensitive Uncertainty-Based Failure Recognition for Object Detection | Moussa Kassem Sbeyti et.al. | 2404.17427 | link |
2024-04-26 | Assessing the Potential of AI for Spatially Sensitive Nature-Related Financial Risks | Steven Reece et.al. | 2404.17369 | null |
2024-04-26 | On the Road to Clarity: Exploring Explainable AI for World Models in a Driver Assistance System | Mohamed Roshdi et.al. | 2404.17350 | null |
2024-04-26 | Scene-Extrapolation: Generating Interactive Traffic Scenarios | Maximilian Zipfl et.al. | 2404.17224 | null |
2024-04-26 | Beyond Imitation: A Life-long Policy Learning Framework for Path Tracking Control of Autonomous Driving | C. Gong et.al. | 2404.17198 | null |
2024-04-26 | Online $\mathrm{L}^{\natural}$ -Convex Minimization | Ken Yokoyama et.al. | 2404.17158 | null |
2024-04-26 | On the Federated Learning Framework for Cooperative Perception | Zhenrong Zhang et.al. | 2404.17147 | null |
2024-04-25 | Defect Localization Using Region of Interest and Histogram-Based Enhancement Approaches in 3D-Printing | Md Manjurul Ahsan et.al. | 2404.17015 | null |
2024-04-25 | Evolve Cost-aware Acquisition Functions Using Large Language Models | Yiming Yao et.al. | 2404.16906 | link |
2024-04-25 | Harnessing Inferior Solutions For Superior Outcomes: Obtaining Robust Solutions From Quantum Algorithms | Pascal Halffmann et.al. | 2404.16784 | null |
2024-04-25 | SHINE: Social Homology Identification for Navigation in Crowded Environments | Diego Martinez-Baselga et.al. | 2404.16705 | null |
2024-04-25 | Cooperate or Collapse: Emergence of Sustainability Behaviors in a Society of LLM Agents | Giorgio Piatti et.al. | 2404.16698 | link |
2024-04-25 | Benchmarking Mobile Device Control Agents across Diverse Configurations | Juyong Lee et.al. | 2404.16660 | null |
2024-04-25 | T-Explainer: A Model-Agnostic Explainability Framework Based on Gradients | Evandro S. Ortigossa et.al. | 2404.16495 | null |
2024-04-25 | CoCoG: Controllable Visual Stimuli Generation based on Human Concept Representations | Chen Wei et.al. | 2404.16482 | link |
2024-04-25 | DiffSeg: A Segmentation Model for Skin Lesions Based on Diffusion Difference | Zhihao Shuai et.al. | 2404.16474 | null |
2024-04-25 | Label-Free Topic-Focused Summarization Using Query Augmentation | Wenchuan Mu et.al. | 2404.16411 | null |
2024-04-25 | ReZero: Boosting MCTS-based Algorithms by Just-in-Time and Speedy Reanalyze | Chunyu Xuan et.al. | 2404.16364 | link |
2024-04-25 | Unraveling cell-cell communication with NicheNet by inferring active ligands from transcriptomics data | Chananchida Sang-aram et.al. | 2404.16358 | null |
2024-04-25 | Integration of Mixture of Experts and Multimodal Generative AI in Internet of Vehicles: A Survey | Minrui Xu et.al. | 2404.16356 | null |
2024-04-25 | Style Adaptation for Domain-adaptive Semantic Segmentation | Ting Li et.al. | 2404.16301 | null |
2024-04-25 | A Multi-objective Optimization Benchmark Test Suite for Real-time Semantic Segmentation | Yifan Zhao et.al. | 2404.16266 | link |
2024-04-24 | A Survey on Intermediate Fusion Methods for Collaborative Perception Categorized by Real World Challenges | Melih Yazgan et.al. | 2404.16139 | null |
2024-04-24 | Cantor: Inspiring Multimodal Chain-of-Thought of MLLM | Timin Gao et.al. | 2404.16033 | null |
2024-04-24 | Learning Car-Following Behaviors Using Bayesian Matrix Normal Mixture Regression | Chengyuan Zhang et.al. | 2404.16023 | null |
2024-04-24 | Explainable AI models for predicting liquefaction-induced lateral spreading | Cheng-Hsi Hsiao et.al. | 2404.15959 | link |
2024-04-24 | Rechargeable UAV Trajectory Optimization for Real-Time Persistent Data Collection of Large-Scale Sensor Networks | Rui Wang et.al. | 2404.15761 | null |
2024-04-24 | Efficient Multi-Model Fusion with Adversarial Complementary Representation Learning | Zuheng Kang et.al. | 2404.15704 | null |
2024-04-24 | Delay-Aware Multi-Agent Reinforcement Learning for Cooperative Adaptive Cruise Control with Model-based Stability Enhancement | Jiaqi Liu et.al. | 2404.15696 | null |
2024-04-24 | Hybrid LLM/Rule-based Approaches to Business Insights Generation from Structured Data | Aliaksei Vertsel et.al. | 2404.15604 | null |
2024-04-23 | CASPR: Automated Evaluation Metric for Contrastive Summarization | Nirupan Ananthamurugan et.al. | 2404.15565 | link |
2024-04-23 | Safe POMDP Online Planning among Dynamic Agents via Adaptive Conformal Prediction | Shili Sheng et.al. | 2404.15557 | null |
2024-04-23 | BattleAgent: Multi-modal Dynamic Emulation on Historical Battles to Complement Historical Analysis | Shuhang Lin et.al. | 2404.15532 | link |
2024-04-23 | Deep Models for Multi-View 3D Object Recognition: A Review | Mona Alzahrani et.al. | 2404.15224 | null |
2024-04-23 | Evaluating Physician-AI Interaction for Cancer Management: Paving the Path towards Precision Oncology | Zeshan Hussain et.al. | 2404.15187 | null |
2024-04-23 | Bias patterns in the application of LLMs for clinical decision support: A comprehensive study | Raphael Poulain et.al. | 2404.15149 | link |
2024-04-23 | Using ARIMA to Predict the Expansion of Subscriber Data Consumption | Mike Wa Nkongolo et.al. | 2404.15095 | null |
2024-04-23 | Hyperparameter Optimization Can Even be Harmful in Off-Policy Learning and How to Deal with It | Yuta Saito et.al. | 2404.15084 | null |
2024-04-23 | A Mechanism-Based Approach to Mitigating Harms from Persuasive Generative AI | Seliem El-Sayed et.al. | 2404.15058 | null |
2024-04-23 | Conformal Predictive Systems Under Covariate Shift | Jef Jonkers et.al. | 2404.15018 | link |
2024-04-23 | OccGen: Generative Multi-modal 3D Occupancy Prediction for Autonomous Driving | Guoqing Wang et.al. | 2404.15014 | null |
2024-04-23 | Vision Beyond Boundaries: An Initial Design Space of Domain-specific Large Vision Models in Human-robot Interaction | Yuchong Zhang et.al. | 2404.14965 | null |
2024-04-23 | Enhancing High-Speed Cruising Performance of Autonomous Vehicles through Integrated Deep Reinforcement Learning Framework | Jinhao Liang et.al. | 2404.14713 | null |
2024-04-23 | LaneCorrect: Self-supervised Lane Detection | Ming Nie et.al. | 2404.14671 | null |
2024-04-23 | Illuminating the Unseen: A Framework for Designing and Mitigating Context-induced Harms in Behavioral Sensing | Han Zhang et.al. | 2404.14665 | null |
2024-04-23 | AI Procurement Checklists: Revisiting Implementation in the Age of AI Governance | Tom Zick et.al. | 2404.14660 | null |
2024-04-23 | Uncertainty Quantification on Graph Learning: A Survey | Chao Chen et.al. | 2404.14642 | null |
2024-04-23 | Digital Twins for forecasting and decision optimisation with machine learning: applications in wastewater treatment | Matthew Colwell et.al. | 2404.14635 | null |
2024-04-22 | A general framework for supporting economic feasibility of generator and storage energy systems through capacity and dispatch optimization | Saeed Azad et.al. | 2404.14583 | link |
2024-04-22 | Designing forecasting software for forecast users: Empowering non-experts to create and understand their own forecasts | Richard Stromer et.al. | 2404.14575 | null |
2024-04-22 | A Survey of Decomposition-Based Evolutionary Multi-Objective Optimization: Part I-Past and Future | Ke Li et.al. | 2404.14571 | null |
2024-04-22 | Exploring Algorithmic Explainability: Generating Explainable AI Insights for Personalized Clinical Decision Support Focused on Cannabis Intoxication in Young Adults | Tongze Zhang et.al. | 2404.14563 | null |
2024-04-22 | Mapping Wireless Networks into Digital Reality through Joint Vertical and Horizontal Learning | Zifan Zhang et.al. | 2404.14497 | null |
2024-04-22 | Analysing the interaction of expansion decisions by end customers and grid development in the context of a municipal energy system | Paul Maximilian Röhrig et.al. | 2404.14371 | null |
2024-04-22 | PLUTO: Pushing the Limit of Imitation Learning-based Planning for Autonomous Driving | Jie Cheng et.al. | 2404.14327 | null |
2024-04-22 | Localization Based on MIMO Backscattering from Retro-Directive Antenna Arrays | Marina Lotti et.al. | 2404.14206 | null |
2024-04-22 | Unlawful Proxy Discrimination: A Framework for Challenging Inherently Discriminatory Algorithms | Hilde Weerts et.al. | 2404.14050 | null |
2024-04-22 | PointDifformer: Robust Point Cloud Registration With Neural Diffusion and Transformer | Rui She et.al. | 2404.14034 | null |
2024-04-22 | Collaborative Perception Datasets in Autonomous Driving: A Survey | Melih Yazgan et.al. | 2404.14022 | null |
2024-04-22 | Benchmarking Multi-Modal LLMs for Testing Visual Deep Learning Systems Through the Lens of Image Mutation | Liwen Wang et.al. | 2404.13945 | null |
2024-04-22 | Open Datasets for Satellite Radio Resource Control | Husnain Shahid et.al. | 2404.13920 | null |
2024-04-22 | Surveying Attitudinal Alignment Between Large Language Models Vs. Humans Towards 17 Sustainable Development Goals | Qingyang Wu et.al. | 2404.13885 | null |
2024-04-22 | Neural Radiance Field in Autonomous Driving: A Survey | Lei He et.al. | 2404.13816 | null |
2024-04-21 | Soar: Design and Deployment of A Smart Roadside Infrastructure System for Autonomous Driving | Shuyao Shi et.al. | 2404.13786 | null |
2024-04-21 | A Practical Multilevel Governance Framework for Autonomous and Intelligent Systems | Lukas D. Pöhler et.al. | 2404.13719 | null |
2024-04-21 | In-situ process monitoring and adaptive quality enhancement in laser additive manufacturing: a critical review | Lequn Chen et.al. | 2404.13673 | null |
2024-04-21 | Are We Ready for Planetary Exploration Robots? The TAIL-Plus Dataset for SLAM in Granular Environments | Zirui Wang et.al. | 2404.13600 | null |
2024-04-20 | FisheyeDetNet: Object Detection on Fisheye Surround View Camera Systems for Automated Driving | Ganesh Sistu et.al. | 2404.13443 | null |
2024-04-20 | Distribution Network Restoration: Resource Scheduling Considering Coupled Transportation-Power Networks | Harshal D. Kaushik et.al. | 2404.13422 | null |
2024-04-20 | On Modeling Multi-Criteria Decision Making with Uncertain Information using Probabilistic Rules | Shengxin Hong et.al. | 2404.13419 | null |
2024-04-20 | Social Force Embedded Mixed Graph Convolutional Network for Multi-class Trajectory Prediction | Quancheng Du et.al. | 2404.13378 | null |
2024-04-20 | Beyond Collaborative Filtering: A Relook at Task Formulation in Recommender Systems | Aixin Sun et.al. | 2404.13375 | null |
2024-04-20 | On Risk-Sensitive Decision Making Under Uncertainty | Chung-Han Hsieh et.al. | 2404.13371 | null |
2024-04-19 | Towards Robust Ferrous Scrap Material Classification with Deep Learning and Conformal Prediction | Paulo Henrique dos Santos et.al. | 2404.13002 | null |
2024-04-19 | Private Agent-Based Modeling | Ayush Chopra et.al. | 2404.12983 | null |
2024-04-19 | Physical Backdoor Attack can Jeopardize Driving with Vision-Large-Language Models | Zhenyang Ni et.al. | 2404.12916 | link |
2024-04-19 | FipTR: A Simple yet Effective Transformer Framework for Future Instance Prediction in Autonomous Driving | Xingtai Gui et.al. | 2404.12867 | link |
2024-04-19 | Language-Driven Active Learning for Diverse Open-Set 3D Object Detection | Ross Greer et.al. | 2404.12856 | link |
2024-04-19 | Explainable Deepfake Video Detection using Convolutional Neural Network and CapsuleNet | Gazi Hasin Ishrak et.al. | 2404.12841 | null |
2024-04-19 | Open Datasets for AI-Enabled Radio Resource Control in Non-Terrestrial Networks | Husnain Shahid et.al. | 2404.12813 | null |
2024-04-19 | Algorithmic Changes Are Not Enough: Evaluating the Removal of Race Adjustment from the eGFR Equation | Marika M. Cusick et.al. | 2404.12812 | link |
2024-04-19 | A Point-Based Approach to Efficient LiDAR Multi-Task Perception | Christopher Lang et.al. | 2404.12798 | null |
2024-04-19 | Camera Agnostic Two-Head Network for Ego-Lane Inference | Chaehyeon Song et.al. | 2404.12770 | null |
2024-04-19 | Immersive Analysis: Enhancing Material Inspection of X-Ray Computed Tomography Datasets in Augmented Reality | Alexander Gall et.al. | 2404.12751 | null |
2024-04-19 | Demonstration of quantum projective simulation on a single-photon-based quantum computer | Giacomo Franceschetto et.al. | 2404.12729 | null |
2024-04-19 | A Containerized Microservice Architecture for a ROS 2 Autonomous Driving Software: An End-to-End Latency Evaluation | Tobias Betz et.al. | 2404.12683 | null |
2024-04-19 | Dragtraffic: A Non-Expert Interactive and Point-Based Controllable Traffic Scene Generation Framework | Sheng Wang et.al. | 2404.12624 | null |
2024-04-19 | Deep Reinforcement Learning-aided Transmission Design for Energy-efficient Link Optimization in Vehicular Communications | Zhengpeng Wang et.al. | 2404.12595 | null |
2024-04-19 | Multi-Objective Offloading Optimization in MEC and Vehicular-Fog Systems: A Distributed-TD3 Approach | Frezer Guteta Wakgra et.al. | 2404.12584 | null |
2024-04-19 | Just Like Me: The Role of Opinions and Personal Experiences in The Perception of Explanations in Subjective Decision-Making | Sharon Ferguson et.al. | 2404.12558 | null |
2024-04-19 | Variance-informed Rounding Uncertainty Analysis for Floating-point Statistical Models | Sahil Bhola et.al. | 2404.12556 | null |
2024-04-18 | State Discretization for Continuous-State MDPs in Infectious Disease Control | Suyanpeng Zhang et.al. | 2404.12540 | null |
2024-04-18 | TrACT: A Training Dynamics Aware Contrastive Learning Framework for Long-tail Trajectory Prediction | Junrui Zhang et.al. | 2404.12538 | null |
2024-04-18 | RoboDreamer: Learning Compositional World Models for Robot Imagination | Siyuan Zhou et.al. | 2404.12377 | null |
2024-04-18 | MedThink: Explaining Medical Visual Question Answering via Multimodal Decision-Making Rationale | Xiaotang Gai et.al. | 2404.12372 | null |
2024-04-18 | Decision making in stochastic extensive form I: Stochastic decision forests | E. Emanuel Rapsch et.al. | 2404.12332 | null |
2024-04-18 | Reducing Bias in Pre-trained Models by Tuning while Penalizing Change | Niklas Penzel et.al. | 2404.12292 | null |
2024-04-18 | An Online Spatial-Temporal Graph Trajectory Planner for Autonomous Vehicles | Jilan Samiuddin et.al. | 2404.12256 | null |
2024-04-18 | Privacy-Preserving UCB Decision Process Verification via zk-SNARKs | Xikun Jiang et.al. | 2404.12186 | null |
2024-04-18 | Stability Certificates for Receding Horizon Games | Sophie Hall et.al. | 2404.12165 | null |
2024-04-18 | The Neutrality Fallacy: When Algorithmic Fairness Interventions are (Not) Positive Action | Hilde Weerts et.al. | 2404.12143 | null |
2024-04-18 | Character is Destiny: Can Large Language Models Simulate Persona-Driven Decisions in Role-Playing? | Rui Xu et.al. | 2404.12138 | null |
2024-04-18 | mABC: multi-Agent Blockchain-Inspired Collaboration for root cause analysis in micro-services architecture | Wei Zhang et.al. | 2404.12135 | link |
2024-04-18 | Intelligence Education made in Europe | Lars Berger et.al. | 2404.12125 | null |
2024-04-18 | Evolutionary Multi-Objective Optimisation for Fairness-Aware Self Adjusting Memory Classifiers in Data Streams | Pivithuru Thejan Amarasinghe et.al. | 2404.12076 | null |
2024-04-18 | emrQA-msquad: A Medical Dataset Structured with the SQuAD V2.0 Framework, Enriched with emrQA Medical Information | Jimenez Eladio et.al. | 2404.12050 | null |
2024-04-18 | Cost and CO2 emissions co-optimisation of green hydrogen production in a grid-connected renewable energy system | Sleiman Farah et.al. | 2404.11995 | null |
2024-04-18 | S4TP: Social-Suitable and Safety-Sensitive Trajectory Planning for Autonomous Vehicles | Xiao Wang et.al. | 2404.11946 | null |
2024-04-18 | Toward Short-Term Glucose Prediction Solely Based on CGM Time Series | Ming Cheng et.al. | 2404.11924 | null |
2024-04-18 | JointPPO: Diving Deeper into the Effectiveness of PPO in Multi-Agent Reinforcement Learning | Chenxing Liu et.al. | 2404.11831 | null |
2024-04-17 | TempBEV: Improving Learned BEV Encoders with Combined Image and BEV Space Temporal Aggregation | Thomas Monninger et.al. | 2404.11803 | null |
2024-04-17 | Multimodal 3D Object Detection on Unseen Domains | Deepti Hegde et.al. | 2404.11764 | null |
2024-04-17 | Language Models Still Struggle to Zero-shot Reason about Time Series | Mike A. Merrill et.al. | 2404.11757 | null |
2024-04-17 | VG4D: Vision-Language Model Goes 4D Video Recognition | Zhichao Deng et.al. | 2404.11605 | link |
2024-04-17 | Explainable Artificial Intelligence Techniques for Accurate Fault Detection and Diagnosis: A Review | Ahmed Maged et.al. | 2404.11597 | null |
2024-04-17 | Open-Ended Wargames with Large Language Models | Daniel P. Hogan et.al. | 2404.11446 | link |
2024-04-17 | Explainable Lung Disease Classification from Chest X-Ray Images Utilizing Deep Learning and XAI | Tanzina Taher Ifty et.al. | 2404.11428 | null |
2024-04-18 | SERENE: A Collusion Resilient Replication-based Verification Framework | Amir Esmaeili et.al. | 2404.11410 | null |
2024-04-17 | Pharmacokinetic Measurements in Dose Finding Model Guided by Escalation with Overdose Control | Arnab Kumar Maity et.al. | 2404.11406 | null |
2024-04-17 | Detector Collapse: Backdooring Object Detection to Catastrophic Overload or Blindness | Hangtao Zhang et.al. | 2404.11357 | null |
2024-04-17 | The dynamics of diversity on corporate boards | Matthias Raddant et.al. | 2404.11334 | null |
2024-04-17 | Towards Human Awareness in Robot Task Planning with Large Language Models | Yuchen Liu et.al. | 2404.11267 | null |
2024-04-17 | KI-GAN: Knowledge-Informed Generative Adversarial Networks for Enhanced Multi-Vehicle Trajectory Forecasting at Signalized Intersections | Chuheng Wei et.al. | 2404.11181 | link |
2024-04-17 | D-Aug: Enhancing Data Augmentation for Dynamic LiDAR Scenes | Jiaxing Zhao et.al. | 2404.11127 | null |
2024-04-17 | Reuse out-of-year data to enhance land cover mappingvia feature disentanglement and contrastive learning | Cassio F. Dantas et.al. | 2404.11114 | null |
2024-04-17 | Recommender Systems in Financial Trading: Using machine-based conviction analysis in an explainable AI investment framework | Alicia Vidler et.al. | 2404.11080 | null |
2024-04-17 | Do you need a DAO? | Henrik Axelsen et.al. | 2404.11076 | null |
2024-04-17 | Sky-GVIO: an enhanced GNSS/INS/Vision navigation with FCN-based sky-segmentation in urban canyon | Jingrong Wang et.al. | 2404.11070 | link |
2024-04-17 | Periodicity in New York State COVID-19 Hospitalizations Leveraged from the Variable Bandpass Periodic Block Bootstrap | Asmaa Ahmad et.al. | 2404.11006 | null |
2024-04-17 | How to deal with glare for improved perception of Autonomous Vehicles | Muhammad Z. Alam et.al. | 2404.10992 | null |
2024-04-17 | Group-Aware Coordination Graph for Multi-Agent Reinforcement Learning | Wei Duan et.al. | 2404.10976 | link |
2024-04-17 | Procedural Dilemma Generation for Evaluating Moral Reasoning in Humans and Language Models | Jan-Philipp Fränken et.al. | 2404.10975 | link |
2024-04-16 | Human-Algorithm Collaborative Bayesian Optimization for Engineering Systems | Tom Savage et.al. | 2404.10949 | link |
2024-04-16 | N-Agent Ad Hoc Teamwork | Caroline Wang et.al. | 2404.10740 | link |
2024-04-16 | PD-Insighter: A Visual Analytics System to Monitor Daily Actions for Parkinson’s Disease Treatment | Jade Kandel et.al. | 2404.10661 | null |
2024-04-16 | Towards free-response paradigm: a theory on decision-making in spiking neural networks | Zhichao Zhu et.al. | 2404.10599 | null |
2024-04-16 | Automated Evaluation of Large Vision-Language Models on Self-driving Corner Cases | Yanze Li et.al. | 2404.10595 | null |
2024-04-16 | PAKT: Perspectivized Argumentation Knowledge Graph and Tool for Deliberation Analysis (with Supplementary Materials) | Moritz Plenz et.al. | 2404.10570 | null |
2024-04-16 | Quantum Mechanics of Human Perception, Behaviour and Decision-Making: A Do-It-Yourself Model Kit for Modelling Optical Illusions and Opinion Formation in Social Networks | Ivan S. Maksymov et.al. | 2404.10554 | link |
2024-04-16 | Warm-Start Variational Quantum Policy Iteration | Nico Meyer et.al. | 2404.10546 | link |
2024-04-16 | LAECIPS: Large Vision Model Assisted Adaptive Edge-Cloud Collaboration for IoT-based Perception System | Shijing Hu et.al. | 2404.10498 | null |
2024-04-16 | Would You Trust an AI Doctor? Building Reliable Medical Predictions with Kernel Dropout Uncertainty | Ubaid Azam et.al. | 2404.10483 | null |
2024-04-16 | AudioProtoPNet: An interpretable deep learning model for bird sound classification | René Heinrich et.al. | 2404.10420 | null |
2024-04-16 | Generating Counterfactual Trajectories with Latent Diffusion Models for Concept Discovery | Payal Varshney et.al. | 2404.10356 | null |
2024-04-16 | Application of Deep Learning Methods to Processing of Noisy Medical Video Data | Danil Afonchikov et.al. | 2404.10319 | null |
2024-04-16 | NeuroMorphix: A Novel Brain MRI Asymmetry-specific Feature Construction Approach For Seizure Recurrence Prediction | Soumen Ghosh et.al. | 2404.10290 | null |
2024-04-16 | PreGSU-A Generalized Traffic Scene Understanding Model for Autonomous Driving based on Pre-trained Graph Attention Network | Yuning Wang et.al. | 2404.10263 | null |
2024-04-16 | Rethinking Software Engineering in the Foundation Model Era: From Task-Driven AI Copilots to Goal-Driven AI Pair Programmers | Ahmed E. Hassan et.al. | 2404.10225 | null |
2024-04-16 | The Impact of Machine Learning on Society: An Analysis of Current Trends and Future Implications | Md Kamrul Hossain Siam et.al. | 2404.10204 | null |
2024-04-15 | Online Estimation via Offline Estimation: An Information-Theoretic Framework | Dylan J. Foster et.al. | 2404.10122 | null |
2024-04-15 | Explainable Light-Weight Deep Learning Pipeline for Improved Drought Stres | Aswini Kumar Patra et.al. | 2404.10073 | null |
2024-04-15 | Evaluating the Explainability of Attributes and Prototypes for a Medical Classification Model | Luisa Gallée et.al. | 2404.09917 | null |
2024-04-15 | Flow-Based Synthesis of Reactive Tests for Discrete Decision-Making Systems with Temporal Logic Specifications | Josefine B. Graebener et.al. | 2404.09888 | null |
2024-04-15 | Effective Reinforcement Learning Based on Structural Information Principles | Xianghua Zeng et.al. | 2404.09760 | link |
2024-04-15 | Sampling for Model Predictive Trajectory Planning in Autonomous Driving using Normalizing Flows | Georg Rabenstein et.al. | 2404.09657 | null |
2024-04-15 | SparseOcc: Rethinking Sparse Latent Representation for Vision-Based Semantic Occupancy Prediction | Pin Tang et.al. | 2404.09502 | null |
2024-04-15 | Towards Collaborative Autonomous Driving: Simulation Platform and End-to-End System | Genjia Liu et.al. | 2404.09496 | link |
2024-04-15 | VFMM3D: Releasing the Potential of Image by Vision Foundation Model for Monocular 3D Object Detection | Bonan Ding et.al. | 2404.09431 | null |
2024-04-14 | SyntStereo2Real: Edge-Aware GAN for Remote Sensing Image-to-Image Translation while Maintaining Stereo Constraint | Vasudha Venkatesan et.al. | 2404.09277 | null |
2024-04-14 | VRS-NeRF: Visual Relocalization with Sparse Neural Radiance Field | Fei Xue et.al. | 2404.09271 | link |
2024-04-14 | A Reinforcement Learning Based Backfilling Strategy for HPC Batch Jobs | Elliot Kolker-Hicks et.al. | 2404.09264 | null |
2024-04-14 | Increasing SLAM Pose Accuracy by Ground-to-Satellite Image Registration | Yanhao Zhang et.al. | 2404.09169 | link |
2024-04-14 | Evaluating the efficacy of haptic feedback, 360° treadmill-integrated Virtual Reality framework and longitudinal training on decision-making performance in a complex search-and-shoot simulation | Akash K Rao et.al. | 2404.09147 | null |
2024-04-13 | Exploring Explainability in Video Action Recognition | Avinab Saha et.al. | 2404.09067 | null |
2024-04-13 | Do LLMs Play Dice? Exploring Probability Distribution Sampling in Large Language Models for Behavioral Simulation | Jia Gu et.al. | 2404.09043 | null |
2024-04-13 | Intention-Aware Control Based on Belief-Space Specifications and Stochastic Expansion | Zengjie Zhang et.al. | 2404.09037 | link |
2024-04-13 | An Agent-Based Model of Elephant Crop Raid Dynamics in the Periyar-Agasthyamalai Complex, India | Purathekandy Anjali et.al. | 2404.09024 | link |
2024-04-13 | Incremental Residual Concept Bottleneck Models | Chenming Shang et.al. | 2404.08978 | link |
2024-04-13 | MCPNet: An Interpretable Classifier via Multi-Level Concept Prototypes | Bor-Shiun Wang et.al. | 2404.08968 | link |
2024-04-13 | Understanding Multimodal Deep Neural Networks: A Concept Selection View | Chenming Shang et.al. | 2404.08964 | null |
2024-04-13 | Voting Participation and Engagement in Blockchain-Based Fan Tokens | Lennart Ante et.al. | 2404.08906 | null |
2024-04-12 | WROOM: An Autonomous Driving Approach for Off-Road Navigation | Dvij Kalaria et.al. | 2404.08855 | link |
2024-04-12 | A Typology of Decision-Making Tasks for Visualization | Camelia D. Brumar et.al. | 2404.08812 | null |
2024-04-12 | Semantic Approach to Quantifying the Consistency of Diffusion Model Image Generation | Brinnae Bent et.al. | 2404.08799 | link |
2024-04-12 | FusionPortableV2: A Unified Multi-Sensor Dataset for Generalized SLAM Across Diverse Platforms and Scalable Environments | Hexiang Wei et.al. | 2404.08563 | link |
2024-04-12 | Leveraging Multi-AI Agents for Cross-Domain Knowledge Discovery | Shiva Aryal et.al. | 2404.08511 | null |
2024-04-12 | Prescribing Optimal Health-Aware Operation for Urban Air Mobility with Deep Reinforcement Learning | Mina Montazeri et.al. | 2404.08497 | null |
2024-04-12 | Maturity of Vehicle Digital Twins: From Monitoring to Enabling Autonomous Driving | Robert Klar et.al. | 2404.08438 | null |
2024-04-12 | SIR-RL: Reinforcement Learning for Optimized Policy Control during Epidemiological Outbreaks in Emerging Market and Developing Economies | Maeghal Jain et.al. | 2404.08423 | null |
2024-04-12 | Collective Bayesian Decision-Making in a Swarm of Miniaturized Robots for Surface Inspection | Thiemen Siemensma et.al. | 2404.08390 | null |
2024-04-12 | Uncertainty Aware Tropical Cyclone Wind Speed Estimation from Satellite Data | Nils Lehmann et.al. | 2404.08325 | link |
2024-04-12 | Transfer Learning Study of Motion Transformer-based Trajectory Predictions | Lars Ullrich et.al. | 2404.08271 | null |
2024-04-12 | Enhancing Fairness and Performance in Machine Learning Models: A Multi-Task Learning Approach with Monte-Carlo Dropout and Pareto Optimality | Khadija Zanna et.al. | 2404.08230 | null |
2024-04-11 | Real-Time Detection and Analysis of Vehicles and Pedestrians using Deep Learning | Md Nahid Sadik et.al. | 2404.08081 | null |
2024-04-11 | VeTraSS: Vehicle Trajectory Similarity Search Through Graph Modeling and Representation Learning | Ming Cheng et.al. | 2404.08021 | null |
2024-04-11 | The Power of Properties: Uncovering the Influential Factors in Emotion Classification | Tim Büchner et.al. | 2404.07867 | null |
2024-04-11 | Sparse Laneformer | Ji Liu et.al. | 2404.07821 | null |
2024-04-12 | NeuroNCAP: Photorealistic Closed-loop Safety Testing for Autonomous Driving | William Ljungbergh et.al. | 2404.07762 | link |
2024-04-11 | Enhancing Valuation of Variable Annuities in Lévy Models with Stochastic Interest Rate | Ludovic Goudenège et.al. | 2404.07658 | null |
2024-04-11 | Homography Guided Temporal Fusion for Road Line and Marking Segmentation | Shan Wang et.al. | 2404.07626 | link |
2024-04-11 | International environmental treaties: An honest or a misguided effort | Reza Hafezi et.al. | 2404.07574 | null |
2024-04-11 | Can Vehicle Motion Planning Generalize to Realistic Long-tail Scenarios? | Marcel Hallgarten et.al. | 2404.07569 | link |
2024-04-11 | PillarTrack: Redesigning Pillar-based Transformer Network for Single Object Tracking on Point Clouds | Weisheng Xu et.al. | 2404.07495 | link |
2024-04-11 | WESE: Weak Exploration to Strong Exploitation for LLM Agents | Xu Huang et.al. | 2404.07456 | null |
2024-04-11 | Data-Driven Portfolio Management for Motion Pictures Industry: A New Data-Driven Optimization Methodology Using a Large Language Model as the Expert | Mohammad Alipour-Vaezi et.al. | 2404.07434 | null |
2024-04-11 | Diversity’s Double-Edged Sword: Analyzing Race’s Effect on Remote Pair Programming Interactions | Shandler A. Mason et.al. | 2404.07427 | null |
2024-04-10 | Structured Reinforcement Learning for Media Streaming at the Wireless Edge | Archana Bura et.al. | 2404.07315 | null |
2024-04-10 | Sequential Decision Making with Expert Demonstrations under Unobserved Heterogeneity | Vahid Balazadeh et.al. | 2404.07266 | link |
2024-04-10 | Reward Learning from Suboptimal Demonstrations with Applications in Surgical Electrocautery | Zohre Karimi et.al. | 2404.07185 | null |
2024-04-10 | Machine learning-based similarity measure to forecast M&A from patent data | Giambattista Albora et.al. | 2404.07179 | link |
2024-04-10 | Rethinking Out-of-Distribution Detection for Reinforcement Learning: Advancing Methods for Evaluation and Detection | Linas Nasvytis et.al. | 2404.07099 | link |
2024-04-10 | Identification of Fine-grained Systematic Errors via Controlled Scene Generation | Valentyn Boreiko et.al. | 2404.07045 | null |
2024-04-10 | LM Transparency Tool: Interactive Tool for Analyzing Transformer Language Models | Igor Tufanov et.al. | 2404.07004 | null |
2024-04-10 | Multi-Agent Soft Actor-Critic with Global Loss for Autonomous Mobility-on-Demand Fleet Control | Zeno Woywood et.al. | 2404.06975 | link |
2024-04-10 | A Survey on the Integration of Generative AI for Critical Thinking in Mobile Networks | Athanasios Karapantelakis et.al. | 2404.06946 | null |
2024-04-10 | SparseAD: Sparse Query-Centric Paradigm for Efficient End-to-End Autonomous Driving | Diankun Zhang et.al. | 2404.06892 | null |
2024-04-10 | RESSCAL3D: Resolution Scalable 3D Semantic Segmentation of Point Clouds | Remco Royen et.al. | 2404.06863 | null |
2024-04-10 | Monocular 3D lane detection for Autonomous Driving: Recent Achievements, Challenges, and Outlooks | Fulong Ma et.al. | 2404.06860 | null |
2024-04-10 | Sparse Points to Dense Clouds: Enhancing 3D Detection with Limited LiDAR Data | Aakash Kumar et.al. | 2404.06715 | null |
2024-04-09 | SAM-I-Am: Semantic Boosting for Zero-shot Atomic-Scale Electron Micrograph Segmentation | Waqwoya Abebe et.al. | 2404.06638 | link |
2024-04-09 | RoadBEV: Road Surface Reconstruction in Bird’s Eye View | Tong Zhao et.al. | 2404.06605 | link |
2024-04-09 | Graph Reinforcement Learning for Combinatorial Optimization: A Survey and Unifying Perspective | Victor-Alexandru Darvariu et.al. | 2404.06492 | null |
2024-04-09 | Enhancing Decision Analysis with a Large Language Model: pyDecision a Comprehensive Library of MCDA Methods in Python | Valdecy Pereira et.al. | 2404.06370 | link |
2024-04-11 | HPNet: Dynamic Trajectory Forecasting with Historical Prediction Attention | Xiaolong Tang et.al. | 2404.06351 | link |
2024-04-09 | AgentsCoDriver: Large Language Model Empowered Collaborative Driving with Lifelong Learning | Senkang Hu et.al. | 2404.06345 | null |
2024-04-09 | Label-Efficient 3D Object Detection For Road-Side Units | Minh-Quan Dao et.al. | 2404.06256 | null |
2024-04-09 | Towards Autonomous Driving with Small-Scale Cars: A Survey of Recent Development | Dianzhao Li et.al. | 2404.06229 | null |
2024-04-09 | Intelligence and Motion Models of Continuum Robots: an Overview | Oxana Shamilyan et.al. | 2404.06171 | null |
2024-04-09 | Distributed Artificial Intelligence as a Means to Achieve Self-X-Functions for Increasing Resilience: the First Steps | Oxana Shamilyan et.al. | 2404.06159 | null |
2024-04-09 | Hierarchical Insights: Exploiting Structural Similarities for Reliable 3D Semantic Segmentation | Mariella Dreissig et.al. | 2404.06124 | null |
2024-04-09 | Passive None-line-of-sight imaging with arbitrary scene condition and detection pattern in small amount of prior data | Yunting Gui et.al. | 2404.06015 | null |
2024-04-09 | Feel-Good Thompson Sampling for Contextual Dueling Bandits | Xuheng Li et.al. | 2404.06013 | null |
2024-04-09 | Concept-Attention Whitening for Interpretable Skin Lesion Diagnosis | Junlin Hou et.al. | 2404.05997 | null |
2024-04-09 | Deep Reinforcement Learning for Personalized Diagnostic Decision Pathways Using Electronic Health Records: A Comparative Study on Anemia and Systemic Lupus Erythematosus | Lillian Muyama et.al. | 2404.05913 | link |
2024-04-08 | ClusterRadar: an Interactive Web-Tool for the Multi-Method Exploration of Spatial Clusters Over Time | Lee Mason et.al. | 2404.05897 | link |
2024-04-08 | Model Predictive Control based Energy Management System for Home Energy Resiliency | Ninad Gaikwad et.al. | 2404.05873 | null |
2024-04-08 | Approaching Emergent Risks: An Exploratory Study into Artificial Intelligence Risk Management within Financial Organisations | Finlay McGee et.al. | 2404.05847 | null |
2024-04-08 | Attention-Driven Multi-Agent Reinforcement Learning: Enhancing Decisions with Expertise-Informed Tasks | Andre R Kuroswiski et.al. | 2404.05840 | null |
2024-04-09 | Dynamic Backtracking in GFlowNets: Enhancing Decision Steps with Reward-Dependent Adjustment Mechanisms | Shuai Guo et.al. | 2404.05576 | null |
2024-04-08 | Evaluating Interventional Reasoning Capabilities of Large Language Models | Tejas Kasetty et.al. | 2404.05545 | null |
2024-04-08 | Decisioning Workshop 2023 | Mario Lezoche et.al. | 2404.05495 | null |
2024-04-08 | What Are the Odds? Improving the foundations of Statistical Model Checking | Tobias Meggendorfer et.al. | 2404.05424 | null |
2024-04-08 | Residual Chain Prediction for Autonomous Driving Path Planning | Liguo Zhou et.al. | 2404.05423 | null |
2024-04-08 | Logic-dependent emergence of multistability, hysteresis, and biphasic dynamics in a minimal positive feedback network with an autoloop | Akriti Srivastava et.al. | 2404.05379 | null |
2024-04-08 | A Max-Min-Max Algorithm for Large-Scale Robust Optimization | Kai Tu et.al. | 2404.05377 | null |
2024-04-08 | Human Detection from 4D Radar Data in Low-Visibility Field Conditions | Mikael Skog et.al. | 2404.05307 | null |
2024-04-08 | Detecting Every Object from Events | Haitian Zhang et.al. | 2404.05285 | link |
2024-04-08 | MOSE: Boosting Vision-based Roadside 3D Object Detection with Scene Cues | Xiahan Chen et.al. | 2404.05280 | null |
2024-04-08 | Fair Machine Guidance to Enhance Fair Decision Making in Biased People | Mingzhe Yang et.al. | 2404.05228 | null |
2024-04-08 | Maximally Forward-Looking Core Inflation | Philippe Goulet Coulombe et.al. | 2404.05209 | null |
2024-04-08 | GloSoFarID: Global multispectral dataset for Solar Farm IDentification in satellite imagery | Zhiyuan Yang et.al. | 2404.05180 | link |
2024-04-08 | Semantic Stealth: Adversarial Text Attacks on NLP Using Several Methods | Roopkatha Dey et.al. | 2404.05159 | null |
2024-04-08 | UniMix: Towards Domain Adaptive and Generalizable LiDAR Semantic Segmentation in Adverse Weather | Haimei Zhao et.al. | 2404.05145 | null |
2024-04-08 | Enhancing Clinical Efficiency through LLM: Discharge Note Generation for Cardiac Patients | HyoJe Jung et.al. | 2404.05144 | null |
2024-04-08 | Better Monocular 3D Detectors with LiDAR from the Past | Yurong You et.al. | 2404.05139 | link |
2024-04-07 | Data Conditioning for Subsurface Models with Single-Image Generative Adversarial Network (SinGAN) | Lei Liu et.al. | 2404.05068 | null |
2024-04-07 | Dir-SPGLM: A Bayesian semiparametric GLM with data-driven reference distribution | Entejar Alam et.al. | 2404.05060 | null |
2024-04-07 | Percentile Criterion Optimization in Offline Reinforcement Learning | Elita A. Lobo et.al. | 2404.05055 | link |
2024-04-05 | Enhancing IoT Intelligence: A Transformer-based Reinforcement Learning Methodology | Gaith Rjoub et.al. | 2404.04205 | null |
2024-04-05 | Exploring Probabilistic Models for Semi-supervised Learning | Jianfeng Wang et.al. | 2404.04199 | null |
2024-04-05 | You Can Use But Cannot Recognize: Preserving Visual Privacy in Deep Neural Networks | Qiushi Li et.al. | 2404.04098 | null |
2024-04-05 | The forgotten pillar of sustainability: development of the S-assessment tool to evaluate Organizational Social Sustainability | Alessandro Annarelli et.al. | 2404.04077 | null |
2024-04-05 | Bidirectional Human Interactive AI Framework for Social Robot Navigation | Tuba Girgin et.al. | 2404.04069 | null |
2024-04-05 | Balancing Progress and Responsibility: A Synthesis of Sustainability Trade-Offs of AI-Based Systems | Apoorva Nalini Pradeep Kumar et.al. | 2404.03995 | null |
2024-04-05 | Modulation of metastable ensemble dynamics explains optimal coding at moderate arousal in auditory cortex | Lia Papadopoulos et.al. | 2404.03902 | null |
2024-04-05 | Enhancing Breast Cancer Diagnosis in Mammography: Evaluation and Integration of Convolutional Neural Networks and Explainable AI | Maryam Ahmed et.al. | 2404.03892 | null |
2024-04-05 | Heterogeneous Multi-Agent Reinforcement Learning for Zero-Shot Scalable Collaboration | Xudong Guo et.al. | 2404.03869 | null |
2024-04-05 | Scaling Motion Forecasting Models with Ensemble Distillation | Scott Ettinger et.al. | 2404.03843 | null |
2024-04-04 | An ExplainableFair Framework for Prediction of Substance Use Disorder Treatment Completion | Mary M. Lucas et.al. | 2404.03833 | null |
2024-04-04 | Language-Guided Instance-Aware Domain-Adaptive Panoptic Segmentation | Elham Amin Mansour et.al. | 2404.03799 | null |
2024-04-04 | Dendrites endow artificial neural networks with accurate, robust and parameter-efficient learning | Spyridon Chavlis et.al. | 2404.03708 | null |
2024-04-04 | AutoWebGLM: Bootstrap And Reinforce A Large Language Model-based Web Navigating Agent | Hanyu Lai et.al. | 2404.03648 | link |
2024-04-04 | Is CLIP the main roadblock for fine-grained open-world perception? | Lorenzo Bianchi et.al. | 2404.03539 | link |
2024-04-04 | Integrating Generative AI into Financial Market Prediction for Improved Decision Making | Chang Che et.al. | 2404.03523 | null |
2024-04-04 | Materials for High Temperature Digital Electronics | Dhiren K. Pradhan et.al. | 2404.03510 | null |
2024-04-05 | A Methodology to Study the Impact of Spiking Neural Network Parameters considering Event-Based Automotive Data | Iqra Bano et.al. | 2404.03493 | null |
2024-04-04 | Knowledge Distillation-Based Model Extraction Attack using Private Counterfactual Explanations | Fatima Ezzeddine et.al. | 2404.03348 | link |
2024-04-04 | Learning to Bid in Forward Electricity Markets Using a No-Regret Algorithm | Arega Getaneh Abate et.al. | 2404.03314 | null |
2024-04-04 | Decentralized Learning Strategies for Estimation Error Minimization with Graph Neural Networks | Xingran Chen et.al. | 2404.03227 | null |
2024-04-04 | CORP: A Multi-Modal Dataset for Campus-Oriented Roadside Perception Tasks | Beibei Wang et.al. | 2404.03191 | null |
2024-04-04 | The Probabilities Also Matter: A More Faithful Metric for Faithfulness of Free-Text Explanations in Large Language Models | Noah Y. Siegel et.al. | 2404.03189 | null |
2024-04-03 | Ego-Motion Aware Target Prediction Module for Robust Multi-Object Tracking | Navid Mahdian et.al. | 2404.03110 | link |
2024-04-03 | Composite Bayesian Optimization In Function Spaces Using NEON – Neural Epistemic Operator Networks | Leonardo Ferreira Guilhoto et.al. | 2404.03099 | null |
2024-04-03 | Data-Driven Goal Recognition Design for General Behavioral Agents | Robert Kasumba et.al. | 2404.03054 | null |
2024-04-03 | When Digital Twin Meets Generative AI: Intelligent Closed-Loop Network Management | Xinyu Huang et.al. | 2404.03025 | null |
2024-04-03 | Tricks from the Trade for Large-Scale Markdown Pricing: Heuristic Cut Generation for Lagrangian Decomposition | Robert Streeck et.al. | 2404.02996 | null |
2024-04-03 | LidarDM: Generative LiDAR Simulation in a Generated World | Vlas Zyrianov et.al. | 2404.02903 | link |
2024-04-03 | IEEE VIS Workshop on Visualization for Climate Action and Sustainability | Benjamin Bach et.al. | 2404.02743 | null |
2024-04-03 | Unsupervised Learning of Effective Actions in Robotics | Marko Zaric et.al. | 2404.02728 | link |
2024-04-03 | Towards detecting unanticipated bias in Large Language Models | Anna Kruspe et.al. | 2404.02650 | null |
2024-04-03 | On the Importance of Uncertainty in Decision-Making with Large Language Models | Nicolò Felicioni et.al. | 2404.02649 | null |
2024-04-03 | One Stack to Rule them All: To Drive Automated Vehicles, and Reach for the 4th level | Sven Ochs et.al. | 2404.02645 | null |
2024-04-04 | Vestibular schwannoma growth prediction from longitudinal MRI by time conditioned neural fields | Yunjie Chen et.al. | 2404.02614 | link |
2024-04-03 | Incremental Learning with Concept Drift Detection and Prototype-based Embeddings for Graph Stream Classification | Kleanthis Malialis et.al. | 2404.02572 | null |
2024-04-03 | HENet: Hybrid Encoding for End-to-end Multi-task 3D Perception from Multi-view Cameras | Zhongyu Xia et.al. | 2404.02517 | link |
2024-04-03 | Task Agnostic Architecture for Algorithm Induction via Implicit Composition | Sahil J. Sindhi et.al. | 2404.02450 | null |
2024-04-03 | From Narratives to Numbers: Valid Inference Using Language Model Predictions from Verbal Autopsy Narratives | Shuxian Fan et.al. | 2404.02438 | null |
2024-04-03 | AD4RL: Autonomous Driving Benchmarks for Offline Reinforcement Learning with Value-based Dataset | Dongsu Lee et.al. | 2404.02429 | null |
2024-04-03 | TCLC-GS: Tightly Coupled LiDAR-Camera Gaussian Splatting for Surrounding Autonomous Driving Scenes | Cheng Zhao et.al. | 2404.02410 | null |
2024-04-04 | CAPE: CAM as a Probabilistic Ensemble for Enhanced DNN Interpretation | Townim Faisal Chowdhury et.al. | 2404.02388 | link |
2024-04-02 | Attribution Regularization for Multimodal Paradigms | Sahiti Yerramilli et.al. | 2404.02359 | null |
2024-04-02 | From Delays to Densities: Exploring Data Uncertainty through Speech, Text, and Visualization | Chase Stokes et.al. | 2404.02317 | null |
2024-04-02 | OFMPNet: Deep End-to-End Model for Occupancy and Flow Prediction in Urban Environment | Youshaa Murhij et.al. | 2404.02263 | link |
2024-04-02 | OOSTraj: Out-of-Sight Trajectory Prediction With Vision-Positioning Denoising | Haichao Zhang et.al. | 2404.02227 | link |
2024-04-02 | FLawN-T5: An Empirical Examination of Effective Instruction-Tuning Data Mixtures for Legal Reasoning | Joel Niklaus et.al. | 2404.02127 | link |
2024-04-02 | Risk-Aware Real-Time Task Allocation for Stochastic Multi-Agent Systems under STL Specifications | Maico H. W. Engelaar et.al. | 2404.02111 | null |
2024-04-02 | WcDT: World-centric Diffusion Transformer for Traffic Scene Generation | Chen Yang et.al. | 2404.02082 | link |
2024-04-02 | A Survey on Large Language Model-Based Game Agents | Sihao Hu et.al. | 2404.02039 | link |
2024-04-02 | Enhancing Portfolio Optimization with Transformer-GAN Integration: A Novel Approach in the Black-Litterman Framework | Enmin Zhu et.al. | 2404.02029 | null |
2024-04-02 | DELAN: Dual-Level Alignment for Vision-and-Language Navigation by Cross-Modal Contrastive Learning | Mengfei Du et.al. | 2404.01994 | link |
2024-04-02 | Heuristic Optimization of Amplifier Reconfiguration Process for Autonomous Driving Optical Networks | Qizhi Qiu et.al. | 2404.01949 | null |
2024-04-02 | Improving Bird’s Eye View Semantic Segmentation by Task Decomposition | Tianhao Zhao et.al. | 2404.01925 | link |
2024-04-02 | A Rationale-centric Counterfactual Data Augmentation Method for Cross-Document Event Coreference Resolution | Bowen Ding et.al. | 2404.01921 | link |
2024-04-02 | Neuromorphic Split Computing with Wake-Up Radios: Architecture and Design via Digital Twinning | Jiechen Chen et.al. | 2404.01815 | null |
2024-04-02 | Analyzing the Single Event Upset Vulnerability of Binarized Neural Networks on SRAM FPGAs | Ioanna Souvatzoglou et.al. | 2404.01757 | null |
2024-04-02 | Safe Interval RRT* for Scalable Multi-Robot Path Planning in Continuous Space | Joonyeol Sim et.al. | 2404.01752 | link |
2024-04-02 | Exploring Latent Pathways: Enhancing the Interpretability of Autonomous Driving with a Variational Autoencoder | Anass Bairouk et.al. | 2404.01750 | null |
2024-04-02 | Towards Scalable & Efficient Interaction-Aware Planning in Autonomous Vehicles using Knowledge Distillation | Piyush Gupta et.al. | 2404.01746 | null |
2024-04-02 | Boosting Visual Recognition for Autonomous Driving in Real-world Degradations with Deep Channel Prior | Zhanwen Liu et.al. | 2404.01703 | link |
2024-04-02 | JRDB-PanoTrack: An Open-world Panoptic Segmentation and Tracking Robotic Dataset in Crowded Human Environments | Duy-Tho Le et.al. | 2404.01686 | null |
2024-04-02 | Collaborative Optimization of Wireless Communication and Computing Resource Allocation based on Multi-Agent Federated Weighting Deep Reinforcement Learning | Junjie Wu et.al. | 2404.01638 | null |
2024-04-02 | Voice EHR: Introducing Multimodal Audio Data for Health | James Anibal et.al. | 2404.01620 | null |
2024-04-02 | Haina Storage: A Decentralized Secure Storage Framework Based on Improved Blockchain Structure | Zijian Zhou et.al. | 2404.01606 | link |
2024-04-02 | Language Model Guided Interpretable Video Action Reasoning | Ning Wang et.al. | 2404.01591 | null |
2024-03-29 | Localising the Seizure Onset Zone from Single-Pulse Electrical Stimulation Responses with a Transformer | Jamie Norris et.al. | 2403.20324 | link |
2024-03-29 | Can LLMs Correct Physicians, Yet? Investigating Effective Interaction Methods in the Medical Domain | Burcu Sayin et.al. | 2403.20288 | link |
2024-03-29 | Optimal Policy Learning with Observational Data in Multi-Action Scenarios: Estimation, Risk Preference, and Potential Failures | Giovanni Cerulli et.al. | 2403.20250 | null |
2024-03-29 | A simple EEG-based decision tool for neonatal therapeutic hypothermia in hypoxic-ischemic encephalopathy | Marc Fiammante et.al. | 2403.20239 | null |
2024-03-29 | Enhancing Lithological Mapping with Spatially Constrained Bayesian Network (SCB-Net): An Approach for Field Data-Constrained Predictions with Uncertainty Evaluation | Victor Silva dos Santos et.al. | 2403.20195 | link |
2024-03-29 | Biologically-Plausible Topology Improved Spiking Actor Network for Efficient Deep Reinforcement Learning | Duzhen Zhang et.al. | 2403.20163 | null |
2024-03-29 | Conformal Prediction for Stochastic Decision-Making of PV Power in Electricity Markets | Yvet Renkema et.al. | 2403.20149 | null |
2024-03-29 | Application of Machine Learning Algorithms in Classifying Postoperative Success in Metabolic Bariatric Surgery: A Comprehensive Study | José Alberto Benítez-Andrades et.al. | 2403.20124 | null |
2024-03-29 | LeGo-Drive: Language-enhanced Goal-oriented Closed-Loop End-to-End Autonomous Driving | Pranjal Paul et.al. | 2403.20116 | null |
2024-03-29 | SGD: Street View Synthesis with Gaussian Splatting and Diffusion Prior | Zhongrui Yu et.al. | 2403.20079 | null |
2024-03-29 | HO-Gaussian: Hybrid Optimization of 3D Gaussian Splatting for Urban Scenes | Zhuopeng Li et.al. | 2403.20032 | null |
2024-03-29 | Decision Mamba: Reinforcement Learning via Sequence Modeling with Selective State Spaces | Toshihiro Ota et.al. | 2403.19925 | link |
2024-03-29 | PLoc: A New Evaluation Criterion Based on Physical Location for Autonomous Driving Datasets | Ruining Yang et.al. | 2403.19893 | null |
2024-03-28 | Optimal regimes with limited resources | Aaron L. Sarvet et.al. | 2403.19842 | null |
2024-03-28 | Multi-Frame, Lightweight & Efficient Vision-Language Models for Question Answering in Autonomous Driving | Akshay Gopalkrishnan et.al. | 2403.19838 | link |
2024-03-28 | Segmentation Re-thinking Uncertainty Estimation Metrics for Semantic Segmentation | Qitian Ma et.al. | 2403.19826 | null |
2024-03-28 | A Digital Twin for Geological Carbon Storage with Controlled Injectivity | Abhinav Prakash Gahlot et.al. | 2403.19819 | null |
2024-03-28 | Human-compatible driving partners through data-regularized self-play reinforcement learning | Daphne Cornelisse et.al. | 2403.19648 | link |
2024-03-28 | In the driver’s mind: modeling the dynamics of human overtaking decisions in interactions with oncoming automated vehicles | Samir H. A. Mohammad et.al. | 2403.19637 | null |
2024-03-28 | Data-Adaptive Tradeoffs among Multiple Risks in Distribution-Free Prediction | Drew T. Nguyen et.al. | 2403.19605 | link |
2024-03-28 | Behavior Trees in Industrial Applications: A Case Study in Underground Explosive Charging | Mattias Hallen et.al. | 2403.19602 | null |
2024-03-28 | Swarm Characteristics Classification Using Neural Networks | Donald W. Peltier III et.al. | 2403.19572 | link |
2024-03-28 | Learning Sampling Distribution and Safety Filter for Autonomous Driving with VQ-VAE and Differentiable Optimization | Simon Idoko et.al. | 2403.19461 | link |
2024-03-28 | Transparent and Clinically Interpretable AI for Lung Cancer Detection in Chest X-Rays | Amy Rafferty et.al. | 2403.19444 | null |
2024-03-28 | SubjectDrive: Scaling Generative Data in Autonomous Driving via Subject Control | Binyuan Huang et.al. | 2403.19438 | null |
2024-03-28 | Learning a Formally Verified Control Barrier Function in Stochastic Environment | Manan Tayal et.al. | 2403.19332 | link |
2024-03-28 | A Machine Learning Approach for Crop Yield and Disease Prediction Integrating Soil Nutrition and Weather Factors | Forkan Uddin Ahmed et.al. | 2403.19273 | null |
2024-03-28 | Evaluating Fair Feature Selection in Machine Learning for Healthcare | Md Rahat Shahriar Zawad et.al. | 2403.19165 | null |
2024-03-28 | Gamu Blue: A Practical Tool for Game Theory Security Equilibria | Ameer Taweel et.al. | 2403.19130 | link |
2024-03-28 | CRKD: Enhanced Camera-Radar Object Detection with Cross-modality Knowledge Distillation | Lingjun Zhao et.al. | 2403.19104 | null |
2024-03-28 | GraphAD: Interaction Scene Graph for End-to-end Autonomous Driving | Yunpeng Zhang et.al. | 2403.19098 | link |
2024-03-27 | GENESIS-RL: GEnerating Natural Edge-cases with Systematic Integration of Safety considerations and Reinforcement Learning | Hsin-Jung Yang et.al. | 2403.19062 | null |
2024-03-27 | Ensuring Safe Autonomy: Navigating the Future of Autonomous Vehicles | Patrick Wolf et.al. | 2403.19006 | null |
2024-03-27 | LORD: Large Models based Opposite Reward Design for Autonomous Driving | Xin Ye et.al. | 2403.18965 | null |
2024-03-27 | 3P-LLM: Probabilistic Path Planning using Large Language Model for Autonomous Robot Navigation | Ehsan Latif et.al. | 2403.18778 | null |
2024-03-27 | Mitigating Hallucinations in Large Vision-Language Models with Instruction Contrastive Decoding | Xintong Wang et.al. | 2403.18715 | link |
2024-03-27 | Sampling-Based Motion Planning with Online Racing Line Generation for Autonomous Driving on Three-Dimensional Race Tracks | Levent Ögretmen et.al. | 2403.18643 | link |
2024-03-27 | Modeling Sustainable City Trips: Integrating CO2 Emissions, Popularity, and Seasonality into Tourism Recommender Systems | Ashmi Banerjee et.al. | 2403.18604 | null |
2024-03-27 | Modeling uncertainty for Gaussian Splatting | Luca Savant et.al. | 2403.18476 | null |
2024-03-27 | Uncertainty-Aware SAR ATR: Defending Against Adversarial Attacks via Bayesian Neural Networks | Tian Ye et.al. | 2403.18318 | null |
2024-03-27 | Manipulating Neural Path Planners via Slight Perturbations | Zikang Xiong et.al. | 2403.18256 | null |
2024-03-27 | From Two-Dimensional to Three-Dimensional Environment with Q-Learning: Modeling Autonomous Navigation with Reinforcement Learning and no Libraries | Ergon Cugler de Moraes Silva et.al. | 2403.18219 | link |
2024-03-27 | Preference-Based Planning in Stochastic Environments: From Partially-Ordered Temporal Goals to Most Preferred Policies | Hazhar Rahmani et.al. | 2403.18212 | null |
2024-03-27 | Long and Short-Term Constraints Driven Safe Reinforcement Learning for Autonomous Driving | Xuemin Hu et.al. | 2403.18209 | null |
2024-03-27 | Road Obstacle Detection based on Unknown Objectness Scores | Chihiro Noguchi et.al. | 2403.18207 | null |
2024-03-27 | Integrating urban digital twins with cloud-based geospatial dashboards for coastal resilience planning: A case study in Florida | Changjie Chen et.al. | 2403.18188 | null |
2024-03-26 | Low-Latency Neural Stereo Streaming | Qiqi Hou et.al. | 2403.17879 | null |
2024-03-27 | Empowering Data Mesh with Federated Learning | Haoyuan Li et.al. | 2403.17878 | link |
2024-03-26 | Counterfactual Fairness through Transforming Data Orthogonal to Bias | Shuyi Chen et.al. | 2403.17852 | null |
2024-03-26 | Scenario-Based Curriculum Generation for Multi-Agent Autonomous Driving | Axel Brunnbauer et.al. | 2403.17805 | link |
2024-03-27 | Query Refinement for Diverse Top- $k$ Selection | Felix S. Campbell et.al. | 2403.17786 | null |
2024-03-26 | LiDAR-Based Crop Row Detection Algorithm for Over-Canopy Autonomous Navigation in Agriculture Fields | Ruiji Liu et.al. | 2403.17774 | link |
2024-03-26 | Optimization-based Prompt Injection Attack to LLM-as-a-Judge | Jiawen Shi et.al. | 2403.17710 | link |
2024-03-26 | Healthcare Data Governance, Privacy, and Security – A Conceptual Framework | Amen Faridoon et.al. | 2403.17648 | null |
2024-03-27 | Intrinsic Subgraph Generation for Interpretable Graph based Visual Question Answering | Pascal Tilli et.al. | 2403.17647 | link |
2024-03-26 | Retentive Decision Transformer with Adaptive Masking for Reinforcement Learning based Recommendation Systems | Siyu Wang et.al. | 2403.17634 | null |
2024-03-26 | UADA3D: Unsupervised Adversarial Domain Adaptation for 3D Object Detection with Sparse LiDAR and Large Domain Gaps | Maciej K Wozniak et.al. | 2403.17633 | link |
2024-03-26 | Quadratic speed-ups in quantum kernelized binary classification | Jungyun Lee et.al. | 2403.17453 | null |
2024-03-26 | Integrating Mamba Sequence Model and Hierarchical Upsampling Network for Accurate Semantic Segmentation of Multiple Sclerosis Legion | Kazi Shahriar Sanjid et.al. | 2403.17432 | null |
2024-03-26 | A Survey on Resource Management in Joint Communication and Computing-Embedded SAGIN | Qian Chen et.al. | 2403.17400 | null |
2024-03-26 | AIDE: An Automatic Data Engine for Object Detection in Autonomous Driving | Mingfu Liang et.al. | 2403.17373 | null |
2024-03-26 | Addressing Myopic Constrained POMDP Planning with Recursive Dual Ascent | Paula Stocco et.al. | 2403.17358 | link |
2024-03-26 | Deep Support Vectors | Junhoo Lee et.al. | 2403.17329 | null |
2024-03-27 | Physical 3D Adversarial Attacks against Monocular Depth Estimation in Autonomous Driving | Junhao Zheng et.al. | 2403.17301 | link |
2024-03-25 | Review Ecosystems to access Educational XR Experiences: a Scoping Review | Shaun Bangay et.al. | 2403.17243 | null |
2024-03-25 | SynFog: A Photo-realistic Synthetic Fog Dataset based on End-to-end Imaging Simulation for Advancing Real-World Defogging in Autonomous Driving | Yiming Xie et.al. | 2403.17094 | null |
2024-03-25 | Towards Human-AI Deliberation: Design and Evaluation of LLM-Empowered Deliberative AI for AI-Assisted Decision-Making | Shuai Ma et.al. | 2403.16812 | null |
2024-03-25 | An LLM-Based Digital Twin for Optimizing Human-in-the Loop Systems | Hanqing Yang et.al. | 2403.16809 | link |
2024-03-25 | A Blotto Game Approach to Ride-hailing Markets with Electric Vehicles | Marko Maljkovic et.al. | 2403.16755 | null |
2024-03-25 | Synapse: Learning Preferential Concepts from Visual Demonstrations | Sadanand Modak et.al. | 2403.16689 | null |
2024-03-25 | Instantaneous Visual Analysis of Blood Flow in Stenoses Using Morphological Similarity | Pepe Eulzer et.al. | 2403.16653 | null |
2024-03-25 | Semantically Enriched Cross-Lingual Sentence Embeddings for Crisis-related Social Media Texts | Rabindra Lamsal et.al. | 2403.16614 | null |
2024-03-25 | ROXIE: Defining a Robotic eXplanation and Interpretability Engine | Francisco J. Rodríguez-Lera et.al. | 2403.16606 | null |
2024-03-25 | Hallucination Detection in Foundation Models for Decision-Making: A Flexible Definition and Review of the State of the Art | Neeloy Chakraborty et.al. | 2403.16527 | null |
2024-03-25 | Harnessing the power of LLMs for normative reasoning in MASs | Bastin Tony Roy Savarimuthu et.al. | 2403.16524 | null |
2024-03-25 | Learning To Guide Human Decision Makers With Vision-Language Models | Debodeep Banerjee et.al. | 2403.16501 | null |
2024-03-25 | RCBEVDet: Radar-camera Fusion in Bird’s Eye View for 3D Object Detection | Zhiwei Lin et.al. | 2403.16440 | link |
2024-03-25 | An image-computable model of speeded decision-making | Paul I. Jaffe et.al. | 2403.16382 | link |
2024-03-25 | ProIn: Learning to Predict Trajectory Based on Progressive Interactions for Autonomous Driving | Yinke Dong et.al. | 2403.16374 | null |
2024-03-25 | Impact of Video Compression Artifacts on Fisheye Camera Visual Perception Tasks | Madhumitha Sakthi et.al. | 2403.16338 | null |
2024-03-25 | MEDDAP: Medical Dataset Enhancement via Diversified Augmentation Pipeline | Yasamin Medghalchi et.al. | 2403.16335 | link |
2024-03-24 | Social Deliberation vs. Social Contracts in Self-Governing Voluntary Organisations | Matthew Scott et.al. | 2403.16329 | null |
2024-03-24 | MRSch: Multi-Resource Scheduling for HPC | Boyang Li et.al. | 2403.16298 | link |
2024-03-24 | Engineering Safety Requirements for Autonomous Driving with Large Language Models | Ali Nouri et.al. | 2403.16289 | null |
2024-03-24 | Sample Empirical Likelihood Methods for Causal Inference | Jingyue Huang et.al. | 2403.16283 | null |
2024-03-24 | The Evolution of Football Betting- A Machine Learning Approach to Match Outcome Forecasting and Bookmaker Odds Estimation | Purnachandra Mandadapu et.al. | 2403.16282 | null |
2024-03-24 | Interference Management for Integrated Sensing and Communication Systems: A Survey | Yangyang Niu et.al. | 2403.16189 | null |
2024-03-24 | Mixed-Initiative Human-Robot Teaming under Suboptimality with Online Bayesian Adaptation | Manisha Natarajan et.al. | 2403.16178 | link |
2024-03-24 | Self-Supervised Multi-Frame Neural Scene Flow | Dongrui Liu et.al. | 2403.16116 | null |
2024-03-22 | Can large language models explore in-context? | Akshay Krishnamurthy et.al. | 2403.15371 | null |
2024-03-22 | Augmented Reality based Simulated Data (ARSim) with multi-view consistency for AV perception networks | Aqeel Anwar et.al. | 2403.15370 | null |
2024-03-22 | CR3DT: Camera-RADAR Fusion for 3D Detection and Tracking | Nicolas Baumann et.al. | 2403.15313 | link |
2024-03-22 | Measuring Gender and Racial Biases in Large Language Models | Jiafu An et.al. | 2403.15281 | null |
2024-03-22 | IS-Fusion: Instance-Scene Collaborative Fusion for Multimodal 3D Object Detection | Junbo Yin et.al. | 2403.15241 | link |
2024-03-22 | Robust optimization for adversarial learning with finite sample complexity guarantees | André Bertolace et.al. | 2403.15207 | null |
2024-03-22 | An Agent-Centric Perspective on Norm Enforcement and Sanctions | Elena Yan et.al. | 2403.15128 | link |
2024-03-22 | Learning from Visual Demonstrations through Differentiable Nonlinear MPC for Personalized Autonomous Driving | Flavia Sofia Acerbo et.al. | 2403.15102 | null |
2024-03-22 | End-to-End Mineral Exploration with Artificial Intelligence and Ambient Noise Tomography | Jack Muir et.al. | 2403.15095 | null |
2024-03-22 | Robust Conformal Prediction under Distribution Shift via Physics-Informed Structural Causal Model | Rui Xu et.al. | 2403.15025 | null |
2024-03-22 | Extracting Human Attention through Crowdsourced Patch Labeling | Minsuk Chang et.al. | 2403.15013 | null |
2024-03-22 | Tri-Perspective View Decomposition for Geometry-Aware Depth Completion | Zhiqiang Yan et.al. | 2403.15008 | null |
2024-03-22 | Unifying Lane-Level Traffic Prediction from a Graph Structural Perspective: Benchmark and Baseline | Shuhao Li et.al. | 2403.14941 | link |
2024-03-22 | A Stochastic Model-Based Control Methodology for Glycemic Management in the Intensive Care Unit | Melike Sirlanci et.al. | 2403.14934 | null |
2024-03-21 | Establishing a leader in a pairwise comparisons method | Jacek Szybowski et.al. | 2403.14885 | null |
2024-03-21 | Consensus formation in quality-sensitive interdependent agent systems | David March-Pons et.al. | 2403.14856 | null |
2024-03-21 | ReAct Meets ActRe: Autonomous Annotations of Agent Trajectories for Contrastive Self-Training | Zonghan Yang et.al. | 2403.14589 | null |
2024-03-21 | Physics-Based Causal Reasoning for Safe & Robust Next-Best Action Selection in Robot Manipulation Tasks | Ricardo Cannizzaro et.al. | 2403.14488 | null |
2024-03-21 | The Ethics of ChatGPT in Medicine and Healthcare: A Systematic Review on Large Language Models (LLMs) | Joschka Haltaufderheide et.al. | 2403.14473 | null |
2024-03-21 | SurroundSDF: Implicit 3D Scene Understanding Based on Signed Distance Field | Lizhe Liu et.al. | 2403.14366 | null |
2024-03-21 | Beyond Surface Similarity: Detecting Subtle Semantic Shifts in Financial Narratives | Jiaxin Liu et.al. | 2403.14341 | null |
2024-03-21 | Investigating the validity of structure learning algorithms in identifying risk factors for intervention in patients with diabetes | Sheresh Zahoor et.al. | 2403.14327 | null |
2024-03-21 | UAV-Assisted Maritime Search and Rescue: A Holistic Approach | Martin Messmer et.al. | 2403.14281 | null |
2024-03-21 | Contrastive Balancing Representation Learning for Heterogeneous Dose-Response Curves Estimation | Minqin Zhu et.al. | 2403.14232 | link |
2024-03-21 | MMIDR: Teaching Large Language Model to Interpret Multimodal Misinformation via Knowledge Distillation | Longzheng Wang et.al. | 2403.14171 | link |
2024-03-21 | Hypothesis-Driven Deep Learning for Out of Distribution Detection | Yasith Jayawardana et.al. | 2403.14058 | null |
2024-03-20 | Spatial Fairness: The Case for its Importance, Limitations of Existing Work, and Guidelines for Future Research | Nripsuta Ani Saxena et.al. | 2403.14040 | null |
2024-03-20 | Pricing-driven Development and Operation of SaaS : Challenges and Opportunities | Alejandro García-Fernández et.al. | 2403.14007 | null |
2024-03-20 | “This is not a data problem”: Algorithms and Power in Public Higher Education in Canada | Kelly McConvey et.al. | 2403.13969 | null |
2024-03-20 | Sequential Modeling of Complex Marine Navigation: Case Study on a Passenger Vessel (Student Abstract) | Yimeng Fan et.al. | 2403.13909 | link |
2024-03-20 | Towards Learning Contrast Kinetics with Multi-Condition Latent Diffusion Models | Richard Osuala et.al. | 2403.13890 | link |
2024-03-20 | Chain-of-Interaction: Enhancing Large Language Models for Psychiatric Behavior Understanding by Dyadic Contexts | Guangzeng Han et.al. | 2403.13786 | link |
2024-03-20 | Towards Principled Representation Learning from Videos for Reinforcement Learning | Dipendra Misra et.al. | 2403.13765 | link |
2024-03-20 | Reinforcement Learning for Online Testing of Autonomous Driving Systems: a Replication and Extension Study | Luca Giamattei et.al. | 2403.13729 | null |
2024-03-20 | Multimodal Variational Autoencoder for Low-cost Cardiac Hemodynamics Instability Detection | Mohammod N. I. Suvon et.al. | 2403.13658 | link |
2024-03-21 | Adversarial Attacks and Defenses in Automated Control Systems: A Comprehensive Benchmark | Vitaliy Pozdnyakov et.al. | 2403.13502 | link |
2024-03-20 | Uncertainty quantification for data-driven weather models | Christopher Bülte et.al. | 2403.13458 | link |
2024-03-20 | IndiTag: An Online Media Bias Analysis and Annotation System Using Fine-Grained Bias Indicators | Luyang Lin et.al. | 2403.13446 | link |
2024-03-21 | AMP: Autoregressive Motion Prediction Revisited with Next Token Prediction for Autonomous Driving | Xiaosong Jia et.al. | 2403.13331 | null |
2024-03-20 | AdaViPro: Region-based Adaptive Visual Prompt for Large-Scale Models Adapting | Mengyu Yang et.al. | 2403.13282 | null |
2024-03-20 | Self-Supervised Class-Agnostic Motion Prediction with Spatial and Temporal Consistency Regularizations | Kewei Wang et.al. | 2403.13261 | link |
2024-03-20 | A Rule-Compliance Path Planner for Lane-Merge Scenarios Based on Responsibility-Sensitive Safety | Pengfei Lin et.al. | 2403.13251 | null |
2024-03-20 | Diffusion Model for Data-Driven Black-Box Optimization | Zihao Li et.al. | 2403.13219 | null |
2024-03-19 | Fast Value Tracking for Deep Reinforcement Learning | Frank Shih et.al. | 2403.13178 | null |
2024-03-19 | Interspecific dispersal constraints suppress pattern formation in metacommunities | Patrick Lawton et.al. | 2403.13098 | null |
2024-03-19 | Yell At Your Robot: Improving On-the-Fly from Language Corrections | Lucy Xiaoyang Shi et.al. | 2403.12910 | null |
2024-03-19 | Tighter Confidence Bounds for Sequential Kernel Regression | Hamish Flynn et.al. | 2403.12732 | null |
2024-03-19 | Deciphering AutoML Ensembles: cattleia’s Assistance in Decision-Making | Anna Kozak et.al. | 2403.12664 | null |
2024-03-19 | A Practical Guide to Statistical Distances for Evaluating Generative Models in Science | Sebastian Bischoff et.al. | 2403.12636 | link |
2024-03-19 | M2DA: Multi-Modal Fusion Transformer Incorporating Driver Attention for Autonomous Driving | Dongyang Xu et.al. | 2403.12552 | null |
2024-03-19 | Embodied LLM Agents Learn to Cooperate in Organized Teams | Xudong Guo et.al. | 2403.12482 | link |
2024-03-19 | INSIGHT: End-to-End Neuro-Symbolic Visual Reinforcement Learning with Language Explanations | Lirui Luo et.al. | 2403.12451 | link |
2024-03-19 | On Predictive planning and counterfactual learning in active inference | Aswin Paul et.al. | 2403.12417 | link |
2024-03-19 | Offline Imitation of Badminton Player Behavior via Experiential Contexts and Brownian Motion | Kuang-Da Wang et.al. | 2403.12406 | link |
2024-03-19 | Hierarchical Digital Twin for Efficient 6G Network Orchestration via Adaptive Attribute Selection and Scalable Network Modeling | Pengyi Jia et.al. | 2403.12398 | null |
2024-03-18 | The Best of Many Robustness Criteria in Decision Making: Formulation and Application to Robust Pricing | Jerry Anunrojwong et.al. | 2403.12260 | null |
2024-03-18 | Safety Implications of Explainable Artificial Intelligence in End-to-End Autonomous Driving | Shahin Atakishiyev et.al. | 2403.12176 | null |
2024-03-18 | HiKER-SGG: Hierarchical Knowledge Enhanced Robust Scene Graph Generation | Ce Zhang et.al. | 2403.12033 | link |
2024-03-18 | Expandable Subspace Ensemble for Pre-Trained Model-Based Class-Incremental Learning | Da-Wei Zhou et.al. | 2403.12030 | link |
2024-03-18 | From Pixels to Insights: A Survey on Automatic Chart Understanding in the Era of Large Foundation Models | Kung-Hsiang Huang et.al. | 2403.12027 | link |
2024-03-18 | Supervised Fine-Tuning as Inverse Reinforcement Learning | Hao Sun et.al. | 2403.12017 | null |
2024-03-18 | Proposal of a general framework to categorize continuous predictor variables | Irantzu Barrio et.al. | 2403.11983 | null |
2024-03-18 | Informed Spectral Normalized Gaussian Processes for Trajectory Prediction | Christian Schlauch et.al. | 2403.11966 | null |
2024-03-18 | Probabilistic Calibration by Design for Neural Network Regression | Victor Dheur et.al. | 2403.11964 | link |
2024-03-18 | AI-Assisted Cervical Cancer Screening | Kanchan Poudel et.al. | 2403.11936 | null |
2024-03-18 | BEVCar: Camera-Radar Fusion for BEV Map and Object Segmentation | Jonas Schramm et.al. | 2403.11761 | link |
2024-03-18 | TrajectoryNAS: A Neural Architecture Search for Trajectory Prediction | Ali Asghar Sharifi et.al. | 2403.11695 | null |
2024-03-18 | Sensitivity Assessment of Multi-Criteria Decision-Making Methods in Chemical Engineering Optimization Applications | Seyed Reza Nabavi et.al. | 2403.11569 | null |
2024-03-18 | OCR is All you need: Importing Multi-Modality into Image-based Defect Detection System | Chih-Chung Hsu et.al. | 2403.11536 | null |
2024-03-18 | State-Separated SARSA: A Practical Sequential Decision-Making Algorithm with Recovering Rewards | Yuto Tanimoto et.al. | 2403.11520 | link |
2024-03-18 | SSAP: A Shape-Sensitive Adversarial Patch for Comprehensive Disruption of Monocular Depth Estimation in Autonomous Navigation Applications | Amira Guesmi et.al. | 2403.11515 | null |
2024-03-18 | MCD: Diverse Large-Scale Multi-Campus Dataset for Robot Perception | Thien-Minh Nguyen et.al. | 2403.11496 | null |
2024-03-18 | LLM Guided Evolution - The Automation of Models Advancing Models | Clint Morris et.al. | 2403.11446 | link |
2024-03-18 | Demystifying Deep Reinforcement Learning-Based Autonomous Vehicle Decision-Making | Hanxi Wan et.al. | 2403.11432 | null |
2024-03-17 | Driving Style Alignment for LLM-powered Driver Agent | Ruoxuan Yang et.al. | 2403.11368 | link |
2024-03-17 | Multi-Sample Long Range Path Planning under Sensing Uncertainty for Off-Road Autonomous Driving | Matt Schmittle et.al. | 2403.11298 | null |
2024-03-17 | A Modified Word Saliency-Based Adversarial Attack on Text Classification Models | Hetvi Waghela et.al. | 2403.11297 | null |
2024-03-17 | Barely Random Algorithms for Metrical Task Systems | Romain Cosson et.al. | 2403.11267 | null |
2024-03-17 | A learning-based solution approach to the application placement problem in mobile edge computing under uncertainty | Taha-Hossein Hejazi et.al. | 2403.11259 | null |
2024-03-17 | Learning-Based Pricing and Matching for Two-Sided Queues | Zixian Yang et.al. | 2403.11093 | null |
2024-03-17 | Tokensome: Towards a Genetic Vision-Language GPT for Explainable and Cognitive Karyotyping | Haoxi Zhang et.al. | 2403.11073 | null |
2024-03-17 | Large Language Models Powered Context-aware Motion Prediction | Xiaoji Zheng et.al. | 2403.11057 | link |
2024-03-17 | JustQ: Automated Deployment of Fair and Accurate Quantum Neural Networks | Ruhan Wang et.al. | 2403.11048 | null |
2024-03-17 | From Pixels to Predictions: Spectrogram and Vision Transformer for Better Time Series Forecasting | Zhen Zeng et.al. | 2403.11047 | null |
2024-03-16 | Advancing multivariate time series similarity assessment: an integrated computational approach | Franck Tonle et.al. | 2403.11044 | null |
2024-03-15 | Can a GPT4-Powered AI Agent Be a Good Enough Performance Attribution Analyst? | Bruno de Melo et.al. | 2403.10482 | null |
2024-03-15 | Gradient based Feature Attribution in Explainable AI: A Technical Review | Yongjie Wang et.al. | 2403.10415 | null |
2024-03-15 | Real-Time Image Segmentation via Hybrid Convolutional-Transformer Architecture Search | Hongyuan Yu et.al. | 2403.10413 | link |
2024-03-15 | Evaluating Perceptual Distances by Fitting Binomial Distributions to Two-Alternative Forced Choice Data | Alexander Hepburn et.al. | 2403.10390 | null |
2024-03-15 | Regret Minimization via Saddle Point Optimization | Johannes Kirschner et.al. | 2403.10379 | null |
2024-03-15 | SimPB: A Single Model for 2D and 3D Object Detection from Multiple Cameras | Yingqi Tang et.al. | 2403.10353 | link |
2024-03-15 | Interactive Trimming against Evasive Online Data Manipulation Attacks: A Game-Theoretic Approach | Yue Fu et.al. | 2403.10313 | null |
2024-03-15 | Designing User-Centered Simulations of Leadership Situations for Cave Automatic Virtual Environments: Development and Usability Study | Francesco Vona et.al. | 2403.10312 | null |
2024-03-15 | A Multi-constraint and Multi-objective Allocation Model for Emergency Rescue in IoT Environment | Xinrun Xu et.al. | 2403.10299 | null |
2024-03-15 | The long-term and disparate impact of job loss on individual mobility behaviour | Simone Centellegher et.al. | 2403.10276 | null |
2024-03-15 | Interpretable Machine Learning for Survival Analysis | Sophie Hanna Langbein et.al. | 2403.10250 | link |
2024-03-15 | CoLeCLIP: Open-Domain Continual Learning via Joint Task Prompt and Vocabulary Learning | Yukun Li et.al. | 2403.10245 | link |
2024-03-15 | Explainability through uncertainty: Trustworthy decision-making with neural networks | Arthur Thuy et.al. | 2403.10168 | null |
2024-03-15 | RCooper: A Real-world Large-scale Dataset for Roadside Cooperative Perception | Ruiyang Hao et.al. | 2403.10145 | link |
2024-03-15 | Enhancing Human-Centered Dynamic Scene Understanding via Multiple LLMs Collaborated Reasoning | Hang Zhang et.al. | 2403.10107 | null |
2024-03-15 | RangeLDM: Fast Realistic LiDAR Point Cloud Generation | Qianjiang Hu et.al. | 2403.10094 | link |
2024-03-15 | Visual Foundation Models Boost Cross-Modal Unsupervised Domain Adaptation for 3D Semantic Segmentation | Jingyi Xu et.al. | 2403.10001 | link |
2024-03-15 | Global Convergence Guarantees for Federated Policy Gradient Methods with Adversaries | Swetha Ganesh et.al. | 2403.09940 | null |
2024-03-14 | Reality Bites: Assessing the Realism of Driving Scenarios with Large Language Models | Jiahui Wu et.al. | 2403.09906 | link |
2024-03-14 | Robust Subgraph Learning by Monitoring Early Training Representations | Sepideh Neshatfar et.al. | 2403.09901 | null |
2024-03-14 | Generalized Predictive Model for Autonomous Driving | Jiazhi Yang et.al. | 2403.09630 | link |
2024-03-14 | Scalable Autonomous Drone Flight in the Forest with Visual-Inertial SLAM and Dense Submaps Built without LiDAR | Sebastián Barbas Laina et.al. | 2403.09596 | null |
2024-03-14 | Iterative Forgetting: Online Data Stream Regression Using Database-Inspired Adaptive Granulation | Niket Kathiriya et.al. | 2403.09588 | null |
2024-03-14 | Are you a robot? Detecting Autonomous Vehicles from Behavior Analysis | Fabio Maresca et.al. | 2403.09571 | null |
2024-03-14 | Characterization of Polarimetric Properties in Various Brain Tumor Types Using Wide-Field Imaging Mueller Polarimetry | Romane Gros et.al. | 2403.09561 | null |
2024-03-14 | “Are You Really Sure?” Understanding the Effects of Human Self-Confidence Calibration in AI-Assisted Decision Making | Shuai Ma et.al. | 2403.09552 | null |
2024-03-14 | On STPA for Distributed Development of Safe Autonomous Driving: An Interview Study | Ali Nouri et.al. | 2403.09509 | null |
2024-03-14 | An Industrial Experience Report about Challenges from Continuous Monitoring, Improvement, and Deployment for Autonomous Driving Features | Ali Nouri et.al. | 2403.09474 | null |
2024-03-14 | Exploring the Interplay of Intrinsic Fluctuation and Complexity in Intracellular Calcium Dynamics | Athokpam Langlen Chanu et.al. | 2403.09386 | null |
2024-03-14 | EfficientMFD: Towards More Efficient Multimodal Synchronous Fusion Detection | Jiaqing Zhang et.al. | 2403.09323 | link |
2024-03-14 | Generating Feasible and Plausible Counterfactual Explanations for Outcome Prediction of Business Processes | Alexander Stevens et.al. | 2403.09232 | link |
2024-03-14 | Unlocking the Potential of Open Government Data: Exploring the Strategic, Technical, and Application Perspectives of High-Value Datasets Opening in Taiwan | Hsien-Lee Tseng et.al. | 2403.09216 | null |
2024-03-14 | Intention-aware Denoising Diffusion Model for Trajectory Prediction | Chen Liu et.al. | 2403.09190 | null |
2024-03-14 | PreSight: Enhancing Autonomous Vehicle Perception with City-Scale NeRF Priors | Tianyuan Yuan et.al. | 2403.09079 | link |
2024-03-13 | AutoGuide: Automated Generation and Selection of State-Aware Guidelines for Large Language Model Agents | Yao Fu et.al. | 2403.08978 | null |
2024-03-13 | Managing Distributional Ambiguity in Stochastic Optimization through a Statistical Upper Bound Framework | Shixin Liu et.al. | 2403.08966 | null |
2024-03-13 | Language-based game theory in the age of artificial intelligence | Valerio Capraro et.al. | 2403.08944 | null |
2024-03-13 | FogGuard: guarding YOLO against fog using perceptual loss | Soheil Gharatappeh et.al. | 2403.08939 | link |
2024-03-13 | CLIP-BEVFormer: Enhancing Multi-View Image-Based BEV Detector with Ground Truth Flow | Chenbin Pan et.al. | 2403.08919 | null |
2024-03-13 | A Framework for Strategic Discovery of Credible Neural Network Surrogate Models under Uncertainty | Pratyush Kumar Singh et.al. | 2403.08901 | null |
2024-03-13 | MIM4D: Masked Modeling with Multi-View Video for Autonomous Driving Representation Learning | Jialv Zou et.al. | 2403.08760 | link |
2024-03-13 | Real-time 3D semantic occupancy prediction for autonomous vehicles using memory-efficient sparse convolution | Samuel Sze et.al. | 2403.08748 | null |
2024-03-13 | Steering LLMs Towards Unbiased Responses: A Causality-Guided Debiasing Framework | Jingling Li et.al. | 2403.08743 | null |
2024-03-13 | Optimal sub-Gaussian variance proxy for truncated Gaussian and exponential random variables | Mathias Barreto et.al. | 2403.08628 | null |
2024-03-13 | Towards a Privacy and Security-Aware Framework for Ethical AI: Guiding the Development and Assessment of AI Systems | Daria Korobenko et.al. | 2403.08624 | null |
2024-03-13 | Pig aggression classification using CNN, Transformers and Recurrent Networks | Junior Silva Souza et.al. | 2403.08528 | null |
2024-03-13 | IAMCV Multi-Scenario Vehicle Interaction Dataset | Novel Certad et.al. | 2403.08455 | null |
2024-03-13 | DeepCSHAP: Utilizing Shapley Values to Explain Deep Complex-Valued Neural Networks | Florian Eilers et.al. | 2403.08428 | null |
2024-03-13 | Causal Graph Neural Networks for Wildfire Danger Prediction | Shan Zhao et.al. | 2403.08414 | null |
2024-03-13 | LLM-Assisted Light: Leveraging Large Language Model Capabilities for Human-Mimetic Traffic Signal Control in Complex Urban Environments | Maonan Wang et.al. | 2403.08337 | link |
2024-03-13 | Optimized Detection and Classification on GTRSB: Advancing Traffic Sign Recognition with Convolutional Neural Networks | Dhruv Toshniwal et.al. | 2403.08283 | null |
2024-03-13 | LIX: Implicitly Infusing Spatial Geometric Prior Knowledge into Visual Semantic Segmentation for Autonomous Driving | Sicen Guo et.al. | 2403.08215 | null |
2024-03-13 | Can Large Language Models Identify Authorship? | Baixiang Huang et.al. | 2403.08213 | link |
2024-03-14 | Data Monetization Pathways and Complex Dynamic Game Equilibrium Analysis in the Energy Industry | Zongxian Wang et.al. | 2403.08082 | null |
2024-03-12 | What would Plato say? Concepts and notions from Greek philosophy applied to gamification mechanics for a meaningful and ethical gamification | Kostas Karpouzis et.al. | 2403.08041 | null |
2024-03-12 | A Review of Cybersecurity Incidents in the Food and Agriculture Sector | Ajay Kulkarni et.al. | 2403.08036 | null |
2024-03-12 | Supervised Time Series Classification for Anomaly Detection in Subsea Engineering | Ergys Çokaj et.al. | 2403.08013 | null |
2024-03-12 | When Eye-Tracking Meets Machine Learning: A Systematic Review on Applications in Medical Image Analysis | Sahar Moradizeyveh et.al. | 2403.07834 | null |
2024-03-12 | FairRR: Pre-Processing for Group Fairness through Randomized Response | Xianli Zeng et.al. | 2403.07780 | link |
2024-03-12 | Transforming Competition into Collaboration: The Revolutionary Role of Multi-Agent Systems and Language Models in Modern Organizations | Carlos Jose Xavier Cruz et.al. | 2403.07769 | link |
2024-03-12 | Unleashing HyDRa: Hybrid Fusion, Depth Consistency and Radar for Unified 3D Perception | Philipp Wolters et.al. | 2403.07746 | link |
2024-03-12 | Equipping Computational Pathology Systems with Artifact Processing Pipelines: A Showcase for Computation and Performance Trade-offs | Neel Kanwal et.al. | 2403.07743 | link |
2024-03-12 | DSEG-LIME - Improving Image Explanation by Hierarchical Data-Driven Segmentation | Patrick Knab et.al. | 2403.07733 | link |
2024-03-12 | A Survey of Vision Transformers in Autonomous Driving: Current Trends and Future Directions | Quoc-Vinh Lai-Dang et.al. | 2403.07542 | null |
2024-03-12 | Adaptive Fusion of Single-View and Multi-View Depth for Autonomous Driving | JunDa Cheng et.al. | 2403.07535 | link |
2024-03-12 | Spatiotemporal Representation Learning for Short and Long Medical Image Time Series | Chengzhi Shen et.al. | 2403.07513 | link |
2024-03-12 | Reinforced Sequential Decision-Making for Sepsis Treatment: The POSNEGDM Framework with Mortality Classifier and Transformer | Dipesh Tamboli et.al. | 2403.07309 | link |
2024-03-12 | Improved Algebraic Inverter Modelling for Four-Wire Power Flow Optimization | Rahmat Heidari et.al. | 2403.07285 | null |
2024-03-12 | Adaptive Bounding Box Uncertainties via Two-Step Conformal Prediction | Alexander Timans et.al. | 2403.07263 | link |
2024-03-12 | Tractable Joint Prediction and Planning over Discrete Behavior Modes for Urban Driving | Adam Villaflor et.al. | 2403.07232 | null |
2024-03-11 | Bigraph Matching Weighted with Learnt Incentive Function for Multi-Robot Task Allocation | Steve Paul et.al. | 2403.07131 | null |
2024-03-11 | RaceMOP: Mapless Online Path Planning for Multi-Agent Autonomous Racing using Residual Policy Learning | Raphael Trumpp et.al. | 2403.07129 | link |
2024-03-11 | Better than classical? The subtle art of benchmarking quantum machine learning models | Joseph Bowles et.al. | 2403.07059 | link |
2024-03-11 | Numerical simulation of individual coil placement – A proof-of-concept study for the prediction of recurrence after aneurysm coiling | Julian Schwarting et.al. | 2403.06889 | null |
2024-03-11 | Model Predictive Control Strategies for Electric Endurance Race Cars Accounting for Competitors Interactions | Jorn van Kampen et.al. | 2403.06885 | null |
2024-03-11 | DriveDreamer-2: LLM-Enhanced World Models for Diverse Driving Video Generation | Guosheng Zhao et.al. | 2403.06845 | null |
2024-03-11 | From Factor Models to Deep Learning: Machine Learning in Reshaping Empirical Asset Pricing | Junyi Ye et.al. | 2403.06779 | null |
2024-03-11 | Real-Time Multimodal Cognitive Assistant for Emergency Medical Services | Keshara Weerasinghe et.al. | 2403.06734 | link |
2024-03-11 | PCLD: Point Cloud Layerwise Diffusion for Adversarial Purification | Mert Gulsen et.al. | 2403.06698 | link |
2024-03-11 | Maxitive functions with respect to general orders | M. Kupper et.al. | 2403.06613 | null |
2024-03-11 | Tactical Decision Making for Autonomous Trucks by Deep Reinforcement Learning with Total Cost of Operation Based Reward | Deepthi Pathare et.al. | 2403.06524 | null |
2024-03-11 | 3D Semantic Segmentation-Driven Representations for 3D Object Detection | Hayeon O et.al. | 2403.06501 | link |
2024-03-11 | CoRAL: Collaborative Retrieval-Augmented Large Language Models Improve Long-tail Recommendation | Junda Wu et.al. | 2403.06447 | null |
2024-03-10 | LinearAPT: An Adaptive Algorithm for the Fixed-Budget Thresholding Linear Bandit Problem | Yun-Ang Wu et.al. | 2403.06230 | null |
2024-03-10 | IDEAS: Information-Driven EV Admission in Charging Station Considering User Impatience to Improve QoS and Station Utilization | Animesh Chattopadhyay et.al. | 2403.06223 | null |
2024-03-10 | TRAD: Enhancing LLM Agents with Step-Wise Thought Retrieval and Aligned Decision | Ruiwen Zhou et.al. | 2403.06221 | link |
2024-03-10 | On depth prediction for autonomous driving using self-supervised learning | Houssem Boulahbal et.al. | 2403.06194 | null |
2024-03-10 | Cross-Cluster Shifting for Efficient and Effective 3D Object Detection in Autonomous Driving | Zhili Chen et.al. | 2403.06166 | null |
2024-03-10 | Target-constrained Bidirectional Planning for Generation of Target-oriented Proactive Dialogue | Jian Wang et.al. | 2403.06063 | link |
2024-03-09 | CarbonNet: How Computer Vision Plays a Role in Climate Change? Application: Learning Geomechanics from Subsurface Geometry of CCS to Mitigate Global Warming | Wei Chen et.al. | 2403.06025 | null |
2024-03-09 | End-to-end solution for linked open data query logs analytics | Dihia Lanasri et.al. | 2403.06016 | null |
2024-03-09 | Deep learning for multi-label classification of coral conditions in the Indo-Pacific via underwater photogrammetry | Xinlei Shao et.al. | 2403.05930 | link |
2024-03-09 | Towards Optimizing Human-Centric Objectives in AI-Assisted Decision-Making With Offline Reinforcement Learning | Zana Buçinca et.al. | 2403.05911 | null |
2024-03-08 | JointMotion: Joint Self-supervision for Joint Motion Prediction | Royden Wagner et.al. | 2403.05489 | link |
2024-03-08 | OccFusion: Depth Estimation Free Multi-sensor Fusion for 3D Occupancy Prediction | Ji Zhang et.al. | 2403.05329 | null |
2024-03-08 | Tapilot-Crossing: Benchmarking and Evolving LLMs Towards Interactive Data Analysis Agents | Jinyang Li et.al. | 2403.05307 | link |
2024-03-08 | Engineering consensus in static networks with unknown disruptors | Agathe Bouis et.al. | 2403.05272 | null |
2024-03-08 | Developing Federated Time-to-Event Scores Using Heterogeneous Real-World Survival Data | Siqi Li et.al. | 2403.05229 | link |
2024-03-08 | Interactive Perception for Deformable Object Manipulation | Zehang Weng et.al. | 2403.05177 | null |
2024-03-08 | LVIC: Multi-modality segmentation by Lifting Visual Info as Cue | Zichao Dong et.al. | 2403.05159 | null |
2024-03-08 | LanePtrNet: Revisiting Lane Detection as Point Voting and Grouping on Curves | Jiayan Cao et.al. | 2403.05155 | null |
2024-03-08 | Inverse Design of Photonic Crystal Surface Emitting Lasers is a Sequence Modeling Problem | Ceyao Zhang et.al. | 2403.05149 | null |
2024-03-08 | DyRoNet: A Low-Rank Adapter Enhanced Dynamic Routing Network for Streaming Perception | Xiang Huang et.al. | 2403.05050 | null |
2024-03-08 | LHMap-loc: Cross-Modal Monocular Localization Using LiDAR Point Cloud Heat Map | Xinrui Wu et.al. | 2403.05002 | link |
2024-03-07 | Fooling Neural Networks for Motion Forecasting via Adversarial Attacks | Edgar Medina et.al. | 2403.04954 | null |
2024-03-07 | A Survey on Human-AI Teaming with Large Pre-Trained Models | Vanshika Vats et.al. | 2403.04931 | null |
2024-03-07 | Mechanism for Decision-aware Collaborative Federated Learning: A Pitfall of Shapley Values | Meng Qi et.al. | 2403.04753 | null |
2024-03-07 | A General Calibrated Regret Metric for Detecting and Mitigating Human-Robot Interaction Failures | Kensuke Nakamura et.al. | 2403.04745 | null |
2024-03-07 | Literature Review of Current Sustainability Assessment Frameworks and Approaches for Organizations | Sarah Farahdel et.al. | 2403.04717 | null |
2024-03-07 | End-to-end Conditional Robust Optimization | Abhilash Chenreddy et.al. | 2403.04670 | null |
2024-03-07 | Embodied Understanding of Driving Scenarios | Yunsong Zhou et.al. | 2403.04593 | link |
2024-03-07 | FriendNet: Detection-Friendly Dehazing Network | Yihua Fan et.al. | 2403.04443 | link |
2024-03-07 | Cooperative Bayesian Optimization for Imperfect Agents | Ali Khoshvishkaie et.al. | 2403.04442 | null |
2024-03-07 | iTRPL: An Intelligent and Trusted RPL Protocol based on Multi-Agent Reinforcement Learning | Debasmita Dey et.al. | 2403.04416 | null |
2024-03-07 | Conjugate operators for transparent, explorable research outputs | Joseph Bond et.al. | 2403.04403 | null |
2024-03-07 | LitSim: Conflict-aware Policy for Long-term Interactive Traffic Simulation | Haojie Xin et.al. | 2403.04299 | null |
2024-03-07 | Generalizing Cooperative Eco-driving via Multi-residual Task Learning | Vindula Jayawardana et.al. | 2403.04232 | null |
2024-03-07 | Incremental Bayesian Learning for Fail-Operational Control in Autonomous Driving | Lei Zheng et.al. | 2403.04143 | null |
2024-03-06 | Hitchhiker’s guide to cancer-associated lymphoid aggregates in histology images: manual and deep learning-based quantification approaches | Karina Silina et.al. | 2403.04142 | null |
2024-03-07 | Towards learning-based planning:The nuPlan benchmark for real-world autonomous driving | Napat Karnchanachari et.al. | 2403.04133 | null |
2024-03-07 | An Explainable AI Framework for Artificial Intelligence of Medical Things | Al Amin et.al. | 2403.04130 | null |
2024-03-06 | Multi-Object Tracking with Camera-LiDAR Fusion for Autonomous Driving | Riccardo Pieroni et.al. | 2403.04112 | null |
2024-03-06 | Joint multi-task learning improves weakly-supervised biomarker prediction in computational pathology | Omar S. M. El Nahhas et.al. | 2403.03891 | link |
2024-03-06 | Confidence-Aware Decision-Making and Control for Tool Selection | Ajith Anil Meera et.al. | 2403.03808 | null |
2024-03-06 | 3D Object Visibility Prediction in Autonomous Driving | Chuanyu Luo et.al. | 2403.03681 | null |
2024-03-06 | Learning Adversarial MDPs with Stochastic Hard Constraints | Francesco Emanuele Stradi et.al. | 2403.03672 | null |
2024-03-06 | Development and evaluation of Artificial Intelligence techniques for IoT data quality assessment and curation | Laura Martín et.al. | 2403.03661 | null |
2024-03-06 | A Survey on Applications of Reinforcement Learning in Spatial Resource Allocation | Di Zhang et.al. | 2403.03643 | null |
2024-03-06 | Seamless Virtual Reality with Integrated Synchronizer and Synthesizer for Autonomous Driving | He Li et.al. | 2403.03541 | null |
2024-03-06 | Global Geolocated Realtime Data of Interfleet Urban Transit Bus Idling | Nicholas Kunz et.al. | 2403.03489 | link |
2024-03-06 | Multi-task Learning for Real-time Autonomous Driving Leveraging Task-adaptive Attention Generator | Wonhyeok Choi et.al. | 2403.03468 | null |
2024-03-06 | Human vs. Machine: Language Models and Wargames | Max Lamparth et.al. | 2403.03407 | link |
2024-03-05 | RACE-SM: Reinforcement Learning Based Autonomous Control for Social On-Ramp Merging | Jordan Poots et.al. | 2403.03359 | null |
2024-03-05 | Towards Democratized Flood Risk Management: An Advanced AI Assistant Enabled by GPT-4 for Enhanced Interpretability and Public Engagement | Rafaela Martelo et.al. | 2403.03188 | link |
2024-03-05 | Behavior Generation with Latent Actions | Seungjae Lee et.al. | 2403.03181 | link |
2024-03-05 | Deep-Learned Compression for Radio-Frequency Signal Classification | Armani Rodriguez et.al. | 2403.03150 | null |
2024-03-05 | Language Guided Exploration for RL Agents in Text Environments | Hitesh Golchha et.al. | 2403.03141 | null |
2024-03-05 | MiKASA: Multi-Key-Anchor & Scene-Aware Transformer for 3D Visual Grounding | Chun-Peng Chang et.al. | 2403.03077 | link |
2024-03-05 | SimuCourt: Building Judicial Decision-Making Agents with Real-world Judgement Documents | Zhitao He et.al. | 2403.02959 | link |
2024-03-05 | XAI-Based Detection of Adversarial Attacks on Deepfake Detectors | Ben Pinhasov et.al. | 2403.02955 | link |
2024-03-05 | User-Driven Adaptation: Tailoring Autonomous Driving Systems with Dynamic Preferences | Mingyue Zhang et.al. | 2403.02928 | null |
2024-03-05 | Risk-Constrained Community Battery Utilisation Optimisation for Electric Vehicle Charging with Photovoltaic Resources | Khalil Gholami et.al. | 2403.02927 | null |
2024-03-05 | Autonomous vehicle decision and control through reinforcement learning with traffic flow randomization | Yuan Lin et.al. | 2403.02882 | null |
2024-03-05 | ActiveAD: Planning-Oriented Active Learning for End-to-End Autonomous Driving | Han Lu et.al. | 2403.02877 | null |
2024-03-05 | FastOcc: Accelerating 3D Occupancy Prediction by Fusing the 2D Bird’s-Eye View and Perspective View | Jiawei Hou et.al. | 2403.02710 | null |
2024-03-05 | HoloVIC: Large-scale Dataset and Benchmark for Multi-Sensor Holographic Intersection and Vehicle-Infrastructure Cooperative | Cong Ma et.al. | 2403.02640 | null |
2024-03-05 | World Models for Autonomous Driving: An Initial Survey | Yanchen Guan et.al. | 2403.02622 | null |
2024-03-05 | Deep Cooperation in ISAC System: Resource, Node and Infrastructure Perspectives | Zhiqing Wei et.al. | 2403.02565 | null |
2024-03-04 | MORBDD: Multiobjective Restricted Binary Decision Diagrams by Learning to Sparsify | Rahul Patel et.al. | 2403.02482 | null |
2024-03-04 | The Ink Splotch Effect: A Case Study on ChatGPT as a Co-Creative Game Designer | Asad Anjum et.al. | 2403.02454 | null |
2024-03-04 | Uncertainty-Aware Prediction and Application in Planning for Autonomous Driving: Definitions, Methods, and Comparison | Wenbo Shao et.al. | 2403.02297 | null |
2024-03-04 | Recency-Weighted Temporally-Segmented Ensemble for Time-Series Modeling | Pål V. Johnsen et.al. | 2403.02150 | link |
2024-03-04 | Depth-Guided Robust and Fast Point Cloud Fusion NeRF for Sparse Input Views | Shuai Guo et.al. | 2403.02063 | null |
2024-03-03 | Optimization decision model of vegetable stock and pricing based on TCN-Attention and genetic algorithm | Linhan Xia et.al. | 2403.01367 | null |
2024-03-02 | Summary Paper: Use Case on Building Collaborative Safe Autonomous Systems-A Robotdog for Guiding Visually Impaired People | Aman Malhotra et.al. | 2403.01286 | null |
2024-03-02 | Automatic Speech Recognition using Advanced Deep Learning Approaches: A survey | Hamza Kheddar et.al. | 2403.01255 | null |
2024-03-02 | AcME-AD: Accelerated Model Explanations for Anomaly Detection | Valentina Zaccaria et.al. | 2403.01245 | null |
2024-03-02 | On the Road to Portability: Compressing End-to-End Motion Planner for Autonomous Driving | Kaituo Feng et.al. | 2403.01238 | link |
2024-03-02 | Results and Lessons Learned from Autonomous Driving Transportation Services in Airfield, Crowded Indoor, and Urban Environments | Doosan Baek et.al. | 2403.01233 | null |
2024-03-02 | Control of cascading failures using protective measures | Davood Fazli et.al. | 2403.01205 | null |
2024-03-01 | On the Role of Information Structure in Reinforcement Learning for Partially-Observable Sequential Teams and Games | Awni Altabaa et.al. | 2403.00993 | null |
2024-03-01 | Adaptive Learning Rate for Follow-the-Regularized-Leader: Competitive Ratio Analysis and Best-of-Both-Worlds | Shinji Ito et.al. | 2403.00715 | null |
2024-03-01 | Playing NetHack with LLMs: Potential & Limitations as Zero-Shot Agents | Dominik Jeurissen et.al. | 2403.00690 | link |
2024-03-01 | Safe Hybrid-Action Reinforcement Learning-Based Decision and Control for Discretionary Lane Change | Ruichen Xu et.al. | 2403.00446 | null |
2024-03-01 | MS-Net: A Multi-Path Sparse Model for Motion Prediction in Multi-Scenes | Xiaqiang Tang et.al. | 2403.00353 | null |
2024-03-01 | Deep Reinforcement Learning for Solving Management Problems: Towards A Large Management Mode | Jinyang Jiang et.al. | 2403.00318 | null |
2024-03-01 | Efficient Reinforcement Learning for Global Decision Making in the Presence of Local Agents at Scale | Emile Anand et.al. | 2403.00222 | null |
2024-02-29 | Identification of important nodes in the information propagation network based on the artificial intelligence method | Bin Yuan et.al. | 2403.00190 | null |
2024-02-29 | Causal Graph ODE: Continuous Treatment Effect Modeling in Multi-agent Dynamical Systems | Zijie Huang et.al. | 2403.00178 | null |
2024-02-29 | Implications of Regulations on the Use of AI and Generative AI for Human-Centered Responsible Artificial Intelligence | Marios Constantinides et.al. | 2403.00148 | null |
2024-02-29 | ArCHer: Training Language Model Agents via Hierarchical Multi-Turn RL | Yifei Zhou et.al. | 2402.19446 | link |
2024-02-29 | Genie: Smart ROS-based Caching for Connected Autonomous Robots | Zexin Li et.al. | 2402.19410 | null |
2024-02-29 | Towards Safe and Reliable Autonomous Driving: Dynamic Occupancy Set Prediction | Wenbo Shao et.al. | 2402.19385 | null |
2024-03-03 | RoadRunner - Learning Traversability Estimation for Autonomous Off-road Driving | Jonas Frey et.al. | 2402.19341 | null |
2024-02-29 | DISCERN: Designing Decision Support Interfaces to Investigate the Complexities of Workplace Social Decision-Making With Line Managers | Pranav Khadpe et.al. | 2402.19318 | null |
2024-02-29 | T3DNet: Compressing Point Cloud Models for Lightweight 3D Recognition | Zhiyuan Yang et.al. | 2402.19264 | null |
2024-02-29 | A Cognitive-Based Trajectory Prediction Approach for Autonomous Driving | Haicheng Liao et.al. | 2402.19251 | link |
2024-02-29 | Prediction of vaccination coverage level in the heterogeneous mixing population | Fan Bai et.al. | 2402.19190 | null |
2024-02-29 | MemoNav: Working Memory Model for Visual Navigation | Hongxin Li et.al. | 2402.19161 | link |
2024-02-29 | ARMCHAIR: integrated inverse reinforcement learning and model predictive control for human-robot collaboration | Angelo Caregnato-Neto et.al. | 2402.19128 | null |
2024-02-29 | CollaFuse: Navigating Limited Resources and Privacy in Collaborative Generative AI | Domenique Zipperling et.al. | 2402.19105 | link |
2024-02-29 | GoalNet: Goal Areas Oriented Pedestrian Trajectory Prediction | Ching-Lin Lee et.al. | 2402.19002 | null |
2024-02-29 | Applications of 0-1 Neural Networks in Prescription and Prediction | Vrishabh Patil et.al. | 2402.18851 | null |
2024-02-29 | A simple model of global cascades on random hypergraphs | Lei Chen et.al. | 2402.18850 | null |
2024-02-29 | Deep Learning for 3D Human Pose Estimation and Mesh Recovery: A Survey | Yang Liu et.al. | 2402.18844 | link |
2024-02-29 | On the Decision-Making Abilities in Role-Playing using Large Language Models | Chenglei Shen et.al. | 2402.18807 | null |
2024-02-29 | Conjectural Online Learning with First-order Beliefs in Asymmetric Information Stochastic Games | Tao Li et.al. | 2402.18781 | null |
2024-02-29 | The Situate AI Guidebook: Co-Designing a Toolkit to Support Multi-Stakeholder Early-stage Deliberations Around Public Sector AI Proposals | Anna Kawakami et.al. | 2402.18774 | null |
2024-02-28 | A revision on Multi-Criteria Decision Making methods for Multi-UAV Mission Planning Support | Cristian Ramirez-Atencia et.al. | 2402.18743 | null |
2024-03-01 | RORA: Robust Free-Text Rationale Evaluation | Zhengping Jiang et.al. | 2402.18678 | link |
2024-02-28 | Approaching Human-Level Forecasting with Language Models | Danny Halawi et.al. | 2402.18563 | null |
2024-02-28 | Selection of appropriate multispectral camera exposure settings and radiometric calibration methods for applications in phenotyping and precision agriculture | Vaishali Swaminathan et.al. | 2402.18553 | null |
2024-02-28 | FinAgent: A Multimodal Foundation Agent for Financial Trading: Tool-Augmented, Diversified, and Generalist | Wentao Zhang et.al. | 2402.18485 | null |
2024-02-28 | Evaluating Decision Optimality of Autonomous Driving via Metamorphic Testing | Mingfei Cheng et.al. | 2402.18393 | null |
2024-02-28 | Unveiling the Potential of Robustness in Evaluating Causal Inference Models | Yiyan Huang et.al. | 2402.18392 | link |
2024-02-28 | Enhancing Roadway Safety: LiDAR-based Tree Clearance Analysis | Miriam Louise Carnot et.al. | 2402.18309 | null |
2024-02-28 | EchoTrack: Auditory Referring Multi-Object Tracking for Autonomous Driving | Jiacheng Lin et.al. | 2402.18302 | link |
2024-02-28 | PiShield: A NeSy Framework for Learning with Requirements | Mihaela Cătălina Stoian et.al. | 2402.18285 | link |
2024-02-28 | EAN-MapNet: Efficient Vectorized HD Map Construction with Anchor Neighborhoods | Huiyuan Xiong et.al. | 2402.18278 | null |
2024-02-28 | NiteDR: Nighttime Image De-Raining with Cross-View Sensor Cooperative Learning for Dynamic Driving Scenes | Cidan Shi et.al. | 2402.18172 | link |
2024-02-28 | 3DSFLabelling: Boosting 3D Scene Flow Estimation by Pseudo Auto-labelling | Chaokang Jiang et.al. | 2402.18146 | link |
2024-02-28 | OccTransformer: Improving BEVFormer for 3D camera-only occupancy prediction | Jian Liu et.al. | 2402.18140 | null |
2024-02-28 | DecisionNCE: Embodied Multimodal Representations via Implicit Preference Learning | Jianxiong Li et.al. | 2402.18137 | link |
2024-02-28 | Passive Snapshot Coded Aperture Dual-Pixel RGB-D Imaging | Bhargav Ghanekar et.al. | 2402.18102 | null |
2024-02-27 | ICAT: An Indoor Connected and Autonomous Testbed for Vehicle Computing | Zhaofeng Tian et.al. | 2402.17933 | null |
2024-02-27 | SWTrack: Multiple Hypothesis Sliding Window 3D Multi-Object Tracking | Sandro Papais et.al. | 2402.17892 | null |
2024-02-27 | Public Goods Games in Disease Evolution and Spread | Christo Morison et.al. | 2402.17842 | null |
2024-02-27 | Personalizing Smart Home Privacy Protection With Individuals’ Regulatory Focus: Would You Preserve or Enhance Your Information Privacy? | Reza Ghaiumy Anaraky et.al. | 2402.17838 | null |
2024-02-27 | Federated Learning for Estimating Heterogeneous Treatment Effects | Disha Makhija et.al. | 2402.17705 | null |
2024-02-27 | Model Free Deep Deterministic Policy Gradient Controller for Setpoint Tracking of Non-minimum Phase Systems | Fatemeh Tavakkoli et.al. | 2402.17703 | null |
2024-02-27 | Autonomous Vehicles: Evolution of Artificial Intelligence and Learning Algorithms | Sneha Sudhir Shetiya et.al. | 2402.17690 | null |
2024-02-27 | QoS prediction in radio vehicular environments via prior user information | Noor Ul Ain et.al. | 2402.17689 | null |
2024-02-27 | Multi-Agent Deep Reinforcement Learning for Distributed Satellite Routing | Federico Lozano-Cuadra et.al. | 2402.17666 | null |
2024-02-27 | Mitigating Distributional Shift in Semantic Segmentation via Uncertainty Estimation from Unlabelled Data | David S. W. Williams et.al. | 2402.17653 | null |
2024-02-27 | Comparison of the Effects of Interaction with Intentional Agent and Artificial Intelligence using fNIRS | Mohammad Ghalavand et.al. | 2402.17650 | null |
2024-02-27 | Chronicles of CI/CD: A Deep Dive into its Usage Over Time | Hugo da Gião et.al. | 2402.17588 | null |
2024-02-27 | An Empirical Study of the Generalization Ability of Lidar 3D Object Detectors to Unseen Domains | George Eskandar et.al. | 2402.17562 | null |
2024-02-27 | Emergency Caching: Coded Caching-based Reliable Map Transmission in Emergency Networks | Zeyu Tian et.al. | 2402.17550 | null |
2024-02-27 | Highway Discretionary Lane-change Decision and Control Using Model Predictive Control | Zishun Zheng et.al. | 2402.17524 | null |
2024-02-27 | Leveraging Enhanced Queries of Point Sets for Vectorized Map Construction | Zihao Liu et.al. | 2402.17430 | link |
2024-02-27 | Determinants of LLM-assisted Decision-Making | Eva Eigner et.al. | 2402.17385 | null |
2024-02-27 | ICP-Flow: LiDAR Scene Flow Estimation with ICP | Yancong Lin et.al. | 2402.17351 | link |
2024-02-27 | VCD: Knowledge Base Guided Visual Commonsense Discovery in Images | Xiangqing Shen et.al. | 2402.17213 | null |
2024-02-27 | Benchmarking Data Science Agents | Yuge Zhang et.al. | 2402.17168 | link |
2024-02-27 | Video as the New Language for Real-World Decision Making | Sherry Yang et.al. | 2402.17139 | null |
2024-02-27 | Deep Reinforcement Learning (DRL)-based Methods for Serverless Stream Processing Engines: A Vision, Architectural Elements, and Future Directions | Maria R. Read et.al. | 2402.17117 | null |
2024-02-26 | Reinforcement Learning Based Oscillation Dampening: Scaling up Single-Agent RL algorithms to a 100 AV highway field operational test | Kathy Jang et.al. | 2402.17050 | null |
2024-02-26 | Successfully Guiding Humans with Imperfect Instructions by Highlighting Potential Errors and Suggesting Corrections | Lingjun Zhao et.al. | 2402.16973 | null |
2024-02-26 | Trajectory Prediction for Autonomous Driving Using a Transformer Network | Zhenning Li et.al. | 2402.16501 | null |
2024-02-26 | Edge Detectors Can Make Deep Convolutional Neural Networks More Robust | Jin Ding et.al. | 2402.16479 | null |
2024-02-26 | Learning to Schedule Online Tasks with Bandit Feedback | Yongxin Xu et.al. | 2402.16463 | null |
2024-02-26 | Contingency Planning Using Bi-level Markov Decision Processes for Space Missions | Somrita Banerjee et.al. | 2402.16342 | link |
2024-02-26 | Achieving $\tilde{O}(1/ε)$ Sample Complexity for Constrained Markov Decision Process | Jiashuo Jiang et.al. | 2402.16324 | null |
2024-02-26 | From Large Language Models and Optimization to Decision Optimization CoPilot: A Research Manifesto | Segev Wasserkrug et.al. | 2402.16269 | null |
2024-02-26 | SeqTrack3D: Exploring Sequence Information for Robust 3D Point Cloud Tracking | Yu Lin et.al. | 2402.16249 | link |
2024-02-25 | How Can LLM Guide RL? A Value-Based Approach | Shenao Zhang et.al. | 2402.16181 | link |
2024-02-25 | From Concept to Implementation: Streamlining Sensor and Actuator Selection for Collaborative Design and Engineering of Interactive Systems | İhsan Ozan Yıldırım et.al. | 2402.16084 | null |
2024-02-25 | EHRNoteQA: A Patient-Specific Question Answering Benchmark for Evaluating Large Language Models in Clinical Settings | Sunjun Kweon et.al. | 2402.16040 | link |
2024-02-25 | Machine Learning-Based Vehicle Intention Trajectory Recognition and Prediction for Autonomous Driving | Hanyi Yu et.al. | 2402.16036 | null |
2024-02-24 | Predicting Outcomes in Video Games with Long Short Term Memory Networks | Kittimate Chulajata et.al. | 2402.15923 | link |
2024-02-24 | Concurrent Learning of Policy and Unknown Safety Constraints in Reinforcement Learning | Lunet Yifru et.al. | 2402.15893 | null |
2024-02-24 | Statistical Games | Jozsef Konczer et.al. | 2402.15892 | null |
2024-02-24 | NaVid: Video-based VLM Plans the Next Step for Vision-and-Language Navigation | Jiazhao Zhang et.al. | 2402.15852 | null |
2024-02-24 | Multiple Instance Learning for Glioma Diagnosis using Hematoxylin and Eosin Whole Slide Images: An Indian cohort Study | Ekansh Chauhan et.al. | 2402.15832 | null |
2024-02-24 | Reward Design for Justifiable Sequential Decision-Making | Aleksa Sukovic et.al. | 2402.15826 | link |
2024-02-24 | Construction and application of artificial intelligence crowdsourcing map based on multi-track GPS data | Yong Wang et.al. | 2402.15796 | null |
2024-02-24 | Discretionary Lane-Change Decision and Control via Parameterized Soft Actor-Critic for Hybrid Action Space | Yuan Lin et.al. | 2402.15790 | null |
2024-02-24 | Detection Is Tracking: Point Cloud Multi-Sweep Deep Learning Models Revisited | Lingji Chen et.al. | 2402.15756 | null |
2024-02-23 | The Sample Average Approximation Method for Solving Two-Stage Stochastic Programs with Endogenous Uncertainty | Maria Bazotte et.al. | 2402.15486 | link |
2024-02-23 | Benchmarking the Robustness of Panoptic Segmentation for Automated Driving | Yiting Wang et.al. | 2402.15469 | null |
2024-02-23 | Information-Theoretic Safe Bayesian Optimization | Alessandro G. Bottero et.al. | 2402.15347 | null |
2024-02-23 | EMIFF: Enhanced Multi-scale Image Feature Fusion for Vehicle-Infrastructure Cooperative 3D Object Detection | Zhe Wang et.al. | 2402.15272 | link |
2024-02-23 | Multi-Agent Collaboration Framework for Recommender Systems | Zhefan Wang et.al. | 2402.15235 | link |
2024-02-23 | Safety Optimized Reinforcement Learning via Multi-Objective Policy Optimization | Homayoun Honari et.al. | 2402.15197 | null |
2024-02-23 | Multi-Armed Bandits with Abstention | Junwen Yang et.al. | 2402.15127 | null |
2024-02-23 | Large Multimodal Agents: A Survey | Junlin Xie et.al. | 2402.15116 | null |
2024-02-22 | Practice Makes Perfect: Planning to Learn Skill Parameter Policies | Nishanth Kumar et.al. | 2402.15025 | null |
2024-02-22 | On the Performance of Empirical Risk Minimization with Smoothed Data | Adam Block et.al. | 2402.14987 | null |
2024-02-22 | Unsupervised Domain Adaptation within Deep Foundation Latent Spaces | Dmitry Kangin et.al. | 2402.14976 | null |
2024-02-22 | Path Planning based on 2D Object Bounding-box | Yanliang Huang et.al. | 2402.14933 | null |
2024-02-22 | Autonomy Oriented Digital Twins for Real2Sim2Real Autoware Deployment | Chinmay Vilas Samak et.al. | 2402.14739 | link |
2024-02-22 | Doing AI: Algorithmic decision support as a human activity | Joachim Meyer et.al. | 2402.14674 | null |
2024-02-22 | Distributed Radiance Fields for Edge Video Compression and Metaverse Integration in Autonomous Driving | Eugen Šlapak et.al. | 2402.14642 | null |
2024-02-22 | Reframing the Expected Free Energy: Four Formulations and a Unification | Théophile Champion et.al. | 2402.14460 | null |
2024-02-22 | Model-Based Reinforcement Learning Control of Reaction-Diffusion Problems | Christina Schenk et.al. | 2402.14446 | null |
2024-02-22 | Algorithm-agnostic significance testing in supervised learning with multimodal data | Lucas Kook et.al. | 2402.14416 | link |
2024-02-22 | Human-machine social systems | Milena Tsvetkova et.al. | 2402.14410 | null |
2024-02-22 | RadarMOSEVE: A Spatial-Temporal Transformer Network for Radar-Only Moving Object Segmentation and Ego-Velocity Estimation | Changsong Pang et.al. | 2402.14380 | link |
2024-02-22 | We Choose to Go to Space: Agent-driven Human and Multi-Robot Collaboration in Microgravity | Miao Xin et.al. | 2402.14299 | null |
2024-02-22 | Enhancing Robotic Manipulation with AI Feedback from Multimodal Large Language Models | Jinyi Liu et.al. | 2402.14245 | null |
2024-02-22 | BeTAIL: Behavior Transformer Adversarial Imitation Learning from Human Racing Gameplay | Catherine Weaver et.al. | 2402.14194 | null |
2024-02-22 | Parking of Connected Automated Vehicles: Vehicle Control, Parking Assignment, and Multi-agent Simulation | Xu Shen et.al. | 2402.14183 | null |
2024-02-21 | Blending Data-Driven Priors in Dynamic Games | Justin Lidard et.al. | 2402.14174 | null |
2024-02-21 | Unveiling Crowdfunding Futures: Analyzing Campaign Outcomes through Distributed Models and Big Data Perspectives | Giuseppe Pipitò et.al. | 2402.14111 | null |
2024-02-21 | Social Environment Design | Edwin Zhang et.al. | 2402.14090 | link |
2024-02-21 | Beyond A*: Better Planning with Transformers via Search Dynamics Bootstrapping | Lucas Lehnert et.al. | 2402.14083 | link |
2024-02-21 | Efficient Normalized Conformal Prediction and Uncertainty Quantification for Anti-Cancer Drug Sensitivity Prediction with Deep Regression Forests | Daniel Nolte et.al. | 2402.14080 | null |
2024-02-21 | Information Elicitation in Agency Games | Serena Wang et.al. | 2402.14005 | null |
2024-02-21 | Generative Probabilistic Time Series Forecasting and Applications in Grid Operations | Xinyi Wang et.al. | 2402.13870 | null |
2024-02-21 | Voice-Driven Mortality Prediction in Hospitalized Heart Failure Patients: A Machine Learning Approach Enhanced with Diagnostic Biomarkers | Nihat Ahmadli et.al. | 2402.13812 | null |
2024-02-21 | Deep Generative Models for Offline Policy Learning: Tutorial, Survey, and Perspectives on Future Directions | Jiayu Chen et.al. | 2402.13777 | link |
2024-02-21 | SaGE: Evaluating Moral Consistency in Large Language Models | Vamshi Krishna Bonagiri et.al. | 2402.13709 | link |
2024-02-21 | Analyizing the Conjunction Fallacy as a Fact | Tomas Veloz et.al. | 2402.13615 | null |
2024-02-21 | Hybrid Reasoning Based on Large Language Models for Autonomous Car Driving | Mehdi Azarafza et.al. | 2402.13602 | link |
2024-02-21 | Mastering the Game of Guandan with Deep Reinforcement Learning and Behavior Regulating | Yifan Yanggong et.al. | 2402.13582 | null |
2024-02-21 | EffLoc: Lightweight Vision Transformer for Efficient 6-DOF Camera Relocalization | Zhendong Xiao et.al. | 2402.13537 | null |
2024-02-21 | Best of Many in Both Worlds: Online Resource Allocation with Predictions under Unknown Arrival Model | Lin An et.al. | 2402.13530 | null |
2024-02-21 | Learning to Model Diverse Driving Behaviors in Highly Interactive Autonomous Driving Scenarios with Multi-Agent Reinforcement Learning | Liu Weiwei et.al. | 2402.13481 | null |
2024-02-21 | A rational logit dynamic for decision-making under uncertainty: well-posedness, vanishing-noise limit, and numerical approximation | Hidekazu Yoshioka et.al. | 2402.13453 | null |
2024-02-21 | A Neuro-Symbolic Approach to Multi-Agent RL for Interpretability and Probabilistic Decision Making | Chitra Subramanian et.al. | 2402.13440 | null |
2024-02-20 | Toward TransfORmers: Revolutionizing the Solution of Mixed Integer Programs with Transformers | Joshua F. Cooper et.al. | 2402.13380 | null |
2024-02-20 | Referee-Meta-Learning for Fast Adaptation of Locational Fairness | Weiye Chen et.al. | 2402.13379 | null |
2024-02-20 | VADv2: End-to-End Vectorized Autonomous Driving via Probabilistic Planning | Shaoyu Chen et.al. | 2402.13243 | link |
2024-02-20 | Analyzing Operator States and the Impact of AI-Enhanced Decision Support in Control Rooms: A Human-in-the-Loop Specialized Reinforcement Learning Framework for Intervention Strategies | Ammar N. Abbas et.al. | 2402.13219 | link |
2024-02-20 | Testing Calibration in Subquadratic Time | Lunjia Hu et.al. | 2402.13187 | link |
2024-02-21 | What if LLMs Have Different World Views: Simulating Alien Civilizations with LLM-based Agents | Mingyu Jin et.al. | 2402.13184 | link |
2024-02-20 | 3D high-resolution imaging algorithm using 1D MIMO array for autonomous driving application | Sen Yuan et.al. | 2402.13062 | null |
2024-02-20 | Random Graph Set and Evidence Pattern Reasoning Model | Tianxiang Zhan et.al. | 2402.13058 | null |
2024-02-20 | Align Your Intents: Offline Imitation Learning via Optimal Transport | Maksim Bobrin et.al. | 2402.13037 | null |
2024-02-20 | Solving the decision-making analysis differential equation using eye fixation data in Unity software with Hermite Long-Short-Term Memory | Kourosh Parand et.al. | 2402.13027 | null |
2024-02-20 | Advancements in Point Cloud-Based 3D Defect Detection and Classification for Industrial Systems: A Comprehensive Survey | Anju Rani et.al. | 2402.12923 | null |
2024-02-20 | MORE-3S:Multimodal-based Offline Reinforcement Learning with Shared Semantic Spaces | Tianyu Zheng et.al. | 2402.12845 | link |
2024-02-20 | Are Large Language Models Rational Investors? | Yuhang Zhou et.al. | 2402.12713 | null |
2024-02-20 | XRL-Bench: A Benchmark for Evaluating and Comparing Explainable Reinforcement Learning Techniques | Yu Xiong et.al. | 2402.12685 | link |
2024-02-20 | Smart Mobility Digital Twin Based Automated Vehicle Navigation System: A Proof of Concept | Kui Wang et.al. | 2402.12682 | null |
2024-02-20 | Pre-trained Transformer-Enabled Strategies with Human-Guided Fine-Tuning for End-to-end Navigation of Autonomous Vehicles | Dong Hu et.al. | 2402.12666 | null |
2024-02-20 | Reflect-RL: Two-Player Online RL Fine-Tuning for LMs | Runlong Zhou et.al. | 2402.12621 | link |
2024-02-20 | A System Development Kit for Big Data Applications on FPGA-based Clusters: The EVEREST Approach | Christian Pilato et.al. | 2402.12612 | null |
2024-02-19 | Artifacts or Abduction: How Do LLMs Answer Multiple-Choice Questions Without the Question? | Nishant Balepur et.al. | 2402.12483 | link |
2024-02-19 | Multi-View Conformal Learning for Heterogeneous Sensor Fusion | Enrique Garcia-Ceja et.al. | 2402.12307 | link |
2024-02-19 | UncertaintyTrack: Exploiting Detection and Localization Uncertainty in Multi-Object Tracking | Chang Won Lee et.al. | 2402.12303 | link |
2024-02-19 | DriveVLM: The Convergence of Autonomous Driving and Large Vision-Language Models | Xiaoyu Tian et.al. | 2402.12289 | null |
2024-02-19 | Synthetic location trajectory generation using categorical diffusion models | Simon Dirmeier et.al. | 2402.12242 | link |
2024-02-19 | Towards AI-Based Precision Oncology: A Machine Learning Framework for Personalized Counterfactual Treatment Suggestions based on Multi-Omics Data | Manuel Schürch et.al. | 2402.12190 | null |
2024-02-19 | Examining Monitoring System: Detecting Abnormal Behavior In Online Examinations | Dinh An Ngo et.al. | 2402.12179 | null |
2024-02-19 | Modified RRT* for Path Planning in Autonomous Driving | Sugirtha T et.al. | 2402.12129 | null |
2024-02-19 | Towards Explainable LiDAR Point Cloud Semantic Segmentation via Gradient Based Target Localization | Abhishek Kuriyal et.al. | 2402.12098 | link |
2024-02-19 | All Language Models Large and Small | Zhixun Chen et.al. | 2402.12061 | null |
2024-02-19 | Surround-View Fisheye Optics in Computer Vision and Simulation: Survey and Challenge | Daniel Jakab et.al. | 2402.12041 | null |
2024-02-19 | Analyzing the Impact of Design Factors on Solar Module Thermomechanical Durability Using Interpretable Machine Learning Techniques | Xin Chen et.al. | 2402.11911 | link |
2024-02-19 | Two Online Map Matching Algorithms Based on Analytic Hierarchy Process and Fuzzy Logic | Jeremy J. Lin et.al. | 2402.11866 | null |
2024-02-19 | UniST: A Prompt-Empowered Universal Model for Urban Spatio-Temporal Prediction | Yuan Yuan et.al. | 2402.11838 | link |
2024-02-19 | SDGE: Stereo Guided Depth Estimation for 360° Camera Sets | Jialei Xu et.al. | 2402.11791 | null |
2024-02-19 | Statistical Test for Generated Hypotheses by Diffusion Models | Teruyuki Katsuoka et.al. | 2402.11789 | null |
2024-02-19 | MM-SurvNet: Deep Learning-Based Survival Risk Stratification in Breast Cancer Through Multimodal Data Fusion | Raktim Kumar Mondol et.al. | 2402.11788 | null |
2024-02-18 | A Note on Bias to Complete | Jia Xu et.al. | 2402.11710 | null |
2024-02-18 | Challenging the Black Box: A Comprehensive Evaluation of Attribution Maps of CNN Applications in Agriculture and Forestry | Lars Nieradzik et.al. | 2402.11670 | null |
2024-02-18 | Dynamic planning in hierarchical active inference | Matteo Priorelli et.al. | 2402.11658 | link |
2024-02-18 | Self-evolving Autoencoder Embedded Q-Network | J. Senthilnath et.al. | 2402.11604 | null |
2024-02-16 | Agent-based Simulation Evaluation of CBD Tolling: A Case Study from New York City | Qingnan Liang et.al. | 2402.10834 | null |
2024-02-16 | RAG-Driver: Generalisable Driving Explanations with Retrieval-Augmented In-Context Learning in Multi-Modal Large Language Model | Jianhao Yuan et.al. | 2402.10828 | null |
2024-02-16 | Enhancing ESG Impact Type Identification through Early Fusion and Multilingual Models | Hariram Veeramani et.al. | 2402.10772 | null |
2024-02-16 | RAGIC: Risk-Aware Generative Adversarial Model for Stock Interval Construction | Jingyi Gu et.al. | 2402.10760 | null |
2024-02-16 | Cloud Kitchen: Using Planning-based Composite AI to Optimize Food Delivery Process | Slavomír Švancár et.al. | 2402.10725 | null |
2024-02-16 | Rethinking Human-like Translation Strategy: Integrating Drift-Diffusion Model with Large Language Models for Machine Translation | Hongbin Na et.al. | 2402.10699 | null |
2024-02-16 | Network Formation and Dynamics Among Multi-LLMs | Marios Papachristou et.al. | 2402.10659 | link |
2024-02-16 | Efficiency at Scale: Investigating the Performance of Diminutive Language Models in Clinical Tasks | Niall Taylor et.al. | 2402.10597 | null |
2024-02-16 | Efficient Multi-task Uncertainties for Joint Semantic Segmentation and Monocular Depth Estimation | Steven Landgraf et.al. | 2402.10580 | null |
2024-02-16 | A novel integrated industrial approach with cobots in the age of industry 4.0 through conversational interaction and computer vision | Andrea Pazienza et.al. | 2402.10553 | null |
2024-02-16 | Quantifying Individual Risk for Binary Outcome: Bounds and Inference | Peng Wu et.al. | 2402.10537 | null |
2024-02-16 | PRISE: Learning Temporal Action Abstractions as a Sequence Compression Problem | Ruijie Zheng et.al. | 2402.10450 | link |
2024-02-16 | Barrier-Enhanced Homotopic Parallel Trajectory Optimization for Safety-Critical Autonomous Driving | Lei Zheng et.al. | 2402.10441 | null |
2024-02-16 | Explaining generative diffusion models via visual analysis for interpretable decision-making process | Ji-Hoon Park et.al. | 2402.10404 | link |
2024-02-15 | Thompson Sampling in Partially Observable Contextual Bandits | Hongju Park et.al. | 2402.10289 | null |
2024-02-15 | InfoNet: Neural Estimation of Mutual Information without Test-Time Optimization | Zhengyang Hu et.al. | 2402.10158 | null |
2024-02-15 | Mitigating subjectivity and bias in AI development indices: A robust approach to redefining country rankings | Betania Silva C Campello et.al. | 2402.10122 | link |
2024-02-15 | Neural Network Approaches for Parameterized Optimal Control | Deepanshu Verma et.al. | 2402.10033 | null |
2024-02-15 | Jack of All Trades, Master of Some, a Multi-Purpose Transformer Agent | Quentin Gallouédec et.al. | 2402.09844 | link |
2024-02-15 | Less is more: Ensemble Learning for Retinal Disease Recognition Under Limited Resources | Jiahao Wang et.al. | 2402.09747 | null |
2024-02-15 | Exploiting Alpha Transparency In Language And Vision-Based AI Systems | David Noever et.al. | 2402.09671 | null |
2024-02-15 | Practitioners’ Challenges and Perceptions of CI Build Failure Predictions at Atlassian | Yang Hong et.al. | 2402.09651 | null |
2024-02-14 | Probabilistic Reasoning in Generative Large Language Models | Aliakbar Nafar et.al. | 2402.09614 | link |
2024-02-14 | LogicPrpBank: A Corpus for Logical Implication and Equivalence | Zhexiong Liu et.al. | 2402.09609 | link |
2024-02-14 | Pulmonologists-Level lung cancer detection based on standard blood test results and smoking status using an explainable machine learning approach | Ricco Noel Hansen Flyckt et.al. | 2402.09596 | null |
2024-02-14 | Large Language Model-Based Interpretable Machine Learning Control in Building Energy Systems | Liang Zhang et.al. | 2402.09584 | null |
2024-02-14 | Rationality Report Cards: Assessing the Economic Rationality of Large Language Models | Narun Raman et.al. | 2402.09552 | null |
2024-02-14 | Dataset Clustering for Improved Offline Policy Learning | Qiang Wang et.al. | 2402.09550 | link |
2024-02-14 | How Secure Are Large Language Models (LLMs) for Navigation in Urban Environments? | Congcong Wen et.al. | 2402.09546 | null |
2024-02-14 | PC-NeRF: Parent-Child Neural Radiance Fields Using Sparse LiDAR Frames in Autonomous Driving Environments | Xiuzhong Hu et.al. | 2402.09325 | link |
2024-02-14 | Learning Interpretable Policies in Hindsight-Observable POMDPs through Partially Supervised Reinforcement Learning | Michael Lanier et.al. | 2402.09290 | null |
2024-02-14 | Synergistic eigenanalysis of covariance and Hessian matrices for enhanced binary classification | Agus Hartoyo et.al. | 2402.09281 | null |
2024-02-14 | Design and Realization of a Benchmarking Testbed for Evaluating Autonomous Platooning Algorithms | Michael Shaham et.al. | 2402.09233 | null |
2024-02-14 | BiasEye: A Bias-Aware Real-time Interactive Material Screening System for Impartial Candidate Assessment | Qianyu Liu et.al. | 2402.09148 | null |
2024-02-14 | Selective decision making and collective behavior of fish by the motion of visual attention | Susumu Ito et.al. | 2402.09073 | null |
2024-02-14 | Cross-Temporal Forecast Reconciliation at Digital Platforms with Machine Learning | Jeroen Rombouts et.al. | 2402.09033 | null |
2024-02-14 | Learning-enabled Flexible Job-shop Scheduling for Scalable Smart Manufacturing | Sihoon Moon et.al. | 2402.08979 | null |
2024-02-14 | Second Order Methods for Bandit Optimization and Control | Arun Suggala et.al. | 2402.08929 | null |
2024-02-14 | Inference for an Algorithmic Fairness-Accuracy Frontier | Yiqi Liu et.al. | 2402.08879 | null |
2024-02-13 | Intelligent Agricultural Management Considering N $_2$ O Emission and Climate Variability with Uncertainties | Zhaoan Wang et.al. | 2402.08832 | null |
2024-02-13 | An Adaptive System Architecture for Multimodal Intelligent Transportation Systems | Muhammad Farooq et.al. | 2402.08817 | null |
2024-02-13 | CaPS: Collaborative and Private Synthetic Data Generation from Distributed Sources | Sikha Pentyala et.al. | 2402.08614 | null |
2024-02-13 | Vehicle Behavior Prediction by Episodic-Memory Implanted NDT | Peining Shen et.al. | 2402.08423 | link |
2024-02-13 | LLMs and the Human Condition | Peter Wallis et.al. | 2402.08403 | null |
2024-02-13 | Uncertainty Quantification for Forward and Inverse Problems of PDEs via Latent Global Evolution | Tailin Wu et.al. | 2402.08383 | link |
2024-02-13 | The Duet of Representations and How Explanations Exacerbate It | Charles Wan et.al. | 2402.08379 | null |
2024-02-13 | Exploration by Optimization with Hybrid Regularizers: Logarithmic Regret with Adversarial Robustness in Partial Monitoring | Taira Tsuchiya et.al. | 2402.08321 | null |
2024-02-13 | Zero Trust Score-based Network-level Access Control in Enterprise Networks | Leonard Bradatsch et.al. | 2402.08299 | null |
2024-02-13 | A survey of recent methods for addressing AI fairness and bias in biomedicine | Yifan Yang et.al. | 2402.08250 | null |
2024-02-13 | Causal Learning for Trustworthy Recommender Systems: A Survey | Jin Li et.al. | 2402.08241 | null |
2024-02-13 | MetaTra: Meta-Learning for Generalized Trajectory Prediction in Unseen Domain | Xiaohe Li et.al. | 2402.08221 | null |
2024-02-13 | Inherent Diverse Redundant Safety Mechanisms for AI-based Software Elements in Automotive Applications | Mandar Pitale et.al. | 2402.08208 | null |
2024-02-13 | Group Decision-Making among Privacy-Aware Agents | Marios Papachristou et.al. | 2402.08156 | link |
2024-02-13 | CMA-R:Causal Mediation Analysis for Explaining Rumour Detection | Lin Tian et.al. | 2402.08155 | link |
2024-02-13 | Epistemic Exploration for Generalizable Planning and Learning in Non-Stationary Settings | Rushang Karia et.al. | 2402.08145 | link |
2024-02-13 | Average-Case Analysis of Iterative Voting | Joshua Kavner et.al. | 2402.08144 | null |
2024-02-12 | Addressing cognitive bias in medical language models | Samuel Schmidgall et.al. | 2402.08113 | link |
2024-02-12 | From Data to Decisions: The Transformational Power of Machine Learning in Business Recommendations | Kapilya Gangadharan et.al. | 2402.08109 | null |
2024-02-12 | Auditing Work: Exploring the New York City algorithmic bias audit regime | Lara Groves et.al. | 2402.08101 | null |
2024-02-12 | MAIDCRL: Semi-centralized Multi-Agent Influence Dense-CNN Reinforcement Learning | Ayesha Siddika Nipu et.al. | 2402.07890 | null |
2024-02-12 | Distributed Anomaly Detection in Modern Power Systems: A Penalty-based Mitigation Approach | Erfan Mehdipour Abadi et.al. | 2402.07884 | null |
2024-02-12 | Retrieval-Augmented Thought Process as Sequential Decision Making | Thomas Pouplin et.al. | 2402.07812 | null |
2024-02-12 | From Uncertainty to Precision: Enhancing Binary Classifier Performance through Calibration | Agathe Fernandes Machado et.al. | 2402.07790 | link |
2024-02-12 | TELLER: A Trustworthy Framework for Explainable, Generalizable and Controllable Fake News Detection | Hui Liu et.al. | 2402.07776 | link |
2024-02-12 | Towards Unified Alignment Between Agents, Humans, and Environment | Zonghan Yang et.al. | 2402.07744 | null |
2024-02-12 | Task-conditioned adaptation of visual features in multi-task policy learning | Pierre Marza et.al. | 2402.07739 | null |
2024-02-12 | Interaction-Based Driving Scenario Classification and Labeling | Cheng Chang et.al. | 2402.07720 | null |
2024-02-12 | Online Sequential Decision-Making with Unknown Delays | Ping Wu et.al. | 2402.07703 | null |
2024-02-12 | AYDIV: Adaptable Yielding 3D Object Detection via Integrated Contextual Vision Transformer | Tanmoy Dam et.al. | 2402.07680 | link |
2024-02-12 | DART: A Compact Platform For Autonomous Driving Research | Lorenzo Lyons et.al. | 2402.07602 | null |
2024-02-12 | Unveiling Group-Specific Distributed Concept Drift: A Fairness Imperative in Federated Learning | Teresa Salazar et.al. | 2402.07586 | link |
2024-02-12 | Topological Safeguard for Evasion Attack based on the Interpretability of Artificial Neural Network Behavior | Xabier Echeberria-Barrio et.al. | 2402.07480 | null |
2024-02-12 | Auxiliary Reward Generation with Transition Distance Representation Learning | Siyuan Li et.al. | 2402.07412 | null |
2024-02-12 | Enhancing Multi-Criteria Decision Analysis with AI: Integrating Analytic Hierarchy Process and GPT-4 for Automated Decision Support | Igor Svoboda et.al. | 2402.07404 | null |
2024-02-12 | Replicability is Asymptotically Free in Multi-armed Bandits | Junpei Komiyama et.al. | 2402.07391 | null |
2024-02-12 | Re-DiffiNet: Modeling discrepancies in tumor segmentation using diffusion | Tianyi Ren et.al. | 2402.07354 | link |
2024-02-12 | Noise-Adaptive Confidence Sets for Linear Bandits and Application to Bayesian Optimization | Kwang-Sung Jun et.al. | 2402.07341 | link |
2024-02-11 | Towards Explainable, Safe Autonomous Driving with Language Embeddings for Novelty Identification and Active Learning: Framework and Experimental Analysis with Real-World Data Sets | Ross Greer et.al. | 2402.07320 | null |
2024-02-11 | Self-Consistent Conformal Prediction | Lars van der Laan et.al. | 2402.07307 | link |
2024-02-09 | What is Hiding in Medicine’s Dark Matter? Learning with Missing Data in Medical Practices | Neslihan Suzen et.al. | 2402.06563 | null |
2024-02-09 | Diffusion-ES: Gradient-free Planning with Diffusion for Autonomous Driving and Zero-Shot Instruction Following | Brian Yang et.al. | 2402.06559 | link |
2024-02-09 | An Exercise in Tournament Design: When Some Matches Must Be Scheduled | Sushmita Gupta et.al. | 2402.06538 | null |
2024-02-09 | Scalable Interactive Machine Learning for Future Command and Control | Anna Madison et.al. | 2402.06501 | null |
2024-02-09 | CurveFormer++: 3D Lane Detection by Curve Propagation with Temporal Curve Queries and Attention | Yifeng Bai et.al. | 2402.06423 | null |
2024-02-09 | FD-Vision Mamba for Endoscopic Exposure Correction | Zhuoran Zheng et.al. | 2402.06378 | null |
2024-02-09 | High-Precision Geosteering via Reinforcement Learning and Particle Filters | Ressi Bonti Muhammad et.al. | 2402.06377 | null |
2024-02-09 | AI, Meet Human: Learning Paradigms for Hybrid Decision Making Systems | Clara Punzi et.al. | 2402.06287 | null |
2024-02-09 | Premier-TACO: Pretraining Multitask Representation via Temporal Action-Driven Contrastive Loss | Ruijie Zheng et.al. | 2402.06187 | link |
2024-02-09 | United We Fall: On the Nash Equilibria of Multiplex and Multilayer Network Games | Raman Ebrahimi et.al. | 2402.06108 | null |
2024-02-08 | Scaling Artificial Intelligence for Digital Wargaming in Support of Decision-Making | Scotty Black et.al. | 2402.06075 | null |
2024-02-08 | Aggregation of pairwise comparison matrices: A clustering approach | Kolos Csaba Ágoston et.al. | 2402.06061 | null |
2024-02-08 | Impact on Public Health Decision Making by Utilizing Big Data Without Domain Knowledge | Miao Zhang et.al. | 2402.06059 | null |
2024-02-08 | Intelligent Mode-switching Framework for Teleoperation | Burak Kizilkaya et.al. | 2402.06047 | null |
2024-02-08 | Optimizing Predictive AI in Physical Design Flows with Mini Pixel Batch Gradient Descent | Haoyu Yang et.al. | 2402.06034 | null |
2024-02-08 | Game-theoretic Counterfactual Explanation for Graph Neural Networks | Chirag Chhablani et.al. | 2402.06030 | null |
2024-02-08 | Driving Everywhere with Large Language Model Policy Adaptation | Boyi Li et.al. | 2402.05932 | null |
2024-02-08 | Understanding Social Immunity in Ants: A Markovian Approach to Collective Cleaning Strategies | Isabella Bueno et.al. | 2402.05924 | null |
2024-02-08 | Editable Scene Simulation for Autonomous Driving via Collaborative LLM-Agents | Yuxi Wei et.al. | 2402.05746 | link |
2024-02-08 | Offline Risk-sensitive RL with Partial Observability to Enhance Performance in Human-Robot Teaming | Giorgio Angelotti et.al. | 2402.05703 | null |
2024-02-08 | Stochastic COLREGs Evaluation for Safe Navigation under Uncertainty | Peter Nicholas Hansen et.al. | 2402.05662 | null |
2024-02-08 | Optimizing Delegation in Collaborative Human-AI Hybrid Teams | Andrew Fuchs et.al. | 2402.05605 | null |
2024-02-08 | Form-From: A Design Space of Social Media Systems | Amy X. Zhang et.al. | 2402.05388 | null |
2024-02-08 | Are We Asking the Right Questions?: Designing for Community Stakeholders’ Interactions with AI in Policing | MD Romael Haque et.al. | 2402.05348 | null |
2024-02-07 | Sym-Q: Adaptive Symbolic Regression via Sequential Decision-Making | Yuan Tian et.al. | 2402.05306 | link |
2024-02-07 | Safe Human-UAS Collaboration Abstraction | Hossein Rastgoftar et.al. | 2402.05277 | null |
2024-02-07 | Exploring Hierarchical Classification Performance for Time Series Data: Dissimilarity Measures and Classifier Comparisons | Celal Alagoz et.al. | 2402.05275 | null |
2024-02-07 | Adaptive Hypergraph Network for Trust Prediction | Rongwei Xu et.al. | 2402.05154 | link |
2024-02-07 | FlowPG: Action-constrained Policy Gradient with Normalizing Flows | Janaka Chathuranga Brahmanage et.al. | 2402.05149 | link |
2024-02-07 | Tuning the feedback controller gains is a simple way to improve autonomous driving performance | Wenyu Liang et.al. | 2402.05064 | null |
2024-02-07 | Conformal Monte Carlo Meta-learners for Predictive Inference of Individual Treatment Effects | Jef Jonkers et.al. | 2402.04906 | link |
2024-02-07 | Learning by Doing: An Online Causal Reinforcement Learning Framework with Causal-Aware Policy | Ruichu Cai et.al. | 2402.04869 | null |
2024-02-07 | Collaborative Computing in Non-Terrestrial Networks: A Multi-Time-Scale Deep Reinforcement Learning Approach | Yang Cao et.al. | 2402.04865 | null |
2024-02-07 | Learning Communication Policies for Different Follower Behaviors in a Collaborative Reference Game | Philipp Sadler et.al. | 2402.04824 | null |
2024-02-07 | Investigating Driving Interactions: A Robust Multi-Agent Simulation Framework for Autonomous Vehicles | Marc Kaufeld et.al. | 2402.04720 | link |
2024-02-07 | Large Language Models As Faithful Explainers | Yu-Neng Chuang et.al. | 2402.04678 | null |