Updated on 2024.12.12
Path Planning
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-12-10 | MAPLE: A Framework for Active Preference Learning Guided by Large Language Models | Saaduddin Mahmud et.al. | 2412.07207 | null |
2024-12-09 | Phaedrus: Exploring Dynamic Application Behavior with Lightweight Generative Models and Large-Language Models | Bodhisatwa Chatterjee et.al. | 2412.06994 | null |
2024-12-07 | Timely reliable Bayesian decision-making enabled using memristors | Lekai Song et.al. | 2412.06838 | null |
2024-12-08 | DiTer++: Diverse Terrain and Multi-modal Dataset for Multi-Robot SLAM in Multi-session Environments | Juwon Kim et.al. | 2412.05839 | null |
2024-12-08 | SizeGS: Size-aware Compression of 3D Gaussians with Hierarchical Mixed Precision Quantization | Shuzhao Xie et.al. | 2412.05808 | null |
2024-12-07 | Controlled rough SDEs, pathwise stochastic control and dynamic programming principles | Peter K. Friz et.al. | 2412.05698 | null |
2024-12-07 | Quantum Annealing and Tensor Networks: a Powerful Combination to Solve Optimization Problems | Miquel Albertí Binimelis et.al. | 2412.05595 | null |
2024-12-07 | Optimizing Returns from Experimentation Programs | Timothy Sudijono et.al. | 2412.05508 | null |
2024-12-06 | Nonmyopic Global Optimisation via Approximate Dynamic Programming | Filippo Airaldi et.al. | 2412.04882 | null |
2024-12-05 | Generating graph states with a single quantum emitter and the minimum number of fusions | Matthias C. Löbl et.al. | 2412.04587 | null |
2024-12-04 | Summa Summarum: Moessner’s Theorem without Dynamic Programming | Olivier Danvy et.al. | 2412.03127 | null |
2024-11-21 | Quantum Annealing based Hybrid Strategies for Real Time Route Optimization | Sushil Mario et.al. | 2412.02720 | null |
2024-11-30 | A Second Soul: Celebrating the Many Languages of Programming – Festschrift in Honor of Peter Thiemann’s Sixtieth Birthday | Annette Bieniusa et.al. | 2412.01856 | null |
2024-12-01 | Optimization of Delivery Routes for Fresh E-commerce in Pre-warehouse Mode | Alice Harward et.al. | 2412.00634 | null |
2024-11-29 | An Optimal Switching Approach for Bird Migration | Jiawei Chu et.al. | 2411.19467 | null |
2024-11-28 | SPAgent: Adaptive Task Decomposition and Model Selection for General Video Generation and Editing | Rong-Cheng Tu et.al. | 2411.18983 | null |
2024-11-27 | SCoTT: Wireless-Aware Path Planning with Vision Language Models and Strategic Chains-of-Thought | Aladin Djuhera et.al. | 2411.18212 | null |
2024-11-26 | Structural Parameterization of Locating-Dominating Set and Test Cover | Dipayan Chakraborty et.al. | 2411.17948 | null |
2024-11-26 | Pushing the Limits of Large Language Model Quantization via the Linearity Theorem | Vladimir Malinovskii et.al. | 2411.17525 | null |
2024-11-26 | Weakly acyclic diagrams: A data structure for infinite-state symbolic verification | Michael Blondin et.al. | 2411.17250 | null |
2024-11-26 | Dynamic Programming-Based Offline Redundancy Resolution of Redundant Manipulators Along Prescribed Paths with Real-Time Adjustment | Zhihang Yin et.al. | 2411.17052 | null |
2024-11-26 | Dynamic Programming-Based Redundancy Resolution for Path Planning of Redundant Manipulators Considering Breakpoints | Zhihang Yin et.al. | 2411.17034 | null |
2024-11-26 | Entropy-Based Dynamic Programming for Efficient Vehicle Parking | Jean-Luc Lupien et.al. | 2411.17014 | null |
2024-11-25 | Any3DIS: Class-Agnostic 3D Instance Segmentation by 2D Mask Tracking | Phuc Nguyen et.al. | 2411.16183 | null |
2024-11-25 | Using Drone Swarm to Stop Wildfire: A Predict-then-optimize Approach | Shijie Pan et.al. | 2411.16144 | null |
2024-11-24 | Hiding Communication Cost in Distributed LLM Training via Micro-batch Co-execution | Haiquan Wang et.al. | 2411.15871 | null |
2024-11-24 | Revenue Maximization in Choice-Based Matching Markets | Dan Nissim et.al. | 2411.15727 | null |
2024-11-22 | Jovis: A Visualization Tool for PostgreSQL Query Optimizer | Yoojin Choi et.al. | 2411.14788 | null |
2024-11-22 | Construction and Preliminary Validation of a Dynamic Programming Concept Inventory | Matthew Ferland et.al. | 2411.14655 | null |
2024-11-18 | Controlled Occupied Processes and Viscosity Solutions | H. Mete Soner et.al. | 2411.12080 | null |
2024-11-18 | A New Finite-Horizon Dynamic Programming Analysis of Nonanticipative Rate-Distortion Function for Markov Sources | Zixuan He et.al. | 2411.11698 | null |
2024-11-18 | gpuPairHMM: High-speed Pair-HMM Forward Algorithm for DNA Variant Calling on GPUs | Bertil Schmidt et.al. | 2411.11547 | link |
2024-11-17 | Dynamic Programming: Optimality at a Point Implies Optimality Everywhere | John Stachurski et.al. | 2411.11062 | null |
2024-11-15 | AmoebaLLM: Constructing Any-Shape Large Language Models for Efficient and Instant Deployment | Yonggan Fu et.al. | 2411.10606 | null |
2024-11-14 | Latency Optimization in LEO Satellite Communications with Hybrid Beam Pattern and Interference Control | Qianqian Zhang et.al. | 2411.09600 | null |
2024-11-13 | On the numerical integration of the Fokker-Planck equation driven by a mechanical force and the Bismut-Elworthy-Li formula | Julia Sanders et.al. | 2411.08518 | null |
2024-11-13 | Tractable Robust Markov Decision Processes | Julien Grand-Clément et.al. | 2411.08435 | null |
2024-11-12 | dpvis: A Visual and Interactive Learning Tool for Dynamic Programming | David H. Lee et.al. | 2411.07705 | link |
2024-11-11 | DP and QP Based Decision-making and Planning for Autonomous Vehicle | Zhicheng Zhang et.al. | 2411.06751 | null |
2024-11-11 | Resilient control under denial-of-service and uncertainty: An adaptive dynamic programming approach | Weinan Gao et.al. | 2411.06689 | null |
2024-11-11 | Two Kinds of Learning Algorithms for Continuous-Time VWAP Targeting Execution | Xingyu Zhou et.al. | 2411.06645 | null |
2024-11-10 | Robust optimal stopping with regime switching | Siyu Lv et.al. | 2411.06522 | null |
2024-11-07 | Optimal control under unknown intensity with Bayesian learning | Nicolas Baradel et.al. | 2411.04917 | null |
2024-11-07 | Structure Matters: Dynamic Policy Gradient | Sara Klein et.al. | 2411.04913 | null |
2024-11-07 | Minimax Linear Regulator Problems for Positive Systems | Alba Gurpegui et.al. | 2411.04809 | null |
2024-11-07 | Optimal Execution under Incomplete Information | Etienne Chevalier et.al. | 2411.04616 | null |
2024-11-07 | Convergence and Robustness of Value and Policy Iteration for the Linear Quadratic Regulator | Bowen Song et.al. | 2411.04548 | link |
2024-11-05 | DP-HLS: A High-Level Synthesis Framework for Accelerating Dynamic Programming Algorithms in Bioinformatics | Yingqi Cao et.al. | 2411.03398 | link |
2024-11-04 | Stochastic Optimal Control of an Industrial Power-to-Heat System with High-Temperature Heat Pump and Thermal Energy Storage | Eric Pilling et.al. | 2411.02211 | null |
2024-11-03 | ParseCaps: An Interpretable Parsing Capsule Network for Medical Image Diagnosis | Xinyu Geng et.al. | 2411.01564 | null |
2024-10-31 | EARL-BO: Reinforcement Learning for Multi-Step Lookahead, High-Dimensional Bayesian Optimization | Mujin Cheon et.al. | 2411.00171 | null |
2024-10-31 | Q-learning for Quantile MDPs: A Decomposition, Performance, and Convergence Analysis | Jia Lin Hau et.al. | 2410.24128 | link |
2024-10-31 | A dynamic programming principle for multiperiod control problems with bicausal constraints | Ruslan Mirmominov et.al. | 2410.23927 | null |
2024-10-30 | Return Augmented Decision Transformer for Off-Dynamics Reinforcement Learning | Ruhan Wang et.al. | 2410.23450 | null |
2024-10-29 | Approximately Counting Knapsack Solutions in Subquadratic Time | Weiming Feng et.al. | 2410.22267 | null |
2024-10-29 | Beating Bellman’s Algorithm for Subset Sum | Karl Bringmann et.al. | 2410.21942 | null |
2024-10-28 | Analysis of Different Algorithmic Design Techniques for Seam Carving | Owais Aijaz et.al. | 2410.21207 | null |
2024-10-27 | A New Method for Inserting Train Paths into a Timetable | David Dekker et.al. | 2410.20561 | link |
2024-10-27 | On the I/O Complexity of the CYK Algorithm and of a Family of Related DP Algorithms | Lorenzo De Stefani et.al. | 2410.20337 | null |
2024-10-25 | An Enhanced Hierarchical Planning Framework for Multi-Robot Autonomous Exploration | Gengyuan Cai et.al. | 2410.19373 | null |
2024-10-24 | Stochastic dynamic programming under recursive Epstein-Zin preferences | Anna Jaśkiewicz et.al. | 2410.19181 | null |
2024-10-24 | A Counterexample in Cross-Correlation Template Matching | Serap A. Savari et.al. | 2410.19085 | null |
2024-10-23 | Trajectory Optimization for Spatial Microstructure Control in Electron Beam Metal Additive Manufacturing | Mikhail Khrenov et.al. | 2410.18207 | null |
2024-10-24 | Estimating the Spectral Moments of the Kernel Integral Operator from Finite Sample Matrices | Chanwoo Chun et.al. | 2410.17998 | null |
2024-10-21 | Policies with Sparse Inter-Agent Dependencies in Dynamic Games: A Dynamic Programming Approach | Xinjie Liu et.al. | 2410.16441 | null |
2024-10-21 | All You Need is an Improving Column: Enhancing Column Generation for Parallel Machine Scheduling via Transformers | Amira Hijazi et.al. | 2410.15601 | null |
2024-10-21 | How to Find the Exact Pareto Front for Multi-Objective MDPs? | Yining Li et.al. | 2410.15557 | null |
2024-10-20 | CASET: Complexity Analysis using Simple Execution Traces for CS* submissions | Aaryen Mehta et.al. | 2410.15419 | null |
2024-10-19 | The Constrained Layer Tree Problem and Applications to Solar Farm Cabling | Thomas Bläsius et.al. | 2410.15031 | null |
2024-10-18 | On picking operations in e-commerce warehouses: Insights from the complete-information counterpart | Catherine Lorenz et.al. | 2410.14316 | null |
2024-10-17 | Quasi-quantum states and the quasi-quantum PCP theorem | Itai Arad et.al. | 2410.13549 | null |
2024-10-17 | Joint Antenna Selection and Covariance Matrix Optimization for ISAC Systems | Michail Palaiologos et.al. | 2410.13446 | null |
2024-10-17 | Membership Testing for Semantic Regular Expressions | Yifei Huang et.al. | 2410.13262 | null |
2024-10-22 | Research on Travel Route Planing Problems Based on Greedy Algorithm | Yiquan Wang et.al. | 2410.13226 | link |
2024-10-17 | Algorithmic Content Selection and the Impact of User Disengagement | Emilio Calvano et.al. | 2410.13108 | null |
2024-10-16 | Learning Representations for Reasoning: Generalizing Across Diverse Structures | Zhaocheng Zhu et.al. | 2410.13018 | null |
2024-10-16 | Vehicle Localization in GPS-Denied Scenarios Using Arc-Length-Based Map Matching | Nur Uddin Javed et.al. | 2410.12208 | null |
2024-10-15 | Incremental computation of the set of period sets | Eric Rivals et.al. | 2410.12077 | null |
2024-10-15 | Routing and Scheduling Optimization for Urban Air Mobility Fleet Management using Quantum Annealing | Renichiro Haba et.al. | 2410.11231 | null |
2024-10-16 | SplitLLM: Collaborative Inference of LLMs for Model Placement and Throughput Optimization | Akrit Mudvari et.al. | 2410.10759 | null |
2024-10-14 | Learning Sub-Second Routing Optimization in Computer Networks requires Packet-Level Dynamics | Andreas Boltres et.al. | 2410.10377 | null |
2024-10-09 | Rapid Computation of the Assembly Index of Molecular Graphs | Ian Seet et.al. | 2410.09100 | null |
2024-10-11 | Deep Learning Algorithms for Mean Field Optimal Stopping in Finite Space and Discrete Time | Lorenzo Magnino et.al. | 2410.08850 | null |
2024-10-11 | Hybrid Filtering Heuristic for the Sensor-Placement Problem to Discretize 2D Continuous Environments | Jan Mikula et.al. | 2410.08784 | link |
2024-10-10 | Dynamic Programming based Local Search approaches for Multi-Agent Path Finding problems on Directed Graphs | Irene Saccani et.al. | 2410.07954 | null |
2024-10-10 | Partitioning Trillion Edge Graphs on Edge Devices | Adil Chhabra et.al. | 2410.07732 | null |
2024-10-11 | Q-WSL:Leveraging Dynamic Programming for Weighted Supervised Learning in Goal-conditioned RL | Xing Lei et.al. | 2410.06648 | null |
2024-10-08 | Solvability of Equilibrium Riccati Equations: A Direct Approach | Bowen Ma et.al. | 2410.06090 | null |
2024-10-07 | Dynamic HumTrans: Humming Transcription Using CNNs and Dynamic Programming | Shubham Gupta et.al. | 2410.05455 | link |
2024-10-07 | A Predictive and Optimization Approach for Enhanced Urban Mobility Using Spatiotemporal Data | Shambhavi Mishra et.al. | 2410.05358 | null |
2024-10-05 | AI as Humanity’s Salieri: Quantifying Linguistic Creativity of Language Models via Systematic Attribution of Machine Text against Web Text | Ximing Lu et.al. | 2410.04265 | null |
2024-10-05 | A branch-&-price approach to the unrooted maximum agreement forest problem | Martin Frohn et.al. | 2410.04122 | null |
2024-10-02 | Electrification of Transportation: A Hybrid Benders/SDDP Algorithm for Optimal Charging Station Trading | Farnaz Sohrabi et.al. | 2410.03763 | null |
2024-10-02 | Effects of eco-driving on energy consumption and battery degradation for electric vehicles at signalized intersections | Yongqiang Wang et.al. | 2410.01685 | null |
2024-10-02 | Krylov-Safonov theory for Pucci-type extremal inequalities on random data clouds | Ángel Arroyo et.al. | 2410.01642 | null |
2024-10-02 | Automated Curvy Waveguide Routing for Large-Scale Photonic Integrated Circuits | Hongjian Zhou et.al. | 2410.01260 | null |
2024-09-30 | Generalised mixed effects models for changepoint analysis of biomedical time series data | Mark B. Fiecas et.al. | 2410.00183 | null |
2024-09-30 | Opt2Skill: Imitating Dynamically-feasible Whole-Body Trajectories for Versatile Humanoid Loco-Manipulation | Fukang Liu et.al. | 2409.20514 | null |
2024-09-28 | On Computing Elastic Shape Distances between Curves in d-dimensional Space | Javier Bernal et.al. | 2409.19380 | null |
2024-09-25 | MaViLS, a Benchmark Dataset for Video-to-Slide Alignment, Assessing Baseline Accuracy with a Multimodal Alignment Algorithm Leveraging Speech, OCR, and Visual Features | Katharina Anderer et.al. | 2409.16765 | link |
2024-09-25 | DeformStream: Deformation-based Adaptive Volumetric Video Streaming | Boyan Li et.al. | 2409.16615 | null |
2024-09-24 | Partial Elastic Shape Registration of 3D Surfaces using Dynamic Programming | Javier Bernal et.al. | 2409.16462 | null |
2024-09-25 | Efficient Nearest Neighbor Search Using Dynamic Programming | Pengfei Wang et.al. | 2409.15023 | null |
2024-09-22 | Unsupervised Word Discovery: Boundary Detection with Clustering vs. Dynamic Programming | Simon Malan et.al. | 2409.14486 | null |
2024-09-24 | Batch Predictive Inference | Yonghoon Lee et.al. | 2409.13990 | link |
2024-09-20 | A Modified Algorithm for Optimal Picker Routing in a Single Block Warehouse | George Dunn et.al. | 2409.13219 | null |
2024-09-19 | Program Slicing in the Era of Large Language Models | Kimya Khakzad Shahandashti et.al. | 2409.12369 | null |
2024-09-18 | Differential dynamic programming with stagewise equality and inequality constraints using interior point method | Siddharth Prabhu et.al. | 2409.12048 | null |
2024-09-20 | Second-Order Constrained Dynamic Optimization | Yuichiro Aoyama et.al. | 2409.11649 | null |
2024-09-18 | Multi-stage stochastic linear programming for shared autonomous vehicle system operation and design with on-demand and pre-booked requests | Riki Kawase et.al. | 2409.11611 | null |
2024-09-17 | Optimal Investment with Costly Expert Opinions | Christoph Knochenhauer et.al. | 2409.11569 | null |
2024-09-20 | Exact Wavefront Propagation for Globally Optimal One-to-All Path Planning on 2D Cartesian Grids | Ibrahim Ibrahim et.al. | 2409.11545 | link |
2024-09-17 | Neural Networks for Vehicle Routing Problem | László Kovács et.al. | 2409.11290 | null |
2024-09-17 | Selective algorithm processing of subset sum distributions | Nick Dawes et.al. | 2409.11076 | null |
2024-09-17 | Local discontinuous Galerkin method for nonlinear BSPDEs of Neumann boundary conditions with deep backward dynamic programming time-marching | Yixiang Dai et.al. | 2409.11004 | null |
2024-09-17 | Relationship between stochastic maximum principle and dynamic programming principle under convex expectation | Xiaojuan Li et.al. | 2409.10987 | null |
2024-09-16 | Direct Data-Driven Discounted Infinite Horizon Linear Quadratic Regulator with Robustness Guarantees | Ramin Esmzad et.al. | 2409.10703 | null |
2024-09-20 | Motion Forecasting via Model-Based Risk Minimization | Aron Distelzweig et.al. | 2409.10585 | null |
2024-09-16 | Estimates for Optimal Multistage Group Partition Testing | Guojiang Shao et.al. | 2409.10410 | null |
2024-09-16 | Pareto Sums of Pareto Sets: Lower Bounds and Algorithms | Daniel Funke et.al. | 2409.10232 | null |
2024-09-12 | Q-value Regularized Decision ConvFormer for Offline Reinforcement Learning | Teng Yan et.al. | 2409.08062 | null |
2024-09-12 | Super Monotonic Alignment Search | Junhyeok Lee et.al. | 2409.07704 | link |
2024-09-10 | Design of Threshold-Constrained Indirect Quantizers | Ariel Doubchak et.al. | 2409.06839 | null |
2024-09-10 | Cooptimizing Safety and Performance with a Control-Constrained Formulation | Hao Wang et.al. | 2409.06696 | link |
2024-09-12 | Valuation Model of Chinese Convertible Bonds Based on Monte Carlo Simulation | Yu Liu et.al. | 2409.06496 | null |
2024-09-09 | OTFS-MDMA: An Elastic Multi-Domain Resource Utilization Mechanism for High Mobility Scenarios | Jie Chen et.al. | 2409.05724 | null |
2024-09-09 | Enhancing Empathic Accuracy: Penalized Functional Alignment Method to Correct Misalignment in Emotional Perception | Linh H Nghiem et.al. | 2409.05343 | null |
2024-09-08 | Cooperative Learning-Based Framework for VNF Caching and Placement Optimization over Low Earth Orbit Satellite Networks | Khai Doan et.al. | 2409.05025 | null |
2024-09-08 | Fast Deep Predictive Coding Networks for Videos Feature Extraction without Labels | Wenqian Xue et.al. | 2409.04945 | null |
2024-09-17 | Second-Order Stein Variational Dynamic Optimization | Yuichiro Aoyama et.al. | 2409.04644 | null |
2024-09-06 | Refined Bounds on Near Optimality Finite Window Policies in POMDPs and Their Reinforcement Learning | Yunus Emre Demirci et.al. | 2409.04351 | null |
2024-09-05 | Space-Efficient Algorithm for Integer Programming with Few Constraints | Lars Rohwedder et.al. | 2409.03681 | null |
2024-09-05 | Fine-Grained Equivalence for Problems Related to Integer Linear Programming | Lars Rohwedder et.al. | 2409.03675 | null |
2024-09-06 | Revenue Management with Calendar-Aware and Dependent Demands: Asymptotically Tight Fluid Approximations | Weiyuan Li et.al. | 2409.02637 | null |
2024-09-03 | FuzzCoder: Byte-level Fuzzing Test via Large Language Model | Liqun Yang et.al. | 2409.01944 | null |
2024-09-03 | Quantum Algorithms for One-Sided Crossing Minimization | Susanna Caroppo et.al. | 2409.01942 | null |
2024-09-02 | Solving Integrated Process Planning and Scheduling Problem via Graph Neural Network Based Deep Reinforcement Learning | Hongpei Li et.al. | 2409.00968 | null |
2024-09-02 | Multistage Robust Average Randomized Spectral Risk Optimization | Qiong Wu et.al. | 2409.00892 | null |
2024-09-01 | An Optimized Binning and Probabilistic Slice Sharing Algorithm for Motion Correction in Abdominal DW-MRI | Michelle Su et.al. | 2409.00798 | null |
2024-09-01 | Cooperative Path Planning with Asynchronous Multiagent Reinforcement Learning | Jiaming Yin et.al. | 2409.00754 | null |
2024-09-01 | The landscape of deterministic and stochastic optimal control problems: One-shot Optimization versus Dynamic Programming | Jihun Kim et.al. | 2409.00655 | null |
2024-08-31 | Foundations of Multivariate Distributional Reinforcement Learning | Harley Wiltzer et.al. | 2409.00328 | null |
2024-08-30 | Approximation Algorithms for Anchored Multiwatchman Routes | Joseph S. B. Mitchell et.al. | 2408.17343 | null |
2024-08-30 | Stationary Policies are Optimal in Risk-averse Total-reward MDPs with EVaR | Xihong Su et.al. | 2408.17286 | null |
2024-08-30 | A Two-Timescale Decision-Hazard-Decision Formulation for Storage Usage Values Calculation | Camila Martinez Parra et.al. | 2408.17113 | null |
2024-08-29 | Optimization Models for the Quadratic Traveling Salesperson Problem | Yuxiao Chen et.al. | 2408.16680 | null |
2024-08-27 | On the parameterized complexity of computing good edge-labelings | Davi de Andrade et.al. | 2408.15181 | null |
2024-08-26 | Achieving designed texture and flows in bulk active nematics using optimal control theory | Saptorshi Ghosh et.al. | 2408.14596 | null |
2024-08-25 | Decentralized Stochastic Control in Standard Borel Spaces: Centralized MDP Reductions, Near Optimality of Finite Window Local Information, and Q-Learning | Omar Mrani-Zentar et.al. | 2408.13828 | null |
2024-08-23 | The Ultimate Guide to Fine-Tuning LLMs from Basics to Breakthroughs: An Exhaustive Review of Technologies, Research, Best Practices, Applied Research Challenges and Opportunities | Venkatesh Balavadhani Parthasarathy et.al. | 2408.13296 | null |
2024-08-18 | An Introduction to Cognidynamics | Marco Gori et.al. | 2408.13112 | null |
2024-08-20 | Optimal Guarantees for Online Selection Over Time | Sebastian Perez-Salazar et.al. | 2408.11224 | null |
2024-08-20 | Fault Tolerant Dynamic Task Assignment for UAV-based Search Teams | Ali Nasir et.al. | 2408.10564 | null |
2024-08-19 | Efficient Exploration in Deep Reinforcement Learning: A Novel Bayesian Actor-Critic Algorithm | Nikolai Rozanov et.al. | 2408.10055 | null |
2024-08-19 | Continuous-Time Dynamic Decision Making with Costly Information | Christoph Knochenhauer et.al. | 2408.09693 | null |
2024-08-19 | Solving stochastic climate-economy models: A deep least-squares Monte Carlo approach | Aleksandar Arandjelović et.al. | 2408.09642 | null |
2024-08-18 | Exploratory Optimal Stopping: A Singular Control Formulation | Jodi Dianetti et.al. | 2408.09335 | null |
2024-08-17 | Optimal Strip Attitude Command of Earth Observation Satellite using Differential Dynamic Programming | Seungyeop Han et.al. | 2408.09244 | null |
2024-08-17 | Twin Sorting Dynamic Programming Assisted User Association and Wireless Bandwidth Allocation for Hierarchical Federated Learning | Rung-Hung Gau et.al. | 2408.09076 | null |
2024-08-17 | Atlas: Hierarchical Partitioning for Quantum Circuit Simulation on GPUs (Extended Version) | Mingkuan Xu et.al. | 2408.09055 | null |
2024-08-15 | Optimal control problems with generalized mean-field dynamics and viscosity solution to Master Bellman equation | Rainer Buckdahn et.al. | 2408.08046 | null |
2024-08-14 | Differentiating Policies for Non-Myopic Bayesian Optimization | Darian Nwankwo et.al. | 2408.07812 | null |
2024-08-11 | Moderate Exponential-time Quantum Dynamic Programming Across the Subsets for Scheduling Problems | Camille Grange et.al. | 2408.05741 | null |
2024-08-10 | Convergence Guarantee of Dynamic Programming for LTL Surrogate Reward | Zetong Xuan et.al. | 2408.05438 | null |
2024-08-09 | MIDI-to-Tab: Guitar Tablature Inference via Masked Language Modeling | Drew Edwards et.al. | 2408.05024 | null |
2024-08-09 | A Comprehensive System Architecture using Field Programmable Gate Arrays Technology, Dijkstra’s Algorithm, and Edge Computing for Emergency Response in Smart Cities | Mahamat Abdel Aziz Assoul et.al. | 2408.04924 | null |
2024-08-08 | Mathematical Programming For Adaptive Experiments | Ethan Che et.al. | 2408.04570 | null |
2024-08-08 | Non-maximizing policies that fulfill multi-criterion aspirations in expectation | Simon Dima et.al. | 2408.04385 | null |
2024-08-08 | Enhanced Traffic Flow Prediction with Multi-Segment Fusion Tensor Graph Convolutional Networks | Wei Zhang et.al. | 2408.04232 | null |
2024-08-06 | A Course in Dynamic Optimization | Bar Light et.al. | 2408.03034 | null |
2024-08-05 | Positive Dynamic Programming: A Critique | Aaqib Peerzada et.al. | 2408.02809 | null |
2024-08-05 | Multi-level Traffic-Responsive Tilt Camera Surveillance through Predictive Correlated Online Learning | Tao Li et.al. | 2408.02208 | null |
2024-08-04 | Non-local Hamilton-Jacobi-Bellman equations for the stochastic optimal control of path-dependent piecewise deterministic processes | Elena Bandini et.al. | 2408.02147 | null |
2024-08-03 | Leveraging GNSS and Onboard Visual Data from Consumer Vehicles for Robust Road Network Estimation | Balázs Opra et.al. | 2408.01640 | null |
2024-08-02 | Occasionally Observed Piecewise-deterministic Markov Processes | Marissa Gee et.al. | 2408.01335 | null |
2024-08-02 | The Impact of Program Reduction on Automated Program Repair | Linas Vidziunas et.al. | 2408.01134 | null |
2024-08-11 | Deep Learning Approach for Changepoint Detection: Penalty Parameter Optimization | Tung L Nguyen et.al. | 2408.00856 | link |
2024-07-31 | Tractable and Provably Efficient Distributional Reinforcement Learning with General Value Function Approximation | Taehyun Cho et.al. | 2407.21260 | null |
2024-07-30 | A Machine Learning Approach to Boost the Vehicle-2-Grid Scheduling | Gabriele Agliardi et.al. | 2407.20802 | null |
2024-07-30 | Generalized replicator dynamics based on mean-field pairwise comparison dynamic | Hidekazu Yoshioka et.al. | 2407.20751 | null |
2024-08-10 | A UAV-Enabled Time-Sensitive Data Collection Scheme for Grassland Monitoring Edge Networks | Dongbin Jiao et.al. | 2407.20585 | null |
2024-07-29 | A Differential Dynamic Programming Framework for Inverse Reinforcement Learning | Kun Cao et.al. | 2407.19902 | null |
2024-07-27 | Map-Matching Queries under Fréchet Distance on Low-Density Spanners | Kevin Buchin et.al. | 2407.19304 | null |
2024-07-26 | RRO: A Regularized Routing Optimization Algorithm for Enhanced Throughput and Low Latency with Efficient Complexity | David Zenati et.al. | 2407.18683 | null |
2024-07-26 | Mean-field control of non exchangeable systems | Anna De Crescenzo et.al. | 2407.18635 | null |
2024-08-01 | Stochastic Games with Minimally Bounded Action Costs | David Mguni et.al. | 2407.18010 | null |
2024-07-25 | Personalized and Context-aware Route Planning for Edge-assisted Vehicles | Dinesh Cyril Selvaraj et.al. | 2407.17980 | null |
2024-07-23 | Data-Driven Optimal Feedback Laws via Kernel Mean Embeddings | Petar Bevanda et.al. | 2407.16407 | null |
2024-07-23 | Data-driven Multistage Distributionally Robust Linear Optimization with Nested Distance | Rui Gao et.al. | 2407.16346 | null |
2024-07-22 | Faster Optimal Coalition Structure Generation via Offline Coalition Selection and Graph-Based Search | Redha Taguelmimt et.al. | 2407.16092 | null |
2024-07-22 | Scheduling on a Stochastic Number of Machines | Moritz Buchem et.al. | 2407.15737 | null |
2024-07-20 | Interdiction of minimum spanning trees and other matroid bases | Noah Weninger et.al. | 2407.14906 | link |
2024-07-20 | A Tale of Two Scales: Reconciling Horizontal and Vertical Scaling for Inference Serving Systems | Kamran Razavi et.al. | 2407.14843 | null |
2024-07-19 | Dynamic Programming Techniques for Planar Orbital Transfer of Low Earth Orbit Satellites | C. Ciancarelli et.al. | 2407.14675 | null |
2024-07-19 | Generalization Error Analysis of Deep Backward Dynamic Programming for Solving Nonlinear PDEs | Du Ouyang et.al. | 2407.14566 | null |
2024-07-19 | On Policy Evaluation Algorithms in Distributional Reinforcement Learning | Julian Gerstenberg et.al. | 2407.14175 | null |
2024-07-18 | Shaded Route Planning Using Active Segmentation and Identification of Satellite Images | Longchao Da et.al. | 2407.13689 | null |
2024-07-18 | The Madness of Multiple Entries in March Madness | Jeff Decary et.al. | 2407.13438 | null |
2024-07-18 | Double interdiction problem on trees on the sum of root-leaf distances by upgrading edges | Xiao Li et.al. | 2407.13391 | null |
2024-07-18 | Deterministic Trajectory Optimization through Probabilistic Optimal Control | Mohammad Mahmoudi Filabadi et.al. | 2407.13316 | null |
2024-07-18 | Integrated Hardware Architecture and Device Placement Search | Irene Wang et.al. | 2407.13143 | link |
2024-07-18 | Multiobjective Vehicle Routing Optimization with Time Windows: A Hybrid Approach Using Deep Reinforcement Learning and NSGA-II | Rixin Wu et.al. | 2407.13113 | null |
2024-07-17 | Dynamic Programming Principle and Hamilton-Jacobi-Bellman Equation for Optimal Control Problems with Uncertainty | M. Soledad Aronna et.al. | 2407.13045 | null |
2024-07-17 | Estimating the Potential Impact of Combined Race and Ethnicity Reporting on Long-Term Earnings Statistics | Kevin L. McKinney et.al. | 2407.12775 | null |
2024-07-16 | Enabling MCTS Explainability for Sequential Planning Through Computation Tree Logic | Ziyan An et.al. | 2407.10820 | null |
2024-07-14 | Fine Grained Lower Bounds for Multidimensional Knapsack | Ilan Doron-Arad et.al. | 2407.10146 | null |
2024-07-12 | Investigating the Interplay of Prioritized Replay and Generalization | Parham Mohammad Panahi et.al. | 2407.09702 | null |
2024-07-12 | An efficient algorithm to compute the minimum free energy of interacting nucleic acid strands | Ahmed Shalaby et.al. | 2407.09676 | null |
2024-07-12 | Hamilton-Jacobi Reachability in Reinforcement Learning: A Survey | Milan Ganai et.al. | 2407.09645 | null |
2024-07-12 | Integer programs with nearly totally unimodular matrices: the cographic case | Manuel Aprile et.al. | 2407.09477 | null |
2024-07-12 | A new approach to principal-agent problems with volatility control | Alessandro Chiusolo et.al. | 2407.09471 | null |
2024-07-12 | CAACS: A Carbon Aware Ant Colony System | Marina Lin et.al. | 2407.09404 | null |
2024-07-12 | Structure and Independence in Hyperbolic Uniform Disk Graphs | Thomas Bläsius et.al. | 2407.09362 | null |
2024-07-12 | KUNPENG: An Embodied Large Model for Intelligent Maritime | Naiyao Wang et.al. | 2407.09048 | link |
2024-07-09 | Trajectory Data Mining and Trip Travel Time Prediction on Specific Roads | Muhammad Awais Amin et.al. | 2407.07030 | null |
2024-07-08 | Solving Multi-Model MDPs by Coordinate Ascent and Dynamic Programming | Xihong Su et.al. | 2407.06329 | link |
2024-07-08 | Narrowing the Gap between Adversarial and Stochastic MDPs via Policy Optimization | Daniil Tiapkin et.al. | 2407.05704 | null |
2024-07-06 | Advancing Algorithmic Approaches to Probabilistic Argumentation under the Constellation Approach | Andrei Popescu et.al. | 2407.05058 | null |
2024-07-05 | Re-Tuning: Overcoming the Compositionality Limits of Large Language Models with Recursive Tuning | Eric Pasewark et.al. | 2407.04787 | link |
2024-07-05 | GOALPlace: Begin with the End in Mind | Anthony Agnesina et.al. | 2407.04579 | null |
2024-07-04 | Advanced Artificial Intelligence Strategy for Optimizing Urban Rail Network Design using Nature-Inspired Algorithms | Hariram Sampath Kumar et.al. | 2407.04087 | null |
2024-07-04 | Multi-Time Scale Service Caching and Pricing in MEC Systems with Dynamic Program Popularity | Yiming Chen et.al. | 2407.03804 | null |
2024-07-03 | Reconsidering utility: unveiling the limitations of synthetic mobility data generation algorithms in real-life scenarios | Alexandra Kapp et.al. | 2407.03237 | null |
2024-07-12 | A Two-stage Identification Method for Switched Linear Systems | Zheng Wenju et.al. | 2407.02743 | null |
2024-07-02 | DM3D: Distortion-Minimized Weight Pruning for Lossless 3D Object Detection | Kaixin Xu et.al. | 2407.02098 | null |
2024-06-28 | Edge-DIRECT: A Deep Reinforcement Learning-based Method for Solving Heterogeneous Electric Vehicle Routing Problem with Time Window Constraints | Arash Mozhdehi et.al. | 2407.01615 | null |
2024-07-02 | Contractual Reinforcement Learning: Pulling Arms with Invisible Hands | Jibang Wu et.al. | 2407.01458 | null |
2024-07-01 | Exact statistical analysis for response-adaptive clinical trials: a general and computationally tractable approach | Stef Baas et.al. | 2407.01055 | null |
2024-06-30 | Maximum Entropy Inverse Reinforcement Learning of Diffusion Models with Energy-Based Models | Sangwoong Yoon et.al. | 2407.00626 | link |
2024-06-30 | Your Car Tells Me Where You Drove: A Novel Path Inference Attack via CAN Bus and OBD-II Data | Tommaso Bianchi et.al. | 2407.00585 | null |
2024-06-29 | A Two-stage Reinforcement Learning-based Approach for Multi-entity Task Allocation | Aicheng Gong et.al. | 2407.00496 | link |
2024-06-29 | Vector-valued robust stochastic control | Igor Cialenco et.al. | 2407.00266 | null |
2024-06-28 | Leveraging Fixed-Parameter Tractability for Robot Inspection Planning | Yosuke Mizutani et.al. | 2407.00251 | null |
2024-06-28 | Approximate Solutions for Multi-Trip Route Planning in Time-Sensitive Situations | Bahar Cavdar et.al. | 2407.00173 | null |
2024-06-28 | Online Optimization of DNN Inference Network Utility in Collaborative Edge Computing | Rui Li et.al. | 2406.19613 | null |
2024-06-27 | Efficient and Distributed Large-Scale 3D Map Registration using Tomographic Features | Halil Utku Unlu et.al. | 2406.19461 | link |
2024-06-27 | Cuts in Graphs with Matroid Constraints | Aritra Banik et.al. | 2406.19134 | null |
2024-06-27 | State and Input Constrained Output-Feedback Adaptive Optimal Control of Affine Nonlinear Systems | Tochukwu Elijah Ogri et.al. | 2406.18804 | null |
2024-06-26 | Markov Decision Process and Approximate Dynamic Programming for a Patient Assignment Scheduling problem | Malgorzata M. O’Reilly et.al. | 2406.18618 | null |
2024-06-26 | Tiered Service Architecture for Remote Patient Monitoring | Siddharth Chandak et.al. | 2406.18000 | null |
2024-06-25 | Splitting Guarantees for Prophet Inequalities via Nonlinear Systems | Johannes Brustle et.al. | 2406.17767 | null |
2024-06-25 | Using iterated local alignment to aggregate GPS trajectories into a traffic flow map | Tarn Duong et.al. | 2406.17500 | null |
2024-06-24 | A multiplicative surface signature through its Magnus expansion | Ilya Chevyrev et.al. | 2406.16856 | null |
2024-06-24 | Stochastic Path-Dependent Volatility Models for Price-Storage Dynamics in Natural Gas Markets and Discrete-Time Swing Option Pricing | Jinniao Qiu et.al. | 2406.16400 | null |
2024-06-21 | Exact discovery is polynomial for sparse causal Bayesian networks | Felix L. Rios et.al. | 2406.15012 | link |
2024-06-19 | A programmable wafer-scale chiroptical heterostructure of twisted aligned carbon nanotubes and phase change materials | Jichao Fan et.al. | 2406.13190 | null |
2024-06-14 | Interpretable Cascading Mixture-of-Experts for Urban Traffic Congestion Prediction | Wenzhao Jiang et.al. | 2406.12923 | null |
2024-06-26 | LayerMerge: Neural Network Depth Compression through Layer Pruning and Merging | Jinuk Kim et.al. | 2406.12837 | link |
2024-06-17 | LibProf: A Python Profiler for Improving Cold Start Performance in Serverless Applications | Syed Salauddin Mohammad Tariq et.al. | 2406.11734 | null |
2024-06-17 | Statistical Learning of Distributionally Robust Stochastic Control in Continuous State Spaces | Shengbo Wang et.al. | 2406.11281 | null |
2024-06-16 | WeShap: Weak Supervision Source Evaluation with Shapley Values | Naiqing Guan et.al. | 2406.11010 | null |
2024-06-16 | Solving Co-Path/Cycle Packing Faster than $3^k$ | Yuxi Liu et.al. | 2406.10829 | null |
2024-06-15 | Scheduling two types of jobs with minimum makespan | Song Cao et.al. | 2406.10467 | null |
2024-06-14 | CycleTrajectory: An End-to-End Pipeline for Enriching and Analyzing GPS Trajectories to Understand Cycling Behavior and Environment | Meihui Wang et.al. | 2406.10069 | link |
2024-06-13 | Optimal Control of Agent-Based Dynamics under Deep Galerkin Feedback Laws | Frederik Kelbel et.al. | 2406.09141 | link |
2024-06-13 | Coordinated Trading Strategies for Battery Storage in Reserve and Spot Markets | Paul E. Seifert et.al. | 2406.08390 | null |
2024-06-11 | Flow Map Matching | Nicholas M. Boffi et.al. | 2406.07507 | null |
2024-06-11 | Variational inequalities and smooth-fit principle for singular stochastic control problems in Hilbert spaces | Salvatore Federico et.al. | 2406.07242 | null |
2024-06-10 | Stochastic Guidance of Buoyancy Controlled Vehicles under Ice Shelves using Ocean Currents | Federico Rossi et.al. | 2406.06724 | null |
2024-06-10 | Leveraging Hyperscanning EEG and VR Omnidirectional Treadmill to Explore Inter-Brain Synchrony in Collaborative Spatial Navigation | Chun-Hsiang Chuang et.al. | 2406.06327 | null |
2024-06-09 | Production and distribution planning, scheduling, and routing optimization in a yogurt supply chain under demand uncertainty: A case study | Babak Javadi et.al. | 2406.05803 | null |
2024-06-09 | Heart Sound Segmentation Using Deep Learning Techniques | Manas Madine et.al. | 2406.05653 | null |
2024-06-11 | Bisimulation Metrics are Optimal Transport Distances, and Can be Computed Efficiently | Sergio Calo et.al. | 2406.04056 | null |
2024-06-04 | GrootVL: Tree Topology is All You Need in State Space Model | Yicheng Xiao et.al. | 2406.02395 | link |
2024-06-21 | Branches: A Fast Dynamic Programming and Branch & Bound Algorithm for Optimal Decision Trees | Ayman Chaouki et.al. | 2406.02175 | link |
2024-06-03 | An efficient solution to Hidden Markov Models on trees with coupled branches | Farzan Vafa et.al. | 2406.01663 | null |
2024-06-03 | A New View on Planning in Online Reinforcement Learning | Kevin Roice et.al. | 2406.01562 | null |
2024-06-02 | Dual Policy Reinforcement Learning for Real-time Rebalancing in Bike-sharing Systems | Jiaqi Liang et.al. | 2406.00868 | null |
2024-06-02 | Computing Optimal Equilibria in Repeated Games with Restarts | Ratip Emin Berker et.al. | 2406.00851 | null |
2024-06-02 | A Lazy Abstraction Algorithm for Markov Decision Processes: Theory and Initial Evaluation | Dániel Szekeres et.al. | 2406.00824 | null |
2024-06-10 | Model Predictive Control and Reinforcement Learning: A Unified Framework Based on Dynamic Programming | Dimitri P. Bertsekas et.al. | 2406.00592 | null |
2024-06-01 | Optimal Transmission Power Scheduling for Networked Control System under DoS Attack | Siyi Wang et.al. | 2406.00540 | null |
2024-06-01 | A Single-Loop Robust Policy Gradient Method for Robust Markov Decision Processes | Zhenwei Lin et.al. | 2406.00274 | link |
2024-05-31 | Finding Diverse Solutions Parameterized by Cliquewidth | Karolina Drabik et.al. | 2405.20931 | null |
2024-05-29 | A numerical algorithm with linear complexity for Multi-marginal Optimal Transport with $L^1$ Cost | Chunhui Chen et.al. | 2405.19246 | null |
2024-05-28 | A Pontryagin Perspective on Reinforcement Learning | Onno Eberhard et.al. | 2405.18100 | null |
2024-05-27 | Q-value Regularized Transformer for Offline Reinforcement Learning | Shengchao Hu et.al. | 2405.17098 | null |
2024-05-25 | A Bi-Objective Approach to Last-Mile Delivery Routing Considering Driver Preferences | Juan Pablo Mesa et.al. | 2405.16051 | null |
2024-06-03 | Inference of Utilities and Time Preference in Sequential Decision-Making | Haoyang Cao et.al. | 2405.15975 | null |
2024-05-31 | Stability and Performance Analysis of Model Predictive Control of Uncertain Linear Systems | Changrui Liu et.al. | 2405.15552 | link |
2024-05-24 | An Approximate Dynamic Programming Framework for Occlusion-Robust Multi-Object Tracking | Pratyusha Musunuru et.al. | 2405.15137 | null |
2024-05-23 | Two-Stage ML-Guided Decision Rules for Sequential Decision Making under Uncertainty | Andrew Rosemberg et.al. | 2405.14973 | null |
2024-05-23 | A rolling horizon heuristic approach for a multi-stage stochastic waste collection problem | Andrea Spinelli et.al. | 2405.14499 | link |
2024-05-23 | EdgeShard: Efficient LLM Inference via Collaborative Edge Computing | Mingjin Zhang et.al. | 2405.14371 | null |
2024-05-23 | Optimal Whole Body Trajectory Planning for Mobile Manipulators in Planetary Exploration and Construction | Federica Storiale et.al. | 2405.14363 | null |
2024-05-23 | Deterministic Policies for Constrained Reinforcement Learning in Polynomial-Time | Jeremy McMahan et.al. | 2405.14183 | null |
2024-05-22 | Tackling Decision Processes with Non-Cumulative Objectives using Reinforcement Learning | Maximilian Nägele et.al. | 2405.13609 | link |
2024-05-21 | Parallel Algorithm for Optimal Threshold Labeling of Ordinal Regression Methods | Ryoya Yamasaki et.al. | 2405.12756 | link |
2024-05-21 | Short and simple introduction to Bellman filtering and smoothing | Rutger-Jan Lange et.al. | 2405.12668 | null |
2024-05-21 | Data-driven Coordinated AC/DC Control Strategy for Frequency Safety | Qianni Cao et.al. | 2405.12546 | null |
2024-05-20 | Semantic Trajectory Data Mining with LLM-Informed POI Classification | Yifan Liu et.al. | 2405.11715 | null |
2024-05-18 | On the Trajectory Regularity of ODE-based Diffusion Sampling | Defang Chen et.al. | 2405.11326 | link |
2024-05-15 | Harmonizing Human Insights and AI Precision: Hand in Hand for Advancing Knowledge Graph Task | Shurong Wang et.al. | 2405.09477 | null |
2024-05-14 | Treatment Effect Estimation for User Interest Exploration on Recommender Systems | Jiaju Chen et.al. | 2405.08582 | link |
2024-05-27 | Dynamic Programming for Symbolic Boolean Realizability and Synthesis | Yi Lin et.al. | 2405.07975 | null |
2024-05-13 | Space Domain based Ecological Cooperative and Adaptive Cruise Control on Rolling Terrain | Mingyue Lei et.al. | 2405.07553 | null |
2024-05-12 | Deciding regular games: a playground for exponential time algorithms | Zihui Liang et.al. | 2405.07188 | null |
2024-05-12 | Trade execution games in a Markovian environment | Masamitsu Ohnishi et.al. | 2405.07184 | null |
2024-05-10 | Dynamic programming principle and computable prices in financial market models with transaction costs | Emmanuel Lepinette et.al. | 2405.06623 | null |
2024-05-09 | Change point localisation and inference in fragmented functional data | Gengyu Xue et.al. | 2405.05730 | link |
2024-05-09 | Infinite horizon stochastic recursive control problems with jumps: dynamic programming and stochastic verification theorems | Sheng Luo et.al. | 2405.05561 | null |
2024-05-14 | Robust Reward Placement under Uncertainty | Petros Petsinis et.al. | 2405.05433 | null |
2024-05-06 | Novel Tour Construction Heuristic for Pick-Up and Delivery Routing Problems | Mithun Goutham et.al. | 2405.03774 | null |
2024-05-05 | TSP Escapes the $O(2^n n^2)$ Curse | Mihail Stoian et.al. | 2405.03018 | link |
2024-05-02 | DiffusionPipe: Training Large Diffusion Models with Efficient Pipelines | Ye Tian et.al. | 2405.01248 | null |
2024-05-02 | Lipschitz constant estimation for general neural network architectures using control tools | Patricia Pauli et.al. | 2405.01125 | link |
2024-05-01 | A biased random-key genetic algorithm with variable mutants to solve a vehicle routing problem | Paola Festa et.al. | 2405.00268 | null |
2024-04-28 | Bi-objective optimization of a VRP problem applied to urban solid waste collection through a model that includes the visual attraction of routes | Diego Rossit et.al. | 2405.00068 | null |
2024-04-26 | Energy Storage Arbitrage in Two-settlement Markets: A Transformer-Based Approach | Saud Alghumayjan et.al. | 2404.17683 | null |
2024-04-25 | Path integral control under McKean-Vlasov dynamics | Timothy Bennett et.al. | 2404.17006 | null |
2024-04-25 | Parallel and (Nearly) Work-Efficient Dynamic Programming | Xiangyun Ding et.al. | 2404.16314 | link |
2024-04-23 | Prediction from compression for models with infinite memory, with applications to hidden Markov and renewal processes | Yanjun Han et.al. | 2404.15454 | null |
2024-04-26 | Variational Dynamic Programming for Stochastic Optimal Control | Marc Lambert et.al. | 2404.14806 | link |
2024-04-22 | Tile-Weighted Rate-Distortion Optimized Packet Scheduling for 360 $^\circ$ VR Video Streaming | Haopeng Wang et.al. | 2404.14573 | null |
2024-04-21 | Stochastic Multi-round Submodular Optimization with Budget | Vincenzo Auletta et.al. | 2404.13737 | null |
2024-04-21 | Planning of Truck Platooning for Road-Network Capacitated Vehicle Routing Problem | Yilang Hao et.al. | 2404.13512 | null |
2024-04-20 | Liquidity Pool Design on Automated Market Makers | Xue Dong He et.al. | 2404.13291 | null |
2024-04-19 | Decentralized Coordination of Distributed Energy Resources through Local Energy Markets and Deep Reinforcement Learning | Daniel May et.al. | 2404.13142 | null |
2024-04-18 | NLP-enabled trajectory map-matching in urban road networks using transformer sequence-to-sequence model | Sevin Mohammadi et.al. | 2404.12460 | null |
2024-04-18 | Recursive stochastic differential games with non-Lipschitzian generators and viscosity solutions of Hamilton-Jacobi-Bellman-Isaacs equation | Guangchen Wang et.al. | 2404.12129 | null |
2024-04-18 | Actor-Critic Reinforcement Learning with Phased Actor | Ruofan Wu et.al. | 2404.11834 | null |
2024-04-18 | Itō and Itō-Wentzell chain rule for flows of conditional laws of continuous semimartingales: an easy approach | Assil Fadle et.al. | 2404.11010 | null |
2024-04-16 | Zero-Sum Games for Volterra Integral Equations and Viscosity Solutions of Path-Dependent Hamilton-Jacobi Equations | Mikhail I. Gomoyunov et.al. | 2404.10428 | null |
2024-04-16 | Urban Water Sprinkler Routing: A Multi-Depot Mixed Capacitated Arc Routing Problem Incorporating Real-Time Demands | Hongtai Yang et.al. | 2404.10230 | null |
2024-04-13 | Fast Gradient Computation for Gromov-Wasserstein Distance | Wei Zhang et.al. | 2404.08970 | null |
2024-04-12 | A Parametric Approach for Solving Convex Quadratic Optimization with Indicators Over Trees | Aaresh Bhathena et.al. | 2404.08178 | link |
2024-04-06 | Viscosity solutions for mean field optimal switching with a two-time-scale Markov chain | Tian Chen et.al. | 2404.07998 | null |
2024-04-11 | Parameterized Fast and Safe Tracking (FaSTrack) using Deepreach | Hyun Joe Jeong et.al. | 2404.07431 | null |
2024-04-09 | Inexact Policy Iteration Methods for Large-Scale Markov Decision Processes | Matilde Gargiani et.al. | 2404.06136 | null |
2024-04-09 | fastcpd: Fast Change Point Detection in R | Xingchi Li et.al. | 2404.05933 | link |
2024-04-08 | Non-concave distributionally robust stochastic control in a discrete time finite horizon setting | Ariel Neufeld et.al. | 2404.05230 | link |
2024-04-07 | Percentile Criterion Optimization in Offline Reinforcement Learning | Elita A. Lobo et.al. | 2404.05055 | link |
2024-04-05 | A Ground Mobile Robot for Autonomous Terrestrial Laser Scanning-Based Field Phenotyping | Javier Rodriguez-Sanchez et.al. | 2404.04404 | null |
2024-04-04 | Forecasting with Neuro-Dynamic Programming | Pedro Afonso Fernandes et.al. | 2404.03737 | null |
2024-04-03 | Reinforcement Learning in Categorical Cybernetics | Jules Hedges et.al. | 2404.02688 | null |
2024-04-03 | Transformer-based Stagewise Decomposition for Large-Scale Multistage Stochastic Optimization | Chanyeong Kim et.al. | 2404.02583 | null |
2024-04-01 | Versatile Navigation under Partial Observability via Value-guided Diffusion Policy | Gengyu Zhang et.al. | 2404.02176 | null |
2024-03-31 | Adversarially-Robust Inference on Trees via Belief Propagation | Samuel B. Hopkins et.al. | 2404.00768 | null |
2024-03-28 | A Faster Algorithm for Pigeonhole Equal Sums | Ce Jin et.al. | 2403.19117 | null |
2024-03-27 | Policy iteration for discrete-time systems with discounted costs: stability and near-optimality guarantees | Jonathan de Brusse et.al. | 2403.19007 | null |
2024-03-27 | A Dynamic Programming Approach for Road Traffic Estimation | Mattia Laurini et.al. | 2403.18561 | null |
2024-03-26 | Generalized Maximum Entropy Differential Dynamic Programming | Yuichiro Aoyama et.al. | 2403.18130 | null |
2024-03-26 | Accuracy enhancement method for speech emotion recognition from spectrogram using temporal frequency correlation and positional information learning through knowledge transfer | Jeong-Yoon Kim et.al. | 2403.17327 | link |
2024-03-25 | State-Augmented Linear Games with Antagonistic Error for High-Dimensional, Nonlinear Hamilton-Jacobi Reachability | Will Sharpless et.al. | 2403.16982 | link |
2024-03-25 | Semantic-Aware Remote Estimation of Multiple Markov Sources Under Constraints | Jiping Luo et.al. | 2403.16855 | null |
2024-03-24 | On the Navier-Stokes equations and the Hamilton-Jacobi-Bellman equation on the group of volume preserving diffeomorphisms | Xiang-Dong Li et.al. | 2403.15997 | null |
2024-03-23 | On Merton’s Optimal Portfolio Problem under Sporadic Bankruptcy | Yaacov Kopeliovich et.al. | 2403.15923 | link |
2024-03-22 | Transactive Local Energy Markets Enable Community-Level Resource Coordination Using Individual Rewards | Daniel C. May et.al. | 2403.15617 | null |
2024-03-19 | Most Likely Sequence Generation for $n$ -Grams, Transformers, HMMs, and Markov Chains, by Using Rollout Algorithms | Yuchao Li et.al. | 2403.15465 | null |
2024-03-21 | Conservative Linear Envelopes for High-Dimensional, Hamilton-Jacobi Reachability for Nonlinear Systems via the Hopf Formula | Will Sharpless et.al. | 2403.14184 | null |
2024-03-20 | Optimal control of continuous-time symmetric systems with unknown dynamics and noisy measurements | Hamed Taghavian et.al. | 2403.13605 | null |
2024-03-19 | Solving Combinatorial Pricing Problems using Embedded Dynamic Programming Models | Quang Minh Bui et.al. | 2403.12923 | null |
2024-03-18 | AdaMER-CTC: Connectionist Temporal Classification with Adaptive Maximum Entropy Regularization for Automatic Speech Recognition | SooHwan Eom et.al. | 2403.11578 | null |
2024-03-17 | Multiscale Quantile Regression with Local Error Control | Zhi Liu et.al. | 2403.11356 | link |
2024-03-15 | Fast Generation of Feasible Trajectories in Direct Optimal Control | David Kiessling et.al. | 2403.10115 | link |
2024-03-14 | Is Data All That Matters? The Role of Control Frequency for Learning-Based Sampled-Data Control of Uncertain Systems | Ralf Römer et.al. | 2403.09504 | link |
2024-03-14 | Quantum Dynamic Programming | Jeongrak Son et.al. | 2403.09187 | null |
2024-03-15 | Relationship between General MP and DPP for the Stochastic Recursive Optimal Control Problem With Jumps: Viscosity Solution Framework | Bin Wang et.al. | 2403.09044 | null |
2024-03-13 | Model-free Resilient Controller Design based on Incentive Feedback Stackelberg Game and Q-learning | Jiajun Shen et.al. | 2403.08948 | null |
2024-03-13 | Online Multi-Contact Feedback Model Predictive Control for Interactive Robotic Tasks | Seo Wook Han et.al. | 2403.08302 | null |
2024-03-12 | Optimal Design and Implementation of an Open-source Emulation Platform for User-Centric Shared E-mobility Services | Maqsood Hussain Shah et.al. | 2403.07964 | null |
2024-03-12 | The Primal Pathwidth SETH | Michael Lampis et.al. | 2403.07239 | null |
2024-03-10 | A Unified Model for Spatio-Temporal Prediction Queries with Arbitrary Modifiable Areal Units | Liyue Chen et.al. | 2403.07022 | link |
2024-03-11 | Domain-Independent Dynamic Programming and Constraint Programming Approaches for Assembly Line Balancing Problems with Setups | Jiachen Zhang et.al. | 2403.06780 | null |
2024-03-11 | Balanced Substructures in Bicolored Graphs | P. S. Ardra et.al. | 2403.06608 | null |
2024-03-11 | An Efficient Solution to the 2D Visibility Problem in Cartesian Grid Maps and its Application in Heuristic Path Planning | Ibrahim Ibrahim et.al. | 2403.06494 | link |
2024-03-11 | AGAThA: Fast and Efficient GPU Acceleration of Guided Sequence Alignment for Long Read Mapping | Seongyeon Park et.al. | 2403.06478 | link |
2024-03-09 | Spatial Clustering Approach for Vessel Path Identification | Mohamed Abuella et.al. | 2403.05778 | link |
2024-03-07 | On $[1,2]$ -Domination in Interval and Circle Graphs | Mohsen Alambardar Meybodi et.al. | 2403.04694 | null |
2024-03-07 | Fill-and-Spill: Deep Reinforcement Learning Policy Gradient Methods for Reservoir Operation Decision and Control | Sadegh Sadeghi Tabas et.al. | 2403.04195 | null |
2024-03-06 | Global Geolocated Realtime Data of Interfleet Urban Transit Bus Idling | Nicholas Kunz et.al. | 2403.03489 | link |
2024-03-06 | SalienTime: User-driven Selection of Salient Time Steps for Large-Scale Geospatial Data Visualization | Juntong Chen et.al. | 2403.03449 | link |
2024-03-06 | Leveraging The Finite States of Emotion Processing to Study Late-Life Mental Health | Yuanzhe Huang et.al. | 2403.03414 | null |
2024-03-04 | Dynamic programming principle in cost-efficient sequential design: application to switching measurements | Jeongmin Han et.al. | 2403.02245 | null |
2024-03-04 | Cooperative and Interaction-aware Driver Model for Lane Change Maneuver | Jemin Woo et.al. | 2403.01752 | null |
2024-03-01 | DyPyBench: A Benchmark of Executable Python Software | Islem Bouzenia et.al. | 2403.00539 | link |
2024-03-01 | Graph Construction with Flexible Nodes for Traffic Demand Prediction | Jinyan Hou et.al. | 2403.00276 | link |
2024-02-29 | Lifelong Benchmarks: Efficient Model Evaluation in an Era of Rapid Progress | Ameya Prabhu et.al. | 2402.19472 | link |
2024-02-27 | Globally Convergent Distributed Sequential Quadratic Programming with Overlapping Decomposition and Exact Augmented Lagrangian Merit Function | Runxin Ni et.al. | 2402.17170 | null |
2024-02-24 | Selective Task offloading for Maximum Inference Accuracy and Energy efficient Real-Time IoT Sensing Systems | Abdelkarim Ben Sada et.al. | 2402.16904 | null |
2024-02-25 | IKLink: End-Effector Trajectory Tracking with Minimal Reconfigurations | Yeping Wang et.al. | 2402.16154 | link |
2024-02-25 | Evolving E-commerce Logistics Planning- Integrating Embedded Technology and Ant Colony Algorithm for Enhanced Efficiency | Lynn Huang et.al. | 2402.15965 | null |
2024-02-25 | Budget-Constrained Tool Learning with Planning | Yuanhang Zheng et.al. | 2402.15960 | link |
2024-02-23 | Neural optimal controller for stochastic systems via pathwise HJB operator | Zhe Jiao et.al. | 2402.15592 | null |
2024-02-23 | Curve fitting on a quantum annealer for an advanced navigation method | Philipp Isserstedt et.al. | 2402.15308 | null |
2024-02-22 | Quantum Markov Decision Processes Part II: Optimal Solutions and Algorithms | Naci Saldi et.al. | 2402.14651 | null |
2024-02-22 | Quantum Markov Decision Processes Part I: General Theory, Approximations, and Classes of Policies | Naci Saldi et.al. | 2402.14649 | null |
2024-02-21 | Quantum Annealing and Graph Neural Networks for Solving TSP with QUBO | Haoqi He et.al. | 2402.14036 | null |
2024-02-21 | Do Efficient Transformers Really Save Computation? | Kai Yang et.al. | 2402.13934 | null |
2024-02-21 | Benchmarking and Dissecting the Nvidia Hopper GPU Architecture | Weile Luo et.al. | 2402.13499 | null |
2024-02-20 | An Improved Lower Bound on the Number of Pseudoline Arrangements | Fernando Cortés Kühnast et.al. | 2402.13107 | null |
2024-02-20 | Smart Mobility Digital Twin Based Automated Vehicle Navigation System: A Proof of Concept | Kui Wang et.al. | 2402.12682 | null |
2024-02-19 | An algorithm for counting number of all (normal) fuzzy subgroups in $U_{6n}$ | Marek Hyčko et.al. | 2402.12543 | null |
2024-02-29 | Sequoia: Scalable, Robust, and Hardware-aware Speculative Decoding | Zhuoming Chen et.al. | 2402.12374 | link |
2024-02-19 | Scalable Virtual Valuations Combinatorial Auction Design by Combining Zeroth-Order and First-Order Optimization Method | Zhijian Duan et.al. | 2402.11904 | null |
2024-02-19 | Two Online Map Matching Algorithms Based on Analytic Hierarchy Process and Fuzzy Logic | Jeremy J. Lin et.al. | 2402.11866 | null |
2024-02-18 | A Fisher Information based Receding Horizon Control Method for Signal Strength Model Estimation | Yancheng Zhu et.al. | 2402.11483 | null |
2024-02-16 | Optimal Savings and Value of Population in A Stochastic Environment: Transient Behavior | Hao Liu et.al. | 2402.10768 | null |
2024-02-15 | Engraving Oriented Joint Estimation of Pitch Spelling and Local and Global Keys | Augustin Bouquillard et.al. | 2402.10247 | null |
2024-02-14 | Analyzing the Impact of Computation in Adaptive Dynamic Programming for Stochastic LQR Problem | Wenhan Cao et.al. | 2402.09575 | null |
2024-02-13 | Approximate Sequential Optimization for Informative Path Planning | Joshua Ott et.al. | 2402.08841 | link |
2024-02-13 | Sequence graphs realizations and ambiguity in language models | Sammy Khalife et.al. | 2402.08830 | null |
2024-02-11 | GenSTL: General Sparse Trajectory Learning via Auto-regressive Generation of Feature Domains | Yan Lin et.al. | 2402.07232 | link |
2024-02-09 | High-Precision Geosteering via Reinforcement Learning and Particle Filters | Ressi Bonti Muhammad et.al. | 2402.06377 | null |
2024-02-09 | Bellman Conformal Inference: Calibrating Prediction Intervals For Time Series | Zitong Yang et.al. | 2402.05203 | link |
2024-02-04 | Empowering Computing and Networks Convergence System with Distributed Cooperative Routing | Yujiao Hu et.al. | 2402.02381 | null |
2024-02-03 | Multiple sequences Prophet Inequality Under Observation Constraints | Aristomenis Tsopelakos et.al. | 2402.02059 | null |
2024-02-02 | Capturing waste collection planning expert knowledge in a fitness function through preference learning | Laura Fernández Díaz et.al. | 2402.01849 | null |
2024-02-02 | Dynamic programming for the stochastic matching model on general graphs: the case of the `N-graph’ | Loïc Jean et.al. | 2402.01803 | null |
2024-02-01 | AlphaRank: An Artificial Intelligence Approach for Ranking and Selection Problems | Ruihan Zhou et.al. | 2402.00907 | null |
2024-02-01 | Cocco: Hardware-Mapping Co-Exploration towards Memory Capacity-Communication Optimization | Zhanhong Tan et.al. | 2402.00629 | null |
2024-02-02 | Branch and Price for the Length-Constrained Cycle Partition Problem | Mohammed Ghannam et.al. | 2401.17937 | link |
2024-01-31 | Revisiting speech segmentation and lexicon learning with better features | Herman Kamper et.al. | 2401.17902 | null |
2024-02-16 | The computation of approximate feedback Stackelberg equilibria in multi-player nonlinear constrained dynamic games | Jingqi Li et.al. | 2401.15745 | link |
2024-01-28 | HappyRouting: Learning Emotion-Aware Route Trajectories for Scalable In-The-Wild Navigation | David Bethge et.al. | 2401.15695 | null |
2024-01-28 | Constrained Markov decision processes for response-adaptive procedures in clinical trials with binary outcomes | Stef Baas et.al. | 2401.15694 | null |
2024-01-27 | Fair and Efficient Ridesharing: A Dynamic Programming-based Relocation Approach | Aqsa Ashraf Makhdomi et.al. | 2401.15363 | null |
2024-01-27 | Optimal Sparse Survival Trees | Rui Zhang et.al. | 2401.15330 | link |
2024-01-25 | Domain-Independent Dynamic Programming | Ryo Kuroiwa et.al. | 2401.13883 | link |
2024-01-27 | Deep multitask neural networks for solving some stochastic optimal control problems | Christian Yeo et.al. | 2401.12923 | link |
2024-01-23 | Optimal Stopping of Branching Diffusion Processes | Idris Kharroubi et.al. | 2401.12811 | null |
2024-01-22 | On a class of interdiction problems with partition matroids: complexity and polynomial-time algorithms | Sergey S. Ketkov et.al. | 2401.12010 | null |
2024-01-22 | Finite horizon optimal control of reaction-diffusion SIV epidemic system with stochastic environment | Zong Wang et.al. | 2401.11744 | null |
2024-01-20 | Closing the Gap between TD Learning and Supervised Learning – A Generalisation Point of View | Raj Ghugare et.al. | 2401.11237 | link |
Large Language Model
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-12-10 | Bayesian Optimization of Antibodies Informed by a Generative Model of Evolving Sequences | Alan Nawzad Amin et.al. | 2412.07763 | link |
2024-12-10 | SAT: Spatial Aptitude Training for Multimodal Language Models | Arijit Ray et.al. | 2412.07755 | null |
2024-12-10 | LoRA3D: Low-Rank Self-Calibration of 3D Geometric Foundation Models | Ziqi Lu et.al. | 2412.07746 | null |
2024-12-10 | Zero-Shot ATC Coding with Large Language Models for Clinical Assessments | Zijian Chen et.al. | 2412.07743 | null |
2024-12-10 | AI Expands Scientists’ Impact but Contracts Science’s Focus | Qianyue Hao et.al. | 2412.07727 | null |
2024-12-10 | Granite Guardian | Inkit Padhi et.al. | 2412.07724 | link |
2024-12-10 | Leveraging Content and Context Cues for Low-Light Image Enhancement | Igor Morawski et.al. | 2412.07693 | null |
2024-12-10 | DriveMM: All-in-One Large Multimodal Model for Autonomous Driving | Zhijian Huang et.al. | 2412.07689 | link |
2024-12-10 | Privacy-Preserving Customer Support: A Framework for Secure and Scalable Interactions | Anant Prakash Awasthi et.al. | 2412.07687 | null |
2024-12-10 | TRIM: Token Reduction and Inference Modeling for Cost-Effective Language Generation | Alfredo Garrachón Ruiz et.al. | 2412.07682 | null |
2024-12-10 | RADIO Amplified: Improved Baselines for Agglomerative Vision Foundation Models | Greg Heinrich et.al. | 2412.07679 | null |
2024-12-10 | Ask Humans or AI? Exploring Their Roles in Visualization Troubleshooting | Shuyu Shen et.al. | 2412.07673 | null |
2024-12-10 | FlexLLM: Exploring LLM Customization for Moving Target Defense on Black-Box LLMs Against Jailbreak Attacks | Bocheng Chen et.al. | 2412.07672 | null |
2024-12-10 | Automating Business Intelligence Requirements with Generative AI and Semantic Search | Nimrod Busany et.al. | 2412.07668 | null |
2024-12-10 | Searching for Structure: Investigating Emergent Communication with Large Language Models | Tom Kouwenhoven et.al. | 2412.07646 | null |
2024-12-10 | TrojanWhisper: Evaluating Pre-trained LLMs to Detect and Localize Hardware Trojans | Md Omar Faruque et.al. | 2412.07636 | null |
2024-12-10 | ChocoLlama: Lessons Learned From Teaching Llamas Dutch | Matthieu Meeus et.al. | 2412.07633 | null |
2024-12-10 | Piece of Table: A Divide-and-Conquer Approach for Selecting Sub-Tables in Table Question Answering | Wonjin Lee et.al. | 2412.07629 | null |
2024-12-10 | OmniDocBench: Benchmarking Diverse PDF Document Parsing with Comprehensive Annotations | Linke Ouyang et.al. | 2412.07626 | link |
2024-12-10 | DRUM: Learning Demonstration Retriever for Large MUlti-modal Models | Ellen Yi-Ge et.al. | 2412.07619 | null |
2024-12-09 | Delve into Visual Contrastive Decoding for Hallucination Mitigation of Large Vision-Language Models | Yi-Lun Lee et.al. | 2412.06775 | link |
2024-12-09 | Visual Lexicon: Rich Image Features in Language Space | XuDong Wang et.al. | 2412.06774 | null |
2024-12-09 | Training Large Language Models to Reason in a Continuous Latent Space | Shibo Hao et.al. | 2412.06769 | null |
2024-12-09 | Ranking-aware adapter for text-driven image ordering with CLIP | Wei-Hsiang Yu et.al. | 2412.06760 | link |
2024-12-09 | Why Do Developers Engage with ChatGPT in Issue-Tracker? Investigating Usage and Reliance on ChatGPT-Generated Code | Joy Krishan Das et.al. | 2412.06757 | null |
2024-12-09 | Refusal Tokens: A Simple Way to Calibrate Refusals in Large Language Models | Neel Jain et.al. | 2412.06748 | null |
2024-12-09 | ONEBench to Test Them All: Sample-Level Benchmarking Over Open-Ended Capabilities | Adhiraj Ghosh et.al. | 2412.06745 | null |
2024-12-09 | JAPAGEN: Efficient Few/Zero-shot Learning via Japanese Training Dataset Generation with LLM | Takuro Fujii et.al. | 2412.06738 | null |
2024-12-09 | AutoDCWorkflow: LLM-based Data Cleaning Workflow Auto-Generation and Benchmark | Lan Li et.al. | 2412.06724 | null |
2024-12-09 | How to Merge Your Multimodal Models Over Time? | Sebastian Dziadzio et.al. | 2412.06712 | null |
2024-12-09 | OmniEvalKit: A Modular, Lightweight Toolbox for Evaluating Large Language Model and its Omni-Extensions | Yi-Kai Zhang et.al. | 2412.06693 | null |
2024-12-09 | Exploring Critical Testing Scenarios for Decision-Making Policies: An LLM Approach | Weichao Xu et.al. | 2412.06684 | null |
2024-12-09 | Toward LLM-Agent-Based Modeling of Transportation Systems: A Conceptual Framework | Tianming Liu et.al. | 2412.06681 | null |
2024-12-09 | I Don’t Know: Explicit Modeling of Uncertainty with an [IDK] Token | Roi Cohen et.al. | 2412.06676 | null |
2024-12-09 | ILLUME: Illuminating Your LLMs to See, Draw, and Self-Enhance | Chunwei Wang et.al. | 2412.06673 | null |
2024-12-09 | MuMu-LLaMA: Multi-modal Music Understanding and Generation via Large Language Models | Shansong Liu et.al. | 2412.06660 | null |
2024-12-09 | Chatbots im Schulunterricht: Wir testen das Fobizz-Tool zur automatischen Bewertung von Hausaufgaben | Rainer Mühlhoff et.al. | 2412.06651 | null |
2024-12-09 | The Narrow Gate: Localized Image-Text Communication in Vision-Language Models | Alessandro Serra et.al. | 2412.06646 | null |
2024-12-09 | MAVias: Mitigate any Visual Bias | Ioannis Sarridis et.al. | 2412.06632 | null |
2024-12-09 | Copyright-Protected Language Generation via Adaptive Model Fusion | Javier Abad et.al. | 2412.06619 | link |
2024-12-06 | Birth and Death of a Rose | Chen Geng et.al. | 2412.05278 | null |
2024-12-06 | Sparse autoencoders reveal selective remapping of visual concepts during adaptation | Hyesu Lim et.al. | 2412.05276 | link |
2024-12-06 | Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling | Zhe Chen et.al. | 2412.05271 | null |
2024-12-06 | APOLLO: SGD-like Memory, AdamW-level Performance | Hanqing Zhu et.al. | 2412.05270 | null |
2024-12-06 | Uncertainty Quantification for Transformer Models for Dark-Pattern Detection | Javier Muñoz et.al. | 2412.05251 | null |
2024-12-06 | Enhancing Foundation Models for Time Series Forecasting via Wavelet-based Tokenization | Luca Masserano et.al. | 2412.05244 | null |
2024-12-06 | CompCap: Improving Multimodal Large Language Models with Composite Captions | Xiaohui Chen et.al. | 2412.05243 | null |
2024-12-06 | MAmmoTH-VL: Eliciting Multimodal Reasoning with Instruction Tuning at Scale | Jarvis Guo et.al. | 2412.05237 | null |
2024-12-06 | BEExformer: A Fast Inferencing Transformer Architecture via Binarization with Multiple Early Exits | Wazib Ansar et.al. | 2412.05225 | null |
2024-12-06 | 100% Hallucination Elimination Using Acurai | Michael C. Wood et.al. | 2412.05223 | null |
2024-12-06 | Evaluating and Aligning CodeLLMs on Human Preference | Jian Yang et.al. | 2412.05210 | null |
2024-12-06 | A Survey of Large Language Model-Based Generative AI for Text-to-SQL: Benchmarks, Applications, Use Cases, and Challenges | Aditi Singh et.al. | 2412.05208 | null |
2024-12-06 | Are Frontier Large Language Models Suitable for Q&A in Science Centres? | Jacob Watson et.al. | 2412.05200 | null |
2024-12-06 | SurgBox: Agent-Driven Operating Room Sandbox with Surgery Copilot | Jinlin Wu et.al. | 2412.05187 | link |
2024-12-06 | LinVT: Empower Your Image-level Large Language Model to Understand Videos | Lishuai Gao et.al. | 2412.05185 | link |
2024-12-06 | QueEn: A Large Language Model for Quechua-English Translation | Junhao Chen et.al. | 2412.05184 | null |
2024-12-06 | Benchmarking Open-ended Audio Dialogue Understanding for Large Audio-Language Models | Kuofeng Gao et.al. | 2412.05167 | null |
2024-12-06 | Enhancing Cross-Language Code Translation via Task-Specific Embedding Alignment in Retrieval-Augmented Generation | Manish Bhattarai et.al. | 2412.05159 | null |
2024-12-06 | Multimodal Fact-Checking with Vision Language Models: A Probing Classifier based Solution with Embedding Strategies | Recep Firat Cekinel et.al. | 2412.05155 | null |
2024-12-06 | A text-to-tabular approach to generate synthetic patient data using LLMs | Margaux Tornqvist et.al. | 2412.05153 | null |
2024-12-05 | Stereo Anywhere: Robust Zero-Shot Deep Stereo Matching Even Where Either Stereo or Mono Fail | Luca Bartolomei et.al. | 2412.04472 | link |
2024-12-05 | NVILA: Efficient Frontier Visual Language Models | Zhijian Liu et.al. | 2412.04468 | null |
2024-12-05 | VisionZip: Longer is Better but Not Necessary in Vision Language Models | Senqiao Yang et.al. | 2412.04467 | link |
2024-12-05 | Code-as-Monitor: Constraint-aware Visual Programming for Reactive and Proactive Robotic Failure Detection | Enshen Zhou et.al. | 2412.04455 | null |
2024-12-05 | p-MoD: Building Mixture-of-Depths MLLMs via Progressive Ratio Decay | Jun Zhang et.al. | 2412.04449 | link |
2024-12-05 | EgoPlan-Bench2: A Benchmark for Multimodal Large Language Model Planning in Real-World Scenarios | Lu Qiu et.al. | 2412.04447 | null |
2024-12-05 | DiCoDe: Diffusion-Compressed Deep Tokens for Autoregressive Video Generation with Language Models | Yizhuo Li et.al. | 2412.04446 | null |
2024-12-05 | Moto: Latent Motion Token as the Bridging Language for Robot Manipulation | Yi Chen et.al. | 2412.04445 | null |
2024-12-05 | Towards Real-Time Open-Vocabulary Video Instance Segmentation | Bin Yan et.al. | 2412.04434 | null |
2024-12-05 | Divot: Diffusion Powers Video Tokenizer for Comprehension and Generation | Yuying Ge et.al. | 2412.04432 | link |
2024-12-05 | Grounding Descriptions in Images informs Zero-Shot Visual Recognition | Shaunak Halbe et.al. | 2412.04429 | link |
2024-12-05 | Florence-VL: Enhancing Vision-Language Models with Generative Vision Encoder and Depth-Breadth Fusion | Jiuhai Chen et.al. | 2412.04424 | link |
2024-12-05 | Targeting the Core: A Simple and Effective Method to Attack RAG-based Agents via Direct LLM Manipulation | Xuying Li et.al. | 2412.04415 | null |
2024-12-05 | Establishing Task Scaling Laws via Compute-Efficient Model Ladders | Akshita Bhagia et.al. | 2412.04403 | null |
2024-12-05 | SeeGround: See and Ground for Zero-Shot Open-Vocabulary 3D Visual Grounding | Rong Li et.al. | 2412.04383 | null |
2024-12-05 | Discriminative Fine-tuning of LVLMs | Yassine Ouali et.al. | 2412.04378 | null |
2024-12-05 | Finer Behavioral Foundation Models via Auto-Regressive Features and Advantage Weighting | Edoardo Cetin et.al. | 2412.04368 | null |
2024-12-05 | Approximate Top- $k$ for Increased Parallelism | Oscar Key et.al. | 2412.04358 | null |
2024-12-05 | Retrieval-Augmented Machine Translation with Unstructured Knowledge | Jiaan Wang et.al. | 2412.04342 | link |
2024-12-05 | Liquid: Language Models are Scalable Multi-modal Generators | Junfeng Wu et.al. | 2412.04332 | null |
2024-12-04 | From Individual to Society: A Survey on Social Simulation Driven by Large Language Model-based Agents | Xinyi Mou et.al. | 2412.03563 | link |
2024-12-04 | FLAIR: VLM with Fine-grained Language-informed Image Representations | Rui Xiao et.al. | 2412.03561 | link |
2024-12-04 | Best-of-N Jailbreaking | John Hughes et.al. | 2412.03556 | link |
2024-12-04 | PaliGemma 2: A Family of Versatile VLMs for Transfer | Andreas Steiner et.al. | 2412.03555 | null |
2024-12-04 | SPICE: Smart Projection Interface for Cooking Enhancement | Vera Prohaska et.al. | 2412.03551 | null |
2024-12-04 | Perception Tokens Enhance Visual Reasoning in Multimodal Language Models | Mahtab Bigverdi et.al. | 2412.03548 | null |
2024-12-04 | Evaluating Gender Bias Transfer between Pre-trained and Prompt-Adapted Language Models | Natalie Mackraz et.al. | 2412.03537 | null |
2024-12-04 | A Review on Scientific Knowledge Extraction using Large Language Models in Biomedical Sciences | Gabriel Lino Garcia et.al. | 2412.03531 | null |
2024-12-04 | FANAL – Financial Activity News Alerting Language Modeling Framework | Urjitkumar Patel et.al. | 2412.03527 | null |
2024-12-04 | You’re (Not) My Type – Can LLMs Generate Feedback of Specific Types for Introductory Programming Tasks? | Dominic Lohr et.al. | 2412.03516 | null |
2024-12-04 | Distillation of Diffusion Features for Semantic Correspondence | Frank Fundel et.al. | 2412.03512 | null |
2024-12-04 | Tight PAC-Bayesian Risk Certificates for Contrastive Learning | Anna van Elst et.al. | 2412.03486 | link |
2024-12-04 | Training-Free Mitigation of Language Reasoning Degradation After Multimodal Instruction Tuning | Neale Ratzlaff et.al. | 2412.03467 | null |
2024-12-04 | Pre-trained Multiple Latent Variable Generative Models are good defenders against Adversarial Attacks | Dario Serez et.al. | 2412.03453 | link |
2024-12-04 | From Words to Workflows: Automating Business Processes | Laura Minkova et.al. | 2412.03446 | null |
2024-12-04 | Assessing Foundation Models’ Transferability to Physiological Signals in Precision Medicine | Matthias Christenson et.al. | 2412.03427 | null |
2024-12-04 | PrefixKV: Adaptive Prefix KV Cache is What Vision Instruction-Following Models Need for Efficient Generation | Ao Wang et.al. | 2412.03409 | link |
2024-12-04 | RedStone: Curating General, Code, Math, and QA Data for Large Language Models | Yaoyao Chang et.al. | 2412.03398 | null |
2024-12-04 | Enhancing Supply Chain Visibility with Generative AI: An Exploratory Case Study on Relationship Prediction in Knowledge Graphs | Ge Zheng et.al. | 2412.03390 | null |
2024-12-04 | WiS Platform: Enhancing Evaluation of LLM-Based Multi-Agent Systems Through Game-Based Analysis | Chengwei Hu et.al. | 2412.03359 | null |
2024-12-03 | T-REG: Preference Optimization with Token-Level Reward Regularization | Wenxuan Zhou et.al. | 2412.02685 | null |
2024-12-03 | Mind the Gap: Examining the Self-Improvement Capabilities of Large Language Models | Yuda Song et.al. | 2412.02674 | null |
2024-12-03 | LLM-Enhanced Path Planning: Safe and Efficient Autonomous Navigation with Instructional Inputs | Pranav Doma et.al. | 2412.02655 | null |
2024-12-03 | Time-Reversal Provides Unsupervised Feedback to LLMs | Yerram Varun et.al. | 2412.02626 | null |
2024-12-03 | Medical Multimodal Foundation Models in Clinical Diagnosis and Treatment: Applications, Challenges, and Future Directions | Kai Sun et.al. | 2412.02621 | null |
2024-12-03 | Improving Dynamic Object Interactions in Text-to-Video Generation with AI Feedback | Hiroki Furuta et.al. | 2412.02617 | null |
2024-12-03 | GLM-4-Voice: Towards Intelligent and Human-Like End-to-End Spoken Chatbot | Aohan Zeng et.al. | 2412.02612 | link |
2024-12-03 | AV-Odyssey Bench: Can Your Multimodal LLMs Really Understand Audio-Visual Information? | Kaixiong Gong et.al. | 2412.02611 | null |
2024-12-03 | Interpretable Company Similarity with Sparse Autoencoders | Marco Molinari et.al. | 2412.02605 | null |
2024-12-03 | CEGI: Measuring the trade-off between efficiency and carbon emissions for SLMs and VLMs | Abhas Kumar et.al. | 2412.02602 | null |
2024-12-03 | PrefixLLM: LLM-aided Prefix Circuit Design | Weihua Xiao et.al. | 2412.02594 | null |
2024-12-03 | OCR Hinders RAG: Evaluating the Cascading Impact of OCR on Retrieval-Augmented Generation | Junyuan Zhang et.al. | 2412.02592 | link |
2024-12-03 | Explainable CTR Prediction via LLM Reasoning | Xiaohan Yu et.al. | 2412.02588 | null |
2024-12-03 | Remote Sensing Temporal Vision-Language Models: A Comprehensive Survey | Chenyang Liu et.al. | 2412.02573 | link |
2024-12-03 | SJTU:Spatial judgments in multimodal models towards unified segmentation through coordinate detection | Joongwon Chae et.al. | 2412.02565 | link |
2024-12-03 | Semantic Tokens in Retrieval Augmented Generation | Joel Suro et.al. | 2412.02563 | null |
2024-12-03 | Patent-CR: A Dataset for Patent Claim Revision | Lekang Jiang et.al. | 2412.02549 | null |
2024-12-03 | Multimodal Remote Sensing Scene Classification Using VLMs and Dual-Cross Attention Networks | Jinjin Cai et.al. | 2412.02531 | null |
2024-12-03 | LLMForecaster: Improving Seasonal Event Forecasts with Unstructured Textual Data | Hanyu Zhang et.al. | 2412.02525 | null |
2024-12-03 | OODFace: Benchmarking Robustness of Face Recognition under Common Corruptions and Appearance Variations | Caixin Kang et.al. | 2412.02479 | null |
2024-12-02 | T2Vid: Translating Long Text into Multi-Image is the Catalyst for Video-LLMs | Shukang Yin et.al. | 2411.19951 | link |
2024-12-02 | Critical Tokens Matter: Token-Level Contrastive Estimation Enhances LLM’s Reasoning Capability | Zicheng Lin et.al. | 2411.19943 | null |
2024-11-29 | VLSBench: Unveiling Visual Leakage in Multimodal Safety | Xuhao Hu et.al. | 2411.19939 | null |
2024-11-29 | On Domain-Specific Post-Training for Multimodal Large Language Models | Daixuan Cheng et.al. | 2411.19930 | null |
2024-11-29 | SIMS: Simulating Human-Scene Interactions with Real World Script Planning | Wenjia Wang et.al. | 2411.19921 | null |
2024-11-29 | FlowCLAS: Enhancing Normalizing Flow Via Contrastive Learning For Anomaly Segmentation | Chang Won Lee et.al. | 2411.19888 | null |
2024-11-29 | PDDLFuse: A Tool for Generating Diverse Planning Domains | Vedant Khandelwal et.al. | 2411.19886 | null |
2024-12-02 | LUMIA: Linear probing for Unimodal and MultiModal Membership Inference Attacks leveraging internal LLM states | Luis Ibanez-Lissen et.al. | 2411.19876 | null |
2024-11-29 | DeMo: Decoupled Momentum Optimization | Bowen Peng et.al. | 2411.19870 | link |
2024-11-29 | AIDetx: a compression-based method for identification of machine-learning generated text | Leonardo Almeida et.al. | 2411.19869 | link |
2024-11-29 | Reverse Thinking Makes LLMs Stronger Reasoners | Justin Chih-Yao Chen et.al. | 2411.19865 | null |
2024-11-29 | Cross-Domain Recommendation Meets Large Language Models | Ajay Krishna Vajjala et.al. | 2411.19862 | link |
2024-11-29 | What fifty-one years of Linguistics and Artificial Intelligence research tell us about their correlation: A scientometric review | Mohammed Q. Shormani et.al. | 2411.19858 | null |
2024-11-29 | Sensitive Content Classification in Social Media: A Holistic Resource and Evaluation | Dimosthenis Antypas et.al. | 2411.19832 | null |
2024-11-29 | Advanced System Integration: Analyzing OpenAPI Chunking for Retrieval-Augmented Generation | Robin D. Pesl et.al. | 2411.19804 | null |
2024-11-29 | INCLUDE: Evaluating Multilingual Language Understanding with Regional Knowledge | Angelika Romanou et.al. | 2411.19799 | null |
2024-11-29 | MoTe: Learning Motion-Text Diffusion Model for Multiple Generation Tasks | Yiming Wu et.al. | 2411.19786 | null |
2024-11-29 | PerLA: Perceptive 3D Language Assistant | Guofeng Mei et.al. | 2411.19774 | null |
2024-11-29 | LongVALE: Vision-Audio-Language-Event Benchmark Towards Time-Aware Omni-Modal Perception of Long Videos | Tiantian Geng et.al. | 2411.19772 | null |
2024-11-29 | Dual Risk Minimization: Towards Next-Level Robustness in Fine-tuning Zero-Shot Models | Kaican Li et.al. | 2411.19757 | link |
2024-11-27 | Lift3D Foundation Policy: Lifting 2D Large-Scale Pretrained Models for Robust 3D Robotic Manipulation | Yueru Jia et.al. | 2411.18623 | null |
2024-11-27 | Cross-modal Information Flow in Multimodal Large Language Models | Zhi Zhang et.al. | 2411.18620 | null |
2024-11-27 | Diffusion Self-Distillation for Zero-Shot Customized Image Generation | Shengqu Cai et.al. | 2411.18616 | null |
2024-11-27 | Automated Literature Review Using NLP Techniques and LLM-Based Retrieval-Augmented Generation | Nurshat Fateh Ali et.al. | 2411.18583 | null |
2024-11-27 | Challenges in Adapting Multilingual LLMs to Low-Resource Languages using LoRA PEFT Tuning | Omkar Khade et.al. | 2411.18571 | null |
2024-11-27 | A Pipeline of Neural-Symbolic Integration to Enhance Spatial Reasoning in Large Language Models | Rong Wang et.al. | 2411.18564 | null |
2024-11-27 | DexDiffuser: Interaction-aware Diffusion Planning for Adaptive Dexterous Manipulation | Zhixuan Liang et.al. | 2411.18562 | null |
2024-11-27 | Retrofitting (Large) Language Models with Dynamic Tokenization | Darius Feher et.al. | 2411.18553 | null |
2024-11-27 | AdaVLN: Towards Visual Language Navigation in Continuous Indoor Environments with Moving Humans | Dillon Loh et.al. | 2411.18539 | link |
2024-11-27 | Emergence of Self-Identity in AI: A Mathematical Framework and Empirical Study with Generative Large Language Models | Minhyeok Lee et.al. | 2411.18530 | link |
2024-11-27 | LLM-ABBA: Understand time series via symbolic approximation | Erin Carson et.al. | 2411.18506 | null |
2024-11-27 | GATE OpenING: A Comprehensive Benchmark for Judging Open-ended Interleaved Image-Text Generation | Pengfei Zhou et.al. | 2411.18499 | null |
2024-11-27 | Beyond Examples: High-level Automated Reasoning Paradigm in In-Context Learning via MCTS | Jinyang Wu et.al. | 2411.18478 | null |
2024-11-27 | Draft Model Knows When to Stop: A Self-Verification Length Policy for Speculative Decoding | Ziyin Zhang et.al. | 2411.18462 | link |
2024-11-27 | Is my Meeting Summary Good? Estimating Quality with a Multi-LLM Evaluator | Frederic Kirstein et.al. | 2411.18444 | null |
2024-11-27 | An AI-Assisted Multi-Agent Dual Dialogue System to Support Mental Health Care Providers | Onno P. Kampman et.al. | 2411.18429 | null |
2024-11-27 | FastSwitch: Optimizing Context Switching Efficiency in Fairness-aware Large Language Model Serving | Ao Shen et.al. | 2411.18424 | null |
2024-11-27 | Politicians vs ChatGPT. A study of presuppositions in French and Italian political communication | Davide Garassino et.al. | 2411.18403 | null |
2024-11-27 | Topic Modeling and Sentiment Analysis on Japanese Online Media’s Coverage of Nuclear Energy | Yifan Sun et.al. | 2411.18383 | null |
2024-11-27 | ChatGPT as speechwriter for the French presidents | Dominique Labbé et.al. | 2411.18382 | null |
2024-11-26 | Adaptive Deployment of Untrusted LLMs Reduces Distributed Threats | Jiaxin Wen et.al. | 2411.17693 | null |
2024-11-26 | Low-Bit Quantization Favors Undertrained LLMs: Scaling Laws for Quantized LLMs with 100T Training Tokens | Xu Ouyang et.al. | 2411.17691 | null |
2024-11-26 | Rethinking Token Reduction in MLLMs: Towards a Unified Paradigm for Training-Free Acceleration | Yuhang Han et.al. | 2411.17686 | null |
2024-11-26 | Enhancing Character-Level Understanding in LLMs through Token Internal Structure Learning | Zhu Xu et.al. | 2411.17679 | link |
2024-11-26 | Instance-Aware Graph Prompt Learning | Jiazheng Li et.al. | 2411.17676 | null |
2024-11-26 | Push the Limit of Multi-modal Emotion Recognition by Prompting LLMs with Receptive-Field-Aware Attention Weighting | Liyun Zhang et.al. | 2411.17674 | null |
2024-11-26 | SketchAgent: Language-Driven Sequential Sketch Generation | Yael Vinker et.al. | 2411.17673 | null |
2024-11-26 | Synthetic Data Generation with LLM for Improved Depression Prediction | Andrea Kang et.al. | 2411.17672 | null |
2024-11-26 | How do Multimodal Foundation Models Encode Text and Speech? An Analysis of Cross-Lingual and Cross-Modal Representations | Hyunji Lee et.al. | 2411.17666 | null |
2024-11-26 | Toward High-Performance LLM Serving: A Simulation-Based Approach for Identifying Optimal Parallelism | Yi-Chien Lin et.al. | 2411.17651 | null |
2024-11-26 | On Limitations of LLM as Annotator for Low Resource Languages | Suramya Jadhav et.al. | 2411.17637 | null |
2024-11-26 | MALMM: Multi-Agent Large Language Models for Zero-Shot Robotics Manipulation | Harsh Singh et.al. | 2411.17636 | null |
2024-11-26 | Data-driven development of cycle prediction models for lithium metal batteries using multi modal mining | Jaewoong Lee et.al. | 2411.17625 | null |
2024-11-26 | Scaling Speech-Text Pre-training with Synthetic Interleaved Data | Aohan Zeng et.al. | 2411.17607 | null |
2024-11-26 | HyperSeg: Towards Universal Visual Segmentation with Large Language Model | Cong Wei et.al. | 2411.17606 | link |
2024-11-26 | Making History Readable | Bipasha Banerjee et.al. | 2411.17600 | null |
2024-11-26 | Agentic AI for Improving Precision in Identifying Contributions to Sustainable Development Goals | William A. Ingram et.al. | 2411.17598 | null |
2024-11-26 | Can artificial intelligence predict clinical trial outcomes? | Shuyi Jin et.al. | 2411.17595 | null |
2024-11-26 | RTL-Breaker: Assessing the Security of LLMs against Backdoor Attacks on HDL Code Generation | Lakshmi Likhitha Mankali et.al. | 2411.17569 | null |
2024-11-26 | Natural Language Understanding and Inference with MLLM in Visual Question Answering: A Survey | Jiayi Kuang et.al. | 2411.17558 | null |
2024-11-25 | Do Large Language Models Perform Latent Multi-Hop Reasoning without Exploiting Shortcuts? | Sohee Yang et.al. | 2411.16679 | null |
2024-11-25 | Diffusion Features for Zero-Shot 6DoF Object Pose Estimation | Bernd Von Gimborn et.al. | 2411.16668 | null |
2024-11-25 | DreamRunner: Fine-Grained Storytelling Video Generation with Retrieval-Augmented Motion Adaptation | Zun Wang et.al. | 2411.16657 | null |
2024-11-25 | Self-Generated Critiques Boost Reward Modeling for Language Models | Yue Yu et.al. | 2411.16646 | null |
2024-11-25 | Preventing Jailbreak Prompts as Malicious Tools for Cybercriminals: A Cyber Defense Perspective | Jean Marie Tshimula et.al. | 2411.16642 | null |
2024-11-25 | StructFormer: Document Structure-based Masked Attention and its Impact on Language Model Pre-Training | Kaustubh Ponkshe et.al. | 2411.16618 | null |
2024-11-25 | Chat2SVG: Vector Graphics Generation with Large Language Models and Image Diffusion Models | Ronghuan Wu et.al. | 2411.16602 | null |
2024-11-25 | From Generation to Judgment: Opportunities and Challenges of LLM-as-a-judge | Dawei Li et.al. | 2411.16594 | link |
2024-11-25 | Large Language Model-based Decision-making for COLREGs and the Control of Autonomous Surface Vehicles | Klinsmann Agyei et.al. | 2411.16587 | null |
2024-11-25 | MarketGPT: Developing a Pre-trained transformer (GPT) for Modeling Financial Time Series | Aaron Wheeler et.al. | 2411.16585 | link |
2024-11-25 | Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision | Zhiheng Xi et.al. | 2411.16579 | null |
2024-11-25 | Predictive Power of LLMs in Financial Markets | Jerick Shi et.al. | 2411.16569 | null |
2024-11-25 | EnStack: An Ensemble Stacking Framework of Large Language Models for Enhanced Vulnerability Detection in Source Code | Shahriyar Zaman Ridoy et.al. | 2411.16561 | null |
2024-11-25 | Generating Out-Of-Distribution Scenarios Using Language Models | Erfan Aasi et.al. | 2411.16554 | null |
2024-11-25 | Representation Collapsing Problems in Vector Quantization | Wenhao Zhao et.al. | 2411.16550 | null |
2024-11-25 | RoboSpatial: Teaching Spatial Understanding to 2D and 3D Vision-Language Models for Robotics | Chan Hee Song et.al. | 2411.16537 | null |
2024-11-25 | Profiling Bias in LLMs: Stereotype Dimensions in Contextual Word Embeddings | Carolin M. Schuster et.al. | 2411.16527 | null |
2024-11-25 | Fundamental Limits of Prompt Tuning Transformers: Universality, Capacity and Efficiency | Jerry Yao-Chieh Hu et.al. | 2411.16525 | null |
2024-11-25 | LaB-RAG: Label Boosted Retrieval Augmented Generation for Radiology Report Generation | Steven Song et.al. | 2411.16523 | null |
2024-11-25 | Noise Diffusion for Enhancing Semantic Faithfulness in Text-to-Image Synthesis | Boming Miao et.al. | 2411.16503 | null |
2024-11-22 | Measuring Bullshit in the Language Games played by ChatGPT | Alessandro Trevisan et.al. | 2411.15129 | null |
2024-11-22 | Health AI Developer Foundations | Atilla P. Kiraly et.al. | 2411.15128 | null |
2024-11-22 | TÜLU 3: Pushing Frontiers in Open Language Model Post-Training | Nathan Lambert et.al. | 2411.15124 | link |
2024-11-22 | RE-Bench: Evaluating frontier AI R&D capabilities of language model agents against human experts | Hjalmar Wijk et.al. | 2411.15114 | link |
2024-11-22 | Efficient Pruning of Text-to-Image Models: Insights from Pruning Stable Diffusion | Samarth N Ramesh et.al. | 2411.15113 | null |
2024-11-22 | AttriBoT: A Bag of Tricks for Efficiently Approximating Leave-One-Out Context Attribution | Fengyuan Liu et.al. | 2411.15102 | link |
2024-11-22 | What You See is Not What You Get: Neural Partial Differential Equations and The Illusion of Learning | Arvind Mohan et.al. | 2411.15101 | null |
2024-11-22 | XGrammar: Flexible and Efficient Structured Generation Engine for Large Language Models | Yixin Dong et.al. | 2411.15100 | null |
2024-11-22 | Context-Aware Multimodal Pretraining | Karsten Roth et.al. | 2411.15099 | null |
2024-11-22 | mR $^2$ AG: Multimodal Retrieval-Reflection-Augmented Generation for Knowledge-Based VQA | Tao Zhang et.al. | 2411.15041 | null |
2024-11-22 | One to rule them all: natural language to bind communication, perception and action | Simone Colombani et.al. | 2411.15033 | null |
2024-11-22 | Time is on my sight: scene graph filtering for dynamic environment perception in an LLM-driven robot | Simone Colombani et.al. | 2411.15027 | null |
2024-11-22 | DyCoke: Dynamic Compression of Tokens for Fast Video Large Language Models | Keda Tao et.al. | 2411.15024 | null |
2024-11-22 | FTA generation using GenAI with an Autonomy sensor Usecase | Sneha Sudhir Shetiya et.al. | 2411.15007 | null |
2024-11-22 | ScribeAgent: Towards Specialized Web Agents Using Production-Scale Workflow Data | Junhong Shen et.al. | 2411.15004 | link |
2024-11-22 | Generative AI may backfire for counterspeech | Dominik Bär et.al. | 2411.14986 | null |
2024-11-22 | Exploring Foundation Models Fine-Tuning for Cytology Classification | Manon Dausort et.al. | 2411.14975 | link |
2024-11-22 | Open-Amp: Synthetic Data Framework for Audio Effect Foundation Models | Alec Wright et.al. | 2411.14972 | link |
2024-11-22 | SwissADT: An Audio Description Translation System for Swiss Languages | Lukas Fischer et.al. | 2411.14967 | null |
2024-11-22 | LoRA-FAIR: Federated LoRA Fine-Tuning with Aggregation and Initialization Refinement | Jieming Bian et.al. | 2411.14961 | null |
2024-11-21 | Insight-V: Exploring Long-Chain Visual Reasoning with Multimodal Large Language Models | Yuhao Dong et.al. | 2411.14432 | link |
2024-11-21 | Unleashing the Potential of Multi-modal Foundation Models and Video Diffusion for 4D Dynamic Physical Scene Simulation | Zhuoman Liu et.al. | 2411.14423 | null |
2024-11-21 | From RNNs to Foundation Models: An Empirical Study on Commercial Building Energy Consumption | Shourya Bose et.al. | 2411.14421 | null |
2024-11-21 | Beyond Training: Dynamic Token Merging for Zero-Shot Video Understanding | Yiming Zhang et.al. | 2411.14401 | null |
2024-11-21 | Lightweight Safety Guardrails Using Fine-tuned BERT Embeddings | Aaron Zheng et.al. | 2411.14398 | null |
2024-11-21 | UnifiedCrawl: Aggregated Common Crawl for Affordable Adaptation of LLMs on Low-Resource Languages | Bethel Melesse Tessema et.al. | 2411.14343 | link |
2024-11-21 | SplatR : Experience Goal Visual Rearrangement with 3D Gaussian Splatting and Dense Feature Matching | Arjun P S et.al. | 2411.14322 | null |
2024-11-21 | Velocitune: A Velocity-based Dynamic Domain Reweighting Method for Continual Pre-training | Zheheng Luo et.al. | 2411.14318 | null |
2024-11-21 | Automated Generation of Code Debugging Exercises | Victor-Alexandru Pădurean et.al. | 2411.14303 | null |
2024-11-21 | Auto-SPICE: Leveraging LLMs for Dataset Creation via Automated SPICE Netlist Extraction from Analog Circuit Diagrams | Jitendra Bhandari et.al. | 2411.14299 | link |
2024-11-21 | EasyHOI: Unleashing the Power of Large Models for Reconstructing Hand-Object Interactions in the Wild | Yumeng Liu et.al. | 2411.14280 | null |
2024-11-21 | Looking Beyond Text: Reducing Language bias in Large Vision-Language Models via Multimodal Dual-Attention and Soft-Image Guidance | Haozhe Zhao et.al. | 2411.14279 | null |
2024-11-21 | Efficient Aspect-Based Summarization of Climate Change Reports with Small Language Models | Iacopo Ghinassi et.al. | 2411.14272 | link |
2024-11-21 | Knowledge Graphs, Large Language Models, and Hallucinations: An NLP Perspective | Ernests Lavrinovics et.al. | 2411.14258 | null |
2024-11-21 | Do I Know This Entity? Knowledge Awareness and Hallucinations in Language Models | Javier Ferrando et.al. | 2411.14257 | null |
2024-11-21 | Generalizing End-To-End Autonomous Driving In Real-World Environments Using Zero-Shot LLMs | Zeyu Dong et.al. | 2411.14256 | null |
2024-11-21 | Intent-Aware Dialogue Generation and Multi-Task Contrastive Learning for Multi-Turn Intent Classification | Junhua Liu et.al. | 2411.14252 | null |
2024-11-21 | Natural Language Reinforcement Learning | Xidong Feng et.al. | 2411.14251 | null |
2024-11-21 | FocusLLaVA: A Coarse-to-Fine Approach for Efficient and Effective Visual Token Compression | Yuke Zhu et.al. | 2411.14228 | null |
2024-11-21 | Towards Context-Rich Automated Biodiversity Assessments: Deriving AI-Powered Insights from Camera Trap Data | Paul Fergus et.al. | 2411.14219 | null |
2024-11-20 | Find Any Part in 3D | Ziqi Ma et.al. | 2411.13550 | null |
2024-11-20 | SpecTool: A Benchmark for Characterizing Errors in Tool-Use LLMs | Shirley Kokane et.al. | 2411.13547 | null |
2024-11-20 | Promoting User Data Autonomy During the Dissolution of a Monopolistic Firm | Rushabh Solanki et.al. | 2411.13546 | null |
2024-11-20 | BALROG: Benchmarking Agentic LLM and VLM Reasoning On Games | Davide Paglieri et.al. | 2411.13543 | null |
2024-11-20 | Metacognition for Unknown Situations and Environments (MUSE) | Rodolfo Valiente et.al. | 2411.13537 | null |
2024-11-20 | Predictive Insights into LGBTQ+ Minority Stress: A Transductive Exploration of Social Media Discourse | S. Chapagain et.al. | 2411.13534 | link |
2024-11-20 | Advancing Complex Medical Communication in Arabic with Sporo AraSum: Surpassing Existing Large Language Models | Chanseo Lee et.al. | 2411.13518 | null |
2024-11-20 | Disentangling Memory and Reasoning Ability in Large Language Models | Mingyu Jin et.al. | 2411.13504 | link |
2024-11-20 | Neural machine translation of seismic waves for petrophysical inversion | José Cunha Teixeira et.al. | 2411.13491 | null |
2024-11-20 | Utilizing Large Language Models to Synthesize Product Desirability Datasets | John D. Hastings et.al. | 2411.13485 | null |
2024-11-20 | PatentEdits: Framing Patent Novelty as Textual Entailment | Ryan Lee et.al. | 2411.13477 | null |
2024-11-20 | When Precision Meets Position: BFloat16 Breaks Down RoPE in Long-Context Training | Haonan Wang et.al. | 2411.13476 | link |
2024-11-20 | SoK: A Systems Perspective on Compound AI Threats and Countermeasures | Sarbartha Banerjee et.al. | 2411.13459 | null |
2024-11-20 | LIMBA: An Open-Source Framework for the Preservation and Valorization of Low-Resource Languages using Generative Models | Salvatore Mario Carta et.al. | 2411.13453 | null |
2024-11-20 | AdaptAgent: Adapting Multimodal Web Agents with Few-Shot Learning from Human Demonstrations | Gaurav Verma et.al. | 2411.13451 | null |
2024-11-20 | WaterPark: A Robustness Assessment of Language Model Watermarking | Jiacheng Liang et.al. | 2411.13425 | link |
2024-11-20 | Unleashing the Power of Large Language Models for Group POI Recommendations | Jing Long et.al. | 2411.13415 | null |
2024-11-20 | A Survey On Enhancing Reinforcement Learning in Complex Environments: Insights from Human and LLM Feedback | Alireza Rashidi Laleh et.al. | 2411.13410 | null |
2024-11-20 | Unification of Balti and trans-border sister dialects in the essence of LLMs and AI Technology | Muhammad Sharif et.al. | 2411.13409 | null |
2024-11-20 | Transformer-Based Contextualized Language Models Joint with Neural Networks for Natural Language Inference in Vietnamese | Dat Van-Thanh Nguyen et.al. | 2411.13407 | null |
2024-11-19 | ACING: Actor-Critic for Instruction Learning in Black-Box Large Language Models | Salma Kharrat et.al. | 2411.12736 | link |
2024-11-19 | Information Theory of Meaningful Communication | Doron Sivan et.al. | 2411.12728 | null |
2024-11-19 | CATCH: Complementary Adaptive Token-level Contrastive Decoding to Mitigate Hallucinations in LVLMs | Zhehan Kan et.al. | 2411.12713 | null |
2024-11-19 | Enhancing Multi-Class Disease Classification: Neoplasms, Cardiovascular, Nervous System, and Digestive Disorders Using Advanced LLMs | Ahmed Akib Jawad Karim et.al. | 2411.12712 | null |
2024-11-19 | Strengthening Fake News Detection: Leveraging SVM and Sophisticated Text Vectorization Techniques. Defying BERT? | Ahmed Akib Jawad Karim et.al. | 2411.12703 | null |
2024-11-19 | When Backdoors Speak: Understanding LLM Backdoor Attacks Through Model-Generated Explanations | Huaizhi Ge et.al. | 2411.12701 | null |
2024-11-19 | SparseInfer: Training-free Prediction of Activation Sparsity for Fast LLM Inference | Jiho Shin et.al. | 2411.12692 | null |
2024-11-19 | Neurosymbolic Graph Enrichment for Grounded World Models | Stefano De Giorgis et.al. | 2411.12671 | null |
2024-11-19 | DLBacktrace: A Model Agnostic Explainability for any Deep Learning Models | Vinay Kumar Sankarapu et.al. | 2411.12643 | link |
2024-11-19 | Improving Controllability and Editability for Pretrained Text-to-Music Generation Models | Yixiao Zhang et.al. | 2411.12641 | null |
2024-11-19 | Provable unlearning in topic modeling and downstream tasks | Stanley Wei et.al. | 2411.12600 | null |
2024-11-19 | AdaCM $^2$ : On Understanding Extremely Long-Term Video with Adaptive Cross-Modality Memory Reduction | Yuanbin Man et.al. | 2411.12593 | null |
2024-11-19 | Procedural Knowledge in Pretraining Drives Reasoning in Large Language Models | Laura Ruis et.al. | 2411.12580 | link |
2024-11-19 | Large Language Models for Combinatorial Optimization of Design Structure Matrix | Shuo Jiang et.al. | 2411.12571 | null |
2024-11-19 | Unlocking State-Tracking in Linear RNNs Through Negative Eigenvalues | Riccardo Grazzi et.al. | 2411.12537 | link |
2024-11-19 | Contourlet Refinement Gate Framework for Thermal Spectrum Distribution Regularized Infrared Image Super-Resolution | Yang Zou et.al. | 2411.12530 | link |
2024-11-19 | Enhancing Reasoning Capabilities of LLMs via Principled Synthetic Logic Corpus | Terufumi Morishita et.al. | 2411.12498 | link |
2024-11-19 | AI Flow at the Network Edge | Jiawei Shao et.al. | 2411.12469 | null |
2024-11-19 | Guide-to-Explain for Controllable Summarization | Sangwon Ryu et.al. | 2411.12460 | null |
2024-11-19 | \textsc{Neon}: News Entity-Interaction Extraction for Enhanced Question Answering | Sneha Singhania et.al. | 2411.12449 | null |
2024-11-18 | Bi-Mamba: Towards Accurate 1-Bit State Space Models | Shengkun Tang et.al. | 2411.11843 | null |
2024-11-18 | Tackling prediction tasks in relational databases with LLMs | Marek Wydmuch et.al. | 2411.11829 | null |
2024-11-18 | Exploring adversarial robustness of JPEG AI: methodology, comparison and new methods | Egor Kovalev et.al. | 2411.11795 | null |
2024-11-18 | LLM-IE: A Python Package for Generative Information Extraction with Large Language Models | Enshuo Hsu et.al. | 2411.11779 | null |
2024-11-18 | sMoRe: Enhancing Object Manipulation and Organization in Mixed Reality Spaces with LLMs and Generative AI | Yunhao Xing et.al. | 2411.11752 | null |
2024-11-18 | BitMoD: Bit-serial Mixture-of-Datatype LLM Acceleration | Yuzong Chen et.al. | 2411.11745 | link |
2024-11-18 | Moral Persuasion in Large Language Models: Evaluating Susceptibility and Ethical Alignment | Allison Huang et.al. | 2411.11731 | link |
2024-11-18 | Semantic-Geometric-Physical-Driven Robot Manipulation Skill Transfer via Skill Library and Tactile Representation | Mingchao Qi et.al. | 2411.11714 | link |
2024-11-18 | FedCoLLM: A Parameter-Efficient Federated Co-tuning Framework for Large and Small Language Models | Tao Fan et.al. | 2411.11707 | null |
2024-11-18 | MC-LLaVA: Multi-Concept Personalized Vision-Language Model | Ruichuan An et.al. | 2411.11706 | link |
2024-11-18 | Technical Report: Enhancing LLM Reasoning with Reward-guided Tree Search | Jinhao Jiang et.al. | 2411.11694 | null |
2024-11-18 | TrojanRobot: Backdoor Attacks Against Robotic Manipulation in the Physical World | Xianlong Wang et.al. | 2411.11683 | null |
2024-11-18 | PSPO*: An Effective Process-supervised Policy Optimization for Reasoning Alignment | Jiawei Li et.al. | 2411.11681 | link |
2024-11-18 | Dissecting Misalignment of Multimodal Large Language Models via Influence Function | Lijie Hu et.al. | 2411.11667 | null |
2024-11-18 | TSINR: Capturing Temporal Continuity via Implicit Neural Representations for Time Series Anomaly Detection | Mengxuan Li et.al. | 2411.11641 | link |
2024-11-18 | Chapter 7 Review of Data-Driven Generative AI Models for Knowledge Extraction from Scientific Literature in Healthcare | Leon Kopitar et.al. | 2411.11635 | null |
2024-11-18 | Signaling and Social Learning in Swarms of Robots | Leo Cazenille et.al. | 2411.11616 | null |
2024-11-18 | Leveraging Computational Pathology AI for Noninvasive Optical Imaging Analysis Without Retraining | Danny Barash et.al. | 2411.11613 | null |
2024-11-18 | VLN-Game: Vision-Language Equilibrium Search for Zero-Shot Semantic Navigation | Bangguo Yu et.al. | 2411.11609 | null |
2024-11-18 | Exploring LLMs for Verifying Technical System Specifications Against Requirements | Lasse M. Reinpold et.al. | 2411.11582 | null |
2024-11-15 | VeriGraph: Scene Graphs for Execution Verifiable Robot Planning | Daniel Ekpo et.al. | 2411.10446 | null |
2024-11-15 | Enhancing the Reasoning Ability of Multimodal Large Language Models via Mixed Preference Optimization | Weiyun Wang et.al. | 2411.10442 | null |
2024-11-15 | LLaVA-o1: Let Vision Language Models Reason Step-by-Step | Guowei Xu et.al. | 2411.10440 | link |
2024-11-15 | MARS: Unleashing the Power of Variance Reduction for Training Large Models | Huizhuo Yuan et.al. | 2411.10438 | link |
2024-11-15 | Mitigating Hallucination in Multimodal Large Language Model via Hallucination-targeted Direct Preference Optimization | Yuhan Fu et.al. | 2411.10436 | null |
2024-11-15 | Evaluating Creativity and Deception in Large Language Models: A Simulation Framework for Multi-Agent Balderdash | Parsa Hejabi et.al. | 2411.10422 | link |
2024-11-15 | On the Foundation Model for Cardiac MRI Reconstruction | Chi Zhang et.al. | 2411.10403 | null |
2024-11-15 | Interactive Cycle Model – The Linkage Combination among Automatic Speech Recognition, Large Language Models and Smart Glasses | Libo Wang et.al. | 2411.10362 | null |
2024-11-15 | Bias Unveiled: Investigating Social Bias in LLM-Generated Code | Lin Ling et.al. | 2411.10351 | null |
2024-11-15 | Y-MAP-Net: Real-time depth, normals, segmentation, multi-label captioning and 2D human pose in RGB images | Ammar Qammaz et.al. | 2411.10334 | null |
2024-11-15 | Number it: Temporal Grounding Videos like Flipping Manga | Yongliang Wu et.al. | 2411.10332 | link |
2024-11-15 | Modification Takes Courage: Seamless Image Stitching via Reference-Driven Inpainting | Ziqi Xie et.al. | 2411.10309 | link |
2024-11-15 | Static network structure cannot stabilize cooperation among Large Language Model agents | Jin Han et.al. | 2411.10294 | null |
2024-11-15 | Scaling Law for Post-training after Model Pruning | Xiaodong Chen et.al. | 2411.10272 | null |
2024-11-15 | Visual-Linguistic Agent: Towards Collaborative Contextual Object Reasoning | Jingru Yang et.al. | 2411.10252 | null |
2024-11-15 | Measuring Non-Adversarial Reproduction of Training Data in Large Language Models | Michael Aerni et.al. | 2411.10242 | null |
2024-11-15 | Generative AI in Multimodal User Interfaces: Trends, Challenges, and Cross-Platform Adaptability | J. Bieniek et.al. | 2411.10234 | null |
2024-11-15 | An Empirical Study on LLM-based Agents for Automated Bug Fixing | Xiangxin Meng et.al. | 2411.10213 | null |
2024-11-15 | Agentic LLMs in the Supply Chain: Towards Autonomous Multi-Agent Consensus-Seeking | Valeria Jannelli et.al. | 2411.10184 | null |
2024-11-15 | CART: Compositional Auto-Regressive Transformer for Image Generation | Siddharth Roheda et.al. | 2411.10180 | null |
2024-11-14 | MagicQuill: An Intelligent Interactive Image Editing System | Zichen Liu et.al. | 2411.09703 | null |
2024-11-14 | Advancing Fine-Grained Visual Understanding with Multi-Scale Alignment in Multi-Modal Models | Wei Wang et.al. | 2411.09691 | null |
2024-11-14 | Squeezed Attention: Accelerating Long Context Length LLM Inference | Coleman Hooper et.al. | 2411.09688 | link |
2024-11-14 | Adaptive Decoding via Latent Preference Optimization | Shehzaad Dhuliawala et.al. | 2411.09661 | null |
2024-11-14 | On the Limits of Language Generation: Trade-Offs Between Hallucination and Mode Collapse | Alkis Kalavasis et.al. | 2411.09642 | null |
2024-11-14 | Local deployment of large-scale music AI models on commodity hardware | Xun Zhou et.al. | 2411.09625 | null |
2024-11-14 | PTR: Precision-Driven Tool Recommendation for Large Language Models | Hang Gao et.al. | 2411.09613 | null |
2024-11-14 | The Moral Foundations Weibo Corpus | Renjie Cao et.al. | 2411.09612 | null |
2024-11-14 | Initial Nugget Evaluation Results for the TREC 2024 RAG Track with the AutoNuggetizer Framework | Ronak Pradeep et.al. | 2411.09607 | null |
2024-11-14 | Accelerating Knowledge Graph and Ontology Engineering with Large Language Models | Cogan Shimizu et.al. | 2411.09601 | null |
2024-11-14 | Assessing the Performance of the DINOv2 Self-supervised Learning Vision Transformer Model for the Segmentation of the Left Atrium from MRI Images | Bipasha Kundu et.al. | 2411.09598 | null |
2024-11-14 | LLaMA-Mesh: Unifying 3D Mesh Generation with Language Models | Zhengyi Wang et.al. | 2411.09595 | null |
2024-11-14 | Adopting RAG for LLM-Aided Future Vehicle Design | Vahid Zolfaghari et.al. | 2411.09590 | null |
2024-11-14 | BabyLM Challenge: Exploring the Effect of Variation Sets on Language Model Training Efficiency | Akari Haga et.al. | 2411.09587 | null |
2024-11-14 | Software Performance Engineering for Foundation Model-Powered Software (FMware) | Haoxiang Zhang et.al. | 2411.09580 | null |
2024-11-14 | Piecing It All Together: Verifying Multi-Hop Multimodal Claims | Haoran Wang et.al. | 2411.09547 | null |
2024-11-14 | A Practical Guide to Fine-tuning Language Models with Limited Data | Márton Szép et.al. | 2411.09539 | null |
2024-11-14 | Navigating the Risks: A Survey of Security, Privacy, and Ethics Threats in LLM-Based Agents | Yuyou Gan et.al. | 2411.09523 | null |
2024-11-14 | Communication Compression for Tensor Parallel LLM Inference | Jan Hansen-Palmus et.al. | 2411.09510 | null |
2024-11-14 | Spider: Any-to-Many Multimodal LLM | Jinxiang Lai et.al. | 2411.09439 | null |
2024-11-13 | Large Wireless Model (LWM): A Foundation Model for Wireless Channels | Sadjad Alikhani et.al. | 2411.08872 | link |
2024-11-13 | The Limited Impact of Medical Adaptation of Large Language and Vision-Language Models | Daniel P. Jeong et.al. | 2411.08870 | link |
2024-11-13 | CamemBERT 2.0: A Smarter French Language Model Aged to Perfection | Wissam Antoun et.al. | 2411.08868 | null |
2024-11-13 | LLMStinger: Jailbreaking LLMs using RL fine-tuned LLMs | Piyush Jha et.al. | 2411.08862 | null |
2024-11-13 | Multimodal Instruction Tuning with Hybrid State Space Models | Jianing Zhou et.al. | 2411.08840 | null |
2024-11-13 | FinRobot: AI Agent for Equity Research and Valuation with Large Language Models | Tianyu Zhou et.al. | 2411.08804 | link |
2024-11-13 | Evaluating World Models with LLM for Decision Making | Chang Yang et.al. | 2411.08794 | null |
2024-11-13 | Can sparse autoencoders be used to decompose and interpret steering vectors? | Harry Mayne et.al. | 2411.08790 | link |
2024-11-13 | Sharingan: Extract User Action Sequence from Desktop Recordings | Yanting Chen et.al. | 2411.08768 | null |
2024-11-13 | Separating Tongue from Thought: Activation Patching Reveals Language-Agnostic Concept Representations in Transformers | Clément Dumas et.al. | 2411.08745 | link |
2024-11-13 | A Comparative Study of Discrete Speech Tokens for Semantic-Related Tasks with Large Language Models | Dingdong Wang et.al. | 2411.08742 | null |
2024-11-13 | Dynamic Rewarding with Prompt Optimization Enables Tuning-free Self-Alignment of Language Models | Somanshu Singla et.al. | 2411.08733 | link |
2024-11-13 | Polymetis:Large Language Modeling for Multiple Material Domains | Chao Huang et.al. | 2411.08728 | null |
2024-11-13 | Voxeland: Probabilistic Instance-Aware Semantic Mapping with Evidence-based Uncertainty Quantification | Jose-Luis Matez-Bandera et.al. | 2411.08727 | link |
2024-11-13 | Theoretical Analysis of Byte-Pair Encoding | László Kozma et.al. | 2411.08671 | null |
2024-11-13 | OSMLoc: Single Image-Based Visual Localization in OpenStreetMap with Geometric and Semantic Guidances | Youqi Liao et.al. | 2411.08665 | link |
2024-11-13 | UniMat: Unifying Materials Embeddings through Multi-modal Learning | Janghoon Ock et.al. | 2411.08664 | null |
2024-11-13 | Accelerating Quasi-Static Time Series Simulations with Foundation Models | Alban Puech et.al. | 2411.08652 | null |
2024-11-13 | A System Level Performance Evaluation for Superconducting Digital Systems | Joyjit Kundu et.al. | 2411.08645 | null |
2024-11-13 | Towards Secure Intelligent O-RAN Architecture: Vulnerabilities, Threats and Promising Technical Solutions using LLMs | Mojdeh Karbalaee Motalleb et.al. | 2411.08640 | null |
2024-11-12 | Learning with Less: Knowledge Distillation from Large Language Models via Unlabeled Data | Juanhui Li et.al. | 2411.08028 | null |
2024-11-12 | LLMPhy: Complex Physical Reasoning Using Large Language Models and World Models | Anoop Cherian et.al. | 2411.08027 | null |
2024-11-12 | Language Models as Causal Effect Generators | Lucius E. J. Bynum et.al. | 2411.08019 | link |
2024-11-12 | ExpressivityArena: Can LLMs Express Information Implicitly? | Joshua Tint et.al. | 2411.08010 | null |
2024-11-12 | Can adversarial attacks by large language models be attributed? | Manuel Cebrian et.al. | 2411.08003 | null |
2024-11-12 | Derivational Morphology Reveals Analogical Generalization in Large Language Models | Valentin Hofmann et.al. | 2411.07990 | null |
2024-11-12 | JanusFlow: Harmonizing Autoregression and Rectified Flow for Unified Multimodal Understanding and Generation | Yiyang Ma et.al. | 2411.07975 | link |
2024-11-12 | From General to Specific: Utilizing General Hallucation to Automatically Measure the Role Relationship Fidelity for Specific Role-Play Agents | Chuyi Kong et.al. | 2411.07965 | null |
2024-11-12 | Towards Low-bit Communication for Tensor Parallel LLM Inference | Harry Dong et.al. | 2411.07942 | null |
2024-11-12 | Leveraging Multimodal Models for Enhanced Neuroimaging Diagnostics in Alzheimer’s Disease | Francesco Chiumento et.al. | 2411.07871 | null |
2024-11-12 | Trustful LLMs: Customizing and Grounding Text Generation with Knowledge Bases and Dual Decoders | Xiaofeng Zhu et.al. | 2411.07870 | null |
2024-11-12 | Verbosity $\neq$ Veracity: Demystify Verbosity Compensation Behavior of Large Language Models | Yusen Zhang et.al. | 2411.07858 | link |
2024-11-12 | Tucano: Advancing Neural Text Generation for Portuguese | Nicholas Kluge Corrêa et.al. | 2411.07854 | link |
2024-11-12 | NL-SLAM for OC-VLN: Natural Language Grounded SLAM for Object-Centric VLN | Sonia Raychaudhuri et.al. | 2411.07848 | null |
2024-11-12 | Chain Association-based Attacking and Shielding Natural Language Processing Systems | Jiacheng Huang et.al. | 2411.07843 | null |
2024-11-12 | FRUGAL: Memory-Efficient Optimization by Reducing State Overhead for Scalable Training | Philip Zmushko et.al. | 2411.07837 | link |
2024-11-12 | Efficient Federated Finetuning of Tiny Transformers with Resource-Constrained Devices | Kilian Pfeiffer et.al. | 2411.07826 | null |
2024-11-12 | Query Optimization for Parametric Knowledge Refinement in Retrieval-Augmented Large Language Models | Youan Cong et.al. | 2411.07820 | null |
2024-11-12 | Federated Low-Rank Adaptation with Differential Privacy over Wireless Networks | Tianqu Kang et.al. | 2411.07806 | null |
2024-11-12 | Likelihood as a Performance Gauge for Retrieval-Augmented Generation | Tianyu Liu et.al. | 2411.07773 | link |
2024-11-11 | UTMath: Math Evaluation with Unit Test via Reasoning-to-Coding Thoughts | Bo Yang et.al. | 2411.07240 | link |
2024-11-11 | OpenThaiGPT 1.5: A Thai-Centric Open Source Large Language Model | Sumeth Yuenyong et.al. | 2411.07238 | null |
2024-11-11 | Contextualized Evaluations: Taking the Guesswork Out of Language Model Evaluations | Chaitanya Malaviya et.al. | 2411.07237 | null |
2024-11-11 | Tooling or Not Tooling? The Impact of Tools on Language Agents for Chemistry Problem Solving | Botao Yu et.al. | 2411.07228 | null |
2024-11-11 | TempCharBERT: Keystroke Dynamics for Continuous Access Control Based on Pre-trained Language Models | Matheus Simão et.al. | 2411.07224 | null |
2024-11-11 | Comparing Bottom-Up and Top-Down Steering Approaches on In-Context Learning Tasks | Madeline Brumley et.al. | 2411.07213 | null |
2024-11-11 | General Geospatial Inference with a Population Dynamics Foundation Model | Mohit Agarwal et.al. | 2411.07207 | null |
2024-11-11 | DLCR: A Generative Data Expansion Framework via Diffusion for Clothes-Changing Person Re-ID | Nyle Siddiqui et.al. | 2411.07205 | link |
2024-11-11 | The Super Weight in Large Language Models | Mengxia Yu et.al. | 2411.07191 | link |
2024-11-11 | NatureLM-audio: an Audio-Language Foundation Model for Bioacoustics | David Robinson et.al. | 2411.07186 | null |
2024-11-11 | SAMPart3D: Segment Any Part in 3D Objects | Yunhan Yang et.al. | 2411.07184 | link |
2024-11-11 | Counterfactual Generation from Language Models | Shauli Ravfogel et.al. | 2411.07180 | link |
2024-11-11 | More Expressive Attention with Negative Weights | Ang Lv et.al. | 2411.07176 | link |
2024-11-11 | Continual Memorization of Factoids in Large Language Models | Howard Chen et.al. | 2411.07175 | link |
2024-11-11 | A Domain-Agnostic Neurosymbolic Approach for Big Social Data Analysis: Evaluating Mental Health Sentiment on Social Media during COVID-19 | Vedant Khandelwal et.al. | 2411.07163 | null |
2024-11-11 | Chinese SimpleQA: A Chinese Factuality Evaluation for Large Language Models | Yancheng He et.al. | 2411.07140 | null |
2024-11-11 | Stronger Models are NOT Stronger Teachers for Instruction Tuning | Zhangchen Xu et.al. | 2411.07133 | null |
2024-11-11 | Token Merging for Training-Free Semantic Binding in Text-to-Image Synthesis | Taihang Hu et.al. | 2411.07132 | link |
2024-11-11 | Retrieval or Global Context Understanding? On Many-Shot In-Context Learning for Long-Context Evaluation | Kaijian Zou et.al. | 2411.07130 | link |
2024-11-11 | Benchmarking LLMs’ Judgments with No Gold Standard | Shengwei Xu et.al. | 2411.07127 | link |
2024-11-08 | Recycled Attention: Efficient inference for long-context language models | Fangyuan Xu et.al. | 2411.05787 | null |
2024-11-08 | Using Language Models to Disambiguate Lexical Choices in Translation | Josh Barua et.al. | 2411.05781 | link |
2024-11-08 | Fact or Fiction? Can LLMs be Reliable Annotators for Political Truths? | Veronica Chatrath et.al. | 2411.05775 | null |
2024-11-08 | Multi-hop Evidence Pursuit Meets the Web: Team Papelo at FEVER 2024 | Christopher Malon et.al. | 2411.05762 | null |
2024-11-08 | End-to-End Navigation with Vision Language Models: Transforming Spatial Reasoning into Question-Answering | Dylan Goetting et.al. | 2411.05755 | link |
2024-11-08 | Aioli: A Unified Optimization Framework for Language Model Data Mixing | Mayee F. Chen et.al. | 2411.05735 | link |
2024-11-08 | Poze: Sports Technique Feedback under Data Constraints | Agamdeep Singh et.al. | 2411.05734 | null |
2024-11-08 | STARS: Sensor-agnostic Transformer Architecture for Remote Sensing | Ethan King et.al. | 2411.05714 | null |
2024-11-08 | Unmasking the Limits of Large Language Models: A Systematic Evaluation of Masked Text Processing Ability through MskQA and MskCal | Fuka Matsuzaki et.al. | 2411.05665 | link |
2024-11-08 | The influence of persona and conversational task on social interactions with a LLM-controlled embodied conversational agent | Leon O. H. Kroczek et.al. | 2411.05653 | null |
2024-11-08 | LightVA: Lightweight Visual Analytics with LLM Agent-Based Task Planning and Execution | Yuheng Zhao et.al. | 2411.05651 | null |
2024-11-08 | Harnessing High-Level Song Descriptors towards Natural Language-Based Music Recommendation | Elena V. Epure et.al. | 2411.05649 | link |
2024-11-08 | Evaluating Large Language Model Capability in Vietnamese Fact-Checking Data Generation | Long Truong To et.al. | 2411.05641 | null |
2024-11-08 | Assessing Open-Source Large Language Models on Argumentation Mining Subtasks | Mohammad Yeghaneh Abkenar et.al. | 2411.05639 | null |
2024-11-08 | A Two-Step Concept-Based Approach for Enhanced Interpretability and Trust in Skin Lesion Diagnosis | Cristiano Patrício et.al. | 2411.05609 | link |
2024-11-08 | Evaluating and Adapting Large Language Models to Represent Folktales in Low-Resource Languages | JA Meaney et.al. | 2411.05593 | null |
2024-11-08 | Open-set object detection: towards unified problem formulation and benchmarking | Hejer Ammar et.al. | 2411.05564 | null |
2024-11-08 | Training objective drives the consistency of representational similarity across datasets | Laure Ciernik et.al. | 2411.05561 | link |
2024-11-08 | AcceLLM: Accelerating LLM Inference using Redundancy for Load Balancing and Data Locality | Ilias Bournias et.al. | 2411.05555 | null |
2024-11-08 | Assessing the Answerability of Queries in Retrieval-Augmented Code Generation | Geonmin Kim et.al. | 2411.05547 | null |
2024-11-07 | SVDQunat: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models | Muyang Li et.al. | 2411.05007 | link |
2024-11-07 | Analyzing The Language of Visual Tokens | David M. Chan et.al. | 2411.05001 | null |
2024-11-07 | Needle Threading: Can LLMs Follow Threads through Near-Million-Scale Haystacks? | Jonathan Roberts et.al. | 2411.05000 | null |
2024-11-07 | DynaMem: Online Dynamic Spatio-Semantic Memory for Open World Mobile Manipulation | Peiqi Liu et.al. | 2411.04999 | link |
2024-11-07 | LLM2CLIP: Powerful Language Model Unlock Richer Visual Representation | Weiquan Huang et.al. | 2411.04997 | link |
2024-11-07 | Mixture-of-Transformers: A Sparse and Scalable Architecture for Multi-Modal Foundation Models | Weixin Liang et.al. | 2411.04996 | null |
2024-11-07 | Rethinking Bradley-Terry Models in Preference-Based Reward Modeling: Foundations, Theory, and Alternatives | Hao Sun et.al. | 2411.04991 | link |
2024-11-07 | The Semantic Hub Hypothesis: Language Models Share Semantic Representations Across Languages and Modalities | Zhaofeng Wu et.al. | 2411.04986 | null |
2024-11-07 | Enhancing Reverse Engineering: Investigating and Benchmarking Large Language Models for Vulnerability Analysis in Decompiled Binaries | Dylan Manuel et.al. | 2411.04981 | null |
2024-11-07 | SuffixDecoding: A Model-Free Approach to Speeding Up Large Language Model Inference | Gabriele Oliaro et.al. | 2411.04975 | null |
2024-11-07 | BitNet a4.8: 4-bit Activations for 1-bit LLMs | Hongyu Wang et.al. | 2411.04965 | null |
2024-11-07 | Position Paper On Diagnostic Uncertainty Estimation from Large Language Models: Next-Word Probability Is Not Pre-test Probability | Yanjun Gao et.al. | 2411.04962 | null |
2024-11-07 | CAD-MLLM: Unifying Multimodality-Conditioned CAD Generation With MLLM | Jingwei Xu et.al. | 2411.04954 | null |
2024-11-07 | M3DocRAG: Multi-modal Retrieval is What You Need for Multi-page Multi-document Understanding | Jaemin Cho et.al. | 2411.04952 | null |
2024-11-07 | A Reinforcement Learning-Based Automatic Video Editing Method Using Pre-trained Vision-Language Model | Panwen Hu et.al. | 2411.04942 | null |
2024-11-07 | VideoGLaMM: A Large Multimodal Model for Pixel-Level Visual Grounding in Videos | Shehan Munasinghe et.al. | 2411.04923 | null |
2024-11-07 | GPTKB: Building Very Large Knowledge Bases from Language Models | Yujia Hu et.al. | 2411.04920 | link |
2024-11-07 | OpenCoder: The Open Cookbook for Top-Tier Code Large Language Models | Siming Huang et.al. | 2411.04905 | null |
2024-11-07 | In the Era of Prompt Learning with Vision-Language Models | Ankit Jha et.al. | 2411.04892 | null |
2024-11-07 | GUI Agents with Foundation Models: A Comprehensive Survey | Shuai Wang et.al. | 2411.04890 | null |
2024-11-06 | Medical Adaptation of Large Language and Vision-Language Models: Are We Making Progress? | Daniel P. Jeong et.al. | 2411.04118 | link |
2024-11-06 | How Transformers Solve Propositional Logic Problems: A Mechanistic Analysis | Guan Zhe Hong et.al. | 2411.04105 | null |
2024-11-06 | RaVL: Discovering and Mitigating Spurious Correlations in Fine-Tuned Vision-Language Models | Maya Varma et.al. | 2411.04097 | link |
2024-11-06 | Textual Decomposition Then Sub-motion-space Scattering for Open-Vocabulary Motion Generation | Ke Fan et.al. | 2411.04079 | null |
2024-11-06 | H-POPE: Hierarchical Polling-based Probing Evaluation of Hallucinations in Large Vision-Language Models | Nhi Pham et.al. | 2411.04077 | null |
2024-11-06 | M3SciQA: A Multi-Modal Multi-Document Scientific QA Benchmark for Evaluating Foundation Models | Chuhan Li et.al. | 2411.04075 | null |
2024-11-06 | Pseudo-labeling with Keyword Refining for Few-Supervised Video Captioning | Ping Li et.al. | 2411.04059 | link |
2024-11-06 | Beemo: Benchmark of Expert-edited Machine-generated Outputs | Ekaterina Artemova et.al. | 2411.04032 | null |
2024-11-06 | Prompt Engineering Using GPT for Word-Level Code-Mixed Language Identification in Low-Resource Dravidian Languages | Aniket Deroy et.al. | 2411.04025 | null |
2024-11-06 | Select2Plan: Training-Free ICL-Based Planning through VQA and Memory Retrieval | Davide Buoso et.al. | 2411.04006 | null |
2024-11-06 | Customized Multiple Clustering via Multi-Modal Subspace Proxy Learning | Jiawei Yao et.al. | 2411.03978 | link |
2024-11-06 | What Really is Commonsense Knowledge? | Quyet V. Do et.al. | 2411.03964 | null |
2024-11-06 | How Does A Text Preprocessing Pipeline Affect Ontology Syntactic Matching? | Zhangcheng Qiang et.al. | 2411.03962 | null |
2024-11-06 | Face Reconstruction from Face Embeddings using Adapter to a Face Foundation Model | Hatef Otroshi Shahreza et.al. | 2411.03960 | null |
2024-11-06 | Fine-Grained Guidance for Retrievers: Leveraging LLMs’ Feedback in Retrieval-Augmented Generation | Yuhang Liu et.al. | 2411.03957 | null |
2024-11-06 | Long-Form Text-to-Music Generation with Adaptive Prompts: A Case of Study in Tabletop Role-Playing Games Soundtracks | Felipe Marra et.al. | 2411.03948 | null |
2024-11-06 | Interactions Across Blocks in Post-Training Quantization of Large Language Models | Khasmamad Shabanovi et.al. | 2411.03934 | null |
2024-11-06 | Multi3Hate: Multimodal, Multilingual, and Multicultural Hate Speech Detection with Vision-Language Models | Minh Duc Bui et.al. | 2411.03888 | link |
2024-11-06 | Polynomial Composition Activations: Unleashing the Dynamics of Large Language Models | Zhijian Zhuo et.al. | 2411.03884 | link |
2024-11-06 | MEG: Medical Knowledge-Augmented Large Language Models for Question Answering | Laura Cabello et.al. | 2411.03883 | link |
2024-11-05 | Inference Optimal VLMs Need Only One Visual Token but Larger Models | Kevin Y. Li et.al. | 2411.03312 | link |
2024-11-05 | LLMs for Domain Generation Algorithm Detection | Reynier Leyva La O et.al. | 2411.03307 | null |
2024-11-05 | VERITAS: A Unified Approach to Reliability Evaluation | Rajkumar Ramamurthy et.al. | 2411.03300 | null |
2024-11-05 | Examining Human-AI Collaboration for Co-Writing Constructive Comments Online | Farhana Shahid et.al. | 2411.03295 | null |
2024-11-05 | Interaction2Code: How Far Are We From Automatic Interactive Webpage Generation? | Jingyu Xiao et.al. | 2411.03292 | link |
2024-11-05 | The Future of Intelligent Healthcare: A Systematic Analysis and Discussion on the Integration and Impact of Robots Using Large Language Models for Healthcare | Souren Pashangpour et.al. | 2411.03287 | null |
2024-11-05 | SMoA: Improving Multi-agent Large Language Models with Sparse Mixture-of-Agents | Dawei Li et.al. | 2411.03284 | link |
2024-11-05 | Spontaneous Emergence of Agent Individuality through Social Interactions in LLM-Based Communities | Ryosuke Takata et.al. | 2411.03252 | null |
2024-11-05 | DiffLM: Controllable Synthetic Data Generation via Diffusion Language Models | Ying Zhou et.al. | 2411.03250 | null |
2024-11-05 | From Pen to Prompt: How Creative Writers Integrate AI into their Writing Practice | Alicia Guo et.al. | 2411.03137 | null |
2024-11-05 | “Create a Fear of Missing Out” – ChatGPT Implements Unsolicited Deceptive Designs in Generated Websites Without Warning | Veronika Krauß et.al. | 2411.03108 | null |
2024-11-05 | Utilizing Precise and Complete Code Context to Guide LLM in Automatic False Positive Mitigation | Jinbao Chen et.al. | 2411.03079 | null |
2024-11-05 | Predictor-Corrector Enhanced Transformers with Exponential Moving Average Coefficient Learning | Bei Li et.al. | 2411.03042 | null |
2024-11-05 | HumanVLM: Foundation for Human-Scene Vision-Language Model | Dawei Dai et.al. | 2411.03034 | null |
2024-11-05 | Leveraging Large Language Models in Code Question Answering: Baselines and Issues | Georgy Andryushchenko et.al. | 2411.03012 | link |
2024-11-05 | Controlling for Unobserved Confounding with Large Language Model Classification of Patient Smoking Status | Samuel Lee et.al. | 2411.03004 | null |
2024-11-05 | Efficient and Effective Adaptation of Multimodal Foundation Models in Sequential Recommendation | Junchen Fu et.al. | 2411.02992 | null |
2024-11-05 | Growing a Tail: Increasing Output Diversity in Large Language Models | Michal Shur-Ofry et.al. | 2411.02989 | null |
2024-11-05 | [Vision Paper] PRObot: Enhancing Patient-Reported Outcome Measures for Diabetic Retinopathy using Chatbots and Generative AI | Maren Pielka et.al. | 2411.02973 | null |
2024-11-05 | Multi-modal NeRF Self-Supervision for LiDAR Semantic Segmentation | Xavier Timoneda et.al. | 2411.02969 | null |
2024-11-04 | Training-free Regional Prompting for Diffusion Transformers | Anthony Chen et.al. | 2411.02395 | link |
2024-11-04 | Adaptive Length Image Tokenization via Recurrent Allocation | Shivam Duggal et.al. | 2411.02393 | link |
2024-11-04 | Attacking Vision-Language Computer Agents via Pop-ups | Yanzhe Zhang et.al. | 2411.02391 | link |
2024-11-04 | Improving Scientific Hypothesis Generation with Knowledge Grounded Large Language Models | Guangzhi Xiong et.al. | 2411.02382 | null |
2024-11-04 | Addressing Uncertainty in LLMs to Enhance Reliability in Generative AI | Ramneet Kaur et.al. | 2411.02381 | null |
2024-11-04 | Learning General-Purpose Biomedical Volume Representations using Randomized Synthesis | Neel Dey et.al. | 2411.02372 | link |
2024-11-04 | DeeR-VLA: Dynamic Inference of Multimodal Large Language Models for Efficient Robot Execution | Yang Yue et.al. | 2411.02359 | link |
2024-11-04 | “Give Me BF16 or Give Me Death”? Accuracy-Performance Trade-Offs in LLM Quantization | Eldar Kurtic et.al. | 2411.02355 | null |
2024-11-04 | Machine learning identification of maternal inflammatory response and histologic choroamnionitis from placental membrane whole slide images | Abhishek Sharma et.al. | 2411.02354 | null |
2024-11-04 | Social-RAG: Retrieving from Group Interactions to Socially Ground Proactive AI Generation to Group Preferences | Ruotong Wang et.al. | 2411.02353 | null |
2024-11-04 | Can Large Language Models generalize analogy solving like people can? | Claire E. Stevenson et.al. | 2411.02348 | null |
2024-11-04 | WebRL: Training LLM Web Agents via Self-Evolving Online Curriculum Reinforcement Learning | Zehan Qi et.al. | 2411.02337 | link |
2024-11-04 | Sparsing Law: Towards Large Language Models with Greater Activation Sparsity | Yuqi Luo et.al. | 2411.02335 | link |
2024-11-04 | Disrupting Test Development with AI Assistants | Vijay Joshi et.al. | 2411.02328 | null |
2024-11-04 | PPLLaVA: Varied Video Sequence Understanding With Prompt Guidance | Ruyang Liu et.al. | 2411.02327 | link |
2024-11-04 | An Empirical Study on the Code Refactoring Capability of Large Language Models | Jonathan Cordeiro et.al. | 2411.02320 | null |
2024-11-04 | Evaluating the Ability of Large Language Models to Generate Verifiable Specifications in VeriFast | Marilyn Rego et.al. | 2411.02318 | null |
2024-11-04 | Defining and Evaluating Physical Safety for Large Language Models | Yung-Chen Tang et.al. | 2411.02317 | null |
2024-11-04 | Evaluating Creative Short Story Generation in Humans and Large Language Models | Mete Ismayilzada et.al. | 2411.02316 | link |
2024-11-04 | Taking AI Welfare Seriously | Robert Long et.al. | 2411.00986 | null |
2024-10-31 | P-Masking: Power Law Masking Improves Multi-attribute Controlled Generation | Mohamed Elgaar et.al. | 2410.24201 | null |
2024-11-01 | SelfCodeAlign: Self-Alignment for Code Generation | Yuxiang Wei et.al. | 2410.24198 | link |
2024-10-31 | DC-Spin: A Speaker-invariant Speech Tokenizer for Spoken Language Models | Heng-Jui Chang et.al. | 2410.24177 | null |
2024-10-31 | Constraint Back-translation Improves Complex Instruction Following of Large Language Models | Yunjia Qi et.al. | 2410.24175 | null |
2024-10-31 | $π_0$ : A Vision-Language-Action Flow Model for General Robot Control | Kevin Black et.al. | 2410.24164 | null |
2024-10-31 | GPT or BERT: why not both? | Lucas Georges Gabriel Charpentier et.al. | 2410.24159 | link |
2024-10-31 | Thought Space Explorer: Navigating and Expanding Thought Space for Large Language Model Reasoning | Jinghan Zhang et.al. | 2410.24155 | null |
2024-10-31 | Language-Driven Policy Distillation for Cooperative Driving in Multi-Agent Reinforcement Learning | Jiaqi Liu et.al. | 2410.24152 | null |
2024-10-31 | Exploring Vision Language Models for Facial Attribute Recognition: Emotion, Race, Gender, and Age | Nouar AlDahoul et.al. | 2410.24148 | null |
2024-10-31 | Leveraging Large Language Models for Code Translation and Software Development in Scientific Computing | Akash Dhruv et.al. | 2410.24119 | link |
2024-10-31 | Repository-Level Compositional Code Translation and Validation | Ali Reza Ibrahimzada et.al. | 2410.24117 | link |
2024-10-31 | Matchmaker: Self-Improving Large Language Model Programs for Schema Matching | Nabeel Seedat et.al. | 2410.24105 | null |
2024-10-31 | Progressive Safeguards for Safe and Model-Agnostic Reinforcement Learning | Nabil Omi et.al. | 2410.24096 | null |
2024-10-31 | In-Context Fine-Tuning for Time-Series Foundation Models | Abhimanyu Das et.al. | 2410.24087 | null |
2024-10-31 | Desert Camels and Oil Sheikhs: Arab-Centric Red Teaming of Frontier LLMs | Muhammed Saeed et.al. | 2410.24049 | null |
2024-10-31 | Handwriting Recognition in Historical Documents with Multimodal LLM | Lucian Li et.al. | 2410.24034 | null |
2024-10-31 | Navigating the Unknown: A Chat-Based Collaborative Interface for Personalized Exploratory Tasks | Yingzhe Peng et.al. | 2410.24032 | null |
2024-10-31 | AndroidLab: Training and Systematic Benchmarking of Android Autonomous Agents | Yifan Xu et.al. | 2410.24024 | link |
2024-10-31 | SFM-Protein: Integrative Co-evolutionary Pre-training for Advanced Protein Sequence Representation | Liang He et.al. | 2410.24022 | null |
2024-10-31 | Speech is More Than Words: Do Speech-to-Text Translation Systems Leverage Prosody? | Ioannis Tsiamas et.al. | 2410.24019 | null |
2024-10-30 | ReferEverything: Towards Segmenting Everything We Can Speak of in Videos | Anurag Bagchi et.al. | 2410.23287 | null |
2024-10-30 | A Monte Carlo Framework for Calibrated Uncertainty Estimation in Sequence Prediction | Qidong Yang et.al. | 2410.23272 | null |
2024-10-30 | TOMATO: Assessing Visual Temporal Reasoning Capabilities in Multimodal Foundation Models | Ziyao Shangguan et.al. | 2410.23266 | link |
2024-10-30 | EMMA: End-to-End Multimodal Model for Autonomous Driving | Jyh-Jing Hwang et.al. | 2410.23262 | null |
2024-10-30 | Keypoint Abstraction using Large Models for Object-Relative Imitation Learning | Xiaolin Fang et.al. | 2410.23254 | null |
2024-10-30 | Evaluating Cultural and Social Awareness of LLM Web Agents | Haoyi Qiu et.al. | 2410.23252 | null |
2024-10-30 | Carrot and Stick: Eliciting Comparison Data and Beyond | Yiling Chen et.al. | 2410.23243 | null |
2024-10-30 | A little less conversation, a little more action, please: Investigating the physical common-sense of LLMs in a 3D embodied environment | Matteo G. Mecattaf et.al. | 2410.23242 | link |
2024-10-30 | EMOTION: Expressive Motion Sequence Generation for Humanoid Robots with In-Context Learning | Peide Huang et.al. | 2410.23234 | null |
2024-10-30 | COMAL: A Convergent Meta-Algorithm for Aligning LLMs with General Preferences | Yixin Liu et.al. | 2410.23223 | link |
2024-10-30 | Partial Channel Dependence with Channel Masks for Time Series Foundation Models | Seunghan Lee et.al. | 2410.23222 | null |
2024-10-30 | OS-ATLAS: A Foundation Action Model for Generalist GUI Agents | Zhiyong Wu et.al. | 2410.23218 | link |
2024-10-31 | Grounding by Trying: LLMs with Reinforcement Learning-Enhanced Retrieval | Sheryl Hsu et.al. | 2410.23214 | null |
2024-10-30 | ProTransformer: Robustify Transformers via Plug-and-Play Paradigm | Zhichao Hou et.al. | 2410.23182 | null |
2024-10-30 | ReasoningRec: Bridging Personalized Recommendations and Human-Interpretable Explanations through LLM Reasoning | Millennium Bismay et.al. | 2410.23180 | link |
2024-10-30 | TokenFormer: Rethinking Transformer Scaling with Tokenized Model Parameters | Haiyang Wang et.al. | 2410.23168 | link |
2024-10-30 | SciPIP: An LLM-based Scientific Paper Idea Proposer | Wenxiao Wang et.al. | 2410.23166 | link |
2024-10-30 | FlexTSF: A Universal Forecasting Model for Time Series with Variable Regularities | Jingge Xiao et.al. | 2410.23160 | link |
2024-10-30 | VisualPredicator: Learning Abstract World Models with Neuro-Symbolic Predicates for Robot Planning | Yichao Liang et.al. | 2410.23156 | null |
2024-10-30 | Public Domain 12M: A Highly Aesthetic Image-Text Dataset with Novel Governance Mechanisms | Jordan Meyer et.al. | 2410.23144 | null |
2024-10-29 | Local Policies Enable Zero-shot Long-horizon Manipulation | Murtaza Dalal et.al. | 2410.22332 | null |
2024-10-29 | Task Vectors are Cross-Modal | Grace Luo et.al. | 2410.22330 | null |
2024-10-29 | Enhancing Code Annotation Reliability: Generative AI’s Role in Comment Quality Assessment Models | Seetharam Killivalavan et.al. | 2410.22323 | null |
2024-10-29 | Online Detecting LLM-Generated Texts via Sequential Hypothesis Testing by Betting | Can Chen et.al. | 2410.22318 | link |
2024-10-29 | Multi-Class Textual-Inversion Secretly Yields a Semantic-Agnostic Classifier | Kai Wang et.al. | 2410.22317 | link |
2024-10-29 | Natural Language Inference Improves Compositionality in Vision-Language Models | Paola Cascante-Bonilla et.al. | 2410.22315 | null |
2024-10-29 | Senna: Bridging Large Vision-Language Models and End-to-End Autonomous Driving | Bo Jiang et.al. | 2410.22313 | link |
2024-10-30 | GPT-4o reads the mind in the eyes | James W. A. Strachan et.al. | 2410.22309 | null |
2024-10-29 | SVIP: Towards Verifiable Inference of Open-source Large Language Models | Yifan Sun et.al. | 2410.22307 | null |
2024-10-29 | Flow-DPO: Improving LLM Mathematical Reasoning through Online Multi-Agent Learning | Yihe Deng et.al. | 2410.22304 | null |
2024-10-29 | LLMs are Highly-Constrained Biophysical Sequence Optimizers | Angelica Chen et.al. | 2410.22296 | null |
2024-10-29 | Fine-Tuning LLMs for Code Mutation: A New Era of Cyber Threats | Mohammad Setak et.al. | 2410.22293 | null |
2024-10-29 | From melodic note sequences to pitches using word2vec | Daniel Defays et.al. | 2410.22285 | null |
2024-10-29 | Embedding-based classifiers can detect prompt injection attacks | Md. Ahsan Ayub et.al. | 2410.22284 | link |
2024-10-29 | Whose ChatGPT? Unveiling Real-World Educational Inequalities Introduced by Large Language Models | Renzhe Yu et.al. | 2410.22282 | null |
2024-10-29 | Fourier Head: Helping Large Language Models Learn Complex Probability Distributions | Nate Gillman et.al. | 2410.22269 | null |
2024-10-29 | Meta-Learning Adaptable Foundation Models | Jacob L. Block et.al. | 2410.22264 | null |
2024-10-29 | FactBench: A Dynamic Benchmark for In-the-Wild Language Model Factuality Evaluation | Farima Fatahi Bayat et.al. | 2410.22257 | null |
2024-10-29 | Abrupt Learning in Transformers: A Case Study on Matrix Completion | Pulkit Gopalani et.al. | 2410.22244 | null |
2024-10-29 | Are Decoder-Only Large Language Models the Silver Bullet for Code Search? | Yuxuan Chen et.al. | 2410.22240 | link |
2024-10-28 | Arithmetic Without Algorithms: Language Models Solve Math With a Bag of Heuristics | Yaniv Nikankin et.al. | 2410.21272 | link |
2024-10-28 | LARP: Tokenizing Videos with a Learned Autoregressive Generative Prior | Hanyu Wang et.al. | 2410.21264 | null |
2024-10-28 | BLAST: Block-Level Adaptive Structured Matrices for Efficient Deep Neural Network Inference | Changwoo Lee et.al. | 2410.21262 | link |
2024-10-29 | AutoBench-V: Can Large Vision-Language Models Benchmark Themselves? | Han Bao et.al. | 2410.21259 | link |
2024-10-28 | Multi-modal AI for comprehensive breast cancer prognostication | Jan Witowski et.al. | 2410.21256 | null |
2024-10-28 | LongReward: Improving Long-context Large Language Models with AI Feedback | Jiajie Zhang et.al. | 2410.21252 | link |
2024-10-28 | Zero-Shot Dense Retrieval with Embeddings from Relevance Feedback | Nour Jedidi et.al. | 2410.21242 | null |
2024-10-28 | Hierarchical Knowledge Graph Construction from Images for Scalable E-Commerce | Zhantao Yang et.al. | 2410.21237 | null |
2024-10-28 | Flaming-hot Initiation with Regular Execution Sampling for Large Language Models | Weizhe Chen et.al. | 2410.21236 | null |
2024-10-28 | LoRA vs Full Fine-tuning: An Illusion of Equivalence | Reece Shuttleworth et.al. | 2410.21228 | null |
2024-10-28 | Vision Search Assistant: Empower Vision-Language Models as Multimodal Search Engines | Zhixin Zhang et.al. | 2410.21220 | link |
2024-10-28 | Lifting the Veil on the Large Language Model Supply Chain: Composition, Risks, and Mitigations | Kaifeng Huang et.al. | 2410.21218 | null |
2024-10-28 | BongLLaMA: LLaMA for Bangla Language | Abdullah Khan Zehady et.al. | 2410.21200 | null |
2024-10-28 | Belief in the Machine: Investigating Epistemological Blind Spots of Language Models | Mirac Suzgun et.al. | 2410.21195 | link |
2024-10-29 | Document Parsing Unveiled: Techniques, Challenges, and Prospects for Structured Information Extraction | Qintong Zhang et.al. | 2410.21169 | null |
2024-10-28 | M2rc-Eval: Massively Multilingual Repository-level Code Completion Evaluation | Jiaheng Liu et.al. | 2410.21157 | null |
2024-10-28 | Palisade – Prompt Injection Detection Framework | Sahasra Kokkula et.al. | 2410.21146 | null |
2024-10-28 | LLM-initialized Differentiable Causal Discovery | Shiv Kampani et.al. | 2410.21141 | null |
2024-10-28 | Do LLMs generate test oracles that capture the actual or the expected program behaviour? | Michael Konstantinou et.al. | 2410.21136 | null |
2024-10-28 | Towards Unifying Evaluation of Counterfactual Explanations: Leveraging Large Language Models for Human-Centric Assessments | Marharyta Domnich et.al. | 2410.21131 | null |
2024-10-25 | The Potential and Value of AI Chatbot in Personalized Cognitive Training | Zilong Wang et.al. | 2410.19733 | null |
2024-10-25 | Rethinking Visual Dependency in Long-Context Reasoning for Large Vision-Language Models | Yucheng Zhou et.al. | 2410.19732 | null |
2024-10-25 | Counting Ability of Large Language Models and Impact of Tokenization | Xiang Zhang et.al. | 2410.19730 | link |
2024-10-25 | FISHNET: Financial Intelligence from Sub-querying, Harmonizing, Neural-Conditioning, Expert Swarms, and Task Planning | Nicole Cho et.al. | 2410.19727 | null |
2024-10-25 | 2D-DPO: Scaling Direct Preference Optimization with 2-Dimensional Supervision | Shilong Li et.al. | 2410.19720 | null |
2024-10-25 | Multi-view biomedical foundation models for molecule-target and property prediction | Parthasarathy Suryanarayanan et.al. | 2410.19704 | link |
2024-10-25 | TimeSuite: Improving MLLMs for Long Video Understanding via Grounded Tuning | Xiangyu Zeng et.al. | 2410.19702 | null |
2024-10-25 | IPPON: Common Sense Guided Informative Path Planning for Object Goal Navigation | Kaixian Qu et.al. | 2410.19697 | null |
2024-10-25 | Less is More: Extreme Gradient Boost Rank-1 Adaption for Efficient Finetuning of LLMs | Yifei Zhang et.al. | 2410.19694 | null |
2024-10-25 | APRICOT: Active Preference Learning and Constraint-Aware Task Planning with LLMs | Huaxiaoyue Wang et.al. | 2410.19656 | null |
2024-10-25 | Frozen-DETR: Enhancing DETR with Image Understanding from Frozen Foundation Models | Shenghao Fu et.al. | 2410.19635 | null |
2024-10-25 | Take Caution in Using LLMs as Human Surrogates: Scylla Ex Machina | Yuan Gao et.al. | 2410.19599 | null |
2024-10-25 | Diverse Sign Language Translation | Xin Shen et.al. | 2410.19586 | link |
2024-10-25 | ChunkRAG: Novel LLM-Chunk Filtering Method for RAG Systems | Ritvik Aggarwal Ishneet Sukhvinder Singh Ibrahim Allahverdiyev et.al. | 2410.19572 | null |
2024-10-25 | GeoLLaVA: Efficient Fine-Tuned Vision-Language Models for Temporal Change Detection in Remote Sensing | Hosam Elgendy et.al. | 2410.19552 | link |
2024-10-25 | Bongard in Wonderland: Visual Puzzles that Still Make AI Go Mad? | Antonia Wüst et.al. | 2410.19546 | link |
2024-10-25 | Brain-like Functional Organization within Large Language Models | H. Sun et.al. | 2410.19542 | null |
2024-10-25 | Detection of Human and Machine-Authored Fake News in Urdu | Muhammad Zain Ali et.al. | 2410.19517 | link |
2024-10-25 | SWITCH: Studying with Teacher for Knowledge Distillation of Large Language Models | Jahyun Koo et.al. | 2410.19503 | null |
2024-10-25 | Introducing MAPO: Momentum-Aided Gradient Descent Prompt Optimization | Anthony Cui et.al. | 2410.19499 | null |
2024-10-24 | Unbounded: A Generative Infinite Game of Character Life Simulation | Jialu Li et.al. | 2410.18975 | null |
2024-10-24 | Deep Insights into Cognitive Decline: A Survey of Leveraging Non-Intrusive Modalities with Deep Learning Techniques | David Ortiz-Perez et.al. | 2410.18972 | null |
2024-10-24 | ConceptDrift: Uncovering Biases through the Lens of Foundational Models | Cristian Daniel Păduraru et.al. | 2410.18970 | null |
2024-10-24 | Ferret-UI 2: Mastering Universal User Interface Understanding Across Platforms | Zhangheng Li et.al. | 2410.18967 | null |
2024-10-24 | Does Data Contamination Detection Work (Well) for LLMs? A Survey and Evaluation on Detection Assumptions | Yujuan Fu et.al. | 2410.18966 | null |
2024-10-24 | On the Crucial Role of Initialization for Matrix Factorization | Bingcong Li et.al. | 2410.18965 | null |
2024-10-24 | OSCAR: Operating System Control via State-Aware Reasoning and Re-Planning | Xiaoqiang Wang et.al. | 2410.18963 | null |
2024-10-24 | Context is Key: A Benchmark for Forecasting with Essential Textual Information | Andrew Robert Williams et.al. | 2410.18959 | link |
2024-10-24 | Bridge-Coder: Unlocking LLMs’ Potential to Overcome Language Gaps in Low-Resource Code | Jipeng Zhang et.al. | 2410.18957 | null |
2024-10-24 | BioMistral-NLU: Towards More Generalizable Medical Language Understanding through Instruction Tuning | Yujuan Velvin Fu et.al. | 2410.18955 | null |
2024-10-24 | Dynamic Vocabulary Pruning in Early-Exit LLMs | Jort Vincenti et.al. | 2410.18952 | link |
2024-10-24 | SafeBench: A Safety Evaluation Framework for Multimodal Large Language Models | Zonghao Ying et.al. | 2410.18927 | null |
2024-10-24 | From Blind Solvers to Logical Thinkers: Benchmarking LLMs’ Logical Integrity on Faulty Mathematical Problems | A M Muntasir Rahman et.al. | 2410.18921 | null |
2024-10-25 | A Survey on Speech Large Language Models | Jing Peng et.al. | 2410.18908 | null |
2024-10-24 | PRISM: A Methodology for Auditing Biases in Large Language Models | Leif Azzopardi et.al. | 2410.18906 | link |
2024-10-24 | LLMs for Extremely Low-Resource Finno-Ugric Languages | Taido Purason et.al. | 2410.18902 | null |
2024-10-24 | Creating and Repairing Robot Programs in Open-World Domains | Claire Schlesinger et.al. | 2410.18893 | null |
2024-10-24 | Improving Small-Scale Large Language Models Function Calling for Reasoning Tasks | Graziano A. Manduzio et.al. | 2410.18890 | null |
2024-10-24 | Are LLMs Better than Reported? Detecting Label Errors and Mitigating Their Effect on Model Performance | Omer Nahum et.al. | 2410.18889 | null |
2024-10-24 | Provably Robust Watermarks for Open-Source Language Models | Miranda Christ et.al. | 2410.18861 | null |
2024-10-23 | TP-Eval: Tap Multimodal LLMs’ Potential in Evaluation by Customizing Prompts | Yuxuan Xie et.al. | 2410.18071 | null |
2024-10-23 | CLEAR: Character Unlearning in Textual and Visual Modalities | Alexey Dontsov et.al. | 2410.18057 | null |
2024-10-23 | LongRAG: A Dual-Perspective Retrieval-Augmented Generation Paradigm for Long-Context Question Answering | Qingfei Zhao et.al. | 2410.18050 | link |
2024-10-23 | Key Algorithms for Keyphrase Generation: Instruction-Based LLMs for Russian Scientific Keyphrases | Anna Glazkova et.al. | 2410.18040 | null |
2024-10-23 | MiLoRA: Efficient Mixture of Low-Rank Adaptation for Large Language Models Fine-tuning | Jingfan Zhang et.al. | 2410.18035 | null |
2024-10-23 | GraphTeam: Facilitating Large Language Model-based Graph Analysis via Multi-Agent Collaboration | Xin Li et.al. | 2410.18032 | link |
2024-10-23 | MiniFed : Integrating LLM-based Agentic-Workflow for Simulating FOMC Meeting | Sungil Seok et.al. | 2410.18012 | null |
2024-10-23 | Benchmarking Foundation Models on Exceptional Cases: Dataset Creation and Validation | Suho Kang et.al. | 2410.18001 | link |
2024-10-23 | MCUBERT: Memory-Efficient BERT Inference on Commodity Microcontrollers | Zebin Yang et.al. | 2410.17957 | null |
2024-10-23 | ExpertFlow: Optimized Expert Activation and Token Allocation for Efficient Mixture-of-Experts Inference | Xin He et.al. | 2410.17954 | null |
2024-10-23 | SimRAG: Self-Improving Retrieval-Augmented Generation for Adapting Large Language Models to Specialized Domains | Ran Xu et.al. | 2410.17952 | null |
2024-10-23 | Benchmarking Floworks against OpenAI & Anthropic: A Novel Framework for Enhanced LLM Function Calling | Nirav Bhan et.al. | 2410.17950 | null |
2024-10-23 | Toward path-invariant embeddings for local distance source characterization | Lisa Linville et.al. | 2410.17937 | null |
2024-10-23 | Guide for Defense (G4D): Dynamic Guidance for Robust and Balanced Defense in Large Language Models | He Cao et.al. | 2410.17922 | link |
2024-10-23 | Scaling Diffusion Language Models via Adaptation from Autoregressive Models | Shansan Gong et.al. | 2410.17891 | link |
2024-10-23 | R-CoT: Reverse Chain-of-Thought Problem Generation for Geometric Reasoning in Large Multimodal Models | Linger Deng et.al. | 2410.17885 | link |
2024-10-23 | Lightweight Neural App Control | Filippos Christianos et.al. | 2410.17883 | null |
2024-10-23 | AdaRankGrad: Adaptive Gradient-Rank and Moments for Memory-Efficient LLMs Training and Fine-Tuning | Yehonathan Refael et.al. | 2410.17881 | null |
2024-10-23 | Understanding Layer Significance in LLM Alignment | Guangyuan Shi et.al. | 2410.17875 | null |
2024-10-23 | DataTales: A Benchmark for Real-World Intelligent Data Narration | Yajing Yang et.al. | 2410.17859 | link |
2024-10-22 | PyramidDrop: Accelerating Your Large Vision-Language Models via Pyramid Visual Redundancy Reduction | Long Xing et.al. | 2410.17247 | link |
2024-10-22 | Towards Reliable Evaluation of Behavior Steering Interventions in LLMs | Itamar Pres et.al. | 2410.17245 | null |
2024-10-22 | Frontiers in Intelligent Colonoscopy | Ge-Peng Ji et.al. | 2410.17241 | link |
2024-10-22 | Large Language Models Empowered Personalized Web Agents | Hongru Cai et.al. | 2410.17236 | null |
2024-10-22 | Automated Spinal MRI Labelling from Reports Using a Large Language Model | Robin Y. Park et.al. | 2410.17235 | link |
2024-10-22 | Fine-Tuning Large Language Models to Appropriately Abstain with Semantic Entropy | Benedict Aaron Tjandra et.al. | 2410.17234 | null |
2024-10-22 | Few-shot In-Context Preference Learning Using Large Language Models | Chao Yu et.al. | 2410.17233 | null |
2024-10-22 | Context-aware Prompt Tuning: Advancing In-Context Learning with Adversarial Methods | Tsachi Blau et.al. | 2410.17222 | null |
2024-10-22 | MiniPLM: Knowledge Distillation for Pre-Training Language Models | Yuxian Gu et.al. | 2410.17215 | link |
2024-10-22 | Exploring Possibilities of AI-Powered Legal Assistance in Bangladesh through Large Language Modeling | Azmine Toushik Wasi et.al. | 2410.17210 | link |
2024-10-22 | VoiceBench: Benchmarking LLM-Based Voice Assistants | Yiming Chen et.al. | 2410.17196 | link |
2024-10-23 | Non-myopic Generation of Language Model for Reasoning and Planning | Chang Ma et.al. | 2410.17195 | link |
2024-10-22 | Remote Timing Attacks on Efficient Language Model Inference | Nicholas Carlini et.al. | 2410.17175 | null |
2024-10-22 | From Attention to Activation: Unravelling the Enigmas of Large Language Models | Prannay Kaul et.al. | 2410.17174 | null |
2024-10-22 | Self-calibration for Language Model Quantization and Pruning | Miles Williams et.al. | 2410.17170 | null |
2024-10-22 | Interchangeable Token Embeddings for Extendable Vocabulary and Alpha-Equivalence | İlker Işık et.al. | 2410.17161 | null |
2024-10-22 | Improving Pinterest Search Relevance Using Large Language Models | Han Wang et.al. | 2410.17152 | null |
2024-10-22 | Are Visual-Language Models Effective in Action Recognition? A Comparative Study | Mahmoud Ali et.al. | 2410.17149 | null |
2024-10-22 | Can General-Purpose Large Language Models Generalize to English-Thai Machine Translation ? | Jirat Chiaranaipanich et.al. | 2410.17145 | null |
2024-10-22 | Towards Automated Penetration Testing: Introducing LLM Benchmark, Analysis, and Improvements | Isamu Isozaki et.al. | 2410.17141 | link |
2024-10-21 | Reflection-Bench: probing AI intelligence with reflection | Lingyu Li et.al. | 2410.16270 | link |
2024-10-21 | SAM2Long: Enhancing SAM 2 for Long Video Segmentation with a Training-Free Memory Tree | Shuangrui Ding et.al. | 2410.16268 | link |
2024-10-21 | xGen-MM-Vid (BLIP-3-Video): You Only Need 32 Tokens to Represent a Video Even in VLMs | Michael S. Ryoo et.al. | 2410.16267 | null |
2024-10-22 | Mini-InternVL: A Flexible-Transfer Pocket Multimodal Model with 5% Parameters and 90% Performance | Zhangwei Gao et.al. | 2410.16261 | link |
2024-10-21 | Elucidating the design space of language models for image generation | Xuantong Liu et.al. | 2410.16257 | link |
2024-10-21 | CompassJudger-1: All-in-one Judge Model Helps Model Evaluation and Evolution | Maosong Cao et.al. | 2410.16256 | link |
2024-10-21 | Can Knowledge Editing Really Correct Hallucinations? | Baixiang Huang et.al. | 2410.16251 | link |
2024-10-21 | Analyzing Context Contributions in LLM-based Machine Translation | Emmanouil Zaranis et.al. | 2410.16246 | null |
2024-10-21 | IBGP: Imperfect Byzantine Generals Problem for Zero-Shot Robustness in Communicative Multi-Agent Systems | Yihuan Mao et.al. | 2410.16237 | null |
2024-10-21 | LLaVA-KD: A Framework of Distilling Multimodal Large Language Models | Yuxuan Cai et.al. | 2410.16236 | link |
2024-10-21 | ToW: Thoughts of Words Improve Reasoning in Large Language Models | Zhikun Xu et.al. | 2410.16235 | null |
2024-10-21 | Sketch2Code: Evaluating Vision-Language Models for Interactive Web Design Prototyping | Ryan Li et.al. | 2410.16232 | null |
2024-10-21 | Building A Coding Assistant via the Retrieval-Augmented Language Model | Xinze Li et.al. | 2410.16229 | link |
2024-10-21 | A Realistic Threat Model for Large Language Model Jailbreaks | Valentyn Boreiko et.al. | 2410.16222 | link |
2024-10-21 | Pre-training Distillation for Large Language Models: A Design Space Exploration | Hao Peng et.al. | 2410.16215 | null |
2024-10-21 | Comprehensive benchmarking of large language models for RNA secondary structure prediction | L. I. Zablocki et.al. | 2410.16212 | link |
2024-10-21 | CoT-TL: Low-Resource Temporal Knowledge Representation of Planning Instructions Using Chain-of-Thought Reasoning | Kumar Manas et.al. | 2410.16207 | null |
2024-10-21 | Improve Vision Language Model Chain-of-thought Reasoning | Ruohong Zhang et.al. | 2410.16198 | link |
2024-10-22 | LASER: Script Execution by Autonomous Agents for On-demand Traffic Simulation | Hao Gao et.al. | 2410.16197 | link |
2024-10-21 | Contamination Report for Multilingual Benchmarks | Sanchit Ahuja et.al. | 2410.16186 | null |
2024-10-18 | Are AI Detectors Good Enough? A Survey on Quality of Datasets With Machine-Generated Texts | German Gritsai et.al. | 2410.14677 | null |
2024-10-18 | SudoLM: Learning Access Control of Parametric Knowledge with Authorization Alignment | Qin Liu et.al. | 2410.14676 | null |
2024-10-18 | Enhancing Large Language Models’ Situated Faithfulness to External Contexts | Yukun Huang et.al. | 2410.14675 | link |
2024-10-18 | Decomposing The Dark Matter of Sparse Autoencoders | Joshua Engels et.al. | 2410.14670 | link |
2024-10-18 | NaturalBench: Evaluating Vision-Language Models on Natural Adversarial Samples | Baiqi Li et.al. | 2410.14669 | null |
2024-10-18 | MiCEval: Unveiling Multimodal Chain of Thought’s Quality via Image Description and Reasoning Steps | Xiongtao Zhou et.al. | 2410.14668 | link |
2024-10-18 | A Large Language Model-Driven Reward Design Framework via Dynamic Feedback for Reinforcement Learning | Shengjie Sun et.al. | 2410.14660 | null |
2024-10-18 | Bridging the Training-Inference Gap in LLMs by Leveraging Self-Generated Tokens | Zhepeng Cen et.al. | 2410.14655 | null |
2024-10-18 | EvoPress: Towards Optimal Dynamic Model Compression via Evolutionary Search | Oliver Sieberling et.al. | 2410.14649 | link |
2024-10-18 | Distance between Relevant Information Pieces Causes Bias in Long-Context LLMs | Runchu Tian et.al. | 2410.14641 | link |
2024-10-18 | GenEOL: Harnessing the Generative Power of LLMs for Training-Free Sentence Embeddings | Raghuveer Thirukovalluru et.al. | 2410.14635 | link |
2024-10-18 | Swiss Army Knife: Synergizing Biases in Knowledge from Vision Foundation Models for Multi-Task Learning | Yuxiang Lu et.al. | 2410.14633 | null |
2024-10-18 | On the Regularization of Learnable Embeddings for Time Series Processing | Luca Butera et.al. | 2410.14630 | null |
2024-10-18 | CELI: Controller-Embedded Language Model Interactions | Jan-Samuel Wagner et.al. | 2410.14627 | null |
2024-10-18 | DiSCo Meets LLMs: A Unified Approach for Sparse Retrieval and Contextual Distillation in Conversational Search | Simon Lupart et.al. | 2410.14609 | null |
2024-10-18 | Teaching Models to Balance Resisting and Accepting Persuasion | Elias Stengel-Eskin et.al. | 2410.14596 | link |
2024-10-18 | Neuro-Symbolic Traders: Assessing the Wisdom of AI Crowds in Markets | Namid R. Stillman et.al. | 2410.14587 | null |
2024-10-18 | Do LLMs estimate uncertainty well in instruction-following? | Juyeon Heo et.al. | 2410.14582 | null |
2024-10-18 | Large Language Models Are Overparameterized Text Encoders | Thennal D K et.al. | 2410.14578 | null |
2024-10-18 | MomentumSMoE: Integrating Momentum into Sparse Mixture of Experts | Rachel S. Y. Teo et.al. | 2410.14574 | link |
2024-10-17 | Fluid: Scaling Autoregressive Text-to-image Generative Models with Continuous Tokens | Lijie Fan et.al. | 2410.13863 | null |
2024-10-17 | PUMA: Empowering Unified MLLM with Multi-granular Visual Generation | Rongyao Fang et.al. | 2410.13861 | link |
2024-10-17 | VLM-Grounder: A VLM Agent for Zero-Shot 3D Visual Grounding | Runsen Xu et.al. | 2410.13860 | link |
2024-10-17 | $γ-$ MoD: Exploring Mixture-of-Depth Adaptation for Multimodal Large Language Models | Yaxin Luo et.al. | 2410.13859 | null |
2024-10-17 | How Numerical Precision Affects Mathematical Reasoning Capabilities of LLMs | Guhao Feng et.al. | 2410.13857 | null |
2024-10-17 | Can MLLMs Understand the Deep Implication Behind Chinese Images? | Chenhao Zhang et.al. | 2410.13854 | link |
2024-10-17 | Retrospective Learning from Interactions | Zizhao Chen et.al. | 2410.13852 | null |
2024-10-17 | Differentiable Robot Rendering | Ruoshi Liu et.al. | 2410.13851 | null |
2024-10-17 | SimLayerKV: A Simple Framework for Layer-Level KV Cache Reduction | Xuan Zhang et.al. | 2410.13846 | link |
2024-10-17 | A Unified View of Delta Parameter Editing in Post-Trained Large-Scale Models | Qiaoyu Tang et.al. | 2410.13841 | null |
2024-10-17 | Active-Dormant Attention Heads: Mechanistically Demystifying Extreme-Token Phenomena in LLMs | Tianyu Guo et.al. | 2410.13835 | link |
2024-10-17 | A Common Pitfall of Margin-based Language Model Alignment: Gradient Entanglement | Hui Yuan et.al. | 2410.13828 | link |
2024-10-17 | Unearthing Skill-Level Insights for Understanding Trade-Offs of Foundation Models | Mazda Moayeri et.al. | 2410.13826 | null |
2024-10-17 | AgentOccam: A Simple Yet Strong Baseline for LLM-Based Web Agents | Ke Yang et.al. | 2410.13825 | null |
2024-10-18 | Harnessing Webpage UIs for Text-Rich Visual Understanding | Junpeng Liu et.al. | 2410.13824 | null |
2024-10-17 | Deep Generative Models Unveil Patterns in Medical Images Through Vision-Language Conditioning | Xiaodan Xing et.al. | 2410.13823 | link |
2024-10-17 | Steering Your Generalists: Improving Robotic Foundation Models via Value Guidance | Mitsuhiko Nakamoto et.al. | 2410.13816 | null |
2024-10-17 | De-mark: Watermark Removal in Large Language Models | Ruibo Chen et.al. | 2410.13808 | null |
2024-10-17 | A Watermark for Order-Agnostic Language Models | Ruibo Chen et.al. | 2410.13805 | null |
2024-10-18 | BenTo: Benchmark Task Reduction with In-Context Transferability | Hongyu Zhao et.al. | 2410.13804 | link |
2024-10-16 | Dual Prototype Evolving for Test-Time Generalization of Vision-Language Models | Ce Zhang et.al. | 2410.12790 | link |
2024-10-16 | Meta-Chunking: Learning Efficient Text Segmentation via Logical Perception | Jihao Zhao et.al. | 2410.12788 | link |
2024-10-16 | In-Context Learning Enables Robot Action Prediction in LLMs | Yida Yin et.al. | 2410.12782 | null |
2024-10-16 | Identifying Task Groupings for Multi-Task Learning Using Pointwise V-Usable Information | Yingya Li et.al. | 2410.12774 | null |
2024-10-16 | Harmon: Whole-Body Motion Generation of Humanoid Robots from Language Descriptions | Zhenyu Jiang et.al. | 2410.12773 | null |
2024-10-16 | Towards Zero-Shot Camera Trap Image Categorization | Jiří Vyskočil et.al. | 2410.12769 | null |
2024-10-16 | The Non-Local Model Merging Problem: Permutation Symmetries and Variance Collapse | Ekansh Sharma et.al. | 2410.12766 | null |
2024-10-16 | StyleDistance: Stronger Content-Independent Style Embeddings with Synthetic Parallel Examples | Ajay Patel et.al. | 2410.12757 | null |
2024-10-17 | CREAM: Consistency Regularized Self-Rewarding Language Models | Zhaoyang Wang et.al. | 2410.12735 | null |
2024-10-16 | WorldMedQA-V: a multilingual, multimodal medical examination dataset for multimodal language models evaluation | João Matos et.al. | 2410.12722 | link |
2024-10-16 | FusionLLM: A Decentralized LLM Training System on Geo-distributed GPUs with Adaptive Compression | Zhenheng Tang et.al. | 2410.12707 | null |
2024-10-16 | WorldCuisines: A Massive-Scale Benchmark for Multilingual and Multicultural Visual Question Answering on Global Cuisines | Genta Indra Winata et.al. | 2410.12705 | link |
2024-10-16 | Sarcasm Detection in a Less-Resourced Language | Lazar Đoković et.al. | 2410.12704 | link |
2024-10-16 | Embedding an Ethical Mind: Aligning Text-to-Image Synthesis via Lightweight Value Optimization | Xingqi Wang et.al. | 2410.12700 | link |
2024-10-16 | VividMed: Vision Language Model with Versatile Visual Grounding for Medicine | Lingxiao Luo et.al. | 2410.12694 | link |
2024-10-16 | Automatic Mapping of Anatomical Landmarks from Free-Text Using Large Language Models: Insights from Llama-2 | Mohamad Abdi et.al. | 2410.12686 | null |
2024-10-16 | 3DIS: Depth-Driven Decoupled Instance Synthesis for Text-to-Image Generation | Dewei Zhou et.al. | 2410.12669 | null |
2024-10-16 | Cross-Modal Safety Mechanism Transfer in Large Vision-Language Models | Shicheng Xu et.al. | 2410.12662 | null |
2024-10-16 | Evaluating Morphological Compositional Generalization in Large Language Models | Mete Ismayilzada et.al. | 2410.12656 | null |
2024-10-16 | Beyond Speech and More: Investigating the Emergent Ability of Speech Foundation Models for Classifying Physiological Time-Series Signals | Orchid Chetia Phukan et.al. | 2410.12645 | null |
2024-10-15 | GaVaMoE: Gaussian-Variational Gated Mixture of Experts for Explainable Recommendation | Fei Tang et.al. | 2410.11841 | link |
2024-10-15 | A Hitchhiker’s Guide to Scaling Law Estimation | Leshem Choshen et.al. | 2410.11840 | link |
2024-10-15 | MMFuser: Multimodal Multi-Layer Feature Fuser for Fine-Grained Vision-Language Understanding | Yue Cao et.al. | 2410.11829 | link |
2024-10-15 | Adaptive Data Optimization: Dynamic Sample Selection with Scaling Laws | Yiding Jiang et.al. | 2410.11820 | link |
2024-10-15 | Improving Long-Text Alignment for Text-to-Image Diffusion Models | Luping Liu et.al. | 2410.11817 | link |
2024-10-15 | SGEdit: Bridging LLM with Text2Image Generative Model for Scene Graph-based Image Editing | Zhiyuan Zhang et.al. | 2410.11815 | null |
2024-10-15 | NesTools: A Dataset for Evaluating Nested Tool Learning Abilities of Large Language Models | Han Han et.al. | 2410.11805 | null |
2024-10-15 | FoundTS: Comprehensive and Unified Benchmarking of Foundation Models for Time Series Forecasting | Zhe Li et.al. | 2410.11802 | null |
2024-10-15 | Selection-p: Self-Supervised Task-Agnostic Prompt Compression for Faithfulness and Transferability | Tsz Ting Chung et.al. | 2410.11786 | null |
2024-10-15 | Latent BKI: Open-Dictionary Continuous Mapping in Visual-Language Latent Spaces with Quantifiable Uncertainty | Joey Wilson et.al. | 2410.11783 | link |
2024-10-15 | G-Designer: Architecting Multi-agent Communication Topologies via Graph Neural Networks | Guibin Zhang et.al. | 2410.11782 | null |
2024-10-15 | Language Models Encode Numbers Using Digit Representations in Base 10 | Amit Arnold Levy et.al. | 2410.11781 | link |
2024-10-15 | MLLM can see? Dynamic Correction Decoding for Hallucination Mitigation | Chenxi Wang et.al. | 2410.11779 | link |
2024-10-15 | Time-Series Foundation Model for Value-at-Risk | Anubha Goel et.al. | 2410.11773 | link |
2024-10-15 | Layer-wise Importance Matters: Less Memory for Better Performance in Parameter-efficient Fine-tuning of Large Language Models | Kai Yao et.al. | 2410.11772 | link |
2024-10-15 | SlideChat: A Large Vision-Language Assistant for Whole-Slide Pathology Image Understanding | Ying Chen et.al. | 2410.11761 | null |
2024-10-15 | Latent Action Pretraining from Videos | Seonghyeon Ye et.al. | 2410.11758 | null |
2024-10-15 | Personas with Attitudes: Controlling LLMs for Diverse Data Annotation | Leon Fröhling et.al. | 2410.11745 | link |
2024-10-15 | DySpec: Faster Speculative Decoding with Dynamic Token Tree Structure | Yunfan Xiong et.al. | 2410.11744 | null |
2024-10-16 | Light-Weight Fault Tolerant Attention for Large Language Model Training | Yuhang Liang et.al. | 2410.11720 | null |
2024-10-14 | DuoAttention: Efficient Long-Context LLM Inference with Retrieval and Streaming Heads | Guangxuan Xiao et.al. | 2410.10819 | link |
2024-10-14 | Your Mixture-of-Experts LLM Is Secretly an Embedding Model For Free | Ziyue Li et.al. | 2410.10814 | link |
2024-10-14 | LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory | Di Wu et.al. | 2410.10813 | link |
2024-10-14 | Local and Global Decoding in Text Generation | Daniel Gareev et.al. | 2410.10810 | link |
2024-10-14 | Mix Data or Merge Models? Optimizing for Diverse Multi-Task Learning | Aakanksha et.al. | 2410.10801 | null |
2024-10-14 | Towards Foundation Models for 3D Vision: How Close Are We? | Yiming Zuo et.al. | 2410.10799 | null |
2024-10-15 | MMAR: Towards Lossless Multi-Modal Auto-Regressive Probabilistic Modeling | Jian Yang et.al. | 2410.10798 | null |
2024-10-14 | Context-Parametric Inversion: Why Instruction Finetuning May Not Actually Improve Context Reliance | Sachin Goyal et.al. | 2410.10796 | link |
2024-10-15 | LiveXiv – A Multi-Modal Live Benchmark Based on Arxiv Papers Content | Nimrod Shabtay et.al. | 2410.10783 | link |
2024-10-14 | When Attention Sink Emerges in Language Models: An Empirical View | Xiangming Gu et.al. | 2410.10781 | link |
2024-10-14 | Focused ReAct: Improving ReAct through Reiterate and Early Stop | Shuoqiu Li et.al. | 2410.10779 | null |
2024-10-14 | AFlow: Automating Agentic Workflow Generation | Jiayi Zhang et.al. | 2410.10762 | link |
2024-10-14 | Denial-of-Service Poisoning Attacks against Large Language Models | Kuofeng Gao et.al. | 2410.10760 | link |
2024-10-14 | SplitLLM: Collaborative Inference of LLMs for Model Placement and Throughput Optimization | Akrit Mudvari et.al. | 2410.10759 | null |
2024-10-14 | Use Random Selection for Now: Investigation of Few-Shot Selection Strategies in LLM-based Text Augmentation for Classification | Jan Cegin et.al. | 2410.10756 | link |
2024-10-14 | NT-LLM: A Novel Node Tokenizer for Integrating Graph Structure into Large Language Models | Yanbiao Ji et.al. | 2410.10743 | null |
2024-10-14 | SensorBench: Benchmarking LLMs in Coding-Based Sensor Processing | Pengrui Quan et.al. | 2410.10741 | link |
2024-10-14 | Balancing Continuous Pre-Training and Instruction Fine-Tuning: Optimizing Instruction-Following in LLMs | Ishan Jindal et.al. | 2410.10739 | null |
2024-10-14 | Embedding Self-Correction as an Inherent Ability in Large Language Models for Enhanced Mathematical Reasoning | Kuofeng Gao et.al. | 2410.10735 | null |
2024-10-14 | Towards LLM-guided Efficient and Interpretable Multi-linear Tensor Network Rank Selection | Giorgos Iacovides et.al. | 2410.10728 | null |
2024-10-11 | Unraveling and Mitigating Safety Alignment Degradation of Vision-Language Models | Qin Liu et.al. | 2410.09047 | null |
2024-10-11 | AttnGCG: Enhancing Jailbreaking Attacks on LLMs with Attention Manipulation | Zijun Wang et.al. | 2410.09040 | link |
2024-10-11 | Semi-Supervised Learning of Noisy Mixture of Experts Models | Oh-Ran Kwon et.al. | 2410.09039 | null |
2024-10-11 | SimpleStrat: Diversifying Language Model Generation with Stratification | Justin Wong et.al. | 2410.09038 | null |
2024-10-11 | Mentor-KD: Making Small Language Models Better Multi-step Reasoners | Hojae Lee et.al. | 2410.09037 | link |
2024-10-11 | PEAR: A Robust and Flexible Automation Framework for Ptychography Enabled by Multiple Large Language Model Agents | Xiangyu Yin et.al. | 2410.09034 | link |
2024-10-11 | MedMobile: A mobile-sized language model with expert-level clinical capabilities | Krithik Vishwanath et.al. | 2410.09019 | link |
2024-10-11 | Parameter-Efficient Fine-Tuning of State Space Models | Kevin Galim et.al. | 2410.09016 | link |
2024-10-11 | The Impact of Visual Information in Chinese Characters: Evaluating Large Models’ Ability to Recognize and Utilize Radicals | Xiaofeng Wu et.al. | 2410.09013 | null |
2024-10-11 | Software Engineering and Foundation Models: Insights from Industry Blogs Using a Jury of Foundation Models | Hao Li et.al. | 2410.09012 | link |
2024-10-11 | SuperCorrect: Supervising and Correcting Language Models with Error-Driven Insights | Ling Yang et.al. | 2410.09008 | link |
2024-10-11 | From Interaction to Impact: Towards Safer AI Agents Through Understanding and Evaluating UI Operation Impacts | Zhuohao Jerry Zhang et.al. | 2410.09006 | null |
2024-10-11 | DA-Ada: Learning Domain-Aware Adapter for Domain Adaptive Object Detection | Haochen Li et.al. | 2410.09004 | null |
2024-10-11 | Hypothesis-only Biases in Large Language Model-Elicited Natural Language Inference | Grace Proebsting et.al. | 2410.08996 | null |
2024-10-11 | The structure of the token space for large language models | Michael Robinson et.al. | 2410.08993 | null |
2024-10-11 | Science is Exploration: Computational Frontiers for Conceptual Metaphor Theory | Rebecca M. M. Hicke et.al. | 2410.08991 | link |
2024-10-11 | SubZero: Random Subspace Zeroth-Order Optimization for Memory-Efficient LLM Fine-Tuning | Ziming Yu et.al. | 2410.08989 | link |
2024-10-11 | Towards Trustworthy Knowledge Graph Reasoning: An Uncertainty Aware Perspective | Bo Ni et.al. | 2410.08985 | null |
2024-10-11 | NoVo: Norm Voting off Hallucinations with Attention Heads in Large Language Models | Zheng Yi Ho et.al. | 2410.08970 | null |
2024-10-11 | Controllable Safety Alignment: Inference-Time Adaptation to Diverse Safety Requirements | Jingyu Zhang et.al. | 2410.08968 | null |
2024-10-10 | DICE: Discrete Inversion Enabling Controllable Editing for Multinomial Diffusion and Masked Generative Models | Xiaoxiao He et.al. | 2410.08207 | null |
2024-10-10 | Mono-InternVL: Pushing the Boundaries of Monolithic Multimodal Large Language Models with Endogenous Visual Pre-training | Gen Luo et.al. | 2410.08202 | null |
2024-10-10 | Adam Exploits $\ell_\infty$ -geometry of Loss Landscape via Coordinate-wise Adaptivity | Shuo Xie et.al. | 2410.08198 | link |
2024-10-10 | From Exploration to Mastery: Enabling LLMs to Master Tools via Self-Driven Interactions | Changle Qu et.al. | 2410.08197 | link |
2024-10-10 | MathCoder2: Better Math Reasoning from Continued Pretraining on Model-translated Mathematical Code | Zimu Lu et.al. | 2410.08196 | link |
2024-10-10 | Features are fate: a theory of transfer learning in high-dimensional regression | Javan Tahir et.al. | 2410.08194 | null |
2024-10-10 | GenARM: Reward Guided Generation with Autoregressive Reward Model for Test-time Alignment | Yuancheng Xu et.al. | 2410.08193 | null |
2024-10-10 | MRAG-Bench: Vision-Centric Evaluation for Retrieval-Augmented Multimodal Models | Wenbo Hu et.al. | 2410.08182 | null |
2024-10-10 | Sample then Identify: A General Framework for Risk Control and Assessment in Multimodal Large Language Models | Qingni Wang et.al. | 2410.08174 | null |
2024-10-10 | On the Evaluation of Generative Robotic Simulations | Feng Chen et.al. | 2410.08172 | null |
2024-10-10 | Visual Scratchpads: Enabling Global Reasoning in Vision | Aryo Lotfi et.al. | 2410.08165 | null |
2024-10-10 | Agent S: An Open Agentic Framework that Uses Computers Like a Human | Saaket Agashe et.al. | 2410.08164 | link |
2024-10-10 | The Effect of Surprisal on Reading Times in Information Seeking and Repeated Reading | Keren Gruteke Klein et.al. | 2410.08162 | link |
2024-10-10 | DART: Denoising Autoregressive Transformer for Scalable Text-to-Image Generation | Jiatao Gu et.al. | 2410.08159 | null |
2024-10-10 | Rewarding Progress: Scaling Automated Process Verifiers for LLM Reasoning | Amrith Setlur et.al. | 2410.08146 | null |
2024-10-10 | Insight Over Sight? Exploring the Vision-Knowledge Conflicts in Multimodal LLMs | Xiaoyuan Liu et.al. | 2410.08145 | link |
2024-10-10 | DelTA: An Online Document-Level Translation Agent Based on Multi-Level Memory | Yutong Wang et.al. | 2410.08143 | link |
2024-10-10 | Steering Masked Discrete Diffusion Models via Discrete Denoising Posterior Prediction | Jarrid Rector-Brooks et.al. | 2410.08134 | null |
2024-10-10 | Think Beyond Size: Dynamic Prompting for More Effective Reasoning | Kamesh R et.al. | 2410.08130 | null |
2024-10-10 | Mars: Situated Inductive Reasoning in an Open-World Environment | Xiaojuan Tang et.al. | 2410.08126 | null |
2024-10-09 | MM-Ego: Towards Building Egocentric Multimodal LLMs | Hanrong Ye et.al. | 2410.07177 | null |
2024-10-09 | Astute RAG: Overcoming Imperfect Retrieval Augmentation and Knowledge Conflicts for Large Language Models | Fei Wang et.al. | 2410.07176 | null |
2024-10-09 | Do better language models have crisper vision? | Jona Ruthardt et.al. | 2410.07173 | null |
2024-10-09 | One Initialization to Rule them All: Fine-tuning via Explained Variance Adaptation | Fabian Paischer et.al. | 2410.07170 | link |
2024-10-09 | Sylber: Syllabic Embedding Representation of Speech from Raw Audio | Cheol Jun Cho et.al. | 2410.07168 | link |
2024-10-09 | Deciphering Cross-Modal Alignment in Large Vision-Language Models with Modality Integration Rate | Qidong Huang et.al. | 2410.07167 | link |
2024-10-09 | Embodied Agent Interface: Benchmarking LLMs for Embodied Decision Making | Manling Li et.al. | 2410.07166 | link |
2024-10-09 | Simplicity Prevails: Rethinking Negative Preference Optimization for LLM Unlearning | Chongyu Fan et.al. | 2410.07163 | link |
2024-10-09 | Trans4D: Realistic Geometry-Aware Transition for Compositional Text-to-4D Synthesis | Bohan Zeng et.al. | 2410.07155 | link |
2024-10-09 | Towards Interpreting Visual Information Processing in Vision-Language Models | Clement Neo et.al. | 2410.07149 | link |
2024-10-09 | Stuffed Mamba: State Collapse and State Capacity of RNN-Based Long-Context Modeling | Yingfa Chen et.al. | 2410.07145 | null |
2024-10-09 | Cheating Automatic LLM Benchmarks: Null Models Achieve High Win Rates | Xiaosen Zheng et.al. | 2410.07137 | link |
2024-10-10 | EvolveDirector: Approaching Advanced Text-to-Image Generation with Large Vision-Language Models | Rui Zhao et.al. | 2410.07133 | link |
2024-10-09 | Mental Disorders Detection in the Era of Large Language Models | Gleb Kuzmin et.al. | 2410.07129 | null |
2024-10-09 | Exploring the Readiness of Prominent Small Language Models for the Democratization of Financial Literacy | Tagore Rao Kosireddy et.al. | 2410.07118 | link |
2024-10-09 | Personalized Visual Instruction Tuning | Renjie Pi et.al. | 2410.07113 | link |
2024-10-09 | VHELM: A Holistic Evaluation of Vision Language Models | Tony Lee et.al. | 2410.07112 | link |
2024-10-09 | I Want to Break Free! Anti-Social Behavior and Persuasion Ability of LLMs in Multi-Agent Settings with Social Hierarchy | Gian Maria Campedelli et.al. | 2410.07109 | link |
2024-10-09 | Unleashing Multi-Hop Reasoning Potential in Large Language Models through Repetition of Misordered Context | Sangwon Yu et.al. | 2410.07103 | null |
2024-10-09 | MLE-bench: Evaluating Machine Learning Agents on Machine Learning Engineering | Jun Shern Chan et.al. | 2410.07095 | link |
2024-10-07 | Fine-Tuning CLIP’s Last Visual Projector: A Few-Shot Cornucopia | Mohammad Fahes et.al. | 2410.05270 | link |
2024-10-07 | Data Advisor: Dynamic Data Curation for Safety Alignment of Large Language Models | Fei Wang et.al. | 2410.05269 | null |
2024-10-07 | PrefixQuant: Static Quantization Beats Dynamic through Prefixed Outliers in LLMs | Mengzhao Chen et.al. | 2410.05265 | link |
2024-10-07 | TurtleBench: Evaluating Top Language Models via Real-World Yes/No Puzzles | Qingchen Yu et.al. | 2410.05262 | link |
2024-10-07 | TextHawk2: A Large Vision-Language Model Excels in Bilingual OCR and Grounding with 16x Fewer Tokens | Ya-Qi Yu et.al. | 2410.05261 | null |
2024-10-07 | Differential Transformer | Tianzhu Ye et.al. | 2410.05258 | link |
2024-10-07 | GLEE: A Unified Framework and Benchmark for Language-based Economic Environments | Eilam Shapira et.al. | 2410.05254 | link |
2024-10-07 | Causal Micro-Narratives | Mourad Heddaya et.al. | 2410.05252 | null |
2024-10-07 | SFTMix: Elevating Language Model Instruction Tuning with Mixup Recipe | Yuxin Xiao et.al. | 2410.05248 | null |
2024-10-07 | Navigating the Digital World as Humans Do: Universal Visual Grounding for GUI Agents | Boyu Gou et.al. | 2410.05243 | link |
2024-10-08 | TuneVLSeg: Prompt Tuning Benchmark for Vision-Language Segmentation Models | Rabin Adhikari et.al. | 2410.05239 | link |
2024-10-07 | GSM-Symbolic: Understanding the Limitations of Mathematical Reasoning in Large Language Models | Iman Mirzadeh et.al. | 2410.05229 | null |
2024-10-07 | Cookbook: A framework for improving LLM generative abilities via programmatic data generating templates | Avanika Narayan et.al. | 2410.05224 | null |
2024-10-07 | Precise Model Benchmarking with Only a Few Observations | Riccardo Fogliato et.al. | 2410.05222 | null |
2024-10-07 | Density estimation with LLMs: a geometric investigation of in-context learning trajectories | Toni J. B. Liu et.al. | 2410.05218 | null |
2024-10-07 | Organizing Unstructured Image Collections using Natural Language | Mingxuan Liu et.al. | 2410.05217 | null |
2024-10-07 | Preserving Multi-Modal Capabilities of Pre-trained VLMs for Improving Vision-Linguistic Compositionality | Youngtaek Oh et.al. | 2410.05210 | link |
2024-10-07 | RevisEval: Improving LLM-as-a-Judge via Response-Adapted References | Qiyuan Zhang et.al. | 2410.05193 | null |
2024-10-07 | Understanding Warmup-Stable-Decay Learning Rates: A River Valley Loss Landscape Perspective | Kaiyue Wen et.al. | 2410.05192 | null |
2024-10-07 | LADEV: A Language-Driven Testing and Evaluation Platform for Vision-Language-Action Models in Robotic Manipulation | Zhijie Wang et.al. | 2410.05191 | null |
2024-10-04 | Enhance Reasoning by Learning from Mistakes: Peer-Review Knowledge Distillation from Multiple Large Language Models | Zhuochun Li et.al. | 2410.03663 | null |
2024-10-04 | Unraveling Cross-Modality Knowledge Conflict in Large Vision-Language Models | Tinghui Zhu et.al. | 2410.03659 | link |
2024-10-04 | RAFT: Realistic Attacks to Fool Text Detectors | James Wang et.al. | 2410.03658 | link |
2024-10-04 | Aligning LLMs with Individual Preferences via Interaction | Shujin Wu et.al. | 2410.03642 | link |
2024-10-04 | Conditional Enzyme Generation Using Protein Language Models with Adapters | Jason Yang et.al. | 2410.03634 | null |
2024-10-04 | Large Language Model Performance Benchmarking on Mobile Platforms: A Thorough Evaluation | Jie Xiao et.al. | 2410.03613 | null |
2024-10-04 | TICKing All the Boxes: Generated Checklists Improve LLM Evaluation and Generation | Jonathan Cook et.al. | 2410.03608 | null |
2024-10-04 | LeLaN: Learning A Language-Conditioned Navigation Policy from In-the-Wild Videos | Noriaki Hirose et.al. | 2410.03603 | null |
2024-10-04 | Efficiently Identifying Watermarked Segments in Mixed-Source Texts | Xuandong Zhao et.al. | 2410.03600 | null |
2024-10-04 | Understanding Reasoning in Chain-of-Thought from the Hopfieldian View | Lijie Hu et.al. | 2410.03595 | null |
2024-10-04 | Look Twice Before You Answer: Memory-Space Visual Retracing for Hallucination Mitigation in Multimodal Large Language Models | Xin Zou et.al. | 2410.03577 | link |
2024-10-04 | Towards Linguistically-Aware and Language-Independent Tokenization for Large Language Models (LLMs) | Abrar Rahman et.al. | 2410.03568 | null |
2024-10-04 | Structure-Enhanced Protein Instruction Tuning: Towards General-Purpose Protein Understanding | Wei Wu et.al. | 2410.03553 | null |
2024-10-04 | Re-examining Sexism and Misogyny Classification with Annotator Attitudes | Aiqi Jiang et.al. | 2410.03543 | null |
2024-10-04 | No Need to Talk: Asynchronous Mixture of Language Models | Anastasiia Filippova et.al. | 2410.03529 | null |
2024-10-04 | Steering Large Language Models between Code Execution and Textual Reasoning | Yongchao Chen et.al. | 2410.03524 | null |
2024-10-04 | A Probabilistic Perspective on Unlearning and Alignment for Large Language Models | Yan Scholten et.al. | 2410.03523 | null |
2024-10-04 | CliMedBench: A Large-Scale Chinese Benchmark for Evaluating Medical Large Language Models in Clinical Scenarios | Zetian Ouyang et.al. | 2410.03502 | link |
2024-10-04 | FedStein: Enhancing Multi-Domain Federated Learning Through James-Stein Estimator | Sunny Gupta et.al. | 2410.03499 | link |
2024-10-04 | Towards Reproducible LLM Evaluation: Quantifying Uncertainty in LLM Benchmark Scores | Robert E. Blackwell et.al. | 2410.03492 | null |
2024-10-03 | Interpreting and Editing Vision-Language Representations to Mitigate Hallucinations | Nick Jiang et.al. | 2410.02762 | link |
2024-10-03 | FakeShield: Explainable Image Forgery Detection and Localization via Multi-modal Large Language Models | Zhipei Xu et.al. | 2410.02761 | link |
2024-10-03 | Erasing Conceptual Knowledge from Language Models | Rohit Gandikota et.al. | 2410.02760 | link |
2024-10-03 | Loong: Generating Minute-level Long Videos with Autoregressive Language Models | Yuqing Wang et.al. | 2410.02757 | null |
2024-10-03 | SIEVE: General Purpose Data Filtering System Matching GPT-4o Accuracy at 1% the Cost | Jifan Zhang et.al. | 2410.02755 | null |
2024-10-03 | Training Language Models on Synthetic Edit Sequences Improves Code Synthesis | Ulyana Piterbarg et.al. | 2410.02749 | link |
2024-10-03 | CriSPO: Multi-Aspect Critique-Suggestion-guided Automatic Prompt Optimization for Text Generation | Han He et.al. | 2410.02748 | null |
2024-10-03 | Contrastive Localized Language-Image Pre-Training | Hong-You Chen et.al. | 2410.02746 | null |
2024-10-03 | Neutral residues: revisiting adapters for model extension | Franck Signe Talla et.al. | 2410.02744 | null |
2024-10-03 | MA-RLHF: Reinforcement Learning from Human Feedback with Macro Actions | Yekun Chai et.al. | 2410.02743 | null |
2024-10-03 | Grounding Large Language Models In Embodied Environment With Imperfect World Models | Haolan Liu et.al. | 2410.02742 | null |
2024-10-03 | Salient Information Prompting to Steer Content in Prompt-based Abstractive Summarization | Lei Xu et.al. | 2410.02741 | link |
2024-10-03 | Revisit Large-Scale Image-Caption Data in Pre-training Multimodal Foundation Models | Zhengfeng Lai et.al. | 2410.02740 | null |
2024-10-04 | Justice or Prejudice? Quantifying Biases in LLM-as-a-Judge | Jiayi Ye et.al. | 2410.02736 | null |
2024-10-03 | DivScene: Benchmarking LVLMs for Object Navigation with Diverse Scenes and Objects | Zhaowei Wang et.al. | 2410.02730 | link |
2024-10-03 | Unified Multi-Modal Interleaved Document Representation for Information Retrieval | Jaewoo Lee et.al. | 2410.02729 | null |
2024-10-03 | Adaptive Inference-Time Compute: LLMs Can Predict if They Can Do Better, Even Mid-Generation | Rohin Manvi et.al. | 2410.02725 | null |
2024-10-03 | Large Language Models as Markov Chains | Oussama Zekri et.al. | 2410.02724 | null |
2024-10-03 | Domain-Specific Retrieval-Augmented Generation Using Vector Stores, Knowledge Graphs, and Tensor Factorization | Ryan C. Barron et.al. | 2410.02721 | null |
2024-10-03 | UncertaintyRAG: Span-Level Uncertainty Enhanced Long-Context Modeling for Retrieval-Augmented Generation | Zixuan Li et.al. | 2410.02719 | null |
2024-10-02 | Locret: Enhancing Eviction in Long-Context LLM Inference with Trained Retaining Heads | Yuxiang Huang et.al. | 2410.01805 | link |
2024-10-02 | Efficient $1$ -bit tensor approximations | Alex W. Neal Riasanovsky et.al. | 2410.01799 | null |
2024-10-02 | Knowledge-Driven Feature Selection and Engineering for Genotype Data with Large Language Models | Joseph Lee et.al. | 2410.01795 | link |
2024-10-02 | When a language model is optimized for reasoning, does it still show embers of autoregression? An analysis of OpenAI o1 | R. Thomas McCoy et.al. | 2410.01792 | null |
2024-10-02 | Investigating on RLHF methodology | Alexey Kutalev et.al. | 2410.01789 | null |
2024-10-02 | OmniGenBench: Automating Large-scale in-silico Benchmarking for Genomic Foundation Models | Heng Yang et.al. | 2410.01784 | link |
2024-10-02 | Open-RAG: Enhanced Retrieval-Augmented Reasoning with Open-Source Large Language Models | Shayekh Bin Islam et.al. | 2410.01782 | link |
2024-10-03 | Quantifying Generalization Complexity for Large Language Models | Zhenting Qi et.al. | 2410.01769 | link |
2024-10-02 | Integrating Protein Sequence and Expression Level to Analysis Molecular Characterization of Breast Cancer Subtypes | Hossein Sholehrasa et.al. | 2410.01755 | null |
2024-10-02 | LEOPARD : A Vision Language Model For Text-Rich Multi-Image Tasks | Mengzhao Jia et.al. | 2410.01744 | link |
2024-10-02 | VitaGlyph: Vitalizing Artistic Typography with Flexible Dual-branch Diffusion Models | Kailai Feng et.al. | 2410.01738 | link |
2024-10-02 | Visual Perception in Text Strings | Qi Jia et.al. | 2410.01733 | link |
2024-10-02 | Automated Knowledge Concept Annotation and Question Representation Learning for Knowledge Tracing | Yilmazcan Ozyurt et.al. | 2410.01727 | link |
2024-10-02 | Auto-Demo Prompting: Leveraging Generated Outputs as Demonstrations for Enhanced Batch Prompting | Longyu Feng et.al. | 2410.01724 | null |
2024-10-02 | Towards a Theoretical Understanding of Synthetic Data in LLM Post-Training: A Reverse-Bottleneck Perspective | Zeyu Gan et.al. | 2410.01720 | link |
2024-10-02 | Examining the Role of Relationship Alignment in Large Language Models | Kristen M. Altenburger et.al. | 2410.01708 | null |
2024-10-02 | Interpretable Contrastive Monte Carlo Tree Search Reasoning | Zitian Gao et.al. | 2410.01707 | link |
2024-10-02 | An Exploration of Self-Supervised Mutual Information Alignment for Multi-Task Settings | Soham Govande et.al. | 2410.01704 | link |
2024-10-02 | CreDes: Causal Reasoning Enhancement and Dual-End Searching for Solving Long-Range Reasoning Problems using LLMs | Kangsheng Wang et.al. | 2410.01696 | null |
2024-10-02 | U-shaped and Inverted-U Scaling behind Emergent Abilities of Large Language Models | Tung-Yu Wu et.al. | 2410.01692 | null |
2024-09-30 | MM1.5: Methods, Analysis & Insights from Multimodal LLM Fine-tuning | Haotian Zhang et.al. | 2409.20566 | null |
2024-09-30 | LaMMA-P: Generalizable Multi-Agent Long-Horizon Task Allocation and Planning with LM-Driven PDDL Planner | Xiaopan Zhang et.al. | 2409.20560 | null |
2024-09-30 | Propose, Assess, Search: Harnessing LLMs for Goal-Oriented Planning in Instructional Videos | Md Mohaiminul Islam et.al. | 2409.20557 | null |
2024-09-30 | UniAff: A Unified Representation of Affordances for Tool Usage and Articulation with Vision-Language Models | Qiaojun Yu et.al. | 2409.20551 | null |
2024-09-30 | LLM Hallucinations in Practical Code Generation: Phenomena, Mechanism, and Mitigation | Ziyao Zhang et.al. | 2409.20550 | null |
2024-09-30 | Robi Butler: Remote Multimodal Interactions with Household Robot Assistant | Anxing Xiao et.al. | 2409.20548 | null |
2024-09-30 | Uncertainty-Informed Screening for Safer Solvents Used in the Synthesis of Perovskite via Language Models | Arpan Mukherjee et.al. | 2409.20512 | null |
2024-09-30 | COLLAGE: Collaborative Human-Agent Interaction Generation using Hierarchical Latent Diffusion and Language Models | Divyanshu Daiya et.al. | 2409.20502 | null |
2024-09-30 | A Weakly Supervised Data Labeling Framework for Machine Lexical Normalization in Vietnamese Social Media | Dung Ha Nguyen et.al. | 2409.20467 | null |
2024-09-30 | Robot Navigation Using Physically Grounded Vision-Language Models in Outdoor Environments | Mohamed Elnoor et.al. | 2409.20445 | null |
2024-10-01 | Instance-adaptive Zero-shot Chain-of-Thought Prompting | Xiaosong Yuan et.al. | 2409.20441 | null |
2024-09-30 | HELPD: Mitigating Hallucination of LVLMs by Hierarchical Feedback Learning with Vision-enhanced Penalty Decoding | Fan Yuan et.al. | 2409.20429 | null |
2024-09-30 | World to Code: Multi-modal Data Generation via Self-Instructed Compositional Captioning and Filtering | Jiacong Wang et.al. | 2409.20424 | link |
2024-09-30 | Anti-stereotypical Predictive Text Suggestions Do Not Reliably Yield Anti-stereotypical Writing | Connor Baumler et.al. | 2409.20390 | null |
2024-09-30 | Wait, but Tylenol is Acetaminophen… Investigating and Improving Language Models’ Ability to Resist Requests for Misinformation | Shan Chen et.al. | 2409.20385 | null |
2024-09-30 | Word-wise intonation model for cross-language TTS systems | Tomilov A. A. et.al. | 2409.20374 | null |
2024-09-30 | The Perfect Blend: Redefining RLHF with Mixture of Judges | Tengyu Xu et.al. | 2409.20370 | null |
2024-09-30 | VideoINSTA: Zero-shot Long Video Understanding via Informative Spatial-Temporal Reasoning with LLMs | Ruotong Liao et.al. | 2409.20365 | link |
2024-09-30 | Efficient Driving Behavior Narration and Reasoning on Edge Device Using Large Language Models | Yizhou Huang et.al. | 2409.20364 | null |
2024-09-30 | Rotated Runtime Smooth: Training-Free Activation Smoother for accurate INT4 inference | Ke Yi et.al. | 2409.20361 | null |
2024-09-27 | Exploring Token Pruning in Vision State Space Models | Zheng Zhan et.al. | 2409.18962 | null |
2024-09-27 | LML: Language Model Learning a Dataset for Data-Augmented Prediction | Praneeth Vadlapati et.al. | 2409.18957 | link |
2024-09-27 | Ruler: A Model-Agnostic Method to Control Generated Length for Large Language Models | Jiaming Li et.al. | 2409.18943 | link |
2024-09-27 | From Seconds to Hours: Reviewing MultiModal Large Language Models on Comprehensive Long Video Understanding | Heqing Zou et.al. | 2409.18938 | null |
2024-09-27 | Social Media Bot Policies: Evaluating Passive and Active Enforcement | Kristina Radivojevic et.al. | 2409.18931 | null |
2024-09-27 | AIPatient: Simulating Patients with EHRs and LLM Powered Agentic Workflow | Huizi Yu et.al. | 2409.18924 | null |
2024-09-27 | Soft Measures for Extracting Causal Collective Intelligence | Maryam Berijanian et.al. | 2409.18911 | link |
2024-09-27 | Improving Visual Object Tracking through Visual Prompting | Shih-Fang Chen et.al. | 2409.18901 | link |
2024-09-27 | IDGen: Item Discrimination Induced Prompt Generation for LLM Evaluation | Fan Lin et.al. | 2409.18892 | link |
2024-09-27 | Suicide Phenotyping from Clinical Notes in Safety-Net Psychiatric Hospital Using Multi-Label Classification with Pre-Trained Language Models | Zehan Li et.al. | 2409.18878 | null |
2024-09-27 | Predicting and analyzing memorization within fine-tuned Large Language Models | Jérémie Dentan et.al. | 2409.18858 | null |
2024-09-27 | Mitigating Selection Bias with Node Pruning and Auxiliary Options | Hyeong Kyu Choi et.al. | 2409.18857 | null |
2024-09-27 | LLMs4Synthesis: Leveraging Large Language Models for Scientific Synthesis | Hamed Babaei Giglou et.al. | 2409.18812 | link |
2024-09-27 | Open-Nav: Exploring Zero-Shot Vision-and-Language Navigation in Continuous Environment with Open-Source LLMs | Yanyuan Qiao et.al. | 2409.18794 | null |
2024-09-27 | A Survey on the Honesty of Large Language Models | Siheng Li et.al. | 2409.18786 | link |
2024-09-27 | Enhancing Explainability in Multimodal Large Language Models Using Ontological Context | Jihen Amara et.al. | 2409.18753 | null |
2024-09-27 | OpenObject-NAV: Open-Vocabulary Object-Oriented Navigation Based on Dynamic Carrier-Relationship Scene Graph | Yujie Tang et.al. | 2409.18743 | null |
2024-09-27 | Scalable Cross-Entropy Loss for Sequential Recommendations with Large Item Catalogs | Gleb Mezentsev et.al. | 2409.18721 | link |
2024-09-27 | Read Over the Lines: Attacking LLMs and Toxicity Detection Systems with ASCII Art to Mask Profanity | Sergey Berezin et.al. | 2409.18708 | link |
2024-09-27 | Beyond Single-Audio: Advancing Multi-Audio Processing in Audio Large Language Models | Yiming Chen et.al. | 2409.18680 | link |
2024-09-26 | EgoLM: Multi-Modal Language Model of Egocentric Motions | Fangzhou Hong et.al. | 2409.18127 | null |
2024-09-26 | Lotus: Diffusion-based Visual Foundation Model for High-quality Dense Prediction | Jing He et.al. | 2409.18124 | null |
2024-09-26 | Multi-View and Multi-Scale Alignment for Contrastive Language-Image Pre-training in Mammography | Yuexi Du et.al. | 2409.18119 | null |
2024-09-26 | E.T. Bench: Towards Open-Ended Event-Level Video-Language Understanding | Ye Liu et.al. | 2409.18111 | link |
2024-09-26 | Open-World Evaluation for Retrieving Diverse Perspectives | Hung-Ting Chen et.al. | 2409.18110 | null |
2024-09-26 | MALPOLON: A Framework for Deep Species Distribution Modeling | Theo Larcher et.al. | 2409.18102 | link |
2024-09-26 | SKT: Integrating State-Aware Keypoint Trajectories with Vision-Language Models for Robotic Garment Manipulation | Xin Li et.al. | 2409.18082 | null |
2024-09-26 | Infer Human’s Intentions Before Following Natural Language Instructions | Yanming Wan et.al. | 2409.18073 | link |
2024-09-26 | Infering Alt-text For UI Icons With Large Language Models During App Development | Sabrina Haque et.al. | 2409.18060 | null |
2024-09-26 | DualAD: Dual-Layer Planning for Reasoning in Autonomous Driving | Dingrui Wang et.al. | 2409.18053 | link |
2024-09-26 | EMOVA: Empowering Language Models to See, Hear and Speak with Vivid Emotions | Kai Chen et.al. | 2409.18042 | null |
2024-09-26 | Compositional Hardness of Code in Large Language Models – A Probabilistic Perspective | Yotam Wolf et.al. | 2409.18028 | null |
2024-09-26 | An Adversarial Perspective on Machine Unlearning for AI Safety | Jakub Łucki et.al. | 2409.18025 | link |
2024-09-26 | DARE: Diverse Visual Question Answering with Robustness Evaluation | Hannah Sterz et.al. | 2409.18023 | null |
2024-09-26 | Role-RL: Online Long-Context Processing with Role Reinforcement Learning for Distinct LLMs in Their Optimal Roles | Lewei He et.al. | 2409.18014 | null |
2024-09-26 | Control Industrial Automation System with Large Language Models | Yuchen Xia et.al. | 2409.18009 | link |
2024-09-26 | Multilingual Evaluation of Long Context Retrieval and Reasoning | Ameeta Agrawal et.al. | 2409.18006 | link |
2024-09-26 | Enhancing Tourism Recommender Systems for Sustainable City Trips Using Retrieval-Augmented Generation | Ashmi Banerjee et.al. | 2409.18003 | null |
2024-09-26 | Extracting Affect Aggregates from Longitudinal Social Media Data with Temporal Adapters for Large Language Models | Georg Ahnert et.al. | 2409.17990 | link |
2024-09-26 | LLM4Brain: Training a Large Language Model for Brain Video Understanding | Ruizhe Zheng et.al. | 2409.17987 | null |
2024-09-25 | Attention Prompting on Image for Large Vision-Language Models | Runpeng Yu et.al. | 2409.17143 | link |
2024-09-25 | FineZip : Pushing the Limits of Large Language Models for Practical Lossless Text Compression | Fazal Mittu et.al. | 2409.17141 | link |
2024-09-25 | Turn Every Application into an Agent: Towards Efficient Human-Agent-Computer Interaction with API-First LLM-Based Agents | Junting Lu et.al. | 2409.17140 | null |
2024-09-25 | Blox-Net: Generative Design-for-Robot-Assembly Using VLM Supervision, Physics Simulation, and a Robot with Reset | Andrew Goldberg et.al. | 2409.17126 | null |
2024-09-25 | Programming Every Example: Lifting Pre-training Data Quality like Experts at Scale | Fan Zhou et.al. | 2409.17115 | link |
2024-09-25 | Unveiling Ontological Commitment in Multi-Modal Foundation Models | Mert Keser et.al. | 2409.17109 | null |
2024-09-25 | Accumulator-Aware Post-Training Quantization | Ian Colbert et.al. | 2409.17092 | null |
2024-09-25 | Can Vision Language Models Learn from Visual Demonstrations of Ambiguous Spatial Reasoning? | Bowen Zhao et.al. | 2409.17080 | link |
2024-09-25 | VPTQ: Extreme Low-bit Vector Post-Training Quantization for Large Language Models | Yifei Liu et.al. | 2409.17066 | link |
2024-09-25 | Benchmarking Domain Generalization Algorithms in Computational Pathology | Neda Zamanitajeddin et.al. | 2409.17063 | null |
2024-09-25 | Using LLM for Real-Time Transcription and Summarization of Doctor-Patient Interactions into ePuskesmas in Indonesia | Azmul Asmar Irfan et.al. | 2409.17054 | null |
2024-09-25 | GeoBiked: A Dataset with Geometric Features and Automated Labeling Techniques to Enable Deep Generative Models in Engineering Design | Phillip Mueller et.al. | 2409.17045 | null |
2024-09-25 | How to Connect Speech Foundation Models and Large Language Models? What Matters and What Does Not | Francesco Verdini et.al. | 2409.17044 | null |
2024-09-25 | Counterfactual Token Generation in Large Language Models | Ivi Chatzi et.al. | 2409.17027 | link |
2024-09-25 | LLM-CARD: Towards a Description and Landscape of Large Language Models | Shengwei Tian et.al. | 2409.17011 | link |
2024-09-25 | Models Can and Should Embrace the Communicative Nature of Human-Generated Math | Sasha Boguraev et.al. | 2409.17005 | null |
2024-09-26 | INT-FlashAttention: Enabling Flash Attention for INT8 Quantization | Shimao Chen et.al. | 2409.16997 | link |
2024-09-25 | Harnessing Diversity for Important Data Selection in Pretraining Large Language Models | Chi Zhang et.al. | 2409.16986 | null |
2024-09-25 | AXCEL: Automated eXplainable Consistency Evaluation using LLMs | P Aditya Sreekar et.al. | 2409.16984 | null |
2024-09-25 | Decoding Large-Language Models: A Systematic Overview of Socio-Technical Impacts, Constraints, and Emerging Questions | Zeyneb N. Kaya et.al. | 2409.16974 | null |
2024-09-20 | Gender Representation and Bias in Indian Civil Service Mock Interviews | Somonnoy Banerjee et.al. | 2409.12194 | null |
2024-09-18 | Qwen2-VL: Enhancing Vision-Language Model’s Perception of the World at Any Resolution | Peng Wang et.al. | 2409.12191 | link |
2024-09-18 | To CoT or not to CoT? Chain-of-thought helps mainly on math and symbolic reasoning | Zayne Sprague et.al. | 2409.12183 | link |
2024-09-23 | A Controlled Study on Long Context Extension and Generalization in LLMs | Yi Lu et.al. | 2409.12181 | link |
2024-09-18 | Finetuning Language Models to Emit Linguistic Expressions of Uncertainty | Arslan Chaudhry et.al. | 2409.12180 | null |
2024-09-18 | Decoding Style: Efficient Fine-Tuning of LLMs for Image-Guided Outfit Recommendation with Preference | Najmeh Forouzandehmehr et.al. | 2409.12150 | null |
2024-09-18 | MAgICoRe: Multi-Agent, Iterative, Coarse-to-Fine Refinement for Reasoning | Justin Chih-Yao Chen et.al. | 2409.12147 | link |
2024-09-18 | MoRAG – Multi-Fusion Retrieval Augmented Generation for Human Motion | Kalakonda Sai Shashank et.al. | 2409.12140 | null |
2024-09-24 | Takin: A Cohort of Superior Quality Zero-shot Speech Generation Models | Sijing Chen et.al. | 2409.12139 | null |
2024-09-18 | GRIN: GRadient-INformed MoE | Liyuan Liu et.al. | 2409.12136 | null |
2024-09-18 | Linguini: A benchmark for language-agnostic linguistic reasoning | Eduardo Sánchez et.al. | 2409.12126 | link |
2024-09-18 | Qwen2.5-Math Technical Report: Toward Mathematical Expert Model via Self-Improvement | An Yang et.al. | 2409.12122 | null |
2024-09-18 | Low Frame-rate Speech Codec: a Codec Designed for Fast High-quality Speech LLM Training and Inference | Edresson Casanova et.al. | 2409.12117 | null |
2024-09-18 | Measuring Human and AI Values based on Generative Psychometrics with Large Language Models | Haoran Ye et.al. | 2409.12106 | link |
2024-09-19 | Skill matching at scale: freelancer-project alignment for efficient multilingual candidate retrieval | Warren Jouanneau et.al. | 2409.12097 | null |
2024-09-19 | The Impact of Element Ordering on LM Agent Performance | Wayne Chi et.al. | 2409.12089 | link |
2024-09-18 | Dual-Layer Training and Decoding of Large Language Model with Simultaneously Thinking and Speaking | Ningyuan Xi et.al. | 2409.12059 | null |
2024-09-19 | Using Large Language Models to Generate Clinical Trial Tables and Figures | Yumeng Yang et.al. | 2409.12046 | null |
2024-09-18 | All-in-one foundational models learning across quantum chemical levels | Yuxinxin Chen et.al. | 2409.12015 | link |
2024-09-18 | Mixture of Prompt Learning for Vision Language Models | Yu Du et.al. | 2409.12011 | null |
2024-09-17 | AraDiCE: Benchmarks for Dialectal and Cultural Capabilities in LLMs | Basel Mousi et.al. | 2409.11404 | null |
2024-09-17 | NVLM: Open Frontier-Class Multimodal LLMs | Wenliang Dai et.al. | 2409.11402 | null |
2024-09-17 | Says Who? Effective Zero-Shot Annotation of Focalization | Rebecca M. M. Hicke et.al. | 2409.11390 | null |
2024-09-17 | Diversify and Conquer: Diversity-Centric Data Selection with Iterative Refinement | Simon Yu et.al. | 2409.11378 | link |
2024-09-17 | Towards Time Series Reasoning with LLMs | Winnie Chow et.al. | 2409.11376 | null |
2024-09-17 | Multi-OCT-SelfNet: Integrating Self-Supervised Learning with Multi-Source Data Fusion for Enhanced Multi-Class Retinal Disease Classification | Fatema-E- Jannat et.al. | 2409.11375 | null |
2024-09-17 | Learning Spatially-Aware Language and Audio Embedding | Bhavika Devnani et.al. | 2409.11369 | null |
2024-09-17 | CoCA: Regaining Safety-awareness of Multimodal Large Language Models with Constitutional Calibration | Jiahui Gao et.al. | 2409.11365 | null |
2024-09-17 | CORE-Bench: Fostering the Credibility of Published Research Through a Computational Reproducibility Agent Benchmark | Zachary S. Siegel et.al. | 2409.11363 | link |
2024-09-17 | AI Suggestions Homogenize Writing Toward Western Styles and Diminish Cultural Nuances | Dhruv Agarwal et.al. | 2409.11360 | null |
2024-09-17 | THaMES: An End-to-End Tool for Hallucination Mitigation and Evaluation in Large Language Models | Mengfei Liang et.al. | 2409.11353 | link |
2024-09-17 | LPT++: Efficient Training on Mixture of Long-tailed Experts | Bowen Dong et.al. | 2409.11323 | null |
2024-09-17 | SOAP: Improving and Stabilizing Shampoo using Adam | Nikhil Vyas et.al. | 2409.11321 | link |
2024-09-17 | Beyond LoRA: Exploring Efficient Fine-Tuning Techniques for Time Series Foundational Models | Divij Gupta et.al. | 2409.11302 | null |
2024-09-17 | Leveraging Distillation Techniques for Document Understanding: A Case Study with FLAN-T5 | Marcel Lamott et.al. | 2409.11282 | null |
2024-09-17 | P-RAG: Progressive Retrieval Augmented Generation For Planning on Embodied Everyday Task | Weiye Xu et.al. | 2409.11279 | null |
2024-09-17 | Hackphyr: A Local Fine-Tuned LLM Agent for Network Security Environments | Maria Rigaki et.al. | 2409.11276 | null |
2024-09-17 | Task Arithmetic for Language Expansion in Speech Translation | Yao-Fei Cheng et.al. | 2409.11274 | null |
2024-09-18 | LOLA – An Open-Source Massively Multilingual Large Language Model | Nikit Srivastava et.al. | 2409.11272 | link |
2024-09-17 | Bio-Inspired Mamba: Temporal Locality and Bioplausible Learning in Selective State Space Models | Jiahao Qin et.al. | 2409.11263 | null |
2024-09-16 | RetrievalAttention: Accelerating Long-Context LLM Inference via Vector Retrieval | Di Liu et.al. | 2409.10516 | link |
2024-09-16 | Context-aware Code Segmentation for C-to-Rust Translation using Large Language Models | Momoko Shiraishi et.al. | 2409.10506 | null |
2024-09-16 | DILA: Dictionary Label Attention for Mechanistic Interpretability in High-dimensional Multi-label Medical Coding Prediction | John Wu et.al. | 2409.10504 | null |
2024-09-16 | Causal Language Modeling Can Elicit Search and Reasoning Capabilities on Logic Puzzles | Kulin Shah et.al. | 2409.10502 | link |
2024-09-16 | Code Vulnerability Detection: A Comparative Analysis of Emerging Large Language Models | Shaznin Sultana et.al. | 2409.10490 | null |
2024-09-16 | Do Pre-trained Vision-Language Models Encode Object States? | Kaleb Newman et.al. | 2409.10488 | null |
2024-09-16 | XLM for Autonomous Driving Systems: A Comprehensive Review | Sonda Fourati et.al. | 2409.10484 | null |
2024-09-17 | Schrodinger’s Memory: Large Language Models | Wei Wang et.al. | 2409.10482 | null |
2024-09-16 | Towards Semantic Versioning of Open Pre-trained Language Model Releases on Hugging Face | Adekunle Ajibode et.al. | 2409.10472 | null |
2024-09-16 | LLM as BT-Planner: Leveraging LLMs for Behavior Tree Generation in Robot Task Planning | Jicong Ao et.al. | 2409.10444 | link |
2024-09-16 | CtRNet-X: Camera-to-Robot Pose Estimation in Real-world Conditions Using a Single Camera | Jingpei Lu et.al. | 2409.10441 | null |
2024-09-16 | HiFi-CS: Towards Open Vocabulary Visual Grounding For Robotic Grasping Using Vision-Language Models | Vineet Bhat et.al. | 2409.10419 | null |
2024-09-16 | A Large-Scale Privacy Assessment of Android Third-Party SDKs | Mark Huasong Meng et.al. | 2409.10411 | null |
2024-09-16 | A Knowledge-Enhanced Disease Diagnosis Method Based on Prompt Learning and BERT Integration | Zhang Zheng et.al. | 2409.10403 | null |
2024-09-17 | Learnings from a Large-Scale Deployment of an LLM-Powered Expert-in-the-Loop Healthcare Chatbot | Bhuvan Sachdeva et.al. | 2409.10354 | null |
2024-09-16 | Large Language Model Enhanced Hard Sample Identification for Denoising Recommendation | Tianrui Song et.al. | 2409.10343 | null |
2024-09-16 | The 20 questions game to distinguish large language models | Gurvan Richardeau et.al. | 2409.10338 | null |
2024-09-16 | MGSA: Multi-granularity Graph Structure Attention for Knowledge Graph-to-Text Generation | Shanshan Wang et.al. | 2409.10294 | null |
2024-09-16 | ReflectDiffu: Reflect between Emotion-intent Contagion and Mimicry for Empathetic Response Generation via a RL-Diffusion Framework | Jiahao Yuan et.al. | 2409.10289 | link |
2024-09-16 | ComplexCodeEval: A Benchmark for Evaluating Large Code Models on More Complex Code | Jia Feng et.al. | 2409.10280 | link |
2024-09-13 | Agents in Software Engineering: Survey, Landscape, and Vision | Yanxian Huang et.al. | 2409.09030 | link |
2024-09-13 | Contri(e)ve: Context + Retrieve for Scholarly Question Answering | Kanchan Shivashankar et.al. | 2409.09010 | null |
2024-09-13 | Safeguarding Decentralized Social Media: LLM Agents for Automating Community Rule Compliance | Lucio La Cava et.al. | 2409.08963 | null |
2024-09-13 | Emerging Reliance Behaviors in Human-AI Text Generation: Hallucinations, Data Quality Assessment, and Cognitive Forcing Functions | Zahra Ashktorab et.al. | 2409.08937 | null |
2024-09-13 | SynSUM – Synthetic Benchmark with Structured and Unstructured Medical Records | Paloma Rabaey et.al. | 2409.08936 | link |
2024-09-13 | LLM-based Weak Supervision Framework for Query Intent Classification in Video Search | Farnoosh Javadi et.al. | 2409.08931 | null |
2024-09-13 | Affective Computing Has Changed: The Foundation Model Disruption | Björn Schuller et.al. | 2409.08907 | null |
2024-09-13 | AnyBipe: An End-to-End Framework for Training and Deploying Bipedal Robots Guided by Large Language Models | Yifei Yao et.al. | 2409.08904 | link |
2024-09-13 | A Market for Lemons? Strategic Directions for a Vigilant Application of Artificial Intelligence in Entrepreneurship Research | Martin Obschonka et.al. | 2409.08890 | null |
2024-09-13 | Visual Language Tracking with Multi-modal Interaction: A Robust Benchmark | Xuchen Li et.al. | 2409.08887 | null |
2024-09-13 | Exploring Graph Structure Comprehension Ability of Multimodal Large Language Models: Case Studies | Zhiqiang Zhong et.al. | 2409.08864 | null |
2024-09-13 | FP-VEC: Fingerprinting Large Language Models via Efficient Vector Addition | Zhenhua Xu et.al. | 2409.08846 | null |
2024-09-13 | AIPO: Improving Training Objective for Iterative Preference Optimization | Yaojie Shen et.al. | 2409.08845 | link |
2024-09-13 | A RAG Approach for Generating Competency Questions in Ontology Engineering | Xueli Pan et.al. | 2409.08820 | null |
2024-09-13 | Your Weak LLM is Secretly a Strong Teacher for Alignment | Leitian Tao et.al. | 2409.08813 | null |
2024-09-13 | Mutual Theory of Mind in Human-AI Collaboration: An Empirical Study with LLM-driven AI Agents in a Real-time Shared Workspace Task | Shao Zhang et.al. | 2409.08811 | null |
2024-09-13 | LLaQo: Towards a Query-Based Coach in Expressive Music Performance Assessment | Huan Zhang et.al. | 2409.08795 | link |
2024-09-13 | Optimizing Ingredient Substitution Using Large Language Models to Enhance Phytochemical Content in Recipes | Luis Rita et.al. | 2409.08792 | null |
2024-09-13 | Electrocardiogram Report Generation and Question Answering via Retrieval-Augmented Self-Supervised Modeling | Jialu Tang et.al. | 2409.08788 | null |
2024-09-13 | Uncertainty and Generalizability in Foundation Models for Earth Observation | Raul Ramos-Pollan et.al. | 2409.08744 | null |
2024-09-12 | Windows Agent Arena: Evaluating Multi-Modal OS Agents at Scale | Rogerio Bonatti et.al. | 2409.08264 | link |
2024-09-12 | OmniQuery: Contextually Augmenting Captured Multimodal Memory to Enable Personal Question Answering | Jiahao Nick Li et.al. | 2409.08250 | null |
2024-09-12 | Source2Synth: Synthetic Data Generation and Curation Grounded in Real Data Sources | Alisia Lupidi et.al. | 2409.08239 | null |
2024-09-12 | LLM Honeypot: Leveraging Large Language Models as Advanced Interactive Honeypot Systems | Hakan T. Otal et.al. | 2409.08234 | link |
2024-09-12 | Adaptive Language-Guided Abstraction from Contrastive Explanations | Andi Peng et.al. | 2409.08212 | null |
2024-09-12 | ComAlign: Compositional Alignment in Vision-Language Models | Ali Abdollah et.al. | 2409.08206 | null |
2024-09-12 | What Makes a Maze Look Like a Maze? | Joy Hsu et.al. | 2409.08202 | null |
2024-09-12 | AudioBERT: Audio Knowledge Augmented Language Model | Hyunjong Ok et.al. | 2409.08199 | link |
2024-09-12 | Fine-tuning Large Language Models for Entity Matching | Aaron Steiner et.al. | 2409.08185 | link |
2024-09-12 | On the Role of Context in Reading Time Prediction | Andreas Opedal et.al. | 2409.08160 | link |
2024-09-12 | Faster Speech-LLaMA Inference with Multi-token Prediction | Desh Raj et.al. | 2409.08148 | null |
2024-09-12 | LLM-POTUS Score: A Framework of Analyzing Presidential Debates with Large Language Models | Zhengliang Liu et.al. | 2409.08147 | null |
2024-09-12 | Towards a graph-based foundation model for network traffic analysis | Louis Van Langendonck et.al. | 2409.08111 | null |
2024-09-12 | The Faetar Benchmark: Speech Recognition in a Very Under-Resourced Language | Michael Ong et.al. | 2409.08103 | null |
2024-09-12 | The CLC-UKET Dataset: Benchmarking Case Outcome Prediction for the UK Employment Tribunal | Huiyuan Xie et.al. | 2409.08098 | null |
2024-09-12 | Securing Large Language Models: Addressing Bias, Misinformation, and Prompt Attacks | Benji Peng et.al. | 2409.08087 | null |
2024-09-12 | SimMAT: Exploring Transferability from Vision Foundation Models to Any Image Modality | Chenyang Lei et.al. | 2409.08083 | link |
2024-09-12 | SoVAR: Building Generalizable Scenarios from Accident Reports for Autonomous Driving Testing | An Guo et.al. | 2409.08081 | null |
2024-09-12 | TravelAgent: An AI Assistant for Personalized Travel Planning | Aili Chen et.al. | 2409.08069 | null |
2024-09-12 | An Evaluation Framework for Attributed Information Retrieval using Large Language Models | Hanane Djeddal et.al. | 2409.08014 | link |
2024-09-11 | “My Grade is Wrong!”: A Contestable AI Framework for Interactive Feedback in Evaluating Student Essays | Shengxin Hong et.al. | 2409.07453 | null |
2024-09-11 | StereoCrafter: Diffusion-based Generation of Long and High-fidelity Stereoscopic 3D from Monocular Videos | Sijie Zhao et.al. | 2409.07447 | null |
2024-09-11 | SUPER: Evaluating Agents on Setting Up and Executing Tasks from Research Repositories | Ben Bogin et.al. | 2409.07440 | link |
2024-09-11 | A Suite for Acoustic Language Model Evaluation | Gallil Maimon et.al. | 2409.07437 | link |
2024-09-11 | Synthetic continued pretraining | Zitong Yang et.al. | 2409.07431 | link |
2024-09-11 | Agent Workflow Memory | Zora Zhiruo Wang et.al. | 2409.07429 | link |
2024-09-11 | CLNX: Bridging Code and Natural Language for C/C++ Vulnerability-Contributing Commits Identification | Zeqing Qin et.al. | 2409.07407 | null |
2024-09-11 | AdaCAD: Adaptively Decoding to Balance Conflicts between Contextual and Parametric Knowledge | Han Wang et.al. | 2409.07394 | link |
2024-09-11 | Awaking the Slides: A Tuning-free and Knowledge-regulated AI Tutoring System via Language Model Coordination | Daniel Zhang-Li et.al. | 2409.07372 | null |
2024-09-11 | Demo: SGCode: A Flexible Prompt-Optimizing System for Secure Generation of Code | Khiem Ton et.al. | 2409.07368 | null |
2024-09-11 | Think Together and Work Better: Combining Humans’ and LLMs’ Think-Aloud Outcomes for Effective Text Evaluation | SeongYeub Chu et.al. | 2409.07355 | link |
2024-09-11 | Securing Vision-Language Models with a Robust Encoder Against Jailbreak and Adversarial Attacks | Md Zarif Hossain et.al. | 2409.07353 | link |
2024-09-11 | Explanation, Debate, Align: A Weak-to-Strong Framework for Language Model Generalization | Mehrdad Zakershahrak et.al. | 2409.07335 | null |
2024-09-11 | Learning to Compress Contexts for Efficient Knowledge-based Visual Question Answering | Weixi Weng et.al. | 2409.07331 | null |
2024-09-11 | MEDIC: Towards a Comprehensive Framework for Evaluating LLMs in Clinical Applications | Praveen K Kanithi et.al. | 2409.07314 | null |
2024-09-11 | Exploring User-level Gradient Inversion with a Diffusion Prior | Zhuohang Li et.al. | 2409.07291 | null |
2024-09-11 | STORE: Streamlining Semantic Tokenization and Generative Recommendation with A Single LLM | Qijiong Liu et.al. | 2409.07276 | null |
2024-09-11 | MiniDrive: More Efficient Vision-Language Models with Multi-Level 2D Features as Text Tokens for Autonomous Driving | Enming Zhang et.al. | 2409.07267 | link |
2024-09-12 | Alignment of Diffusion Models: Fundamentals, Challenges, and Future | Buhua Liu et.al. | 2409.07253 | link |
2024-09-11 | PiTe: Pixel-Temporal Alignment for Large Video-Language Model | Yang Liu et.al. | 2409.07239 | link |
2024-09-10 | Benchmarking Sub-Genre Classification For Mainstage Dance Music | Hongzhi Shu et.al. | 2409.06690 | null |
2024-09-10 | E2LLM: Encoder Elongated Large Language Models for Long-Context Understanding and Reasoning | Zihan Liao et.al. | 2409.06679 | null |
2024-09-10 | LLaMA-Omni: Seamless Speech Interaction with Large Language Models | Qingkai Fang et.al. | 2409.06666 | link |
2024-09-10 | Human Perception of LLM-generated Text Content in Social Media Environments | Kristina Radivojevic et.al. | 2409.06653 | null |
2024-09-10 | Optimal Workload Placement on Multi-Instance GPUs | Bekir Turkkan et.al. | 2409.06646 | null |
2024-09-11 | EyeCLIP: A visual-language foundation model for multi-modal ophthalmic image analysis | Danli Shi et.al. | 2409.06644 | null |
2024-09-11 | Segmenting sea ice floes in close-range optical imagery with active contour and foundation models | Giulio Passerotti et.al. | 2409.06641 | null |
2024-09-10 | TeXBLEU: Automatic Metric for Evaluate LaTeX Format | Kyudan Jung et.al. | 2409.06639 | link |
2024-09-10 | MoWE-Audio: Multitask AudioLLMs with Mixture of Weak Encoders | Wenyu Zhang et.al. | 2409.06635 | null |
2024-09-10 | A Practice of Post-Training on Llama-3 70B with Optimal Selection of Additional Language Mixture Ratio | Ningyuan Xi et.al. | 2409.06624 | null |
2024-09-10 | Exploring Italian sentence embeddings properties through multi-tasking | Vivi Nastase et.al. | 2409.06622 | link |
2024-09-10 | Alleviating Hallucinations in Large Language Models with Scepticism Modeling | Yetao Wu et.al. | 2409.06601 | null |
2024-09-10 | GroUSE: A Benchmark to Evaluate Evaluators in Grounded Question Answering | Sacha Muller et.al. | 2409.06595 | link |
2024-09-10 | Quantifying and Enabling the Interpretability of CLIP-like Models | Avinash Madasu et.al. | 2409.06579 | null |
2024-09-10 | Exploring syntactic information in sentence embeddings through multilingual subject-verb agreement | Vivi Nastase et.al. | 2409.06567 | null |
2024-09-10 | MAPS: Energy-Reliability Tradeoff Management in Autonomous Vehicles Through LLMs Penetrated Science | Mahdieh Aliazam et.al. | 2409.06558 | null |
2024-09-10 | Questioning Internal Knowledge Structure of Large Language Models Through the Lens of the Olympic Games | Juhwan Choi et.al. | 2409.06518 | link |
2024-09-10 | Aligning Machine and Human Visual Representations across Abstraction Levels | Lukas Muttenthaler et.al. | 2409.06509 | null |
2024-09-10 | Mitigating Hallucination in Visual-Language Models via Re-Balancing Contrastive Decoding | Xiaoyu Liang et.al. | 2409.06485 | null |
2024-09-10 | Multimodal Large Language Model Driven Scenario Testing for Autonomous Vehicles | Qiujing Lu et.al. | 2409.06450 | null |
2024-09-09 | MMEvol: Empowering Multimodal Large Language Models with Evol-Instruct | Run Luo et.al. | 2409.05840 | null |
2024-09-09 | Are Large Language Models a Threat to Programming Platforms? An Exploratory Study | Md Mustakim Billah et.al. | 2409.05824 | null |
2024-09-09 | VFA: Vision Frequency Analysis of Foundation Models and Human | Mohammad-Javad Darvishi-Bayazi et.al. | 2409.05817 | null |
2024-09-09 | Improving Pretraining Data Using Perplexity Correlations | Tristan Thrush et.al. | 2409.05816 | null |
2024-09-09 | Benchmarking Chinese Knowledge Rectification in Large Language Models | Tianhe Lu et.al. | 2409.05806 | link |
2024-09-09 | Evidence from fMRI Supports a Two-Phase Abstraction Process in Language Models | Emily Cheng et.al. | 2409.05771 | null |
2024-09-09 | Model Input Verification of Large Scale Simulations | Rumyana Neykova et.al. | 2409.05768 | null |
2024-09-09 | A Novel Idea Generation Tool using a Structured Conversational AI (CAI) System | B. Sankar et.al. | 2409.05747 | null |
2024-09-09 | LLMs Will Always Hallucinate, and We Need to Live With This | Sourav Banerjee et.al. | 2409.05746 | null |
2024-09-09 | A System and Benchmark for LLM-based Q\&A on Heterogeneous Data | Achille Fokoue et.al. | 2409.05735 | null |
2024-09-09 | Towards Democratizing Multilingual Large Language Models For Medicine Through A Two-Stage Instruction Fine-tuning Approach | Meng Zhou et.al. | 2409.05732 | null |
2024-09-09 | The Influence of Task and Group Disparities over Users’ Attitudes Toward Using Large Language Models for Psychotherapy | Qihang He et.al. | 2409.05703 | null |
2024-09-09 | Segmentation by Factorization: Unsupervised Semantic Segmentation for Pathology by Factorizing Foundation Model Features | Jacob Gildenblat et.al. | 2409.05697 | null |
2024-09-09 | Zero-shot Outlier Detection via Prior-data Fitted Networks: Model Selection Bygone! | Yuchen Shen et.al. | 2409.05672 | null |
2024-09-09 | Revisiting English Winogender Schemas for Consistency, Coverage, and Grammatical Case | Vagrant Gautam et.al. | 2409.05653 | link |
2024-09-10 | MemoRAG: Moving towards Next-Gen RAG Via Memory-Inspired Knowledge Discovery | Hongjin Qian et.al. | 2409.05591 | link |
2024-09-09 | Leveraging Content and Acoustic Representations for Efficient Speech Emotion Recognition | Soumya Dutta et.al. | 2409.05566 | null |
2024-09-09 | CauseJudger: Identifying the Cause with LLMs for Abductive Logical Reasoning | Jinwei He et.al. | 2409.05559 | null |
2024-09-09 | SciAgents: Automating scientific discovery through multi-agent intelligent graph reasoning | Alireza Ghafarollahi et.al. | 2409.05556 | link |
2024-09-09 | Harmonic Reasoning in Large Language Models | Anna Kruspe et.al. | 2409.05521 | null |
2024-09-06 | VILA-U: a Unified Foundation Model Integrating Visual Understanding and Generation | Yecheng Wu et.al. | 2409.04429 | link |
2024-09-06 | Exploring Foundation Models for Synthetic Medical Imaging: A Study on Chest X-Rays and Fine-Tuning Techniques | Davide Clode da Silva et.al. | 2409.04424 | null |
2024-09-06 | RLPF: Reinforcement Learning from Prediction Feedback for User Summarization with LLMs | Jiaxing Wu et.al. | 2409.04421 | null |
2024-09-06 | Question-Answering Dense Video Events | Hangyu Qin et.al. | 2409.04388 | null |
2024-09-06 | Learning vs Retrieval: The Role of In-Context Examples in Regression with LLMs | Aliakbar Nafar et.al. | 2409.04318 | link |
2024-09-06 | An optically accelerated extreme learning machine using hot atomic vapors | Pierre Azam et.al. | 2409.04312 | null |
2024-09-06 | Using Large Language Models to Generate Authentic Multi-agent Knowledge Work Datasets | Desiree Heim et.al. | 2409.04286 | null |
2024-09-06 | Advancing Automated Knowledge Transfer in Evolutionary Multitasking via Large Language Models | Yuxiao Huang et.al. | 2409.04270 | null |
2024-09-06 | An overview of domain-specific foundation model: key technologies, applications and challenges | Haolong Chen et.al. | 2409.04267 | null |
2024-09-06 | UniDet3D: Multi-dataset Indoor 3D Object Detection | Maksim Kolodiazhnyi et.al. | 2409.04234 | link |
2024-09-06 | Fast Forwarding Low-Rank Training | Adir Rahamim et.al. | 2409.04206 | null |
2024-09-06 | Residual Stream Analysis with Multi-Layer SAEs | Tim Lawson et.al. | 2409.04185 | link |
2024-09-06 | GALLa: Graph Aligned Large Language Models for Improved Source Code Understanding | Ziyin Zhang et.al. | 2409.04183 | null |
2024-09-06 | Combining LLMs and Knowledge Graphs to Reduce Hallucinations in Question Answering | Larissa Pusch et.al. | 2409.04181 | null |
2024-09-06 | From Calculation to Adjudication: Examining LLM judges on Mathematical Reasoning Tasks | Andreas Stephan et.al. | 2409.04168 | null |
2024-09-06 | Can OpenSource beat ChatGPT? – A Comparative Study of Large Language Models for Text-to-Code Generation | Luis Mayer et.al. | 2409.04164 | null |
2024-09-06 | Prompt-based Personality Profiling: Reinforcement Learning for Relevance Filtering | Jan Hofmann et.al. | 2409.04122 | null |
2024-09-06 | Multi-Programming Language Ensemble for Code Generation in Large Language Model | Tengfei Xue et.al. | 2409.04114 | link |
2024-09-06 | Can LLMs Generate Novel Research Ideas? A Large-Scale Human Study with 100+ NLP Researchers | Chenglei Si et.al. | 2409.04109 | link |
2024-09-06 | UI-JEPA: Towards Active Perception of User Intent through Onscreen User Activity | Yicheng Fu et.al. | 2409.04081 | null |
2024-09-05 | Lexicon3D: Probing Visual Foundation Models for Complex 3D Scene Understanding | Yunze Man et.al. | 2409.03757 | link |
2024-09-05 | Foundation Model or Finetune? Evaluation of few-shot semantic segmentation for river pollution | Marga Don et.al. | 2409.03754 | link |
2024-09-05 | Attention Heads of Large Language Models: A Survey | Zifan Zheng et.al. | 2409.03752 | link |
2024-09-05 | LLM-CI: Assessing Contextual Integrity Norms in Language Models | Yan Shvartzshnaider et.al. | 2409.03735 | null |
2024-09-05 | Safety vs. Performance: How Multi-Objective Learning Reduces Barriers to Market Entry | Meena Jagadeesan et.al. | 2409.03734 | null |
2024-09-05 | Planning In Natural Language Improves LLM Search For Code Generation | Evan Wang et.al. | 2409.03733 | link |
2024-09-06 | RAG based Question-Answering for Contextual Response Prediction System | Sriram Veturi et.al. | 2409.03708 | null |
2024-09-05 | LAST: Language Model Aware Speech Tokenization | Arnon Turetzky et.al. | 2409.03701 | null |
2024-09-05 | TRACE-cs: Trustworthy Reasoning for Contrastive Explanations in Course Scheduling Problems | Stylianos Loukas Vasileiou et.al. | 2409.03671 | null |
2024-09-05 | A Fused Large Language Model for Predicting Startup Success | Abdurahman Maarouf et.al. | 2409.03668 | null |
2024-09-05 | The representation landscape of few-shot learning and fine-tuning in large language models | Diego Doimo et.al. | 2409.03662 | link |
2024-09-06 | LLM-based multi-agent poetry generation in non-cooperative environments | Ran Zhang et.al. | 2409.03659 | link |
2024-09-05 | On the Limited Generalization Capability of the Implicit Reward Model Induced by Direct Preference Optimization | Yong Lin et.al. | 2409.03650 | null |
2024-09-05 | Text-Guided Mixup Towards Long-Tailed Image Categorization | Richard Franklin et.al. | 2409.03583 | link |
2024-09-05 | FrozenSeg: Harmonizing Frozen Foundation Models for Open-Vocabulary Segmentation | Xi Chen et.al. | 2409.03525 | null |
2024-09-05 | Have Large Vision-Language Models Mastered Art History? | Ombretta Strafforello et.al. | 2409.03521 | null |
2024-09-05 | Tissue Concepts: supervised foundation models in computational pathology | Till Nicke et.al. | 2409.03519 | link |
2024-09-05 | From MOOC to MAIC: Reshaping Online Teaching and Learning through LLM-driven Agents | Jifan Yu et.al. | 2409.03512 | null |
2024-09-05 | LLM-based event abstraction and integration for IoT-sourced logs | Mohsen Shirali et.al. | 2409.03478 | link |
2024-09-05 | How Much Data is Enough Data? Fine-Tuning Large Language Models for In-House Translation: Performance Evaluation Across Multiple Dataset Sizes | Inacio Vieira et.al. | 2409.03454 | null |
2024-09-04 | RoboTwin: Dual-Arm Robot Benchmark with Generative Digital Twins (early version) | Yao Mu et.al. | 2409.02920 | null |
2024-09-04 | Can LVLMs Obtain a Driver’s License? A Benchmark Towards Reliable AGI for Autonomous Driving | Yuhang Lu et.al. | 2409.02914 | null |
2024-09-04 | Masked Diffusion Models are Secretly Time-Agnostic Masked Models and Exploit Inaccurate Categorical Sampling | Kaiwen Zheng et.al. | 2409.02908 | null |
2024-09-05 | LongCite: Enabling LLMs to Generate Fine-grained Citations in Long-context QA | Jiajie Zhang et.al. | 2409.02897 | link |
2024-09-04 | LongLLaVA: Scaling Multi-modal LLMs to 1000 Images Efficiently via Hybrid Architecture | Xidong Wang et.al. | 2409.02889 | link |
2024-09-04 | CanvOI, an Oncology Intelligence Foundation Model: Scaling FLOPS Differently | Jonathan Zalach et.al. | 2409.02885 | null |
2024-09-04 | Benchmarking Spurious Bias in Few-Shot Image Classifiers | Guangtao Zheng et.al. | 2409.02882 | link |
2024-09-04 | Configurable Foundation Models: Building LLMs from a Modular Perspective | Chaojun Xiao et.al. | 2409.02877 | null |
2024-09-04 | Historical German Text Normalization Using Type- and Token-Based Language Modeling | Anton Ehrmanntraut et.al. | 2409.02841 | null |
2024-09-04 | Exploring Sentiment Dynamics and Predictive Behaviors in Cryptocurrency Discussions by Few-Shot Learning with Large Language Models | Moein Shahiki Tash et.al. | 2409.02836 | null |
2024-09-04 | CMM-Math: A Chinese Multimodal Math Dataset To Evaluate and Enhance the Mathematics Reasoning of Large Multimodal Models | Wentao Liu et.al. | 2409.02834 | link |
2024-09-04 | ExpLLM: Towards Chain of Thought for Facial Expression Recognition | Xing Lan et.al. | 2409.02828 | null |
2024-09-04 | Design Contradictions: Help or Hindrance? | Aron E. Owen et.al. | 2409.02823 | null |
2024-09-04 | Language Understanding as a Constraint on Consensus Size in LLM Societies | Giordano De Marzo et.al. | 2409.02822 | null |
2024-09-04 | Towards a Unified View of Preference Learning for Large Language Models: A Survey | Bofei Gao et.al. | 2409.02795 | link |
2024-09-05 | Pooling And Attention: What Are Effective Designs For LLM-Based Embedding Models? | Yixuan Tang et.al. | 2409.02727 | link |
2024-09-04 | Pre-training data selection for biomedical domain adaptation using journal impact metrics | Mathieu Laï-king et.al. | 2409.02725 | null |
2024-09-04 | Alignment-Aware Model Extraction Attacks on Large Language Models | Zi Liang et.al. | 2409.02718 | link |
2024-09-04 | Creating a Gen-AI based Track and Trace Assistant MVP (SuperTracy) for PostNL | Mohammad Reshadati et.al. | 2409.02711 | null |
2024-09-04 | LLM-Assisted Visual Analytics: Opportunities and Challenges | Maeve Hutchinson et.al. | 2409.02691 | null |
2024-08-30 | SYNTHEVAL: Hybrid Behavioral Testing of NLP Models with Synthetic CheckLists | Raoyuan Zhao et.al. | 2408.17437 | link |
2024-08-30 | DARES: Depth Anything in Robotic Endoscopic Surgery with Self-supervised Vector-LoRA of the Foundation Model | Mona Sheikh Zeinoddin et.al. | 2408.17433 | link |
2024-08-30 | Advancing Multi-talker ASR Performance with Large Language Models | Mohan Shi et.al. | 2408.17431 | null |
2024-08-30 | CLOCR-C: Context Leveraging OCR Correction with Pre-trained Language Models | Jonathan Bourne et.al. | 2408.17428 | null |
2024-09-03 | Open-vocabulary Temporal Action Localization using VLMs | Naoki Wake et.al. | 2408.17422 | null |
2024-08-30 | Getting Inspiration for Feature Elicitation: App Store- vs. LLM-based Approach | Jialiang Wei et.al. | 2408.17404 | link |
2024-08-30 | EMPOWER: Embodied Multi-role Open-vocabulary Planning with Online Grounding and Execution | Francesco Argenziano et.al. | 2408.17379 | null |
2024-08-30 | NDP: Next Distribution Prediction as a More Broad Target | Junhao Ruan et.al. | 2408.17377 | null |
2024-08-30 | Assessing Generative Language Models in Classification Tasks: Performance and Self-Evaluation Capabilities in the Environmental and Climate Change Domain | Francesca Grasso et.al. | 2408.17362 | link |
2024-08-30 | Forget to Flourish: Leveraging Machine-Unlearning on Pretrained Language Models for Privacy Leakage | Md Rafi Ur Rashid et.al. | 2408.17354 | null |
2024-09-02 | LSMS: Language-guided Scale-aware MedSegmentor for Medical Image Referring Segmentation | Shuyi Ouyang et.al. | 2408.17347 | null |
2024-08-30 | Investigating Neuron Ablation in Attention Heads: The Case for Peak Activation Centering | Nicholas Pochinkov et.al. | 2408.17322 | link |
2024-08-30 | Bridging Domain Knowledge and Process Discovery Using Large Language Models | Ali Norouzifar et.al. | 2408.17316 | link |
2024-08-30 | Flexible and Effective Mixing of Large Language Models into a Mixture of Domain Experts | Rhui Dih Lee et.al. | 2408.17280 | null |
2024-08-30 | Joint Estimation and Prediction of City-wide Delivery Demand: A Large Language Model Empowered Graph-based Learning Approach | Tong Nie et.al. | 2408.17258 | null |
2024-08-30 | VisionTS: Visual Masked Autoencoders Are Free-Lunch Zero-Shot Time Series Forecasters | Mouxiang Chen et.al. | 2408.17253 | link |
2024-08-30 | Improving Extraction of Clinical Event Contextual Properties from Electronic Health Records: A Comparative Study | Shubham Agarwal et.al. | 2408.17181 | null |
2024-08-30 | Codec Does Matter: Exploring the Semantic Shortcoming of Codec for Audio Language Model | Zhen Ye et.al. | 2408.17175 | link |
2024-08-30 | Look, Compare, Decide: Alleviating Hallucination in Large Vision-Language Models via Multi-View Multi-Path Reasoning | Xiaoye Qu et.al. | 2408.17150 | link |
2024-08-30 | Reasoning AI Performance Degradation in 6G Networks with Large Language Models | Liming Huang et.al. | 2408.17097 | null |
2024-08-29 | PromptSmooth: Certifying Robustness of Medical Vision-Language Models via Prompt Learning | Noor Hussein et.al. | 2408.16769 | link |
2024-08-29 | How Far Can Cantonese NLP Go? Benchmarking Cantonese Capabilities of Large Language Models | Jiyue Jiang et.al. | 2408.16756 | link |
2024-08-29 | Reinforcement Learning without Human Feedback for Last Mile Fine-Tuning of Large Language Models | Alec Solway et.al. | 2408.16753 | null |
2024-08-29 | A Gradient Analysis Framework for Rewarding Good and Penalizing Bad Examples in Language Models | Yi-Lin Tuan et.al. | 2408.16751 | null |
2024-08-29 | Assessing Large Language Models for Online Extremism Research: Identification, Explanation, and New Knowledge | Beidi Dong et.al. | 2408.16749 | null |
2024-08-29 | Theoretical and Methodological Framework for Studying Texts Produced by Large Language Models | Jiří Milička et.al. | 2408.16740 | null |
2024-08-29 | Smaller, Weaker, Yet Better: Training LLM Reasoners via Compute-Optimal Sampling | Hritik Bansal et.al. | 2408.16737 | null |
2024-08-29 | VideoLLM-MoD: Efficient Video-Language Streaming with Mixture-of-Depths Vision Computation | Shiwei Wu et.al. | 2408.16730 | null |
2024-08-30 | Mini-Omni: Language Models Can Hear, Talk While Thinking in Streaming | Zhifei Xie et.al. | 2408.16725 | link |
2024-08-29 | GradBias: Unveiling Word Influence on Bias in Text-to-Image Generative Models | Moreno D’Incà et.al. | 2408.16700 | link |
2024-08-29 | Entropic Distribution Matching in Supervised Fine-tuning of LLMs: Less Overfitting and Better Diversity | Ziniu Li et.al. | 2408.16673 | null |
2024-08-29 | Space3D-Bench: Spatial 3D Question Answering Benchmark | Emilia Szymanska et.al. | 2408.16662 | null |
2024-08-29 | DriveGenVLM: Real-world Video Generation for Vision Language Model based Autonomous Driving | Yongjie Fu et.al. | 2408.16647 | null |
2024-08-29 | Examination of Code generated by Large Language Models | Robin Beer et.al. | 2408.16601 | link |
2024-08-29 | Enhancing Dialogue Generation in Werewolf Game Through Situation Analysis and Persuasion Strategies | Zhiyang Qi et.al. | 2408.16586 | null |
2024-08-29 | WavTokenizer: an Efficient Acoustic Discrete Codec Tokenizer for Audio Language Modeling | Shengpeng Ji et.al. | 2408.16532 | link |
2024-08-29 | CNIMA: A Universal Evaluation Framework and Automated Approach for Assessing Second Language Dialogues | Rena Gao et.al. | 2408.16518 | link |
2024-08-29 | LLMs vs Established Text Augmentation Techniques for Classification: When do the Benefits Outweight the Costs? | Jan Cegin et.al. | 2408.16502 | null |
2024-08-29 | CogVLM2: Visual Language Models for Image and Video Understanding | Wenyi Hong et.al. | 2408.16500 | link |
2024-08-29 | A Survey on Evaluating Large Language Models in Code Generation Tasks | Liguo Chen et.al. | 2408.16498 | null |
2024-08-28 | Eagle: Exploring The Design Space for Multimodal LLMs with Mixture of Encoders | Min Shi et.al. | 2408.15998 | link |
2024-08-29 | Spatio-Temporal Context Prompting for Zero-Shot Action Detection | Wei-Jhe Huang et.al. | 2408.15996 | null |
2024-08-28 | Perceive-IR: Learning to Perceive Degradation Better for All-in-One Image Restoration | Xu Zhang et.al. | 2408.15994 | null |
2024-08-28 | BattleAgentBench: A Benchmark for Evaluating Cooperation and Competition Capabilities of Language Models in Multi-Agent Systems | Wei Wang et.al. | 2408.15971 | null |
2024-08-28 | More Text, Less Point: Towards 3D Data-Efficient Point-Language Understanding | Yuan Tang et.al. | 2408.15966 | link |
2024-08-28 | Atari-GPT: Investigating the Capabilities of Multimodal Large Language Models as Low-Level Policies for Atari Games | Nicholas R. Waytowich et.al. | 2408.15950 | null |
2024-08-28 | DeMoBot: Deformable Mobile Manipulation with Vision-based Sub-goal Retrieval | Yuying Zhang et.al. | 2408.15919 | null |
2024-08-28 | Leveraging Open Knowledge for Advancing Task Expertise in Large Language Models | Yuncheng Yang et.al. | 2408.15915 | link |
2024-08-28 | Decentralized LLM Inference over Edge Networks with Energy Harvesting | Aria Khoshsirat et.al. | 2408.15907 | null |
2024-08-28 | LLM-Based Multi-Hop Question Answering with Knowledge Graph Integration in Evolving Environments | Ruirui Chen et.al. | 2408.15903 | null |
2024-08-28 | Nexus: Specialization meets Adaptability for Efficiently Training Mixture of Experts | Nikolas Gritsch et.al. | 2408.15901 | null |
2024-08-28 | Bias in LLMs as Annotators: The Effect of Party Cues on Labelling Decision by Large Language Models | Sebastian Vallejo Vera et.al. | 2408.15895 | null |
2024-08-28 | LLaVA-MoD: Making LLaVA Tiny via MoE Knowledge Distillation | Fangxun Shu et.al. | 2408.15881 | link |
2024-08-28 | Persuasion Games using Large Language Models | Ganesh Prasath Ramani et.al. | 2408.15879 | null |
2024-08-28 | Retrieval-Augmented Instruction Tuning for Automated Process Engineering Calculations : A Tool-Chaining Problem-Solving Framework with Attributable Reflection | Sagar Srinivas Sakhinana et.al. | 2408.15866 | null |
2024-08-28 | Benchmarking foundation models as feature extractors for weakly-supervised computational pathology | Peter Neidlinger et.al. | 2408.15823 | null |
2024-08-28 | Visual Prompt Engineering for Medical Vision Language Models in Radiology | Stefan Denner et.al. | 2408.15802 | null |
2024-08-28 | Scaling Up Summarization: Leveraging Large Language Models for Long Text Extractive Summarization | Léo Hemamou et.al. | 2408.15801 | null |
2024-08-28 | Evaluating Named Entity Recognition Using Few-Shot Prompting with Large Language Models | Hédi Zhegidi et.al. | 2408.15796 | link |
2024-08-28 | Efficient LLM Scheduling by Learning to Rank | Yichao Fu et.al. | 2408.15792 | link |
2024-08-27 | Generative Verifiers: Reward Modeling as Next-Token Prediction | Lunjun Zhang et.al. | 2408.15240 | null |
2024-08-27 | The Mamba in the Llama: Distilling and Accelerating Hybrid Models | Junxiong Wang et.al. | 2408.15237 | link |
2024-08-27 | Into the Unknown Unknowns: Engaged Human Learning through Participation in Language Model Agent Conversations | Yucheng Jiang et.al. | 2408.15232 | null |
2024-08-27 | LLM Defenses Are Not Robust to Multi-Turn Human Jailbreaks Yet | Nathaniel Li et.al. | 2408.15221 | null |
2024-08-27 | Investigating Coverage Criteria in Large Language Models: An In-Depth Study Through Jailbreak Attacks | Shide Zhou et.al. | 2408.15207 | null |
2024-08-27 | Leveraging Hallucinations to Reduce Manual Prompt Dependency in Promptable Segmentation | Jian Hu et.al. | 2408.15205 | link |
2024-08-27 | Can Unconfident LLM Annotations Be Used for Confident Conclusions? | Kristina Gligorić et.al. | 2408.15204 | link |
2024-08-27 | Infusing Acoustic Pause Context into Text-Based Dementia Assessment | Franziska Braun et.al. | 2408.15188 | null |
2024-08-27 | Unlocking Potential in Pre-Trained Music Language Models for Versatile Multi-Track Music Arrangement | Longshen Ou et.al. | 2408.15176 | null |
2024-08-27 | X-Reflect: Cross-Reflection Prompting for Multimodal Recommendation | Hanjia Lyu et.al. | 2408.15172 | null |
2024-08-27 | Measuring text summarization factuality using atomic facts entailment metrics in the context of retrieval augmented generation | N. E. Kriman et.al. | 2408.15171 | null |
2024-08-27 | How transformers learn structured data: insights from hierarchical filtering | Jerome Garnier-Brun et.al. | 2408.15138 | null |
2024-08-27 | CLIP-AGIQA: Boosting the Performance of AI-Generated Image Quality Assessment with CLIP | Zhenchen Tang et.al. | 2408.15098 | null |
2024-08-27 | Relation Also Knows: Rethinking the Recall and Editing of Factual Associations in Auto-Regressive Transformer Language Models | Xiyu Liu et.al. | 2408.15091 | null |
2024-08-27 | BaichuanSEED: Sharing the Potential of ExtensivE Data Collection and Deduplication by Introducing a Competitive Large Language Model Baseline | Guosheng Dong et.al. | 2408.15079 | null |
2024-08-27 | Constraining Participation: Affordances of Feedback Features in Interfaces to Large Language Models | Ned Cooper et.al. | 2408.15066 | null |
2024-08-27 | The Benefits of Balance: From Information Projections to Variance Reduction | Lang Liu et.al. | 2408.15065 | null |
2024-08-28 | DocLayLLM: An Efficient and Effective Multi-modal Extension of Large Language Models for Text-rich Document Understanding | Wenhui Liao et.al. | 2408.15045 | null |
2024-08-28 | A Survey of Large Language Models for European Languages | Wazir Ali et.al. | 2408.15040 | null |
2024-08-27 | Speech Recognition Transformers: Topological-lingualism Perspective | Shruti Singh et.al. | 2408.14991 | null |
2024-08-26 | A Practitioner’s Guide to Continual Multimodal Pretraining | Karsten Roth et.al. | 2408.14471 | link |
2024-08-27 | Step-by-Step Unmasking for Parameter-Efficient Fine-tuning of Large Language Models | Aradhye Agarwal et.al. | 2408.14470 | link |
2024-08-26 | Grounded Multi-Hop VideoQA in Long-Form Egocentric Videos | Qirui Chen et.al. | 2408.14469 | null |
2024-08-26 | Explicit Inductive Inference using Large Language Models | Tianyang Liu et.al. | 2408.14467 | null |
2024-08-26 | Evaluating Large Language Models on Spatial Tasks: A Multi-Task Benchmarking Study | Liuchang Xu Shuo Zhao et.al. | 2408.14438 | null |
2024-08-26 | Social perception of faces in a vision-language model | Carina I. Hausladen et.al. | 2408.14435 | link |
2024-08-26 | CHARTOM: A Visual Theory-of-Mind Benchmark for Multimodal Large Language Models | Shubham Bharti et.al. | 2408.14419 | null |
2024-08-26 | MEDSAGE: Enhancing Robustness of Medical Dialogue Summarization to ASR Errors with LLM-generated Synthetic Dialogues | Kuluhan Binici et.al. | 2408.14418 | null |
2024-08-26 | Hyperdimensional Computing Empowered Federated Foundation Model over Wireless Networks for Metaverse | Yahao Ding et.al. | 2408.14416 | null |
2024-08-26 | Language-specific Calibration for Pruning Multilingual Language Models | Simon Kurz et.al. | 2408.14398 | null |
2024-08-26 | Reprogramming Foundational Large Language Models(LLMs) for Enterprise Adoption for Spatio-Temporal Forecasting Applications: Unveiling a New Era in Copilot-Guided Cross-Modal Time Series Representation Learning | Sakhinana Sagar Srinivas et.al. | 2408.14387 | null |
2024-08-26 | Probing Causality Manipulation of Large Language Models | Chenyang Zhang et.al. | 2408.14380 | link |
2024-08-26 | An Embedding is Worth a Thousand Noisy Labels | Francesco Di Salvo et.al. | 2408.14358 | link |
2024-08-26 | SWE-bench-java: A GitHub Issue Resolving Benchmark for Java | Daoguang Zan et.al. | 2408.14354 | link |
2024-08-26 | Assessing Contamination in Large Language Models: Introducing the LogProber method | Nicolas Yax et.al. | 2408.14352 | null |
2024-08-27 | Foundation Models for Music: A Survey | Yinghao Ma et.al. | 2408.14340 | link |
2024-08-26 | Claim Verification in the Age of Large Language Models: A Survey | Alphaeus Dmonte et.al. | 2408.14317 | null |
2024-08-26 | LLM-3D Print: Large Language Models To Monitor and Control 3D Printing | Yayati Jadhav et.al. | 2408.14307 | null |
2024-08-26 | Investigating the Effectiveness of Bayesian Spam Filters in Detecting LLM-modified Spam Mails | Malte Josten et.al. | 2408.14293 | link |
2024-08-26 | Predictability and Causality in Spanish and English Natural Language Generation | Andrea Busto-Castiñeira et.al. | 2408.14283 | null |
2024-08-23 | MME-RealWorld: Could Your Multimodal LLM Challenge High-Resolution Real-World Scenarios that are Difficult for Humans? | Yi-Fan Zhang et.al. | 2408.13257 | null |
2024-08-23 | Domain-specific long text classification from sparse relevant information | Célia D’Cruz et.al. | 2408.13253 | null |
2024-08-23 | Foundational Model for Electron Micrograph Analysis: Instruction-Tuning Small-Scale Language-and-Vision Assistant for Enterprise Adoption | Sakhinana Sagar Srinivas et.al. | 2408.13248 | null |
2024-08-23 | Multi-Layer Transformers Gradient Can be Approximated in Almost Linear Time | Yingyu Liang et.al. | 2408.13233 | null |
2024-08-23 | EUR-USD Exchange Rate Forecasting Based on Information Fusion with Large Language Models and Deep Learning Methods | Hongcheng Ding et.al. | 2408.13214 | null |
2024-08-23 | DOMAINEVAL: An Auto-Constructed Benchmark for Multi-Domain Code Generation | Qiming Zhu et.al. | 2408.13204 | null |
2024-08-23 | Can LLM be a Good Path Planner based on Prompt Engineering? Mitigating the Hallucination for Path Planning | Hourui Deng et.al. | 2408.13184 | null |
2024-08-23 | IntelliCare: Improving Healthcare Analysis with Variance-Controlled Patient-Level Knowledge from Large Language Models | Zhihao Yu et.al. | 2408.13073 | link |
2024-08-23 | Guiding IoT-Based Healthcare Alert Systems with Large Language Models | Yulan Gao et.al. | 2408.13071 | null |
2024-08-23 | SpeechPrompt: Prompting Speech Language Models for Speech Processing Tasks | Kai-Wei Chang et.al. | 2408.13040 | null |
2024-08-23 | VFM-Det: Towards High-Performance Vehicle Detection via Large Foundation Models | Wentao Wu et.al. | 2408.13031 | link |
2024-08-23 | In-Context Learning with Reinforcement Learning for Incomplete Utterance Rewriting | Haowei Du et.al. | 2408.13028 | null |
2024-08-23 | A Web-Based Solution for Federated Learning with LLM-Based Automation | Chamith Mawela et.al. | 2408.13010 | null |
2024-08-23 | Systematic Evaluation of LLM-as-a-Judge in LLM Alignment Tasks: Explainable Metrics and Diverse Prompt Templates | Hui Wei et.al. | 2408.13006 | link |
2024-08-23 | CRUXEval-X: A Benchmark for Multilingual Code Reasoning, Understanding and Execution | Ruiyang Xu et.al. | 2408.13001 | null |
2024-08-23 | Open Llama2 Model for the Lithuanian Language | Artūras Nakvosas et.al. | 2408.12963 | null |
2024-08-23 | Multimodal Contrastive In-Context Learning | Yosuke Miyanishi et.al. | 2408.12959 | null |
2024-08-23 | Image Segmentation in Foundation Model Era: A Survey | Tianfei Zhou et.al. | 2408.12957 | link |
2024-08-23 | E-code: Mastering Efficient Code Generation through Pretrained Models and Expert Encoder Group | Yue Pan et.al. | 2408.12948 | null |
2024-08-23 | Causal-Guided Active Learning for Debiasing Large Language Models | Zhouhao Sun et.al. | 2408.12942 | link |
2024-08-22 | Controllable Text Generation for Large Language Models: A Survey | Xun Liang et.al. | 2408.12599 | link |
2024-08-23 | Non-Homophilic Graph Pre-Training and Prompt Learning | Xingtong Yu et.al. | 2408.12594 | null |
2024-08-22 | RuleAlign: Making Large Language Models Better Physicians with Diagnostic Rule Alignment | Xiaohan Wang et.al. | 2408.12579 | null |
2024-08-22 | MuMA-ToM: Multi-modal Multi-Agent Theory of Mind | Haojun Shi et.al. | 2408.12574 | link |
2024-08-22 | Jamba-1.5: Hybrid Transformer-Mamba Models at Scale | Jamba Team et.al. | 2408.12570 | null |
2024-08-22 | ssProp: Energy-Efficient Training for Convolutional Neural Networks with Scheduled Sparse Back Propagation | Lujia Zhong et.al. | 2408.12561 | link |
2024-08-22 | Towards Evaluating and Building Versatile Large Language Models for Medicine | Chaoyi Wu et.al. | 2408.12547 | link |
2024-08-22 | Show-o: One Single Transformer to Unify Multimodal Understanding and Generation | Jinheng Xie et.al. | 2408.12528 | null |
2024-08-22 | MEDCO: Medical Education Copilots Based on A Multi-Agent Framework | Hao Wei et.al. | 2408.12496 | null |
2024-08-22 | GenderCARE: A Comprehensive Framework for Assessing and Reducing Gender Bias in Large Language Models | Kunsheng Tang et.al. | 2408.12494 | link |
2024-08-23 | Vintern-1B: An Efficient Multimodal Large Language Model for Vietnamese | Khang T. Doan et.al. | 2408.12480 | null |
2024-08-22 | Frame Order Matters: A Temporal Sequence-Aware Model for Few-Shot Action Recognition | Bozheng Li et.al. | 2408.12475 | null |
2024-08-22 | DLCRec: A Novel Approach for Managing Diversity in LLM-Based Recommender Systems | Jiaju Chen et.al. | 2408.12470 | null |
2024-08-22 | Envisioning Class Entity Reasoning by Large Language Models for Few-shot Learning | Mushui Liu et.al. | 2408.12469 | null |
2024-08-22 | Enhancing Multi-hop Reasoning through Knowledge Erasure in Large Language Model Editing | Mengqi Zhang et.al. | 2408.12456 | null |
2024-08-22 | Positional Description for Numerical Normalization | Deepanshu Gupta et.al. | 2408.12430 | null |
2024-08-22 | FlexEdit: Marrying Free-Shape Masks to VLLM for Flexible Image Editing | Jue Wang et.al. | 2408.12429 | link |
2024-08-22 | Enhanced Infield Agriculture with Interpretable Machine Learning Approaches for Crop Classification | Sudi Murindanyi et.al. | 2408.12426 | null |
2024-08-22 | Unlearning Trojans in Large Language Models: A Comparison Between Natural Language and Source Code | Mahdi Kazemi et.al. | 2408.12416 | null |
2024-08-22 | Generalized SAM: Efficient Fine-Tuning of SAM for Variable Input Image Sizes | Sota Kato et.al. | 2408.12406 | link |
2024-08-21 | Great Memory, Shallow Reasoning: Limits of $k$ NN-LMs | Shangyi Geng et.al. | 2408.11815 | link |
2024-08-21 | SEA: Supervised Embedding Alignment for Token-Level Visual-Textual Integration in MLLMs | Yuanyang Yin et.al. | 2408.11813 | null |
2024-08-21 | EmbodiedSAM: Online Segment Any 3D Thing in Real Time | Xiuwei Xu et.al. | 2408.11811 | null |
2024-08-21 | Approaching Deep Learning through the Spectral Dynamics of Weights | David Yunis et.al. | 2408.11804 | link |
2024-08-21 | Story3D-Agent: Exploring 3D Storytelling Visualization with Large Language Models | Yuzhou Huang et.al. | 2408.11801 | null |
2024-08-21 | PermitQA: A Benchmark for Retrieval Augmented Generation in Wind Siting and Permitting domain | Rounak Meyur et.al. | 2408.11800 | null |
2024-08-21 | Practical token pruning for foundation models in few-shot conversational virtual assistant systems | Haode Qi et.al. | 2408.11799 | null |
2024-08-21 | EE-MLLM: A Data-Efficient and Compute-Efficient Multimodal Large Language Model | Feipeng Ma et.al. | 2408.11795 | null |
2024-08-21 | Leveraging Chemistry Foundation Models to Facilitate Structure Focused Retrieval Augmented Generation in Multi-Agent Workflows for Catalyst and Materials Design | Nathaniel H. Park et.al. | 2408.11793 | null |
2024-08-21 | Critique-out-Loud Reward Models | Zachary Ankner et.al. | 2408.11791 | link |
2024-08-21 | DreamFactory: Pioneering Multi-Scene Long Video Generation with a Multi-Agent Framework | Zhifei Xie et.al. | 2408.11788 | null |
2024-08-21 | Personality Alignment of Large Language Models | Minjun Zhu et.al. | 2408.11779 | link |
2024-08-21 | Leveraging Fine-Tuned Retrieval-Augmented Generation with Long-Context Support: For 3GPP Standards | Omar Erak et.al. | 2408.11775 | link |
2024-08-21 | Against All Odds: Overcoming Typology, Script, and Language Confusion in Multilingual Embedding Inversion Attacks | Yiyi Chen et.al. | 2408.11749 | link |
2024-08-21 | DH-Bench: Probing Depth and Height Perception of Large Visual-Language Models | Shehreen Azad et.al. | 2408.11748 | link |
2024-08-21 | Open-Ended 3D Point Cloud Instance Segmentation | Phuc D. A. Nguyen et.al. | 2408.11747 | null |
2024-08-21 | Mixed Sparsity Training: Achieving 4 $\times$ FLOP Reduction for Transformer Pretraining | Pihe Hu et.al. | 2408.11746 | null |
2024-08-21 | FocusLLM: Scaling LLM’s Context by Parallel Decoding | Zhenyu Li et.al. | 2408.11745 | null |
2024-08-21 | MARLIN: Mixed-Precision Auto-Regressive Parallel Inference on Large Language Models | Elias Frantar et.al. | 2408.11743 | link |
2024-08-21 | CluMo: Cluster-based Modality Fusion Prompt for Continual Learning in Visual Question Answering | Yuliang Cai et.al. | 2408.11742 | link |
2024-08-20 | Prompt-Guided Image-Adaptive Neural Implicit Lookup Tables for Interpretable Image Enhancement | Satoshi Kosugi et.al. | 2408.11055 | link |
2024-08-20 | Revisiting VerilogEval: Newer LLMs, In-Context Learning, and Specification-to-RTL Tasks | Nathaniel Pinckney et.al. | 2408.11053 | link |
2024-08-20 | FLAME: Learning to Navigate with Multimodal LLM in Urban Environments | Yunzhe Xu et.al. | 2408.11051 | link |
2024-08-21 | MagicDec: Breaking the Latency-Throughput Tradeoff for Long Context Generation with Speculative Decoding | Jian Chen et.al. | 2408.11049 | link |
2024-08-20 | Inside the Black Box: Detecting Data Leakage in Pre-trained Language Encoders | Yuan Xin et.al. | 2408.11046 | null |
2024-08-20 | Reconciling Methodological Paradigms: Employing Large Language Models as Novice Qualitative Research Assistants in Talent Management Research | Sreyoshi Bhaduri et.al. | 2408.11043 | null |
2024-08-20 | Transfusion: Predict the Next Token and Diffuse Images with One Multi-Modal Model | Chunting Zhou et.al. | 2408.11039 | null |
2024-08-20 | Scaling Law with Learning Rate Annealing | Howe Tissue et.al. | 2408.11029 | null |
2024-08-20 | Athena: Safe Autonomous Agents with Verbal Contrastive Learning | Tanmana Sadhu et.al. | 2408.11021 | null |
2024-08-20 | While GitHub Copilot Excels at Coding, Does It Ensure Responsible Output? | Wen Cheng et.al. | 2408.11006 | link |
2024-08-20 | SenPa-MAE: Sensor Parameter Aware Masked Autoencoder for Multi-Satellite Self-Supervised Pretraining | Jonathan Prexl et.al. | 2408.11000 | link |
2024-08-20 | CTP-LLM: Clinical Trial Phase Transition Prediction Using Large Language Models | Michael Reinisch et.al. | 2408.10995 | null |
2024-08-20 | Dr.Academy: A Benchmark for Evaluating Questioning Capability in Education for Large Language Models | Yuyan Chen et.al. | 2408.10947 | null |
2024-08-20 | Large Language Model Driven Recommendation | Anton Korikov et.al. | 2408.10946 | null |
2024-08-20 | HiRED: Attention-Guided Token Dropping for Efficient Inference of High-Resolution Vision-Language Models in Resource-Constrained Environments | Kazi Hasan Ibn Arif et.al. | 2408.10945 | link |
2024-08-20 | SysBench: Can Large Language Models Follow System Messages? | Yanzhao Qin et.al. | 2408.10943 | link |
2024-08-20 | Proxona: Leveraging LLM-Driven Personas to Enhance Creators’ Understanding of Their Audience | Yoonseo Choi et.al. | 2408.10937 | null |
2024-08-21 | LBC: Language-Based-Classifier for Out-Of-Variable Generalization | Kangjun Noh et.al. | 2408.10923 | link |
2024-08-21 | BEYOND DIALOGUE: A Profile-Dialogue Alignment Framework Towards General Role-Playing Language Model | Yeyong Yu et.al. | 2408.10903 | link |
2024-08-20 | Soda-Eval: Open-Domain Dialogue Evaluation in the age of LLMs | John Mendonça et.al. | 2408.10902 | link |
2024-08-19 | SANER: Annotation-free Societal Attribute Neutralizer for Debiasing CLIP | Yusuke Hirota et.al. | 2408.10202 | null |
2024-08-19 | Demystifying the Communication Characteristics for Distributed Transformer Models | Quentin Anthony et.al. | 2408.10197 | null |
2024-08-19 | Transformers to SSMs: Distilling Quadratic Knowledge to Subquadratic Models | Aviv Bick et.al. | 2408.10189 | null |
2024-08-19 | LongVILA: Scaling Long-Context Visual Language Models for Long Videos | Fuzhao Xue et.al. | 2408.10188 | link |
2024-08-19 | SMILE: Zero-Shot Sparse Mixture of Low-Rank Experts Construction From Pre-Trained Foundation Models | Anke Tang et.al. | 2408.10174 | link |
2024-08-19 | Customizing Language Models with Instance-wise LoRA for Sequential Recommendation | Xiaoyu Kong et.al. | 2408.10159 | link |
2024-08-19 | Multilingual Needle in a Haystack: Investigating Long-Context Behavior of Multilingual Large Language Models | Amey Hengle et.al. | 2408.10151 | link |
2024-08-19 | In-Context Learning with Representations: Contextual Generalization of Trained Transformers | Tong Yang et.al. | 2408.10147 | null |
2024-08-19 | Instruction Finetuning for Leaderboard Generation from Empirical AI Research | Salomon Kabongo et.al. | 2408.10141 | null |
2024-08-19 | Rhyme-aware Chinese lyric generator based on GPT | Yixiao Yuan et.al. | 2408.10130 | null |
2024-08-19 | Video Object Segmentation via SAM 2: The 4th Solution for LSVOS Challenge VOS Track | Feiyu Pan et.al. | 2408.10125 | null |
2024-08-19 | Molecular Graph Representation Learning Integrating Large Language Models with Domain-specific Small Models | Tianyu Zhang et.al. | 2408.10124 | link |
2024-08-19 | Geometry Informed Tokenization of Molecules for Language Model Generation | Xiner Li et.al. | 2408.10120 | null |
2024-08-19 | GLIMMER: Incorporating Graph and Lexical Features in Unsupervised Multi-Document Summarization | Ran Liu et.al. | 2408.10115 | link |
2024-08-20 | PLUTUS: A Well Pre-trained Large Unified Transformer can Unveil Financial Time Series Regularities | Yuanjian Xu et.al. | 2408.10111 | null |
2024-08-19 | ARMADA: Attribute-Based Multimodal Data Augmentation | Xiaomeng Jin et.al. | 2408.10086 | null |
2024-08-19 | Personalizing Reinforcement Learning from Human Feedback with Variational Preference Learning | Sriyash Poddar et.al. | 2408.10075 | null |
2024-08-19 | FFAA: Multimodal Large Language Model based Explainable Open-World Face Forgery Analysis Assistant | Zhengchao Huang et.al. | 2408.10072 | link |
2024-08-19 | Privacy Checklist: Privacy Violation Detection Grounding on Contextual Integrity Theory | Haoran Li et.al. | 2408.10053 | null |
2024-08-19 | Defense Priorities in the Open-Source AI Debate: A Preliminary Assessment | Masao Dahlgren et.al. | 2408.10026 | null |
2024-08-16 | SAM2-UNet: Segment Anything 2 Makes Strong Encoder for Natural and Medical Image Segmentation | Xinyu Xiong et.al. | 2408.08870 | link |
2024-08-16 | PEDAL: Enhancing Greedy Decoding with Large Language Models using Diverse Exemplars | Sumanth Prabhu et.al. | 2408.08869 | null |
2024-08-16 | A Hassle-free Algorithm for Private Learning in Practice: Don’t Use Tree Aggregation, Use BLTs | H. Brendan McMahan et.al. | 2408.08868 | null |
2024-08-16 | Visual Agents as Fast and Slow Thinkers | Guangyan Sun et.al. | 2408.08862 | link |
2024-08-16 | DPA: Dual Prototypes Alignment for Unsupervised Adaptation of Vision-Language Models | Eman Ali et.al. | 2408.08855 | null |
2024-08-16 | GeoTransformer: Enhancing Urban Forecasting with Geospatial Attention Mechanisms | Yuhao Jia et.al. | 2408.08852 | null |
2024-08-16 | ECG-Chat: A Large ECG-Language Model for Cardiac Disease Diagnosis | Yubao Zhao et.al. | 2408.08849 | link |
2024-08-16 | PsychoLex: Unveiling the Psychological Mind of Large Language Models | Mohammad Amin Abbasi et.al. | 2408.08848 | null |
2024-08-16 | FLEXTAF: Enhancing Table Reasoning with Flexible Tabular Formats | Xuanliang Zhang et.al. | 2408.08841 | link |
2024-08-16 | EasyRec: Simple yet Effective Language Models for Recommendation | Xubin Ren et.al. | 2408.08821 | link |
2024-08-16 | Retrieval-augmented Few-shot Medical Image Segmentation with Foundation Models | Lin Zhao et.al. | 2408.08813 | null |
2024-08-16 | Artificial Intelligence and Strategic Decision-Making: Evidence from Entrepreneurs and Investors | Felipe A. Csaszar et.al. | 2408.08811 | null |
2024-08-16 | Constructing Domain-Specific Evaluation Sets for LLM-as-a-judge | Ravi Raju et.al. | 2408.08808 | null |
2024-08-16 | CIKMar: A Dual-Encoder Approach to Prompt-Based Reranking in Educational Dialogue Systems | Joanito Agili Lopo et.al. | 2408.08805 | null |
2024-08-16 | A Disease-Specific Foundation Model Using Over 100K Fundus Images: Release and Validation for Abnormality and Multi-Disease Classification on Downstream Tasks | Boa Jang et.al. | 2408.08790 | link |
2024-08-16 | EmoDynamiX: Emotional Support Dialogue Strategy Prediction by Modelling MiXed Emotions and Discourse Dynamics | Chenwei Wan et.al. | 2408.08782 | link |
2024-08-16 | Large Language Models Might Not Care What You Are Saying: Prompt Format Beats Descriptions | Chenming Tang et.al. | 2408.08780 | null |
2024-08-16 | DAC: Decomposed Automation Correction for Text-to-SQL | Dingzirui Wang et.al. | 2408.08779 | link |
2024-08-16 | Lower Layer Matters: Alleviating Hallucination via Multi-Layer Fusion Contrastive Decoding with Truthfulness Refocused | Dingwei Chen et.al. | 2408.08769 | null |
2024-08-16 | Rethinking Generative Semantic Communication for Multi-User Systems with Multi-Modal LLM | Wanting Yang et.al. | 2408.08765 | null |
2024-08-15 | Can Large Language Models Understand Symbolic Graphics Programs? | Zeju Qiu et.al. | 2408.08313 | null |
2024-08-15 | ScalingFilter: Assessing Data Quality through Inverse Utilization of Scaling Laws | Ruihang Li et.al. | 2408.08310 | null |
2024-08-15 | Towards Flexible Visual Relationship Segmentation | Fangrui Zhu et.al. | 2408.08305 | null |
2024-08-15 | Benchmarking the Capabilities of Large Language Models in Transportation System Engineering: Accuracy, Consistency, and Reasoning Behaviors | Usman Syed et.al. | 2408.08302 | null |
2024-08-15 | VLPG-Nav: Object Navigation Using Visual Language Pose Graph and Object Localization Probability Maps | Senthil Hariharan Arul et.al. | 2408.08301 | null |
2024-08-15 | HELP: Hierarchical Embeddings-based Log Parsing | Andy Xu et.al. | 2408.08300 | null |
2024-08-15 | The ShareLM Collection and Plugin: Contributing Human-Model Chats for the Benefit of the Community | Shachar Don-Yehiya et.al. | 2408.08291 | null |
2024-08-15 | Autonomous Behavior Planning For Humanoid Loco-manipulation Through Grounded Language Model | Jin Wang et.al. | 2408.08282 | null |
2024-08-15 | BAM! Just Like That: Simple and Efficient Parameter Upcycling for Mixture of Experts | Qizhen Zhang et.al. | 2408.08274 | null |
2024-08-15 | DaRec: A Disentangled Alignment Framework for Large Language Model and Recommender System | Xihong Yang et.al. | 2408.08231 | null |
2024-08-15 | RED-CT: A Systems Design Methodology for Using LLM-labeled Data to Train and Deploy Edge Classifiers for Computational Social Science | David Farr et.al. | 2408.08217 | null |
2024-08-15 | Does Reasoning Emerge? Examining the Probabilities of Causation in Large Language Models | Javier González et.al. | 2408.08210 | null |
2024-08-15 | LLM4DSR: Leveraing Large Language Model for Denoising Sequential Recommendation | Bohao Wang et.al. | 2408.08208 | null |
2024-08-15 | Heavy Labels Out! Dataset Distillation with Label Space Lightening | Ruonan Yu et.al. | 2408.08201 | null |
2024-08-15 | Scaling Up Natural Language Understanding for Multi-Robots Through the Lens of Hierarchy | Shaojun Xu et.al. | 2408.08188 | null |
2024-08-15 | General-purpose Clothes Manipulation with Semantic Keypoints | Yuhong Deng et.al. | 2408.08160 | null |
2024-08-15 | EmBARDiment: an Embodied AI Agent for Productivity in XR | Riccardo Bovo et.al. | 2408.08158 | null |
2024-08-15 | DeepSeek-Prover-V1.5: Harnessing Proof Assistant Feedback for Reinforcement Learning and Monte-Carlo Tree Search | Huajian Xin et.al. | 2408.08152 | link |
2024-08-15 | P/D-Serve: Serving Disaggregated Large Language Model at Scale | Yibo Jin et.al. | 2408.08147 | null |
2024-08-15 | KOALA: Enhancing Speculative Decoding for LLM via Multi-Layer Draft Heads with Adversarial Learning | Kaiqi Zhang et.al. | 2408.08146 | null |
2024-08-14 | The Death of Schema Linking? Text-to-SQL in the Age of Well-Reasoned Language Models | Karime Maamari et.al. | 2408.07702 | null |
2024-08-15 | Model Merging in LLMs, MLLMs, and Beyond: Methods, Theories, Applications and Opportunities | Enneng Yang et.al. | 2408.07666 | link |
2024-08-14 | Spoken Stereoset: On Evaluating Social Bias Toward Speaker in Speech Large Language Models | Yi-Cheng Lin et.al. | 2408.07665 | link |
2024-08-14 | Alignment-Enhanced Decoding:Defending via Token-Level Adaptive Refining of Probability Distributions | Quan Liu et.al. | 2408.07663 | link |
2024-08-14 | WeKnow-RAG: An Adaptive Approach for Retrieval-Augmented Generation Integrating Web Search and Knowledge Graphs | Weijian Xie et.al. | 2408.07611 | null |
2024-08-14 | Transformers and Large Language Models for Efficient Intrusion Detection Systems: A Comprehensive Survey | Hamza Kheddar et.al. | 2408.07583 | null |
2024-08-15 | MathScape: Evaluating MLLMs in multimodal Math Scenarios through a Hierarchical Benchmark | Minxuan Zhou et.al. | 2408.07543 | link |
2024-08-15 | Usefulness of data flow diagrams and large language models for security threat validation: a registered report | Winnie Bahati Mbaka et.al. | 2408.07537 | null |
2024-08-14 | Development of a Multi-Agent Clinical Decision Support System for Korean Triage and Acuity Scale (KTAS)-Based Triage and Treatment Planning in Emergency Departments | Seungjun Han et.al. | 2408.07531 | null |
2024-08-14 | Large Language Models Know What Makes Exemplary Contexts | Quanyu Long et.al. | 2408.07505 | null |
2024-08-14 | Cross-Platform Video Person ReID: A New Benchmark Dataset and Adaptation Approach | Shizhou Zhang et.al. | 2408.07500 | link |
2024-08-14 | QirK: Question Answering via Intermediate Representation on Knowledge Graphs | Jan Luca Scheerer et.al. | 2408.07494 | null |
2024-08-14 | Training Overhead Ratio: A Practical Reliability Metric for Large Language Model Training Systems | Ning Lu et.al. | 2408.07482 | null |
2024-08-14 | Bridging and Modeling Correlations in Pairwise Data for Direct Preference Optimization | Yuxin Jiang et.al. | 2408.07471 | link |
2024-08-14 | Domain-invariant Representation Learning via Segment Anything Model for Blood Cell Classification | Yongcheng Li et.al. | 2408.07467 | link |
2024-08-14 | Large Language Models Prompting With Episodic Memory | Dai Do et.al. | 2408.07465 | null |
2024-08-14 | From Brazilian Portuguese to European Portuguese | João Sanches et.al. | 2408.07457 | null |
2024-08-14 | Fact or Fiction? Improving Fact Verification with Knowledge Graphs through Simplified Subgraph Retrievals | Tobias A. Opsahl et.al. | 2408.07453 | link |
2024-08-15 | BAPLe: Backdoor Attacks on Medical Foundational Models using Prompt Learning | Asif Hanif et.al. | 2408.07440 | link |
2024-08-14 | Beyond Inter-Item Relations: Dynamic Adaptive Mixture-of-Experts for LLM-Based Sequential Recommendation | CanYi Liu et.al. | 2408.07427 | null |
2024-08-13 | Diversity Empowers Intelligence: Integrating Expertise of Software Engineering Agents | Kexun Zhang et.al. | 2408.07060 | null |
2024-08-13 | LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs | Yushi Bai et.al. | 2408.07055 | link |
2024-08-13 | Casper: Prompt Sanitization for Protecting User Privacy in Web-Based Large Language Models | Chun Jie Chong et.al. | 2408.07004 | null |
2024-08-13 | LLMs can Schedule | Henrik Abgaryan et.al. | 2408.06993 | link |
2024-08-13 | DyG-Mamba: Continuous State Space Modeling on Dynamic Graphs | Dongyuan Li et.al. | 2408.06966 | null |
2024-08-13 | Towards Holistic Disease Risk Prediction using Small Language Models | Liv Björkdahl et.al. | 2408.06943 | null |
2024-08-13 | OpenResearcher: Unleashing AI for Accelerated Scientific Research | Yuxiang Zheng et.al. | 2408.06941 | link |
2024-08-13 | The advantages of context specific language models: the case of the Erasmian Language Model | João Gonçalves et.al. | 2408.06931 | link |
2024-08-13 | Evaluating Cultural Adaptability of a Large Language Model via Simulation of Synthetic Personas | Louis Kwok et.al. | 2408.06929 | link |
2024-08-13 | SceneGPT: A Language Model for 3D Scene Understanding | Shivam Chandhok et.al. | 2408.06926 | null |
2024-08-13 | Re-TASK: Revisiting LLM Tasks from Capability, Skill, and Knowledge Perspectives | Zhihu Wang et.al. | 2408.06904 | null |
2024-08-13 | Leveraging Language Models for Emotion and Behavior Analysis in Education | Kaito Tanaka et.al. | 2408.06874 | null |
2024-08-13 | LoRA $^2$ : Multi-Scale Low-Rank Approximations for Fine-Tuning Large Language Models | Jia-Chen Zhang et.al. | 2408.06854 | null |
2024-08-13 | Causal Agent based on Large Language Model | Kairong Han et.al. | 2408.06849 | link |
2024-08-13 | DracoGPT: Extracting Visualization Design Preferences from Large Language Models | Huichen Will Wang et.al. | 2408.06845 | null |
2024-08-13 | How Aligned are Human Chart Takeaways and LLM Predictions? A Case Study on Bar Charts with Varying Layouts | Huichen Will Wang et.al. | 2408.06837 | null |
2024-08-13 | Efficient Search for Customized Activation Functions with Gradient Descent | Lukas Strack et.al. | 2408.06820 | link |
2024-08-13 | MAQA: Evaluating Uncertainty Quantification in LLMs Regarding Data Uncertainty | Yongjin Yang et.al. | 2408.06816 | null |
2024-08-13 | HLSPilot: LLM-based High-Level Synthesis | Chenwei Xiong et.al. | 2408.06810 | link |
2024-08-13 | Layerwise Recurrent Router for Mixture-of-Experts | Zihan Qiu et.al. | 2408.06793 | link |
2024-08-12 | FastFiD: Improve Inference Efficiency of Open Domain Question Answering via Sentence Selection | Yufei Huang et.al. | 2408.06333 | link |
2024-08-12 | Animate, or Inanimate, That is the Question for Large Language Models | Leonardo Ranaldi et.al. | 2408.06332 | null |
2024-08-12 | Can We Rely on LLM Agents to Draft Long-Horizon Plans? Let’s Take TravelPlanner as an Example | Yanan Chen et.al. | 2408.06318 | null |
2024-08-12 | Long-Form Answers to Visual Questions from Blind and Low Vision People | Mina Huh et.al. | 2408.06303 | null |
2024-08-12 | The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery | Chris Lu et.al. | 2408.06292 | link |
2024-08-12 | MovieSum: An Abstractive Summarization Dataset for Movie Screenplays | Rohit Saxena et.al. | 2408.06281 | link |
2024-08-13 | Review-driven Personalized Preference Reasoning with Large Language Models for Recommendation | Jieyong Kim et.al. | 2408.06276 | null |
2024-08-13 | FuxiTranyu: A Multilingual Large Language Model Trained with Balanced Data | Haoran Sun et.al. | 2408.06273 | link |
2024-08-12 | A RAG-Based Question-Answering Solution for Cyber-Attack Investigation and Attribution | Sampath Rajapaksha et.al. | 2408.06272 | null |
2024-08-12 | Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment | Karel D’Oosterlinck et.al. | 2408.06266 | link |
2024-08-12 | Context-aware Visual Storytelling with Visual Prefix Tuning and Contrastive Learning | Yingjin Song et.al. | 2408.06259 | null |
2024-08-12 | On Effects of Steering Latent Representation for Large Language Model Unlearning | Dang Huu-Tien et.al. | 2408.06223 | null |
2024-08-12 | Mutual Reasoning Makes Smaller LLMs Stronger Problem-Solvers | Zhenting Qi et.al. | 2408.06195 | link |
2024-08-12 | FruitNeRF: A Unified Neural Radiance Field based Fruit Counting Framework | Lukas Meyer et.al. | 2408.06190 | link |
2024-08-12 | Improving Structural Diversity of Blackbox LLMs via Chain-of-Specification Prompting | Halley Young et.al. | 2408.06186 | null |
2024-08-12 | OmniCLIP: Adapting CLIP for Video Recognition with Spatial-Temporal Omni-Scale Feature Learning | Mushui Liu et.al. | 2408.06158 | link |
2024-08-12 | LipidBERT: A Lipid Language Model Pre-trained on METiS de novo Lipid Library | Tianhao Yu et.al. | 2408.06150 | null |
2024-08-12 | Self-Supervised Learning on MeerKAT Wide-Field Continuum Images | Erica Lastufka et.al. | 2408.06147 | link |
2024-08-12 | Med42-v2: A Suite of Clinical LLMs | Clément Christophe et.al. | 2408.06142 | null |
2024-08-12 | Utilize Transformers for translating Wikipedia category names | Hoang-Thang Ta et.al. | 2408.06124 | null |
2024-08-10 | Preserving Privacy in Large Language Models: A Survey on Current Threats and Solutions | Michele Miranda et.al. | 2408.05212 | link |
2024-08-09 | VITA: Towards Open-Source Interactive Omni Multimodal LLM | Chaoyou Fu et.al. | 2408.05211 | link |
2024-08-09 | Evaluating the capability of large language models to personalize science texts for diverse middle-school-age learners | Michael Vaccaro Jr et.al. | 2408.05204 | null |
2024-08-09 | TaSL: Task Skill Localization and Consolidation for Language Model Continual Learning | Yujie Feng et.al. | 2408.05200 | link |
2024-08-09 | ECG-FM: An Open Electrocardiogram Foundation Model | Kaden McKeen et.al. | 2408.05178 | link |
2024-08-09 | Weak-Annotation of HAR Datasets using Vision Foundation Models | Marius Bock et.al. | 2408.05169 | link |
2024-08-09 | AttackER: Towards Enhancing Cyber-Attack Attribution with a Named Entity Recognition Dataset | Pritam Deka et.al. | 2408.05149 | null |
2024-08-09 | A Hybrid RAG System with Comprehensive Enhancement on Complex Reasoning | Ye Yuan et.al. | 2408.05141 | null |
2024-08-09 | Is ChatGPT a Good Software Librarian? An Exploratory Study on the Use of ChatGPT for Software Library Recommendations | Jasmine Latendresse et.al. | 2408.05128 | null |
2024-08-09 | Large Language Models and Thematic Analysis: Human-AI Synergy in Researching Hate Speech on Social Media | Petre Breazu et.al. | 2408.05126 | null |
2024-08-09 | Sportify: Question Answering with Embedded Visualizations and Personified Narratives for Sports Video | Chunggi Lee et.al. | 2408.05123 | null |
2024-08-09 | A Survey of NL2SQL with Large Language Models: Where are we, and where are we going? | Xinyu Liu et.al. | 2408.05109 | link |
2024-08-09 | Depth Helps: Improving Pre-trained RGB-based Policy with Depth Information Injection | Xincheng Pang et.al. | 2408.05107 | null |
2024-08-09 | How Well Do LLMs Identify Cultural Unity in Diversity? | Jialin Li et.al. | 2408.05102 | link |
2024-08-09 | Hyperbolic Learning with Multimodal Large Language Models | Paolo Mandica et.al. | 2408.05097 | null |
2024-08-09 | Unlocking Decoding-time Controllability: Gradient-Free Multi-Objective Alignment with Contrastive Prompts | Tingchen Fu et.al. | 2408.05094 | null |
2024-08-09 | Order Matters in Hallucination: Reasoning Order as Benchmark and Reflexive Prompting for Large-Language-Models | Zikai Xie et.al. | 2408.05093 | link |
2024-08-09 | Generating novel experimental hypotheses from language models: A case study on cross-dative generalization | Kanishka Misra et.al. | 2408.05086 | link |
2024-08-09 | RT-Surv: Improving Mortality Prediction After Radiotherapy with Large Language Model Structuring of Large-Scale Unstructured Electronic Health Records | Sangjoon Park et.al. | 2408.05074 | null |
2024-08-09 | Examining the Behavior of LLM Architectures Within the Framework of Standardized National Exams in Brazil | Marcelo Sartori Locatelli et.al. | 2408.05035 | null |
2024-08-08 | Better Alignment with Instruction Back-and-Forth Translation | Thao Nguyen et.al. | 2408.04614 | null |
2024-08-08 | Code-switching in text and speech reveals information-theoretic audience design | Debasmita Bhattacharya et.al. | 2408.04596 | null |
2024-08-09 | Img-Diff: Contrastive Data Synthesis for Multimodal Large Language Models | Qirui Jiao et.al. | 2408.04594 | link |
2024-08-08 | Towards Resilient and Efficient LLMs: A Comparative Study of Efficiency, Performance, and Adversarial Robustness | Xiaojing Fan et.al. | 2408.04585 | null |
2024-08-08 | SAM2-Adapter: Evaluating & Adapting Segment Anything 2 in Downstream Tasks: Camouflage, Shadow, Medical Image Segmentation, and More | Tianrun Chen et.al. | 2408.04579 | null |
2024-08-08 | SCENE: Evaluating Explainable AI Techniques Using Soft Counterfactuals | Haoran Zheng et.al. | 2408.04575 | null |
2024-08-08 | Learning Fine-Grained Grounded Citations for Attributed Large Language Models | Lei Huang et.al. | 2408.04568 | link |
2024-08-08 | Bias-Aware Low-Rank Adaptation: Mitigating Catastrophic Inheritance of Large Language Models | Yupeng Chang et.al. | 2408.04556 | link |
2024-08-08 | Depth Any Canopy: Leveraging Depth Foundation Models for Canopy Height Estimation | Daniele Rege Cambrin et.al. | 2408.04523 | link |
2024-08-08 | Compromesso! Italian Many-Shot Jailbreaks Undermine the Safety of Large Language Models | Fabio Pernisi et.al. | 2408.04522 | null |
2024-08-08 | What You Need is What You Get: Theory of Mind for an LLM-Based Code Understanding Assistant | Jonan Richards et.al. | 2408.04477 | null |
2024-08-08 | Can LLMs Beat Humans in Debating? A Dynamic Multi-agent Framework for Competitive Debate | Yiqun Zhang et.al. | 2408.04472 | link |
2024-08-08 | RiskAwareBench: Towards Evaluating Physical Risk Awareness for High-level Planning of LLM-based Embodied Agents | Zihao Zhu et.al. | 2408.04449 | link |
2024-08-08 | Large Language Models for cross-language code clone detection | Micheline Bénédicte Moumoula et.al. | 2408.04430 | null |
2024-08-08 | Recognizing Emotion Regulation Strategies from Human Behavior with Large Language Models | Philipp Müller et.al. | 2408.04420 | null |
2024-08-08 | Enhancing Robustness of Retrieval-Augmented Language Models with In-Context Learning | Seong-Il Park et.al. | 2408.04414 | null |
2024-08-08 | Deeploy: Enabling Energy-Efficient Deployment of Small Language Models On Heterogeneous Microcontrollers | Moritz Scherer et.al. | 2408.04413 | null |
2024-08-08 | Exploring Reasoning Biases in Large Language Models Through Syllogism: Insights from the NeuBAROCO Dataset | Kentaro Ozeki et.al. | 2408.04403 | link |
2024-08-08 | Automated Educational Question Generation at Different Bloom’s Skill Levels using Large Language Models: Strategies and Evaluation | Nicy Scaria et.al. | 2408.04394 | link |
2024-08-08 | Open-domain Implicit Format Control for Large Language Model Generation | Yiqun Yao et.al. | 2408.04392 | link |
2024-08-07 | How Well Can Vision Language Models See Image Details? | Chenhui Gou et.al. | 2408.03940 | null |
2024-08-07 | SLIM-RAFT: A Novel Fine-Tuning Approach to Improve Cross-Linguistic Performance for Mercosur Common Nomenclature | Vinícius Di Oliveira et.al. | 2408.03936 | null |
2024-08-07 | CodexGraph: Bridging Large Language Models and Code Repositories via Code Graph Databases | Xiangyan Liu et.al. | 2408.03910 | link |
2024-08-07 | Decoding Biases: Automated Methods and LLM Judges for Gender Bias Detection in Language Models | Shachi H Kumar et.al. | 2408.03907 | null |
2024-08-07 | Speech-MASSIVE: A Multilingual Speech Dataset for SLU and Beyond | Beomseok Lee et.al. | 2408.03900 | link |
2024-08-07 | Simplifying Scholarly Abstracts for Accessible Digital Libraries | Haining Wang et.al. | 2408.03899 | link |
2024-08-07 | From Data to Story: Towards Automatic Animated Data Video Creation with LLM-based Multi-Agent Systems | Leixian Shen et.al. | 2408.03876 | null |
2024-08-07 | PackMamba: Efficient Processing of Variable-Length Sequences in Mamba training | Haoran Xu et.al. | 2408.03865 | null |
2024-08-07 | GAIA – A Large Language Model for Advanced Power Dispatch | Yuheng Cheng et.al. | 2408.03847 | null |
2024-08-07 | MaxMind: A Memory Loop Network to Enhance Software Productivity based on Large Language Models | Yuchen Dong et.al. | 2408.03841 | null |
2024-08-07 | WalledEval: A Comprehensive Safety Evaluation Toolkit for Large Language Models | Prannaya Gupta et.al. | 2408.03837 | link |
2024-08-07 | Target Prompting for Information Extraction with Vision Language Model | Dipankar Medhi et.al. | 2408.03834 | null |
2024-08-07 | Leveraging Variation Theory in Counterfactual Data Augmentation for Optimized Active Learning | Simret Araya Gebreegziabher et.al. | 2408.03819 | null |
2024-08-07 | Generative Language Models with Retrieval Augmented Generation for Automated Short Answer Scoring | Zifan Wang et.al. | 2408.03811 | null |
2024-08-07 | ‘Finance Wizard’ at the FinLLM Challenge Task: Financial Text Summarization | Meisin Lee et.al. | 2408.03762 | null |
2024-08-07 | MMSummary: Multimodal Summary Generation for Fetal Ultrasound Video | Xiaoqing Guo et.al. | 2408.03761 | null |
2024-08-07 | Advancing Multimodal Large Language Models with Quantization-Aware Scale Learning for Efficient Adaptation | Jingjing Xie et.al. | 2408.03735 | link |
2024-08-07 | Question Rephrasing for Quantifying Uncertainty in Large Language Models: Applications in Molecular Chemistry Tasks | Zizhang Chen et.al. | 2408.03732 | null |
2024-08-07 | A Convex-optimization-based Layer-wise Post-training Pruner for Large Language Models | Pengxiang Zhao et.al. | 2408.03728 | null |
2024-08-07 | Local Topology Measures of Contextual Language Model Latent Spaces With Applications to Dialogue Term Extraction | Benjamin Matthias Ruppik et.al. | 2408.03706 | null |
2024-08-06 | CoverBench: A Challenging Benchmark for Complex Claim Verification | Alon Jacovi et.al. | 2408.03325 | null |
2024-08-06 | Segment Anything in Medical Images and Videos: Benchmark and Deployment | Jun Ma et.al. | 2408.03322 | link |
2024-08-06 | TextIM: Part-aware Interactive Motion Synthesis from Text | Siyuan Fan et.al. | 2408.03302 | null |
2024-08-06 | KaPO: Knowledge-aware Preference Optimization for Controllable Knowledge Selection in Retrieval-Augmented Language Models | Ruizhe Zhang et.al. | 2408.03297 | null |
2024-08-06 | Biomedical SAM 2: Segment Anything in Biomedical Images and Videos | Zhiling Yan et.al. | 2408.03286 | link |
2024-08-07 | StructEval: Deepen and Broaden Large Language Model Assessment via Structured Evaluation | Boxi Cao et.al. | 2408.03281 | link |
2024-08-06 | Compress and Compare: Interactively Evaluating Efficiency and Behavior Across ML Model Compression Experiments | Angie Boggust et.al. | 2408.03274 | null |
2024-08-06 | Synthesizing Text-to-SQL Data from Weak and Strong LLMs | Jiaxi Yang et.al. | 2408.03256 | null |
2024-08-06 | Unveiling Factual Recall Behaviors of Large Language Models through Knowledge Neurons | Yifei Wang et.al. | 2408.03247 | link |
2024-08-06 | Making Long-Context Language Models Better Multi-Hop Reasoners | Yanyang Li et.al. | 2408.03246 | link |
2024-08-06 | Leveraging Parameter Efficient Training Methods for Low Resource Text Classification: A Case Study in Marathi | Pranita Deshmukh et.al. | 2408.03172 | null |
2024-08-06 | Conditioning LLMs with Emotion in Neural Machine Translation | Charles Brazier et.al. | 2408.03150 | null |
2024-08-06 | Leveraging Entity Information for Cross-Modality Correlation Learning: The Entity-Guided Multimodal Summarization | Yanghai Zhang et.al. | 2408.03149 | link |
2024-08-06 | Inference Optimizations for Large Language Models: Effects, Challenges, and Practical Considerations | Leo Donisch et.al. | 2408.03130 | null |
2024-08-06 | Lisbon Computational Linguists at SemEval-2024 Task 2: Using A Mistral 7B Model and Data Augmentation | Artur Guimarães et.al. | 2408.03127 | link |
2024-08-06 | Evaluating the Translation Performance of Large Language Models Based on Euas-20 | Yan Huang et.al. | 2408.03119 | null |
2024-08-06 | Topic Modeling with Fine-tuning LLMs and Bag of Sentences | Johannes Schneider et.al. | 2408.03099 | link |
2024-08-07 | TestART: Improving LLM-based Unit Test via Co-evolution of Automated Generation and Repair Iteration | Siqi Gu et.al. | 2408.03095 | null |
2024-08-06 | 500xCompressor: Generalized Prompt Compression for Large Language Models | Zongqian Li et.al. | 2408.03094 | link |
2024-08-06 | Extend Model Merging from Fine-Tuned to Pre-Trained Large Language Models via Weight Disentanglement | Le Yu et.al. | 2408.03092 | link |
2024-08-05 | Lumina-mGPT: Illuminate Flexible Photorealistic Text-to-Image Generation with Multimodal Generative Pretraining | Dongyang Liu et.al. | 2408.02657 | link |
2024-08-05 | Can Reinforcement Learning Unlock the Hidden Dangers in Aligned Large Language Models? | Mohammad Bahrami Karkevandi et.al. | 2408.02651 | null |
2024-08-05 | Command-line Obfuscation Detection using Small Language Models | Vojtech Outrata et.al. | 2408.02637 | null |
2024-08-05 | SEAS: Self-Evolving Adversarial Safety Optimization for Large Language Models | Muxi Diao et.al. | 2408.02632 | null |
2024-08-05 | Language Model Can Listen While Speaking | Ziyang Ma et.al. | 2408.02622 | null |
2024-08-05 | Progressively Selective Label Enhancement for Language Model Alignment | Biao Liu et.al. | 2408.02599 | null |
2024-08-05 | Modelling Visual Semantics via Image Captioning to extract Enhanced Multi-Level Cross-Modal Semantic Incongruity Representation with Attention for Multimodal Sarcasm Detection | Sajal Aggarwal et.al. | 2408.02595 | null |
2024-08-05 | Leveraging the Power of LLMs: A Fine-Tuning Approach for High-Quality Aspect-Based Summarization | Ankan Mullick et.al. | 2408.02584 | null |
2024-08-05 | DanModCap: Designing a Danmaku Moderation Tool for Video-Sharing Platforms that Leverages Impact Captions | Siying Hu et.al. | 2408.02574 | null |
2024-08-05 | Evaluating and Enhancing LLMs Agent based on Theory of Mind in Guandan: A Multi-Player Cooperative Game under Imperfect Information | Yauwai Yim et.al. | 2408.02559 | null |
2024-08-05 | Generative AI as a Service in 6G Edge-Cloud: Generation Task Offloading by In-context Learning | Hao Zhou et.al. | 2408.02549 | null |
2024-08-05 | RAG Foundry: A Framework for Enhancing LLMs for Retrieval Augmented Generation | Daniel Fleischer et.al. | 2408.02545 | link |
2024-08-05 | Caution for the Environment: Multimodal Agents are Susceptible to Environmental Distractions | Xinbei Ma et.al. | 2408.02544 | link |
2024-08-05 | Towards Coarse-grained Visual Language Navigation Task Planning Enhanced by Event Knowledge Graph | Zhao Kaichen et.al. | 2408.02535 | null |
2024-08-05 | Practical Attacks against Black-box Code Completion Engines | Slobodan Jenko et.al. | 2408.02509 | null |
2024-08-05 | UnifiedMLLM: Enabling Unified Representation for Multi-modal Multi-tasks With Large Language Model | Zhaowei Li et.al. | 2408.02503 | link |
2024-08-05 | Context Conquers Parameters: Outperforming Proprietary LLM in Commit Message Generation | Aaron Imani et.al. | 2408.02502 | null |
2024-08-05 | A First Look at License Compliance Capability of LLMs in Code Generation | Weiwei Xu et.al. | 2408.02487 | link |
2024-08-05 | Exploring Conditional Multi-Modal Prompts for Zero-shot HOI Detection | Ting Lei et.al. | 2408.02484 | link |
2024-08-05 | From LLMs to LLM-based Agents for Software Engineering: A Survey of Current, Challenges and Future | Haolin Jin et.al. | 2408.02479 | null |
2024-08-02 | Prompt Recursive Search: A Living Framework with Adaptive Growth in LLM Auto-Prompting | Xiangyu Zhao et.al. | 2408.01423 | null |
2024-08-02 | Mission Impossible: A Statistical Perspective on Jailbreaking LLMs | Jingtong Su et.al. | 2408.01420 | null |
2024-08-02 | DebateQA: Evaluating Question Answering on Debatable Knowledge | Rongwu Xu et.al. | 2408.01419 | link |
2024-08-02 | Talk Less, Interact Better: Evaluating In-context Conversational Adaptation in Multimodal LLMs | Yilun Hua et.al. | 2408.01417 | null |
2024-08-02 | Pre-trained Language Models Improve the Few-shot Prompt Ability of Decision Transformer | Yu Yang et.al. | 2408.01402 | null |
2024-08-02 | Coalitions of Large Language Models Increase the Robustness of AI Agents | Prattyush Mangal et.al. | 2408.01380 | null |
2024-08-02 | Toward Automatic Relevance Judgment using Vision–Language Models for Image–Text Retrieval Evaluation | Jheng-Hong Yang et.al. | 2408.01363 | null |
2024-08-02 | Hallu-PI: Evaluating Hallucination in Multi-modal Large Language Models within Perturbed Inputs | Peng Ding et.al. | 2408.01355 | link |
2024-08-02 | MCGMark: An Encodable and Robust Online Watermark for LLM-Generated Malicious Code | Kaiwen Ning et.al. | 2408.01354 | link |
2024-08-02 | Prompt Refinement or Fine-tuning? Best Practices for using LLMs in Computational Social Science Tasks | Anders Giovanni Møller et.al. | 2408.01346 | null |
2024-08-02 | MuChoMusic: Evaluating Music Understanding in Multimodal Audio-Language Models | Benno Weck et.al. | 2408.01337 | link |
2024-08-02 | A Backbone for Long-Horizon Robot Task Understanding | Xiaoshuai Chen et.al. | 2408.01334 | null |
2024-08-02 | FANNO: Augmenting High-Quality Instruction Data with Open-Sourced LLMs Only | He Zhu et.al. | 2408.01323 | null |
2024-08-02 | A Comprehensive Review of Multimodal Large Language Models: Performance and Challenges Across Different Tasks | Jiaqi Wang et.al. | 2408.01319 | null |
2024-08-02 | Reconsidering Token Embeddings with the Definitions for Pre-trained Language Models | Ying Zhang et.al. | 2408.01308 | null |
2024-08-02 | The Mismeasure of Man and Models: Evaluating Allocational Harms in Large Language Models | Hannah Chen et.al. | 2408.01285 | null |
2024-08-02 | RAGEval: Scenario Specific RAG Evaluation Dataset Generation Framework | Kunlun Zhu et.al. | 2408.01262 | link |
2024-08-02 | The Phantom Menace: Unmasking Privacy Leakages in Vision-Language Models | Simone Caldarella et.al. | 2408.01228 | null |
2024-08-02 | High-Throughput Phenotyping of Clinical Text Using Large Language Models | Daniel B. Hier et.al. | 2408.01214 | null |
2024-08-02 | Misinforming LLMs: vulnerabilities, challenges and opportunities | Bo Zhou et.al. | 2408.01168 | null |
2024-08-01 | AgentGen: Enhancing Planning Abilities for Large Language Model based Agent via Environment and Task Generation | Mengkang Hu et.al. | 2408.00764 | null |
2024-08-01 | UniTalker: Scaling up Audio-Driven 3D Facial Animation through A Unified Model | Xiangyu Fan et.al. | 2408.00762 | null |
2024-08-01 | Tamper-Resistant Safeguards for Open-Weight LLMs | Rishub Tamirisa et.al. | 2408.00761 | link |
2024-08-01 | Thermal Conductivity Predictions with Foundation Atomistic Models | Balázs Póta et.al. | 2408.00755 | link |
2024-08-01 | Coarse Correspondence Elicit 3D Spacetime Understanding in Multimodal Language Model | Benlin Liu et.al. | 2408.00754 | null |
2024-08-01 | Collaborative Vision-Text Representation Optimizing for Open-Vocabulary Segmentation | Siyu Jiao et.al. | 2408.00744 | link |
2024-08-01 | DynamoLLM: Designing LLM Inference Clusters for Performance and Energy Efficiency | Jovan Stojkovic et.al. | 2408.00741 | null |
2024-08-01 | Virchow 2: Scaling Self-Supervised Mixed Magnification Models in Pathology | Eric Zimmermann et.al. | 2408.00738 | null |
2024-08-01 | Improving Retrieval-Augmented Generation in Medicine with Iterative Follow-up Questions | Guangzhi Xiong et.al. | 2408.00727 | link |
2024-08-01 | An Empirical Analysis of Compute-Optimal Inference for Problem-Solving with Language Models | Yangzhen Wu et.al. | 2408.00724 | null |
2024-08-01 | Pathway to Secure and Trustworthy 6G for LLMs: Attacks, Defense, and Opportunities | Sunder Ali Khowaja et.al. | 2408.00722 | null |
2024-08-01 | SAM 2: Segment Anything in Images and Videos | Nikhila Ravi et.al. | 2408.00714 | link |
2024-08-01 | Point-supervised Brain Tumor Segmentation with Box-prompted MedSAM | Xiaofeng Liu et.al. | 2408.00706 | null |
2024-08-02 | Improving Text Embeddings for Smaller Language Models Using Contrastive Fine-tuning | Trapoom Ukarapol et.al. | 2408.00690 | link |
2024-08-01 | Can Developers Prompt? A Controlled Experiment for Code Documentation Generation | Hans-Alexander Kruse et.al. | 2408.00686 | null |
2024-08-01 | ExpertAF: Expert Actionable Feedback from Video | Kumar Ashutosh et.al. | 2408.00672 | null |
2024-08-01 | AutoM3L: An Automated Multimodal Machine Learning Framework with Large Language Models | Daqin Luo et.al. | 2408.00665 | link |
2024-08-01 | Disentangling Dense Embeddings with Sparse Autoencoders | Charles O’Neill et.al. | 2408.00657 | null |
2024-08-02 | SentenceVAE: Faster, Longer and More Accurate Inference with Next-sentence Prediction for Large Language Models | Hongjun An et.al. | 2408.00655 | link |
2024-08-01 | Towards End-to-End Explainable Facial Action Unit Recognition via Vision-Language Joint Learning | Xuri Ge et.al. | 2408.00644 | null |
2024-07-31 | Generalized Out-of-Distribution Detection and Beyond in Vision Language Model Era: A Survey | Atsuyuki Miyai et.al. | 2407.21794 | null |
2024-07-31 | Vision-Language Model Based Handwriting Verification | Mihir Chauhan et.al. | 2407.21788 | null |
2024-07-31 | Large Language Monkeys: Scaling Inference Compute with Repeated Sampling | Bradley Brown et.al. | 2407.21787 | null |
2024-07-31 | The Llama 3 Herd of Models | Abhimanyu Dubey et.al. | 2407.21783 | null |
2024-07-31 | Paying More Attention to Image: A Training-Free Method for Alleviating Hallucination in LVLMs | Shi Liu et.al. | 2407.21771 | null |
2024-07-31 | MoMa: Efficient Early-Fusion Pre-training with Mixture of Modality-Aware Experts | Xi Victoria Lin et.al. | 2407.21770 | null |
2024-07-31 | ReplanVLM: Replanning Robotic Tasks with Visual Language Models | Aoran Mei et.al. | 2407.21762 | null |
2024-07-31 | Learning Video Context as Interleaved Multimodal Sequences | Kevin Qinghong Lin et.al. | 2407.21757 | link |
2024-07-31 | A Federated Learning-Friendly Approach for Parameter-Efficient Fine-Tuning of SAM in 3D Segmentation | Mothilal Asokan et.al. | 2407.21739 | null |
2024-07-31 | Open-Vocabulary Audio-Visual Semantic Segmentation | Ruohao Guo et.al. | 2407.21721 | null |
2024-07-31 | Adaptive Retrieval-Augmented Generation for Conversational Systems | Xi Wang et.al. | 2407.21712 | null |
2024-07-31 | CEAR: Automatic construction of a knowledge graph of chemical entities and roles from scientific literature | Stefan Langer et.al. | 2407.21708 | null |
2024-07-31 | TransferTOD: A Generalizable Chinese Multi-Domain Task-Oriented Dialogue System with Transfer Capabilities | Ming Zhang et.al. | 2407.21693 | link |
2024-07-31 | Synth-Empathy: Towards High-Quality Synthetic Empathy Data | Hao Liang et.al. | 2407.21669 | link |
2024-08-01 | Defending Jailbreak Attack in VLMs via Cross-modality Information Detector | Yue Xu et.al. | 2407.21659 | link |
2024-07-31 | MTA-CLIP: Language-Guided Semantic Segmentation with Mask-Text Alignment | Anurag Das et.al. | 2407.21654 | null |
2024-07-31 | Zero-Shot Cross-Domain Dialogue State Tracking via Dual Low-Rank Adaptation | Xiang Luo et.al. | 2407.21633 | link |
2024-07-31 | TAROT: Task-Oriented Authorship Obfuscation Using Policy Optimization Methods | Gabriel Loiseau et.al. | 2407.21630 | link |
2024-07-31 | LLM-for-X: Application-agnostic Integration of Large Language Models to Support Personal Writing Workflows | Lukas Teufelberger et.al. | 2407.21593 | null |
2024-07-31 | A Performance Study of LLM-Generated Code on Leetcode | Tristan Coignion et.al. | 2407.21579 | null |
2024-07-30 | ThinK: Thinner Key Cache by Query-Driven Pruning | Yuhui Xu et.al. | 2407.21018 | null |
2024-07-30 | CLEFT: Language-Image Contrastive Learning with Efficient Large Language Model and Prompt Fine-Tuning | Yuexi Du et.al. | 2407.21011 | link |
2024-07-30 | GABInsight: Exploring Gender-Activity Binding Bias in Vision-Language Models | Ali Abdollahi et.al. | 2407.21001 | link |
2024-07-31 | MoFO: Momentum-Filtered Optimizer for Mitigating Forgetting in LLM Fine-Tuning | Yupeng Chen et.al. | 2407.20999 | null |
2024-07-30 | From Feature Importance to Natural Language Explanations Using LLMs with RAG | Sule Tekkesinoglu et.al. | 2407.20990 | link |
2024-07-30 | Large Language Models (LLMs) for Semantic Communication in Edge-based IoT Networks | Alakesh Kalita et.al. | 2407.20970 | null |
2024-07-30 | MMTrail: A Multimodal Trailer Video Dataset with Language and Music Descriptions | Xiaowei Chi et.al. | 2407.20962 | link |
2024-07-30 | UniProcessor: A Text-induced Unified Low-level Image Processor | Huiyu Duan et.al. | 2407.20928 | link |
2024-07-30 | SSPA: Split-and-Synthesize Prompting with Gated Alignments for Multi-Label Image Recognition | Hao Tan et.al. | 2407.20920 | null |
2024-07-30 | Automated Review Generation Method Based on Large Language Models | Shican Wu et.al. | 2407.20906 | link |
2024-07-30 | Faithful and Plausible Natural Language Explanations for Image Classification: A Pipeline Approach | Adam Wojciechowski et.al. | 2407.20899 | link |
2024-07-30 | ThinkRepair: Self-Directed Automated Program Repair | Xin Yin et.al. | 2407.20898 | link |
2024-07-30 | Effective Black Box Testing of Sentiment Analysis Classification Networks | Parsa Karbasizadeh et.al. | 2407.20884 | null |
2024-07-30 | Breaking Agents: Compromising Autonomous LLM Agents Through Malfunction Amplification | Boyang Zhang et.al. | 2407.20859 | null |
2024-07-30 | Learn by Selling: Equipping Large Language Models with Product Knowledge for Context-Driven Recommendations | Sarthak Anand et.al. | 2407.20856 | null |
2024-07-30 | Large Language Model (LLM)-enabled Graphs in Dynamic Networking | Geng Sun et.al. | 2407.20840 | null |
2024-07-30 | How to Measure the Intelligence of Large Language Models? | Nils Körber et.al. | 2407.20828 | null |
2024-07-30 | Diffusion Augmented Agents: A Framework for Efficient Exploration and Transfer Learning | Norman Di Palo et.al. | 2407.20798 | null |
2024-07-30 | Interpretable Pre-Trained Transformers for Heart Time-Series Data | Harry J. Davies et.al. | 2407.20775 | link |
2024-07-30 | OmniBal: Towards Fast Instruct-tuning for Vision-Language Models via Omniverse Computation Balance | Yongqiang Yao et.al. | 2407.20761 | link |
2024-07-29 | Specify and Edit: Overcoming Ambiguity in Text-Based Image Editing | Ekaterina Iakovleva et.al. | 2407.20232 | null |
2024-07-29 | Improving 2D Feature Representations by 3D-Aware Fine-Tuning | Yuanwen Yue et.al. | 2407.20229 | null |
2024-07-29 | FlexAttention for Efficient High-Resolution Vision-Language Models | Junyan Li et.al. | 2407.20228 | null |
2024-07-29 | Can Editing LLMs Inject Harm? | Canyu Chen et.al. | 2407.20224 | null |
2024-07-29 | SANGRIA: Surgical Video Scene Graph Optimization for Surgical Workflow Prediction | Çağhan Köksal et.al. | 2407.20214 | null |
2024-07-29 | QAEA-DR: A Unified Text Augmentation Framework for Dense Retrieval | Hongming Tan et.al. | 2407.20207 | null |
2024-07-29 | MindSearch: Mimicking Human Minds Elicits Deep AI Searcher | Zehui Chen et.al. | 2407.20183 | link |
2024-07-29 | Theia: Distilling Diverse Vision Foundation Models for Robot Learning | Jinghuan Shang et.al. | 2407.20179 | link |
2024-07-29 | AutoScale: Automatic Prediction of Compute-optimal Data Composition for Training LLMs | Feiyang Kang et.al. | 2407.20177 | link |
2024-07-29 | Advancing Multimodal Large Language Models in Chart Question Answering with Visualization-Referenced Instruction Tuning | Xingchen Zeng et.al. | 2407.20174 | link |
2024-07-29 | Diffusion Feedback Helps CLIP See Better | Wenxuan Wang et.al. | 2407.20171 | link |
2024-07-29 | Language-Conditioned Offline RL for Multi-Robot Navigation | Steven Morad et.al. | 2407.20164 | null |
2024-07-29 | rLLM: Relational Table Learning with LLMs | Weichen Li et.al. | 2407.20157 | link |
2024-07-29 | ByteCheckpoint: A Unified Checkpointing System for LLM Development | Borui Wan et.al. | 2407.20143 | null |
2024-07-29 | Strong Copyright Protection for Language Models via Adaptive Model Fusion | Javier Abad et.al. | 2407.20105 | null |
2024-07-29 | Orca: Ocean Significant Wave Height Estimation with Spatio-temporally Aware Large Language Models | Zhe Li et.al. | 2407.20053 | null |
2024-07-29 | Exploring Large Language Models to generate Easy to Read content | Paloma Martínez et.al. | 2407.20046 | null |
2024-07-29 | MaskInversion: Localized Embeddings via Optimization of Explainability Maps | Walid Bousselham et.al. | 2407.20034 | null |
2024-07-29 | Efficient Training of Large Language Models on Distributed Infrastructures: A Survey | Jiangfei Duan et.al. | 2407.20018 | null |
2024-07-29 | Rosetta Statements: Lowering the Barrier for Semantic Parsing and Increasing the Cognitive Interoperability of Knowledge Graphs | Lars Vogt et.al. | 2407.20007 | null |
2024-07-26 | Wolf: Captioning Everything with a World Summarization Framework | Boyi Li et.al. | 2407.18908 | null |
2024-07-26 | SHIC: Shape-Image Correspondences with no Keypoint Supervision | Aleksandar Shtedritski et.al. | 2407.18907 | null |
2024-07-26 | A Flexible and Scalable Approach for Collecting Wildlife Advertisements on the Web | Juliana Barbosa et.al. | 2407.18898 | link |
2024-07-26 | Small Molecule Optimization with Large Language Models | Philipp Guevorguian et.al. | 2407.18897 | link |
2024-07-26 | Human-artificial intelligence teaming for scientific information extraction from data-driven additive manufacturing research using large language models | Mutahar Safdar et.al. | 2407.18827 | null |
2024-07-26 | Automatic Detection of Moral Values in Music Lyrics | Vjosa Preniqi et.al. | 2407.18787 | link |
2024-07-26 | The power of Prompts: Evaluating and Mitigating Gender Bias in MT with LLMs | Aleix Sant et.al. | 2407.18786 | null |
2024-07-26 | Foundation Models for the Digital Twin Creation of Cyber-Physical Systems | Shaukat Ali et.al. | 2407.18779 | null |
2024-07-26 | TAGIFY: LLM-powered Tagging Interface for Improved Data Findability on OGD portals | Kevin Kliimask et.al. | 2407.18764 | null |
2024-07-26 | Knowledge Graph Structure as Prompt: Improving Small Language Models Capabilities for Knowledge-based Causal Discovery | Yuni Susanti et.al. | 2407.18752 | link |
2024-07-26 | Towards Effective and Efficient Continual Pre-training of Large Language Models | Jie Chen et.al. | 2407.18743 | null |
2024-07-26 | Towards Generalized Offensive Language Identification | Alphaeus Dmonte et.al. | 2407.18738 | null |
2024-07-26 | LLASP: Fine-tuning Large Language Models for Answer Set Programming | Erica Coppolillo et.al. | 2407.18723 | null |
2024-07-26 | Neurosymbolic AI for Enhancing Instructability in Generative AI | Amit Sheth et.al. | 2407.18722 | null |
2024-07-26 | Cluster-norm for Unsupervised Probing of Knowledge | Walter Laurito et.al. | 2407.18712 | link |
2024-07-26 | Adaptive Contrastive Search: Uncertainty-Guided Decoding for Open-Ended Text Generation | Esteban Garces Arias et.al. | 2407.18698 | link |
2024-07-26 | Collaborative Evolving Strategy for Automatic Data-Centric Development | Xu Yang et.al. | 2407.18690 | null |
2024-07-26 | The BIAS Detection Framework: Bias Detection in Word Embeddings and Language Models for European Languages | Alexandre Puttick et.al. | 2407.18689 | link |
2024-07-26 | Right Now, Wrong Then: Non-Stationary Direct Preference Optimization under Preference Drift | Seongho Son et.al. | 2407.18676 | null |
2024-07-26 | Every Part Matters: Integrity Verification of Scientific Figures Based on Multimodal Large Language Models | Xiang Shi et.al. | 2407.18626 | link |
2024-07-25 | Self-Training with Direct Preference Optimization Improves Chain-of-Thought Reasoning | Tianduo Wang et.al. | 2407.18248 | link |
2024-07-25 | LoRA-Pro: Are Low-Rank Adapters Properly Optimized? | Zhengbo Wang et.al. | 2407.18242 | link |
2024-07-26 | Recursive Introspection: Teaching Language Model Agents How to Self-Improve | Yuxiao Qu et.al. | 2407.18219 | null |
2024-07-26 | Exploring Scaling Trends in LLM Robustness | Nikolaus Howe et.al. | 2407.18213 | null |
2024-07-25 | AsEP: Benchmarking Deep Learning Methods for Antibody-specific Epitope Prediction | Chunan Liu et.al. | 2407.18184 | link |
2024-07-25 | Gene Regulatory Network Inference from Pre-trained Single-Cell Transcriptomics Transformer with Joint Graph Learning | Sindhura Kommu et.al. | 2407.18181 | null |
2024-07-25 | Unlocking Tokens as Data Points for Generalization Bounds on Larger Language Models | Sanae Lotfi et.al. | 2407.18158 | null |
2024-07-25 | $\mathbb{X}$ -Sample Contrastive Loss: Improving Contrastive Learning with Sample Similarity Graphs | Vlad Sobal et.al. | 2407.18134 | null |
2024-07-26 | Dallah: A Dialect-Aware Multimodal Large Language Model for Arabic | Fakhraddin Alwajih et.al. | 2407.18129 | null |
2024-07-25 | Efficient Inference of Vision Instruction-Following Models with Elastic Cache | Zuyan Liu et.al. | 2407.18121 | link |
2024-07-25 | Multi-Resolution Histopathology Patch Graphs for Ovarian Cancer Subtyping | Jack Breen et.al. | 2407.18105 | link |
2024-07-25 | Fine-Tuning Large Language Models for Stock Return Prediction Using Newsflow | Tian Guo et.al. | 2407.18103 | null |
2024-07-25 | PEFT-U: Parameter-Efficient Fine-Tuning for User Personalization | Christopher Clarke et.al. | 2407.18078 | link |
2024-07-25 | C2P: Featuring Large Language Models with Causal Reasoning | Abdolmahdi Bagheri et.al. | 2407.18069 | null |
2024-07-25 | ComPeer: A Generative Conversational Agent for Proactive Peer Support | Tianjian Liu et.al. | 2407.18064 | link |
2024-07-25 | Audio Entailment: Assessing Deductive Reasoning for Audio Understanding | Soham Deshmukh et.al. | 2407.18062 | link |
2024-07-25 | Difficulty Estimation and Simplification of French Text Using LLMs | Henri Jamet et.al. | 2407.18061 | null |
2024-07-25 | The Geometry of Queries: Query-Based Innovations in Retrieval-Augmented Generation | Eric Yang et.al. | 2407.18044 | null |
2024-07-25 | RestoreAgent: Autonomous Image Restoration Agent via Multimodal Large Language Models | Haoyu Chen et.al. | 2407.18035 | null |
2024-07-25 | GermanPartiesQA: Benchmarking Commercial Large Language Models for Political Bias and Sycophancy | Jan Batzner et.al. | 2407.18008 | null |
2024-07-24 | I Could’ve Asked That: Reformulating Unanswerable Questions | Wenting Zhao et.al. | 2407.17469 | link |
2024-07-24 | WildHallucinations: Evaluating Long-form Factuality in LLMs with Real-World Entity Queries | Wenting Zhao et.al. | 2407.17468 | null |
2024-07-24 | CMR Scaling Law: Predicting Critical Mixture Ratios for Continual Pre-training of Language Models | Jiawei Gu et.al. | 2407.17467 | null |
2024-07-24 | $VILA^2$ : VILA Augmented VILA | Yunhao Fang et.al. | 2407.17453 | null |
2024-07-24 | Fluent Student-Teacher Redteaming | T. Ben Thompson et.al. | 2407.17447 | link |
2024-07-24 | Can Watermarking Large Language Models Prevent Copyrighted Text Generation and Hide Training Data? | Michael-Andrei Panaitescu-Liess et.al. | 2407.17417 | null |
2024-07-24 | (PASS) Visual Prompt Locates Good Structure Sparsity through a Recurrent HyperNetwork | Tianjin Huang et.al. | 2407.17412 | null |
2024-07-24 | Dependency Transformer Grammars: Integrating Dependency Structures into Transformer Language Models | Yida Zhao et.al. | 2407.17406 | link |
2024-07-24 | Grammar-based Game Description Generation using Large Language Models | Tsunehiko Tanaka et.al. | 2407.17404 | null |
2024-07-24 | 3D Question Answering for City Scene Understanding | Penglei Sun et.al. | 2407.17398 | null |
2024-07-24 | PERSONA: A Reproducible Testbed for Pluralistic Alignment | Louis Castricato et.al. | 2407.17387 | null |
2024-07-24 | A Comprehensive Approach to Misspelling Correction with BERT and Levenshtein Distance | Amirreza Naziri et.al. | 2407.17383 | null |
2024-07-24 | MMRA: A Benchmark for Multi-granularity Multi-image Relational Association | Siwei Wu et.al. | 2407.17379 | link |
2024-07-24 | ViPer: Visual Personalization of Generative Models via Individual Preference Learning | Sogand Salehi et.al. | 2407.17365 | null |
2024-07-24 | Gradient-based inference of abstract task representations for generalization in neural networks | Ali Hummos et.al. | 2407.17356 | null |
2024-07-24 | Scalify: scale propagation for efficient low-precision LLM training | Paul Balança et.al. | 2407.17353 | link |
2024-07-24 | Boosting Large Language Models with Socratic Method for Conversational Mathematics Teaching | Yuyang Ding et.al. | 2407.17349 | link |
2024-07-24 | DexGANGrasp: Dexterous Generative Adversarial Grasping Synthesis for Task-Oriented Manipulation | Qian Feng et.al. | 2407.17348 | null |
2024-07-24 | Label Alignment and Reassignment with Generalist Large Language Model for Enhanced Cross-Domain Named Entity Recognition | Ke Bao et.al. | 2407.17344 | null |
2024-07-24 | How Good (Or Bad) Are LLMs at Detecting Misleading Visualizations? | Leo Yu-Ho Lo et.al. | 2407.17291 | null |
2024-07-23 | PartGLEE: A Foundation Model for Recognizing and Parsing Any Objects | Junyi Li et.al. | 2407.16696 | link |
2024-07-23 | Stress-Testing Long-Context Language Models with Lifelong ICL and Task Haystack | Xiaoyue Xu et.al. | 2407.16695 | link |
2024-07-23 | Can Large Language Models Automatically Jailbreak GPT-4V? | Yuanwei Wu et.al. | 2407.16686 | null |
2024-07-23 | SAM-CP: Marrying SAM with Composable Prompts for Versatile Segmentation | Pengfei Chen et.al. | 2407.16682 | null |
2024-07-23 | RedAgent: Red Teaming Large Language Models with Context-aware Autonomous Language Agent | Huiyu Xu et.al. | 2407.16667 | null |
2024-07-23 | Course-Correction: Safety Alignment Using Synthetic Preferences | Rongwu Xu et.al. | 2407.16637 | link |
2024-07-23 | Lawma: The Power of Specialization for Legal Tasks | Ricardo Dominguez-Olmedo et.al. | 2407.16615 | null |
2024-07-23 | Data Mixture Inference: What do BPE Tokenizers Reveal about their Training Data? | Jonathan Hayase et.al. | 2407.16607 | link |
2024-07-23 | Shared Imagination: LLMs Hallucinate Alike | Yilun Zhou et.al. | 2407.16604 | null |
2024-07-23 | A Comparative Study on Patient Language across Therapeutic Domains for Effective Patient Voice Classification in Online Health Discussions | Giorgos Lysandrou et.al. | 2407.16593 | null |
2024-07-23 | Exploring Automatic Cryptographic API Misuse Detection in the Era of LLMs | Yifan Xia et.al. | 2407.16576 | null |
2024-07-23 | TLCR: Token-Level Continuous Reward for Fine-grained Reinforcement Learning from Human Feedback | Eunseop Yoon et.al. | 2407.16574 | null |
2024-07-23 | Retrieve, Generate, Evaluate: A Case Study for Medical Paraphrases Generation with Small Language Models | Ioana Buhnila et.al. | 2407.16565 | link |
2024-07-23 | Patched RTC: evaluating LLMs for diverse software development tasks | Asankhaya Sharma et.al. | 2407.16557 | link |
2024-07-24 | MicroEmo: Time-Sensitive Multimodal Emotion Recognition with Micro-Expression Dynamics in Video Dialogues | Liyun Zhang et.al. | 2407.16552 | null |
2024-07-23 | Quantifying the Role of Textual Predictability in Automatic Speech Recognition | Sean Robertson et.al. | 2407.16537 | null |
2024-07-23 | Imperfect Vision Encoders: Efficient and Robust Tuning for Vision-Language Models | Aristeidis Panos et.al. | 2407.16526 | null |
2024-07-24 | AMONGAGENTS: Evaluating Large Language Models in the Interactive Text-Based Social Deduction Game | Yizhou Chi et.al. | 2407.16521 | null |
2024-07-23 | Language-Based Security for Low-Level MPC | Christian Skalka et.al. | 2407.16504 | null |
2024-07-23 | Machine Translation Hallucination Detection for Low and High Resource Languages using Large Language Models | Kenza Benkirane et.al. | 2407.16470 | link |
2024-07-22 | AutoAD-Zero: A Training-Free Framework for Zero-Shot Audio Description | Junyu Xie et.al. | 2407.15850 | link |
2024-07-22 | LLMmap: Fingerprinting For Large Language Models | Dario Pasquini et.al. | 2407.15847 | link |
2024-07-22 | SlowFast-LLaVA: A Strong Training-Free Baseline for Video Large Language Models | Mingze Xu et.al. | 2407.15841 | link |
2024-07-22 | MMInstruct: A High-Quality Multi-Modal Instruction Tuning Dataset with Extensive Diversity | Yangzhou Liu et.al. | 2407.15838 | link |
2024-07-22 | dMel: Speech Tokenization made Simple | He Bai et.al. | 2407.15835 | null |
2024-07-22 | J-CHAT: Japanese Large-scale Spoken Dialogue Corpus for Spoken Dialogue Language Modeling | Wataru Nakata et.al. | 2407.15828 | null |
2024-07-22 | Accelerating Pre-training of Multimodal LLMs via Chain-of-Sight | Ziyuan Huang et.al. | 2407.15819 | null |
2024-07-22 | Perceptions of Linguistic Uncertainty by Language Models and Humans | Catarina G Belem et.al. | 2407.15814 | link |
2024-07-22 | AdaCLIP: Adapting CLIP with Hybrid Learnable Prompts for Zero-Shot Anomaly Detection | Yunkang Cao et.al. | 2407.15795 | link |
2024-07-22 | CLIP with Generative Latent Replay: a Strong Baseline for Incremental Learning | Emanuele Frascaroli et.al. | 2407.15793 | link |
2024-07-22 | Extracting Structured Insights from Financial News: An Augmented LLM Driven Approach | Rian Dolphin et.al. | 2407.15788 | null |
2024-07-22 | Concept-Based Interpretable Reinforcement Learning with Limited to No Human Labels | Zhuorui Ye et.al. | 2407.15786 | null |
2024-07-22 | Conditioned Language Policy: A General Framework for Steerable Multi-Objective Finetuning | Kaiwen Wang et.al. | 2407.15762 | null |
2024-07-22 | MoRSE: Bridging the Gap in Cybersecurity Expertise with Retrieval Augmented Generation | Marco Simoni et.al. | 2407.15748 | null |
2024-07-22 | OMoS-QA: A Dataset for Cross-Lingual Extractive Question Answering in a German Migration Context | Steffen Kleinle et.al. | 2407.15736 | null |
2024-07-22 | TaskGen: A Task-Based, Memory-Infused Agentic Framework using StrictJSON | John Chong Min Tan et.al. | 2407.15734 | link |
2024-07-22 | Zero-Shot Embeddings Inform Learning and Forgetting with Vision-Language Encoders | Laura Niss et.al. | 2407.15731 | null |
2024-07-22 | SAM2CLIP2SAM: Vision Language Model for Segmentation of 3D CT Scans for Covid-19 Detection | Dimitrios Kollias et.al. | 2407.15728 | null |
2024-07-22 | DStruct2Design: Data and Benchmarks for Data Structure Driven Generative Floor Plan Design | Zhi Hao Luo et.al. | 2407.15723 | link |
2024-07-22 | Do Large Language Models Have Compositional Ability? An Investigation into Limitations and Scalability | Zhuoyan Xu et.al. | 2407.15720 | link |
2024-07-19 | Internal Consistency and Self-Feedback in Large Language Models: A Survey | Xun Liang et.al. | 2407.14507 | link |
2024-07-19 | On Pre-training of Multimodal Language Models Customized for Chart Understanding | Wan-Cyuan Fan et.al. | 2407.14506 | null |
2024-07-19 | PD-TPE: Parallel Decoder with Text-guided Position Encoding for 3D Visual Grounding | Chenshu Hou et.al. | 2407.14491 | null |
2024-07-19 | Evaluating the Reliability of Self-Explanations in Large Language Models | Korbinian Randl et.al. | 2407.14487 | link |
2024-07-19 | Data-Centric Human Preference Optimization with Rationales | Hoang Anh Just et.al. | 2407.14477 | link |
2024-07-19 | Contrastive Learning with Counterfactual Explanations for Radiology Report Generation | Mingjie Li et.al. | 2407.14474 | null |
2024-07-19 | Check-Eval: A Checklist-based Approach for Evaluating Text Quality | Jayr Pereira et.al. | 2407.14467 | null |
2024-07-19 | Undermining Mental Proof: How AI Can Make Cooperation Harder by Making Thinking Easier | Zachary Wojtowicz et.al. | 2407.14452 | null |
2024-07-19 | Token-level Correlation-guided Compression for Efficient Multimodal Document Understanding | Renshan Zhang et.al. | 2407.14439 | link |
2024-07-19 | Jumping Ahead: Improving Reconstruction Fidelity with JumpReLU Sparse Autoencoders | Senthooran Rajamanoharan et.al. | 2407.14435 | null |
2024-07-19 | Mixture of Experts with Mixture of Precisions for Tuning Quality of Service | HamidReza Imani et.al. | 2407.14417 | null |
2024-07-19 | System-1.x: Learning to Balance Fast and Slow Planning with Language Models | Swarnadeep Saha et.al. | 2407.14414 | link |
2024-07-19 | DEAL: Disentangle and Localize Concept-level Explanations for VLMs | Tang Li et.al. | 2407.14412 | link |
2024-07-19 | The Vision of Autonomic Computing: Can LLMs Make It a Reality? | Zhiyang Zhang et.al. | 2407.14402 | null |
2024-07-19 | Frontiers of Deep Learning: From Novel Application to Real-World Deployment | Rui Xie et.al. | 2407.14386 | null |
2024-07-19 | Open Artificial Knowledge | Vadim Borisov et.al. | 2407.14371 | null |
2024-07-19 | Enhancing Zero-shot Audio Classification using Sound Attribute Knowledge from Large Language Models | Xuenan Xu et.al. | 2407.14355 | link |
2024-07-19 | Improving Retrieval in Sponsored Search by Leveraging Query Context Signals | Akash Kumar Mohankumar et.al. | 2407.14346 | null |
2024-07-19 | LLMs left, right, and center: Assessing GPT’s capabilities to label political bias from web domains | Raphael Hernandes et.al. | 2407.14344 | null |
2024-07-19 | Multimodal Misinformation Detection using Large Vision-Language Models | Sahar Tahmasebi et.al. | 2407.14321 | null |
2024-07-18 | Latent Causal Probing: A Formal Perspective on Probing with Causal Models of Data | Charles Jin et.al. | 2407.13765 | null |
2024-07-18 | SegPoint: Segment Any Point Cloud via Large Language Model | Shuting He et.al. | 2407.13761 | null |
2024-07-18 | Black-Box Opinion Manipulation Attacks to Retrieval-Augmented Generation of Large Language Models | Zhuo Chen et.al. | 2407.13757 | null |
2024-07-18 | CellularLint: A Systematic Approach to Identify Inconsistent Behavior in Cellular Network Specifications | Mirza Masfiqur Rahman et.al. | 2407.13742 | null |
2024-07-18 | Baba Is AI: Break the Rules to Beat the Benchmark | Nathan Cloos et.al. | 2407.13729 | null |
2024-07-18 | CoDefeater: Using LLMs To Find Defeaters in Assurance Cases | Usman Gohar et.al. | 2407.13717 | link |
2024-07-18 | Understanding Reference Policies in Direct Preference Optimization | Yixin Liu et.al. | 2407.13709 | link |
2024-07-18 | A Comprehensive Review of Recommender Systems: Transitioning from Theory to Practice | Shaina Raza et.al. | 2407.13699 | null |
2024-07-18 | Benchmark Agreement Testing Done Right: A Guide for LLM Benchmark Evaluation | Yotam Perlitz et.al. | 2407.13696 | link |
2024-07-18 | Prover-Verifier Games improve legibility of LLM outputs | Jan Hendrik Kirchner et.al. | 2407.13692 | null |
2024-07-18 | Shaded Route Planning Using Active Segmentation and Identification of Satellite Images | Longchao Da et.al. | 2407.13689 | null |
2024-07-18 | FuLG: 150B Romanian Corpus for Language Model Pretraining | Vlad-Andrei Bădoiu et.al. | 2407.13657 | null |
2024-07-18 | COMCAT: Leveraging Human Judgment to Improve Automatic Documentation and Summarization | Skyler Grandel et.al. | 2407.13648 | null |
2024-07-18 | Weak-to-Strong Reasoning | Yuqing Yang et.al. | 2407.13647 | link |
2024-07-18 | Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies | Chaofan Tao et.al. | 2407.13623 | link |
2024-07-18 | KNOWNET: Guided Health Information Seeking from LLMs via Knowledge Graph Integration | Youfu Yan et.al. | 2407.13598 | null |
2024-07-18 | PLANTS: A Novel Problem and Dataset for Summarization of Planning-Like (PL) Tasks | Vishal Pallagani et.al. | 2407.13597 | null |
2024-07-18 | EarthMarker: A Visual Prompt Learning Framework for Region-level and Point-level Remote Sensing Imagery Comprehension | Wei Zhang et.al. | 2407.13596 | link |
2024-07-18 | Robust Calibration of Large Vision-Language Adapters | Balamurali Murugesan et.al. | 2407.13588 | link |
2024-07-18 | Towards Zero-Shot Multimodal Machine Translation | Matthieu Futeral et.al. | 2407.13579 | link |
2024-07-17 | LMMs-Eval: Reality Check on the Evaluation of Large Multimodal Models | Kaichen Zhang et.al. | 2407.12772 | link |
2024-07-17 | EchoSight: Advancing Visual-Language Models with Wiki Knowledge | Yibin Yan et.al. | 2407.12735 | null |
2024-07-17 | NL2Contact: Natural Language Guided 3D Hand-Object Contact Modeling with Diffusion Model | Zhongqun Zhang et.al. | 2407.12727 | null |
2024-07-17 | Is Sarcasm Detection A Step-by-Step Reasoning Process in Large Language Models? | Ben Yao et.al. | 2407.12725 | null |
2024-07-17 | The Future of Learning: Large Language Models through the Lens of Students | He Zhang et.al. | 2407.12723 | null |
2024-07-17 | MoME: Mixture of Multimodal Experts for Generalist Multimodal Large Language Models | Leyang Shen et.al. | 2407.12709 | link |
2024-07-17 | Subgraph-Aware Training of Text-based Methods for Knowledge Graph Completion | Youmin Ko et.al. | 2407.12703 | null |
2024-07-17 | Patch-Level Training for Large Language Models | Chenze Shao et.al. | 2407.12665 | link |
2024-07-17 | Zero-shot Text-guided Infinite Image Synthesis with LLM guidance | Soyeong Kwon et.al. | 2407.12642 | null |
2024-07-17 | Domain-specific or Uncertainty-aware models: Does it really make a difference for biomedical text classification? | Aman Sinha et.al. | 2407.12626 | null |
2024-07-17 | Harnessing the Power of Artificial Intelligence to Vitalize Endangered Indigenous Languages: Technologies and Experiences | Claudio Pinhanez et.al. | 2407.12620 | null |
2024-07-17 | AudienceView: AI-Assisted Interpretation of Audience Feedback in Journalism | William Brannon et.al. | 2407.12613 | link |
2024-07-17 | VisFocus: Prompt-Guided Vision Encoders for OCR-Free Dense Document Understanding | Ofir Abramovich et.al. | 2407.12594 | null |
2024-07-18 | Benchmarking Robust Self-Supervised Learning Across Diverse Downstream Tasks | Antoni Kowalczuk et.al. | 2407.12588 | link |
2024-07-17 | E5-V: Universal Embeddings with Multimodal Large Language Models | Ting Jiang et.al. | 2407.12580 | link |
2024-07-17 | Audio Conditioning for Music Generation via Discrete Bottleneck Features | Simon Rouard et.al. | 2407.12563 | null |
2024-07-17 | Conspiracy theories and where to find them on TikTok | Francesco Corso et.al. | 2407.12545 | null |
2024-07-17 | Abstraction Alignment: Comparing Model and Human Conceptual Relationships | Angie Boggust et.al. | 2407.12543 | link |
2024-07-17 | Towards Collaborative Intelligence: Propagating Intentions and Reasoning for Multi-Agent Coordination with Large Language Models | Xihe Qiu et.al. | 2407.12532 | null |
2024-07-17 | Crafting the Path: Robust Query Rewriting for Information Retrieval | Ingeol Baek et.al. | 2407.12529 | null |
2024-07-16 | UrbanWorld: An Urban World Model for 3D City Generation | Yu Shang et.al. | 2407.11965 | link |
2024-07-16 | NeedleBench: Can LLMs Do Retrieval and Reasoning in 1 Million Context Window? | Mo Li et.al. | 2407.11963 | link |
2024-07-16 | Code Documentation and Analysis to Secure Software Development | Paul Attie et.al. | 2407.11934 | null |
2024-07-16 | What’s Wrong? Refining Meeting Summaries with LLM Feedback | Frederic Kirstein et.al. | 2407.11919 | null |
2024-07-16 | GraphFM: A Scalable Framework for Multi-Graph Pretraining | Divyansha Lachi et.al. | 2407.11907 | null |
2024-07-16 | Ascend-CC: Confidential Computing on Heterogeneous NPU for Emerging Generative AI Workloads | Aritra Dhar et.al. | 2407.11888 | null |
2024-07-16 | Zero-shot Cross-Lingual Transfer for Synthetic Data Generation in Grammatical Error Detection | Gaetan Lopez Latouche et.al. | 2407.11854 | null |
2024-07-16 | Schema Matching with Large Language Models: an Experimental Study | Marcel Parciak et.al. | 2407.11852 | link |
2024-07-16 | LoFTI: Localization and Factuality Transfer to Indian Locales | Sona Elza Simon et.al. | 2407.11833 | link |
2024-07-16 | GPT Assisted Annotation of Rhetorical and Linguistic Features for Interpretable Propaganda Technique Detection in News Text | Kyle Hamilton et.al. | 2407.11827 | null |
2024-07-16 | PipeInfer: Accelerating LLM Inference using Asynchronous Pipelined Speculation | Branden Butler et.al. | 2407.11798 | null |
2024-07-16 | Large Language Models as Misleading Assistants in Conversation | Betty Li Hou et.al. | 2407.11789 | null |
2024-07-16 | SwitchCIT: Switching for Continual Instruction Tuning of Large Language Models | Xinbo Wu et.al. | 2407.11780 | null |
2024-07-16 | Sharif-MGTD at SemEval-2024 Task 8: A Transformer-Based Approach to Detect Machine Generated Text | Seyedeh Fatemeh Ebrahimi et.al. | 2407.11774 | null |
2024-07-16 | Educational Personalized Learning Path Planning with Large Language Models | Chee Ng et.al. | 2407.11773 | null |
2024-07-16 | XEdgeAI: A Human-centered Industrial Inspection Framework with Data-centric Explainable Edge AI Approach | Truong Thanh Hung Nguyen et.al. | 2407.11771 | link |
2024-07-16 | Robust Utility-Preserving Text Anonymization Based on Large Language Models | Tianyu Yang et.al. | 2407.11770 | link |
2024-07-16 | Vectoring Languages | Joseph Chen et.al. | 2407.11766 | null |
2024-07-16 | Exploring Quantization for Efficient Pre-Training of Transformer Language Models | Kamran Chitsaz et.al. | 2407.11722 | link |
2024-07-17 | Harnessing Large Language Models for Multimodal Product Bundling | Xiaohao Liu et.al. | 2407.11712 | null |
2024-07-15 | VGBench: Evaluating Large Language Models on Vector Graphics Understanding and Generation | Bocheng Zou et.al. | 2407.10972 | link |
2024-07-15 | Q-Sparse: All Large Language Models can be Fully Sparsely-Activated | Hongyu Wang et.al. | 2407.10969 | null |
2024-07-15 | Fast Matrix Multiplications for Lookup Table-Quantized LLMs | Han Guo et.al. | 2407.10960 | link |
2024-07-15 | Spider2-V: How Far Are Multimodal Agents From Automating Data Science and Engineering Workflows? | Ruisheng Cao et.al. | 2407.10956 | link |
2024-07-15 | MMM: Multilingual Mutual Reinforcement Effect Mix Datasets & Test with Open-domain Information Extraction Large Language Models | Chengguang Gan et.al. | 2407.10953 | null |
2024-07-15 | Can Textual Semantics Mitigate Sounding Object Segmentation Preference? | Yaoting Wang et.al. | 2407.10947 | link |
2024-07-15 | Learning from Naturally Occurring Feedback | Shachar Don-Yehiya et.al. | 2407.10944 | link |
2024-07-15 | GRUtopia: Dream General Robots in a City at Scale | Hanqing Wang et.al. | 2407.10943 | link |
2024-07-15 | Fine-Tuning and Prompt Optimization: Two Great Steps that Work Better Together | Dilara Soylu et.al. | 2407.10930 | null |
2024-07-15 | Benchmarking Vision Language Models for Cultural Understanding | Shravan Nayak et.al. | 2407.10920 | null |
2024-07-15 | FinDKG: Dynamic Knowledge Graphs with Large Language Models for Detecting Global Trends in Financial Markets | Xiaohui Victor Li et.al. | 2407.10909 | link |
2024-07-15 | Hey, That’s My Model! Introducing Chain & Hash, An LLM Fingerprinting Technique | Mark Russinovich et.al. | 2407.10887 | null |
2024-07-15 | SLIP: Securing LLMs IP Using Weights Decomposition | Yehonathan Refael et.al. | 2407.10886 | null |
2024-07-15 | Understanding the Importance of Evolutionary Search in Automated Heuristic Design with Large Language Models | Rui Zhang et.al. | 2407.10873 | null |
2024-07-15 | GPT Sonograpy: Hand Gesture Decoding from Forearm Ultrasound Images via VLM | Keshav Bimbraw et.al. | 2407.10870 | null |
2024-07-15 | Physics-Inspired Generative Models in Medical Imaging: A Review | Dennis Hein et.al. | 2407.10856 | null |
2024-07-15 | Weighted Grouped Query Attention in Transformers | Sai Sena Chinnakonduru et.al. | 2407.10855 | null |
2024-07-15 | An Actionable Framework for Assessing Bias and Fairness in Large Language Model Use Cases | Dylan Bouchard et.al. | 2407.10853 | null |
2024-07-15 | MetaLLM: A High-performant and Cost-efficient Dynamic Framework for Wrapping LLMs | Quang H. Nguyen et.al. | 2407.10834 | null |
2024-07-15 | BiasScanner: Automatic Detection and Classification of News Bias to Strengthen Democracy | Tim Menzner et.al. | 2407.10829 | null |
2024-07-12 | FairyLandAI: Personalized Fairy Tales utilizing ChatGPT and DALLE-3 | Georgios Makridis et.al. | 2407.09467 | null |
2024-07-12 | Human-like Episodic Memory for Infinite Context LLMs | Zafeirios Fountas et.al. | 2407.09450 | link |
2024-07-12 | ASTPrompter: Weakly Supervised Automated Language Model Red-Teaming to Identify Likely Toxic Prompts | Amelia F. Hardy et.al. | 2407.09447 | link |
2024-07-12 | MUSCLE: A Model Update Strategy for Compatible LLM Evolution | Jessica Echterhoff et.al. | 2407.09435 | null |
2024-07-12 | A Perspective on Foundation Models for the Electric Power Grid | Hendrik F. Hamann et.al. | 2407.09434 | null |
2024-07-12 | Open (Clinical) LLMs are Sensitive to Instruction Phrasings | Alberto Mario Ceballos Arroyo et.al. | 2407.09429 | link |
2024-07-12 | TelecomGPT: A Framework to Build Telecom-Specfic Large Language Models | Hang Zou et.al. | 2407.09424 | null |
2024-07-12 | Mitigating Entity-Level Hallucination in Large Language Models | Weihang Su et.al. | 2407.09417 | link |
2024-07-12 | SPIQA: A Dataset for Multimodal Question Answering on Scientific Papers | Shraman Pramanick et.al. | 2407.09413 | link |
2024-07-12 | Deep Bag-of-Words Model: An Efficient and Interpretable Relevance Architecture for Chinese E-Commerce | Zhe Lin et.al. | 2407.09395 | null |
2024-07-12 | PersonaRAG: Enhancing Retrieval-Augmented Generation Systems with User-Centric Agents | Saber Zerhoudi et.al. | 2407.09394 | link |
2024-07-12 | GAVEL: Generating Games Via Evolution and Language Models | Graham Todd et.al. | 2407.09388 | link |
2024-07-12 | Is Contrasting All You Need? Contrastive Learning for the Detection and Attribution of AI-generated Text | Lucio La Cava et.al. | 2407.09364 | null |
2024-07-12 | Good Intentions, Risky Inventions: A Method for Assessing the Risks and Benefits of AI in Mobile and Wearable Uses | Marios Constantinides et.al. | 2407.09322 | link |
2024-07-12 | Scalability of Bayesian Network Structure Elicitation with Large Language Models: a Novel Methodology and Comparative Analysis | Nikolay Babakov et.al. | 2407.09311 | null |
2024-07-12 | Transformer Layers as Painters | Qi Sun et.al. | 2407.09298 | link |
2024-07-12 | Security Matrix for Multimodal Agents on Mobile Devices: A Systematic and Proof of Concept Study | Yulong Yang et.al. | 2407.09295 | null |
2024-07-12 | CEIPA: Counterfactual Explainable Incremental Prompt Attack Analysis on Large Language Models | Dong Shu et.al. | 2407.09292 | null |
2024-07-12 | Structuring Authenticity Assessments on Historical Documents using LLMs | Andrea Schimmenti et.al. | 2407.09290 | null |
2024-07-12 | WSESeg: Introducing a Dataset for the Segmentation of Winter Sports Equipment with a Baseline for Interactive Segmentation | Robin Schön et.al. | 2407.09288 | link |
2024-07-11 | MAVIS: Mathematical Visual Instruction Tuning | Renrui Zhang et.al. | 2407.08739 | link |
2024-07-11 | Real-Time Anomaly Detection and Reactive Planning with Large Language Models | Rohan Sinha et.al. | 2407.08735 | null |
2024-07-11 | Is Your Model Really A Good Math Reasoner? Evaluating Mathematical Reasoning with Checklist | Zihao Zhou et.al. | 2407.08733 | null |
2024-07-11 | A Taxonomy for Data Contamination in Large Language Models | Medha Palavalli et.al. | 2407.08716 | null |
2024-07-11 | GTA: A Benchmark for General Tool Agents | Jize Wang et.al. | 2407.08713 | link |
2024-07-11 | eyeballvul: a future-proof benchmark for vulnerability detection in the wild | Timothee Chauvin et.al. | 2407.08708 | link |
2024-07-11 | Extracting Training Data from Document-Based VQA Models | Francesco Pinto et.al. | 2407.08707 | null |
2024-07-11 | HiRes-LLaVA: Restoring Fragmentation Input in High-Resolution Large Vision-Language Models | Runhui Huang et.al. | 2407.08706 | null |
2024-07-11 | Live2Diff: Live Stream Translation via Uni-directional Attention in Video Diffusion Models | Zhening Xing et.al. | 2407.08701 | null |
2024-07-11 | Mitigating Catastrophic Forgetting in Language Transfer via Model Merging | Anton Alexandrov et.al. | 2407.08699 | null |
2024-07-11 | Cloud Atlas: Efficient Fault Localization for Cloud Systems using Language Models and Causal Insight | Zhiqiang Xie et.al. | 2407.08694 | null |
2024-07-11 | Robotic Control via Embodied Chain-of-Thought Reasoning | Zawalski Michał et.al. | 2407.08693 | null |
2024-07-11 | SEED-Story: Multimodal Long Story Generation with Large Language Model | Shuai Yang et.al. | 2407.08683 | link |
2024-07-11 | NODE-Adapter: Neural Ordinary Differential Equations for Better Vision-Language Reasoning | Yi Zhang et.al. | 2407.08672 | null |
2024-07-11 | Uncertainty Estimation of Large Language Models in Medical Question Answering | Jiaxin Wu et.al. | 2407.08662 | null |
2024-07-11 | Towards Building Specialized Generalist AI with System 1 and System 2 Fusion | Kaiyan Zhang et.al. | 2407.08642 | null |
2024-07-11 | $β$-DPO: Direct Preference Optimization with Dynamic $β$ | Junkang Wu et.al. | 2407.08639 | link |
2024-07-11 | RoboMorph: Evolving Robot Morphology using Large Language Models | Kevin Qiu et.al. | 2407.08626 | null |
2024-07-11 | Tamil Language Computing: the Present and the Future | Kengatharaiyer Sarveswaran et.al. | 2407.08618 | null |
2024-07-11 | FlashAttention-3: Fast and Accurate Attention with Asynchrony and Low-precision | Jay Shah et.al. | 2407.08608 | link |
2024-07-10 | Training on the Test Task Confounds Evaluation and Emergence | Ricardo Dominguez-Olmedo et.al. | 2407.07890 | link |
2024-07-10 | Towards Robust Alignment of Language Models: Distributionally Robustifying Direct Preference Optimization | Junkang Wu et.al. | 2407.07880 | link |
2024-07-11 | Toto: Time Series Optimized Transformer for Observability | Ben Cohen et.al. | 2407.07874 | null |
2024-07-10 | FACTS About Building Retrieval Augmented Generation-based Chatbots | Rama Akkiraju et.al. | 2407.07858 | null |
2024-07-10 | OpenDiLoCo: An Open-Source Framework for Globally Distributed Low-Communication Training | Sami Jaghouar et.al. | 2407.07852 | link |
2024-07-10 | Natural Language Mechanisms via Self-Resolution with Foundation Models | Nicolas Della Penna et.al. | 2407.07845 | null |
2024-07-10 | Benchmarking Embedding Aggregation Methods in Computational Pathology: A Clinical Data Perspective | Shengjia Chen et.al. | 2407.07841 | link |
2024-07-10 | Decompose and Compare Consistency: Measuring VLMs’ Answer Reliability via Task-Decomposition Consistency Comparison | Qian Yang et.al. | 2407.07840 | null |
2024-07-10 | Transformer Alignment in Large Language Models | Murdock Aubry et.al. | 2407.07810 | null |
2024-07-11 | AVCap: Leveraging Audio-Visual Features as Text Tokens for Captioning | Jongsuk Kim et.al. | 2407.07801 | link |
2024-07-10 | Attribute or Abstain: Large Language Models as Long Document Assistants | Jan Buchmann et.al. | 2407.07799 | link |
2024-07-11 | Evaluating Large Language Models with Grid-Based Game Competitions: An Extensible LLM Benchmark and Leaderboard | Oguzhan Topsakal et.al. | 2407.07796 | link |
2024-07-10 | Flooding Spread of Manipulated Knowledge in LLM-Based Multi-Agent Communities | Tianjie Ju et.al. | 2407.07791 | link |
2024-07-10 | WorldAPIs: The World Is Worth How Many APIs? A Thought Experiment | Jiefu Ou et.al. | 2407.07778 | null |
2024-07-10 | Mobility VLA: Multimodal Instruction Navigation with Long-Context VLMs and Topological Graphs | Hao-Tien Lewis Chiang et.al. | 2407.07775 | null |
2024-07-10 | Can ChatGPT Pass a Theory of Computing Course? | Matei A. Golesteanu et.al. | 2407.07757 | null |
2024-07-10 | Fine-Tuning Large Language Models with User-Level Differential Privacy | Zachary Charles et.al. | 2407.07737 | null |
2024-07-10 | PaliGemma: A versatile 3B VLM for transfer | Lucas Beyer et.al. | 2407.07726 | link |
2024-07-10 | Why should we ever automate moral decision making? | Vincent Conitzer et.al. | 2407.07671 | null |
2024-07-10 | A Proposed S.C.O.R.E. Evaluation Framework for Large Language Models : Safety, Consensus, Objectivity, Reproducibility and Explainability | Ting Fang Tan et.al. | 2407.07666 | null |
2024-07-09 | AnyTaskTune: Advanced Domain-Specific Solutions through Task-Fine-Tuning | Jiaxi Cui et.al. | 2407.07094 | link |
2024-07-09 | FBI-LLM: Scaling Up Fully Binarized LLMs from Scratch via Autoregressive Distillation | Liqun Ma et.al. | 2407.07093 | link |
2024-07-09 | CopyBench: Measuring Literal and Non-Literal Reproduction of Copyright-Protected Text in Language Model Generation | Tong Chen et.al. | 2407.07087 | link |
2024-07-09 | Hypothetical Minds: Scaffolding Theory of Mind for Multi-Agent Tasks with Large Language Models | Logan Cross et.al. | 2407.07086 | link |
2024-07-09 | Adapting LLMs to Hebrew: Unveiling DictaLM 2.0 with Enhanced Vocabulary and Instruction Capabilities | Shaltiel Shmidman et.al. | 2407.07080 | null |
2024-07-09 | Lookback Lens: Detecting and Mitigating Contextual Hallucinations in Large Language Models Using Only Attention Maps | Yung-Sung Chuang et.al. | 2407.07071 | link |
2024-07-09 | Prompting Techniques for Secure Code Generation: A Systematic Investigation | Catherine Tony et.al. | 2407.07064 | null |
2024-07-10 | Internet of Agents: Weaving a Web of Heterogeneous Agents for Collaborative Intelligence | Weize Chen et.al. | 2407.07061 | link |
2024-07-10 | Multimodal Self-Instruct: Synthetic Abstract Image and Visual Reasoning Instruction Using Language Model | Wenqi Zhang et.al. | 2407.07053 | link |
2024-07-09 | ProtoSAM – One Shot Medical Image Segmentation With Foundational Models | Lev Ayzenberg et.al. | 2407.07042 | link |
2024-07-09 | Vision-and-Language Navigation Today and Tomorrow: A Survey in the Era of Foundation Models | Yue Zhang et.al. | 2407.07035 | link |
2024-07-09 | Exploring Scalability of Self-Training for Open-Vocabulary Temporal Action Localization | Jeongseok Hyun et.al. | 2407.07024 | link |
2024-07-09 | Using Large Language Models for Generating Smart Contracts for Health Insurance from Textual Policies | Inwon Kang et.al. | 2407.07019 | null |
2024-07-09 | End-To-End Causal Effect Estimation from Unstructured Natural Language Data | Nikita Dhawan et.al. | 2407.07018 | null |
2024-07-09 | Is Large Language Model All You Need to Predict the Synthesizability and Precursors of Crystal Structures? | Zhilong Song et.al. | 2407.07016 | null |
2024-07-09 | Induction Heads as an Essential Mechanism for Pattern Matching in In-context Learning | J. Crosbie et.al. | 2407.07011 | null |
2024-07-09 | Metron: Holistic Performance Evaluation Framework for LLM Inference Systems | Amey Agrawal et.al. | 2407.07000 | link |
2024-07-09 | Robust Neural Information Retrieval: An Adversarial and Out-of-distribution Perspective | Yu-An Liu et.al. | 2407.06992 | link |
2024-07-09 | Segment-Based Interactive Machine Translation for Pre-trained Models | Angel Navarro et.al. | 2407.06990 | null |
2024-07-09 | Listen and Speak Fairly: A Study on Semantic Gender Bias in Speech Integrated Large Language Models | Yi-Cheng Lin et.al. | 2407.06957 | link |
2024-07-08 | Multi-Object Hallucination in Vision-Language Models | Xuweiyi Chen et.al. | 2407.06192 | link |
2024-07-08 | 4D Contrastive Superflows are Dense 3D Representation Learners | Xiang Xu et.al. | 2407.06190 | link |
2024-07-08 | Video-STaR: Self-Training Enables Video Instruction Tuning with Any Supervision | Orr Zohar et.al. | 2407.06189 | link |
2024-07-08 | CrowdMoGen: Zero-Shot Text-Driven Collective Motion Generation | Xinying Guo et.al. | 2407.06188 | null |
2024-07-08 | JeDi: Joint-Image Diffusion Models for Finetuning-Free Personalized Text-to-Image Generation | Yu Zeng et.al. | 2407.06187 | null |
2024-07-08 | Vision-Language Models under Cultural and Inclusive Considerations | Antonia Karamolegkou et.al. | 2407.06177 | null |
2024-07-08 | On Speeding Up Language Model Evaluation | Jin Peng Zhou et.al. | 2407.06172 | null |
2024-07-08 | What’s Wrong with Your Code Generated by Large Language Models? An Extensive Study | Shihan Dou et.al. | 2407.06153 | null |
2024-07-08 | Using Grammar Masking to Ensure Syntactic Validity in LLM-based Modeling Tasks | Lukas Netz et.al. | 2407.06146 | null |
2024-07-08 | ANOLE: An Open, Autoregressive, Native Large Multimodal Models for Interleaved Image-Text Generation | Ethan Chern et.al. | 2407.06135 | link |
2024-07-08 | Evaluating the Semantic Profiling Abilities of LLMs for Natural Language Utterances in Data Visualization | Hannah K. Bako et.al. | 2407.06129 | link |
2024-07-08 | Depression Detection and Analysis using Large Language Models on Textual and Audio-Visual Modalities | Avinash Anand et.al. | 2407.06125 | null |
2024-07-08 | Enhancing Language Model Rationality with Bi-Directional Deliberation Reasoning | Yadong Zhang et.al. | 2407.06112 | null |
2024-07-08 | Artificial Intuition: Efficient Classification of Scientific Abstracts | Harsh Sakhrani et.al. | 2407.06093 | null |
2024-07-08 | Merge, Ensemble, and Cooperate! A Survey on Collaborative Strategies in the Era of Large Language Models | Jinliang Lu et.al. | 2407.06089 | null |
2024-07-08 | From Loops to Oops: Fallback Behaviors of Language Models Under Uncertainty | Maor Ivgi et.al. | 2407.06071 | link |
2024-07-08 | Variational Best-of-N Alignment | Afra Amini et.al. | 2407.06057 | null |
2024-07-08 | MST5 – Multilingual Question Answering over Knowledge Graphs | Nikit Srivastava et.al. | 2407.06041 | link |
2024-07-08 | PAS: Data-Efficient Plug-and-Play Prompt Augmentation System | Miao Zheng et.al. | 2407.06027 | null |
2024-07-08 | iLLM-TSC: Integration reinforcement learning and large language model for traffic signal control policy improvement | Aoyu Pang et.al. | 2407.06025 | link |
2024-07-05 | Me, Myself, and AI: The Situational Awareness Dataset (SAD) for LLMs | Rudolf Laine et.al. | 2407.04694 | link |
2024-07-05 | ANAH-v2: Scaling Analytical Hallucination Annotation of Large Language Models | Yuzhe Gu et.al. | 2407.04693 | link |
2024-07-05 | Rethinking Visual Prompting for Multimodal Large Language Models with External Knowledge | Yuanze Lin et.al. | 2407.04681 | null |
2024-07-05 | Lost in Translation: The Algorithmic Gap Between LMs and the Brain | Tommaso Tosato et.al. | 2407.04680 | null |
2024-07-05 | Seed-ASR: Understanding Diverse Speech and Contexts with LLM-based Speech Recognition | Ye Bai et.al. | 2407.04675 | null |
2024-07-05 | Lazarus: Resilient and Elastic Training of Mixture-of-Experts Models with Adaptive Expert Placement | Yongji Wu et.al. | 2407.04656 | null |
2024-07-05 | Speculative Speech Recognition by Audio-Prefixed Low-Rank Adaptation of Language Models | Bolaji Yusuf et.al. | 2407.04641 | null |
2024-07-05 | Entity Decomposition with Filtering: A Zero-Shot Clinical Named Entity Recognition Framework | Reza Averly et.al. | 2407.04629 | null |
2024-07-05 | On scalable oversight with weak LLMs judging strong LLMs | Zachary Kenton et.al. | 2407.04622 | null |
2024-07-05 | CountGD: Multi-Modal Open-World Counting | Niki Amini-Naieni et.al. | 2407.04619 | null |
2024-07-05 | ARM: Efficient Guided Decoding with Autoregressive Reward Models | Sergey Troshin et.al. | 2407.04615 | null |
2024-07-05 | AWT: Transferring Vision-Language Models via Augmentation, Weighting, and Transportation | Yuhan Zhu et.al. | 2407.04603 | link |
2024-07-05 | Written Term Detection Improves Spoken Term Detection | Bolaji Yusuf et.al. | 2407.04601 | link |
2024-07-05 | Testing learning hypotheses using neural networks by manipulating learning data | Cara Su-Yi Leong et.al. | 2407.04593 | null |
2024-07-05 | Leveraging Large Language Models for Integrated Satellite-Aerial-Terrestrial Networks: Recent Advances and Future Directions | Shumaila Javaid et.al. | 2407.04581 | null |
2024-07-05 | VRSD: Rethinking Similarity and Diversity for Retrieval in Large Language Models | Hang Gao et.al. | 2407.04573 | null |
2024-07-05 | Not (yet) the whole story: Evaluating Visual Storytelling Requires More than Measuring Coherence, Grounding, and Repetition | Aditya K Surikuchi et.al. | 2407.04559 | link |
2024-07-05 | Spontaneous Reward Hacking in Iterative Self-Refinement | Jane Pan et.al. | 2407.04549 | null |
2024-07-05 | PoPreRo: A New Dataset for Popularity Prediction of Romanian Reddit Posts | Ana-Cristina Rogoz et.al. | 2407.04541 | link |
2024-07-05 | GPT vs RETRO: Exploring the Intersection of Retrieval and Parameter-Efficient Fine-Tuning | Aleksander Ficek et.al. | 2407.04528 | null |
2024-07-03 | Planetarium: A Rigorous Benchmark for Translating Text to Structured Planning Languages | Max Zuo et.al. | 2407.03321 | link |
2024-07-03 | InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output | Pan Zhang et.al. | 2407.03320 | link |
2024-07-03 | BACON: Supercharge Your VLM with Bag-of-Concept Graph to Mitigate Hallucinations | Zhantao Yang et.al. | 2407.03314 | null |
2024-07-03 | Universal Length Generalization with Turing Programs | Kaiying Hou et.al. | 2407.03310 | null |
2024-07-03 | Large Language Models for JSON Schema Discovery | Michael J. Mior et.al. | 2407.03286 | null |
2024-07-03 | LLM Internal States Reveal Hallucination Risk Faced With a Query | Ziwei Ji et.al. | 2407.03282 | link |
2024-07-03 | STF: Sentence Transformer Fine-Tuning For Topic Categorization With Limited Data | Kheir Eddine Daouadi et.al. | 2407.03253 | null |
2024-07-03 | Improving Retrieval-augmented Text-to-SQL with AST-based Ranking and Schema Pruning | Zhili Shen et.al. | 2407.03227 | null |
2024-07-03 | How Does Quantization Affect Multilingual LLMs? | Kelly Marchisio et.al. | 2407.03211 | null |
2024-07-03 | TheoremLlama: Transforming General-Purpose LLMs into Lean4 Experts | Ruida Wang et.al. | 2407.03203 | link |
2024-07-03 | Fine-Tuning with Divergent Chains of Thought Boosts Reasoning Through Self-Correction in Language Models | Haritz Puerto et.al. | 2407.03181 | link |
2024-07-03 | Investigating Decoder-only Large Language Models for Speech-to-text Translation | Chao-Wei Huang et.al. | 2407.03169 | null |
2024-07-03 | SOS! Soft Prompt Attack Against Open-Source Large Language Models | Ziqing Yang et.al. | 2407.03160 | null |
2024-07-03 | Let the Code LLM Edit Itself When You Edit the Code | Zhenyu He et.al. | 2407.03157 | null |
2024-07-03 | Reinforcement Learning for Sequence Design Leveraging Protein Language Models | Jithendaraa Subramanian et.al. | 2407.03154 | null |
2024-07-03 | Enhancing Translation Accuracy of Large Language Models through Continual Pre-Training on Parallel Data | Minato Kondo et.al. | 2407.03145 | null |
2024-07-03 | Social Bias Evaluation for Large Language Models Requires Prompt Variations | Rem Hida et.al. | 2407.03129 | link |
2024-07-03 | KeyVideoLLM: Towards Large-scale Video Keyframe Selection | Hao Liang et.al. | 2407.03104 | null |
2024-07-03 | Cactus: Towards Psychological Counseling Conversations using Cognitive Behavioral Theory | Suyeon Lee et.al. | 2407.03103 | link |
2024-07-03 | ScreenTK: Seamless Detection of Time-Killing Moments Using Continuous Mobile Screen Text Monitoring | Le Fang et.al. | 2407.03063 | null |
2024-07-02 | MInference 1.0: Accelerating Pre-filling for Long-Context LLMs via Dynamic Sparse Attention | Huiqiang Jiang et.al. | 2407.02490 | link |
2024-07-02 | Neurocache: Efficient Vector Retrieval for Long-range Language Modeling | Ali Safaya et.al. | 2407.02486 | link |
2024-07-02 | RankRAG: Unifying Context Ranking with Retrieval-Augmented Generation in LLMs | Yue Yu et.al. | 2407.02485 | null |
2024-07-02 | MMedAgent: Learning to Use Medical Tools with Multi-modal Agent | Binxu Li et.al. | 2407.02483 | link |
2024-07-02 | Understanding Alignment in Multimodal LLMs: A Comprehensive Study | Elmira Amirloo et.al. | 2407.02477 | null |
2024-07-02 | Open Scene Graphs for Open World Object-Goal Navigation | Joel Loo et.al. | 2407.02473 | null |
2024-07-02 | ValueScope: Unveiling Implicit Norms and Values via Return Potential Model of Social Interactions | Chan Young Park et.al. | 2407.02472 | link |
2024-07-02 | Reliable Confidence Intervals for Information Retrieval Evaluation Using Generative A.I | Harrie Oosterhuis et.al. | 2407.02464 | null |
2024-07-02 | Ensemble of pre-trained language models and data augmentation for hate speech detection from Arabic tweets | Kheir Eddine Daouadi et.al. | 2407.02448 | null |
2024-07-03 | Video Watermarking: Safeguarding Your Video from (Unauthorized) Annotations by Video-based LLMs | Jinmin Li et.al. | 2407.02411 | null |
2024-07-02 | CEB: Compositional Evaluation Benchmark for Fairness in Large Language Models | Song Wang et.al. | 2407.02408 | null |
2024-07-02 | Assessing the Code Clone Detection Capability of Large Language Models | Zixian Zhang et.al. | 2407.02402 | null |
2024-07-02 | Learning to Refine with Fine-Grained Natural Language Feedback | Manya Wadhwa et.al. | 2407.02397 | link |
2024-07-02 | Is Your AI-Generated Code Really Secure? Evaluating Large Language Models on Secure Code Generation with CodeSecEval | Jiexin Wang et.al. | 2407.02395 | null |
2024-07-02 | TokenPacker: Efficient Visual Projector for Multimodal LLM | Wentong Li et.al. | 2407.02392 | link |
2024-07-02 | Talking to Machines: do you read me? | Lina M. Rojas-Barahona et.al. | 2407.02354 | null |
2024-07-02 | Pelican: Correcting Hallucination in Vision-LLMs via Claim Decomposition and Program of Thought Verification | Pritish Sahu et.al. | 2407.02352 | null |
2024-07-02 | Generative Large Language Models in Automated Fact-Checking: A Survey | Ivan Vykopal et.al. | 2407.02351 | null |
2024-07-02 | Conceptual Codebook Learning for Vision-Language Models | Yi Zhang et.al. | 2407.02350 | null |
2024-07-02 | MORPHEUS: Modeling Role from Personalized Dialogue History by Exploring and Utilizing Latent Space | Yihong Tang et.al. | 2407.02345 | null |
2024-06-28 | Web2Code: A Large-scale Webpage-to-Code Dataset and Evaluation Framework for Multimodal LLMs | Sukmin Yun et.al. | 2406.20098 | link |
2024-06-28 | LLaRA: Supercharging Robot Learning Data for Vision-Language Policy | Xiang Li et.al. | 2406.20095 | link |
2024-06-28 | Scaling Synthetic Data Creation with 1,000,000,000 Personas | Xin Chan et.al. | 2406.20094 | link |
2024-06-28 | LLaVolta: Efficient Multi-modal Models via Stage-wise Visual Context Compression | Jieneng Chen et.al. | 2406.20092 | link |
2024-06-28 | ProgressGym: Alignment with a Millennium of Moral Progress | Tianyi Qiu et.al. | 2406.20087 | link |
2024-06-28 | Auto Cherry-Picker: Learning from High-quality Generative Data Driven by Language | Yicheng Chen et.al. | 2406.20085 | null |
2024-06-28 | Molecular Facts: Desiderata for Decontextualization in LLM Fact Verification | Anisha Gunjal et.al. | 2406.20079 | link |
2024-06-28 | EVF-SAM: Early Vision-Language Fusion for Text-Prompted Segment Anything Model | Yuxuan Zhang et.al. | 2406.20076 | link |
2024-06-28 | To Word Senses and Beyond: Inducing Concepts with Contextualized Language Models | Bastien Liétard et.al. | 2406.20054 | null |
2024-06-28 | Covert Malicious Finetuning: Challenges in Safeguarding LLM Adaptation | Danny Halawi et.al. | 2406.20053 | null |
2024-07-02 | BMW Agents – A Framework For Task Automation Through Multi-Agent Collaboration | Noel Crawford et.al. | 2406.20041 | null |
2024-06-28 | BioMNER: A Dataset for Biomedical Method Entity Recognition | Chen Tang et.al. | 2406.20038 | null |
2024-06-28 | LEMoE: Advanced Mixture of Experts Adaptor for Lifelong Model Editing of Large Language Models | Renzhi Wang et.al. | 2406.20030 | null |
2024-06-28 | ToolBeHonest: A Multi-level Hallucination Diagnostic Benchmark for Tool-Augmented Large Language Models | Yuxiang Zhang et.al. | 2406.20015 | link |
2024-06-28 | The SIFo Benchmark: Investigating the Sequential Instruction Following Ability of Large Language Models | Xinyi Chen et.al. | 2406.19999 | link |
2024-06-28 | Single Parent Family: A Spectrum of Family Members from a Single Pre-Trained Foundation Model | Habib Hajimolahoseini et.al. | 2406.19995 | null |
2024-06-28 | ScaleBiO: Scalable Bilevel Optimization for LLM Data Reweighting | Rui Pan et.al. | 2406.19976 | null |
2024-06-28 | STLLaVA-Med: Self-Training Large Language and Vision Assistant for Medical | Guohao Sun et.al. | 2406.19973 | link |
2024-06-28 | Into the Unknown: Generating Geospatial Descriptions for New Environments | Tzuf Paz-Argaman et.al. | 2406.19967 | null |
2024-06-28 | Simulating Financial Market via Large Language Model based Agents | Shen Gao et.al. | 2406.19966 | null |
2024-06-27 | ReXTime: A Benchmark Suite for Reasoning-Across-Time in Videos | Jr-Jen Chen et.al. | 2406.19392 | link |
2024-06-27 | The Remarkable Robustness of LLMs: Stages of Inference? | Vedang Lad et.al. | 2406.19384 | link |
2024-06-27 | The Model Arena for Cross-lingual Sentiment Analysis: A Comparative Study in the Era of Large Language Models | Xiliang Zhu et.al. | 2406.19358 | null |
2024-06-27 | DiVERT: Distractor Generation with Variational Errors Represented as Text for Math Multiple-choice Questions | Nigel Fernandez et.al. | 2406.19356 | link |
2024-06-27 | Fundamental Problems With Model Editing: How Should Rational Belief Revision Work in LLMs? | Peter Hase et.al. | 2406.19354 | null |
2024-06-27 | IndoToxic2024: A Demographically-Enriched Dataset of Hate Speech and Toxicity Types for Indonesian Language | Lucky Susanto et.al. | 2406.19349 | null |
2024-06-27 | Jump Starting Bandits with LLM-Generated Prior Knowledge | Parand A. Alamdari et.al. | 2406.19317 | link |
2024-06-27 | MCNC: Manifold Constrained Network Compression | Chayne Thrash et.al. | 2406.19301 | null |
2024-06-27 | From Artificial Needles to Real Haystacks: Improving Retrieval Capabilities in LLMs by Finetuning on Synthetic Data | Zheyang Xiong et.al. | 2406.19292 | link |
2024-06-27 | PhysioLLM: Supporting Personalized Health Insights with Wearables and Large Language Models | Cathy Mengying Fang et.al. | 2406.19283 | null |
2024-06-27 | HuatuoGPT-Vision, Towards Injecting Medical Visual Knowledge into Multimodal LLMs at Scale | Junying Chen et.al. | 2406.19280 | link |
2024-06-27 | VERISCORE: Evaluating the factuality of verifiable claims in long-form text generation | Yixiao Song et.al. | 2406.19276 | link |
2024-06-27 | AutoPureData: Automated Filtering of Web Data for LLM Fine-tuning | Praneeth Vadlapati et.al. | 2406.19271 | link |
2024-06-27 | Read Anywhere Pointed: Layout-aware GUI Screen Reading with Tree-of-Lens Grounding | Yue Fan et.al. | 2406.19263 | link |
2024-06-27 | Enhancing Video-Language Representations with Structural Spatio-Temporal Alignment | Hao Fei et.al. | 2406.19255 | null |
2024-06-27 | AutoRAG-HP: Automatic Online Hyper-Parameter Tuning for Retrieval-Augmented Generation | Jia Fu et.al. | 2406.19251 | null |
2024-06-27 | Revealing Fine-Grained Values and Opinions in Large Language Models | Dustin Wright et.al. | 2406.19238 | link |
2024-06-28 | FlowVQA: Mapping Multimodal Logic in Visual Question Answering with Flowcharts | Shubhankar Singh et.al. | 2406.19237 | null |
2024-06-27 | Seeing Is Believing: Black-Box Membership Inference Attacks Against Retrieval Augmented Generation | Yuying Li et.al. | 2406.19234 | null |
2024-06-28 | RuBLiMP: Russian Benchmark of Linguistic Minimal Pairs | Ekaterina Taktasheva et.al. | 2406.19232 | link |
2024-06-26 | Towards Compositionality in Concept Learning | Adam Stein et.al. | 2406.18534 | link |
2024-06-26 | Symbolic Learning Enables Self-Evolving Agents | Wangchunshu Zhou et.al. | 2406.18532 | link |
2024-06-26 | PrExMe! Large Scale Prompt Exploration of Open Source LLMs for Machine Translation and Summarization Evaluation | Christoph Leiter et.al. | 2406.18528 | link |
2024-06-26 | CharXiv: Charting Gaps in Realistic Chart Understanding in Multimodal LLMs | Zirui Wang et.al. | 2406.18521 | link |
2024-06-26 | “Is ChatGPT a Better Explainer than My Professor?”: Evaluating the Explanation Capabilities of LLMs in Conversation Compared to a Human Baseline | Grace Li et.al. | 2406.18512 | null |
2024-06-26 | WildTeaming at Scale: From In-the-Wild Jailbreaks to (Adversarially) Safer Language Models | Liwei Jiang et.al. | 2406.18510 | link |
2024-06-26 | Mental Modeling of Reinforcement Learning Agents by Language Models | Wenhao Lu et.al. | 2406.18505 | null |
2024-06-26 | Is In-Context Learning a Type of Gradient-Based Learning? Evidence from the Inverse Frequency Effect in Structural Priming | Zhenghao Zhou et.al. | 2406.18501 | null |
2024-06-26 | Role-Play Zero-Shot Prompting with Large Language Models for Open-Domain Human-Machine Conversation | Ahmed Njifenjou et.al. | 2406.18460 | null |
2024-06-26 | Cascading Large Language Models for Salient Event Graph Generation | Xingwei Tan et.al. | 2406.18449 | link |
2024-06-26 | New intelligent empowerment for digital transformation | Peng Yifeng et.al. | 2406.18440 | null |
2024-06-26 | IRCAN: Mitigating Knowledge Conflicts in LLM Generation via Identifying and Reweighting Context-Aware Neurons | Dan Shi et.al. | 2406.18406 | link |
2024-06-26 | Do LLMs dream of elephants (when told not to)? Latent concept association and associative memory in transformers | Yibo Jiang et.al. | 2406.18400 | null |
2024-06-26 | Adversarial Search Engine Optimization for Large Language Models | Fredrik Nestaas et.al. | 2406.18382 | null |
2024-06-26 | MALSIGHT: Exploring Malicious Source Code and Benign Pseudocode for Iterative Binary Malware Summarization | Haolang Lu et.al. | 2406.18379 | null |
2024-06-26 | Themis: Towards Flexible and Interpretable NLG Evaluation | Xinyu Hu et.al. | 2406.18365 | link |
2024-06-26 | AI Alignment through Reinforcement Learning from Human Feedback? Contradictions and Limitations | Adam Dahlgren Lindström et.al. | 2406.18346 | null |
2024-06-26 | PDFA Distillation via String Probability Queries {PDFA Distillation via String Probability Queries} | Robert Baumgartner et.al. | 2406.18328 | link |
2024-06-26 | PaCoST: Paired Confidence Significance Testing for Benchmark Contamination Detection in Large Language Models | Huixuan Zhang et.al. | 2406.18326 | null |
2024-06-26 | MathOdyssey: Benchmarking Mathematical Problem-Solving Skills in Large Language Models Using Odyssey Math Data | Meng Fang et.al. | 2406.18321 | null |
2024-06-25 | MG-LLaVA: Towards Multi-Granularity Visual Instruction Tuning | Xiangyu Zhao et.al. | 2406.17770 | link |
2024-06-25 | EXTRACT: Efficient Policy Learning by Extracting Transferrable Robot Skills from Offline Data | Jesse Zhang et.al. | 2406.17768 | null |
2024-06-25 | BMIKE-53: Investigating Cross-Lingual Knowledge Editing with In-Context Learning | Ercong Nie et.al. | 2406.17764 | null |
2024-06-25 | CaLMQA: Exploring culturally specific long-form question answering across 23 languages | Shane Arora et.al. | 2406.17761 | link |
2024-06-25 | Accelerating Clinical Evidence Synthesis with Large Language Models | Zifeng Wang et.al. | 2406.17755 | null |
2024-06-25 | Measuring and Benchmarking Large Language Models’ Capabilities to Generate Persuasive Language | Amalie Brogaard Pauli et.al. | 2406.17753 | null |
2024-06-25 | Recite, Reconstruct, Recollect: Memorization in LMs as a Multifaceted Phenomenon | USVSN Sai Prashanth et.al. | 2406.17746 | link |
2024-06-25 | Point-SAM: Promptable 3D Segmentation Model for Point Clouds | Yuchen Zhou et.al. | 2406.17741 | link |
2024-06-25 | Find Parent then Label Children: A Two-stage Taxonomy Completion Method with Pre-trained Language Model | Fei Xia et.al. | 2406.17739 | null |
2024-06-25 | LLM Targeted Underperformance Disproportionately Impacts Vulnerable Users | Elinor Poole-Dayan et.al. | 2406.17737 | null |
2024-06-25 | FedBiOT: LLM Local Fine-tuning in Federated Learning without Full Model | Feijie Wu et.al. | 2406.17706 | link |
2024-06-25 | From Distributional to Overton Pluralism: Investigating Large Language Model Alignment | Thom Lake et.al. | 2406.17692 | link |
2024-06-26 | VarBench: Robust Language Model Benchmarking Through Dynamic Variable Perturbation | Kun Qian et.al. | 2406.17681 | link |
2024-06-25 | Quantifying AI Psychology: A Psychometrics Benchmark for Large Language Models | Yuan Li et.al. | 2406.17675 | null |
2024-06-25 | LaTable: Towards Large Tabular Models | Boris van Breugel et.al. | 2406.17673 | null |
2024-06-25 | LLM-ARC: Enhancing LLMs with an Automated Reasoning Critic | Aditya Kalyanpur et.al. | 2406.17663 | null |
2024-06-25 | Grass: Compute Efficient Low-Memory LLM Training with Structured Sparse Gradients | Aashiq Muhamed et.al. | 2406.17660 | link |
2024-06-25 | DKPROMPT: Domain Knowledge Prompting Vision-Language Models for Open-World Planning | Xiaohan Zhang et.al. | 2406.17659 | null |
2024-06-25 | Leveraging Large Language Models for Software Model Completion: Results from Industrial and Public Datasets | Christof Tinnes et.al. | 2406.17651 | link |
2024-06-25 | Variationist: Exploring Multifaceted Variation and Bias in Written Language Data | Alan Ramponi et.al. | 2406.17647 | link |
2024-06-24 | Cambrian-1: A Fully Open, Vision-Centric Exploration of Multimodal LLMs | Shengbang Tong et.al. | 2406.16860 | link |
2024-06-24 | EAGLE-2: Faster Inference of Language Models with Dynamic Draft Trees | Yuhui Li et.al. | 2406.16858 | link |
2024-06-24 | Long Context Transfer from Language to Vision | Peiyuan Zhang et.al. | 2406.16852 | link |
2024-06-24 | Losing Visual Needles in Image Haystacks: Vision Language Models are Easily Distracted in Short and Long Contexts | Aditya Sharma et.al. | 2406.16851 | null |
2024-06-24 | RaTEScore: A Metric for Radiology Report Generation | Weike Zhao et.al. | 2406.16845 | link |
2024-06-24 | From Decoding to Meta-Generation: Inference-time Algorithms for Large Language Models | Sean Welleck et.al. | 2406.16838 | null |
2024-06-24 | USDC: A Dataset of $\underline{U}$ser $\underline{S}$tance and $\underline{D}$ogmatism in Long $\underline{C}$ onversations | Mounika Marreddy et.al. | 2406.16833 | null |
2024-06-24 | Understanding and Mitigating Tokenization Bias in Language Models | Buu Phan et.al. | 2406.16829 | null |
2024-06-24 | Ragnarök: A Reusable RAG Framework and Baselines for TREC 2024 Retrieval-Augmented Generation Track | Ronak Pradeep et.al. | 2406.16828 | link |
2024-06-24 | GPT-4V Explorations: Mining Autonomous Driving | Zixuan Li et.al. | 2406.16817 | null |
2024-06-24 | RES-Q: Evaluating Code-Editing Large Language Model Systems at the Repository Scale | Beck LaBash et.al. | 2406.16801 | link |
2024-06-25 | Lottery Ticket Adaptation: Mitigating Destructive Interference in LLMs | Ashwinee Panda et.al. | 2406.16797 | link |
2024-06-24 | Adam-mini: Use Fewer Learning Rates To Gain More | Yushun Zhang et.al. | 2406.16793 | link |
2024-06-24 | M2Lingual: Enhancing Multilingual, Multi-Turn Instruction Alignment in Large Language Models | Rishabh Maheshwary et.al. | 2406.16783 | null |
2024-06-24 | It Is Not About What You Say, It Is About How You Say It: A Surprisingly Simple Approach for Improving Reading Comprehension | Sagi Shaier et.al. | 2406.16779 | null |
2024-06-24 | Finding Transformer Circuits with Edge Pruning | Adithya Bhaskar et.al. | 2406.16778 | link |
2024-06-24 | Blending LLMs into Cascaded Speech Translation: KIT’s Offline Speech Translation System for IWSLT 2024 | Sai Koneru et.al. | 2406.16777 | null |
2024-06-24 | WARP: On the Benefits of Weight Averaged Rewarded Policies | Alexandre Ramé et.al. | 2406.16768 | null |
2024-06-24 | The GPT-WritingPrompts Dataset: A Comparative Analysis of Character Portrayal in Short Stories | Xi Yu Huang et.al. | 2406.16767 | link |
2024-06-24 | Towards Fast Multilingual LLM Inference: Speculative Decoding and Specialized Drafters | Euiin Yi et.al. | 2406.16758 | link |
2024-06-21 | GenoTEX: A Benchmark for Evaluating LLM-Based Exploration of Gene Expression Data in Alignment with Bioinformaticians | Haoyang Liu et.al. | 2406.15341 | link |
2024-06-21 | Gradient-Mask Tuning Elevates the Upper Limits of LLM Performance | Haoling Li et.al. | 2406.15330 | null |
2024-06-21 | Bug In the Code Stack: Can LLMs Find Bugs in Large Python Code Stacks | Hokyung Lee et.al. | 2406.15325 | link |
2024-06-21 | Cognitive Map for Language Models: Optimal Planning via Verbally Representing the World Model | Doyoung Kim et.al. | 2406.15275 | link |
2024-06-21 | Towards Fine-Grained Citation Evaluation in Generated Text: A Comparative Analysis of Faithfulness Metrics | Weijia Zhang et.al. | 2406.15264 | null |
2024-06-21 | Unsupervised Morphological Tree Tokenizer | Qingyang Zhu et.al. | 2406.15245 | null |
2024-06-21 | Large Batch Analysis for Adagrad Under Anisotropic Smoothness | Yuxing Liu et.al. | 2406.15244 | null |
2024-06-21 | Detecting Synthetic Lyrics with Few-Shot Inference | Yanis Labrak et.al. | 2406.15231 | null |
2024-06-21 | A LLM-Based Ranking Method for the Evaluation of Automatic Counter-Narrative Generation | Irune Zubiaga et.al. | 2406.15227 | link |
2024-06-21 | Unsupervised Extraction of Dialogue Policies from Conversations | Makesh Narsimhan Sreedhar et.al. | 2406.15214 | null |
2024-06-21 | Prompting Whisper for QA-driven Zero-shot End-to-end Spoken Language Understanding | Mohan Li et.al. | 2406.15209 | null |
2024-06-21 | Exploring the Efficacy of Robotic Assistants with ChatGPT and Claude in Enhancing ADHD Therapy: Innovating Treatment Paradigms | Santiago Berrezueta-Guzman et.al. | 2406.15198 | null |
2024-06-21 | UDA: A Benchmark Suite for Retrieval Augmented Generation in Real-world Document Analysis | Yulong Hui et.al. | 2406.15187 | link |
2024-06-21 | Hybrid Alignment Training for Large Language Models | Chenglong Wang et.al. | 2406.15178 | link |
2024-06-21 | EmpathyEar: An Open-source Avatar Multimodal Empathetic Chatbot | Hao Fei et.al. | 2406.15177 | link |
2024-06-21 | Enhancing Idiomatic Representation in Multiple Languages via an Adaptive Contrastive Triplet Loss | Wei He et.al. | 2406.15175 | null |
2024-06-21 | Évaluation des capacités de réponse de larges modèles de langage (LLM) pour des questions d’historiens | Mathieu Chartier et.al. | 2406.15173 | null |
2024-06-21 | Assessing Good, Bad and Ugly Arguments Generated by ChatGPT: a New Dataset, its Methodology and Associated Tasks | Victor Hugo Nascimento Rocha et.al. | 2406.15130 | link |
2024-06-21 | Brain-Like Language Processing via a Shallow Untrained Multihead Attention Network | Badr AlKhamissi et.al. | 2406.15109 | link |
2024-06-21 | PARIKSHA : A Large-Scale Investigation of Human-LLM Evaluator Agreement on Multilingual and Multi-Cultural Data | Ishaan Watts et.al. | 2406.15053 | null |
2024-06-20 | Model Merging and Safety Alignment: One Bad Model Spoils the Bunch | Hasan Abed Al Kader Hammoud et.al. | 2406.14563 | null |
2024-06-20 | Whiteboard-of-Thought: Thinking Step-by-Step Across Modalities | Sachit Menon et.al. | 2406.14562 | null |
2024-06-20 | How to Compute the Probability of a Word | Tiago Pimentel et.al. | 2406.14561 | link |
2024-06-21 | Asynchronous Large Language Model Enhanced Planner for Autonomous Driving | Yuan Chen et.al. | 2406.14556 | link |
2024-06-20 | GraphReader: Building Graph-based Agent to Enhance Long-Context Abilities of Large Language Models | Shilong Li et.al. | 2406.14550 | null |
2024-06-20 | Uncovering Latent Memories: Assessing Data Leakage and Memorization Patterns in Large Language Models | Sunny Duan et.al. | 2406.14549 | null |
2024-06-20 | Connecting the Dots: LLMs can Infer and Verbalize Latent Structure from Disparate Training Data | Johannes Treutlein et.al. | 2406.14546 | link |
2024-06-20 | Unmasking Database Vulnerabilities: Zero-Knowledge Schema Inference Attacks in Text-to-SQL Systems | Đorđe Klisura et.al. | 2406.14545 | null |
2024-06-20 | Prism: A Framework for Decoupling and Assessing the Capabilities of VLMs | Yuxuan Qiao et.al. | 2406.14544 | link |
2024-06-21 | Are LLMs Naturally Good at Synthetic Tabular Data Generation? | Shengzhe Xu et.al. | 2406.14541 | link |
2024-06-20 | PostMark: A Robust Blackbox Watermark for Large Language Models | Yapei Chang et.al. | 2406.14517 | link |
2024-06-20 | MMBench-Video: A Long-Form Multi-Shot Benchmark for Holistic Video Understanding | Xinyu Fang et.al. | 2406.14515 | link |
2024-06-20 | Evidence of a log scaling law for political persuasion with large language models | Kobi Hackenburg et.al. | 2406.14508 | link |
2024-06-20 | Overview of the CAIL 2023 Argument Mining Track | Jingcong Liang et.al. | 2406.14503 | null |
2024-06-20 | Improving Expert Radiology Report Summarization by Prompting Large Language Models with a Layperson Summary | Xingmeng Zhao et.al. | 2406.14500 | null |
2024-06-20 | LLaSA: Large Multimodal Agent for Human Activity Analysis Through Wearable Sensors | Sheikh Asif Imran et.al. | 2406.14498 | link |
2024-06-20 | CodeRAG-Bench: Can Retrieval Augment Code Generation? | Zora Zhiruo Wang et.al. | 2406.14497 | link |
2024-06-20 | African or European Swallow? Benchmarking Large Vision-Language Models for Fine-Grained Object Classification | Gregor Geigle et.al. | 2406.14496 | link |
2024-06-20 | Does Object Grounding Really Reduce Hallucination of Large Vision-Language Models? | Gregor Geigle et.al. | 2406.14492 | null |
2024-06-20 | Instruction Pre-Training: Language Models are Supervised Multitask Learners | Daixuan Cheng et.al. | 2406.14491 | link |
2024-06-18 | DrVideo: Document Retrieval Based Long Video Understanding | Ziyu Ma et.al. | 2406.12846 | null |
2024-06-18 | Interpretable Preferences via Multi-Objective Reward Modeling and Mixture-of-Experts | Haoxiang Wang et.al. | 2406.12845 | link |
2024-06-18 | Synergizing Foundation Models and Federated Learning: A Survey | Shenghui Li et.al. | 2406.12844 | null |
2024-06-18 | GroPrompt: Efficient Grounded Prompting and Adaptation for Referring Video Object Segmentation | Ci-Siang Lin et.al. | 2406.12834 | null |
2024-06-18 | LaMDA: Large Model Fine-Tuning via Spectrally Decomposed Low-Dimensional Adaptation | Seyedarmin Azizi et.al. | 2406.12832 | link |
2024-06-18 | What Are the Odds? Language Models Are Capable of Probabilistic Reasoning | Akshay Paruchuri et.al. | 2406.12830 | link |
2024-06-18 | From RAGs to rich parameters: Probing how language models utilize external knowledge over parametric information for factual queries | Hitesh Wadhwa et.al. | 2406.12824 | null |
2024-06-18 | Is It Good Data for Multilingual Instruction Tuning or Just Bad Multilingual Evaluation for Large Language Models? | Pinzhen Chen et.al. | 2406.12822 | null |
2024-06-18 | Adversarial Attacks on Multimodal Agents | Chen Henry Wu et.al. | 2406.12814 | link |
2024-06-18 | Can Large Language Models Always Solve Easy Problems if They Can Solve Harder Ones? | Zhe Yang et.al. | 2406.12809 | link |
2024-06-18 | Identifying Performance-Sensitive Configurations in Software Systems through Code Analysis with LLM Agents | Zehao Wang et.al. | 2406.12806 | null |
2024-06-18 | Supporting Human Raters with the Detection of Harmful Content using Large Language Models | Kurt Thomas et.al. | 2406.12800 | null |
2024-06-18 | ChatGLM: A Family of Large Language Models from GLM-130B to GLM-4 All Tools | Team GLM et.al. | 2406.12793 | link |
2024-06-18 | In-Context Learning of Energy Functions | Rylan Schaeffer et.al. | 2406.12785 | null |
2024-06-18 | UBENCH: Benchmarking Uncertainty in Large Language Models with Multiple Choice Questions | Xunzhi Wang et.al. | 2406.12784 | link |
2024-06-18 | Hopping Too Late: Exploring the Limitations of Large Language Models on Multi-Hop Queries | Eden Biran et.al. | 2406.12775 | link |
2024-06-18 | Towards Exact Gradient-based Training on Analog In-memory Computing | Zhaoxian Wu et.al. | 2406.12774 | null |
2024-06-18 | GFM4MPM: Towards Geospatial Foundation Models for Mineral Prospectivity Mapping | Angel Daruna et.al. | 2406.12756 | null |
2024-06-18 | OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI | Zhen Huang et.al. | 2406.12753 | link |
2024-06-18 | Benchmarking Multi-Image Understanding in Vision and Language Models: Perception, Knowledge, Reasoning, and Multi-Hop Reasoning | Bingchen Zhao et.al. | 2406.12742 | link |
2024-06-17 | LLaNA: Large Language and NeRF Assistant | Andrea Amaduzzi et.al. | 2406.11840 | null |
2024-06-17 | mDPO: Conditional Preference Optimization for Multimodal Large Language Models | Fei Wang et.al. | 2406.11839 | null |
2024-06-17 | MMDU: A Multi-Turn Multi-Image Dialog Understanding Benchmark and Instruction-Tuning Dataset for LVLMs | Ziyu Liu et.al. | 2406.11833 | link |
2024-06-17 | Unveiling Encoder-Free Vision-Language Models | Haiwen Diao et.al. | 2406.11832 | link |
2024-06-17 | Exploring the Role of Large Language Models in Prompt Encoding for Diffusion Models | Bingqi Ma et.al. | 2406.11831 | null |
2024-06-17 | Language Modeling with Editable External Knowledge | Belinda Z. Li et.al. | 2406.11830 | link |
2024-06-17 | WPO: Enhancing RLHF with Weighted Preference Optimization | Wenxuan Zhou et.al. | 2406.11827 | link |
2024-06-17 | On Efficient Language and Vision Assistants for Visually-Situated Natural Language Understanding: What Matters in Reading and Reasoning | Geewook Kim et.al. | 2406.11823 | link |
2024-06-17 | MegaScenes: Scene-Level View Synthesis at Scale | Joseph Tung et.al. | 2406.11819 | link |
2024-06-17 | Embodied Instruction Following in Unknown Environments | Zhenyu Wu et.al. | 2406.11818 | null |
2024-06-17 | Iterative Length-Regularized Direct Preference Optimization: A Case Study on Improving 7B Language Models to GPT-4 Level | Jie Liu et.al. | 2406.11817 | null |
2024-06-17 | VideoLLM-online: Online Video Large Language Model for Streaming Video | Joya Chen et.al. | 2406.11816 | null |
2024-06-17 | How Do Large Language Models Acquire Factual Knowledge During Pretraining? | Hoyeon Chang et.al. | 2406.11813 | link |
2024-06-17 | RepLiQA: A Question-Answering Dataset for Benchmarking LLMs on Unseen Reference Content | Joao Monteiro et.al. | 2406.11811 | link |
2024-06-17 | Safety Arithmetic: A Framework for Test-time Safety Alignment of Language Models by Steering Parameters and Activations | Rima Hazra et.al. | 2406.11801 | link |
2024-06-17 | DataComp-LM: In search of the next generation of training sets for language models | Jeffrey Li et.al. | 2406.11794 | null |
2024-06-17 | CELL your Model: Contrastive Explanation Methods for Large Language Models | Ronny Luss et.al. | 2406.11785 | null |
2024-06-17 | Split, Unlearn, Merge: Leveraging Data Attributes for More Effective Unlearning in LLMs | Swanand Ravindra Kadhe et.al. | 2406.11780 | null |
2024-06-17 | Improving Multi-Agent Debate with Sparse Communication Topology | Yunxuan Li et.al. | 2406.11776 | null |
2024-06-17 | Task Me Anything | Jieyu Zhang et.al. | 2406.11775 | link |
2024-06-14 | Quantifying Variance in Evaluation Benchmarks | Lovish Madaan et.al. | 2406.10229 | null |
2024-06-14 | EFM3D: A Benchmark for Measuring Progress Towards 3D Egocentric Foundation Models | Julian Straub et.al. | 2406.10224 | link |
2024-06-14 | Short Film Dataset (SFD): A Benchmark for Story-Level Video Understanding | Ridouane Ghermi et.al. | 2406.10221 | link |
2024-06-14 | Semantic Membership Inference Attack against Large Language Models | Hamid Mozaffari et.al. | 2406.10218 | null |
2024-06-14 | Regularizing Hidden States Enables Learning Generalizable Reward Model for LLMs | Rui Yang et.al. | 2406.10216 | link |
2024-06-14 | DevBench: A multimodal developmental benchmark for language learning | Alvin Wei Ming Tan et.al. | 2406.10215 | link |
2024-06-14 | Be like a Goldfish, Don’t Memorize! Mitigating Memorization in Generative LLMs | Abhimanyu Hans et.al. | 2406.10209 | link |
2024-06-14 | A Fundamental Trade-off in Aligned Language Models and its Relation to Sampling Adaptors | Naaman Tan et.al. | 2406.10203 | link |
2024-06-14 | TRIP-PAL: Travel Planning with Guarantees by Combining Large Language Models and Automated Planners | Tomas de la Rosa et.al. | 2406.10196 | null |
2024-06-14 | Detecting and Evaluating Medical Hallucinations in Large Vision Language Models | Jiawei Chen et.al. | 2406.10185 | null |
2024-06-14 | Practical offloading for fine-tuning LLM on commodity GPU via learned subspace projectors | Siyuan Chen et.al. | 2406.10181 | null |
2024-06-14 | Let the Poem Hit the Rhythm: Using a Byte-Based Transformer for Beat-Aligned Poetry Generation | Mohamad Elzohbi et.al. | 2406.10174 | link |
2024-06-14 | IntentionQA: A Benchmark for Evaluating Purchase Intention Comprehension Abilities of Language Models in E-commerce | Wenxuan Ding et.al. | 2406.10173 | link |
2024-06-14 | Datasets for Multilingual Answer Sentence Selection | Matteo Gabburo et.al. | 2406.10172 | null |
2024-06-14 | CarLLaVA: Vision language models for camera-only closed-loop driving | Katrin Renz et.al. | 2406.10165 | null |
2024-06-14 | Sycophancy to Subterfuge: Investigating Reward-Tampering in Large Language Models | Carson Denison et.al. | 2406.10162 | link |
2024-06-14 | RoboGolf: Mastering Real-World Minigolf with a Reflective Multi-Modality Vision-Language Model | Hantao Zhou et.al. | 2406.10157 | null |
2024-06-14 | BABILong: Testing the Limits of LLMs with Long Context Reasoning-in-a-Haystack | Yuri Kuratov et.al. | 2406.10149 | link |
2024-06-14 | Evaluation of Large Language Models: STEM education and Gender Stereotypes | Smilla Due et.al. | 2406.10133 | null |
2024-06-14 | The Devil is in the Neurons: Interpreting and Mitigating Social Biases in Pre-trained Language Models | Yan Liu et.al. | 2406.10130 | link |
2024-06-13 | VideoGPT+: Integrating Image and Video Encoders for Enhanced Video Understanding | Muhammad Maaz et.al. | 2406.09418 | link |
2024-06-13 | Explore the Limits of Omni-modal Pretraining at Scale | Yiyuan Zhang et.al. | 2406.09412 | link |
2024-06-13 | 4M-21: An Any-to-Any Vision Model for Tens of Tasks and Modalities | Roman Bachmann et.al. | 2406.09406 | null |
2024-06-13 | Visual Sketchpad: Sketching as a Visual Chain of Thought for Multimodal Language Models | Yushi Hu et.al. | 2406.09403 | null |
2024-06-13 | OmniTokenizer: A Joint Image-Video Tokenizer for Visual Generation | Junke Wang et.al. | 2406.09399 | link |
2024-06-13 | Aligning Vision Models with Human Aesthetics in Retrieval: Benchmarks and Algorithms | Miaosen Zhang et.al. | 2406.09397 | null |
2024-06-13 | Too Many Frames, not all Useful:Efficient Strategies for Long-Form Video QA | Jongwoo Park et.al. | 2406.09396 | link |
2024-06-13 | Exploring the Spectrum of Visio-Linguistic Compositionality and Recognition | Youngtaek Oh et.al. | 2406.09388 | link |
2024-06-13 | Towards Vision-Language Geo-Foundation Model: A Survey | Yue Zhou et.al. | 2406.09385 | link |
2024-06-13 | Reflecting on the State of Rehearsal-free Continual Learning with Pretrained Models | Lukas Thede et.al. | 2406.09384 | null |
2024-06-13 | Needle In A Video Haystack: A Scalable Synthetic Framework for Benchmarking Video MLLMs | Zijia Zhao et.al. | 2406.09367 | link |
2024-06-13 | ElicitationGPT: Text Elicitation Mechanisms via Language Models | Yifan Wu et.al. | 2406.09363 | null |
2024-06-13 | Enhancing Domain Adaptation through Prompt Gradient Alignment | Hoang Phan et.al. | 2406.09353 | link |
2024-06-13 | Separations in the Representational Capabilities of Transformers and Recurrent Architectures | Satwik Bhattamishra et.al. | 2406.09347 | null |
2024-06-13 | DiscreteSLU: A Large Language Model with Self-Supervised Discrete Speech Units for Spoken Language Understanding | Suwon Shon et.al. | 2406.09345 | null |
2024-06-13 | ProxyLM: Predicting Language Model Performance on Multilingual Tasks via Proxy Models | David Anugraha et.al. | 2406.09334 | link |
2024-06-13 | REVS: Unlearning Sensitive Information in Language Models via Rank Editing in the Vocabulary Space | Tomer Ashuach et.al. | 2406.09325 | null |
2024-06-13 | Bag of Tricks: Benchmarking of Jailbreak Attacks on LLMs | Zhao Xu et.al. | 2406.09324 | link |
2024-06-13 | JailbreakEval: An Integrated Toolkit for Evaluating Jailbreak Attempts Against Large Language Models | Delong Ran et.al. | 2406.09321 | link |
2024-06-13 | Common and Rare Fundus Diseases Identification Using Vision-Language Foundation Model with Knowledge of Over 400 Diseases | Meng Wang et.al. | 2406.09317 | link |
2024-06-12 | What If We Recaption Billions of Web Images with LLaMA-3? | Xianhang Li et.al. | 2406.08478 | null |
2024-06-12 | Improving LLMs for Recommendation with Out-Of-Vocabulary Tokens | Ting-Ji Huang et.al. | 2406.08477 | null |
2024-06-12 | Real2Code: Reconstruct Articulated Objects via Code Generation | Zhao Mandi et.al. | 2406.08474 | null |
2024-06-12 | PAL: Pluralistic Alignment Framework for Learning from Heterogeneous Preferences | Daiwei Chen et.al. | 2406.08469 | null |
2024-06-12 | Magpie: Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing | Zhangchen Xu et.al. | 2406.08464 | link |
2024-06-12 | AToM-Bot: Embodied Fulfillment of Unspoken Human Needs with Affective Theory of Mind | Wei Ding et.al. | 2406.08455 | null |
2024-06-12 | OLMES: A Standard for Language Model Evaluations | Yuling Gu et.al. | 2406.08446 | null |
2024-06-12 | SVSNet+: Enhancing Speaker Voice Similarity Assessment Models with Representations from Speech Foundation Models | Chun Yin et.al. | 2406.08445 | null |
2024-06-12 | TasTe: Teaching Large Language Models to Translate through Self-Reflection | Yutong Wang et.al. | 2406.08434 | link |
2024-06-12 | Next-Generation Database Interfaces: A Survey of LLM-based Text-to-SQL | Zijin Hong et.al. | 2406.08426 | null |
2024-06-12 | OmniCorpus: An Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text | Qingyun Li et.al. | 2406.08418 | link |
2024-06-12 | Discovering Preference Optimization Algorithms with and for Large Language Models | Chris Lu et.al. | 2406.08414 | link |
2024-06-12 | Memory Is All You Need: An Overview of Compute-in-Memory Architectures for Accelerating Large Language Model Inference | Christopher Wolters et.al. | 2406.08413 | null |
2024-06-13 | MMWorld: Towards Multi-discipline Multi-faceted World Model Evaluation in Videos | Xuehai He et.al. | 2406.08407 | link |
2024-06-12 | Understanding Sounds, Missing the Questions: The Challenge of Object Hallucination in Large Audio-Language Models | Chun-Yi Kuan et.al. | 2406.08402 | link |
2024-06-12 | cPAPERS: A Dataset of Situated and Multimodal Interactive Conversations in Scientific Papers | Anirudh Sundar et.al. | 2406.08398 | null |
2024-06-12 | VisionLLM v2: An End-to-End Generalist Multimodal Large Language Model for Hundreds of Vision-Language Tasks | Jiannan Wu et.al. | 2406.08394 | link |
2024-06-12 | Large Language Models Must Be Taught to Know What They Don’t Know | Sanyam Kapoor et.al. | 2406.08391 | link |
2024-06-12 | Banal Deception Human-AI Ecosystems: A Study of People’s Perceptions of LLM-generated Deceptive Behaviour | Xiao Zhan et.al. | 2406.08386 | null |
2024-06-13 | APSeg: Auto-Prompt Network for Cross-Domain Few-Shot Semantic Segmentation | Weizhao He et.al. | 2406.08372 | null |
2024-06-11 | A3VLM: Actionable Articulation-Aware Vision Language Model | Siyuan Huang et.al. | 2406.07549 | link |
2024-06-11 | Image and Video Tokenization with Binary Spherical Quantization | Yue Zhao et.al. | 2406.07548 | link |
2024-06-11 | Open-LLM-Leaderboard: From Multi-choice to Open-style Questions for LLMs Evaluation, Benchmark, and Arena | Aidar Myrzakhan et.al. | 2406.07545 | link |
2024-06-11 | QuickLLaMA: Query-aware Inference Acceleration for Large Language Models | Jingyao Li et.al. | 2406.07528 | link |
2024-06-11 | Simple and Effective Masked Diffusion Language Models | Subham Sekhar Sahoo et.al. | 2406.07524 | link |
2024-06-11 | Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling | Liliang Ren et.al. | 2406.07522 | link |
2024-06-11 | Beyond Model Collapse: Scaling Up with Synthesized Data Requires Reinforcement | Yunzhen Feng et.al. | 2406.07515 | null |
2024-06-11 | THaLLE: Text Hyperlocally Augmented Large Language Extension – Technical Report | KBTG Labs et.al. | 2406.07505 | null |
2024-06-11 | Image Textualization: An Automatic Framework for Creating Accurate and Detailed Image Descriptions | Renjie Pi et.al. | 2406.07502 | link |
2024-06-11 | TextGrad: Automatic “Differentiation” via Text | Mert Yuksekgonul et.al. | 2406.07496 | link |
2024-06-12 | CADS: A Systematic Literature Review on the Challenges of Abstractive Dialogue Summarization | Frederic Kirstein et.al. | 2406.07494 | null |
2024-06-11 | Paraphrasing in Affirmative Terms Improves Negation Understanding | MohammadHossein Rezaei et.al. | 2406.07492 | null |
2024-06-11 | PITCH: Productivity and Mental Well-being Coaching through Daily Conversational Interaction | Adnan Abbas et.al. | 2406.07485 | null |
2024-06-11 | Advancing Annotation of Stance in Social Media Posts: A Comparative Analysis of Large Language Models and Crowd Sourcing | Mao Li et.al. | 2406.07483 | null |
2024-06-11 | VideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMs | Zesen Cheng et.al. | 2406.07476 | link |
2024-06-11 | Anomaly Detection on Unstable Logs with GPT Models | Fatemeh Hadadi et.al. | 2406.07467 | null |
2024-06-11 | Estimating the Hallucination Rate of Generative AI | Andrew Jesson et.al. | 2406.07457 | null |
2024-06-11 | Reinforcement Learning from Human Feedback without Reward Inference: Model-Free Algorithm and Instance-Dependent Analysis | Qining Zhang et.al. | 2406.07455 | null |
2024-06-11 | On the Robustness of Document-Level Relation Extraction Models to Entity Name Variations | Shiao Meng et.al. | 2406.07444 | link |
2024-06-11 | McEval: Massively Multilingual Code Evaluation | Linzheng Chai et.al. | 2406.07436 | null |
2024-06-10 | Autoregressive Model Beats Diffusion: Llama for Scalable Image Generation | Peize Sun et.al. | 2406.06525 | link |
2024-06-10 | UMBRELA: UMbrela is the (Open-Source Reproduction of the) Bing RELevance Assessor | Shivani Upadhyay et.al. | 2406.06519 | link |
2024-06-10 | Merlin: A Vision Language Foundation Model for 3D Computed Tomography | Louis Blankemeier et.al. | 2406.06512 | null |
2024-06-10 | NarrativeBridge: Enhancing Video Captioning with Causal-Temporal Narrative | Asmar Nadeem et.al. | 2406.06499 | null |
2024-06-10 | Direct Preference Optimization for Suppressing Hallucinated Prior Exams in Radiology Report Generation | Oishi Banerjee et.al. | 2406.06496 | null |
2024-06-10 | Can Language Models Serve as Text-Based World Simulators? | Ruoyao Wang et.al. | 2406.06485 | null |
2024-06-10 | Parallelizing Linear Transformers with the Delta Rule over Sequence Length | Songlin Yang et.al. | 2406.06484 | link |
2024-06-10 | Towards a Personal Health Large Language Model | Justin Cosentino et.al. | 2406.06474 | null |
2024-06-10 | AID: Adapting Image2Video Diffusion Models for Instruction-guided Video Prediction | Zhen Xing et.al. | 2406.06465 | null |
2024-06-10 | Transforming Wearable Data into Health Insights using Large Language Model Agents | Mike A. Merrill et.al. | 2406.06464 | null |
2024-06-10 | VCR: Visual Caption Restoration | Tianyu Zhang et.al. | 2406.06462 | link |
2024-06-11 | Reasoning in Token Economies: Budget-Aware Evaluation of LLM Reasoning Strategies | Junlin Wang et.al. | 2406.06461 | null |
2024-06-10 | Evaluating the Retrieval Component in LLM-Based Question Answering Systems | Ashkan Alinejad et.al. | 2406.06458 | null |
2024-06-10 | A Large Language Model Pipeline for Breast Cancer Oncology | Tristen Pool et.al. | 2406.06455 | null |
2024-06-10 | Insights from Social Shaping Theory: The Appropriation of Large Language Models in an Undergraduate Programming Course | Aadarsh Padiyath et.al. | 2406.06451 | null |
2024-06-10 | LLM Dataset Inference: Did you train on my dataset? | Pratyush Maini et.al. | 2406.06443 | link |
2024-06-10 | Interpretability of Language Models via Task Spaces | Lucas Weber et.al. | 2406.06441 | null |
2024-06-10 | Language Models are Alignable Decision-Makers: Dataset and Application to the Medical Triage Domain | Brian Hu et.al. | 2406.06435 | link |
2024-06-10 | Multivariate Stochastic Dominance via Optimal Transport and Applications to Models Benchmarking | Gabriel Rioux et.al. | 2406.06425 | null |
2024-06-10 | An Empirical Design Justice Approach to Identifying Ethical Considerations in the Intersection of Large Language Models and Social Robotics | Alva Markelius et.al. | 2406.06400 | null |
2024-06-07 | 3D-GRAND: Towards Better Grounding and Less Hallucination for 3D-LLMs | Jianing Yang et.al. | 2406.05132 | link |
2024-06-07 | An Empirical Study on Parameter-Efficient Fine-Tuning for MultiModal Large Language Models | Xiongtao Zhou et.al. | 2406.05130 | link |
2024-06-07 | Towards Semantic Equivalence of Tokenization in Multimodal LLM | Shengqiong Wu et.al. | 2406.05127 | null |
2024-06-07 | Large Generative Graph Models | Yu Wang et.al. | 2406.05109 | null |
2024-06-07 | LINX: A Language Driven Generative System for Goal-Oriented Automated Data Exploration | Tavor Lipman et.al. | 2406.05107 | null |
2024-06-07 | Corpus Poisoning via Approximate Greedy Gradient Descent | Jinyan Su et.al. | 2406.05087 | link |
2024-06-07 | Multi-Head RAG: Solving Multi-Aspect Problems with LLMs | Maciej Besta et.al. | 2406.05085 | link |
2024-06-07 | SUMIE: A Synthetic Benchmark for Incremental Entity Summarization | Eunjeong Hwang et.al. | 2406.05079 | null |
2024-06-07 | Are Large Language Models More Empathetic than Humans? | Anuradha Welivita et.al. | 2406.05063 | null |
2024-06-07 | Robustness Assessment of Mathematical Reasoning in the Presence of Missing and Contradictory Conditions | Shi-Yu Tian et.al. | 2406.05055 | null |
2024-06-07 | Hints-In-Browser: Benchmarking Language Models for Programming Feedback Generation | Nachiket Kotalwar et.al. | 2406.05053 | null |
2024-06-07 | Bootstrapping Referring Multi-Object Tracking | Yani Zhang et.al. | 2406.05039 | link |
2024-06-07 | Scenarios and Approaches for Situated Natural Language Explanations | Pengshuo Qiu et.al. | 2406.05035 | null |
2024-06-07 | CHIQ: Contextual History Enhancement for Improving Query Rewriting in Conversational Search | Fengran Mo et.al. | 2406.05013 | link |
2024-06-07 | Compositional Generalization with Grounded Language Models | Sondre Wold et.al. | 2406.04989 | link |
2024-06-07 | Language models emulate certain cognitive profiles: An investigation of how predictability measures interact with individual differences | Patrick Haller et.al. | 2406.04988 | link |
2024-06-07 | MEFT: Memory-Efficient Fine-Tuning through Sparse Adapter | Jitai Hao et.al. | 2406.04984 | link |
2024-06-07 | CityCraft: A Real Crafter for 3D City Generation | Jie Deng et.al. | 2406.04983 | null |
2024-06-07 | Quantifying Geospatial in the Common Crawl Corpus | Ilya Ilyankou et.al. | 2406.04952 | null |
2024-06-07 | BAMO at SemEval-2024 Task 9: BRAINTEASER: A Novel Task Defying Common Sense | Baktash Ansari et.al. | 2406.04947 | link |
2024-06-06 | Verbalized Machine Learning: Revisiting Machine Learning with Language Models | Tim Z. Xiao et.al. | 2406.04344 | null |
2024-06-06 | Flash3D: Feed-Forward Generalisable 3D Scene Reconstruction from a Single Image | Stanislaw Szymanowicz et.al. | 2406.04343 | link |
2024-06-06 | Learning 1D Causal Visual Representation with De-focus Attention Networks | Chenxin Tao et.al. | 2406.04342 | link |
2024-06-06 | RoboMamba: Multimodal State Space Model for Efficient Robot Reasoning and Manipulation | Jiaming Liu et.al. | 2406.04339 | null |
2024-06-06 | Coherent Zero-Shot Visual Instruction Generation | Quynh Phung et.al. | 2406.04337 | null |
2024-06-06 | DeepStack: Deeply Stacking Visual Tokens is Surprisingly Simple and Effective for LMMs | Lingchen Meng et.al. | 2406.04334 | null |
2024-06-06 | PaCE: Parsimonious Concept Engineering for Large Language Models | Jinqi Luo et.al. | 2406.04331 | link |
2024-06-06 | Parameter-Inverted Image Pyramid Networks | Xizhou Zhu et.al. | 2406.04330 | link |
2024-06-06 | Simplified and Generalized Masked Diffusion for Discrete Data | Jiaxin Shi et.al. | 2406.04329 | null |
2024-06-06 | Causal Estimation of Memorisation Profiles | Pietro Lesci et.al. | 2406.04327 | link |
2024-06-06 | ShareGPT4Video: Improving Video Understanding and Generation with Better Captions | Lin Chen et.al. | 2406.04325 | null |
2024-06-06 | Step-aware Preference Optimization: Aligning Preference with Denoising Performance at Each Step | Zhanhao Liang et.al. | 2406.04314 | link |
2024-06-06 | Improving Alignment and Robustness with Short Circuiting | Andy Zou et.al. | 2406.04313 | link |
2024-06-06 | Semantically Diverse Language Generation for Uncertainty Estimation in Language Models | Lukas Aichberger et.al. | 2406.04306 | link |
2024-06-06 | Quixer: A Quantum Transformer Model | Nikhil Khatri et.al. | 2406.04305 | null |
2024-06-06 | Text-to-Drive: Diverse Driving Behavior Synthesis via Large Language Models | Phat Nguyen et.al. | 2406.04300 | null |
2024-06-06 | VISTA: Visualized Text Embedding For Universal Multi-Modal Retrieval | Junjie Zhou et.al. | 2406.04292 | link |
2024-06-06 | Stratified Prediction-Powered Inference for Hybrid Language Model Evaluation | Adam Fisch et.al. | 2406.04291 | null |
2024-06-07 | What Languages are Easy to Language-Model? A Perspective from Learning Probabilistic Regular Languages | Nadav Borenstein et.al. | 2406.04289 | null |
2024-06-06 | Characterizing Similarities and Divergences in Conversational Tones in Humans and LLMs by Sampling with People | Dun-Ming Huang et.al. | 2406.04278 | link |
2024-06-05 | Wings: Learning Multimodal LLMs without Text-only Forgetting | Yi-Kai Zhang et.al. | 2406.03496 | null |
2024-06-06 | Seq1F1B: Efficient Sequence-Level Pipeline Parallelism for Large Language Model Training | Ao Sun et.al. | 2406.03488 | link |
2024-06-05 | Analyzing LLM Behavior in Dialogue Summarization: Unveiling Circumstantial Hallucination Trends | Sanjana Ramprasad et.al. | 2406.03487 | null |
2024-06-05 | BIPED: Pedagogically Informed Tutoring System for ESL Education | Soonwoo Kwon et.al. | 2406.03486 | null |
2024-06-05 | Does your data spark joy? Performance gains from domain upsampling at the end of training | Cody Blakeney et.al. | 2406.03476 | null |
2024-06-05 | AD-H: Autonomous Driving with Hierarchical Agents | Zaibin Zhang et.al. | 2406.03474 | null |
2024-06-05 | What is the Best Way for ChatGPT to Translate Poetry? | Shanshan Wang et.al. | 2406.03450 | null |
2024-06-05 | Pre-trained Large Language Models Use Fourier Features to Compute Addition | Tianyi Zhou et.al. | 2406.03445 | null |
2024-06-05 | Are language models rational? The case of coherence norms and belief revision | Thomas Hofweber et.al. | 2406.03442 | null |
2024-06-05 | Cycles of Thought: Measuring LLM Confidence through Stable Explanations | Evan Becker et.al. | 2406.03441 | null |
2024-06-05 | Computation-Efficient Era: A Comprehensive Survey of State Space Models in Medical Image Analysis | Moein Heidari et.al. | 2406.03430 | link |
2024-06-05 | Interactive Text-to-Image Retrieval with Large Language Models: A Plug-and-Play Approach | Saehyung Lee et.al. | 2406.03411 | link |
2024-06-05 | Automating Turkish Educational Quiz Generation Using Large Language Models | Kamyar Zeinalipour et.al. | 2406.03397 | link |
2024-06-05 | Log Parsing with Self-Generated In-Context Learning and Self-Correction | Yifan Wu et.al. | 2406.03376 | null |
2024-06-05 | IrokoBench: A New Benchmark for African Languages in the Age of Large Language Models | David Ifeoluwa Adelani et.al. | 2406.03368 | null |
2024-06-05 | CLMASP: Coupling Large Language Models with Answer Set Programming for Robotic Task Planning | Xinrui Lin et.al. | 2406.03367 | null |
2024-06-05 | LLM-based Rewriting of Inappropriate Argumentation using Reinforcement Learning from Machine Feedback | Timon Ziegenbein et.al. | 2406.03363 | null |
2024-06-05 | Save It for the “Hot” Day: An LLM-Empowered Visual Analytics System for Heat Risk Management | Haobo Li et.al. | 2406.03317 | null |
2024-06-05 | The Good, the Bad, and the Hulk-like GPT: Analyzing Emotional Decisions of Large Language Models in Cooperation and Bargaining Games | Mikhail Mozikov et.al. | 2406.03299 | null |
2024-06-05 | SpikeLM: Towards General Spike-Driven Language Modeling via Elastic Bi-Spiking Mechanisms | Xingrun Xing et.al. | 2406.03287 | link |
2024-06-04 | Learning to grok: Emergence of in-context learning and skill composition in modular arithmetic tasks | Tianyu He et.al. | 2406.02550 | link |
2024-06-04 | Open-YOLO 3D: Towards Fast and Accurate Open-Vocabulary 3D Instance Segmentation | Mohamed El Amine Boudjoghra et.al. | 2406.02548 | link |
2024-06-04 | Leveraging Visual Tokens for Extended Text Contexts in Multi-Modal Learning | Alex Jinpeng Wang et.al. | 2406.02547 | link |
2024-06-04 | To Believe or Not to Believe Your LLM | Yasin Abbasi Yadkori et.al. | 2406.02543 | null |
2024-06-04 | Loki: Low-Rank Keys for Efficient Sparse Attention | Prajwal Singhania et.al. | 2406.02542 | link |
2024-06-04 | Parrot: Multilingual Visual Instruction Tuning | Hai-Long Sun et.al. | 2406.02539 | link |
2024-06-04 | TopViewRS: Vision-Language Models as Top-View Spatial Reasoners | Chengzu Li et.al. | 2406.02537 | link |
2024-06-04 | Mitigate Position Bias in Large Language Models via Scaling a Single Dimension | Yijiong Yu et.al. | 2406.02536 | link |
2024-06-04 | SpecExec: Massively Parallel Speculative Decoding for Interactive LLM Inference on Consumer Devices | Ruslan Svirschevski et.al. | 2406.02532 | link |
2024-06-04 | Scalable MatMul-free Language Modeling | Rui-Jie Zhu et.al. | 2406.02528 | link |
2024-06-04 | CheckEmbed: Effective Verification of LLM Solutions to Open-Ended Tasks | Maciej Besta et.al. | 2406.02524 | link |
2024-06-04 | RoboCasa: Large-Scale Simulation of Everyday Tasks for Generalist Robots | Soroush Nasiriany et.al. | 2406.02523 | null |
2024-06-04 | Demystifying the Compression of Mixture-of-Experts Through a Unified Framework | Shwai He et.al. | 2406.02500 | link |
2024-06-04 | Hiding Text in Large Language Models: Introducing Unconditional Token Forcing Confusion | Jakub Hoscilowicz et.al. | 2406.02481 | link |
2024-06-04 | Analyzing Temporal Complex Events with Large Language Models? A Benchmark towards Temporal, Long Context Understanding | Zhihan Zhang et.al. | 2406.02472 | link |
2024-06-04 | Meta-Designing Quantum Experiments with Language Models | Sören Arlt et.al. | 2406.02470 | null |
2024-06-04 | Seed-TTS: A Family of High-Quality Versatile Speech Generation Models | Philip Anastassiou et.al. | 2406.02430 | link |
2024-06-04 | Self-Supervised Singing Voice Pre-Training towards Speech-to-Singing Conversion | Ruiqi Li et.al. | 2406.02429 | null |
2024-06-04 | GrootVL: Tree Topology is All You Need in State Space Model | Yicheng Xiao et.al. | 2406.02395 | link |
2024-06-04 | Multiple Choice Questions and Large Languages Models: A Case Study with Fictional Medical Data | Maxime Griot et.al. | 2406.02394 | link |
2024-05-31 | Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis | Chaoyou Fu et.al. | 2405.21075 | null |
2024-05-31 | Code Pretraining Improves Entity Tracking Abilities of Language Models | Najoung Kim et.al. | 2405.21068 | null |
2024-05-31 | Transformers are SSMs: Generalized Models and Efficient Algorithms Through Structured State Space Duality | Tri Dao et.al. | 2405.21060 | link |
2024-05-31 | RydbergGPT | David Fitzek et.al. | 2405.21052 | link |
2024-05-31 | Kaleido Diffusion: Improving Conditional Diffusion Models with Autoregressive Latent Modeling | Jiatao Gu et.al. | 2405.21048 | null |
2024-05-31 | Grammar-Aligned Decoding | Kanghee Park et.al. | 2405.21047 | null |
2024-05-31 | Exploratory Preference Optimization: Harnessing Implicit Q*-Approximation for Sample-Efficient RLHF | Tengyang Xie et.al. | 2405.21046 | null |
2024-05-31 | Direct Alignment of Language Models via Quality-Aware Self-Refinement | Runsheng Yu et.al. | 2405.21040 | null |
2024-05-31 | Standards for Belief Representations in LLMs | Daniel A. Herrmann et.al. | 2405.21030 | null |
2024-05-31 | LACIE: Listener-Aware Finetuning for Confidence Calibration in Large Language Models | Elias Stengel-Eskin et.al. | 2405.21028 | link |
2024-05-31 | You Only Scan Once: Efficient Multi-dimension Sequential Modeling with LightNet | Zhen Qin et.al. | 2405.21022 | null |
2024-05-31 | Improved Techniques for Optimization-Based Jailbreaking on Large Language Models | Xiaojun Jia et.al. | 2405.21018 | link |
2024-06-04 | StrucTexTv3: An Efficient Vision-Language Model for Text-rich Image Perception, Comprehension, and Beyond | Pengyuan Lyu et.al. | 2405.21013 | null |
2024-05-31 | Hard Cases Detection in Motion Prediction by Vision-Language Foundation Models | Yi Yang et.al. | 2405.20991 | link |
2024-05-31 | DeCo: Decoupling Token Compression from Semantic Abstraction in Multimodal Large Language Models | Linli Yao et.al. | 2405.20985 | link |
2024-05-31 | Enhancing Noise Robustness of Retrieval-Augmented Language Models with Adaptive Adversarial Training | Feiteng Fang et.al. | 2405.20978 | link |
2024-05-31 | SaySelf: Teaching LLMs to Express Confidence with Self-Reflective Rationales | Tianyang Xu et.al. | 2405.20974 | link |
2024-05-31 | LCQ: Low-Rank Codebook based Quantization for Large Language Models | Wen-Pu Cai et.al. | 2405.20973 | null |
2024-06-03 | Large Language Models are Zero-Shot Next Location Predictors | Ciro Beneduce et.al. | 2405.20962 | link |
2024-06-03 | A Robot Walks into a Bar: Can Language Models Serve as Creativity Support Tools for Comedy? An Evaluation of LLMs’ Humour Alignment with Comedians | Piotr Wojciech Mirowski et.al. | 2405.20956 | null |
2024-05-30 | MotionLLM: Understanding Human Behaviors from Human Motions and Videos | Ling-Hao Chen et.al. | 2405.20340 | link |
2024-05-30 | Visual Perception by Large Language Model’s Weights | Feipeng Ma et.al. | 2405.20339 | link |
2024-05-30 | Xwin-LM: Strong and Scalable Alignment Practice for LLMs | Bolin Ni et.al. | 2405.20335 | link |
2024-05-31 | ParSEL: Parameterized Shape Editing with Language | Aditya Ganeshan et.al. | 2405.20319 | null |
2024-05-30 | CausalQuest: Collecting Natural Causal Questions for AI Agents | Roberto Ceraolo et.al. | 2405.20318 | link |
2024-05-30 | ANAH: Analytical Annotation of Hallucinations in Large Language Models | Ziwei Ji et.al. | 2405.20315 | link |
2024-05-30 | Sequence-Augmented SE(3)-Flow Matching For Conditional Protein Backbone Generation | Guillaume Huguet et.al. | 2405.20313 | null |
2024-05-30 | Large Language Models Can Self-Improve At Web Agent Tasks | Ajay Patel et.al. | 2405.20309 | link |
2024-05-30 | Can’t make an Omelette without Breaking some Eggs: Plausible Action Anticipation using Large Video-Language Models | Himangi Mittal et.al. | 2405.20305 | null |
2024-05-30 | Group Robust Preference Optimization in Reward-free RLHF | Shyam Sundhar Ramesh et.al. | 2405.20304 | link |
2024-05-30 | Who Writes the Review, Human or AI? | Panagiotis C. Theocharopoulos et.al. | 2405.20285 | null |
2024-05-30 | ETHER: Efficient Finetuning of Large-Scale Models with Hyperplane Reflections | Massimo Bini et.al. | 2405.20271 | link |
2024-05-30 | Evaluating Large Language Model Biases in Persona-Steered Generation | Andy Liu et.al. | 2405.20253 | link |
2024-05-30 | Towards Hierarchical Multi-Agent Workflows for Zero-Shot Prompt Optimization | Yuchi Liu et.al. | 2405.20252 | link |
2024-05-30 | Retrieval Augmented Structured Generation: Business Document Information Extraction As Tool Use | Franz Louis Cesista et.al. | 2405.20245 | null |
2024-05-30 | Context Injection Attacks on Large Language Models | Cheng’an Wei et.al. | 2405.20234 | null |
2024-05-30 | Data-efficient fine-tuning of foundational models for first-principles quality sublimation enthalpies | Harveen Kaur et.al. | 2405.20217 | null |
2024-05-30 | TS-Align: A Teacher-Student Collaborative Framework for Scalable Iterative Finetuning of Large Language Models | Chen Zhang et.al. | 2405.20215 | null |
2024-05-30 | One QuantLLM for ALL: Fine-tuning Quantized LLMs Once for Efficient Deployments | Ke Yi et.al. | 2405.20202 | null |
2024-05-31 | Using Large Language Models for Humanitarian Frontline Negotiation: Opportunities and Considerations | Zilin Ma et.al. | 2405.20195 | null |
2024-05-29 | X-VILA: Cross-Modality Alignment for Large Language Model | Hanrong Ye et.al. | 2405.19335 | null |
2024-05-29 | LLMs Meet Multimodal Generation and Editing: A Survey | Yingqing He et.al. | 2405.19334 | link |
2024-05-29 | Multi-Modal Generative Embedding Model | Feipeng Ma et.al. | 2405.19333 | null |
2024-05-29 | Self-Exploring Language Models: Active Preference Elicitation for Online Alignment | Shenao Zhang et.al. | 2405.19332 | link |
2024-05-29 | Normative Modules: A Generative Agent Architecture for Learning Norms that Supports Multi-Agent Cooperation | Atrisha Sarkar et.al. | 2405.19328 | null |
2024-05-29 | MAP-Neo: Highly Capable and Transparent Bilingual Large Language Model Series | Ge Zhang et.al. | 2405.19327 | link |
2024-05-29 | Reasoning3D – Grounding and Reasoning in 3D: Fine-Grained Zero-Shot Open-Vocabulary 3D Reasoning Part Segmentation via Large Vision-Language Models | Tianrun Chen et.al. | 2405.19326 | null |
2024-05-29 | Nearest Neighbor Speculative Decoding for LLM Generation and Attribution | Minghan Li et.al. | 2405.19325 | null |
2024-05-29 | Are Large Language Models Chameleons? | Mingmeng Geng et.al. | 2405.19323 | null |
2024-05-29 | Value-Incentivized Preference Optimization: A Unified Approach to Online and Offline RLHF | Shicong Cen et.al. | 2405.19320 | null |
2024-05-29 | Robust Preference Optimization through Reward Model Distillation | Adam Fisch et.al. | 2405.19316 | null |
2024-05-29 | Matryoshka Query Transformer for Large Vision-Language Models | Wenbo Hu et.al. | 2405.19315 | link |
2024-05-29 | Language Models Trained to do Arithmetic Predict Human Risky and Intertemporal Choice | Jian-Qiao Zhu et.al. | 2405.19313 | null |
2024-05-29 | Expert-Guided Extinction of Toxic Tokens for Debiased Generation | Xueyao Sun et.al. | 2405.19299 | null |
2024-05-29 | MASSIVE Multilingual Abstract Meaning Representation: A Dataset and Baselines for Hallucination Detection | Michael Regan et.al. | 2405.19285 | null |
2024-05-29 | Optimizing Foundation Model Inference on a Many-tiny-core Open-source RISC-V Platform | Viviane Potocnik et.al. | 2405.19284 | null |
2024-05-29 | Programmable Motion Generation for Open-Set Motion Control Tasks | Hanchao Liu et.al. | 2405.19283 | null |
2024-05-29 | PediatricsGPT: Large Language Models as Chinese Medical Assistants for Pediatric Applications | Dingkang Yang et.al. | 2405.19266 | link |
2024-05-29 | AlchemistCoder: Harmonizing and Eliciting Code Capability by Hindsight Tuning on Multi-source Data | Zifan Song et.al. | 2405.19265 | link |
2024-05-29 | Weak-to-Strong Search: Align Large Language Models via Searching over Small Language Models | Zhanhui Zhou et.al. | 2405.19262 | link |
2024-05-28 | Why are Visually-Grounded Language Models Bad at Image Classification? | Yuhui Zhang et.al. | 2405.18415 | link |
2024-05-28 | Don’t Forget to Connect! Improving RAG with Graph-based Reranking | Jialin Dong et.al. | 2405.18414 | null |
2024-05-28 | WIDIn: Wording Image for Domain-Invariant Representation in Single-Source Domain Generalization | Jiawei Ma et.al. | 2405.18405 | null |
2024-05-29 | Superposed Decoding: Multiple Generations from a Single Autoregressive Inference Pass | Ethan Shen et.al. | 2405.18400 | link |
2024-05-28 | Instruct-MusicGen: Unlocking Text-to-Music Editing for Music Language Models via Instruction Tuning | Yixiao Zhang et.al. | 2405.18386 | link |
2024-05-28 | OwLore: Outlier-weighed Layerwise Sampled Low-Rank Projection for Memory-Efficient LLM Fine-tuning | Pengxiang Li et.al. | 2405.18380 | link |
2024-05-28 | LLaMA-NAS: Efficient Neural Architecture Search for Large Language Models | Anthony Sarah et.al. | 2405.18377 | null |
2024-05-28 | Empowering Source-Free Domain Adaptation with MLLM-driven Curriculum Learning | Dongjie Chen et.al. | 2405.18376 | link |
2024-05-28 | Thai Winograd Schemas: A Benchmark for Thai Commonsense Reasoning | Phakphum Artkaew et.al. | 2405.18375 | link |
2024-05-28 | PromptWizard: Task-Aware Agent-driven Prompt Optimization Framework | Eshaan Agarwal et.al. | 2405.18369 | null |
2024-05-28 | Is a 3D-Tokenized LLM the Key to Reliable Autonomous Driving? | Yifan Bai et.al. | 2405.18361 | null |
2024-05-28 | Bridging the Gap: Dynamic Learning Strategies for Improving Multilingual Performance in LLMs | Somnath Kumar et.al. | 2405.18359 | null |
2024-05-28 | MMCTAgent: Multi-modal Critical Thinking Agent Framework for Complex Visual Reasoning | Somnath Kumar et.al. | 2405.18358 | null |
2024-05-28 | Faithful Logical Reasoning via Symbolic Chain-of-Thought | Jundong Xu et.al. | 2405.18357 | link |
2024-05-28 | Universal and Extensible Language-Vision Models for Organ Segmentation and Tumor Detection from Abdominal Computed Tomography | Jie Liu et.al. | 2405.18356 | link |
2024-05-28 | Intelligent Clinical Documentation: Harnessing Generative AI for Patient-Centric Clinical Note Generation | Anjanava Biswas et.al. | 2405.18346 | null |
2024-05-28 | The Battle of LLMs: A Comparative Study in Conversational QA Tasks | Aryan Rangapur et.al. | 2405.18344 | null |
2024-05-28 | Frustratingly Easy Test-Time Adaptation of Vision-Language Models | Matteo Farina et.al. | 2405.18330 | link |
2024-05-28 | Multi-modal Generation via Cross-Modal In-Context Learning | Amandeep Kumar et.al. | 2405.18304 | link |
2024-05-28 | Semantic are Beacons: A Semantic Perspective for Unveiling Parameter-Efficient Fine-Tuning in Knowledge Learning | Renzhi Wang et.al. | 2405.18292 | null |
2024-05-27 | Matryoshka Multimodal Models | Mu Cai et.al. | 2405.17430 | null |
2024-05-27 | NV-Embed: Improved Techniques for Training LLMs as Generalist Embedding Models | Chankyu Lee et.al. | 2405.17428 | null |
2024-05-27 | Reason3D: Searching and Reasoning 3D Segmentation via Large Language Model | Kuan-Chih Huang et.al. | 2405.17427 | link |
2024-05-27 | LARM: Large Auto-Regressive Model for Long-Horizon Embodied Intelligence | Zhuoling Li et.al. | 2405.17424 | null |
2024-05-27 | Privacy-Aware Visual Language Models | Laurens Samson et.al. | 2405.17423 | null |
2024-05-27 | Self-Corrected Multimodal Large Language Model for End-to-End Robot Manipulation | Jiaming Liu et.al. | 2405.17418 | null |
2024-05-27 | THREAD: Thinking Deeper with Recursive Spawning | Philip Schroeder et.al. | 2405.17402 | link |
2024-05-27 | The Expressive Capacity of State Space Models: A Formal Language Perspective | Yash Sarrof et.al. | 2405.17394 | null |
2024-05-27 | MindMerger: Efficient Boosting LLM Reasoning in non-English Languages | Zixian Huang et.al. | 2405.17386 | link |
2024-05-27 | Unlocking the Secrets of Linear Complexity Sequence Model from A Unified Perspective | Zhen Qin et.al. | 2405.17383 | null |
2024-05-27 | ReMoDetect: Reward Models Recognize Aligned LLM’s Generations | Hyunseok Lee et.al. | 2405.17382 | link |
2024-05-27 | Various Lengths, Constant Speed: Efficient Language Modeling with Lightning Attention | Zhen Qin et.al. | 2405.17381 | link |
2024-05-27 | RTL-Repo: A Benchmark for Evaluating LLMs on Large-Scale RTL Design Projects | Ahmed Allam et.al. | 2405.17378 | link |
2024-05-28 | Navigating the Safety Landscape: Measuring Risks in Finetuning Large Language Models | ShengYun Peng et.al. | 2405.17374 | link |
2024-05-27 | Prompt Optimization with Human Feedback | Xiaoqiang Lin et.al. | 2405.17346 | link |
2024-05-27 | Exploring and steering the moral compass of Large Language Models | Alejandro Tlaie et.al. | 2405.17345 | link |
2024-05-27 | Cost-efficient Knowledge-based Question Answering with Large Language Models | Junnan Dong et.al. | 2405.17337 | null |
2024-05-27 | XFormParser: A Simple and Effective Multimodal Multilingual Semi-structured Form Parser | Xianfu Cheng et.al. | 2405.17336 | link |
2024-05-27 | FedHPL: Efficient Heterogeneous Federated Learning with Prompt Tuning and Logit Distillation | Yuting Ma et.al. | 2405.17267 | null |
2024-05-27 | On the Noise Robustness of In-Context Learning for Text Generation | Hongfu Gao et.al. | 2405.17264 | link |
2024-05-24 | Scaling Laws for Discriminative Classification in Large Language Models | Dean Wyatte et.al. | 2405.15765 | null |
2024-05-24 | Filtered Corpus Training (FiCT) Shows that Language Models can Generalize from Indirect Evidence | Abhinav Patil et.al. | 2405.15750 | link |
2024-05-24 | Sparse maximal update parameterization: A holistic approach to sparse training dynamics | Nolan Dey et.al. | 2405.15743 | link |
2024-05-24 | Large Language Models Reflect Human Citation Patterns with a Heightened Citation Bias | Andres Algaba et.al. | 2405.15739 | link |
2024-05-24 | LM4LV: A Frozen Large Language Model for Low-level Vision Tasks | Boyang Zheng et.al. | 2405.15734 | link |
2024-05-24 | Understanding the differences in Foundation Models: Attention, State Space Models, and Recurrent Neural Networks | Jerome Sieber et.al. | 2405.15731 | link |
2024-05-24 | Optimizing Large Language Models for OpenAPI Code Completion | Bohdan Petryshyn et.al. | 2405.15729 | link |
2024-05-24 | Disease-informed Adaptation of Vision-Language Models | Jiajin Zhang et.al. | 2405.15728 | link |
2024-05-24 | The Impact of Geometric Complexity on Neural Collapse in Transfer Learning | Michael Munn et.al. | 2405.15706 | null |
2024-05-24 | Prompt-Aware Adapter: Towards Learning Adaptive Visual Tokens for Multimodal Large Language Models | Yue Zhang et.al. | 2405.15684 | null |
2024-05-24 | VDGD: Mitigating LVLM Hallucinations in Cognitive Prompts by Bridging the Visual Perception Gap | Sreyan Ghosh et.al. | 2405.15683 | link |
2024-05-24 | What Do You See? Enhancing Zero-Shot Image Classification with Multimodal Large Language Models | Abdelrahman Abdelhamed et.al. | 2405.15668 | null |
2024-05-24 | Class Machine Unlearning for Complex Data via Concepts Inference and Data Poisoning | Wenhan Chang et.al. | 2405.15662 | null |
2024-05-24 | \(\mathbf{L^2\cdot M = C^2}\) Large Language Models as Covert Channels… a Systematic Analysis | Simen Gaure et.al. | 2405.15652 | null |
2024-05-24 | LLM-based Robot Task Planning with Exceptional Handling for General Purpose Service Robots | Ruoyu Wang et.al. | 2405.15646 | null |
2024-05-24 | GECKO: Generative Language Model for English, Code and Korean | Sungwoo Oh et.al. | 2405.15640 | null |
2024-05-24 | M4U: Evaluating Multilingual Understanding and Reasoning for Large Multimodal Models | Hongyu Wang et.al. | 2405.15638 | link |
2024-05-24 | GPTZoo: A Large-scale Dataset of GPTs for the Research Community | Xinyi Hou et.al. | 2405.15630 | link |
2024-05-24 | A Comparative Analysis of Distributed Training Strategies for GPT-2 | Ishan Patwardhan et.al. | 2405.15628 | null |
2024-05-24 | Inverse-RLignment: Inverse Reinforcement Learning from Demonstrations for LLM Alignment | Hao Sun et.al. | 2405.15624 | null |
2024-05-23 | PuzzleAvatar: Assembling 3D Avatars from Personal Albums | Yuliang Xiu et.al. | 2405.14869 | link |
2024-05-23 | A Nurse is Blue and Elephant is Rugby: Cross Domain Alignment in Large Language Models Reveal Human-like Patterns | Asaf Yehudai et.al. | 2405.14863 | null |
2024-05-23 | Bitune: Bidirectional Instruction-Tuning | Dawid J. Kopiczko et.al. | 2405.14862 | null |
2024-05-23 | Not All Language Model Features Are Linear | Joshua Engels et.al. | 2405.14860 | link |
2024-05-23 | PV-Tuning: Beyond Straight-Through Estimation for Extreme LLM Compression | Vladimir Malinovskii et.al. | 2405.14852 | link |
2024-05-23 | A Textbook Remedy for Domain Shifts: Knowledge Priors for Medical Image Analysis | Yue Yang et.al. | 2405.14839 | null |
2024-05-23 | From Explicit CoT to Implicit CoT: Learning to Internalize CoT Step by Step | Yuntian Deng et.al. | 2405.14838 | link |
2024-05-23 | HippoRAG: Neurobiologically Inspired Long-Term Memory for Large Language Models | Bernal Jiménez Gutiérrez et.al. | 2405.14831 | link |
2024-05-23 | Designing A Sustainable Marine Debris Clean-up Framework without Human Labels | Raymond Wang et.al. | 2405.14815 | link |
2024-05-23 | As an AI Language Model, “Yes I Would Recommend Calling the Police’’: Norm Inconsistency in LLM Decision-Making | Shomik Jain et.al. | 2405.14812 | null |
2024-05-23 | Implicit Personalization in Language Models: A Systematic Study | Zhijing Jin et.al. | 2405.14808 | link |
2024-05-23 | Can LLMs Solve longer Math Word Problems Better? | Xin Xu et.al. | 2405.14804 | null |
2024-05-23 | Lessons from the Trenches on Reproducible Evaluation of Language Models | Stella Biderman et.al. | 2405.14782 | null |
2024-05-23 | WISE: Rethinking the Knowledge Memory for Lifelong Model Editing of Large Language Models | Peng Wang et.al. | 2405.14768 | link |
2024-05-23 | FinRobot: An Open-Source AI Agent Platform for Financial Applications using Large Language Models | Hongyang Yang et.al. | 2405.14767 | link |
2024-05-23 | Evaluating Large Language Models for Public Health Classification and Extraction Tasks | Joshua Harris et.al. | 2405.14766 | null |
2024-05-23 | Large language models can be zero-shot anomaly detectors for time series? | Sarah Alnegheimish et.al. | 2405.14755 | link |
2024-05-23 | A Transformer-Based Approach for Smart Invocation of Automatic Code Completion | Aral de Moor et.al. | 2405.14753 | link |
2024-05-23 | MultiCast: Zero-Shot Multivariate Time Series Forecasting Using LLMs | Georgios Chatzigeorgakidis et.al. | 2405.14748 | null |
2024-05-23 | Exploring Prosocial Irrationality for LLM Agents: A Social Cognition View | Xuan Liu et.al. | 2405.14744 | null |
2024-05-21 | Reducing Transformer Key-Value Cache Size with Cross-Layer Attention | William Brandon et.al. | 2405.12981 | null |
2024-05-21 | OmniGlue: Generalizable Feature Matching with Foundation Model Guidance | Hanwen Jiang et.al. | 2405.12979 | link |
2024-05-21 | BiomedParse: a biomedical foundation model for image parsing of everything everywhere all at once | Theodore Zhao et.al. | 2405.12971 | null |
2024-05-21 | Energy Rank Alignment: Using Preference Optimization to Search Chemical Space at Scale | Shriram Chennakesavalu et.al. | 2405.12961 | link |
2024-05-21 | Aggregation of Reasoning: A Hierarchical Framework for Enhancing Answer Selection in Large Language Models | Zhangyue Yin et.al. | 2405.12939 | link |
2024-05-21 | Skin-in-the-Game: Decision Making via Multi-Stakeholder Alignment in LLMs | Bilgehan Sel et.al. | 2405.12933 | null |
2024-05-21 | Code-mixed Sentiment and Hate-speech Prediction | Anjali Yadav et.al. | 2405.12929 | link |
2024-05-21 | Streamlining Software Reviews: Efficient Predictive Modeling with Minimal Examples | Tim Menzies et.al. | 2405.12920 | link |
2024-05-21 | G-DIG: Towards Gradient-based DIverse and hiGh-quality Instruction Data Selection for Machine Translation | Xingyuan Pan et.al. | 2405.12915 | link |
2024-05-21 | An Empirical Study and Analysis of Text-to-Image Generation Using Large Language Model-Powered Textual Representation | Zhiyu Tan et.al. | 2405.12914 | link |
2024-05-21 | Topic Modelling Case Law Using a Large Language Model and a New Taxonomy for UK Law: AI Insights into Summary Judgment | Holli Sargeant et.al. | 2405.12910 | link |
2024-05-21 | Adversarial DPO: Harnessing Harmful Data for Reducing Toxicity with Minimal Impact on Coherence and Evasiveness in Dialogue Agents | San Kim et.al. | 2405.12900 | null |
2024-05-21 | Investigating Persuasion Techniques in Arabic: An Empirical Study Leveraging Large Language Models | Abdurahmman Alzahrani et.al. | 2405.12884 | null |
2024-05-21 | LLM Processes: Numerical Predictive Distributions Conditioned on Natural Language | James Requeima et.al. | 2405.12856 | link |
2024-05-21 | OpenCarbonEval: A Unified Carbon Emission Estimation Framework in Large-Scale AI Models | Zhaojian Yu et.al. | 2405.12843 | link |
2024-05-21 | SmartFlow: Robotic Process Automation using LLMs | Arushi Jain et.al. | 2405.12842 | null |
2024-05-21 | Large Language Models Meet NLP: A Survey | Libo Qin et.al. | 2405.12819 | link |
2024-05-21 | Test Oracle Automation in the era of LLMs | Facundo Molina et.al. | 2405.12766 | null |
2024-05-21 | C3L: Content Correlated Vision-Language Instruction Tuning Data Generation via Contrastive Learning | Ji Ma et.al. | 2405.12752 | null |
2024-05-21 | Generative AI and Large Language Models for Cyber Security: All Insights You Need | Mohamed Amine Ferrag et.al. | 2405.12750 | null |
2024-05-20 | Adapting Large Multimodal Models to Distribution Shifts: The Role of In-Context Learning | Guanglin Zhou et.al. | 2405.12217 | link |
2024-05-20 | MathBench: Evaluating the Theory and Application Proficiency of LLMs with a Hierarchical Mathematics Benchmark | Hongwei Liu et.al. | 2405.12209 | link |
2024-05-20 | Developers’ Perceptions on the Impact of ChatGPT in Software Development: A Survey | Thiago S. Vaillant et.al. | 2405.12195 | link |
2024-05-20 | CT-Eval: Benchmarking Chinese Text-to-Table Performance in Large Language Models | Haoxiang Shi et.al. | 2405.12174 | null |
2024-05-20 | Fennec: Fine-grained Language Model Evaluation and Correction Extended through Branching and Bridging | Xiaobo Liang et.al. | 2405.12163 | link |
2024-05-20 | Eliciting Problem Specifications via Large Language Models | Robert E. Wray et.al. | 2405.12147 | null |
2024-05-20 | DTLLM-VLT: Diverse Text Generation for Visual Language Tracking Based on LLM | Xuchen Li et.al. | 2405.12139 | null |
2024-05-20 | MoRA: High-Rank Updating for Parameter-Efficient Fine-Tuning | Ting Jiang et.al. | 2405.12130 | link |
2024-05-20 | Reindex-Then-Adapt: Improving Large Language Models for Conversational Recommendation | Zhankui He et.al. | 2405.12119 | null |
2024-05-20 | Imp: Highly Capable Large Multimodal Models for Mobile Devices | Zhenwei Shao et.al. | 2405.12107 | link |
2024-05-20 | DOP: Diagnostic-Oriented Prompting for Large Language Models in Mathematical Correction | Hao Chen et.al. | 2405.12100 | null |
2024-05-20 | Distributional Semantics, Holism, and the Instability of Meaning | Jumbly Grindrod et.al. | 2405.12084 | null |
2024-05-20 | PARALLELGPUOS: A Concurrent OS-level GPU Checkpoint and Restore System using Validated Speculation | Zhuobin Huang et.al. | 2405.12079 | null |
2024-05-20 | CLAMBER: A Benchmark of Identifying and Clarifying Ambiguous Information Needs in Large Language Models | Tong Zhang et.al. | 2405.12063 | link |
2024-05-20 | STYLE: Improving Domain Transferability of Asking Clarification Questions in Large Language Model Powered Conversational Agents | Yue Chen et.al. | 2405.12059 | null |
2024-05-20 | KG-RAG: Bridging the Gap Between Knowledge and Creativity | Diego Sanmartin et.al. | 2405.12035 | null |
2024-05-20 | Can AI Relate: Testing Large Language Model Response for Mental Health Support | Saadia Gabriel et.al. | 2405.12021 | link |
2024-05-20 | MTVQA: Benchmarking Multilingual Text-Centric Visual Question Answering | Jingqun Tang et.al. | 2405.11985 | link |
2024-05-20 | A review on the use of large language models as virtual tutors | Silvia García-Méndez et.al. | 2405.11983 | null |
2024-05-20 | Position-Guided Prompt Learning for Anomaly Detection in Chest X-Rays | Zhichao Sun et.al. | 2405.11976 | link |
2024-05-17 | Observational Scaling Laws and the Predictability of Language Model Performance | Yangjun Ruan et.al. | 2405.10938 | link |
2024-05-17 | A Survey on Large Language Models with Multilingualism: Recent Advances and New Frontiers | Kaiyu Huang et.al. | 2405.10936 | link |
2024-05-17 | The Local Interaction Basis: Identifying Computationally-Relevant and Sparsely Interacting Features in Neural Networks | Lucius Bushnaq et.al. | 2405.10928 | link |
2024-05-17 | Blackbox Adaptation for Medical Image Segmentation | Jay N. Paranjape et.al. | 2405.10913 | link |
2024-05-17 | COGNET-MD, an evaluation framework and dataset for Large Language Model benchmarks in the medical domain | Dimitrios P. Panagoulias et.al. | 2405.10893 | null |
2024-05-17 | Application of Artificial Intelligence in Schizophrenia Rehabilitation Management: Systematic Literature Review | Hongyi Yang et.al. | 2405.10883 | null |
2024-05-17 | ECR-Chain: Advancing Generative Language Models to Better Emotion-Cause Reasoners through Reasoning Chains | Zhaopei Huang et.al. | 2405.10860 | link |
2024-05-17 | The Future of Large Language Model Pre-training is Federated | Lorenzo Sani et.al. | 2405.10853 | null |
2024-05-17 | Open-Vocabulary Spatio-Temporal Action Detection | Tao Wu et.al. | 2405.10832 | null |
2024-05-17 | Large Language Model (LLM) for Telecommunications: A Comprehensive Survey on Principles, Key Techniques, and Opportunities | Hao Zhou et.al. | 2405.10825 | null |
2024-05-17 | ActiveLLM: Large Language Model-based Active Learning for Textual Few-Shot Scenarios | Markus Bayer et.al. | 2405.10808 | null |
2024-05-17 | The Relational Machine Calculus | Chris Barrett et.al. | 2405.10801 | null |
2024-05-17 | Empowering Small-Scale Knowledge Graphs: A Strategy of Leveraging General-Purpose Knowledge Graphs for Enriched Embeddings | Albert Sawczyn et.al. | 2405.10745 | null |
2024-05-17 | Efficient Multimodal Large Language Models: A Survey | Yizhang Jin et.al. | 2405.10739 | link |
2024-05-17 | INDUS: Effective and Efficient Language Models for Scientific Applications | Bishwaranjan Bhattacharjee et.al. | 2405.10725 | null |
2024-05-17 | SignLLM: Sign Languages Production Large Language Models | Sen Fang et.al. | 2405.10718 | null |
2024-05-17 | Persian Pronoun Resolution: Leveraging Neural Networks and Language Models | Hassan Haji Mohammadi et.al. | 2405.10714 | null |
2024-05-17 | SynDy: Synthetic Dynamic Dataset Generation Framework for Misinformation Tasks | Michael Shliselberg et.al. | 2405.10700 | null |
2024-05-17 | Revolutionizing Process Mining: A Novel Architecture for ChatGPT Integration and Enhanced User Experience through Optimized Prompt Engineering | Mehrdad Agha Mohammad Ali Kermani et.al. | 2405.10689 | null |
2024-05-17 | Realistic Evaluation of Toxicity in Large Language Models | Tinh Son Luong et.al. | 2405.10659 | null |
2024-05-16 | UniRAG: Universal Retrieval Augmentation for Multi-Modal Large Language Models | Sahel Sharifymoghaddam et.al. | 2405.10311 | null |
2024-05-16 | 4D Panoptic Scene Graph Generation | Jingkang Yang et.al. | 2405.10305 | link |
2024-05-16 | Conformal Alignment: Knowing When to Trust Foundation Models with Guarantees | Yu Gui et.al. | 2405.10301 | link |
2024-05-16 | HW-GPT-Bench: Hardware-Aware Architecture Benchmark for Language Models | Rhea Sanjay Sukthanker et.al. | 2405.10299 | link |
2024-05-17 | Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning | Yuexiang Zhai et.al. | 2405.10292 | null |
2024-05-16 | Timeline-based Sentence Decomposition with In-Context Learning for Temporal Fact Extraction | Jianhao Chen et.al. | 2405.10288 | link |
2024-05-16 | FFF: Fixing Flawed Foundations in contrastive pre-training results in very strong Vision-Language models | Adrian Bulat et.al. | 2405.10286 | null |
2024-05-16 | Revisiting OPRO: The Limitations of Small-Scale LLMs as Optimizers | Tuo Zhang et.al. | 2405.10276 | null |
2024-05-16 | Keep It Private: Unsupervised Privatization of Online Text | Calvin Bao et.al. | 2405.10260 | link |
2024-05-16 | When LLMs step into the 3D World: A Survey and Meta-Analysis of 3D Tasks via Multi-modal Large Language Models | Xianzheng Ma et.al. | 2405.10255 | link |
2024-05-16 | PRISM: A Multi-Modal Generative Foundation Model for Slide-Level Histopathology | George Shaikovski et.al. | 2405.10254 | null |
2024-05-16 | A Systematic Evaluation of Large Language Models for Natural Language Generation Tasks | Xuanfan Ni et.al. | 2405.10251 | null |
2024-05-16 | IntelliExplain: Enhancing Interactive Code Generation through Natural Language Explanations for Non-Professional Programmers | Hao Yan et.al. | 2405.10250 | null |
2024-05-16 | A Foundation Model for Brain Lesion Segmentation with Mixture of Modality Experts | Xinru Zhang et.al. | 2405.10246 | link |
2024-05-16 | DocuMint: Docstring Generation for Python using Small Language Models | Bibek Poudel et.al. | 2405.10243 | link |
2024-05-16 | Low-Rank Adaptation of Time Series Foundational Models for Out-of-Domain Modality Forecasting | Divij Gupta et.al. | 2405.10216 | null |
2024-05-16 | CPsyExam: A Chinese Benchmark for Evaluating Psychology using Examinations | Jiahao Zhao et.al. | 2405.10212 | link |
2024-05-16 | LFED: A Literary Fiction Evaluation Dataset for Large Language Models | Linhao Yu et.al. | 2405.10166 | link |
2024-05-16 | PIR: Remote Sensing Image-Text Retrieval with Prior Instruction Representation Learning | Jiancheng Pan et.al. | 2405.10160 | link |
2024-05-16 | Speaker Verification in Agent-Generated Conversations | Yizhe Yang et.al. | 2405.10150 | null |
2024-05-15 | Modeling Bilingual Sentence Processing: Evaluating RNN and Transformer Architectures for Cross-Language Structural Priming | Bushi Xiao et.al. | 2405.09508 | null |
2024-05-15 | Constrained Learning for Causal Inference and Semiparametric Statistics | Tiffany Tianhui Cai et.al. | 2405.09493 | null |
2024-05-15 | Beyond Flesch-Kincaid: Prompt-based Metrics Improve Difficulty Classification of Educational Texts | Donya Rooein et.al. | 2405.09482 | null |
2024-05-15 | Tell Me Why: Explainable Public Health Fact-Checking with Large Language Models | Majid Zarharan et.al. | 2405.09454 | link |
2024-05-15 | M $^4$ oE: A Foundation Model for Medical Multimodal Image Segmentation with Mixture of Experts | Yufeng Jiang et.al. | 2405.09446 | link |
2024-05-15 | Facilitating Opinion Diversity through Hybrid NLP Approaches | Michiel van der Meer et.al. | 2405.09439 | null |
2024-05-15 | A Survey On Text-to-3D Contents Generation In The Wild | Chenhan Jiang et.al. | 2405.09431 | null |
2024-05-15 | MicroPython Testbed for Federated Learning Algorithms | Miroslav Popovic et.al. | 2405.09423 | link |
2024-05-15 | Matching domain experts by training from scratch on domain knowledge | Xiaoliang Luo et.al. | 2405.09395 | null |
2024-05-15 | Compositional imprecise probability | Jack Liell-Cock et.al. | 2405.09391 | null |
2024-05-15 | PolygloToxicityPrompts: Multilingual Evaluation of Neural Toxic Degeneration in Large Language Models | Devansh Jain et.al. | 2405.09373 | link |
2024-05-15 | SARATR-X: A Foundation Model for Synthetic Aperture Radar Images Target Recognition | Weijie L et.al. | 2405.09365 | link |
2024-05-15 | Large Language Model Bias Mitigation from the Perspective of Knowledge Editing | Ruizhe Chen et.al. | 2405.09341 | null |
2024-05-15 | Prompting-based Synthetic Data Generation for Few-Shot Question Answering | Maximilian Schmidt et.al. | 2405.09335 | link |
2024-05-15 | Transfer Learning in Pre-Trained Large Language Models for Malware Detection Based on System Calls | Pedro Miguel Sánchez Sánchez et.al. | 2405.09318 | null |
2024-05-15 | Comparing the Efficacy of GPT-4 and Chat-GPT in Mental Health Care: A Blind Assessment of Large Language Models for Psychological Support | Birger Moell et.al. | 2405.09300 | null |
2024-05-15 | Do language models capture implied discourse meanings? An investigation with exhaustivity implicatures of Korean morphology | Hagyeong Shin et.al. | 2405.09293 | null |
2024-05-15 | Sign of the Times: Evaluating the use of Large Language Models for Idiomaticity Detection | Dylan Phelps et.al. | 2405.09279 | null |
2024-05-15 | Dynamic Activation Pitfalls in LLaMA Models: An Empirical Study | Chi Ma et.al. | 2405.09274 | null |
2024-05-15 | New Textual Corpora for Serbian Language Modeling | Mihailo Škorić et.al. | 2405.09250 | null |
2024-05-14 | Efficient Vision-Language Pre-training by Cluster Masking | Zihao Wei et.al. | 2405.08815 | link |
2024-05-14 | Towards Enhanced RAC Accessibility: Leveraging Datasets and LLMs | Edison Jair Bejarano Sepulveda et.al. | 2405.08792 | link |
2024-05-14 | Incorporating Clinical Guidelines through Adapting Multi-modal Large Language Model for Prostate Cancer PI-RADS Scoring | Tiantian Zhang et.al. | 2405.08786 | link |
2024-05-14 | Is the Pope Catholic? Yes, the Pope is Catholic. Generative Evaluation of Intent Resolution in LLMs | Akhila Yerukola et.al. | 2405.08760 | link |
2024-05-14 | Distributed Threat Intelligence at the Edge Devices: A Large Language Model-Driven Approach | Syed Mhamudul Hasan et.al. | 2405.08755 | null |
2024-05-14 | Hunyuan-DiT: A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding | Zhimin Li et.al. | 2405.08748 | link |
2024-05-14 | Beyond Scaling Laws: Understanding Transformer Performance with Associative Memory | Xueyan Niu et.al. | 2405.08707 | null |
2024-05-14 | EndoDAC: Efficient Adapting Foundation Model for Self-Supervised Depth Estimation from Any Endoscopic Camera | Beilei Cui et.al. | 2405.08672 | link |
2024-05-14 | Promoting AI Equity in Science: Generalized Domain Prompt Learning for Accessible VLM Research | Qinglong Cao et.al. | 2405.08668 | link |
2024-05-14 | Thinking Tokens for Language Modeling | David Herel et.al. | 2405.08644 | null |
2024-05-15 | ALMol: Aligned Language-Molecule Translation LLMs through Offline Preference Contrastive Optimisation | Dimitris Gkoumas et.al. | 2405.08619 | null |
2024-05-14 | A Comprehensive Survey of Large Language Models and Multimodal Large Language Models in Medicine | Hanguang Xiao et.al. | 2405.08603 | null |
2024-05-15 | EVDA: Evolving Deepfake Audio Detection Continual Learning Benchmark | Xiaohui Zhang et.al. | 2405.08596 | link |
2024-05-14 | Open-Vocabulary Object Detection via Neighboring Region Attention Alignment | Sunyuan Qiang et.al. | 2405.08593 | null |
2024-05-14 | Improving Transformers with Dynamically Composable Multi-Head Attention | Da Xiao et.al. | 2405.08553 | link |
2024-05-14 | Self-Distillation Improves DNA Sequence Inference | Tong Yu et.al. | 2405.08538 | link |
2024-05-14 | Falcon 7b for Software Mention Detection in Scholarly Documents | AmeerAli Khan et.al. | 2405.08514 | null |
2024-05-14 | Archimedes-AUEB at SemEval-2024 Task 5: LLM explains Civil Procedure | Odysseas S. Chlapanis et.al. | 2405.08502 | link |
2024-05-14 | Is Less More? Quality, Quantity and Context in Idiom Processing with Natural Language Models | Agne Knietaite et.al. | 2405.08497 | link |
2024-05-14 | Enhancing Gender-Inclusive Machine Translation with Neomorphemes and Large Language Models | Andrea Piergentili et.al. | 2405.08477 | null |
2024-05-13 | Plot2Code: A Comprehensive Benchmark for Evaluating Multi-modal Large Language Models in Code Generation from Scientific Plots | Chengyue Wu et.al. | 2405.07990 | null |
2024-05-13 | A Generalist Learner for Multifaceted Medical Image Interpretation | Hong-Yu Zhou et.al. | 2405.07988 | null |
2024-05-13 | The Platonic Representation Hypothesis | Minyoung Huh et.al. | 2405.07987 | link |
2024-05-13 | Investigating the Semantic Robustness of CLIP-based Zero-Shot Anomaly Segmentation | Kevin Stangl et.al. | 2405.07969 | null |
2024-05-13 | PyZoBot: A Platform for Conversational Information Extraction and Synthesis from Curated Zotero Reference Libraries through Advanced Retrieval-Augmented Generation | Suad Alshammari et.al. | 2405.07963 | link |
2024-05-13 | AgentClinic: a multimodal agent benchmark to evaluate AI in simulated clinical environments | Samuel Schmidgall et.al. | 2405.07960 | null |
2024-05-13 | EconLogicQA: A Question-Answering Benchmark for Evaluating Large Language Models in Economic Sequential Reasoning | Yinzhu Quan et.al. | 2405.07938 | link |
2024-05-14 | PARDEN, Can You Repeat That? Defending against Jailbreaks via Repetition | Ziyang Zhang et.al. | 2405.07932 | link |
2024-05-13 | Stable Diffusion-based Data Augmentation for Federated Learning with Non-IID Data | Mahdi Morafah et.al. | 2405.07925 | null |
2024-05-13 | Can Better Text Semantics in Prompt Tuning Improve VLM Generalization? | Hari Chandana Kuchibhotla et.al. | 2405.07921 | null |
2024-05-13 | A Systematic Investigation of Distilling Large Language Models into Cross-Encoders for Passage Re-ranking | Ferdinand Schlatt et.al. | 2405.07920 | link |
2024-05-13 | PLUTO: Pathology-Universal Transformer | Dinkar Juyal et.al. | 2405.07905 | null |
2024-05-13 | Russian-Language Multimodal Dataset for Automatic Summarization of Scientific Papers | Alena Tsanda et.al. | 2405.07886 | link |
2024-05-13 | Zero-Shot Tokenizer Transfer | Benjamin Minixhofer et.al. | 2405.07883 | link |
2024-05-13 | RLHF Workflow: From Reward Modeling to Online RLHF | Hanze Dong et.al. | 2405.07863 | link |
2024-05-13 | Can LLMs Help Predict Elections? (Counter)Evidence from the World’s Largest Democracy | Pratik Gujral et.al. | 2405.07828 | null |
2024-05-13 | A View of How Language Models Will Transform Law | Frank Fagan et.al. | 2405.07826 | null |
2024-05-13 | FreeVA: Offline MLLM as Training-Free Video Assistant | Wenhao Wu et.al. | 2405.07798 | link |
2024-05-13 | DEPTH: Discourse Education through Pre-Training Hierarchically | Zachary Bamberger et.al. | 2405.07788 | link |
2024-05-13 | Generating Human Motion in 3D Scenes from Text Descriptions | Zhi Cen et.al. | 2405.07784 | null |
2024-05-10 | Linearizing Large Language Models | Jean Mercat et.al. | 2405.06640 | link |
2024-05-10 | Value Augmented Sampling for Language Model Alignment and Personalization | Seungwook Han et.al. | 2405.06639 | link |
2024-05-10 | Multimodal LLMs Struggle with Basic Visual Network Analysis: a VNA Benchmark | Evan M. Williams et.al. | 2405.06634 | link |
2024-05-10 | Characterizing the Accuracy - Efficiency Trade-off of Low-rank Decomposition in Language Models | Chakshu Moar et.al. | 2405.06626 | null |
2024-05-10 | Explaining Text Similarity in Transformer Models | Alexandros Vasileiou et.al. | 2405.06604 | link |
2024-05-10 | Enhancing Weakly Supervised Semantic Segmentation with Multi-modal Foundation Models: An End-to-End Approach | Elham Ravanbakhsh et.al. | 2405.06586 | null |
2024-05-10 | What Can Natural Language Processing Do for Peer Review? | Ilia Kuznetsov et.al. | 2405.06563 | link |
2024-05-10 | Mitigating Hallucinations in Large Language Models via Self-Refinement-Enhanced Knowledge Retrieval | Mengjia Niu et.al. | 2405.06545 | null |
2024-05-10 | Prompting Large Language Models with Knowledge Graphs for Question Answering Involving Long-tail Facts | Wenyu Huang et.al. | 2405.06524 | null |
2024-05-10 | UniDM: A Unified Framework for Data Manipulation with Large Language Models | Yichen Qian et.al. | 2405.06510 | null |
2024-05-10 | Storypark: Leveraging Large Language Models to Enhance Children Story Learning Through Child-AI collaboration Storytelling | Lyumanshan Ye et.al. | 2405.06495 | null |
2024-05-10 | Pseudo-Prompt Generating in Pre-trained Vision-Language Models for Multi-Label Medical Image Classification | Yaoqin Ye et.al. | 2405.06468 | link |
2024-05-10 | Improving Instruction Following in Language Models through Proxy-Based Uncertainty Estimation | JoonHo Lee et.al. | 2405.06424 | link |
2024-05-10 | Can Large Language Models Replicate ITS Feedback on Open-Ended Math Questions? | Hunter McNichols et.al. | 2405.06414 | link |
2024-05-10 | Potential and Limitations of LLMs in Capturing Structured Semantics: A Case Study on SRL | Ning Cheng et.al. | 2405.06410 | null |
2024-05-10 | Program Synthesis using Inductive Logic Programming for the Abstraction and Reasoning Corpus | Filipe Marinho Rocha et.al. | 2405.06399 | null |
2024-05-10 | Memory Mosaics | Jianyu Zhang et.al. | 2405.06394 | link |
2024-05-10 | LLM Discussion: Enhancing the Creativity of Large Language Models via Discussion Framework and Role-Play | Li-Chun Lu et.al. | 2405.06373 | link |
2024-05-10 | LMD3: Language Model Data Density Dependence | John Kirchenbauer et.al. | 2405.06331 | null |
2024-05-10 | Correlation Dimension of Natural Language in a Statistical Manifold | Xin Du et.al. | 2405.06321 | null |
2024-05-09 | Natural Language Processing RELIES on Linguistics | Juri Opitz et.al. | 2405.05966 | null |
2024-05-09 | OpenBA-V2: Reaching 77.3% High Compression Ratio with Fast Multi-Stage Pruning | Dan Qiao et.al. | 2405.05957 | link |
2024-05-09 | Probing Multimodal LLMs as World Models for Driving | Shiva Sreeram et.al. | 2405.05956 | link |
2024-05-09 | Smurfs: Leveraging Multiple Proficiency Agents with Context-Efficiency for Tool Planning | Junzhi Chen et.al. | 2405.05955 | link |
2024-05-09 | CuMo: Scaling Multimodal LLM with Co-Upcycled Mixture-of-Experts | Jiachen Li et.al. | 2405.05949 | link |
2024-05-09 | DOLOMITES: Domain-Specific Long-Form Methodical Tasks | Chaitanya Malaviya et.al. | 2405.05938 | null |
2024-05-09 | Trustworthy AI-Generative Content in Intelligent 6G Network: Adversarial, Privacy, and Fairness | Siyuan Li et.al. | 2405.05930 | null |
2024-05-09 | Does Fine-Tuning LLMs on New Knowledge Encourage Hallucinations? | Zorik Gekhman et.al. | 2405.05904 | null |
2024-05-09 | Co-driver: VLM-based Autonomous Driving Assistant with Human-like Behavior and Understanding for Complex Road Scenes | Ziang Guo et.al. | 2405.05885 | link |
2024-05-09 | FlockGPT: Guiding UAV Flocking with Linguistic Orchestration | Artem Lykov et.al. | 2405.05872 | null |
2024-05-09 | Pre-trained Text-to-Image Diffusion Models Are Versatile Representation Learners for Control | Gunshi Gupta et.al. | 2405.05852 | link |
2024-05-09 | Robots Can Feel: LLM-based Framework for Robot Ethical Reasoning | Artem Lykov et.al. | 2405.05824 | link |
2024-05-09 | Boosting Multimodal Large Language Models with Visual Tokens Withdrawal for Rapid Inference | Zhihang Lin et.al. | 2405.05803 | link |
2024-05-09 | Towards a More Inclusive AI: Progress and Perspectives in Large Language Model Training for the Sámi Language | Ronny Paul et.al. | 2405.05777 | null |
2024-05-09 | Experimental Pragmatics with Machines: Testing LLM Predictions for the Inferences of Plain and Embedded Disjunctions | Polina Tsvilodub et.al. | 2405.05776 | null |
2024-05-09 | Large Language Model-Aided Evolutionary Search for Constrained Multiobjective Optimization | Zeyi Wang et.al. | 2405.05767 | null |
2024-05-09 | Similarity Guided Multimodal Fusion Transformer for Semantic Location Prediction in Social Media | Zhizhen Zhang et.al. | 2405.05760 | null |
2024-05-09 | Exploring the Potential of Human-LLM Synergy in Advancing Qualitative Analysis: A Case Study on Mental-Illness Stigma | Han Meng et.al. | 2405.05758 | null |
2024-05-09 | Can large language models understand uncommon meanings of common words? | Jinyang Wu et.al. | 2405.05741 | null |
2024-05-09 | Evaluating Dialect Robustness of Language Models via Conversation Understanding | Dipankar Srirag et.al. | 2405.05688 | link |
2024-05-08 | THRONE: An Object-based Hallucination Benchmark for the Free-form Generations of Large Vision-Language Models | Prannay Kaul et.al. | 2405.05256 | null |
2024-05-09 | You Only Cache Once: Decoder-Decoder Architectures for Language Models | Yutao Sun et.al. | 2405.05254 | link |
2024-05-08 | Open Source Language Models Can Provide Feedback: Evaluating LLMs’ Ability to Help Students Using GPT-4-As-A-Judge | Charles Koutcheme et.al. | 2405.05253 | link |
2024-05-09 | LLMs with Personalities in Multi-issue Negotiation Games | Sean Noh et.al. | 2405.05248 | null |
2024-05-08 | EVA-X: A Foundation Model for General Chest X-ray Analysis with Self-supervised Learning | Jingfeng Yao et.al. | 2405.05237 | link |
2024-05-08 | SuFIA: Language-Guided Augmented Dexterity for Robotic Surgical Assistants | Masoud Moghani et.al. | 2405.05226 | null |
2024-05-08 | Conv-Basis: A New Paradigm for Efficient Attention Inference and Gradient Computation in Transformers | Jiuxiang Gu et.al. | 2405.05219 | null |
2024-05-08 | FinePOSE: Fine-Grained Prompt-Driven 3D Human Pose Estimation via Diffusion Models | Jinglin Xu et.al. | 2405.05216 | link |
2024-05-08 | MIDGARD: Self-Consistency Using Minimum Description Length for Structured Commonsense Reasoning | Inderjeet Nair et.al. | 2405.05189 | link |
2024-05-08 | Encoder-Decoder Framework for Interactive Free Verses with Generation with Controllable High-Quality Rhyming | Tommaso Pasini et.al. | 2405.05176 | null |
2024-05-08 | Air Gap: Protecting Privacy-Conscious Conversational Agents | Eugene Bagdasaryan et.al. | 2405.05175 | null |
2024-05-08 | XAMPLER: Learning to Retrieve Cross-Lingual In-Context Examples | Peiqin Lin et.al. | 2405.05116 | link |
2024-05-08 | QFMTS: Generating Query-Focused Summaries over Multi-Table Inputs | Weijia Zhang et.al. | 2405.05109 | null |
2024-05-08 | Concerns on Bias in Large Language Models when Creating Synthetic Personae | Helena A. Haxvig et.al. | 2405.05080 | null |
2024-05-08 | Impact of Tone-Aware Explanations in Recommender Systems | Ayano Okoso et.al. | 2405.05061 | null |
2024-05-08 | Conversational Topic Recommendation in Counseling and Psychotherapy with Decision Transformer and Large Language Models | Aylin Gunal et.al. | 2405.05060 | null |
2024-05-08 | Seeds of Stereotypes: A Large-Scale Textual Analysis of Race and Gender Associations with Diseases in Online Sources | Lasse Hyldig Hansen et.al. | 2405.05049 | null |
2024-05-08 | ${M^2D}$ NeRF: Multi-Modal Decomposition NeRF with 3D Feature Fields | Ning Wang et.al. | 2405.05010 | null |
2024-05-08 | ADELIE: Aligning Large Language Models on Information Extraction | Yunjia Qi et.al. | 2405.05008 | link |
2024-05-08 | NAVRepair: Node-type Aware C/C++ Code Vulnerability Repair | Ruoke Wang et.al. | 2405.04994 | null |
2024-05-07 | ChatHuman: Language-driven 3D Human Understanding with Retrieval-Augmented Tool Reasoning | Jing Lin et.al. | 2405.04533 | null |
2024-05-07 | QServe: W4A8KV4 Quantization and System Co-design for Efficient LLM Serving | Yujun Lin et.al. | 2405.04532 | link |
2024-05-07 | NaturalCodeBench: Examining Coding Performance Mismatch on HumanEval and Natural User Prompts | Shudan Zhang et.al. | 2405.04520 | null |
2024-05-07 | xLSTM: Extended Long Short-Term Memory | Maximilian Beck et.al. | 2405.04517 | link |
2024-05-07 | A Transformer with Stack Attention | Jiaoda Li et.al. | 2405.04515 | link |
2024-05-08 | Unveiling Disparities in Web Task Handling Between Human and Web Agent | Kihoon Son et.al. | 2405.04497 | null |
2024-05-07 | Toward In-Context Teaching: Adapting Examples to Students’ Misconceptions | Alexis Ross et.al. | 2405.04495 | null |
2024-05-07 | Representation Learning of Daily Movement Data Using Text Encoders | Alexander Capstick et.al. | 2405.04494 | link |
2024-05-08 | DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model | DeepSeek-AI et.al. | 2405.04434 | link |
2024-05-07 | The Silicone Ceiling: Auditing GPT’s Race and Gender Biases in Hiring | Lena Armstrong et.al. | 2405.04412 | null |
2024-05-07 | Learning To See But Forgetting To Follow: Visual Instruction Tuning Makes LLMs More Prone To Jailbreak Attacks | Georgios Pantazopoulos et.al. | 2405.04403 | link |
2024-05-07 | Large Language Models Cannot Explain Themselves | Advait Sarkar et.al. | 2405.04382 | null |
2024-05-07 | A Fourth Wave of Open Data? Exploring the Spectrum of Scenarios for Open Data and Generative AI | Hannah Chafetz et.al. | 2405.04333 | null |
2024-05-07 | Deception in Reinforced Autonomous Agents: The Unconventional Rabbit Hat Trick in Legislation | Atharvan Dogra et.al. | 2405.04325 | null |
2024-05-07 | Granite Code Models: A Family of Open Foundation Models for Code Intelligence | Mayank Mishra et.al. | 2405.04324 | link |
2024-05-07 | Accelerating Speculative Decoding using Dynamic Speculation Length | Jonathan Mamou et.al. | 2405.04304 | null |
2024-05-07 | Enhancing the Efficiency and Accuracy of Underlying Asset Reviews in Structured Finance: The Application of Multi-agent Framework | Xiangpeng Wan et.al. | 2405.04294 | link |
2024-05-07 | Who Wrote This? The Key to Zero-Shot LLM-Generated Text Detection Is GECScore | Junchao Wu et.al. | 2405.04286 | null |
2024-05-07 | On the Foundations of Earth and Climate Foundation Models | Xiao Xiang Zhu et.al. | 2405.04285 | null |
2024-05-07 | Semantic API Alignment: Linking High-level User Goals to APIs | Robert Feldt et.al. | 2405.04236 | null |
2024-05-06 | Complex Video Reasoning and Robustness Evaluation Suite for Video-LMMs | Muhammad Uzair Khattak et.al. | 2405.03690 | null |
2024-05-06 | Pose Priors from Language Models | Sanjay Subramanian et.al. | 2405.03689 | null |
2024-05-06 | Large Language Models Reveal Information Operation Goals, Tactics, and Narrative Frames | Keith Burghardt et.al. | 2405.03688 | link |
2024-05-06 | Language-Image Models with 3D Understanding | Jang Hyun Cho et.al. | 2405.03685 | null |
2024-05-06 | AtomGPT: Atomistic Generative Pre-trained Transformer for Forward and Inverse Materials Design | Kamal Choudhary et.al. | 2405.03680 | link |
2024-05-06 | When LLMs Meet Cybersecurity: A Systematic Literature Review | Jie Zhang et.al. | 2405.03644 | link |
2024-05-06 | A Controlled Experiment on the Energy Efficiency of the Source Code Generated by Code Llama | Vlad-Andrei Cursaru et.al. | 2405.03616 | null |
2024-05-06 | GREEN: Generative Radiology Report Evaluation and Error Notation | Sophie Ostmeier et.al. | 2405.03595 | null |
2024-05-06 | Enabling High-Sparsity Foundational Llama Models with Efficient Pretraining and Deployment | Abhinav Agarwalla et.al. | 2405.03594 | null |
2024-05-06 | Liberating Seen Classes: Boosting Few-Shot and Zero-Shot Text Classification via Anchor Generation and Classification Reframing | Han Liu et.al. | 2405.03565 | null |
2024-05-07 | ID-centric Pre-training for Recommendation | Yiqing Wu et.al. | 2405.03562 | null |
2024-05-06 | AlphaMath Almost Zero: process Supervision without process | Guoxin Chen et.al. | 2405.03553 | link |
2024-05-06 | MAmmoTH2: Scaling Instructions from the Web | Xiang Yue et.al. | 2405.03548 | null |
2024-05-06 | Position Paper: Leveraging Foundational Models for Black-Box Optimization: Benefits, Challenges, and Future Directions | Xingyou Song et.al. | 2405.03547 | null |
2024-05-06 | Are Human Rules Necessary? Generating Reusable APIs with CoT Reasoning and In-Context Learning | Yubo Mai et.al. | 2405.03509 | null |
2024-05-06 | UnsafeBench: Benchmarking Image Safety Classifiers on Real-World and AI-Generated Images | Yiting Qu et.al. | 2405.03486 | null |
2024-05-06 | LGTM: Local-to-Global Text-Driven Human Motion Diffusion Model | Haowen Sun et.al. | 2405.03485 | link |
2024-05-06 | Doing Personal LAPS: LLM-Augmented Dialogue Construction for Personalized Multi-Session Conversational Search | Hideaki Joko et.al. | 2405.03480 | link |
2024-05-07 | Large Language Models (LLMs) as Agents for Augmented Democracy | Jairo Gudiño-Rosero et.al. | 2405.03452 | null |
2024-05-06 | SEvenLLM: Benchmarking, Eliciting, and Enhancing Abilities of Large Language Models in Cyber Threat Intelligence | Hangyuan Ji et.al. | 2405.03446 | link |
2024-05-03 | Vibe-Eval: A hard evaluation suite for measuring progress of multimodal language models | Piotr Padlewski et.al. | 2405.02287 | link |
2024-05-03 | Structural Pruning of Pre-trained Language Models via Neural Architecture Search | Aaron Klein et.al. | 2405.02267 | link |
2024-05-03 | On the test-time zero-shot generalization of vision-language models: Do we really need prompt learning? | Maxime Zanella et.al. | 2405.02266 | link |
2024-05-03 | Leveraging Large Language Models to Enhance Domain Expert Inclusion in Data Science Workflows | Jasmine Y. Shih et.al. | 2405.02260 | null |
2024-05-03 | What matters when building vision-language models? | Hugo Laurençon et.al. | 2405.02246 | null |
2024-05-03 | REASONS: A benchmark for REtrieval and Automated citationS Of scieNtific Sentences using Public and Proprietary LLMs | Deepa Tilwani et.al. | 2405.02228 | null |
2024-05-03 | Fair Risk Control: A Generalized Framework for Calibrating Multi-group Fairness Risks | Lujing Zhang et.al. | 2405.02225 | null |
2024-05-03 | FairEvalLLM. A Comprehensive Framework for Benchmarking Fairness in Large Language Model Recommender Systems | Yashar Deldjoo et.al. | 2405.02219 | null |
2024-05-03 | Automatic Programming: Large Language Models and Beyond | Michael R. Lyu et.al. | 2405.02213 | null |
2024-05-03 | Assessing and Verifying Task Utility in LLM-Powered Applications | Negar Arabzadeh et.al. | 2405.02178 | null |
2024-05-03 | Hoaxpedia: A Unified Wikipedia Hoax Articles Dataset | Hsuvas Borkakoty et.al. | 2405.02175 | link |
2024-05-03 | Mapping the Unseen: Unified Promptable Panoptic Mapping with Dynamic Labeling using Foundation Models | Mohamad Al Mdfaa et.al. | 2405.02162 | null |
2024-05-03 | Neural Context Flows for Learning Generalizable Dynamical Systems | Roussel Desmond Nzoyem et.al. | 2405.02154 | link |
2024-05-03 | The AI Review Lottery: Widespread AI-Assisted Peer Reviews Boost Paper Scores and Acceptance Rates | Giuseppe Russo Latona et.al. | 2405.02150 | link |
2024-05-03 | MedReadMe: A Systematic Study for Fine-grained Sentence Readability in Medical Domain | Chao Jiang et.al. | 2405.02144 | null |
2024-05-03 | Optimising Calls to Large Language Models with Uncertainty-Based Two-Tier Selection | Guillem Ramírez et.al. | 2405.02134 | null |
2024-05-03 | Unveiling the Potential of LLM-Based ASR on Chinese Open-Source Datasets | Xuelong Geng et.al. | 2405.02132 | link |
2024-05-03 | Evaluating Large Language Models for Structured Science Summarization in the Open Research Knowledge Graph | Vladyslav Nechakhin et.al. | 2405.02105 | null |
2024-05-03 | Argumentative Large Language Models for Explainable and Contestable Decision-Making | Gabriel Freedman et.al. | 2405.02079 | null |
2024-05-03 | Comparative Analysis of Retrieval Systems in the Real World | Dmytro Mozolevskyi et.al. | 2405.02048 | null |
2024-05-02 | Prometheus 2: An Open Source Language Model Specialized in Evaluating Other Language Models | Seungone Kim et.al. | 2405.01535 | link |
2024-05-02 | Plan-Seq-Learn: Language Model Guided RL for Solving Long Horizon Robotics Tasks | Murtaza Dalal et.al. | 2405.01534 | null |
2024-05-02 | OmniDrive: A Holistic LLM-Agent Framework for Autonomous Driving with 3D Perception, Reasoning and Planning | Shihao Wang et.al. | 2405.01533 | link |
2024-05-02 | FLAME: Factuality-Aware Alignment for Large Language Models | Sheng-Chieh Lin et.al. | 2405.01525 | null |
2024-05-03 | A separability-based approach to quantifying generalization: which layer is best? | Luciano Dyballa et.al. | 2405.01524 | link |
2024-05-02 | Transformer-Aided Semantic Communications | Matin Mortaheb et.al. | 2405.01521 | null |
2024-05-02 | D2PO: Discriminator-Guided DPO with Response Evaluation Models | Prasann Singhal et.al. | 2405.01511 | link |
2024-05-02 | Analyzing the Role of Semantic Representations in the Era of Large Language Models | Zhijing Jin et.al. | 2405.01502 | link |
2024-05-02 | Supporting Business Document Workflows via Collection-Centric Information Foraging with Large Language Models | Raymond Fok et.al. | 2405.01501 | null |
2024-05-02 | Controllable Text Generation in the Instruction-Tuning Era | Dhananjay Ashok et.al. | 2405.01490 | null |
2024-05-02 | MANTIS: Interleaved Multi-Image Instruction Tuning | Dongfu Jiang et.al. | 2405.01483 | link |
2024-05-02 | NeMo-Aligner: Scalable Toolkit for Efficient Model Alignment | Gerald Shen et.al. | 2405.01481 | link |
2024-05-02 | V-FLUTE: Visual Figurative Language Understanding with Textual Explanations | Arkadiy Saakyan et.al. | 2405.01474 | link |
2024-05-02 | Advancing human-centric AI for robust X-ray analysis through holistic self-supervised learning | Théo Moutakanni et.al. | 2405.01469 | null |
2024-05-02 | Understanding Retrieval-Augmented Task Adaptation for Vision-Language Models | Yifei Ming et.al. | 2405.01468 | null |
2024-05-02 | A Systematic Literature Review on Large Language Models for Automated Program Repair | Quanjun Zhang et.al. | 2405.01466 | link |
2024-05-02 | Natural Language to Verilog: Design of a Recurrent Spiking Neural Network using Large Language Models and ChatGPT | Paola Vitolo et.al. | 2405.01419 | null |
2024-05-02 | MiniGPT-3D: Efficiently Aligning 3D Point Clouds with Large Language Models using 2D Priors | Yuan Tang et.al. | 2405.01413 | link |
2024-05-02 | Verification and Refinement of Natural Language Explanations through LLM-Symbolic Theorem Proving | Xin Quan et.al. | 2405.01379 | link |
2024-05-02 | GAIA: A General AI Assistant for Intelligent Accelerator Operations | Frank Mayet et.al. | 2405.01359 | null |
2024-05-01 | Self-Play Preference Optimization for Language Model Alignment | Yue Wu et.al. | 2405.00675 | link |
2024-05-01 | Is Bigger Edit Batch Size Always Better? – An Empirical Study on Model Editing with Llama-3 | Junsang Yoon et.al. | 2405.00664 | link |
2024-05-01 | HalluVault: A Novel Logic Programming-aided Metamorphic Testing Framework for Detecting Fact-Conflicting Hallucinations in Large Language Models | Ningke Li et.al. | 2405.00648 | null |
2024-05-01 | When Quantization Affects Confidence of Large Language Models? | Irina Proskurina et.al. | 2405.00632 | link |
2024-05-01 | “I’m Not Sure, But…”: Examining the Impact of Large Language Models’ Uncertainty Expression on User Reliance and Trust | Sunnie S. Y. Kim et.al. | 2405.00623 | null |
2024-05-01 | Causal Evaluation of Language Models | Sirui Chen et.al. | 2405.00622 | link |
2024-05-01 | Addressing Topic Granularity and Hallucination in Large Language Models for Topic Modelling | Yida Mu et.al. | 2405.00611 | link |
2024-05-01 | Investigating Automatic Scoring and Feedback using Large Language Models | Gloria Ashiya Katuka et.al. | 2405.00602 | null |
2024-05-01 | Are Models Biased on Text without Gender-related Language? | Catarina G Belém et.al. | 2405.00588 | link |
2024-05-01 | The Real, the Better: Aligning Large Language Models with Online Human Behaviors | Guanying Jiang et.al. | 2405.00578 | null |
2024-05-01 | EALD-MLLM: Emotion Analysis in Long-sequential and De-identity videos with Multi-modal Large Language Model | Deng Li et.al. | 2405.00574 | null |
2024-05-01 | NumLLM: Numeric-Sensitive Large Language Model for Chinese Finance | Huan-Yi Su et.al. | 2405.00566 | null |
2024-05-01 | Mixture of insighTful Experts (MoTE): The Synergy of Thought Chains and Expert Mixtures in Self-Alignment | Zhili Liu et.al. | 2405.00557 | null |
2024-05-01 | Long-Term Human Trajectory Prediction using 3D Dynamic Scene Graphs | Nicolas Gorlo et.al. | 2405.00552 | link |
2024-05-01 | ChatBI: Towards Natural Language to Complex Business Intelligence SQL | Jinqing Lian et.al. | 2405.00527 | null |
2024-05-01 | CookingSense: A Culinary Knowledgebase with Multidisciplinary Assertions | Donghee Choi et.al. | 2405.00523 | null |
2024-05-01 | Navigating WebAI: Training Agents to Complete Web Tasks with Large Language Models and Reinforcement Learning | Lucas-Andreï Thil et.al. | 2405.00516 | null |
2024-05-01 | GOLD: Geometry Problem Solver with Natural Language Description | Jiaxin Zhang et.al. | 2405.00494 | link |
2024-05-01 | Is Temperature the Creativity Parameter of Large Language Models? | Max Peeperkorn et.al. | 2405.00492 | link |
2024-05-01 | The Pyramid of Captions | Delong Chen et.al. | 2405.00485 | null |
2024-04-30 | Visual Fact Checker: Enabling High-Fidelity Detailed Caption Generation | Yunhao Ge et.al. | 2404.19752 | null |
2024-04-30 | PrivComp-KG : Leveraging Knowledge Graph and Large Language Models for Privacy Policy Compliance Verification | Leon Garza et.al. | 2404.19744 | null |
2024-04-30 | Better & Faster Large Language Models via Multi-token Prediction | Fabian Gloeckle et.al. | 2404.19737 | null |
2024-04-30 | A Framework for Leveraging Human Computation Gaming to Enhance Knowledge Graphs for Accuracy Critical Generative AI Applications | Steph Buongiorno et.al. | 2404.19729 | null |
2024-04-30 | PANGeA: Procedural Artificial Narrative using Generative AI for Turn-Based Video Games | Steph Buongiorno et.al. | 2404.19721 | null |
2024-04-30 | Assessing LLMs in Malicious Code Deobfuscation of Real-world Malware Campaigns | Constantinos Patsakis et.al. | 2404.19715 | null |
2024-04-30 | Automated Generation of High-Quality Medical Simulation Scenarios Through Integration of Semi-Structured Data and Large Language Models | Scott Sumpter et.al. | 2404.19713 | null |
2024-04-30 | When to Retrieve: Teaching LLMs to Utilize Information Retrieval Effectively | Tiziano Labruna et.al. | 2404.19705 | link |
2024-04-30 | Naturally Supervised 3D Visual Grounding with Language-Regularized Concept Learners | Chun Feng et.al. | 2404.19696 | null |
2024-04-30 | Towards Generalist Robot Learning from Internet Video: A Survey | Robert McCarthy et.al. | 2404.19664 | null |
2024-04-30 | MetaCoCo: A New Few-Shot Classification Benchmark with Spurious Correlation | Min Zhang et.al. | 2404.19644 | link |
2024-04-30 | On Training a Neural Network to Explain Binaries | Alexander Interrante-Grant et.al. | 2404.19631 | null |
2024-04-30 | Seeing Through the Clouds: Cloud Gap Imputation with Prithvi Foundation Model | Denys Godwin et.al. | 2404.19609 | null |
2024-04-30 | Transferring Troubles: Cross-Lingual Transferability of Backdoor Attacks in LLMs with Instruction Tuning | Xuanli He et.al. | 2404.19597 | null |
2024-04-30 | RAG and RAU: A Survey on Retrieval-Augmented Language Model in Natural Language Processing | Yucheng Hu et.al. | 2404.19543 | link |
2024-04-30 | MoST: Multi-modality Scene Tokenization for Motion Prediction | Norman Mu et.al. | 2404.19531 | null |
2024-04-30 | Do Large Language Models Understand Conversational Implicature – A case study with a chinese sitcom | Shisen Yue et.al. | 2404.19509 | link |
2024-04-30 | More Compute Is What You Need | Zhen Guo et.al. | 2404.19484 | null |
2024-05-01 | Neuro-Vision to Language: Image Reconstruction and Language enabled Interaction via Brain Recordings | Guobin Shen et.al. | 2404.19438 | null |
2024-04-30 | Can Large Language Models put 2 and 2 together? Probing for Entailed Arithmetical Relationships | D. Panas et.al. | 2404.19432 | null |
2024-04-29 | Hallucination of Multimodal Large Language Models: A Survey | Zechen Bai et.al. | 2404.18930 | link |
2024-04-29 | Holmes: Benchmark the Linguistic Competence of Language Models | Andreas Waldis et.al. | 2404.18923 | null |
2024-04-29 | DPO Meets PPO: Reinforced Token Optimization for RLHF | Han Zhong et.al. | 2404.18922 | null |
2024-04-29 | TheaterGen: Character Management with LLM for Consistent Multi-turn Image Generation | Junhao Cheng et.al. | 2404.18919 | link |
2024-04-29 | Kangaroo: Lossless Self-Speculative Decoding via Double Early Exiting | Fangcheng Liu et.al. | 2404.18911 | link |
2024-04-29 | Human-in-the-Loop Synthetic Text Data Inspection with Provenance Tracking | Hong Jin Kang et.al. | 2404.18881 | link |
2024-04-29 | More RLHF, More Trust? On The Impact of Human Preference Alignment On Language Model Trustworthiness | Aaron J. Li et.al. | 2404.18870 | link |
2024-04-29 | Truth-value judgment in language models: belief directions are context sensitive | Stefan F. Schouten et.al. | 2404.18865 | null |
2024-04-29 | Performance-Aligned LLMs for Generating Fast Code | Daniel Nichols et.al. | 2404.18864 | null |
2024-04-29 | A Survey on Vision Mamba: Models, Applications and Challenges | Rui Xu et.al. | 2404.18861 | link |
2024-04-29 | VERT: Verified Equivalent Rust Transpilation with Few-Shot Learning | Aidan Z. H. Yang et.al. | 2404.18852 | null |
2024-04-30 | FeDeRA:Efficient Fine-tuning of Language Models in Federated Learning Leveraging Weight Decomposition | Yuxuan Yan et.al. | 2404.18848 | null |
2024-04-29 | It’s Difficult to be Neutral – Human and LLM-based Sentiment Annotation of Patient Comments | Petter Mæhlum et.al. | 2404.18832 | null |
2024-04-29 | Benchmarking Benchmark Leakage in Large Language Models | Ruijie Xu et.al. | 2404.18824 | link |
2024-04-29 | AppPoet: Large Language Model based Android malware detection via multi-view prompt engineering | Wenxiang Zhao et.al. | 2404.18816 | null |
2024-04-29 | Unknown Script: Impact of Script on Cross-Lingual Transfer | Wondimagegnhue Tsegaye Tufa et.al. | 2404.18810 | link |
2024-04-29 | Replacing Judges with Juries: Evaluating LLM Generations with a Panel of Diverse Models | Pat Verga et.al. | 2404.18796 | null |
2024-04-29 | PECC: Problem Extraction and Coding Challenges | Patrick Haller et.al. | 2404.18766 | link |
2024-04-29 | Transitive Vision-Language Prompt Learning for Domain Generalization | Liyuan Wang et.al. | 2404.18758 | null |
2024-04-29 | Enhancing Interactive Image Retrieval With Query Rewriting Using Large Language Models and Vision Language Models | Hongyi Zhu et.al. | 2404.18746 | null |
2024-04-26 | Probabilistic Inference in Language Models via Twisted Sequential Monte Carlo | Stephen Zhao et.al. | 2404.17546 | link |
2024-04-26 | Exploring the Distinctiveness and Fidelity of the Descriptions Generated by Large Vision-Language Models | Yuhang Huang et.al. | 2404.17534 | null |
2024-04-26 | Large Language Model Agent as a Mechanical Designer | Yayati Jadhav et.al. | 2404.17525 | null |
2024-04-26 | On the Use of Large Language Models to Generate Capability Ontologies | Luis Miguel Vieira da Silva et.al. | 2404.17524 | link |
2024-04-26 | Enhancing Legal Compliance and Regulation Analysis with Large Language Models | Shabnam Hassani et.al. | 2404.17522 | null |
2024-04-26 | A Comprehensive Evaluation on Event Reasoning of Large Language Models | Zhengwei Tao et.al. | 2404.17513 | link |
2024-04-26 | CEval: A Benchmark for Evaluating Counterfactual Text Generation | Van Bach Nguyen et.al. | 2404.17475 | link |
2024-04-26 | Ruffle&Riley: Insights from Designing and Evaluating a Large Language Model-Based Conversational Tutoring System | Robin Schmucker et.al. | 2404.17460 | null |
2024-04-26 | “ChatGPT Is Here to Help, Not to Replace Anybody” – An Evaluation of Students’ Opinions On Integrating ChatGPT In CS Courses | Bruno Pereira Cipriano et.al. | 2404.17443 | null |
2024-04-26 | PromptCIR: Blind Compressed Image Restoration with Prompt Learning | Bingchen Li et.al. | 2404.17433 | link |
2024-04-26 | Evaluation of Geographical Distortions in Language Models: A Crucial Step Towards Equitable Representations | Rémy Decoupes et.al. | 2404.17401 | null |
2024-04-26 | UniRGB-IR: A Unified Framework for Visible-Infrared Downstream Tasks via Adapter Tuning | Maoxun Yuan et.al. | 2404.17360 | null |
2024-04-26 | InspectorRAGet: An Introspection Platform for RAG Evaluation | Kshitij Fadnis et.al. | 2404.17347 | link |
2024-04-26 | Introducing cosmosGPT: Monolingual Training for Turkish Language Models | H. Toprak Kesgin et.al. | 2404.17336 | null |
2024-04-26 | A Novel Spike Transformer Network for Depth Estimation from Event Cameras via Cross-modality Knowledge Distillation | Xin Zhang et.al. | 2404.17335 | null |
2024-04-26 | An Extendable Cloud-Native Alloy Property Explorer | Zhuoyuan Li et.al. | 2404.17330 | link |
2024-04-26 | When to Trust LLMs: Aligning Confidence with Response Quality | Shuchang Tao et.al. | 2404.17287 | link |
2024-04-26 | Reinforcement Retrieval Leveraging Fine-grained Feedback for Fact Checking News Claims with Black-Box LLM | Xuan Zhang et.al. | 2404.17283 | link |
2024-04-26 | Prompting Towards Alleviating Code-Switched Data Scarcity in Under-Resourced Languages with GPT as a Pivot | Michelle Terblanche et.al. | 2404.17216 | null |
2024-04-26 | Low-Rank Knowledge Decomposition for Medical Foundation Models | Yuhang Zhou et.al. | 2404.17184 | link |
2024-04-25 | The Third Monocular Depth Estimation Challenge | Jaime Spencer et.al. | 2404.16831 | null |
2024-04-25 | Make-it-Real: Unleashing Large Multimodal Model’s Ability for Painting 3D Objects with Realistic Materials | Ye Fang et.al. | 2404.16829 | null |
2024-04-25 | V2A-Mark: Versatile Deep Visual-Audio Watermarking for Manipulation Localization and Copyright Protection | Xuanyu Zhang et.al. | 2404.16824 | null |
2024-04-25 | How Far Are We to GPT-4V? Closing the Gap to Commercial Multimodal Models with Open-Source Suites | Zhe Chen et.al. | 2404.16821 | link |
2024-04-25 | IndicGenBench: A Multilingual Benchmark to Evaluate Generation Capabilities of LLMs on Indic Languages | Harman Singh et.al. | 2404.16816 | link |
2024-04-26 | Make Your LLM Fully Utilize the Context | Shengnan An et.al. | 2404.16811 | link |
2024-04-25 | Improving Diversity of Commonsense Generation by Large Language Models via In-Context Learning | Tianhui Zhang et.al. | 2404.16807 | link |
2024-04-25 | AAPL: Adding Attributes to Prompt Learning for Vision-Language Models | Gahyeon Kim et.al. | 2404.16804 | link |
2024-04-25 | Weak-to-Strong Extrapolation Expedites Alignment | Chujie Zheng et.al. | 2404.16792 | link |
2024-04-25 | SEED-Bench-2-Plus: Benchmarking Multimodal Large Language Models with Text-Rich Visual Comprehension | Bohao Li et.al. | 2404.16790 | link |
2024-04-25 | Continual Learning of Large Language Models: A Comprehensive Survey | Haizhou Shi et.al. | 2404.16789 | link |
2024-04-25 | Modeling Selective Feature Attention for Representation-based Siamese Text Matching | Jianxiang Zang et.al. | 2404.16776 | link |
2024-04-25 | REBEL: Reinforcement Learning via Regressing Relative Rewards | Zhaolin Gao et.al. | 2404.16767 | link |
2024-04-25 | Prefix Text as a Yarn: Eliciting Non-English Alignment in Foundation Language Model | Runzhe Zhan et.al. | 2404.16766 | null |
2024-04-25 | RadGenome-Chest CT: A Grounded Vision-Language Dataset for Chest CT Analysis | Xiaoman Zhang et.al. | 2404.16754 | link |
2024-04-25 | Embracing Diversity: Interpretable Zero-shot classification beyond one vector per class | Mazda Moayeri et.al. | 2404.16717 | null |
2024-04-25 | Layer Skip: Enabling Early Exit Inference and Self-Speculative Decoding | Mostafa Elhoushi et.al. | 2404.16710 | link |
2024-04-25 | Cooperate or Collapse: Emergence of Sustainability Behaviors in a Society of LLM Agents | Giorgio Piatti et.al. | 2404.16698 | link |
2024-04-25 | Influence of Solution Efficiency and Valence of Instruction on Additive and Subtractive Solution Strategies in Humans and GPT-4 | Lydia Uhler et.al. | 2404.16692 | null |
2024-04-25 | EmoVIT: Revolutionizing Emotion Insights with Visual Instruction Tuning | Hongxia Xie et.al. | 2404.16670 | link |
2024-04-24 | Hybrid LLM/Rule-based Approaches to Business Insights Generation from Structured Data | Aliaksei Vertsel et.al. | 2404.15604 | null |
2024-04-24 | ImplicitAVE: An Open-Source Dataset and Multimodal LLMs Benchmark for Implicit Attribute Value Extraction | Henry Peng Zou et.al. | 2404.15592 | link |
2024-04-24 | MiM: Mask in Mask Self-Supervised Pre-Training for 3D Medical Image Analysis | Jiaxin Zhuang et.al. | 2404.15580 | null |
2024-04-24 | Can Foundational Large Language Models Assist with Conducting Pharmaceuticals Manufacturing Investigations? | Hossein Salami et.al. | 2404.15578 | null |
2024-04-24 | Retrieval Head Mechanistically Explains Long-Context Factuality | Wenhao Wu et.al. | 2404.15574 | link |
2024-04-23 | PRISM: Patient Records Interpretation for Semantic Clinical Trial Matching using Large Language Models | Shashi Kant Gupta et.al. | 2404.15549 | null |
2024-04-23 | BattleAgent: Multi-modal Dynamic Emulation on Historical Battles to Complement Historical Analysis | Shuhang Lin et.al. | 2404.15532 | link |
2024-04-23 | Towards Systematic Evaluation of Logical Reasoning Ability of Large Language Models | Mihir Parmar et.al. | 2404.15522 | link |
2024-04-23 | Visual Delta Generator with Large Multi-modal Models for Semi-supervised Composed Image Retrieval | Young Kyun Jang et.al. | 2404.15516 | null |
2024-04-23 | ToM-LM: Delegating Theory Of Mind Reasoning to External Symbolic Executors in Large Language Models | Weizhi Tang et.al. | 2404.15515 | null |
2024-04-23 | IryoNLP at MEDIQA-CORR 2024: Tackling the Medical Error Detection & Correction Task On the Shoulders of Medical Agents | Jean-Philippe Corbeil et.al. | 2404.15488 | link |
2024-04-23 | Large Language Models Spot Phishing Emails with Surprising Accuracy: A Comparative Analysis of Performance | Het Patel et.al. | 2404.15485 | null |
2024-04-23 | Can Large Language Models Learn the Physics of Metamaterials? An Empirical Study with ChatGPT | Darui Lu et.al. | 2404.15458 | null |
2024-04-23 | XC-Cache: Cross-Attending to Cached Context for Efficient LLM Inference | João Monteiro et.al. | 2404.15420 | null |
2024-04-23 | Wiki-LLaVA: Hierarchical Retrieval-Augmented Generation for Multimodal LLMs | Davide Caffagni et.al. | 2404.15406 | null |
2024-04-23 | Aligning LLM Agents by Learning Latent Preference from User Edits | Ge Gao et.al. | 2404.15269 | link |
2024-04-23 | XFT: Unlocking the Power of Code Instruction Tuning by Simply Merging Upcycled Mixture-of-Experts | Yifeng Ding et.al. | 2404.15247 | link |
2024-04-23 | CultureBank: An Online Community-Driven Knowledge Base Towards Culturally Aware Language Technologies | Weiyan Shi et.al. | 2404.15238 | link |
2024-04-23 | Revisiting Unnaturalness for Automated Program Repair in the Era of Large Language Models | Aidan Z. H. Yang et.al. | 2404.15236 | null |
2024-04-23 | Re-Thinking Inverse Graphics With Large Language Models | Peter Kulits et.al. | 2404.15228 | null |
2024-04-23 | Does Instruction Tuning Make LLMs More Consistent? | Constanza Fierro et.al. | 2404.15206 | null |
2024-04-23 | Setting up the Data Printer with Improved English to Ukrainian Machine Translation | Yurii Paniv et.al. | 2404.15196 | link |
2024-04-23 | Regressive Side Effects of Training Language Models to Mimic Student Misconceptions | Shashank Sonkar et.al. | 2404.15156 | null |
2024-04-23 | Bias patterns in the application of LLMs for clinical decision support: A comprehensive study | Raphael Poulain et.al. | 2404.15149 | link |
2024-04-23 | Rethinking LLM Memorization through the Lens of Adversarial Compression | Avi Schwarzschild et.al. | 2404.15146 | null |
2024-04-23 | MedDr: Diagnosis-Guided Bootstrapping for Large-Scale Medical Vision-Language Learning | Sunan He et.al. | 2404.15127 | link |
2024-04-23 | Identifying Fairness Issues in Automatically Generated Testing Content | Kevin Stowe et.al. | 2404.15104 | null |
2024-04-23 | Multimodal Large Language Model is a Human-Aligned Annotator for Text-to-Image Generation | Xun Wu et.al. | 2404.15100 | null |
2024-04-23 | Detection of circular permutations by Protein Language Models | Yue Hu et.al. | 2404.15087 | link |
2024-04-23 | Multi-Head Mixture-of-Experts | Xun Wu et.al. | 2404.15045 | link |
2024-04-23 | TAXI: Evaluating Categorical Knowledge Editing for Language Models | Derek Powell et.al. | 2404.15004 | link |
2024-04-23 | Transformers Can Represent $n$ -gram Language Models | Anej Svete et.al. | 2404.14994 | null |
2024-04-23 | A Short Review for Ontology Learning from Text: Stride from Shallow Learning, Deep Learning to Large Language Models Trend | Rick Du et.al. | 2404.14991 | null |
2024-04-23 | $\texttt{MiniMol}$ : A Parameter-Efficient Foundation Model for Molecular Learning | Kerstin Kläser et.al. | 2404.14986 | null |
2024-04-23 | Social Media and Artificial Intelligence for Sustainable Cities and Societies: A Water Quality Analysis Use-case | Muhammad Asif Auyb et.al. | 2404.14977 | null |
2024-04-22 | AutoAD III: The Prequel – Back to the Pixels | Tengda Han et.al. | 2404.14412 | null |
2024-04-22 | SpaceByte: Towards Deleting Tokenization from Large Language Modeling | Kevin Slagle et.al. | 2404.14408 | link |
2024-04-22 | RTP-LX: Can LLMs Evaluate Toxicity in Multilingual Scenarios? | Adrian de Wynter et.al. | 2404.14397 | link |
2024-04-22 | SEED-X: Multimodal Models with Unified Multi-granularity Comprehension and Generation | Yuying Ge et.al. | 2404.14396 | link |
2024-04-22 | PARAMANU-GANITA: Language Model with Mathematical Capabilities | Mitodru Niyogi et.al. | 2404.14395 | null |
2024-04-22 | A Multimodal Automated Interpretability Agent | Tamar Rott Shaham et.al. | 2404.14394 | null |
2024-04-22 | A Survey on Self-Evolution of Large Language Models | Zhengwei Tao et.al. | 2404.14387 | link |
2024-04-22 | Beyond Scaling: Predicting Patent Approval with Domain-specific Fine-grained Claim Dependency Graph | Xiaochen Kev Gao et.al. | 2404.14372 | link |
2024-04-23 | Preference Fine-Tuning of LLMs Should Leverage Suboptimal, On-Policy Data | Fahim Tajwar et.al. | 2404.14367 | link |
2024-04-22 | Better Synthetic Data by Retrieving and Transforming Existing Datasets | Saumya Gandhi et.al. | 2404.14361 | link |
2024-04-22 | Rethinking Legal Compliance Automation: Opportunities with Large Language Models | Shabnam Hassani et.al. | 2404.14356 | null |
2024-04-22 | Calc-CMU at SemEval-2024 Task 7: Pre-Calc – Learning to Use the Calculator Improves Numeracy in Language Models | Vishruth Veerendranath et.al. | 2404.14355 | link |
2024-04-22 | Automated Long Answer Grading with RiceChem Dataset | Shashank Sonkar et.al. | 2404.14316 | link |
2024-04-22 | Self-Supervised Alignment with Mutual Information: Learning to Follow Principles without Preference Labels | Jan-Philipp Fränken et.al. | 2404.14313 | link |
2024-04-22 | Explaining Arguments’ Strength: Unveiling the Role of Attacks and Supports (Technical Report) | Xiang Yin et.al. | 2404.14304 | link |
2024-04-22 | Marking: Visual Grading with Highlighting Errors and Annotating Missing Bits | Shashank Sonkar et.al. | 2404.14301 | null |
2024-04-22 | Does Your Neural Code Completion Model Use My Code? A Membership Inference Approach | Yao Wan et.al. | 2404.14296 | link |
2024-04-22 | A Survey on Efficient Inference for Large Language Models | Zixuan Zhou et.al. | 2404.14294 | null |
2024-04-22 | LLM-Personalize: Aligning LLM Planners with Human Preferences via Reinforced Self-Training for Housekeeping Robots | Dongge Han et.al. | 2404.14285 | null |
2024-04-22 | Detecting and Mitigating Hallucination in Large Vision Language Models via Fine-Grained AI Feedback | Wenyi Xiao et.al. | 2404.14233 | null |
2024-04-19 | MoVA: Adapting Mixture of Vision Experts to Multimodal Context | Zhuofan Zong et.al. | 2404.13046 | link |
2024-04-19 | Unified Scene Representation and Reconstruction for 3D Large Language Models | Tao Chu et.al. | 2404.13044 | null |
2024-04-19 | Data Alignment for Zero-Shot Concept Generation in Dermatology AI | Soham Gadgil et.al. | 2404.13043 | null |
2024-04-19 | Sample Design Engineering: An Empirical Study of What Makes Good Downstream Fine-Tuning Samples for LLMs | Biyang Guo et.al. | 2404.13033 | link |
2024-04-19 | When Life gives you LLMs, make LLM-ADE: Large Language Models with Adaptive Data Engineering | Stephen Choi et.al. | 2404.13028 | null |
2024-04-19 | Stronger Random Baselines for In-Context Learning | Gregory Yauney et.al. | 2404.13020 | link |
2024-04-19 | Groma: Localized Visual Tokenization for Grounding Multimodal Large Language Models | Chuofan Ma et.al. | 2404.13013 | link |
2024-04-19 | Rethinking the Evaluation of Dialogue Systems: Effects of User Feedback on Crowdworkers and LLMs | Clemencia Siro et.al. | 2404.12994 | link |
2024-04-19 | FineRec:Exploring Fine-grained Sequential Recommendation | Xiaokun Zhang et.al. | 2404.12975 | link |
2024-04-19 | Eyes Can Deceive: Benchmarking Counterfactual Reasoning Abilities of Multi-modal Large Language Models | Yian Li et.al. | 2404.12966 | null |
2024-04-19 | Towards Reliable Latent Knowledge Estimation in LLMs: In-Context Learning vs. Prompting Based Factual Knowledge Extraction | Qinyuan Wu et.al. | 2404.12957 | null |
2024-04-19 | Zero-Shot Medical Phrase Grounding with Off-the-shelf Diffusion Models | Konstantinos Vilouras et.al. | 2404.12920 | null |
2024-04-19 | Physical Backdoor Attack can Jeopardize Driving with Vision-Large-Language Models | Zhenyang Ni et.al. | 2404.12916 | link |
2024-04-19 | Large Language Models for Networking: Workflow, Advances and Challenges | Chang Liu et.al. | 2404.12901 | null |
2024-04-19 | Enabling Natural Zero-Shot Prompting on Encoder Models via Statement-Tuning | Ahmed Elshabrawy et.al. | 2404.12897 | null |
2024-04-19 | Unlocking Multi-View Insights in Knowledge-Dense Retrieval-Augmented Generation | Guanhua Chen et.al. | 2404.12879 | null |
2024-04-19 | LLM-R2: A Large Language Model Enhanced Rule-based Rewrite System for Boosting Query Efficiency | Zhaodonghui Li et.al. | 2404.12872 | link |
2024-04-19 | How Does the Textual Information Affect the Retrieval of Multimodal In-Context Learning? | Yang Luo et.al. | 2404.12866 | link |
2024-04-19 | Foundation Model assisted Weakly Supervised LiDAR Semantic Segmentation | Yilong Chen et.al. | 2404.12861 | null |
2024-04-19 | TartuNLP @ SIGTYP 2024 Shared Task: Adapting XLM-RoBERTa for Ancient and Historical Languages | Aleksei Dorkin et.al. | 2404.12845 | null |
2024-04-18 | BLINK: Multimodal Large Language Models Can See but Not Perceive | Xingyu Fu et.al. | 2404.12390 | null |
2024-04-18 | Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models | Aitor Ormazabal et.al. | 2404.12387 | null |
2024-04-18 | MedThink: Explaining Medical Visual Question Answering via Multimodal Decision-Making Rationale | Xiaotang Gai et.al. | 2404.12372 | null |
2024-04-18 | When LLMs are Unfit Use FastFit: Fast and Effective Text Classification with Many Classes | Asaf Yehudai et.al. | 2404.12365 | link |
2024-04-18 | From $r$ to $Q^*$ : Your Language Model is Secretly a Q-Function | Rafael Rafailov et.al. | 2404.12358 | null |
2024-04-18 | Towards a Foundation Model for Partial Differential Equation: Multi-Operator Learning and Extrapolation | Jingmin Sun et.al. | 2404.12355 | link |
2024-04-18 | V2Xum-LLM: Cross-Modal Video Summarization with Temporal Prompt Instruction Tuning | Hang Hua et.al. | 2404.12353 | null |
2024-04-18 | Evaluating AI for Law: Bridging the Gap with Open-Source Solutions | Rohan Bhambhoria et.al. | 2404.12349 | null |
2024-04-18 | Large Language Models in Targeted Sentiment Analysis | Nicolay Rusnachenko et.al. | 2404.12342 | link |
2024-04-18 | Normative Requirements Operationalization with Large Language Models | Nick Feng et.al. | 2404.12335 | null |
2024-04-18 | Reuse Your Rewards: Reward Model Transfer for Zero-Shot Cross-Lingual Alignment | Zhaofeng Wu et.al. | 2404.12318 | null |
2024-04-18 | Large Language Models for Synthetic Participatory Planning of Shared Automated Electric Mobility Systems | Jiangbo Yu et.al. | 2404.12317 | null |
2024-04-18 | Simultaneous Interpretation Corpus Construction by Large Language Models in Distant Language Pair | Yusuke Sakai et.al. | 2404.12299 | null |
2024-04-18 | Augmenting emotion features in irony detection with Large language modeling | Yucheng Lin et.al. | 2404.12291 | null |
2024-04-18 | Performance Evaluation of Segment Anything Model with Variational Prompting for Application to Non-Visible Spectrum Imagery | Yona Falinie A. Gaus et.al. | 2404.12285 | null |
2024-04-18 | Enhancing Embedding Performance through Large Language Model-based Text Enrichment and Rewriting | Nicholas Harris et.al. | 2404.12283 | null |
2024-04-18 | Advancing the Robustness of Large Language Models through Self-Denoised Smoothing | Jiabao Ji et.al. | 2404.12274 | link |
2024-04-18 | FedEval-LLM: Federated Evaluation of Large Language Models on Downstream Tasks with Collective Wisdom | Yuanqin He et.al. | 2404.12273 | null |
2024-04-18 | Who Validates the Validators? Aligning LLM-Assisted Evaluation of LLM Outputs with Human Preferences | Shreya Shankar et.al. | 2404.12272 | null |
2024-04-18 | Concept Induction: Analyzing Unstructured Text with High-Level Concepts Using LLooM | Michelle S. Lam et.al. | 2404.12259 | link |
2024-04-18 | Private federated discovery of out-of-vocabulary words for Gboard | Ziteng Sun et.al. | 2404.11607 | null |
2024-04-17 | VG4D: Vision-Language Model Goes 4D Video Recognition | Zhichao Deng et.al. | 2404.11605 | link |
2024-04-17 | A Deep Dive into Large Language Models for Automated Bug Localization and Repair | Soneya Binta Hossain et.al. | 2404.11595 | null |
2024-04-17 | Prompt Optimizer of Text-to-Image Diffusion Models for Abstract Concept Understanding | Zezhong Fan et.al. | 2404.11589 | null |
2024-04-17 | LLMTune: Accelerate Database Knob Tuning with Large Language Models | Xinmei Huang et.al. | 2404.11581 | link |
2024-04-17 | On the Scalability of GNNs for Molecular Graphs | Maciej Sypetkowski et.al. | 2404.11568 | null |
2024-04-17 | MoA: Mixture-of-Attention for Subject-Context Disentanglement in Personalized Image Generation | Kuan-Chieh et.al. | 2404.11565 | null |
2024-04-17 | Quantifying Multilingual Performance of Large Language Models Across Languages | Zihao Li et.al. | 2404.11553 | null |
2024-04-17 | Evaluating Span Extraction in Generative Paradigm: A Reflection on Aspect-Based Sentiment Analysis | Soyoung Yang et.al. | 2404.11539 | null |
2024-04-17 | FedPFT: Federated Proxy Fine-Tuning of Foundation Models | Zhaopeng Peng et.al. | 2404.11536 | link |
2024-04-17 | Select and Reorder: A Novel Approach for Neural Sign Language Production | Harry Walsh et.al. | 2404.11532 | null |
2024-04-17 | Pack of LLMs: Model Fusion at Test-Time via Perplexity Optimization | Costas Mavromatis et.al. | 2404.11531 | link |
2024-04-17 | Embedding Privacy in Computational Social Science and Artificial Intelligence Research | Keenan Jones et.al. | 2404.11515 | null |
2024-04-17 | Towards Coarse-to-Fine Evaluation of Inference Efficiency for Large Language Models | Yushuo Chen et.al. | 2404.11502 | link |
2024-04-17 | Paraphrase and Solve: Exploring and Exploiting the Impact of Surface Form on Mathematical Reasoning in Large Language Models | Yue Zhou et.al. | 2404.11500 | link |
2024-04-18 | Octopus v3: Technical Report for On-device Sub-billion Multimodal AI Agent | Wei Chen et.al. | 2404.11459 | null |
2024-04-17 | Unifying Bias and Unfairness in Information Retrieval: A Survey of Challenges and Opportunities with Large Language Models | Sunhao Dai et.al. | 2404.11457 | link |
2024-04-17 | AI-Enhanced Cognitive Behavioral Therapy: Deep Learning and Large Language Models for Extracting Cognitive Pathways from Social Media Texts | Meng Jiang et.al. | 2404.11449 | link |
2024-04-17 | Open-Ended Wargames with Large Language Models | Daniel P. Hogan et.al. | 2404.11446 | link |
2024-04-17 | DUPE: Detection Undermining via Prompt Engineering for Deepfake Text | James Weichert et.al. | 2404.11408 | null |
2024-04-16 | Nearly Optimal Algorithms for Contextual Dueling Bandits from Adversarial Feedback | Qiwei Di et.al. | 2404.10776 | null |
2024-04-16 | COMBO: Compositional World Models for Embodied Multi-Agent Cooperation | Hongxin Zhang et.al. | 2404.10775 | null |
2024-04-16 | Deep Learning and LLM-based Methods Applied to Stellar Lightcurve Classification | Yu-Yang Li et.al. | 2404.10757 | link |
2024-04-16 | Is DPO Superior to PPO for LLM Alignment? A Comprehensive Study | Shusheng Xu et.al. | 2404.10719 | link |
2024-04-17 | Dual Modalities of Text: Visual and Textual Generative Pre-training | Yekun Chai et.al. | 2404.10710 | link |
2024-04-16 | Question Difficulty Ranking for Multiple-Choice Reading Comprehension | Vatsal Raina et.al. | 2404.10704 | null |
2024-04-16 | An empirical study on code review activity prediction in practice | Doriane Olewicki et.al. | 2404.10703 | null |
2024-04-16 | Automating REST API Postman Test Cases Using LLM | S Deepika Sri et.al. | 2404.10678 | null |
2024-04-16 | Self-playing Adversarial Language Game Enhances LLM Reasoning | Pengyu Cheng et.al. | 2404.10642 | link |
2024-04-16 | HLAT: High-quality Large Language Model Pre-trained on AWS Trainium | Haozheng Fan et.al. | 2404.10630 | link |
2024-04-16 | Private Attribute Inference from Images with Vision-Language Models | Batuhan Tömekçe et.al. | 2404.10618 | null |
2024-04-16 | Automated Evaluation of Large Vision-Language Models on Self-driving Corner Cases | Yanze Li et.al. | 2404.10595 | null |
2024-04-16 | Construction of Domain-specified Japanese Large Language Model for Finance through Continual Pre-training | Masanori Hirano et.al. | 2404.10555 | null |
2024-04-16 | Unveiling the Misuse Potential of Base Large Language Models via In-Context Learning | Xiao Wang et.al. | 2404.10552 | null |
2024-04-16 | Capturing the Macroscopic Behaviour of Molecular Dynamics with Membership Functions | Alexander Sikorski et.al. | 2404.10523 | link |
2024-04-16 | CoTAR: Chain-of-Thought Attribution Reasoning with Multi-level Granularity | Moshe Berchansky et.al. | 2404.10513 | null |
2024-04-16 | White Men Lead, Black Women Help: Uncovering Gender, Racial, and Intersectional Bias in Language Agency | Yixin Wan et.al. | 2404.10508 | null |
2024-04-16 | Self-Supervised Visual Preference Alignment | Ke Zhu et.al. | 2404.10501 | link |
2024-04-16 | When Emotional Stimuli meet Prompt Designing: An Auto-Prompt Graphical Paradigm | Chenggian Ma et.al. | 2404.10500 | null |
2024-04-16 | Spiral of Silences: How is Large Language Model Killing Information Retrieval? – A Case Study on Open Domain Question Answering | Xiaoyang Chen et.al. | 2404.10496 | link |
2024-04-15 | KG-CTG: Citation Generation through Knowledge Graph-guided Large Language Models | Avinash Anand et.al. | 2404.09763 | null |
2024-04-15 | Resilience of Large Language Models for Noisy Instructions | Bin Wang et.al. | 2404.09754 | null |
2024-04-15 | Personalized Collaborative Fine-Tuning for On-Device Large Language Models | Nicolas Wagner et.al. | 2404.09753 | link |
2024-04-15 | AMPCliff: quantitative definition and benchmarking of activity cliffs in antimicrobial peptides | Kewei Li et.al. | 2404.09738 | link |
2024-04-15 | Quantization of Large Language Models with an Overdetermined Basis | Daniil Merkulov et.al. | 2404.09737 | null |
2024-04-15 | Photo-Realistic Image Restoration in the Wild with Controlled Vision-Language Models | Ziwei Luo et.al. | 2404.09732 | link |
2024-04-15 | Unveiling Imitation Learning: Exploring the Impact of Data Falsity to Large Language Model | Hyunsoo Cho et.al. | 2404.09717 | null |
2024-04-15 | Enhancing Robot Explanation Capabilities through Vision-Language Models: a Preliminary Study by Interpreting Visual Inputs for Improved Human-Robot Interaction | David Sobrín-Hidalgo et.al. | 2404.09705 | null |
2024-04-15 | Generative AI for Game Theory-based Mobile Networking | Long He et.al. | 2404.09699 | null |
2024-04-15 | Are Large Language Models Reliable Argument Quality Annotators? | Nailia Mirzakhmedova et.al. | 2404.09696 | link |
2024-04-15 | LoRAP: Transformer Sub-Layers Deserve Differentiated Structured Compression for Large Language Models | Guangyan Li et.al. | 2404.09695 | null |
2024-04-15 | Multi-News+: Cost-efficient Dataset Cleansing via LLM-based Data Annotation | Juhwan Choi et.al. | 2404.09682 | link |
2024-04-15 | Learn Your Reference Model for Real Good Alignment | Alexey Gorbatovski et.al. | 2404.09656 | null |
2024-04-15 | Do LLMs Understand Visual Anomalies? Uncovering LLM Capabilities in Zero-shot Anomaly Detection | Jiaqi Zhu et.al. | 2404.09654 | null |
2024-04-15 | Bridging Vision and Language Spaces with Assignment Prediction | Jungin Park et.al. | 2404.09632 | link |
2024-04-15 | AesExpert: Towards Multi-modality Foundation Model for Image Aesthetics Perception | Yipo Huang et.al. | 2404.09624 | link |
2024-04-15 | UNIAA: A Unified Multi-modal Image Aesthetic Assessment Baseline and Benchmark | Zhaokun Zhou et.al. | 2404.09619 | null |
2024-04-15 | A Self-feedback Knowledge Elicitation Approach for Chemical Reaction Predictions | Pengfei Liu et.al. | 2404.09606 | link |
2024-04-15 | Improving Recall of Large Language Models: A Model Collaboration Approach for Relational Triple Extraction | Zepeng Ding et.al. | 2404.09593 | null |
2024-04-15 | Modelling Language | Jumbly Grindrod et.al. | 2404.09579 | null |
2024-04-15 | Transformers, Contextualism, and Polysemy | Jumbly Grindrod et.al. | 2404.09577 | link |
2024-04-15 | Large language models and linguistic intentionality | Jumbly Grindrod et.al. | 2404.09576 | null |
2024-04-12 | Probing the 3D Awareness of Visual Foundation Models | Mohamed El Banani et.al. | 2404.08636 | link |
2024-04-12 | Pre-training Small Base LMs with Fewer Tokens | Sunny Sanyal et.al. | 2404.08634 | link |
2024-04-12 | FCert: Certifiably Robust Few-Shot Classification in the Era of Foundation Models | Yanting Wang et.al. | 2404.08631 | link |
2024-04-12 | Training-free Boost for Open-Vocabulary Object Detection with Confidence Aggregation | Yanhao Zheng et.al. | 2404.08603 | link |
2024-04-12 | Enhancing Visual Question Answering through Question-Driven Image Captions as Prompts | Övgü Özdemir et.al. | 2404.08589 | link |
2024-04-12 | Pathological Primitive Segmentation Based on Visual Foundation Model with Zero-Shot Mask Generation | Abu Bakor Hayat Arnob et.al. | 2404.08584 | link |
2024-04-12 | FashionFail: Addressing Failure Cases in Fashion Object Detection and Segmentation | Riza Velioglu et.al. | 2404.08582 | link |
2024-04-12 | Lossy Image Compression with Foundation Diffusion Models | Lucas Relic et.al. | 2404.08580 | null |
2024-04-12 | Enhancing Autonomous Vehicle Training with Language Model Integration and Critical Scenario Generation | Hanlin Tian et.al. | 2404.08570 | link |
2024-04-12 | RLHF Deciphered: A Critical Analysis of Reinforcement Learning from Human Feedback for LLMs | Shreyas Chaudhari et.al. | 2404.08555 | null |
2024-04-12 | Memory Traces: Are Transformers Tulving Machines? | Jean-Marie Chauvet et.al. | 2404.08543 | null |
2024-04-12 | Online Safety Analysis for LLMs: a Benchmark, an Assessment, and a Path Forward | Xuan Xie et.al. | 2404.08517 | null |
2024-04-12 | ChatGPT and general-purpose AI count fruits in pictures surprisingly well | Konlavach Mengsuwan et.al. | 2404.08515 | null |
2024-04-12 | Efficient Interactive LLM Serving with Proxy Model-based Sequence Length Prediction | Haoran Qiu et.al. | 2404.08509 | link |
2024-04-12 | LaSagnA: Language-based Segmentation Assistant for Complex Queries | Cong Wei et.al. | 2404.08506 | link |
2024-04-12 | Strategic Interactions between Large Language Models-based Agents in Beauty Contests | Siting Lu et.al. | 2404.08492 | null |
2024-04-12 | Mitigating Language-Level Performance Disparity in mPLMs via Teacher Language Selection and Cross-lingual Self-Distillation | Haozhe Zhao et.al. | 2404.08491 | link |
2024-04-12 | Thematic Analysis with Large Language Models: does it work with languages other than English? A targeted test in Italian | Stefano De Paoli et.al. | 2404.08488 | null |
2024-04-12 | Comparing Apples to Oranges: LLM-powered Multimodal Intention Prediction in an Object Categorization Task | Hassan Ali et.al. | 2404.08424 | null |
2024-04-12 | Adapting the Segment Anything Model During Usage in Novel Situations | Robin Schön et.al. | 2404.08421 | null |
2024-04-11 | OpenBias: Open-set Bias Detection in Text-to-Image Generative Models | Moreno D’Incà et.al. | 2404.07990 | link |
2024-04-11 | Any2Point: Empowering Any-modality Large Models for Efficient 3D Understanding | Yiwen Tang et.al. | 2404.07989 | link |
2024-04-11 | Two Effects, One Trigger: On the Modality Gap, Object Bias, and Information Imbalance in Contrastive Vision-Language Representation Learning | Simon Schrodi et.al. | 2404.07983 | null |
2024-04-11 | Language Imbalance Can Boost Cross-lingual Generalisation | Anton Schäfer et.al. | 2404.07982 | link |
2024-04-11 | Manipulating Large Language Models to Increase Product Visibility | Aounon Kumar et.al. | 2404.07981 | link |
2024-04-11 | LLoCO: Learning Long Contexts Offline | Sijun Tan et.al. | 2404.07979 | link |
2024-04-11 | Ferret-v2: An Improved Baseline for Referring and Grounding with Large Language Models | Haotian Zhang et.al. | 2404.07973 | null |
2024-04-11 | Rho-1: Not All Tokens Are What You Need | Zhenghao Lin et.al. | 2404.07965 | link |
2024-04-11 | On Unified Prompt Tuning for Request Quality Assurance in Public Code Review | Xinyu Chen et.al. | 2404.07942 | null |
2024-04-11 | Leveraging Large Language Models (LLMs) to Support Collaborative Human-AI Online Risk Data Annotation | Jinkyung Park et.al. | 2404.07926 | null |
2024-04-11 | LaVy: Vietnamese Multimodal Large Language Model | Chi Tran et.al. | 2404.07922 | link |
2024-04-11 | AmpleGCG: Learning a Universal and Transferable Generative Model of Adversarial Suffixes for Jailbreaking Both Open and Closed LLMs | Zeyi Liao et.al. | 2404.07921 | link |
2024-04-11 | DesignQA: A Multimodal Benchmark for Evaluating Large Language Models’ Understanding of Engineering Documentation | Anna C. Doris et.al. | 2404.07917 | link |
2024-04-11 | HGRN2: Gated Linear RNNs with State Expansion | Zhen Qin et.al. | 2404.07904 | link |
2024-04-11 | High-Dimension Human Value Representation in Large Language Models | Samuel Cahyawijaya et.al. | 2404.07900 | link |
2024-04-11 | Guiding Large Language Models to Post-Edit Machine Translation with Error Annotations | Dayeon Ki et.al. | 2404.07851 | link |
2024-04-11 | On Training Data Influence of GPT Models | Qingyi Liu et.al. | 2404.07840 | link |
2024-04-11 | RecurrentGemma: Moving Past Transformers for Efficient Open Language Models | Aleksandar Botev et.al. | 2404.07839 | link |
2024-04-11 | Streamlined Photoacoustic Image Processing with Foundation Models: A Training-Free Solution | Handi Deng et.al. | 2404.07833 | null |
2024-04-11 | Heron-Bench: A Benchmark for Evaluating Vision Language Models in Japanese | Yuichi Inoue et.al. | 2404.07824 | link |
2024-04-10 | BRAVE: Broadening the visual encoding of vision-language models | Oğuzhan Fatih Kar et.al. | 2404.07204 | null |
2024-04-10 | UMBRAE: Unified Multimodal Decoding of Brain Signals | Weihao Xia et.al. | 2404.07202 | link |
2024-04-10 | Scaling Laws for Data Filtering – Data Curation cannot be Compute Agnostic | Sachin Goyal et.al. | 2404.07177 | link |
2024-04-10 | Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention | Tsendsuren Munkhdalai et.al. | 2404.07143 | null |
2024-04-10 | Open reaction-diffusion systems: bridging probabilistic theory across scales | Mauricio J. del Razo et.al. | 2404.07119 | null |
2024-04-10 | Continuous Language Model Interpolation for Dynamic and Controllable Text Generation | Sara Kangaslahti et.al. | 2404.07117 | link |
2024-04-11 | From Model-centered to Human-Centered: Revision Distance as a Metric for Text Evaluation in LLMs-based Applications | Yongqiang Ma et.al. | 2404.07108 | null |
2024-04-10 | Graph Chain-of-Thought: Augmenting Large Language Models by Reasoning on Graphs | Bowen Jin et.al. | 2404.07103 | link |
2024-04-10 | Dynamic Generation of Personalities with Large Language Models | Jianzhi Liu et.al. | 2404.07084 | link |
2024-04-10 | VLLMs Provide Better Context for Emotion Understanding Through Common Sense Reasoning | Alexandros Xenos et.al. | 2404.07078 | link |
2024-04-10 | Exploring Concept Depth: How Large Language Models Acquire Knowledge at Different Layers? | Mingyu Jin et.al. | 2404.07066 | link |
2024-04-10 | Groundedness in Retrieval-augmented Long-form Generation: An Empirical Study | Alessandro Stolfo et.al. | 2404.07060 | null |
2024-04-10 | Meta4XNLI: A Crosslingual Parallel Corpus for Metaphor Detection and Interpretation | Elisa Sanchez-Bayona et.al. | 2404.07053 | link |
2024-04-10 | ORacle: Large Vision-Language Models for Knowledge-Guided Holistic OR Domain Modeling | Ege Özsoy et.al. | 2404.07031 | link |
2024-04-10 | Improving Language Model Reasoning with Self-motivated Learning | Yunlong Feng et.al. | 2404.07017 | null |
2024-04-10 | A Mathematical Theory for Learning Semantic Languages by Abstract Learners | Kuo-Yu Liao et.al. | 2404.07009 | null |
2024-04-10 | WordDecipher: Enhancing Digital Workspace Communication with Explainable AI for Non-native English Speakers | Yuexi Chen et.al. | 2404.07005 | null |
2024-04-10 | LM Transparency Tool: Interactive Tool for Analyzing Transformer Language Models | Igor Tufanov et.al. | 2404.07004 | null |
2024-04-10 | Event Grounded Criminal Court View Generation withCooperative (Large) Language Models | Linan Yue et.al. | 2404.07001 | link |
2024-04-10 | Advancing Real-time Pandemic Forecasting Using Large Language Models: A COVID-19 Case Study | Hongru Du et.al. | 2404.06962 | link |
2024-04-09 | InternLM-XComposer2-4KHD: A Pioneering Large Vision-Language Model Handling Resolutions from 336 Pixels to 4K HD | Xiaoyi Dong et.al. | 2404.06512 | link |
2024-04-09 | Can Feedback Enhance Semantic Grounding in Large Vision-Language Models? | Yuan-Hong Liao et.al. | 2404.06510 | null |
2024-04-09 | On the Effect of (Near) Duplicate Subwords in Language Modelling | Anton Schäfer et.al. | 2404.06508 | link |
2024-04-09 | Pitfalls of Conversational LLMs on News Debiasing | Ipek Baris Schlicht et.al. | 2404.06488 | null |
2024-04-10 | Ada-LEval: Evaluating long-context LLMs with length-adaptable benchmarks | Chonghua Wang et.al. | 2404.06480 | link |
2024-04-10 | Text-Based Reasoning About Vector Graphics | Zhenhailong Wang et.al. | 2404.06479 | null |
2024-04-09 | Automated Federated Pipeline for Parameter-Efficient Fine-Tuning of Large Language Models | Zihan Fang et.al. | 2404.06448 | null |
2024-04-09 | Large Language Models to the Rescue: Deadlock Resolution in Multi-Robot Systems | Kunal Garg et.al. | 2404.06413 | null |
2024-04-09 | AgentQuest: A Modular Benchmark Framework to Measure Progress and Improve LLM Agents | Luca Gioacchini et.al. | 2404.06411 | link |
2024-04-09 | Take a Look at it! Rethinking How to Evaluate Language Model Jailbreak | Hongyu Cai et.al. | 2404.06407 | link |
2024-04-09 | Apprentices to Research Assistants: Advancing Research with Large Language Models | M. Namvarpour et.al. | 2404.06404 | null |
2024-04-09 | MiniCPM: Unveiling the Potential of Small Language Models with Scalable Training Strategies | Shengding Hu et.al. | 2404.06395 | link |
2024-04-10 | MuPT: A Generative Symbolic Music Pretrained Transformer | Xingwei Qu et.al. | 2404.06393 | null |
2024-04-09 | Event Extraction in Basque: Typologically motivated Cross-Lingual Transfer-Learning Analysis | Mikel Zubillaga et.al. | 2404.06392 | null |
2024-04-09 | Latent Distance Guided Alignment Training for Large Language Models | Haotian Luo et.al. | 2404.06390 | null |
2024-04-09 | Model Generation from Requirements with LLMs: an Exploratory Study | Alessio Ferrari et.al. | 2404.06371 | null |
2024-04-09 | Enhancing Decision Analysis with a Large Language Model: pyDecision a Comprehensive Library of MCDA Methods in Python | Valdecy Pereira et.al. | 2404.06370 | link |
2024-04-09 | VISION2UI: A Real-World Dataset with Layout for Code Generation from UI Designs | Yi Gui et.al. | 2404.06369 | null |
2024-04-09 | ClinLinker: Medical Entity Linking of Clinical Concept Mentions in Spanish | Fernando Gallego et.al. | 2404.06367 | null |
2024-04-09 | Test-Time Adaptation with SaLIP: A Cascade of SAM and CLIP for Zero shot Medical Image Segmentation | Sidra Aleem et.al. | 2404.06362 | link |
2024-04-08 | MA-LMM: Memory-Augmented Large Multimodal Model for Long-Term Video Understanding | Bo He et.al. | 2404.05726 | link |
2024-04-08 | Ferret-UI: Grounded Mobile UI Understanding with Multimodal LLMs | Keen You et.al. | 2404.05719 | null |
2024-04-08 | Comprehensive Study on German Language Models for Clinical and Biomedical Text Understanding | Ahmad Idrissi-Yaghir et.al. | 2404.05694 | null |
2024-04-08 | Evaluating Mathematical Reasoning Beyond Accuracy | Shijie Xia et.al. | 2404.05692 | link |
2024-04-08 | Retrieval-Augmented Open-Vocabulary Object Detection | Jooyeon Kim et.al. | 2404.05687 | link |
2024-04-08 | MoMA: Multimodal LLM Adapter for Fast Personalized Image Generation | Kunpeng Song et.al. | 2404.05674 | link |
2024-04-08 | CoReS: Orchestrating the Dance of Reasoning and Segmentation | Xiaoyi Bao et.al. | 2404.05673 | null |
2024-04-09 | Fighting crime with Transformers: Empirical analysis of address parsing methods in payment data | Haitham Hammami et.al. | 2404.05632 | link |
2024-04-08 | LTNER: Large Language Model Tagging for Named Entity Recognition with Contextualized Entity Marking | Faren Yan et.al. | 2404.05624 | null |
2024-04-08 | MULTIFLOW: Shifting Towards Task-Agnostic Vision-Language Pruning | Matteo Farina et.al. | 2404.05621 | link |
2024-04-08 | SpeechAlign: Aligning Speech Generation to Human Preferences | Dong Zhang et.al. | 2404.05600 | link |
2024-04-08 | MedExpQA: Multilingual Benchmarking of Large Language Models for Medical Question Answering | Iñigo Alonso et.al. | 2404.05590 | null |
2024-04-08 | Enhancing Software Related Information Extraction with Generative Language Models through Single-Choice Question Answering | Wolfgang Otto et.al. | 2404.05587 | null |
2024-04-08 | Towards More General Video-based Deepfake Detection through Facial Feature Guided Adaptation for Foundation Model | Yue-Hua Han et.al. | 2404.05583 | null |
2024-04-08 | 360°REA: Towards A Reusable Experience Accumulation with 360° Assessment for Multi-Agent System | Shen Gao et.al. | 2404.05569 | link |
2024-04-08 | Dense Training, Sparse Inference: Rethinking Training of Mixture-of-Experts Language Models | Bowen Pan et.al. | 2404.05567 | null |
2024-04-08 | Chinese Sequence Labeling with Semi-Supervised Boundary-Aware Language Model Pre-training | Longhui Zhang et.al. | 2404.05560 | link |
2024-04-08 | Evaluating Interventional Reasoning Capabilities of Large Language Models | Tejas Kasetty et.al. | 2404.05545 | null |
2024-04-08 | OPSD: an Offensive Persian Social media Dataset and its baseline evaluations | Mehran Safayani et.al. | 2404.05540 | null |
2024-04-08 | Best-of-Venom: Attacking RLHF by Injecting Poisoned Preference Data | Tim Baumgärtner et.al. | 2404.05530 | null |
2024-04-05 | Who Evaluates the Evaluations? Objectively Scoring Text-to-Image Prompt Coherence Metrics with T2IScoreScore (TS2) | Michael Saxon et.al. | 2404.04251 | link |
2024-04-05 | Physical Property Understanding from Language-Embedded Feature Fields | Albert J. Zhai et.al. | 2404.04242 | null |
2024-04-05 | Cleared for Takeoff? Compositional & Conditional Reasoning may be the Achilles Heel to (Flight-Booking) Language Agents | Harsh Kohli et.al. | 2404.04237 | null |
2024-04-05 | player2vec: A Language Modeling Approach to Understand Player Behavior in Games | Tianze Wang et.al. | 2404.04234 | null |
2024-04-05 | Image-Text Co-Decomposition for Text-Supervised Semantic Segmentation | Ji-Jia Wu et.al. | 2404.04231 | link |
2024-04-05 | Unlocking Parameter-Efficient Fine-Tuning for Low-Resource Language Translation | Tong Su et.al. | 2404.04212 | null |
2024-04-05 | Social Skill Training with Large Language Models | Diyi Yang et.al. | 2404.04204 | null |
2024-04-05 | Do Sentence Transformers Learn Quasi-Geospatial Concepts from General Text? | Ilya Ilyankou et.al. | 2404.04169 | null |
2024-04-05 | Chinese Tiny LLM: Pretraining a Chinese-Centric Large Language Model | Xinrun Du et.al. | 2404.04167 | null |
2024-04-05 | Dwell in the Beginning: How Language Models Embed Long Documents for Dense Retrieval | João Coelho et.al. | 2404.04163 | link |
2024-04-05 | BEAR: A Unified Framework for Evaluating Relational Knowledge in Causal and Masked Language Models | Jacek Wiland et.al. | 2404.04113 | link |
2024-04-05 | Large language models as oracles for instantiating ontologies with domain-specific knowledge | Giovanni Ciatto et.al. | 2404.04108 | link |
2024-04-05 | Robust Preference Optimization with Provable Noise Tolerance for LLMs | Xize Liang et.al. | 2404.04102 | null |
2024-04-05 | Label Propagation for Zero-shot Classification with Vision-Language Models | Vladan Stojnić et.al. | 2404.04072 | link |
2024-04-05 | Assessing the quality of information extraction | Filip Seitl et.al. | 2404.04068 | null |
2024-04-05 | CLUE: A Clinical Language Understanding Evaluation for LLMs | Amin Dada et.al. | 2404.04067 | link |
2024-04-05 | VoicePilot: Harnessing LLMs as Speech Interfaces for Physically Assistive Robots | Akhil Padmanabha et.al. | 2404.04066 | null |
2024-04-05 | A Comparison of Methods for Evaluating Generative IR | Negar Arabzadeh et.al. | 2404.04044 | link |
2024-04-05 | Teaching Llama a New Language Through Cross-Lingual Knowledge Transfer | Hele-Andra Kuulmets et.al. | 2404.04042 | link |
2024-04-05 | Willkommens-Merkel, Chaos-Johnson, and Tore-Klose: Modeling the Evaluative Meaning of German Personal Name Compounds | Annerose Eichel et.al. | 2404.04031 | link |
2024-04-04 | OpenNeRF: Open Set 3D Neural Scene Segmentation with Pixel-Wise Features and Rendered Novel Views | Francis Engelmann et.al. | 2404.03650 | null |
2024-04-04 | AutoWebGLM: Bootstrap And Reinforce A Large Language Model-based Web Navigating Agent | Hanyu Lai et.al. | 2404.03648 | link |
2024-04-04 | Capabilities of Large Language Models in Control Engineering: A Benchmark Study on GPT-4, Claude 3 Opus, and Gemini 1.0 Ultra | Darioush Kevian et.al. | 2404.03647 | null |
2024-04-04 | Locating and Editing Factual Associations in Mamba | Arnab Sen Sharma et.al. | 2404.03646 | link |
2024-04-04 | Training LLMs over Neurally Compressed Text | Brian Lester et.al. | 2404.03626 | null |
2024-04-04 | Standardizing Knowledge Engineering Practices with a Reference Architecture | Bradley P. Allen et.al. | 2404.03624 | null |
2024-04-04 | Unveiling LLMs: The Evolution of Latent Representations in a Temporal Knowledge Graph | Marco Bronzini et.al. | 2404.03623 | link |
2024-04-04 | Visualization-of-Thought Elicits Spatial Reasoning in Large Language Models | Wenshan Wu et.al. | 2404.03622 | null |
2024-04-04 | DeViDe: Faceted medical knowledge for improved medical vision-language pre-training | Haozhe Luo et.al. | 2404.03618 | null |
2024-04-04 | Sailor: Open Language Models for South-East Asia | Longxu Dou et.al. | 2404.03608 | link |
2024-04-04 | Mitigating the Impact of Outlier Channels for Language Model Quantization with Activation Regularization | Aniruddha Nrusimha et.al. | 2404.03605 | link |
2024-04-04 | Evaluating LLMs at Detecting Errors in LLM Responses | Ryo Kamoi et.al. | 2404.03602 | link |
2024-04-04 | Intent Detection and Entity Extraction from BioMedical Literature | Ankan Mullick et.al. | 2404.03598 | link |
2024-04-04 | ReFT: Representation Finetuning for Language Models | Zhengxuan Wu et.al. | 2404.03592 | link |
2024-04-04 | SemGrasp: Semantic Grasp Generation via Language Aligned Discretization | Kailin Li et.al. | 2404.03590 | null |
2024-04-04 | Untangle the KNOT: Interweaving Conflicting Knowledge and Reasoning Skills in Large Language Models | Yantao Liu et.al. | 2404.03577 | link |
2024-04-04 | Embodied AI with Two Arms: Zero-shot Learning, Safety and Modularity | Jake Varley et.al. | 2404.03570 | null |
2024-04-04 | Personalized LLM Response Generation with Parameterized Memory Injection | Kai Zhang et.al. | 2404.03565 | null |
2024-04-04 | Select and Summarize: Scene Saliency for Movie Script Summarization | Rohit Saxena et.al. | 2404.03561 | link |
2024-04-04 | How does Multi-Task Training Affect Transformer In-Context Capabilities? Investigations with Function Classes | Harmon Bhasin et.al. | 2404.03558 | link |
2024-04-03 | ALOHa: A New Measure for Hallucination in Captioning Models | Suzanne Petryk et.al. | 2404.02904 | null |
2024-04-03 | MatAtlas: Text-driven Consistent Geometry Texturing and Material Assignment | Duygu Ceylan et.al. | 2404.02899 | null |
2024-04-03 | ChatGLM-Math: Improving Math Problem-Solving in Large Language Models with a Self-Critique Pipeline | Yifan Xu et.al. | 2404.02893 | link |
2024-04-03 | MODNO: Multi Operator Learning With Distributed Neural Operators | Zecheng Zhang et.al. | 2404.02892 | null |
2024-04-03 | Linear Attention Sequence Parallelism | Weigao Sun et.al. | 2404.02882 | link |
2024-04-03 | Integrating Explanations in Learning LTL Specifications from Demonstrations | Ashutosh Gupta et.al. | 2404.02872 | null |
2024-04-03 | Toward Inference-optimal Mixture-of-Expert Large Language Models | Longfei Yun et.al. | 2404.02852 | null |
2024-04-03 | I-Design: Personalized LLM Interior Designer | Ata Çelen et.al. | 2404.02838 | null |
2024-04-03 | Cherry on Top: Parameter Heterogeneity and Quantization in Large Language Models | Wanyun Cui et.al. | 2404.02837 | null |
2024-04-03 | Retrieving Examples from Memory for Retrieval Augmented Neural Machine Translation: A Systematic Comparison | Maxime Bouthors et.al. | 2404.02835 | null |
2024-04-03 | Empowering Biomedical Discovery with AI Agents | Shanghua Gao et.al. | 2404.02831 | null |
2024-04-03 | BAdam: A Memory Efficient Full Parameter Training Method for Large Language Models | Qijun Luo et.al. | 2404.02827 | link |
2024-04-03 | Conifer: Improving Complex Constrained Instruction-Following Ability of Large Language Models | Haoran Sun et.al. | 2404.02823 | link |
2024-04-03 | A Survey of Optimization-based Task and Motion Planning: From Classical To Learning Approaches | Zhigen Zhao et.al. | 2404.02817 | null |
2024-04-03 | The RealHumanEval: Evaluating Large Language Models’ Abilities to Support Programmers | Hussein Mozannar et.al. | 2404.02806 | link |
2024-04-03 | Efficient Multi-Vector Dense Retrieval Using Bit Vectors | Franco Maria Nardini et.al. | 2404.02805 | link |
2024-04-03 | AI and personalized learning: bridging the gap with modern educational goals | Kristjan-Julius Laak et.al. | 2404.02798 | null |
2024-04-03 | CLaM-TTS: Improving Neural Codec Language Model for Zero-Shot Text-to-Speech | Jaehyeon Kim et.al. | 2404.02781 | null |
2024-04-03 | FPT: Feature Prompt Tuning for Few-shot Readability Assessment | Ziyang Wang et.al. | 2404.02772 | link |
2024-04-03 | DIBS: Enhancing Dense Video Captioning with Unlabeled Videos via Pseudo Boundary Enrichment and Online Refinement | Hao Wu et.al. | 2404.02755 | null |
2024-04-02 | Segment Any 3D Object with Language | Seungjun Lee et.al. | 2404.02157 | null |
2024-04-02 | Iterated Learning Improves Compositionality in Large Vision-Language Models | Chenhao Zheng et.al. | 2404.02145 | null |
2024-04-02 | Topic-based Watermarks for LLM-Generated Text | Alexander Nemecek et.al. | 2404.02138 | null |
2024-04-02 | ViTamin: Designing Scalable Vision Models in the Vision-Language Era | Jienneg Chen et.al. | 2404.02132 | link |
2024-04-02 | FLawN-T5: An Empirical Examination of Effective Instruction-Tuning Data Mixtures for Legal Reasoning | Joel Niklaus et.al. | 2404.02127 | link |
2024-04-02 | Exploring Automated Distractor Generation for Math Multiple-choice Questions via Large Language Models | Wanyong Feng et.al. | 2404.02124 | link |
2024-04-02 | GINopic: Topic Modeling with Graph Isomorphism Network | Suman Adhya et.al. | 2404.02115 | link |
2024-04-02 | CLAPNQ: Cohesive Long-form Answers from Passages in Natural Questions for RAG systems | Sara Rosenthal et.al. | 2404.02103 | link |
2024-04-02 | Advancing LLM Reasoning Generalists with Preference Trees | Lifan Yuan et.al. | 2404.02078 | link |
2024-04-02 | Red-Teaming Segment Anything Model | Krzysztof Jankowski et.al. | 2404.02067 | link |
2024-04-02 | Digital Forgetting in Large Language Models: A Survey of Unlearning Methods | Alberto Blanco-Justicia et.al. | 2404.02062 | null |
2024-04-02 | Long-context LLMs Struggle with Long In-context Learning | Tianle Li et.al. | 2404.02060 | link |
2024-04-02 | IISAN: Efficiently Adapting Multimodal Representation for Sequential Recommendation with Decoupled PEFT | Junchen Fu et.al. | 2404.02059 | link |
2024-04-02 | Deconstructing In-Context Learning: Understanding Prompts via Corruption | Namrata Shivagunde et.al. | 2404.02054 | link |
2024-04-02 | A Survey on Large Language Model-Based Game Agents | Sihao Hu et.al. | 2404.02039 | link |
2024-04-02 | MultiParaDetox: Extending Text Detoxification with Parallel Data to New Languages | Daryna Dementieva et.al. | 2404.02037 | null |
2024-04-02 | Improving Retrieval Augmented Open-Domain Question-Answering with Vectorized Contexts | Zhuo Chen et.al. | 2404.02022 | link |
2024-04-02 | Large Language Models for Orchestrating Bimanual Robots | Kun Chu et.al. | 2404.02018 | link |
2024-04-02 | MuxServe: Flexible Multiplexing for Efficient Multiple LLM Serving | Jiangfei Duan et.al. | 2404.02015 | link |
2024-04-02 | Dissecting Paraphrases: The Impact of Prompt Syntax and supplementary Information on Knowledge Retrieval from Pretrained Language Models | Stephan Linzbach et.al. | 2404.01992 | null |
2024-03-29 | Unsolvable Problem Detection: Evaluating Trustworthiness of Vision Language Models | Atsuyuki Miyai et.al. | 2403.20331 | link |
2024-03-29 | Are We on the Right Way for Evaluating Large Vision-Language Models? | Lin Chen et.al. | 2403.20330 | link |
2024-03-29 | ReALM: Reference Resolution As Language Modeling | Joel Ruben Antony Moniz et.al. | 2403.20329 | null |
2024-03-29 | Gecko: Versatile Text Embeddings Distilled from Large Language Models | Jinhyuk Lee et.al. | 2403.20327 | null |
2024-03-29 | Convolutional Prompting meets Language Models for Continual Learning | Anurag Roy et.al. | 2403.20317 | null |
2024-03-29 | Learn “No” to Say “Yes” Better: Improving Vision-Language Models via Negations | Jaisidh Singh et.al. | 2403.20312 | link |
2024-03-29 | Towards Greener LLMs: Bringing Energy-Efficiency to the Forefront of LLM Inference | Jovan Stojkovic et.al. | 2403.20306 | null |
2024-03-29 | Can LLMs Correct Physicians, Yet? Investigating Effective Interaction Methods in the Medical Domain | Burcu Sayin et.al. | 2403.20288 | link |
2024-03-29 | LUQ: Long-text Uncertainty Quantification for LLMs | Caiqi Zhang et.al. | 2403.20279 | link |
2024-04-01 | Draw-and-Understand: Leveraging Visual Prompts to Enable MLLMs to Comprehend What You Want | Weifeng Lin et.al. | 2403.20271 | link |
2024-03-29 | Latxa: An Open Language Model and Evaluation Suite for Basque | Julen Etxaniz et.al. | 2403.20266 | link |
2024-03-29 | ELITR-Bench: A Meeting Assistant Benchmark for Long-Context Language Models | Thibaut Thonet et.al. | 2403.20262 | link |
2024-03-29 | MedCLIP-SAM: Bridging Text and Image Towards Universal Medical Image Segmentation | Taha Koleilat et.al. | 2403.20253 | link |
2024-03-29 | Using LLMs to Model the Beliefs and Preferences of Targeted Populations | Keiichi Namikoshi et.al. | 2403.20252 | null |
2024-03-29 | Long-Tailed Anomaly Detection with Learnable Class Names | Chih-Hui Ho et.al. | 2403.20236 | null |
2024-03-29 | H2RSVLM: Towards Helpful and Honest Remote Sensing Large Vision Language Model | Chao Pang et.al. | 2403.20213 | link |
2024-03-29 | Unleashing the Potential of Large Language Models for Predictive Tabular Tasks in Data Science | Yazheng Yang et.al. | 2403.20208 | null |
2024-03-29 | The Future of Combating Rumors? Retrieval, Discrimination, and Generation | Junhao Xu et.al. | 2403.20204 | null |
2024-03-29 | ConvBench: A Multi-Turn Conversation Evaluation Benchmark with Hierarchical Capability for Large Vision-Language Models | Shuo Liu et.al. | 2403.20194 | null |
2024-03-29 | HARMamba: Efficient Wearable Sensor Human Activity Recognition Based on Bidirectional Selective SSM | Shuangjian Li et.al. | 2403.20183 | null |
2024-03-28 | RSMamba: Remote Sensing Image Classification with State Space Model | Keyan Chen et.al. | 2403.19654 | link |
2024-03-28 | InterDreamer: Zero-Shot Text to 3D Dynamic Human-Object Interaction | Sirui Xu et.al. | 2403.19652 | null |
2024-03-28 | MagicLens: Self-Supervised Image Retrieval with Open-Ended Instructions | Kai Zhang et.al. | 2403.19651 | link |
2024-03-28 | Sparse Feature Circuits: Discovering and Editing Interpretable Causal Graphs in Language Models | Samuel Marks et.al. | 2403.19647 | link |
2024-03-28 | Change-Agent: Towards Interactive Comprehensive Change Interpretation and Analysis from Change Detection and Change Captioning | Chenyang Liu et.al. | 2403.19646 | link |
2024-03-28 | Retrieval-Enhanced Knowledge Editing for Multi-Hop Question Answering in Language Models | Yucheng Shi et.al. | 2403.19631 | link |
2024-03-28 | RH20T-P: A Primitive-Level Robotic Dataset Towards Composable Generalization Agents | Zeren Chen et.al. | 2403.19622 | null |
2024-03-28 | SAID-NeRF: Segmentation-AIDed NeRF for Depth Completion of Transparent Objects | Avinash Ummadisingu et.al. | 2403.19607 | null |
2024-03-28 | Img2Loc: Revisiting Image Geolocalization using Multi-modality Foundation Models and Image-based Retrieval-Augmented Generation | Zhongliang Zhou et.al. | 2403.19584 | link |
2024-03-28 | Keypoint Action Tokens Enable In-Context Imitation Learning in Robotics | Norman Di Palo et.al. | 2403.19578 | null |
2024-03-28 | WaterJudge: Quality-Detection Trade-off when Watermarking Large Language Models | Piotr Molenda et.al. | 2403.19548 | null |
2024-03-28 | Interpreting Key Mechanisms of Factual Recall in Transformer-Based Language Models | Ang Lv et.al. | 2403.19521 | link |
2024-03-28 | Improving Clinical NLP Performance through Language Model-Generated Synthetic Clinical Data | Shan Chen et.al. | 2403.19511 | link |
2024-03-28 | LLMs as Academic Reading Companions: Extending HCI Through Synthetic Personae | Celia Chen et.al. | 2403.19506 | null |
2024-03-28 | Evolving Assembly Code in an Adversarial Environment | Irina Maliukov et.al. | 2403.19489 | link |
2024-03-28 | JDocQA: Japanese Document Question Answering Dataset for Generative Language Models | Eri Onami et.al. | 2403.19454 | link |
2024-03-28 | Mixed Preference Optimization: Reinforcement Learning with Data Selection and Better Reference Model | Qi Gou et.al. | 2403.19443 | null |
2024-03-28 | OAKINK2: A Dataset of Bimanual Hands-Object Manipulation in Complex Task Completion | Xinyu Zhan et.al. | 2403.19417 | null |
2024-03-28 | BP4ER: Bootstrap Prompting for Explicit Reasoning in Medical Dialogue Generation | Yuhong He et.al. | 2403.19414 | null |
2024-03-28 | Checkpoint Merging via Bayesian Optimization in LLM Pretraining | Deyuan Liu et.al. | 2403.19390 | null |
2024-03-27 | Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models | Yanwei Li et.al. | 2403.18814 | link |
2024-03-28 | ECoDepth: Effective Conditioning of Diffusion Models for Monocular Depth Estimation | Suraj Patni et.al. | 2403.18807 | link |
2024-03-27 | Is Modularity Transferable? A Case Study through the Lens of Knowledge Distillation | Mateusz Klimaszewski et.al. | 2403.18804 | link |
2024-03-27 | Projective Methods for Mitigating Gender Bias in Pre-trained Language Models | Hillary Dawkins et.al. | 2403.18803 | link |
2024-03-27 | Long-form factuality in large language models | Jerry Wei et.al. | 2403.18802 | link |
2024-03-27 | Towards a World-English Language Model for On-Device Virtual Assistants | Rricha Jalota et.al. | 2403.18783 | null |
2024-03-27 | 3P-LLM: Probabilistic Path Planning using Large Language Model for Autonomous Robot Navigation | Ehsan Latif et.al. | 2403.18778 | null |
2024-03-27 | ImageNet-D: Benchmarking Neural Network Robustness on Diffusion Synthetic Object | Chenshuang Zhang et.al. | 2403.18775 | link |
2024-03-27 | CheckEval: Robust Evaluation Framework using Large Language Model via Checklist | Yukyung Lee et.al. | 2403.18771 | null |
2024-03-27 | MLDT: Multi-Level Decomposition for Complex Long-Horizon Robotic Task Planning with Open-Source Large Language Model | Yike Wu et.al. | 2403.18760 | link |
2024-03-27 | CYCLE: Learning to Self-Refine the Code Generation | Yangruibo Ding et.al. | 2403.18746 | link |
2024-03-27 | Understanding the Learning Dynamics of Alignment with Human Feedback | Shawn Im et.al. | 2403.18742 | link |
2024-03-27 | PhysicsAssistant: An LLM-Powered Interactive Learning Robot for Physics Lab Investigations | Ehsan Latif et.al. | 2403.18721 | null |
2024-03-27 | Mitigating Hallucinations in Large Vision-Language Models with Instruction Contrastive Decoding | Xintong Wang et.al. | 2403.18715 | link |
2024-03-27 | The Invalsi Benchmark: measuring Language Models Mathematical and Language understanding in Italian | Andrea Esuli et.al. | 2403.18697 | null |
2024-03-27 | NL-ITI: Optimizing Probing and Intervention for Improvement of ITI Method | Jakub Hoscilowicz et.al. | 2403.18680 | link |
2024-03-27 | An Exploratory Study on Upper-Level Computing Students’ Use of Large Language Models as Tools in a Semester-Long Project | Ben Arie Tanay et.al. | 2403.18679 | null |
2024-03-27 | SDSAT: Accelerating LLM Inference through Speculative Decoding with Semantic Adaptive Tokens | Chengbo Liu et.al. | 2403.18647 | link |
2024-03-27 | To Recommend or Not: Recommendability Identification in Conversations with Pre-trained Language Models | Zhefan Wang et.al. | 2403.18628 | link |
2024-03-27 | Vulnerability Detection with Code Language Models: How Far Are We? | Yangruibo Ding et.al. | 2403.18624 | link |
2024-03-26 | OmniVid: A Generative Framework for Universal Video Understanding | Junke Wang et.al. | 2403.17935 | link |
2024-03-26 | Track Everything Everywhere Fast and Robustly | Yunzhou Song et.al. | 2403.17931 | null |
2024-03-26 | MAGIS: LLM-Based Multi-Agent Framework for GitHub Issue Resolution | Wei Tao et.al. | 2403.17927 | null |
2024-03-26 | LISA: Layerwise Importance Sampling for Memory-Efficient Large Language Model Fine-Tuning | Rui Pan et.al. | 2403.17919 | link |
2024-03-26 | Large scale paired antibody language models | Henry Kenlay et.al. | 2403.17889 | null |
2024-03-26 | Compressed Multi-task embeddings for Data-Efficient Downstream training and inference in Earth Observation | Carlos Gomes et.al. | 2403.17886 | link |
2024-03-26 | MIND Your Language: A Multilingual Dataset for Cross-lingual News Recommendation | Andreea Iana et.al. | 2403.17876 | link |
2024-03-26 | Addressing Social Misattributions of Large Language Models: An HCXAI-based Approach | Andrea Ferrario et.al. | 2403.17873 | null |
2024-03-26 | Exploring LLMs as a Source of Targeted Synthetic Textual Data to Minimize High Confidence Misclassifications | Philip Lippmann et.al. | 2403.17860 | null |
2024-03-26 | ChroniclingAmericaQA: A Large-scale Question Answering Dataset based on Historical American Newspaper Pages | Bhawna Piryani et.al. | 2403.17859 | link |
2024-03-26 | Verbing Weirds Language (Models): Evaluation of English Zero-Derivation in Five LLMs | David R. Mortensen et.al. | 2403.17856 | null |
2024-03-26 | ArabicaQA: A Comprehensive Dataset for Arabic Question Answering | Abdelrahman Abdallah et.al. | 2403.17848 | link |
2024-03-26 | Hierarchical Open-Vocabulary 3D Scene Graphs for Language-Grounded Robot Navigation | Abdelrhman Werby et.al. | 2403.17846 | null |
2024-03-26 | Mechanistic Design and Scaling of Hybrid Architectures | Michael Poli et.al. | 2403.17844 | link |
2024-03-26 | ReMamber: Referring Image Segmentation with Mamba Twister | Yuhuan Yang et.al. | 2403.17839 | link |
2024-03-26 | A foundation model utilizing chest CT volumes and radiology reports for supervised-level zero-shot detection of abnormalities | Ibrahim Ethem Hamamci et.al. | 2403.17834 | link |
2024-03-26 | Assessment of Multimodal Large Language Models in Alignment with Human Values | Zhelun Shi et.al. | 2403.17830 | null |
2024-03-26 | Accelerating Radio Spectrum Regulation Workflows with Large Language Models (LLMs) | Amir Ghasemi et.al. | 2403.17819 | null |
2024-03-26 | Graph Language Model (GLM): A new graph-based approach to detect social instabilities | Wallyson Lemes de Oliveira et.al. | 2403.17816 | null |
2024-03-26 | Are Compressed Language Models Less Subgroup Robust? | Leonidas Gee et.al. | 2403.17811 | link |
2024-03-25 | Towards Human-AI Deliberation: Design and Evaluation of LLM-Empowered Deliberative AI for AI-Assisted Decision-Making | Shuai Ma et.al. | 2403.16812 | null |
2024-03-25 | An LLM-Based Digital Twin for Optimizing Human-in-the Loop Systems | Hanqing Yang et.al. | 2403.16809 | link |
2024-03-25 | Iterative Refinement of Project-Level Code Context for Precise Code Generation with Compiler Feedback | Zhangqian Bi et.al. | 2403.16792 | link |
2024-03-25 | All Artificial, Less Intelligence: GenAI through the Lens of Formal Verification | Deepak Narayan Gadde et.al. | 2403.16750 | null |
2024-03-25 | A Robotic Skill Learning System Built Upon Diffusion Policies and Foundation Models | Nils Ingelhag et.al. | 2403.16730 | null |
2024-03-25 | ProCQA: A Large-scale Community-based Programming Question Answering Dataset for Code Search | Zehan Li et.al. | 2403.16702 | link |
2024-03-25 | Synapse: Learning Preferential Concepts from Visual Demonstrations | Sadanand Modak et.al. | 2403.16689 | null |
2024-03-25 | Investigation of the effectiveness of applying ChatGPT in Dialogic Teaching Using Electroencephalography | Jiayue Zhang et.al. | 2403.16687 | null |
2024-03-26 | RU22Fact: Optimizing Evidence for Multilingual Explainable Fact-Checking on Russia-Ukraine Conflict | Yirong Zeng et.al. | 2403.16662 | link |
2024-03-25 | Grammatical vs Spelling Error Correction: An Investigation into the Responsiveness of Transformer-based Language Models using BART and MarianMT | Rohit Raju et.al. | 2403.16655 | null |
2024-03-26 | CLHA: A Simple yet Effective Contrastive Learning Framework for Human Alignment | Feiteng Fang et.al. | 2403.16649 | link |
2024-03-25 | Virtual Co-Pilot: Multimodal Large Language Model-enabled Quick-access Procedures for Single Pilot Operations | Fan Li et.al. | 2403.16645 | null |
2024-03-25 | Semantically Enriched Cross-Lingual Sentence Embeddings for Crisis-related Social Media Texts | Rabindra Lamsal et.al. | 2403.16614 | null |
2024-03-25 | Conversational Grounding: Annotation and Analysis of Grounding Acts and Grounding Units | Biswesh Mohapatra et.al. | 2403.16609 | null |
2024-03-25 | TrustAI at SemEval-2024 Task 8: A Comprehensive Analysis of Multi-domain Machine Generated Text Detection Techniques | Ashok Urlana et.al. | 2403.16592 | null |
2024-03-25 | Can Large Language Models (or Humans) Distill Text? | Nicolas Audinet de Pieuchon et.al. | 2403.16584 | link |
2024-03-25 | NSINA: A News Corpus for Sinhala | Hansi Hettiarachchi et.al. | 2403.16571 | link |
2024-03-25 | Elysium: Exploring Object-level Perception in Videos via MLLM | Han Wang et.al. | 2403.16558 | link |
2024-03-25 | DOrA: 3D Visual Grounding with Order-Aware Referring | Tung-Yu Wu et.al. | 2403.16539 | null |
2024-03-25 | Open-Set Recognition in the Age of Vision-Language Models | Dimity Miller et.al. | 2403.16528 | link |
2024-03-25 | Hallucination Detection in Foundation Models for Decision-Making: A Flexible Definition and Review of the State of the Art | Neeloy Chakraborty et.al. | 2403.16527 | null |
2024-03-25 | Harnessing the power of LLMs for normative reasoning in MASs | Bastin Tony Roy Savarimuthu et.al. | 2403.16524 | null |
2024-03-25 | Norm Violation Detection in Multi-Agent Systems using Large Language Models: A Pilot Study | Shawn He et.al. | 2403.16517 | null |
2024-03-25 | Linguistically Differentiating Acts and Recalls of Racial Microaggressions on Social Media | Uma Sushmitha Gunturi et.al. | 2403.16514 | null |
2024-03-25 | LLMs Are Few-Shot In-Context Low-Resource Language Learners | Samuel Cahyawijaya et.al. | 2403.16512 | link |
2024-03-22 | LLaVA-PruMerge: Adaptive Token Reduction for Efficient Large Multimodal Models | Yuzhang Shang et.al. | 2403.15388 | null |
2024-03-22 | Long-CLIP: Unlocking the Long-Text Capability of CLIP | Beichen Zhang et.al. | 2403.15378 | link |
2024-03-22 | InternVideo2: Scaling Video Foundation Models for Multimodal Video Understanding | Yi Wang et.al. | 2403.15377 | link |
2024-03-22 | Can large language models explore in-context? | Akshay Krishnamurthy et.al. | 2403.15371 | null |
2024-03-22 | CoLLEGe: Concept Embedding Generation for Large Language Models | Ryan Teehan et.al. | 2403.15362 | null |
2024-03-22 | Neural Plasticity-Inspired Foundation Model for Observing the Earth Crossing Modalities | Zhitong Xiong et.al. | 2403.15356 | link |
2024-03-22 | Controlled Training Data Generation with Diffusion Models | Teresa Yeo et.al. | 2403.15309 | null |
2024-03-22 | Sphere Neural-Networks for Rational Reasoning | Tiansi Dong et.al. | 2403.15297 | null |
2024-03-22 | Measuring Gender and Racial Biases in Large Language Models | Jiafu An et.al. | 2403.15281 | null |
2024-03-22 | Bioinformatics and Biomedical Informatics with ChatGPT: Year One Review | Jinge Wang et.al. | 2403.15274 | null |
2024-03-22 | Event Temporal Relation Extraction based on Retrieval-Augmented on LLMs | Xiaobin Zhang et.al. | 2403.15273 | null |
2024-03-22 | Imagination Augmented Generation: Learning to Imagine Richer Context for Question Answering over Large Language Models | Huanxuan Liao et.al. | 2403.15268 | link |
2024-03-22 | AI Exposure and Strategic Positioning on an Online Work Platform | Shun Yiu et.al. | 2403.15262 | null |
2024-03-22 | FollowIR: Evaluating and Teaching Information Retrieval Models to Follow Instructions | Orion Weller et.al. | 2403.15246 | link |
2024-03-22 | Shadow Generation for Composite Image Using Diffusion model | Qingyang Liu et.al. | 2403.15234 | link |
2024-03-22 | An Exploratory Investigation into Code License Infringements in Large Language Model Training Datasets | Jonathan Katzy et.al. | 2403.15230 | link |
2024-03-22 | Not All Attention is Needed: Parameter and Computation Efficient Transfer Learning for Multi-modal Large Language Models | Qiong Wu et.al. | 2403.15226 | link |
2024-03-22 | Anytime, Anywhere, Anyone: Investigating the Feasibility of Segment Anything Model for Crowd-Sourcing Medical Image Annotations | Pranav Kulkarni et.al. | 2403.15218 | link |
2024-03-22 | InstaSynth: Opportunities and Challenges in Generating Synthetic Instagram Data with ChatGPT for Sponsored Content Detection | Thales Bertaglia et.al. | 2403.15214 | link |
2024-03-22 | MSCoTDet: Language-driven Multi-modal Fusion for Improved Multispectral Pedestrian Detection | Taeheon Kim et.al. | 2403.15209 | null |
2024-03-21 | MathVerse: Does Your Multi-modal LLM Truly See the Diagrams in Visual Math Problems? | Renrui Zhang et.al. | 2403.14624 | null |
2024-03-21 | Parameter-Efficient Fine-Tuning for Large Models: A Comprehensive Survey | Zeyu Han et.al. | 2403.14608 | null |
2024-03-21 | MyVLM: Personalizing VLMs for User-Specific Queries | Yuval Alaluf et.al. | 2403.14599 | null |
2024-03-21 | ReAct Meets ActRe: Autonomous Annotations of Agent Trajectories for Contrastive Self-Training | Zonghan Yang et.al. | 2403.14589 | null |
2024-03-21 | Large Language Models for Multi-Choice Question Classification of Medical Subjects | Víctor Ponce-López et.al. | 2403.14582 | null |
2024-03-21 | RAmBLA: A Framework for Evaluating the Reliability of LLMs as Assistants in the Biomedical Domain | William James Bolton et.al. | 2403.14578 | link |
2024-03-21 | A Chain-of-Thought Prompting Approach with LLMs for Evaluating Students’ Formative Assessment Responses in Science | Clayton Cohn et.al. | 2403.14565 | null |
2024-03-21 | The Era of Semantic Decoding | Maxime Peyrard et.al. | 2403.14562 | null |
2024-03-21 | Lexicon-Level Contrastive Visual-Grounding Improves Language Modeling | Chengxu Zhuang et.al. | 2403.14551 | null |
2024-03-21 | EDT: Improving Large Language Models’ Generation by Entropy-based Dynamic Temperature Sampling | Shimao Zhang et.al. | 2403.14541 | link |
2024-03-21 | Cobra: Extending Mamba to Multi-Modal Large Language Model for Efficient Inference | Han Zhao et.al. | 2403.14520 | link |
2024-03-21 | The Ethics of ChatGPT in Medicine and Healthcare: A Systematic Review on Large Language Models (LLMs) | Joschka Haltaufderheide et.al. | 2403.14473 | null |
2024-03-21 | Detoxifying Large Language Models via Knowledge Editing | Mengru Wang et.al. | 2403.14472 | link |
2024-03-21 | ChatGPT Alternative Solutions: Large Language Models Survey | Hanieh Alipour et.al. | 2403.14469 | null |
2024-03-21 | Recourse for reclamation: Chatting with generative language models | Jennifer Chien et.al. | 2403.14467 | null |
2024-03-21 | Towards Single-System Illusion in Software-Defined Vehicles – Automated, AI-Powered Workflow | Krzysztof Lebioda et.al. | 2403.14460 | null |
2024-03-21 | Multi-Level Explanations for Generative Language Models | Lucas Monteiro Paes et.al. | 2403.14459 | null |
2024-03-21 | gTBLS: Generating Tables from Text by Conditional Question Answering | Anirudh Sundar et.al. | 2403.14457 | null |
2024-03-21 | Language Models Can Reduce Asymmetry in Information Markets | Nasim Rahaman et.al. | 2403.14443 | null |
2024-03-21 | A Multimodal Approach to Device-Directed Speech Detection with Large Language Models | Dominik Wager et.al. | 2403.14438 | null |
2024-03-20 | RAR: Retrieving And Ranking Augmented MLLMs for Visual Recognition | Ziyu Liu et.al. | 2403.13805 | link |
2024-03-20 | Learning from Models and Data for Visual Grounding | Ruozhen He et.al. | 2403.13804 | null |
2024-03-20 | Reverse Training to Nurse the Reversal Curse | Olga Golovneva et.al. | 2403.13799 | null |
2024-03-20 | Bridge the Modality and Capacity Gaps in Vision-Language Model Selection | Chao Yi et.al. | 2403.13797 | null |
2024-03-20 | RewardBench: Evaluating Reward Models for Language Modeling | Nathan Lambert et.al. | 2403.13787 | link |
2024-03-20 | Chain-of-Interaction: Enhancing Large Language Models for Psychiatric Behavior Understanding by Dyadic Contexts | Guangzeng Han et.al. | 2403.13786 | link |
2024-03-20 | Information-Theoretic Distillation for Reference-less Summarization | Jaehun Jung et.al. | 2403.13780 | null |
2024-03-20 | Embedding Pose Graph, Enabling 3D Foundation Model Capabilities with a Compact Representation | Hugues Thomas et.al. | 2403.13777 | null |
2024-03-20 | Describe-and-Dissect: Interpreting Neurons in Vision Networks with Language Models | Nicholas Bai et.al. | 2403.13771 | link |
2024-03-20 | Enhancing Gait Video Analysis in Neurodegenerative Diseases by Knowledge Augmentation in Vision Language Model | Diwei Wang et.al. | 2403.13756 | null |
2024-03-20 | Different Tokenization Schemes Lead to Comparable Performance in Spanish Number Agreement | Catherine Arnett et.al. | 2403.13754 | null |
2024-03-20 | EthioLLM: Multilingual Large Language Models for Ethiopian Languages with Task Evaluation | Atnafu Lambebo Tonja et.al. | 2403.13737 | null |
2024-03-20 | Large Language Models meet Network Slicing Management and Orchestration | Abdulhalim Dandoush et.al. | 2403.13721 | null |
2024-03-20 | SPTNet: An Efficient Alternative Framework for Generalized Category Discovery with Spatial Prompt Tuning | Hongjun Wang et.al. | 2403.13684 | null |
2024-03-20 | PARAMANU-AYN: An Efficient Novel Generative and Instruction-tuned Language Model for Indian Legal Case Documents | Mitodru Niyogi et.al. | 2403.13681 | null |
2024-03-21 | RoleInteract: Evaluating the Social Interaction of Role-Playing Agents | Hongzhan Chen et.al. | 2403.13679 | link |
2024-03-20 | Grounding Spatial Relations in Text-Only Language Models | Gorka Azkune et.al. | 2403.13666 | link |
2024-03-21 | Do Not Worry if You Do Not Have Data: Building Pretrained Language Models Using Translationese | Meet Doshi et.al. | 2403.13638 | null |
2024-03-20 | VL-Mamba: Exploring State Space Models for Multimodal Learning | Yanyuan Qiao et.al. | 2403.13600 | null |
2024-03-20 | No more optimization rules: LLM-enabled policy-based multi-modal query optimizer (version 1) | Yifan Wang et.al. | 2403.13597 | null |
2024-03-19 | LLMLingua-2: Data Distillation for Efficient and Faithful Task-Agnostic Prompt Compression | Zhuoshi Pan et.al. | 2403.12968 | link |
2024-03-19 | Chain-of-Spot: Interactive Reasoning Improves Large Vision-Language Models | Zuyan Liu et.al. | 2403.12966 | link |
2024-03-19 | Negative Yields Positive: Unified Dual-Path Adapter for Vision-Language Models | Ce Zhang et.al. | 2403.12964 | link |
2024-03-19 | Dated Data: Tracing Knowledge Cutoffs in Large Language Models | Jeffrey Cheng et.al. | 2403.12958 | link |
2024-03-19 | Just Shift It: Test-Time Prototype Shifting for Zero-Shot Generalization with Vision-Language Models | Elaine Sui et.al. | 2403.12952 | link |
2024-03-19 | Automatic Information Extraction From Employment Tribunal Judgements Using Large Language Models | Joana Ribeiro de Faria et.al. | 2403.12936 | null |
2024-03-19 | Segment Anything for comprehensive analysis of grapevine cluster architecture and berry properties | Efrain Torres-Lomas et.al. | 2403.12935 | null |
2024-03-19 | Rapid AIdeation: Generating Ideas With the Self and in Collaboration With Large Language Models | Gionnieve Lim et.al. | 2403.12928 | null |
2024-03-19 | Supporting Energy Policy Research with Large Language Models | Grant Buster et.al. | 2403.12924 | null |
2024-03-19 | Contextual AD Narration with Interleaved Multimodal Sequence | Hanlin Wang et.al. | 2403.12922 | null |
2024-03-19 | Semantic Layering in Room Segmentation via LLMs | Taehyeon Kim et.al. | 2403.12920 | null |
2024-03-19 | Generalizable and Stable Finetuning of Pretrained Language Models on Low-Resource Texts | Sai Ashish Somayajula et.al. | 2403.12918 | link |
2024-03-19 | Yell At Your Robot: Improving On-the-Fly from Language Corrections | Lucy Xiaoyang Shi et.al. | 2403.12910 | null |
2024-03-19 | Toward Sustainable GenAI using Generation Directives for Carbon-Friendly Large Language Model Inference | Baolin Li et.al. | 2403.12900 | null |
2024-03-19 | mPLUG-DocOwl 1.5: Unified Structure Learning for OCR-free Document Understanding | Anwen Hu et.al. | 2403.12895 | link |
2024-03-20 | MEDBind: Unifying Language and Multimodal Medical Data Embeddings | Yuan Gao et.al. | 2403.12894 | null |
2024-03-19 | HYDRA: A Hyper Agent for Dynamic Compositional Visual Reasoning | Fucai Ke et.al. | 2403.12884 | link |
2024-03-19 | Agent-FLAN: Designing Data and Methods of Effective Agent Tuning for Large Language Models | Zehui Chen et.al. | 2403.12881 | link |
2024-03-19 | Epistemology of Language Models: Do Language Models Have Holistic Knowledge? | Minsu Kim et.al. | 2403.12862 | null |
2024-03-19 | RASP: A Drone-based Reconfigurable Actuation and Sensing Platform Towards Ambient Intelligent Systems | Minghui Zhao et.al. | 2403.12853 | null |
2024-03-18 | Modality-Agnostic fMRI Decoding of Vision and Language | Mitja Nikolaus et.al. | 2403.11771 | null |
2024-03-18 | Meta-Prompting for Automating Zero-shot Visual Recognition with LLMs | M. Jehanzeb Mirza et.al. | 2403.11755 | link |
2024-03-18 | Revisiting The Classics: A Study on Identifying and Rectifying Gender Stereotypes in Rhymes and Poems | Aditya Narayan Sankaran et.al. | 2403.11752 | link |
2024-03-18 | Embedded Named Entity Recognition using Probing Classifiers | Nicholas Popovič et.al. | 2403.11747 | link |
2024-03-18 | TTT-KD: Test-Time Training for 3D Semantic Segmentation through Knowledge Distillation from Foundation Models | Lisa Weijler et.al. | 2403.11691 | null |
2024-03-18 | HDLdebugger: Streamlining HDL debugging with Large Language Models | Xufeng Yao et.al. | 2403.11671 | null |
2024-03-18 | Prioritized Semantic Learning for Zero-shot Instance Navigation | Xander Sun et.al. | 2403.11650 | link |
2024-03-18 | Arc2Face: A Foundation Model of Human Faces | Foivos Paraperas Papantoniou et.al. | 2403.11641 | link |
2024-03-18 | Compositional Kronecker Context Optimization for Vision-Language Models | Kun Ding et.al. | 2403.11631 | null |
2024-03-18 | Let’s Focus on Neuron: Neuron-Level Supervised Fine-tuning for Large Language Model | Haoyun Xu et.al. | 2403.11621 | null |
2024-03-18 | CRS-Diff: Controllable Generative Remote Sensing Foundation Model | Datao Tang et.al. | 2403.11614 | link |
2024-03-18 | Linguacodus: A Synergistic Framework for Transformative Code Generation in Machine Learning Pipelines | Ekaterina Trofimova et.al. | 2403.11585 | null |
2024-03-18 | Reinforcement Learning with Token-level Feedback for Controllable Text Generation | Wendi Li et.al. | 2403.11558 | link |
2024-03-18 | LLM^3:Large Language Model-based Task and Motion Planning with Motion Failure Reasoning | Shu Wang et.al. | 2403.11552 | link |
2024-03-18 | Boosting Continual Learning of Vision-Language Models via Mixture-of-Experts Adapters | Jiazuo Yu et.al. | 2403.11549 | link |
2024-03-18 | DEE: Dual-stage Explainable Evaluation Method for Text Generation | Shenyu Zhang et.al. | 2403.11509 | null |
2024-03-18 | Do CLIPs Always Generalize Better than ImageNet Models? | Qizhou Wang et.al. | 2403.11497 | null |
2024-03-18 | VideoAgent: A Memory-augmented Multimodal Agent for Video Understanding | Yue Fan et.al. | 2403.11481 | null |
2024-03-18 | HateCOT: An Explanation-Enhanced Dataset for Generalizable Offensive Speech Detection via Large Language Models | Huy Nghiem et.al. | 2403.11456 | link |
2024-03-18 | Zero-shot Compound Expression Recognition with Visual Language Model at the 6th ABAW Challenge | Jiahe Wang et.al. | 2403.11450 | null |
2024-03-15 | VideoAgent: Long-form Video Understanding with Large Language Model as Agent | Xiaohan Wang et.al. | 2403.10517 | null |
2024-03-15 | Demystifying Faulty Code with LLM: Step-by-Step Reasoning for Explainable Fault Localization | Ratnadira Widyasari et.al. | 2403.10507 | null |
2024-03-15 | ATOM: Asynchronous Training of Massive Models for Deep Learning in a Decentralized Environment | Xiaofeng Wu et.al. | 2403.10504 | null |
2024-03-15 | Benchmarking Zero-Shot Robustness of Multimodal Foundation Models: A Pilot Study | Chenguang Wang et.al. | 2403.10499 | link |
2024-03-15 | Reconfigurable Robot Identification from Motion Data | Yuhang Hu et.al. | 2403.10496 | null |
2024-03-15 | Can a GPT4-Powered AI Agent Be a Good Enough Performance Attribution Analyst? | Bruno de Melo et.al. | 2403.10482 | null |
2024-03-15 | Enhancing LLM Factual Accuracy with RAG to Counter Hallucinations: A Case Study on Domain-Specific Queries in Private Knowledge-Bases | Jiarui Li et.al. | 2403.10446 | link |
2024-03-15 | Optimal Block-Level Draft Verification for Accelerating Speculative Decoding | Ziteng Sun et.al. | 2403.10444 | null |
2024-03-15 | Using an LLM to Turn Sign Spottings into Spoken Language Sentences | Ozge Mercanoglu Sincan et.al. | 2403.10434 | null |
2024-03-15 | SocialGenPod: Privacy-Friendly Generative AI Social Web Applications with Decentralised Personal Data Stores | Vidminas Vizgirda et.al. | 2403.10408 | link |
2024-03-15 | A Thorough Comparison of Cross-Encoders and LLMs for Reranking SPLADE | Hervé Déjean et.al. | 2403.10407 | null |
2024-03-15 | Monotonic Representation of Numeric Properties in Language Models | Benjamin Heinzerling et.al. | 2403.10381 | link |
2024-03-15 | EXAMS-V: A Multi-Discipline Multilingual Multimodal Exam Benchmark for Evaluating Vision Language Models | Rocktim Jyoti Das et.al. | 2403.10378 | link |
2024-03-15 | TriSum: Learning Summarization Ability from Large Language Models with Structured Rationale | Pengcheng Jiang et.al. | 2403.10351 | null |
2024-03-15 | Investigating grammatical abstraction in language models using few-shot learning of novel noun gender | Priyanka Sukumaran et.al. | 2403.10338 | null |
2024-03-15 | CDGP: Automatic Cloze Distractor Generation based on Pre-trained Language Model | Shang-Hsuan Chiang et.al. | 2403.10326 | link |
2024-03-15 | NetBench: A Large-Scale and Comprehensive Network Traffic Benchmark Dataset for Foundation Models | Chen Qian et.al. | 2403.10319 | link |
2024-03-15 | Uni-SMART: Universal Science Multimodal Analysis and Research Transformer | Hengxing Cai et.al. | 2403.10301 | null |
2024-03-15 | Few-Shot Image Classification and Segmentation as Visual Question Answering Using Vision-Language Models | Tian Meng et.al. | 2403.10287 | null |
2024-03-15 | Team Trifecta at Factify5WQA: Setting the Standard in Fact Verification with Fine-Tuning | Shang-Hsuan Chiang et.al. | 2403.10281 | link |
2024-03-14 | GaussianGrasper: 3D Language Gaussian Splatting for Open-vocabulary Robotic Grasping | Yuhang Zheng et.al. | 2403.09637 | link |
2024-03-14 | Dynamic Memory Compression: Retrofitting LLMs for Accelerated Inference | Piotr Nawrot et.al. | 2403.09636 | null |
2024-03-14 | Transformers Get Stable: An End-to-End Signal Propagation Theory for Language Models | Akhil Kedia et.al. | 2403.09635 | link |
2024-03-14 | OneTracker: Unifying Visual Object Tracking with Foundation Models and Efficient Tuning | Lingyi Hong et.al. | 2403.09634 | null |
2024-03-14 | 3D-VLA: A 3D Vision-Language-Action Generative World Model | Haoyu Zhen et.al. | 2403.09631 | null |
2024-03-14 | Quiet-STaR: Language Models Can Teach Themselves to Think Before Speaking | Eric Zelikman et.al. | 2403.09629 | link |
2024-03-14 | Explore In-Context Segmentation via Latent Diffusion Models | Chaoyang Wang et.al. | 2403.09616 | null |
2024-03-14 | MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training | Brandon McKinzie et.al. | 2403.09611 | null |
2024-03-14 | Large Language Models and Causal Inference in Collaboration: A Comprehensive Survey | Xiaoyu Liu et.al. | 2403.09606 | null |
2024-03-14 | Logical Discrete Graphical Models Must Supplement Large Language Models for Information Synthesis | Gregory Coppola et.al. | 2403.09599 | null |
2024-03-14 | Renovating Names in Open-Vocabulary Segmentation Benchmarks | Haiwen Huang et.al. | 2403.09593 | null |
2024-03-14 | ExploRLLM: Guiding Exploration in Reinforcement Learning with Large Language Models | Runyu Ma et.al. | 2403.09583 | null |
2024-03-14 | Eyes Closed, Safety On: Protecting Multimodal LLMs via Image-to-Text Transformation | Yunhao Gou et.al. | 2403.09572 | null |
2024-03-14 | Enhancing Trust in Autonomous Agents: An Architecture for Accountability and Explainability through Blockchain and Large Language Models | Laura Fernández-Becerra et.al. | 2403.09567 | null |
2024-03-14 | Welcome Your New AI Teammate: On Safety Analysis by Leashing Large Language Models | Ali Nouri et.al. | 2403.09565 | null |
2024-03-14 | PreCurious: How Innocent Pre-Trained Language Models Turn into Privacy Traps | Ruixuan Liu et.al. | 2403.09562 | null |
2024-03-14 | Less is More: Data Value Estimation for Visual Instruction Tuning | Zikang Liu et.al. | 2403.09559 | null |
2024-03-14 | Logits of API-Protected LLMs Leak Proprietary Information | Matthew Finlayson et.al. | 2403.09539 | null |
2024-03-14 | VisionGPT-3D: A Generalized Multimodal Agent for Enhanced 3D Vision Understanding | Chris Kelly et.al. | 2403.09530 | null |
2024-03-14 | WavCraft: Audio Editing and Generation with Natural Language Prompts | Jinhua Liang et.al. | 2403.09527 | link |
2024-03-13 | Simple and Scalable Strategies to Continually Pre-train Large Language Models | Adam Ibrahim et.al. | 2403.08763 | link |
2024-03-13 | Steering LLMs Towards Unbiased Responses: A Causality-Guided Debiasing Framework | Jingling Li et.al. | 2403.08743 | null |
2024-03-13 | The Garden of Forking Paths: Observing Dynamic Parameters Distribution in Large Language Models | Carlo Nicolini et.al. | 2403.08739 | null |
2024-03-13 | ILCiteR: Evidence-grounded Interpretable Local Citation Recommendation | Sayar Ghosh Roy et.al. | 2403.08737 | link |
2024-03-13 | Strengthening Multimodal Large Language Model with Bootstrapped Preference Optimization | Renjie Pi et.al. | 2403.08730 | null |
2024-03-14 | SOTOPIA- $π$ : Interactive Learning of Socially Intelligent Language Agents | Ruiyi Wang et.al. | 2403.08715 | link |
2024-03-13 | Review of Generative AI Methods in Cybersecurity | Yagmur Yigit et.al. | 2403.08701 | null |
2024-03-13 | TeaMs-RL: Teaching LLMs to Teach Themselves Better Instructions via Reinforcement Learning | Shangding Gu et.al. | 2403.08694 | link |
2024-03-13 | Do Language Models Care About Text Quality? Evaluating Web-Crawled Corpora Across 11 Languages | Rik van Noord et.al. | 2403.08693 | null |
2024-03-13 | Zero-shot and Few-shot Generation Strategies for Artificial Clinical Records | Erlend Frayling et.al. | 2403.08664 | null |
2024-03-13 | Self-Supervised Learning for Covariance Estimation | Tzvi Diskin et.al. | 2403.08662 | null |
2024-03-13 | Human Alignment of Large Language Models through Online Preference Optimisation | Daniele Calandriello et.al. | 2403.08635 | null |
2024-03-13 | MedInsight: A Multi-Source Context Augmentation Framework for Generating Patient-Centric Medical Responses using Large Language Models | Subash Neupane et.al. | 2403.08607 | null |
2024-03-14 | Language-Grounded Dynamic Scene Graphs for Interactive Object Search with Mobile Manipulation | Daniel Honerkamp et.al. | 2403.08605 | link |
2024-03-13 | DevBench: A Comprehensive Benchmark for Software Development | Bowen Li et.al. | 2403.08604 | link |
2024-03-13 | Call Me When Necessary: LLMs can Efficiently and Faithfully Reason over Structured Environments | Sitao Cheng et.al. | 2403.08593 | null |
2024-03-13 | Non-discrimination Criteria for Generative Language Models | Sara Sterlie et.al. | 2403.08564 | link |
2024-03-13 | AIGCs Confuse AI Too: Investigating and Explaining Synthetic Image-induced Hallucinations in Large Vision-Language Models | Yifei Gao et.al. | 2403.08542 | link |
2024-03-13 | Language models scale reliably with over-training and on downstream tasks | Samir Yitzhak Gadre et.al. | 2403.08540 | link |
2024-03-13 | Masked Generative Story Transformer with Character Guidance and Caption Augmentation | Christos Papadimitriou et.al. | 2403.08502 | link |
2024-03-12 | Beyond Text: Frozen Large Language Models in Visual Signal Comprehension | Lei Zhu et.al. | 2403.07874 | link |
2024-03-12 | Rethinking Generative Large Language Model Evaluation for Semantic Comprehension | Fangyun Wei et.al. | 2403.07872 | null |
2024-03-12 | Exploring Safety Generalization Challenges of Large Language Models via Code | Qibing Ren et.al. | 2403.07865 | link |
2024-03-12 | Bridging Different Language Models and Generative Vision Models for Text-to-Image Generation | Shihao Zhao et.al. | 2403.07860 | link |
2024-03-12 | MoPE-CLIP: Structured Pruning for Efficient Vision-Language Models with Module-wise Pruning Error Metric | Haokun Lin et.al. | 2403.07839 | null |
2024-03-12 | DeliGrasp: Inferring Object Mass, Friction, and Compliance with LLMs for Adaptive and Minimally Deforming Grasp Policies | William Xie et.al. | 2403.07832 | null |
2024-03-12 | The Missing Piece in Model Editing: A Deep Dive into the Hidden Damage Brought By Model Editing | Jianchen Wang et.al. | 2403.07825 | null |
2024-03-12 | Branch-Train-MiX: Mixing Expert LLMs into a Mixture-of-Experts LLM | Sainbayar Sukhbaatar et.al. | 2403.07816 | null |
2024-03-12 | Chronos: Learning the Language of Time Series | Abdul Fatir Ansari et.al. | 2403.07815 | link |
2024-03-12 | Beyond Memorization: The Challenge of Random Memory Access in Language Models | Tongyao Zhu et.al. | 2403.07805 | link |
2024-03-12 | Fine-tuning Large Language Models with Sequential Instructions | Hanxu Hu et.al. | 2403.07794 | link |
2024-03-12 | Transforming Competition into Collaboration: The Revolutionary Role of Multi-Agent Systems and Language Models in Modern Organizations | Carlos Jose Xavier Cruz et.al. | 2403.07769 | link |
2024-03-12 | Synth $^2$ : Boosting Visual-Language Models with Synthetic Captions and Image Embeddings | Sahand Sharifzadeh et.al. | 2403.07750 | null |
2024-03-12 | FineMath: A Fine-Grained Mathematical Evaluation Benchmark for Chinese Large Language Models | Yan Liu et.al. | 2403.07747 | null |
2024-03-12 | Multi-modal Auto-regressive Modeling via Visual Words | Tianshuo Peng et.al. | 2403.07720 | link |
2024-03-12 | WorkArena: How Capable Are Web Agents at Solving Common Knowledge Work Tasks? | Alexandre Drouin et.al. | 2403.07718 | link |
2024-03-12 | StableToolBench: Towards Stable Large-Scale Benchmarking on Tool Learning of Large Language Models | Zhicheng Guo et.al. | 2403.07714 | link |
2024-03-12 | Improving Reinforcement Learning from Human Feedback Using Contrastive Rewards | Wei Shen et.al. | 2403.07708 | null |
2024-03-12 | Large, Small or Both: A Novel Data Augmentation Framework Based on Language Models for Debiasing Opinion Summarization | Yanyue Zhang et.al. | 2403.07693 | null |
2024-03-12 | Reference-free Monolithic Preference Optimization with Odds Ratio | Jiwoo Hong et.al. | 2403.07691 | link |
2024-03-11 | Hybrid Human-LLM Corpus Construction and LLM Evaluation for Rare Linguistic Phenomena | Leonie Weissweiler et.al. | 2403.06965 | null |
2024-03-11 | Materials science in the era of large language models: a perspective | Ge Lei et.al. | 2403.06949 | null |
2024-03-11 | Split to Merge: Unifying Separated Modalities for Unsupervised Domain Adaptation | Xinyao Li et.al. | 2403.06946 | link |
2024-03-11 | Naming, Describing, and Quantifying Visual Objects in Humans and LLMs | Alberto Testoni et.al. | 2403.06935 | link |
2024-03-11 | ERA-CoT: Improving Chain-of-Thought through Entity Relationship Analysis | Yanming Liu et.al. | 2403.06932 | link |
2024-03-11 | MEND: Meta dEmonstratioN Distillation for Efficient and Effective In-Context Learning | Yichuan Li et.al. | 2403.06914 | link |
2024-03-11 | Application of Quantum Tensor Networks for Protein Classification | Debarshi Kundu et.al. | 2403.06890 | null |
2024-03-11 | Exploring Large Language Models and Hierarchical Frameworks for Classification of Large Unstructured Legal Documents | Nishchal Prasad et.al. | 2403.06872 | link |
2024-03-11 | Semantic Residual Prompts for Continual Learning | Martin Menabue et.al. | 2403.06870 | link |
2024-03-11 | Learning with Noisy Foundation Models | Hao Chen et.al. | 2403.06869 | null |
2024-03-11 | A Geospatial Approach to Predicting Desert Locust Breeding Grounds in Africa | Ibrahim Salihu Yusuf et.al. | 2403.06860 | null |
2024-03-11 | Development of a Reliable and Accessible Caregiving Language Model (CaLM) | Bambang Parmanto et.al. | 2403.06857 | null |
2024-03-11 | DriveDreamer-2: LLM-Enhanced World Models for Diverse Driving Video Generation | Guosheng Zhao et.al. | 2403.06845 | null |
2024-03-11 | RA-ISF: Learning to Answer and Understand from Retrieval Augmentation via Iterative Self-Feedback | Yanming Liu et.al. | 2403.06840 | link |
2024-03-11 | ACFIX: Guiding LLMs with Mined Common RBAC Practices for Context-Aware Repair of Access Control Vulnerabilities in Smart Contracts | Lyuye Zhang et.al. | 2403.06838 | null |
2024-03-11 | Can LLMs Separate Instructions From Data? And What Do We Even Mean By That? | Egor Zverev et.al. | 2403.06833 | link |
2024-03-11 | The Power of Noise: Toward a Unified Multi-modal Knowledge Graph Representation Framework | Zhuo Chen et.al. | 2403.06832 | link |
2024-03-11 | ConspEmoLLM: Conspiracy Theory Detection Using an Emotion-Based Large Language Model | Zhiwei Liu et.al. | 2403.06765 | link |
2024-03-11 | An Image is Worth 1/2 Tokens After Layer 2: Plug-and-Play Inference Acceleration for Large Vision-Language Models | Liang Chen et.al. | 2403.06764 | link |
2024-03-11 | ALaRM: Align Language Models via Hierarchical Rewards Modeling | Yuhang Lai et.al. | 2403.06754 | link |
2024-03-08 | Bayesian Preference Elicitation with Language Models | Kunal Handa et.al. | 2403.05534 | null |
2024-03-08 | Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context | Machel Reid et.al. | 2403.05530 | null |
2024-03-08 | GEAR: An Efficient KV Cache Compression Recipefor Near-Lossless Generative Inference of LLM | Hao Kang et.al. | 2403.05527 | link |
2024-03-08 | DeepSeek-VL: Towards Real-World Vision-Language Understanding | Haoyu Lu et.al. | 2403.05525 | link |
2024-03-08 | Beyond Finite Data: Towards Data-free Out-of-distribution Generalization via Extrapola | Yijiang Li et.al. | 2403.05523 | null |
2024-03-08 | Authorship Attribution in Bangla Literature (AABL) via Transfer Learning using ULMFiT | Aisha Khatun et.al. | 2403.05519 | null |
2024-03-08 | Bias-Augmented Consistency Training Reduces Biased Reasoning in Chain-of-Thought | James Chua et.al. | 2403.05518 | link |
2024-03-08 | To Err Is Human, but Llamas Can Learn It Too | Agnes Luhtaru et.al. | 2403.05493 | link |
2024-03-08 | Will GPT-4 Run DOOM? | Adrian de Wynter et.al. | 2403.05468 | null |
2024-03-08 | Cost-Performance Optimization for Processing Low-Resource Language Tasks Using Commercial LLMs | Arijit Nag et.al. | 2403.05434 | null |
2024-03-08 | Towards Real-World Stickers Use: A New Dataset for Multi-Tag Sticker Recognition | Bingbing Wang et.al. | 2403.05428 | null |
2024-03-08 | FedFMS: Exploring Federated Foundation Models for Medical Image Segmentation | Yuxi Liu et.al. | 2403.05408 | link |
2024-03-08 | Exploring Robust Features for Few-Shot Object Detection in Satellite Imagery | Xavier Bou et.al. | 2403.05381 | link |
2024-03-08 | VLM-PL: Advanced Pseudo Labeling approach Class Incremental Object Detection with Vision-Language Model | Junsu Kim et.al. | 2403.05346 | null |
2024-03-08 | Explaining Pre-Trained Language Models with Attribution Scores: An Analysis in Low-Resource Settings | Wei Zhou et.al. | 2403.05338 | null |
2024-03-08 | ChatASU: Evoking LLM’s Reflexion to Truly Understand Aspect Sentiment in Dialogues | Yiding Liu et.al. | 2403.05326 | null |
2024-03-08 | RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Horizon Generation | Zihao Wang et.al. | 2403.05313 | null |
2024-03-08 | Tapilot-Crossing: Benchmarking and Evolving LLMs Towards Interactive Data Analysis Agents | Jinyang Li et.al. | 2403.05307 | link |
2024-03-08 | ACLSum: A New Dataset for Aspect-based Summarization of Scientific Publications | Sotaro Takeshita et.al. | 2403.05303 | link |
2024-03-08 | Modeling Dynamic (De)Allocations of Local Memory for Translation Validation | Abhishek Rose et.al. | 2403.05302 | null |
2024-03-07 | iScore: Visual Analytics for Interpreting How Language Models Automatically Score Summaries | Adam Coscia et.al. | 2403.04760 | link |
2024-03-07 | KnowledgeVIS: Interpreting Language Models by Comparing Fill-in-the-Blank Prompts | Adam Coscia et.al. | 2403.04758 | link |
2024-03-07 | LLMs in the Imaginarium: Tool Learning through Simulated Trial and Error | Boshi Wang et.al. | 2403.04746 | link |
2024-03-08 | How Far Are We from Intelligent Visual Deductive Reasoning? | Yizhe Zhang et.al. | 2403.04732 | link |
2024-03-07 | Common 7B Language Models Already Possess Strong Math Capabilities | Chen Li et.al. | 2403.04706 | link |
2024-03-07 | ObjectCompose: Evaluating Resilience of Vision-Based Models on Object-to-Background Compositional Changes | Hashmat Shadab Malik et.al. | 2403.04701 | link |
2024-03-07 | Fact-Checking the Output of Large Language Models via Token-Level Uncertainty Quantification | Ekaterina Fadeeva et.al. | 2403.04696 | link |
2024-03-07 | Telecom Language Models: Must They Be Large? | Nicola Piovesan et.al. | 2403.04666 | null |
2024-03-07 | Yi: Open Foundation Models by 01.AI | 01. AI et.al. | 2403.04652 | link |
2024-03-07 | Teaching Large Language Models to Reason with Reinforcement Learning | Alex Havrilla et.al. | 2403.04642 | null |
2024-03-07 | CAT: Enhancing Multimodal Large Language Model to Answer Questions in Dynamic Audio-Visual Scenarios | Qilang Ye et.al. | 2403.04640 | link |
2024-03-07 | A Detailed Audio-Text Data Simulation Pipeline using Single-Event Sounds | Xuenan Xu et.al. | 2403.04594 | link |
2024-03-07 | Embodied Understanding of Driving Scenarios | Yunsong Zhou et.al. | 2403.04593 | link |
2024-03-07 | Wiki-TabNER:Advancing Table Interpretation Through Named Entity Recognition | Aneta Koleva et.al. | 2403.04577 | link |
2024-03-07 | Reducing self-supervised learning complexity improves weakly-supervised classification performance in computational pathology | Tim Lenz et.al. | 2403.04558 | null |
2024-03-07 | Enhancing Data Quality in Federated Fine-Tuning of Foundation Models | Wanru Zhao et.al. | 2403.04529 | null |
2024-03-07 | Where does In-context Translation Happen in Large Language Models | Suzanna Sia et.al. | 2403.04510 | null |
2024-03-07 | GraphInstruct: Empowering Large Language Models with Graph Understanding and Reasoning Capability | Zihan Luo et.al. | 2403.04483 | link |
2024-03-08 | Do Large Language Model Understand Multi-Intent Spoken Language ? | Shangjian Yin et.al. | 2403.04481 | link |
2024-03-08 | Pearl: A Review-driven Persona-Knowledge Grounded Conversational Recommendation Dataset | Minjin Kim et.al. | 2403.04460 | link |
2024-03-06 | Backtracing: Retrieving the Cause of the Query | Rose E. Wang et.al. | 2403.03956 | link |
2024-03-06 | Bridging Language and Items for Retrieval and Recommendation | Yupeng Hou et.al. | 2403.03952 | link |
2024-03-06 | The Heuristic Core: Understanding Subnetwork Generalization in Pretrained Language Models | Adithya Bhaskar et.al. | 2403.03942 | link |
2024-03-06 | Did Translation Models Get More Robust Without Anyone Even Noticing? | Ben Peters et.al. | 2403.03923 | null |
2024-03-06 | Fuzzing BusyBox: Leveraging LLM and Crash Reuse for Embedded Bug Unearthing | Asmita et.al. | 2403.03897 | link |
2024-03-06 | IRCoder: Intermediate Representations Make Language Models Robust Multilingual Code Generators | Indraneil Paul et.al. | 2403.03894 | link |
2024-03-06 | From One to Many: Expanding the Scope of Toxicity Mitigation in Language Models | Luiza Pozzobon et.al. | 2403.03893 | link |
2024-03-06 | FaaF: Facts as a Function for the evaluation of RAG systems | Vasileios Katranidis et.al. | 2403.03888 | link |
2024-03-06 | SaulLM-7B: A pioneering Large Language Model for Law | Pierre Colombo et.al. | 2403.03883 | null |
2024-03-06 | Learning to Decode Collaboratively with Multiple Language Models | Shannon Zejiang Shen et.al. | 2403.03870 | link |
2024-03-06 | On the Origins of Linear Representations in Large Language Models | Yibo Jiang et.al. | 2403.03867 | null |
2024-03-06 | KIWI: A Dataset of Knowledge-Intensive Writing Instructions for Answering Research Questions | Fangyuan Xu et.al. | 2403.03866 | null |
2024-03-06 | Are Language Models Puzzle Prodigies? Algorithmic Puzzles Unveil Serious Challenges in Multimodal Reasoning | Deepanway Ghosal et.al. | 2403.03864 | link |
2024-03-06 | X-Shot: A Unified System to Handle Frequent, Few-shot and Zero-shot Learning Simultaneously in Classification | Hanzi Xu et.al. | 2403.03863 | link |
2024-03-06 | Designing Informative Metrics for Few-Shot Example Selection | Rishabh Adiga et.al. | 2403.03861 | null |
2024-03-06 | Emojinize : Enriching Any Text with Emoji Translations | Lars Henning Klein et.al. | 2403.03857 | null |
2024-03-06 | ShortGPT: Layers in Large Language Models are More Redundant Than You Expect | Xin Men et.al. | 2403.03853 | null |
2024-03-06 | Evaluating the Elementary Multilingual Capabilities of Large Language Models with MultiQ | Carolin Holtermann et.al. | 2403.03814 | link |
2024-03-06 | Popeye: A Unified Visual-Language Model for Multi-Source Ship Detection from Remote Sensing Imagery | Wei Zhang et.al. | 2403.03790 | null |
2024-03-06 | PPTC-R benchmark: Towards Evaluating the Robustness of Large Language Models for PowerPoint Task Completion | Zekai Zhang et.al. | 2403.03788 | link |
2024-03-05 | The WMDP Benchmark: Measuring and Reducing Malicious Use With Unlearning | Nathaniel Li et.al. | 2403.03218 | null |
2024-03-05 | CLEVR-POC: Reasoning-Intensive Visual Question Answering in Partially Observable Environments | Savitha Sam Abraham et.al. | 2403.03203 | null |
2024-03-05 | Towards Democratized Flood Risk Management: An Advanced AI Assistant Enabled by GPT-4 for Enhanced Interpretability and Public Engagement | Rafaela Martelo et.al. | 2403.03188 | link |
2024-03-05 | Reliable, Adaptable, and Attributable Language Models with Retrieval | Akari Asai et.al. | 2403.03187 | null |
2024-03-05 | MOKA: Open-Vocabulary Robotic Manipulation through Mark-Based Visual Prompting | Fangchen Liu et.al. | 2403.03174 | null |
2024-03-05 | SNIFFER: Multimodal Large Language Model for Explainable Out-of-Context Misinformation Detection | Peng Qi et.al. | 2403.03170 | null |
2024-03-05 | PARADISE: Evaluating Implicit Planning Skills of Language Models with Procedural Warnings and Tips Dataset | Arda Uzunoğlu et.al. | 2403.03167 | link |
2024-03-05 | Quantum Many-Body Physics Calculations with Large Language Models | Haining Pan et.al. | 2403.03154 | null |
2024-03-05 | Language Guided Exploration for RL Agents in Text Environments | Hitesh Golchha et.al. | 2403.03141 | null |
2024-03-05 | CoGenesis: A Framework Collaborating Large and Small Language Models for Secure Context-Aware Instruction Following | Kaiyan Zhang et.al. | 2403.03129 | null |
2024-03-05 | Angry Men, Sad Women: Large Language Models Reflect Gendered Stereotypes in Emotion Attribution | Flor Miriam Plaza-del-Arco et.al. | 2403.03121 | link |
2024-03-05 | “In Dialogues We Learn”: Towards Personalized Dialogue Without Pre-defined Profiles through In-Dialogue Learning | Chuanqi Cheng et.al. | 2403.03102 | null |
2024-03-05 | KnowAgent: Knowledge-Augmented Planning for LLM-Based Agents | Yuqi Zhu et.al. | 2403.03101 | link |
2024-03-05 | Learning to Use Tools via Cooperative and Interactive Agents | Zhengliang Shi et.al. | 2403.03031 | link |
2024-03-05 | Socratic Reasoning Improves Positive Text Rewriting | Anmol Goel et.al. | 2403.03029 | null |
2024-03-05 | Word Importance Explains How Prompts Affect Language Model Outputs | Stefan Hackmann et.al. | 2403.03028 | null |
2024-03-05 | OPEx: A Component-Wise Analysis of LLM-Centric Agents in Embodied Instruction Following | Haochen Shi et.al. | 2403.03017 | null |
2024-03-05 | Knowledge Graphs as Context Sources for LLM-Based Explanations of Learning Recommendations | Hasan Abu-Rasheed et.al. | 2403.03008 | null |
2024-03-05 | Feast Your Eyes: Mixture-of-Resolution Adaptation for Multimodal Large Language Models | Gen Luo et.al. | 2403.03003 | link |
2024-03-05 | Localized Zeroth-Order Prompt Optimization | Wenyang Hu et.al. | 2403.02993 | null |
2024-03-02 | LM4OPT: Unveiling the Potential of Large Language Models in Formulating Mathematical Optimization Problems | Tasnim Ahmed et.al. | 2403.01342 | null |
2024-03-02 | Making Hybrid Languages: A Recipe | Leif Andersen et.al. | 2403.01335 | null |
2024-03-02 | Chaining thoughts and LLMs to learn DNA structural biophysics | Tyler D. Ross et.al. | 2403.01332 | link |
2024-03-02 | VBART: The Turkish LLM | Meliksah Turker et.al. | 2403.01308 | null |
2024-03-02 | ICC: Quantifying Image Caption Concreteness for Multimodal Dataset Curation | Moran Yanuka et.al. | 2403.01306 | link |
2024-03-02 | Improving the Validity of Automatically Generated Feedback via Reinforcement Learning | Alexander Scarlatos et.al. | 2403.01304 | link |
2024-03-02 | NoMAD-Attention: Efficient LLM Inference on CPUs Through Multiply-add-free Attention | Tianyi Zhang et.al. | 2403.01273 | link |
2024-03-02 | Employing LLMs for Incident Response Planning and Review | Sam Hays et.al. | 2403.01271 | null |
2024-03-02 | Dissecting Language Models: Machine Unlearning via Selective Pruning | Nicholas Pochinkov et.al. | 2403.01267 | link |
2024-03-02 | Accelerating Greedy Coordinate Gradient via Probe Sampling | Yiran Zhao et.al. | 2403.01251 | link |
2024-03-02 | SceneCraft: An LLM Agent for Synthesizing 3D Scene as Blender Code | Ziniu Hu et.al. | 2403.01248 | null |
2024-03-02 | Mitigating Catastrophic Forgetting in Large Language Models with Self-Synthesized Rehearsal | Jianheng Huang et.al. | 2403.01244 | link |
2024-03-02 | IntactKV: Improving Large Language Model Quantization by Keeping Pivot Tokens Intact | Ruikang Liu et.al. | 2403.01241 | link |
2024-03-02 | Inexact Unlearning Needs More Careful Evaluations to Avoid a False Sense of Privacy | Jamie Hayes et.al. | 2403.01218 | null |
2024-03-02 | API Is Enough: Conformal Prediction for Large Language Models Without Logit-Access | Jiayuan Su et.al. | 2403.01216 | null |
2024-03-02 | Data-free Multi-label Image Recognition via LLM-powered Prompt Tuning | Shuo Yang et.al. | 2403.01209 | null |
2024-03-02 | The Case for Animal-Friendly AI | Sankalpa Ghose et.al. | 2403.01199 | null |
2024-03-02 | DMoERM: Recipes of Mixture-of-Experts for Effective Reward Modeling | Shanghaoran Quan et.al. | 2403.01197 | link |
2024-03-02 | RAGged Edges: The Double-Edged Sword of Retrieval-Augmented Chatbots | Philip Feldman. James R. Foulds et.al. | 2403.01193 | null |
2024-03-02 | Balancing Exploration and Exploitation in LLM using Soft RLLF for Enhanced Negation Understanding | Ha-Thanh Nguyen et.al. | 2403.01185 | null |
2024-02-29 | The Counterfeit Conundrum: Can Code Language Models Grasp the Nuances of Their Incorrect Generations? | Alex Gu et.al. | 2402.19475 | null |
2024-02-29 | The All-Seeing Project V2: Towards General Relation Comprehension of the Open World | Weiyun Wang et.al. | 2402.19474 | link |
2024-02-29 | Retrieval-Augmented Generation for AI-Generated Content: A Survey | Penghao Zhao et.al. | 2402.19473 | link |
2024-02-29 | Loose LIPS Sink Ships: Asking Questions in Battleship with Language-Informed Program Sampling | Gabriel Grand et.al. | 2402.19471 | null |
2024-03-01 | TV-TREES: Multimodal Entailment Trees for Neuro-Symbolic Video Reasoning | Kate Sanders et.al. | 2402.19467 | null |
2024-02-29 | Towards Tracing Trustworthiness Dynamics: Revisiting Pre-training Period of Large Language Models | Chen Qian et.al. | 2402.19465 | link |
2024-02-29 | Curiosity-driven Red-teaming for Large Language Models | Zhang-Wei Hong et.al. | 2402.19464 | link |
2024-02-29 | Functional Benchmarks for Robust Evaluation of Reasoning Performance, and the Reasoning Gap | Saurabh Srivastava et.al. | 2402.19450 | link |
2024-02-29 | Heavy-Tailed Class Imbalance and Why Adam Outperforms Gradient Descent on Language Models | Frederik Kunstner et.al. | 2402.19449 | null |
2024-02-29 | ArCHer: Training Language Model Agents via Hierarchical Multi-Turn RL | Yifei Zhou et.al. | 2402.19446 | link |
2024-02-29 | Pushing the Limits of Cross-Embodiment Learning for Manipulation and Navigation | Jonathan Yang et.al. | 2402.19432 | null |
2024-02-29 | Compositional API Recommendation for Library-Oriented Code Generation | Zexiong Ma et.al. | 2402.19431 | null |
2024-02-29 | Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models | Soham De et.al. | 2402.19427 | null |
2024-02-29 | Crafting Knowledge: Exploring the Creative Mechanisms of Chat-Based Search Engines | Lijia Ma et.al. | 2402.19421 | null |
2024-02-29 | PaECTER: Patent-level Representation Learning using Citation-informed Transformers | Mainak Ghosh et.al. | 2402.19411 | null |
2024-02-29 | On the Scaling Laws of Geographical Representation in Language Models | Nathan Godey et.al. | 2402.19406 | null |
2024-02-29 | Entity-Aware Multimodal Alignment Framework for News Image Captioning | Junzhe Zhang et.al. | 2402.19404 | null |
2024-02-29 | Wisdom of the Silicon Crowd: LLM Ensemble Prediction Capabilities Match Human Crowd Accuracy | Philipp Schoenegger et.al. | 2402.19379 | null |
2024-02-29 | OpenMedLM: Prompt engineering can out-perform fine-tuning in medical question-answering with open-source large language models | Jenish Maharjan et.al. | 2402.19371 | null |
2024-02-29 | SoK: Exploring the Potential of Large Language Models for Improving Digital Forensic Investigation Efficiency | Akila Wickramasekara et.al. | 2402.19366 | null |
2024-02-28 | Arithmetic Control of LLMs for Diverse User Preferences: Directional Preference Alignment with Multi-Objective Rewards | Haoxiang Wang et.al. | 2402.18571 | link |
2024-02-28 | Diffusion Language Models Are Versatile Protein Learners | Xinyou Wang et.al. | 2402.18567 | link |
2024-02-28 | A Categorization of Complexity Classes for Information Retrieval and Synthesis Using Natural Logic | Gregory Coppola et.al. | 2402.18566 | null |
2024-02-28 | Approaching Human-Level Forecasting with Language Models | Danny Halawi et.al. | 2402.18563 | null |
2024-02-28 | Implicit Bias of Next-Token Prediction | Christos Thrampoulidis et.al. | 2402.18551 | null |
2024-02-28 | Orchid: Flexible and Data-Dependent Convolution for Sequence Modeling | Mahdi Karami et.al. | 2402.18508 | null |
2024-02-28 | Few-Shot Fairness: Unveiling LLM’s Potential for Fairness-Aware Classification | Garima Chhikara et.al. | 2402.18502 | null |
2024-02-28 | Language Models Represent Beliefs of Self and Others | Wentao Zhu et.al. | 2402.18496 | null |
2024-02-28 | IBD: Alleviating Hallucinations in Large Vision-Language Models via Image-Biased Decoding | Lanyun Zhu et.al. | 2402.18476 | null |
2024-02-28 | Meta-Task Prompting Elicits Embedding from Large Language Models | Yibin Lei et.al. | 2402.18458 | link |
2024-02-28 | Prompt-Driven Dynamic Object-Centric Learning for Single Domain Generalization | Deng Li et.al. | 2402.18447 | null |
2024-02-28 | Beyond Natural Language: LLMs Leveraging Alternative Formats for Enhanced Reasoning and Communication | Weize Chen et.al. | 2402.18439 | link |
2024-02-28 | A Cognitive Evaluation Benchmark of Image Reasoning and Description for Large Vision Language Models | Xiujie Song et.al. | 2402.18409 | link |
2024-02-28 | Balanced Similarity with Auxiliary Prompts: Towards Alleviating Text-to-Image Retrieval Bias for CLIP in Zero-shot Learning | Hanyao Wang et.al. | 2402.18400 | null |
2024-02-28 | Decomposed Prompting: Unveiling Multilingual Linguistic Structure Knowledge in English-Centric Large Language Models | Ercong Nie et.al. | 2402.18397 | null |
2024-02-28 | The First Place Solution of WSDM Cup 2024: Leveraging Large Language Models for Conversational Multi-Doc QA | Yiming Li et.al. | 2402.18385 | link |
2024-02-28 | Large Language Models As Evolution Strategies | Robert Tjarko Lange et.al. | 2402.18381 | null |
2024-02-28 | Tokenization Is More Than Compression | Craig W. Schmidt et.al. | 2402.18376 | link |
2024-02-28 | VerifiNER: Verification-augmented NER via Knowledge-grounded Reasoning with Large Language Models | Seoyeon Kim et.al. | 2402.18374 | link |
2024-02-28 | Focus on Your Question! Interpreting and Mitigating Toxic CoT Problems in Commonsense Reasoning | Jiachun Li et.al. | 2402.18344 | link |
2024-02-27 | ShapeLLM: Universal 3D Object Understanding for Embodied Interaction | Zekun Qi et.al. | 2402.17766 | link |
2024-02-27 | The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits | Shuming Ma et.al. | 2402.17764 | null |
2024-02-27 | Massive Activations in Large Language Models | Mingjie Sun et.al. | 2402.17762 | link |
2024-02-27 | Towards Optimal Learning of Language Models | Yuxian Gu et.al. | 2402.17759 | null |
2024-02-27 | Evaluating Very Long-Term Conversational Memory of LLM Agents | Adyasha Maharana et.al. | 2402.17753 | null |
2024-02-27 | Tower: An Open Multilingual Large Language Model for Translation-Related Tasks | Duarte M. Alves et.al. | 2402.17733 | link |
2024-02-27 | AmbigNLG: Addressing Task Ambiguity in Instruction for NLG | Ayana Niwa et.al. | 2402.17717 | link |
2024-02-27 | Case-Based or Rule-Based: How Do Transformers Do the Math? | Yi Hu et.al. | 2402.17709 | link |
2024-02-27 | RAVEL: Evaluating Interpretability Methods on Disentangling Language Model Representations | Jing Huang et.al. | 2402.17700 | link |
2024-02-27 | NextLevelBERT: Investigating Masked Language Modeling with Higher-Level Representations for Long Documents | Tamara Czinczoll et.al. | 2402.17682 | link |
2024-02-27 | The Emergence of Large Language Models in Static Analysis: A First Look through Micro-Benchmarks | Ashwin Prasad Shivarpatna Venkatesh et.al. | 2402.17679 | null |
2024-02-27 | CAD-SIGNet: CAD Language Inference from Point Clouds using Layer-wise Sketch Instance Guided Attention | Mohammad Sadil Khan et.al. | 2402.17678 | null |
2024-02-27 | Securing Reliability: A Brief Overview on Enhancing In-Context Learning for Foundation Models | Yunpeng Huang et.al. | 2402.17671 | null |
2024-02-27 | Beyond prompt brittleness: Evaluating the reliability and consistency of political worldviews in LLMs | Tanise Ceron et.al. | 2402.17649 | null |
2024-02-27 | SongComposer: A Large Language Model for Lyric and Melody Composition in Song Generation | Shuangrui Ding et.al. | 2402.17645 | link |
2024-02-27 | Are LLMs Capable of Data-based Statistical and Causal Reasoning? Benchmarking Advanced Quantitative Reasoning with Data | Xiao Liu et.al. | 2402.17644 | link |
2024-02-27 | Variational Learning is Effective for Large Deep Networks | Yuesong Shen et.al. | 2402.17641 | link |
2024-02-27 | Masked Gamma-SSL: Learning Uncertainty Estimation via Masked Image Modeling | David S. W. Williams et.al. | 2402.17622 | null |
2024-02-27 | Agent-Pro: Learning to Evolve via Policy-Level Reflection and Optimization | Wenqi Zhang et.al. | 2402.17574 | link |
2024-02-27 | Unleashing the Potential of Large Language Models as Prompt Optimizers: An Analogical Analysis with Gradient-based Model Optimizers | Xinyu Tang et.al. | 2402.17564 | link |
2024-02-26 | Integrating Large Language Models with Graphical Session-Based Recommendation | Naicheng Guo et.al. | 2402.16539 | null |
2024-02-26 | LLMArena: Assessing Capabilities of Large Language Models in Dynamic Multi-Agent Environments | Junzhe Chen et.al. | 2402.16499 | link |
2024-02-26 | On Languaging a Simulation Engine | Han Liu et.al. | 2402.16482 | null |
2024-02-26 | Unveiling ChatGPT’s Usage in Open Source Projects: A Mining-based Study | Rosalia Tufano et.al. | 2402.16480 | null |
2024-02-26 | mEdIT: Multilingual Text Editing via Instruction Tuning | Vipul Raheja et.al. | 2402.16472 | link |
2024-02-26 | Unveiling Vulnerability of Self-Attention | Khai Jiet Liong et.al. | 2402.16470 | link |
2024-02-26 | Defending LLMs against Jailbreaking Attacks via Backtranslation | Yihan Wang et.al. | 2402.16459 | link |
2024-02-26 | ProLLaMA: A Protein Large Language Model for Multi-Task Protein Language Processing | Liuzhenghao Lv et.al. | 2402.16445 | link |
2024-02-26 | ShieldLM: Empowering LLMs as Aligned, Customizable and Explainable Safety Detectors | Zhexin Zhang et.al. | 2402.16444 | link |
2024-02-26 | Language-Specific Neurons: The Key to Multilingual Capabilities in Large Language Models | Tianyi Tang et.al. | 2402.16438 | link |
2024-02-26 | RoCoIns: Enhancing Robustness of Large Language Models through Code-Style Instructions | Yuansen Zhang et.al. | 2402.16431 | null |
2024-02-26 | Predicting Sustainable Development Goals Using Course Descriptions – from LLMs to Conventional Foundation Models | Lev Kharlashkin et.al. | 2402.16420 | null |
2024-02-26 | From RAGs to riches: Using large language models to write documents for clinical trials | Nigel Markey et.al. | 2402.16406 | null |
2024-02-26 | MoZIP: A Multilingual Benchmark to Evaluate Large Language Models in Intellectual Property | Shiwen Ni et.al. | 2402.16389 | link |
2024-02-26 | Immunization against harmful fine-tuning attacks | Domenic Rosati et.al. | 2402.16382 | null |
2024-02-26 | Improving LLM-based Machine Translation with Systematic Self-Correction | Zhaopeng Feng et.al. | 2402.16379 | link |
2024-02-26 | Unraveling Babel: Exploring Multilingual Activation Patterns within Large Language Models | Weize Liu et.al. | 2402.16367 | null |
2024-02-26 | LLM Inference Unveiled: Survey and Roofline Model Insights | Zhihang Yuan et.al. | 2402.16363 | link |
2024-02-26 | Layer-wise Regularized Dropout for Neural Language Models | Shiwen Ni et.al. | 2402.16361 | null |
2024-02-26 | An Integrated Data Processing Framework for Pretraining Foundation Models | Yiding Sun et.al. | 2402.16358 | link |
2024-02-23 | AgentOhana: Design Unified Data and Training Pipeline for Effective Agent Learning | Jianguo Zhang et.al. | 2402.15506 | link |
2024-02-23 | API-BLEND: A Comprehensive Corpora for Training and Benchmarking API LLMs | Kinjal Basu et.al. | 2402.15491 | link |
2024-02-23 | Prejudice and Caprice: A Statistical Framework for Measuring Social Discrimination in Large Language Models | Yiran Liu et.al. | 2402.15481 | null |
2024-02-23 | Leveraging Domain Knowledge for Efficient Reward Modelling in RLHF: A Case-Study in E-Commerce Opinion Summarization | Swaroop Nath et.al. | 2402.15473 | link |
2024-02-23 | Repetition Improves Language Model Embeddings | Jacob Mitchell Springer et.al. | 2402.15449 | link |
2024-02-23 | A Data-Centric Approach To Generate Faithful and High Quality Patient Summaries with Large Language Models | Stefan Hegselmann et.al. | 2402.15422 | link |
2024-02-23 | PREDILECT: Preferences Delineated with Zero-Shot Language-based Reasoning in Reinforcement Learning | Simon Holk et.al. | 2402.15420 | null |
2024-02-23 | Does Combining Parameter-efficient Modules Improve Few-shot Transfer Accuracy? | Nader Asadi et.al. | 2402.15414 | null |
2024-02-23 | Grasp, See and Place: Efficient Unknown Object Rearrangement with Policy Structure Prior | Kechun Xu et.al. | 2402.15402 | link |
2024-02-23 | Explorations of Self-Repair in Language Models | Cody Rushing et.al. | 2402.15390 | link |
2024-02-23 | Safe Task Planning for Language-Instructed Multi-Robot Systems using Conformal Prediction | Jun Wang et.al. | 2402.15368 | null |
2024-02-23 | Farsight: Fostering Responsible AI Awareness During AI Application Prototyping | Zijie J. Wang et.al. | 2402.15350 | link |
2024-02-23 | NuNER: Entity Recognition Encoder Pre-training via LLM-Annotated Data | Sergei Bogdanov et.al. | 2402.15343 | link |
2024-02-23 | Ranking Entities along Conceptual Space Dimensions with LLMs: An Analysis of Fine-Tuning Strategies | Nitesh Kumar et.al. | 2402.15337 | null |
2024-02-23 | GPTVQ: The Blessing of Dimensionality for LLM Quantization | Mart van Baalen et.al. | 2402.15319 | null |
2024-02-23 | ArabianGPT: Native Arabic GPT-based Large Language | Anis Koubaa et.al. | 2402.15313 | null |
2024-02-23 | Counterfactual Generation with Identifiability Guarantees | Hanqi Yan et.al. | 2402.15309 | link |
2024-02-23 | Representing Online Handwriting for Recognition in Large Vision-Language Models | Anastasiia Fadeeva et.al. | 2402.15307 | null |
2024-02-23 | How (un)ethical are instruction-centric responses of LLMs? Unveiling the vulnerabilities of safety guardrails to harmful queries | Somnath Banerjee et.al. | 2402.15302 | link |
2024-02-23 | Causal Graph Discovery with Retrieval-Augmented Generation based Large Language Models | Yuzhe Zhang et.al. | 2402.15301 | null |
2024-02-22 | PALO: A Polyglot Large Multimodal Model for 5B People | Muhammad Maaz et.al. | 2402.14818 | link |
2024-02-22 | Demographic Bias of Expert-Level Vision-Language Foundation Models in Medical Imaging | Yuzhe Yang et.al. | 2402.14815 | link |
2024-02-22 | WeakSAM: Segment Anything Meets Weakly-supervised Instance-level Recognition | Lianghui Zhu et.al. | 2402.14812 | link |
2024-02-22 | Fine-Tuning Enhances Existing Mechanisms: A Case Study on Entity Tracking | Nikhil Prakash et.al. | 2402.14811 | null |
2024-02-22 | CriticBench: Benchmarking LLMs for Critique-Correct Reasoning | Zicheng Lin et.al. | 2402.14809 | link |
2024-02-22 | RelayAttention for Efficient Large Language Model Serving with Long System Prompts | Lei Zhu et.al. | 2402.14808 | link |
2024-02-22 | A Decision-Language Model (DLM) for Dynamic Restless Multi-Armed Bandit Tasks in Public Health | Nikhil Behari et.al. | 2402.14807 | null |
2024-02-22 | Identifying Multiple Personalities in Large Language Models with External Evaluation | Xiaoyang Song et.al. | 2402.14805 | null |
2024-02-22 | Not All Experts are Equal: Efficient Expert Pruning and Skipping for Mixture-of-Experts Large Language Models | Xudong Lu et.al. | 2402.14800 | link |
2024-02-22 | Enhancing Systematic Decompositional Natural Language Inference Using Informal Logic | Nathaniel Weir et.al. | 2402.14798 | null |
2024-02-22 | Zero-shot cross-lingual transfer in instruction tuning of large language model | Nadezhda Chirkova et.al. | 2402.14778 | null |
2024-02-22 | 2D Matryoshka Sentence Embeddings | Xianming Li et.al. | 2402.14776 | link |
2024-02-22 | DualFocus: Integrating Macro and Micro Perspectives in Multi-modal Large Language Models | Yuhang Cao et.al. | 2402.14767 | link |
2024-02-22 | MT-Bench-101: A Fine-Grained Benchmark for Evaluating Large Language Models in Multi-Turn Dialogues | Ge Bai et.al. | 2402.14762 | link |
2024-02-22 | Generalizing Reward Modeling for Out-of-Distribution Preference Learning | Chen Jia et.al. | 2402.14760 | link |
2024-02-22 | Large Language Models as Urban Residents: An LLM Agent Framework for Personal Mobility Generation | Jiawei Wang et.al. | 2402.14744 | link |
2024-02-22 | Dependency Annotation of Ottoman Turkish with Multilingual BERT | Şaziye Betül Özateş et.al. | 2402.14743 | null |
2024-02-22 | Back to Basics: Revisiting REINFORCE Style Optimization for Learning from Human Feedback in LLMs | Arash Ahmadian et.al. | 2402.14740 | null |
2024-02-22 | Efficient and Effective Vocabulary Expansion Towards Multilingual Large Language Models | Seungduk Kim et.al. | 2402.14714 | link |
2024-02-22 | IEPile: Unearthing Large-Scale Schema-Based Information Extraction Corpus | Honghao Gui et.al. | 2402.14710 | link |
2024-02-21 | Coercing LLMs to do and reveal (almost) anything | Jonas Geiping et.al. | 2402.14020 | link |
2024-02-21 | Is LLM-as-a-Judge Robust? Investigating Universal Adversarial Attacks on Zero-shot LLM Assessment | Vyas Raina et.al. | 2402.14016 | link |
2024-02-21 | OlympiadBench: A Challenging Benchmark for Promoting AGI with Olympiad-Level Bilingual Multimodal Scientific Problems | Chaoqun He et.al. | 2402.14008 | link |
2024-02-21 | Can Watermarks Survive Translation? On the Cross-lingual Consistency of Text Watermark for Large Language Models | Zhiwei He et.al. | 2402.14007 | link |
2024-02-21 | Hallucinations or Attention Misdirection? The Path to Strategic Value Extraction in Business Using Large Language Models | Aline Ioste et.al. | 2402.14002 | null |
2024-02-21 | Analysing The Impact of Sequence Composition on Language Model Pre-Training | Yu Zhao et.al. | 2402.13991 | link |
2024-02-21 | Towards Building Multilingual Language Model for Medicine | Pengcheng Qiu et.al. | 2402.13963 | link |
2024-02-21 | Measuring Social Biases in Masked Language Models by Proxy of Prediction Quality | Rahul Zalkikar et.al. | 2402.13954 | link |
2024-02-21 | Making Reasoning Matter: Measuring and Improving Faithfulness of Chain-of-Thought Reasoning | Debjit Paul et.al. | 2402.13950 | null |
2024-02-21 | Do Efficient Transformers Really Save Computation? | Kai Yang et.al. | 2402.13934 | null |
2024-02-21 | Large Language Models are Vulnerable to Bait-and-Switch Attacks for Generating Harmful Content | Federico Bianchi et.al. | 2402.13926 | null |
2024-02-21 | SYNFAC-EDIT: Synthetic Imitation Edit Feedback for Factual Alignment in Clinical Summarization | Prakamya Mishra et.al. | 2402.13919 | link |
2024-02-21 | What Linguistic Features and Languages are Important in LLM Translation? | Ryandito Diandaru et.al. | 2402.13917 | null |
2024-02-21 | Calibrating Large Language Models with Sample Consistency | Qing Lyu et.al. | 2402.13904 | null |
2024-02-21 | Beyond Probabilities: Unveiling the Misalignment in Evaluating Large Language Models | Chenyang Lyu et.al. | 2402.13887 | null |
2024-02-21 | $\texttt{Se}^2$: $\textit{Se}$quential Example $\textit{Se}$ lection for In-Context Learning | Haoyu Liu et.al. | 2402.13874 | link |
2024-02-21 | An Explainable Transformer-based Model for Phishing Email Detection: A Large Language Model Approach | Mohammad Amaz Uddin et.al. | 2402.13871 | null |
2024-02-21 | Kuaiji: the First Chinese Accounting Large Language Model | Jiayuan Luo et.al. | 2402.13866 | null |
2024-02-21 | RealDex: Towards Human-like Grasping for Robotic Dexterous Hand | Yumeng Liu et.al. | 2402.13853 | null |
2024-02-21 | VL-Trojan: Multimodal Instruction Backdoor Attacks against Autoregressive Visual Language Models | Jiawei Liang et.al. | 2402.13851 | null |
2024-02-20 | Towards audio language modeling – an overview | Haibin Wu et.al. | 2402.13236 | null |
2024-02-20 | Unlocking Insights: Semantic Search in Jupyter Notebooks | Lan Li et.al. | 2402.13234 | null |
2024-02-20 | A Touch, Vision, and Language Dataset for Multimodal Alignment | Letian Fu et.al. | 2402.13232 | link |
2024-02-20 | Investigating Cultural Alignment of Large Language Models | Badr AlKhamissi et.al. | 2402.13231 | link |
2024-02-20 | Smaug: Fixing Failure Modes of Preference Optimisation with DPO-Positive | Arka Pal et.al. | 2402.13228 | link |
2024-02-20 | AgentMD: Empowering Language Agents for Risk Prediction with Large-Scale Clinical Tool Learning | Qiao Jin et.al. | 2402.13225 | null |
2024-02-20 | RoCode: A Dataset for Measuring Code Intelligence from Problem Definitions in Romanian | Adrian Cosma et.al. | 2402.13222 | link |
2024-02-20 | How Easy is It to Fool Your Multimodal LLMs? An Empirical Analysis on Deceptive Prompts | Yusu Qian et.al. | 2402.13220 | null |
2024-02-20 | Softmax Probabilities (Mostly) Predict Large Language Model Correctness on Multiple-Choice Q&A | Benjamin Plaut et.al. | 2402.13213 | link |
2024-02-20 | Soft Self-Consistency Improves Language Model Agents | Han Wang et.al. | 2402.13212 | link |
2024-02-20 | Can Large Language Models be Good Emotional Supporter? Mitigating Preference Bias on Emotional Support Conversation | Dongjin Kang et.al. | 2402.13211 | null |
2024-02-20 | Bayesian Reward Models for LLM Alignment | Adam X. Yang et.al. | 2402.13210 | null |
2024-02-20 | How do Hyenas deal with Human Speech? Speech Recognition and Translation with ConfHyena | Marco Gaido et.al. | 2402.13208 | link |
2024-02-20 | Question Calibration and Multi-Hop Modeling for Temporal Question Answering | Chao Xue et.al. | 2402.13188 | null |
2024-02-20 | What if LLMs Have Different World Views: Simulating Alien Civilizations with LLM-based Agents | Mingyu Jin et.al. | 2402.13184 | link |
2024-02-20 | DINOBot: Robot Manipulation via Retrieval and Alignment with Vision Foundation Models | Norman Di Palo et.al. | 2402.13181 | null |
2024-02-20 | Benchmarking Retrieval-Augmented Generation for Medicine | Guangzhi Xiong et.al. | 2402.13178 | link |
2024-02-20 | Defending Jailbreak Prompts via In-Context Adversarial Game | Yujun Zhou et.al. | 2402.13148 | null |
2024-02-20 | OLViT: Multi-Modal State Tracking via Attention-Based Embeddings for Video-Grounded Dialog | Adnen Abdessaied et.al. | 2402.13146 | null |
2024-02-20 | The Hidden Space of Transformer Language Adapters | Jesujoba O. Alabi et.al. | 2402.13137 | link |
2024-02-19 | Sequoia: Scalable, Robust, and Hardware-aware Speculative Decoding | Zhuoming Chen et.al. | 2402.12374 | link |
2024-02-19 | AnaloBench: Benchmarking the Identification of Abstract and Long-context Analogies | Xiao Ye et.al. | 2402.12370 | link |
2024-02-19 | A Critical Evaluation of AI Feedback for Aligning Large Language Models | Archit Sharma et.al. | 2402.12366 | link |
2024-02-19 | Emergent Word Order Universals from Cognitively-Motivated Language Models | Tatsuki Kuribayashi et.al. | 2402.12363 | link |
2024-02-19 | Graph-Based Retriever Captures the Long Tail of Biomedical Knowledge | Julien Delile et.al. | 2402.12352 | null |
2024-02-19 | GTBench: Uncovering the Strategic Reasoning Limitations of LLMs via Game-Theoretic Evaluations | Jinhao Duan et.al. | 2402.12348 | link |
2024-02-19 | Emulated Disalignment: Safety Alignment for Large Language Models May Backfire! | Zhanhui Zhou et.al. | 2402.12343 | link |
2024-02-19 | Robust CLIP: Unsupervised Adversarial Fine-Tuning of Vision Embeddings for Robust Large Vision-Language Models | Christian Schlarmann et.al. | 2402.12336 | link |
2024-02-19 | Query-Based Adversarial Prompt Generation | Jonathan Hayase et.al. | 2402.12329 | null |
2024-02-19 | Shall We Talk: Exploring Spontaneous Collaborations of Competing LLM Agents | Zengqing Wu et.al. | 2402.12327 | link |
2024-02-19 | ARKS: Active Retrieval in Knowledge Soup for Code Generation | Hongjin Su et.al. | 2402.12317 | link |
2024-02-19 | Is Open-Source There Yet? A Comparative Study on Commercial and Open-Source LLMs in Their Ability to Label Chest X-Ray Reports | Felix J. Dorfner et.al. | 2402.12298 | null |
2024-02-19 | KARL: Knowledge-Aware Retrieval and Representations aid Retention and Learning in Students | Matthew Shu et.al. | 2402.12291 | null |
2024-02-19 | DriveVLM: The Convergence of Autonomous Driving and Large Vision-Language Models | Xiaoyu Tian et.al. | 2402.12289 | null |
2024-02-19 | Adaptive Skeleton Graph Decoding | Shuowei Jin et.al. | 2402.12280 | null |
2024-02-19 | Key ingredients for effective zero-shot cross-lingual knowledge transfer in generative tasks | Nadezhda Chirkova et.al. | 2402.12279 | null |
2024-02-19 | Explain then Rank: Scale Calibration of Neural Rankers Using Natural Language Explanations from Large Language Models | Puxuan Yu et.al. | 2402.12276 | link |
2024-02-19 | High-quality Data-to-Text Generation for Severely Under-Resourced Languages with Out-of-the-box Large Language Models | Michela Lorandi et.al. | 2402.12267 | link |
2024-02-19 | Uncertainty quantification in fine-tuned LLMs using LoRA ensembles | Oleksandr Balabanov et.al. | 2402.12264 | null |
2024-02-19 | NEO-BENCH: Evaluating Robustness of Large Language Models with Neologisms | Jonathan Zheng et.al. | 2402.12261 | link |
2024-02-16 | PaLM2-VAdapter: Progressively Aligned Language Model Makes a Strong Vision-language Adapter | Junfei Xiao et.al. | 2402.10896 | null |
2024-02-16 | RLVF: Learning from Verbal Feedback without Overgeneralization | Moritz Stephan et.al. | 2402.10893 | link |
2024-02-16 | Instruction Diversity Drives Generalization To Unseen Tasks | Dylan Zhang et.al. | 2402.10891 | null |
2024-02-16 | When is Tree Search Useful for LLM Planning? It Depends on the Discriminator | Ziru Chen et.al. | 2402.10890 | link |
2024-02-16 | Multi-modal preference alignment remedies regression of visual instruction tuning on language model | Shengzhi Li et.al. | 2402.10884 | link |
2024-02-16 | EcoRank: Budget-Constrained Text Re-ranking Using Large Language Models | Muhammad Shihab Rashid et.al. | 2402.10866 | link |
2024-02-16 | Time Series Forecasting with LLMs: Understanding and Enhancing Model Capabilities | Mingyu Jin et.al. | 2402.10835 | null |
2024-02-16 | RAG-Driver: Generalisable Driving Explanations with Retrieval-Augmented In-Context Learning in Multi-Modal Large Language Model | Jianhao Yuan et.al. | 2402.10828 | null |
2024-02-16 | Quantifying the Persona Effect in LLM Simulations | Tiancheng Hu et.al. | 2402.10811 | null |
2024-02-16 | Generative Cross-Modal Retrieval: Memorizing Images in Multimodal Language Models for Retrieval and Beyond | Yongqi Li et.al. | 2402.10805 | null |
2024-02-16 | EdgeQAT: Entropy and Distribution Guided Quantization-Aware Training for the Acceleration of Lightweight LLMs on the Edge | Xuan Shen et.al. | 2402.10787 | link |
2024-02-16 | A Condensed Transition Graph Framework for Zero-shot Link Prediction with Large Language Models | Mingchen Li et.al. | 2402.10779 | null |
2024-02-16 | AutoGPT+P: Affordance-based Task Planning with Large Language Models | Timo Birr et.al. | 2402.10778 | null |
2024-02-16 | How Reliable Are Automatic Evaluation Methods for Instruction-Tuned LLMs? | Ehsan Doostmohammadi et.al. | 2402.10770 | null |
2024-02-16 | Distillation Enhanced Generative Retrieval | Yongqi Li et.al. | 2402.10769 | null |
2024-02-16 | Inference to the Best Explanation in Large Language Models | Dhairya Dalal et.al. | 2402.10767 | null |
2024-02-16 | When Dataflow Analysis Meets Large Language Models | Chengpeng Wang et.al. | 2402.10754 | link |
2024-02-16 | ToolSword: Unveiling Safety Issues of Large Language Models in Tool Learning Across Three Stages | Junjie Ye et.al. | 2402.10753 | link |
2024-02-16 | GenRES: Rethinking Evaluation for Generative Relation Extraction in the Era of Large Language Models | Pengcheng Jiang et.al. | 2402.10744 | link |
2024-02-16 | Let’s Learn Step by Step: Enhancing In-Context Learning Ability with Curriculum Learning | Yinpeng Liu et.al. | 2402.10738 | link |
2024-02-15 | Self-Play Fine-Tuning of Diffusion Models for Text-to-Image Generation | Huizhuo Yuan et.al. | 2402.10210 | null |
2024-02-15 | Rewards-in-Context: Multi-objective Alignment of Foundation Models with Dynamic Preference Adjustment | Rui Yang et.al. | 2402.10207 | link |
2024-02-15 | Chain-of-Thought Reasoning Without Prompting | Xuezhi Wang et.al. | 2402.10200 | null |
2024-02-15 | A Trembling House of Cards? Mapping Adversarial Attacks against Language Agents | Lingbo Mo et.al. | 2402.10196 | link |
2024-02-15 | BitDelta: Your Fine-Tune May Only Be Worth One Bit | James Liu et.al. | 2402.10193 | link |
2024-02-15 | Uncertainty Decomposition and Quantification for In-Context Learning of Large Language Models | Chen Ling et.al. | 2402.10189 | link |
2024-02-15 | Rethinking Information Structures in RLHF: Reward Generalization from a Graph Theory Perspective | Tianyi Qiu et.al. | 2402.10184 | null |
2024-02-15 | TDAG: A Multi-Agent Framework based on Dynamic Task Decomposition and Agent Generation | Yaoxiang Wang et.al. | 2402.10178 | null |
2024-02-15 | OpenMathInstruct-1: A 1.8 Million Math Instruction Tuning Dataset | Shubham Toshniwal et.al. | 2402.10176 | link |
2024-02-15 | Unlocking Structure Measuring: Introducing PDD, an Automatic Metric for Positional Discourse Coherence | Yinhong Liu et.al. | 2402.10175 | link |
2024-02-15 | OptiMUS: Scalable Optimization Modeling with (MI)LP Solvers and Large Language Models | Ali AhmadiTeshnizi et.al. | 2402.10172 | link |
2024-02-15 | Data Engineering for Scaling Language Models to 128K Context | Yao Fu et.al. | 2402.10171 | link |
2024-02-15 | Knowledge-Infused LLM-Powered Conversational Health Agent: A Case Study for Diabetes Patients | Mahyar Abbasian et.al. | 2402.10153 | null |
2024-02-15 | ControlLM: Crafting Diverse Personalities for Language Models | Yixuan Weng et.al. | 2402.10151 | link |
2024-02-15 | TOAD: Task-Oriented Automatic Dialogs with Diverse Response Styles | Yinhong Liu et.al. | 2402.10137 | null |
2024-02-15 | Zero-Shot Reasoning: Personalized Content Generation Without the Cold Start Problem | Davor Hafnar et.al. | 2402.10133 | link |
2024-02-15 | Selective Reflection-Tuning: Student-Selected Data Recycling for LLM Instruction-Tuning | Ming Li et.al. | 2402.10110 | link |
2024-02-15 | Quantized Embedding Vectors for Controllable Diffusion Language Models | Cheng Kang et.al. | 2402.10107 | null |
2024-02-15 | GeoEval: Benchmark for Evaluating LLMs and Multi-Modal Models on Geometry Problem-Solving | Jiaxin Zhang et.al. | 2402.10104 | link |
2024-02-15 | Any-Shift Prompting for Generalization over Distributions | Zehao Xiao et.al. | 2402.10099 | null |
2024-02-14 | AQA-Bench: An Interactive Benchmark for Evaluating LLMs’ Sequential Reasoning Ability | Siwei Yang et.al. | 2402.09404 | link |
2024-02-14 | Reinforcement Learning from Human Feedback with Active Queries | Kaixuan Ji et.al. | 2402.09401 | null |
2024-02-14 | Get More with LESS: Synthesizing Recurrence with KV Cache Compression for Efficient LLM Inference | Harry Dong et.al. | 2402.09398 | link |
2024-02-14 | LlaSMol: Advancing Large Language Models for Chemistry with a Large-Scale, Comprehensive, High-Quality Instruction Tuning Dataset | Botao Yu et.al. | 2402.09391 | link |
2024-02-14 | HGOT: Hierarchical Graph of Thoughts for Retrieval-Augmented In-Context Learning in Factuality Evaluation | Yihao Fang et.al. | 2402.09390 | link |
2024-02-14 | Transformers Can Achieve Length Generalization But Not Robustly | Yongchao Zhou et.al. | 2402.09371 | null |
2024-02-14 | Pseudorandom Error-Correcting Codes | Miranda Christ et.al. | 2402.09370 | null |
2024-02-14 | Massively Multi-Cultural Knowledge Acquisition & LM Benchmarking | Yi Fung et.al. | 2402.09369 | link |
2024-02-14 | Copyright Traps for Large Language Models | Matthieu Meeus et.al. | 2402.09363 | link |
2024-02-14 | HiRE: High Recall Approximate Top- $k$ Estimation for Efficient LLM Inference | Yashas Samaga B L et.al. | 2402.09360 | null |
2024-02-14 | Developing a Framework for Auditing Large Language Models Using Human-in-the-Loop | Maryam Amirizaniani et.al. | 2402.09346 | null |
2024-02-14 | Mitigating Reward Hacking via Information-Theoretic Reward Modeling | Yuchun Miao et.al. | 2402.09345 | link |
2024-02-14 | AuditLLM: A Tool for Auditing Large Language Models Using Multiprobe Approach | Maryam Amirizaniani et.al. | 2402.09334 | null |
2024-02-14 | ICDPO: Effectively Borrowing Alignment Capability of Others via In-context Direct Preference Optimization | Feifan Song et.al. | 2402.09320 | link |
2024-02-14 | Embracing the black box: Heading towards foundation models for causal discovery from time series data | Gideon Stein et.al. | 2402.09305 | link |
2024-02-14 | Trained Without My Consent: Detecting Code Inclusion In Language Models Trained on Code | Vahid Majdinasab et.al. | 2402.09299 | link |
2024-02-14 | Attacks, Defenses and Evaluations for LLM Conversation Safety: A Survey | Zhichen Dong et.al. | 2402.09283 | link |
2024-02-14 | Leveraging Large Language Models for Enhanced NLP Task Performance through Knowledge Distillation and Optimized Training Strategies | Yining Huang et.al. | 2402.09282 | null |
2024-02-14 | Personalized Large Language Models | Stanisław Woźniak et.al. | 2402.09269 | null |
2024-02-14 | Self-Alignment for Factuality: Mitigating Hallucinations in LLMs via Self-Evaluation | Xiaoying Zhang et.al. | 2402.09267 | null |
2024-02-13 | Mitigating Object Hallucination in Large Vision-Language Models via Classifier-Free Guidance | Linxi Zhao et.al. | 2402.08680 | null |
2024-02-13 | COLD-Attack: Jailbreaking LLMs with Stealthiness and Controllability | Xingang Guo et.al. | 2402.08679 | link |
2024-02-13 | Human Curriculum Effects Emerge with In-Context Learning in Neural Networks | Jacob Russin et.al. | 2402.08674 | null |
2024-02-13 | Rec-GPT4V: Multimodal Recommendation with Large Vision-Language Models | Yuqing Liu et.al. | 2402.08670 | null |
2024-02-13 | Improving Generalization in Semantic Parsing by Increasing Natural Language Variation | Irina Saparina et.al. | 2402.08666 | link |
2024-02-13 | The Last JITAI? The Unreasonable Effectiveness of Large Language Models in Issuing Just-in-Time Adaptive Interventions: Fostering Physical Activity in a Prospective Cardiac Rehabilitation Setting | David Haag et.al. | 2402.08658 | null |
2024-02-13 | PIN: Positional Insert Unlocks Object Localisation Abilities in VLMs | Michael Dorkenwald et.al. | 2402.08657 | null |
2024-02-13 | Tandem Transformers for Inference Efficient LLMs | Aishwarya P S et.al. | 2402.08644 | null |
2024-02-13 | SemRel2024: A Collection of Semantic Textual Relatedness Datasets for 14 Languages | Nedjma Ousidhoum et.al. | 2402.08638 | null |
2024-02-13 | Knowledge Editing on Black-box Large Language Models | Xiaoshuai Song et.al. | 2402.08631 | link |
2024-02-13 | Bayesian Multi-Task Transfer Learning for Soft Prompt Tuning | Haeju Lee et.al. | 2402.08594 | link |
2024-02-13 | Test-Time Backdoor Attacks on Multimodal Large Language Models | Dong Lu et.al. | 2402.08577 | link |
2024-02-13 | Online Foundation Model Selection in Robotics | Po-han Li et.al. | 2402.08570 | null |
2024-02-13 | Agent Smith: A Single Image Can Jailbreak One Million Multimodal LLM Agents Exponentially Fast | Xiangming Gu et.al. | 2402.08567 | link |
2024-02-13 | Artificial Intelligence for Literature Reviews: Opportunities and Challenges | Francisco Bolanos et.al. | 2402.08565 | null |
2024-02-13 | Higher Layers Need More LoRA Experts | Chongyang Gao et.al. | 2402.08562 | link |
2024-02-13 | Grounding LLMs For Robot Task Planning Using Closed-loop State Feedback | Vineet Bhat et.al. | 2402.08546 | null |
2024-02-13 | The Application of ChatGPT in Responding to Questions Related to the Boston Bowel Preparation Scale | Xiaoqiang Liu et.al. | 2402.08492 | null |
2024-02-13 | Intriguing Differences Between Zero-Shot and Systematic Evaluations of Vision-Language Transformer Models | Shaeke Salman et.al. | 2402.08473 | null |
2024-02-13 | Large Language Models for the Automated Analysis of Optimization Algorithms | Camilo Chacón Sartori et.al. | 2402.08472 | link |
2024-02-12 | A systematic investigation of learnability from single child linguistic input | Yulu Qin et.al. | 2402.07899 | link |
2024-02-12 | Suppressing Pink Elephants with Direct Principle Feedback | Louis Castricato et.al. | 2402.07896 | null |
2024-02-12 | WildfireGPT: Tailored Large Language Model for Wildfire Analysis | Yangxinyu Xie et.al. | 2402.07877 | null |
2024-02-12 | Policy Improvement using Language Feedback Models | Victor Zhong et.al. | 2402.07876 | link |
2024-02-12 | PIVOT: Iterative Visual Prompting Elicits Actionable Knowledge for VLMs | Soroush Nasiriany et.al. | 2402.07872 | null |
2024-02-12 | Scaling Laws for Fine-Grained Mixture of Experts | Jakub Krajewski et.al. | 2402.07871 | link |
2024-02-12 | PoisonedRAG: Knowledge Poisoning Attacks to Retrieval-Augmented Generation of Large Language Models | Wei Zou et.al. | 2402.07867 | link |
2024-02-12 | Prismatic VLMs: Investigating the Design Space of Visually-Conditioned Language Models | Siddharth Karamcheti et.al. | 2402.07865 | link |
2024-02-12 | AI-Augmented Predictions: LLM Assistants Improve Human Forecasting Accuracy | Philipp Schoenegger et.al. | 2402.07862 | null |
2024-02-12 | Lissard: Long and Simple Sequential Reasoning Datasets | Mirelle Bueno et.al. | 2402.07859 | link |
2024-02-12 | Mercury: An Efficiency Benchmark for LLM Code Synthesis | Mingzhe Du et.al. | 2402.07844 | link |
2024-02-12 | Do Membership Inference Attacks Work on Large Language Models? | Michael Duan et.al. | 2402.07841 | link |
2024-02-12 | Aya Model: An Instruction Finetuned Open-Access Multilingual Language Model | Ahmet Üstün et.al. | 2402.07827 | null |
2024-02-12 | Differentially Private Zeroth-Order Methods for Scalable Large Language Model Finetuning | Z Liu et.al. | 2402.07818 | null |
2024-02-12 | Injecting Wiktionary to improve token-level contextual representations using contrastive learning | Anna Mosolova et.al. | 2402.07817 | null |
2024-02-12 | Retrieval-Augmented Thought Process as Sequential Decision Making | Thomas Pouplin et.al. | 2402.07812 | null |
2024-02-12 | Empowering Federated Learning for Massive Models with NVIDIA FLARE | Holger R. Roth et.al. | 2402.07792 | null |
2024-02-12 | TELLER: A Trustworthy Framework for Explainable, Generalizable and Controllable Fake News Detection | Hui Liu et.al. | 2402.07776 | link |
2024-02-12 | Quantitative knowledge retrieval from large language models | David Selby et.al. | 2402.07770 | link |
2024-02-12 | Towards an Understanding of Stepwise Inference in Transformers: A Synthetic Graph Navigation Model | Mikail Khona et.al. | 2402.07757 | null |
2024-02-09 | Feedback Loops With Language Models Drive In-Context Reward Hacking | Alexander Pan et.al. | 2402.06627 | link |
2024-02-09 | Understanding the Effects of Iterative Prompting on Truthfulness | Satyapriya Krishna et.al. | 2402.06625 | null |
2024-02-09 | Aya Dataset: An Open-Access Collection for Multilingual Instruction Tuning | Shivalika Singh et.al. | 2402.06619 | null |
2024-02-09 | FaBERT: Pre-training BERT on Persian Blogs | Mostafa Masumi et.al. | 2402.06617 | null |
2024-02-09 | On the Out-Of-Distribution Generalization of Multimodal Large Language Models | Xingxuan Zhang et.al. | 2402.06599 | null |
2024-02-09 | CigaR: Cost-efficient Program Repair with LLMs | Dávid Hidvégi et.al. | 2402.06598 | link |
2024-02-09 | Understanding the Weakness of Large Language Model Agents within a Complex Android Environment | Mingzhe Xing et.al. | 2402.06596 | link |
2024-02-09 | Self-consistent context aware conformer transducer for speech recognition | Konstantin Kolokolov et.al. | 2402.06592 | null |
2024-02-09 | G-SciEdBERT: A Contextualized LLM for Science Assessment Tasks in German | Ehsan Latif et.al. | 2402.06584 | link |
2024-02-09 | Video Annotator: A framework for efficiently building video classifiers using vision-language models and active learning | Amir Ziai et.al. | 2402.06560 | link |
2024-02-09 | The Quantified Boolean Bayesian Network: Theory and Experiments with a Logical Graphical Model | Gregory Coppola et.al. | 2402.06557 | link |
2024-02-09 | Bryndza at ClimateActivism 2024: Stance, Target and Hate Event Detection via Retrieval-Augmented GPT-4 and LLaMA | Marek Šuppa et.al. | 2402.06549 | link |
2024-02-09 | Calibrating Long-form Generations from Large Language Models | Yukun Huang et.al. | 2402.06544 | link |
2024-02-09 | Introspective Planning: Guiding Language-Enabled Agents to Refine Their Own Uncertainty | Kaiqu Liang et.al. | 2402.06529 | link |
2024-02-09 | Multimodal Clinical Trial Outcome Prediction with Large Language Models | Wenhao Zheng et.al. | 2402.06512 | link |
2024-02-09 | Iris-SAM: Iris Segmentation Using a Foundational Model | Parisa Farmanifard et.al. | 2402.06497 | link |
2024-02-09 | Large Language Models for Captioning and Retrieving Remote Sensing Images | João Daniel Silva et.al. | 2402.06475 | null |
2024-02-09 | V-STaR: Training Verifiers for Self-Taught Reasoners | Arian Hosseini et.al. | 2402.06457 | null |
2024-02-09 | StruQ: Defending Against Prompt Injection with Structured Queries | Sizhe Chen et.al. | 2402.06363 | link |
2024-02-09 | CoSearchAgent: A Lightweight Collaborative Search Agent with Large Language Models | Peiyuan Gong et.al. | 2402.06360 | link |
2024-02-08 | SPHINX-X: Scaling Data and Parameters for a Family of Multi-modal Large Language Models | Peng Gao et.al. | 2402.05935 | link |
2024-02-08 | Driving Everywhere with Large Language Model Policy Adaptation | Boyi Li et.al. | 2402.05932 | null |
2024-02-08 | WebLINX: Real-World Website Navigation with Multi-Turn Dialogue | Xing Han Lù et.al. | 2402.05930 | link |
2024-02-08 | An Interactive Agent Foundation Model | Zane Durante et.al. | 2402.05929 | null |
2024-02-08 | On the Convergence of Zeroth-Order Federated Tuning in Large Language Models | Zhenqing Ling et.al. | 2402.05926 | link |
2024-02-08 | Efficient Stagewise Pretraining via Progressive Subnetworks | Abhishek Panigrahi et.al. | 2402.05913 | null |
2024-02-08 | FACT-GPT: Fact-Checking Augmentation via Claim Matching with LLMs | Eun Cheol Choi et.al. | 2402.05904 | link |
2024-02-08 | Large Language Model Meets Graph Neural Network in Knowledge Distillation | Shengxiang Hu et.al. | 2402.05894 | null |
2024-02-08 | Generative Echo Chamber? Effects of LLM-Powered Search Systems on Diverse Information Seeking | Nikhil Sharma et.al. | 2402.05880 | null |
2024-02-08 | PromptCrypt: Prompt Encryption for Secure Communication with Large Language Models | Guo Lin et.al. | 2402.05868 | link |
2024-02-08 | How Well Can LLMs Negotiate? NegotiationArena Platform and Analysis | Federico Bianchi et.al. | 2402.05863 | link |
2024-02-08 | Let Your Graph Do the Talking: Encoding Structured Data for LLMs | Bryan Perozzi et.al. | 2402.05862 | link |
2024-02-08 | Learning to Route Among Specialized Experts for Zero-Shot Generalization | Mohammed Muqeeth et.al. | 2402.05859 | link |
2024-02-08 | Limitations of Agents Simulated by Predictive Models | Raymond Douglas et.al. | 2402.05829 | null |
2024-02-08 | Is it Possible to Edit Large Language Models Robustly? | Xinbei Ma et.al. | 2402.05827 | link |
2024-02-08 | Selective Forgetting: Advancing Machine Unlearning Techniques and Evaluation in Language Models | Lingzhi Wang et.al. | 2402.05813 | null |
2024-02-08 | Training Large Language Models for Reasoning through Reverse Curriculum Reinforcement Learning | Zhiheng Xi et.al. | 2402.05808 | link |
2024-02-08 | How do Transformers perform In-Context Autoregressive Learning? | Michael E. Sander et.al. | 2402.05787 | null |
2024-02-08 | Limits of Transformer Language Models on Algorithmic Learning | Jonathan Thomm et.al. | 2402.05785 | link |
2024-02-08 | Text-to-Code Generation with Modality-relative Pre-training | Fenia Christopoulou et.al. | 2402.05783 | null |
Autonomous Driving
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-12-10 | Test-time Correction with Human Feedback: An Online 3D Detection System via Visual Prompting | Zetong Yang et.al. | 2412.07768 | null |
2024-12-10 | Predictive Modeling of Homeless Service Assignment: A Representation Learning Approach | Khandker Sadia Rahman et.al. | 2412.07747 | null |
2024-12-10 | DriveMM: All-in-One Large Multimodal Model for Autonomous Driving | Zhijian Huang et.al. | 2412.07689 | link |
2024-12-10 | Optimizing Sensor Redundancy in Sequential Decision-Making Problems | Jonas Nüßlein et.al. | 2412.07686 | null |
2024-12-10 | Automating Business Intelligence Requirements with Generative AI and Semantic Search | Nimrod Busany et.al. | 2412.07668 | null |
2024-12-10 | Swarm Behavior Cloning | Jonas Nüßlein et.al. | 2412.07617 | null |
2024-12-10 | Hallucination Elimination and Semantic Enhancement Framework for Vision-Language Models in Traffic Scenarios | Jiaqi Fan et.al. | 2412.07518 | link |
2024-12-10 | A Real-time Degeneracy Sensing and Compensation Method for Enhanced LiDAR SLAM | Zongbo Liao et.al. | 2412.07513 | null |
2024-12-10 | A Robust Sustainability Assessment Methodology for Aircraft Parts: Application to a Fuselage Panel | Aikaterini A. Anagnostopoulou et.al. | 2412.07421 | null |
2024-12-10 | ITPNet: Towards Instantaneous Trajectory Prediction for Autonomous Driving | Rongqing Li et.al. | 2412.07369 | null |
2024-12-10 | Addressing Key Challenges of Adversarial Attacks and Defenses in the Tabular Domain: A Methodological Framework for Coherence and Consistency | Yael Itzhakev et.al. | 2412.07326 | null |
2024-12-10 | HARP: Hesitation-Aware Reframing in Transformer Inference Pass | Romain Storaï et.al. | 2412.07282 | link |
2024-12-10 | Human-Computer Interaction and Human-AI Collaboration in Advanced Air Mobility: A Comprehensive Review | Fatma Yamac Sagirli et.al. | 2412.07241 | null |
2024-12-10 | Epidemiological Model Calibration via Graybox Bayesian Optimization | Puhua Niu et.al. | 2412.07193 | null |
2024-12-10 | Effective Reward Specification in Deep Reinforcement Learning | Julien Roy et.al. | 2412.07177 | null |
2024-12-10 | Fast Occupancy Network | Mingjie Lu et.al. | 2412.07163 | null |
2024-12-09 | A Note on Sample Complexity of Interactive Imitation Learning with Log Loss | Yichen Li et.al. | 2412.07057 | null |
2024-12-09 | GenAI4UQ: A Software for Inverse Uncertainty Quantification Using Conditional Generative Models | Ming Fan et.al. | 2412.07026 | link |
2024-12-09 | Creating a Cooperative AI Policymaking Platform through Open Source Collaboration | Aiden Lewington et.al. | 2412.06936 | null |
2024-12-09 | Driv3R: Learning Dense 4D Reconstruction for Autonomous Driving | Xin Fei et.al. | 2412.06777 | link |
2024-12-09 | 3D Graph Attention Networks for High Fidelity Pediatric Glioma Segmentation | Harish Thangaraj et.al. | 2412.06743 | null |
2024-12-09 | Digital Transformation in the Water Distribution System based on the Digital Twins Concept | MohammadHossein Homaei et.al. | 2412.06694 | null |
2024-12-09 | Policy Agnostic RL: Offline RL and Online RL Fine-Tuning of Any Class and Backbone | Max Sobol Mark et.al. | 2412.06685 | null |
2024-12-09 | Exploring Critical Testing Scenarios for Decision-Making Policies: An LLM Approach | Weichao Xu et.al. | 2412.06684 | null |
2024-12-09 | Toward LLM-Agent-Based Modeling of Transportation Systems: A Conceptual Framework | Tianming Liu et.al. | 2412.06681 | null |
2024-12-09 | Generalized Design of Basket Trials with P-value Combination Test | Heng Zhou et.al. | 2412.06622 | null |
2024-12-09 | Prediction of Occluded Pedestrians in Road Scenes using Human-like Reasoning: Insights from the OccluRoads Dataset | Melo Castillo Angie Nataly et.al. | 2412.06549 | null |
2024-12-09 | PPT: Pre-Training with Pseudo-Labeled Trajectories for Motion Forecasting | Yihong Xu et.al. | 2412.06491 | null |
2024-12-09 | Towards Civic Digital Twins: Co-Design the Citizen-Centric Future of Bologna | Massimiliano Luca et.al. | 2412.06328 | null |
2024-12-09 | World knowledge-enhanced Reasoning Using Instruction-guided Interactor in Autonomous Driving | Mingliang Zhai et.al. | 2412.06324 | null |
2024-12-09 | Omni-Scene: Omni-Gaussian Representation for Ego-Centric Sparse-View Scene Reconstruction | Dongxu Wei et.al. | 2412.06273 | null |
2024-12-09 | Towards a Comprehensive Framework for Cyber-Incident Response Decision Support in Smart Grids | Omer Sen et.al. | 2412.06254 | null |
2024-12-09 | LLMs as Debate Partners: Utilizing Genetic Algorithms and Adversarial Search for Adaptive Arguments | Prakash Aryan et.al. | 2412.06229 | null |
2024-12-09 | Discrete-Time Distribution Steering using Monte Carlo Tree Search | Alexandros E. Tzikas et.al. | 2412.06220 | null |
2024-12-09 | Pilot-guided Multimodal Semantic Communication for Audio-Visual Event Localization | Fei Yu et.al. | 2412.06208 | null |
2024-12-09 | Conservative Contextual Bandits: Beyond Linear Representations | Rohan Deb et.al. | 2412.06165 | null |
2024-12-09 | AgentAlign: Misalignment-Adapted Multi-Agent Perception for Resilient Inter-Agent Sensor Correlations | Zonglin Meng et.al. | 2412.06142 | null |
2024-12-09 | HSDA: High-frequency Shuffle Data Augmentation for Bird’s-Eye-View Map Segmentation | Calvin Glisson et.al. | 2412.06127 | link |
2024-12-08 | Multifidelity Uncertainty Quantification for Ice Sheet Simulations | Nicole Aretz et.al. | 2412.06110 | null |
2024-12-06 | Stag-1: Towards Realistic 4D Driving Simulation with Video Generation Model | Lening Wang et.al. | 2412.05280 | link |
2024-12-06 | Reinforcement Learning: An Overview | Kevin Murphy et.al. | 2412.05265 | null |
2024-12-06 | Uncertainty Quantification for Transformer Models for Dark-Pattern Detection | Javier Muñoz et.al. | 2412.05251 | null |
2024-12-06 | SurgBox: Agent-Driven Operating Room Sandbox with Surgery Copilot | Jinlin Wu et.al. | 2412.05187 | link |
2024-12-06 | Towards Flexible 3D Perception: Object-Centric Occupancy Completion Augments 3D Object Detection | Chaoda Zheng et.al. | 2412.05154 | link |
2024-12-06 | Explingo: Explaining AI Predictions using Large Language Models | Alexandra Zytek et.al. | 2412.05145 | null |
2024-12-06 | A Parametric, Second-Order Cone Representable Model of Fairness for Decision-Making Problems | Kaarthik Sundar et.al. | 2412.05143 | null |
2024-12-06 | Constructing optimal treatment length strategies to maximize quality-adjusted lifetimes | Hao Sun et.al. | 2412.05108 | null |
2024-12-06 | Integrating Semantic Communication and Human Decision-Making into an End-to-End Sensing-Decision Framework | Edgar Beck et.al. | 2412.05103 | null |
2024-12-06 | Backdooring Outlier Detection Methods: A Novel Attack Approach | ZeinabSadat Taghavi et.al. | 2412.05010 | null |
2024-12-06 | Who Speaks Next? Multi-party AI Discussion Leveraging the Systematics of Turn-taking in Murder Mystery Games | Ryota Nonomura et.al. | 2412.04937 | null |
2024-12-06 | Nonmyopic Global Optimisation via Approximate Dynamic Programming | Filippo Airaldi et.al. | 2412.04882 | null |
2024-12-06 | Self-Organizing Complex Networks with AI-Driven Adaptive Nodes for Optimized Connectivity and Energy Efficiency | Azra Seyyedi et.al. | 2412.04874 | null |
2024-12-06 | Using Machine Learning to Discover Parsimonious and Physically-Interpretable Representations of Catchment-Scale Rainfall-Runoff Dynamics | Yuan-Heng Wang et.al. | 2412.04845 | null |
2024-12-06 | UniMLVG: Unified Framework for Multi-view Long Video Generation with Comprehensive Control Capabilities for Autonomous Driving | Rui Chen et.al. | 2412.04842 | null |
2024-12-06 | Automatic Prediction of Stroke Treatment Outcomes: Latest Advances and Perspectives | Zeynel A. Samak et.al. | 2412.04812 | null |
2024-12-06 | Question Answering for Decisionmaking in Green Building Design: A Multimodal Data Reasoning Method Driven by Large Language Models | Yihui Li et.al. | 2412.04741 | null |
2024-12-05 | Multiclass Post-Earthquake Building Assessment Integrating Optical and SAR Satellite Imagery, Ground Motion, and Soil Data with Transformers | Deepank Singh et.al. | 2412.04664 | null |
2024-12-05 | Fairness-aware Principal Component Analysis for Mortality Forecasting and Annuity Pricing | Fei Huang et.al. | 2412.04663 | null |
2024-12-05 | Game-Theoretic Foundations for Cyber Resilience Against Deceptive Information Attacks in Intelligent Transportation Systems | Ya-Ting Yang et.al. | 2412.04627 | null |
2024-12-05 | Probabilistic Gaussian Superposition for Efficient 3D Occupancy Prediction | Yuanhui Huang et.al. | 2412.04384 | link |
2024-12-05 | Sensor-Driven Predictive Vehicle Maintenance and Routing Problem with Time Windows | Iman Kazemian et.al. | 2412.04350 | null |
2024-12-05 | Reinforcement Learning for Freeway Lane-Change Regulation via Connected Vehicles | Ke Sun et.al. | 2412.04341 | null |
2024-12-05 | Reflective Teacher: Semi-Supervised Multimodal 3D Object Detection in Bird’s-Eye-View via Uncertainty Measure | Saheli Hazra et.al. | 2412.04337 | null |
2024-12-05 | YOLO-CCA: A Context-Based Approach for Traffic Sign Detection | Linfeng Jiang et.al. | 2412.04289 | link |
2024-12-05 | Deep Causal Inference for Point-referenced Spatial Data with Continuous Treatments | Ziyang Jiang et.al. | 2412.04285 | link |
2024-12-05 | On Extrapolation of Treatment Effects in Multiple-Cutoff Regression Discontinuity Designs | Yuta Okamoto et.al. | 2412.04265 | null |
2024-12-05 | CALMM-Drive: Confidence-Aware Autonomous Driving with Large Multimodal Model | Ruoyu Yao et.al. | 2412.04209 | null |
2024-12-05 | Towards Comprehensive Legislative Requirements for Cyber Physical Systems Testing in the European Union | Guillaume Nguyen et.al. | 2412.04132 | null |
2024-12-05 | Integrated Sensing and Communications for Low-Altitude Economy: A Deep Reinforcement Learning Approach | Xiaowen Ye et.al. | 2412.04074 | null |
2024-12-05 | AI4EF: Artificial Intelligence for Energy Efficiency in the Building Sector | Alexandros Menelaos Tzortzis et.al. | 2412.04045 | null |
2024-12-05 | Considerations Influencing Offense-Defense Dynamics From Artificial Intelligence | Giulio Corsi et.al. | 2412.04029 | null |
2024-12-05 | A Model of the Sidewalk Salsa | Olger Siebinga et.al. | 2412.04023 | null |
2024-12-05 | Computing diverse pair of solutions for tractable SAT | Tatsuya Gima et.al. | 2412.04016 | null |
2024-12-05 | Quality Control in Open-Ended Crowdsourcing: A Survey | Lei Chai et.al. | 2412.03991 | null |
2024-12-05 | UNCOVER: Unknown Class Object Detection for Autonomous Vehicles in Real-time | Lars Schmarje et.al. | 2412.03986 | null |
2024-12-05 | Electronic Health Records-Based Data-Driven Diabetes Knowledge Unveiling and Risk Prognosis | Huadong Pang et.al. | 2412.03961 | null |
2024-12-05 | Quantized and Interpretable Learning Scheme for Deep Neural Networks in Classification Task | Alireza Maleki et.al. | 2412.03915 | null |
2024-12-05 | A Unified Framework for Evaluating the Effectiveness and Enhancing the Transparency of Explainable AI Methods in Real-World Applications | Md. Ariful Islam et.al. | 2412.03884 | null |
2024-12-05 | Learning Based MPC for Autonomous Driving Using a Low Dimensional Residual Model | Yaoyu Li et.al. | 2412.03874 | null |
2024-12-04 | Streaming Detection of Queried Event Start | Cristobal Eyzaguirre et.al. | 2412.03567 | link |
2024-12-04 | FreeSim: Toward Free-viewpoint Camera Simulation in Driving Scenes | Lue Fan et.al. | 2412.03566 | null |
2024-12-04 | Seeing Beyond Views: Multi-View Driving Scene Video Generation with Holistic Attention | Hannan Lu et.al. | 2412.03520 | null |
2024-12-04 | Data Fusion of Semantic and Depth Information in the Context of Object Detection | Md Abu Yusuf et.al. | 2412.03490 | null |
2024-12-04 | From Words to Workflows: Automating Business Processes | Laura Minkova et.al. | 2412.03446 | null |
2024-12-04 | BIMCaP: BIM-based AI-supported LiDAR-Camera Pose Refinement | Miguel Arturo Vega Torres et.al. | 2412.03434 | link |
2024-12-04 | Governance as a complex, networked, democratic, satisfiability problem | Laurent Hébert-Dufresne et.al. | 2412.03421 | null |
2024-12-04 | Learning Semantic Association Rules from Internet of Things Data | Erkan Karabulut et.al. | 2412.03417 | link |
2024-12-04 | Risk-aware Classification via Uncertainty Quantification | Murat Sensoy et.al. | 2412.03391 | null |
2024-12-04 | AI-Driven Day-to-Day Route Choice | Leizhen Wang et.al. | 2412.03338 | null |
2024-12-04 | Are Explanations Helpful? A Comparative Analysis of Explainability Methods in Skin Lesion Classifiers | Rosa Y. G. Paccotacya-Yanque et.al. | 2412.03166 | link |
2024-12-04 | LLM-Twin: A Generated-Persona Approach for Survey Pre-Testing | Sunwoong Kim et.al. | 2412.03162 | null |
2024-12-04 | LEP-QNN: Loan Eligibility Prediction Using Quantum Neural Networks | Nouhaila Innan et.al. | 2412.03158 | null |
2024-12-04 | Hybrid deep learning-based strategy for the hepatocellular carcinoma cancer grade classification of H&E stained liver histopathology images | Ajinkya Deshpande et.al. | 2412.03084 | null |
2024-12-04 | Coordinated Multi-Armed Bandits for Improved Spatial Reuse in Wi-Fi | Francesc Wilhelmi et.al. | 2412.03076 | null |
2024-12-04 | A Survey of Wireless Sensing Security from a Role-Based View: Victim, Weapon, and Shield | Ruixu Geng et.al. | 2412.03064 | link |
2024-12-04 | Lightweight Stochastic Video Prediction via Hybrid Warping | Kazuki Kotoyori et.al. | 2412.03061 | null |
2024-12-04 | Less is More: A Stealthy and Efficient Adversarial Attack Method for DRL-based Autonomous Driving Policies | Junchao Fan et.al. | 2412.03051 | null |
2024-12-04 | Data Acquisition for Improving Model Fairness using Reinforcement Learning | Jahid Hasan et.al. | 2412.03009 | null |
2024-12-04 | Data-driven Koopman Operator-based Prediction and Control Using Model Averaging | Daisuke Uchida et.al. | 2412.02984 | null |
2024-12-03 | Preliminary Investigation into Data Scaling Laws for Imitation Learning-Based End-to-End Autonomous Driving | Yupeng Zheng et.al. | 2412.02689 | null |
2024-12-03 | Wasserstein Markets for Differentially-Private Data | Saurab Chhachhi et.al. | 2412.02609 | link |
2024-12-03 | Explainable CTR Prediction via LLM Reasoning | Xiaohan Yu et.al. | 2412.02588 | null |
2024-12-03 | Generating Critical Scenarios for Testing Automated Driving Systems | Trung-Hieu Nguyen et.al. | 2412.02574 | link |
2024-12-03 | Semantic Tokens in Retrieval Augmented Generation | Joel Suro et.al. | 2412.02563 | null |
2024-12-03 | Time-Series-Informed Closed-loop Learning for Sequential Decision Making and Control | Sebastian Hirt et.al. | 2412.02423 | null |
2024-12-03 | OMENN: One Matrix to Explain Neural Networks | Adam Wróbel et.al. | 2412.02399 | null |
2024-12-03 | Who Walks With You Matters: Perceiving Social Interactions with Groups for Pedestrian Trajectory Prediction | Ziqian Zou et.al. | 2412.02395 | null |
2024-12-03 | Social patch foraging theory in an egalitarian group | Lisa Blum Moyse et.al. | 2412.02381 | null |
2024-12-03 | Use of surrogate endpoints in health technology assessment: a review of selected NICE technology appraisals in oncology | Lorna Wheaton et.al. | 2412.02380 | null |
2024-12-03 | Trajectory-based Road Autolabeling with Lidar-Camera Fusion in Winter Conditions | Eerik Alamikkotervo et.al. | 2412.02370 | link |
2024-12-03 | Step-by-Step Guidance to Differential Anemia Diagnosis with Real-World Data and Deep Reinforcement Learning | Lillian Muyama et.al. | 2412.02273 | link |
2024-12-03 | Technical Report on Reinforcement Learning Control on the Lucas-Nülle Inverted Pendulum | Maximilian Schenke et.al. | 2412.02264 | null |
2024-12-03 | Selective Reviews of Bandit Problems in AI via a Statistical View | Pengjie Zhou et.al. | 2412.02251 | null |
2024-12-03 | An Automated Data Mining Framework Using Autoencoders for Feature Extraction and Dimensionality Reduction | Yaxin Liang et.al. | 2412.02211 | null |
2024-12-03 | DataLab: A Unifed Platform for LLM-Powered Business Intelligence | Luoxuan Weng et.al. | 2412.02205 | null |
2024-12-03 | Self-Supervised Learning-Based Path Planning and Obstacle Avoidance Using PPO and B-Splines in Unknown Environments | Shahab Shokouhi et.al. | 2412.02176 | null |
2024-12-03 | Underload: Defending against Latency Attacks for Object Detectors on Edge Devices | Tianyi Wang et.al. | 2412.02171 | null |
2024-12-03 | CausalMob: Causal Human Mobility Prediction with LLMs-derived Human Intentions toward Public Events | Xiaojie Yang et.al. | 2412.02155 | link |
2024-12-03 | Failure Probability Estimation for Black-Box Autonomous Systems using State-Dependent Importance Sampling Proposals | Harrison Delecki et.al. | 2412.02154 | null |
2024-11-29 | FlowCLAS: Enhancing Normalizing Flow Via Contrastive Learning For Anomaly Segmentation | Chang Won Lee et.al. | 2411.19888 | null |
2024-11-29 | SpaRC: Sparse Radar-Camera Fusion for 3D Object Detection | Philipp Wolters et.al. | 2411.19860 | null |
2024-11-29 | Collective decision-making with heterogeneous biases: Role of network topology and susceptibility | Yunus Sevinchan et.al. | 2411.19829 | null |
2024-11-29 | A Multi-Loss Strategy for Vehicle Trajectory Prediction: Combining Off-Road, Diversity, and Directional Consistency Losses | Ahmad Rahimi et.al. | 2411.19747 | link |
2024-11-29 | Graph Neural Networks for Heart Failure Prediction on an EHR-Based Patient Similarity Graph | Heloisa Oss Boll et.al. | 2411.19742 | link |
2024-11-29 | The Streetscape Application Services Stack (SASS): Towards a Distributed Sensing Architecture for Urban Applications | Navid Salami Pargoo et.al. | 2411.19714 | null |
2024-11-29 | RMIO: A Model-Based MARL Framework for Scenarios with Observation Loss in Some Agents | Shi Zifeng et.al. | 2411.19639 | null |
2024-11-29 | AdvFuzz: Finding More Violations Caused by the EGO Vehicle in Simulation Testing by Adversarial NPC Vehicles | You Lu et.al. | 2411.19567 | null |
2024-11-29 | ReconDreamer: Crafting World Models for Driving Scene Reconstruction via Online Restoration | Chaojun Ni et.al. | 2411.19548 | null |
2024-11-29 | A Local Information Aggregation based Multi-Agent Reinforcement Learning for Robot Swarm Dynamic Task Allocation | Yang Lv et.al. | 2411.19526 | null |
2024-11-29 | Auto-RAG: Autonomous Retrieval-Augmented Generation for Large Language Models | Tian Yu et.al. | 2411.19443 | link |
2024-11-28 | Mapping Public Perception of Artificial Intelligence: Expectations, Risk-Benefit Tradeoffs, and Value As Determinants for Societal Acceptance | Philipp Brauner et.al. | 2411.19356 | null |
2024-11-28 | UrbanCAD: Towards Highly Controllable and Photorealistic 3D Vehicles for Urban Scene Simulation | Yichong Lu et.al. | 2411.19292 | null |
2024-11-28 | SADG: Segment Any Dynamic Gaussian Without Object Trackers | Yun-Jin Li et.al. | 2411.19290 | link |
2024-11-28 | BPQP: A Differentiable Convex Optimization Framework for Efficient End-to-End Learning | Jianming Pan et.al. | 2411.19285 | null |
2024-11-28 | On-chip Hyperspectral Image Segmentation with Fully Convolutional Networks for Scene Understanding in Autonomous Driving | Jon Gutiérrez-Zaballa et.al. | 2411.19274 | null |
2024-11-28 | Contrastive representations of high-dimensional, structured treatments | Oriol Corcoll Andreu et.al. | 2411.19245 | null |
2024-11-28 | InstanceGaussian: Appearance-Semantic Joint Gaussian Representation for 3D Instance-Level Perception | Haijie Li et.al. | 2411.19235 | null |
2024-11-28 | Convex Regularization and Convergence of Policy Gradient Flows under Safety Constraints | Pekka Malo et.al. | 2411.19193 | null |
2024-11-28 | Per-event Uncertainty Quantification for Flow Cytometry using Calibration Beads | Prajakta Bedekar et.al. | 2411.19191 | null |
2024-11-27 | Collective decision making by embodied neural agents | Nicolas Coucke et.al. | 2411.18498 | null |
2024-11-27 | Bhirkuti’s Test of Bias Acceptance: Examining in Psychometric Simulations | Aneel Bhusal et.al. | 2411.18481 | null |
2024-11-27 | An End-to-End Smart Predict-then-Optimize Framework for Vehicle Relocation Problems in Large-Scale Vehicle Crowd Sensing | Xinyu Wang et.al. | 2411.18432 | null |
2024-11-27 | Neural Image Unfolding: Flattening Sparse Anatomical Structures using Neural Fields | Leonhard Rist et.al. | 2411.18415 | null |
2024-11-27 | Two-Timescale Digital Twin Assisted Model Interference and Retraining over Wireless Network | Jiayi Cong et.al. | 2411.18329 | null |
2024-11-27 | Learning optimal objective values for MILP | Lara Scavuzzo et.al. | 2411.18321 | link |
2024-11-27 | MvKeTR: Chest CT Report Generation with Multi-View Perception and Knowledge Enhancement | Xiwei Deng et.al. | 2411.18309 | null |
2024-11-27 | InterHub: A Naturalistic Trajectory Dataset with Dense Interaction for Autonomous Driving | Xiyan Jiang et.al. | 2411.18302 | link |
2024-11-27 | Visual Adversarial Attack on Vision-Language Models for Autonomous Driving | Tianyuan Zhang et.al. | 2411.18275 | null |
2024-11-27 | Dynamic Retail Pricing via Q-Learning – A Reinforcement Learning Framework for Enhanced Revenue Management | Mohit Apte et.al. | 2411.18261 | null |
2024-11-27 | From Open Vocabulary to Open World: Teaching Vision Language Models to Detect Novel Objects | Zizhao Li et.al. | 2411.18207 | link |
2024-11-27 | Critic-V: VLM Critics Help Catch VLM Errors in Multimodal Reasoning | Di Zhang et.al. | 2411.18203 | null |
2024-11-27 | SentiXRL: An advanced large language Model Framework for Multilingual Fine-Grained Emotion Classification in Complex Text Environment | Jie Wang et.al. | 2411.18162 | null |
2024-11-27 | Enhancing Visual Reasoning with Autonomous Imagination in Multimodal Large Language Models | Jingming Liu et.al. | 2411.18142 | null |
2024-11-27 | Edge-Assisted Accelerated Cooperative Sensing for CAVs: Task Placement and Resource Allocation | Yuxuan Wang et.al. | 2411.18129 | null |
2024-11-27 | A Machine Learning-based Framework towards Assessment of Decision-Makers’ Biases | Wanxue Dong et.al. | 2411.18122 | null |
2024-11-27 | Large Scale Evaluation of Deep Learning-based Explainable Solar Flare Forecasting Models with Attribution-based Proximity Analysis | Temitope Adeyeha et.al. | 2411.18070 | null |
2024-11-27 | Heterogeneous Relationships of Subjects and Shapelets for Semi-supervised Multivariate Series Classification | Mingsen Du et.al. | 2411.18043 | null |
2024-11-27 | FASIONAD : FAst and Slow FusION Thinking Systems for Human-Like Autonomous Driving with Adaptive Feedback | Kangan Qian et.al. | 2411.18013 | null |
2024-11-26 | Stealthy Multi-Task Adversarial Attacks | Jiacheng Guo et.al. | 2411.17936 | null |
2024-11-26 | Explainable AI for Classifying UTI Risk Groups Using a Real-World Linked EHR and Pathology Lab Dataset | Yujie Dai et.al. | 2411.17645 | null |
2024-11-26 | Modality-Incremental Learning with Disjoint Relevance Mapping Networks for Image-based Semantic Segmentation | Niharika Hegde et.al. | 2411.17610 | null |
2024-11-26 | Belief patterns with information processing | Federico Vaccari et.al. | 2411.17597 | null |
2024-11-26 | What Differentiates Educational Literature? A Multimodal Fusion Approach of Transformers and Computational Linguistics | Jordan J. Bird et.al. | 2411.17593 | null |
2024-11-26 | Decision making in stochastic extensive form II: Stochastic extensive forms and games | E. Emanuel Rapsch et.al. | 2411.17587 | null |
2024-11-26 | Multi-Objective Reinforcement Learning for Automated Resilient Cyber Defence | Ross O’Driscoll et.al. | 2411.17585 | null |
2024-11-26 | Rapid Deployment of Domain-specific Hyperspectral Image Processors with Application to Autonomous Driving | Jon Gutiérrez-Zaballa et.al. | 2411.17543 | null |
2024-11-26 | AI-Augmented Ethical Hacking: A Practical Examination of Manual Exploitation and Privilege Escalation in Linux Environments | Haitham S. Al-Sinani et.al. | 2411.17539 | null |
2024-11-26 | HSI-Drive v2.0: More Data for New Challenges in Scene Understanding for Autonomous Driving | Jon Gutiérrez-Zaballa et.al. | 2411.17530 | null |
2024-11-26 | Confidence-Aware Deep Learning for Load Plan Adjustments in the Parcel Service Industry | Thomas Bruys et.al. | 2411.17502 | null |
2024-11-26 | A Graph Neural Network deep-dive into successful counterattacks | Joris Bekkers et.al. | 2411.17450 | null |
2024-11-26 | CoA: Chain-of-Action for Generative Semantic Labels | Meng Wei et.al. | 2411.17406 | link |
2024-11-26 | LHPF: Look back the History and Plan for the Future in Autonomous Driving | Sheng Wang et.al. | 2411.17253 | null |
2024-11-26 | DGNN-YOLO: Dynamic Graph Neural Networks with YOLO11 for Small Object Detection and Tracking in Traffic Surveillance | Shahriar Soudeep et.al. | 2411.17251 | null |
2024-11-26 | Fault Localization from the Semantic Code Search Perspective | Yihao Qin et.al. | 2411.17230 | null |
2024-11-26 | Interval-based validation of a nonlinear estimator | Maël Godard et.al. | 2411.17215 | null |
2024-11-26 | Enhancing Lane Segment Perception and Topology Reasoning with Crowdsourcing Trajectory Priors | Peijin Jia et.al. | 2411.17161 | null |
2024-11-26 | Fast, Precise Thompson Sampling for Bayesian Optimization | David Sweet et.al. | 2411.17071 | null |
2024-11-26 | Conformalised Conditional Normalising Flows for Joint Prediction Regions in time series | Eshant English et.al. | 2411.17042 | null |
2024-11-25 | Explainable AI Approach using Near Misses Analysis | Eran Kaufman et.al. | 2411.16895 | null |
2024-11-25 | Winning opinion: Following Your Friends’ Advice or That of Their Friends? | Francisco J. Muñoz et.al. | 2411.16671 | null |
2024-11-25 | CatNet: Effective FDR Control in LSTM with Gaussian Mirrors and SHAP Feature Importance | Jiaan Han et.al. | 2411.16666 | null |
2024-11-25 | Large Language Model-based Decision-making for COLREGs and the Control of Autonomous Surface Vehicles | Klinsmann Agyei et.al. | 2411.16587 | null |
2024-11-25 | Generating Out-Of-Distribution Scenarios Using Language Models | Erfan Aasi et.al. | 2411.16554 | null |
2024-11-25 | Responsible forecasting: identifying and typifying forecasting harms | Bahman Rostami-Tabar et.al. | 2411.16531 | null |
2024-11-25 | Characterized Diffusion Networks for Enhanced Autonomous Driving Trajectory Prediction | Haoming Li et.al. | 2411.16457 | null |
2024-11-25 | A Study on Unsupervised Domain Adaptation for Semantic Segmentation in the Era of Vision-Language Models | Manuel Schwonberg et.al. | 2411.16407 | null |
2024-11-25 | A Review of Bayesian Uncertainty Quantification in Deep Probabilistic Image Segmentation | M. M. A. Valiuddin et.al. | 2411.16370 | null |
2024-11-25 | Monocular Lane Detection Based on Deep Learning: A Survey | Xin He et.al. | 2411.16316 | link |
2024-11-25 | A Performance Increment Strategy for Semantic Segmentation of Low-Resolution Images from Damaged Roads | Rafael S. Toledo et.al. | 2411.16295 | link |
2024-11-25 | FinML-Chain: A Blockchain-Integrated Dataset for Enhanced Financial Machine Learning | Jingfeng Chen et.al. | 2411.16277 | null |
2024-11-25 | Efficient pooling of predictions via kernel embeddings | Sam Allen et.al. | 2411.16246 | null |
2024-11-25 | Interpreting Object-level Foundation Models via Visual Precision Search | Ruoyu Chen et.al. | 2411.16198 | null |
2024-11-25 | The Critical Canvas–How to regain information autonomy in the AI era | Dong Chen et.al. | 2411.16193 | null |
2024-11-25 | Multi-Robot Reliable Navigation in Uncertain Topological Environments with Graph Attention Networks | Zhuoyuan Yu et.al. | 2411.16134 | link |
2024-11-25 | End-to-End Steering for Autonomous Vehicles via Conditional Imitation Co-Learning | Mahmoud M. Kishky et.al. | 2411.16131 | null |
2024-11-25 | Three Cars Approaching within 100m! Enhancing Distant Geometry by Tri-Axis Voxel Scanning for Camera-based Semantic Scene Completion | Jongseong Bae et.al. | 2411.16129 | null |
2024-11-25 | Ensemble Learning via Knowledge Transfer for CTR Prediction | Honghao Li et.al. | 2411.16122 | link |
2024-11-25 | DP-CDA: An Algorithm for Enhanced Privacy Preservation in Dataset Synthesis Through Randomized Mixing | Utsab Saha et.al. | 2411.16121 | null |
2024-11-25 | Why the Agent Made that Decision: Explaining Deep Reinforcement Learning with Vision Masks | Rui Zuo et.al. | 2411.16120 | null |
2024-11-22 | DiffusionDrive: Truncated Diffusion Model for End-to-End Autonomous Driving | Bencheng Liao et.al. | 2411.15139 | link |
2024-11-22 | Enhancing Autonomous Driving Safety through World Model-Based Predictive Navigation and Adaptive Learning Algorithms for 5G Wireless Applications | Hong Ding et.al. | 2411.15042 | null |
2024-11-22 | MSSF: A 4D Radar and Camera Fusion Framework With Multi-Stage Sampling for 3D Object Detection in Autonomous Driving | Hongsi Liu et.al. | 2411.15016 | null |
2024-11-22 | FTA generation using GenAI with an Autonomy sensor Usecase | Sneha Sudhir Shetiya et.al. | 2411.15007 | null |
2024-11-22 | Optimization Strategies for Parallel Computation of Skylines | Paolo Ciaccia et.al. | 2411.14968 | null |
2024-11-22 | LiDAR-based End-to-end Temporal Perception for Vehicle-Infrastructure Cooperation | Zhenwei Yang et.al. | 2411.14927 | null |
2024-11-22 | Exploring Kolmogorov-Arnold Networks for Interpretable Time Series Classification | Irina Barašin et.al. | 2411.14904 | link |
2024-11-22 | Benchmarking the Robustness of Optical Flow Estimation to Corruptions | Zhonghua Yi et.al. | 2411.14865 | null |
2024-11-22 | Jovis: A Visualization Tool for PostgreSQL Query Optimizer | Yoojin Choi et.al. | 2411.14788 | null |
2024-11-22 | Resolution-Agnostic Transformer-based Climate Downscaling | Declan Curran et.al. | 2411.14774 | null |
2024-11-22 | TopoSD: Topology-Enhanced Lane Segment Perception with SDMap Prior | Sen Yang et.al. | 2411.14751 | null |
2024-11-22 | Universal and Context-Independent Triggers for Precise Control of LLM Outputs | Jiashuo Liang et.al. | 2411.14738 | null |
2024-11-22 | VisionPAD: A Vision-Centric Pre-training Paradigm for Autonomous Driving | Haiming Zhang et.al. | 2411.14716 | null |
2024-11-21 | A Systematic Study of Multi-Agent Deep Reinforcement Learning for Safe and Robust Autonomous Highway Ramp Entry | Larry Schester et.al. | 2411.14593 | null |
2024-11-21 | Enhancing GeoAI and location encoding with spatial point pattern statistics: A Case Study of Terrain Feature Classification | Sizhe Wang et.al. | 2411.14560 | null |
2024-11-21 | Combining missing data imputation and internal validation in clinical risk prediction models | Junhui Mi et.al. | 2411.14542 | link |
2024-11-21 | GMAI-VL & GMAI-VL-5.5M: A Large Vision-Language Model and A Comprehensive Multimodal Dataset Towards General Medical AI | Tianbin Li et.al. | 2411.14522 | link |
2024-11-21 | Open Challenges in the Formal Verification of Autonomous Driving | Paolo Burgio et.al. | 2411.14520 | null |
2024-11-21 | Model Checking for Reinforcement Learning in Autonomous Driving: One Can Do More Than You Think! | Rong Gu et.al. | 2411.14375 | null |
2024-11-21 | Formal Simulation and Visualisation of Hybrid Programs | Pedro Mendes et.al. | 2411.14365 | null |
2024-11-21 | Generalizing End-To-End Autonomous Driving In Real-World Environments Using Zero-Shot LLMs | Zeyu Dong et.al. | 2411.14256 | null |
2024-11-21 | BERT-Based Approach for Automating Course Articulation Matrix Construction with Explainable AI | Natenaile Asmamaw Shiferaw et.al. | 2411.14254 | link |
2024-11-21 | Natural Language Reinforcement Learning | Xidong Feng et.al. | 2411.14251 | null |
2024-11-21 | Towards Context-Rich Automated Biodiversity Assessments: Deriving AI-Powered Insights from Camera Trap Data | Paul Fergus et.al. | 2411.14219 | null |
2024-11-21 | Grand Challenges in the Verification of Autonomous Systems | Kevin Leahy et.al. | 2411.14155 | null |
2024-11-21 | Forecasting Future International Events: A Reliable Dataset for Text-Based Event Modeling | Daehoon Gwak et.al. | 2411.14042 | null |
2024-11-21 | Dual-Arm Telerobotic Platform for Robotic Hotbox Operations for Nuclear Waste Disposition in EM Sites | Joong-Ku Lee et.al. | 2411.13994 | null |
2024-11-21 | Market Making without Regret | Nicolò Cesa-Bianchi et.al. | 2411.13993 | null |
2024-11-21 | FedRAV: Hierarchically Federated Region-Learning for Traffic Object Classification of Autonomous Vehicles | Yijun Zhai et.al. | 2411.13979 | link |
2024-11-21 | Breadboarding the European Moon Rover System: discussion and results of the analogue field test campaign | Cristina Luna et.al. | 2411.13978 | null |
2024-11-21 | ICODE: Modeling Dynamical Systems with Extrinsic Input Information | Zhaoyi Li et.al. | 2411.13914 | null |
2024-11-21 | Towards Full Delegation: Designing Ideal Agentic Behaviors for Travel Planning | Song Jiang et.al. | 2411.13904 | null |
2024-11-21 | Trajectory Tracking Using Frenet Coordinates with Deep Deterministic Policy Gradient | Tongzhou Jiang et.al. | 2411.13885 | null |
2024-11-21 | Interactive and Expressive Code-Augmented Planning with Large Language Models | Anthony Z. Liu et.al. | 2411.13826 | null |
2024-11-21 | Dynamic spatial interaction models for a leader’s resource allocation and followers’ multiple activities | Hanbat Jeong et.al. | 2411.13810 | null |
2024-11-21 | MagicDriveDiT: High-Resolution Long Video Generation for Autonomous Driving with Adaptive Control | Ruiyuan Gao et.al. | 2411.13807 | null |
2024-11-21 | A Survey on Adversarial Robustness of LiDAR-based Machine Learning Perception in Autonomous Vehicles | Junae Kim et.al. | 2411.13778 | null |
2024-11-20 | Exploring Large Language Models for Climate Forecasting | Yang Wang et.al. | 2411.13724 | null |
2024-11-20 | BALROG: Benchmarking Agentic LLM and VLM Reasoning On Games | Davide Paglieri et.al. | 2411.13543 | null |
2024-11-20 | Advancing Complex Medical Communication in Arabic with Sporo AraSum: Surpassing Existing Large Language Models | Chanseo Lee et.al. | 2411.13518 | null |
2024-11-20 | Disentangling Memory and Reasoning Ability in Large Language Models | Mingyu Jin et.al. | 2411.13504 | link |
2024-11-20 | Neural machine translation of seismic waves for petrophysical inversion | José Cunha Teixeira et.al. | 2411.13491 | null |
2024-11-20 | Unleashing the Power of Large Language Models for Group POI Recommendations | Jing Long et.al. | 2411.13415 | null |
2024-11-20 | A Survey On Enhancing Reinforcement Learning in Complex Environments: Insights from Human and LLM Feedback | Alireza Rashidi Laleh et.al. | 2411.13410 | null |
2024-11-20 | Explainable Finite-Memory Policies for Partially Observable Markov Decision Processes | Muqsit Azeem et.al. | 2411.13365 | null |
2024-11-20 | WHALES: A Multi-agent Scheduling Dataset for Enhanced Cooperation in Autonomous Driving | Siwei Chen et.al. | 2411.13340 | link |
2024-11-20 | A Resource Efficient Fusion Network for Object Detection in Bird’s-Eye View using Camera and Raw Radar Data | Kavin Chandrasekaran et.al. | 2411.13311 | link |
2024-11-20 | A computational framework for integrating Predictive processes with evidence Accumulation Models (PAM) | Antonino Visalli et.al. | 2411.13203 | link |
2024-11-20 | Guided Object-Oriented Development | Harrie Passier et.al. | 2411.13200 | null |
2024-11-20 | Quantitative Fairness – A Framework For The Design Of Equitable Cybernetic Societies | Kevin Riehl et.al. | 2411.13184 | null |
2024-11-20 | YCB-LUMA: YCB Object Dataset with Luminance Keying for Object Localization | Thomas Pöllabauer et.al. | 2411.13149 | link |
2024-11-20 | Provably Efficient Action-Manipulation Attack Against Continuous Reinforcement Learning | Zhi Luo et.al. | 2411.13116 | null |
2024-11-20 | DriveMLLM: A Benchmark for Spatial Understanding with Multimodal Large Language Models in Autonomous Driving | Xianda Guo et.al. | 2411.13112 | link |
2024-11-20 | Hints of Prompt: Enhancing Visual Representation for Multimodal LLMs in Autonomous Driving | Hao Zhou et.al. | 2411.13076 | null |
2024-11-20 | MEGL: Multimodal Explanation-Guided Learning | Yifei Zhang et.al. | 2411.13053 | null |
2024-11-20 | Study of Group III-V Waveguides on Sapphire Platform for Photonic Integrated Circuits | Manoj Kumar Shah et.al. | 2411.13035 | null |
2024-11-20 | Hierarchical Diffusion Policy: manipulation trajectory generation via contact guidance | Dexin Wang et.al. | 2411.12982 | link |
2024-11-20 | LaVida Drive: Vision-Text Interaction VLM for Autonomous Driving with Token Selection, Recovery and Enhancement | Siwen Jiao et.al. | 2411.12980 | null |
2024-11-19 | Dimensions of Generative AI Evaluation Design | P. Alex Dow et.al. | 2411.12709 | null |
2024-11-19 | OrigamiPlot: An R Package and Shiny Web App Enhanced Visualizations for Multivariate Data | Yiwen Lu et.al. | 2411.12674 | null |
2024-11-19 | Smart Predict-then-Optimize Method with Dependent Data: Risk Bounds and Calibration of Autoregression | Jixian Liu et.al. | 2411.12653 | null |
2024-11-19 | DLBacktrace: A Model Agnostic Explainability for any Deep Learning Models | Vinay Kumar Sankarapu et.al. | 2411.12643 | link |
2024-11-19 | M3D: Dual-Stream Selective State Spaces and Depth-Driven Framework for High-Fidelity Single-View 3D Reconstruction | Luoxi Zhang et.al. | 2411.12635 | link |
2024-11-19 | GaussianPretrain: A Simple Unified 3D Gaussian Representation for Visual Pre-training in Autonomous Driving | Shaoqing Xu et.al. | 2411.12452 | link |
2024-11-19 | Motif Channel Opened in a White-Box: Stereo Matching via Motif Correlation Graph | Ziyang Chen et.al. | 2411.12426 | link |
2024-11-19 | A general modeling and simulation framework for dynamic vehicle routing | Markó Horváth et.al. | 2411.12406 | link |
2024-11-19 | C $^{2}$ INet: Realizing Incremental Trajectory Prediction with Prior-Aware Continual Causal Intervention | Xiaohe Li et.al. | 2411.12313 | null |
2024-11-19 | Could Humans Outshine AI in Visual Data Analysis? | Ratanond Koonchanok et.al. | 2411.12299 | null |
2024-11-19 | A Survey of Medical Vision-and-Language Applications and Their Techniques | Qi Chen et.al. | 2411.12195 | link |
2024-11-19 | Action-Attentive Deep Reinforcement Learning for Autonomous Alignment of Beamlines | Siyu Wang et.al. | 2411.12183 | link |
2024-11-19 | Robust 3D Semantic Occupancy Prediction with Calibration-free Spatial Transformation | Zhuangwei Zhuang et.al. | 2411.12177 | link |
2024-11-19 | SkillTree: Explainable Skill-Based Deep Reinforcement Learning for Long-Horizon Control Tasks | Yongyan Wen et.al. | 2411.12173 | null |
2024-11-18 | Coverage-Constrained Human-AI Cooperation with Multiple Experts | Zheng Zhang et.al. | 2411.11976 | null |
2024-11-19 | Generative World Explorer | Taiming Lu et.al. | 2411.11844 | null |
2024-11-18 | Exploring the Requirements of Clinicians for Explainable AI Decision Support Systems in Intensive Care | Jeffrey N. Clark et.al. | 2411.11774 | null |
2024-11-18 | Robust Reinforcement Learning under Diffusion Models for Data with Jumps | Chenyang Jiang et.al. | 2411.11697 | null |
2024-11-18 | TrojanRobot: Backdoor Attacks Against Robotic Manipulation in the Physical World | Xianlong Wang et.al. | 2411.11683 | null |
2024-11-18 | On the Incorporation of Stability Constraints into Sequential Operational Scheduling | Wangkun Xu et.al. | 2411.11652 | null |
2024-11-18 | ST-Tree with Interpretability for Multivariate Time Series Classification | Mingsen Du et.al. | 2411.11620 | link |
2024-11-18 | VLN-Game: Vision-Language Equilibrium Search for Zero-Shot Semantic Navigation | Bangguo Yu et.al. | 2411.11609 | null |
2024-11-18 | Transformer networks for Heavy flavor jet tagging | A. Hammad et.al. | 2411.11519 | null |
2024-11-18 | Structure learning with Temporal Gaussian Mixture for model-based Reinforcement Learning | Théophile Champion et.al. | 2411.11511 | null |
2024-11-18 | SignEye: Traffic Sign Interpretation from Vehicle First-Person View | Chuang Yang et.al. | 2411.11507 | null |
2024-11-18 | MGNiceNet: Unified Monocular Geometric Scene Understanding | Markus Schön et.al. | 2411.11466 | null |
2024-11-18 | The ADUULM-360 Dataset – A Multi-Modal Dataset for Depth Estimation in Adverse Weather | Markus Schön et.al. | 2411.11455 | null |
2024-11-18 | Robust Markov Decision Processes: A Place Where AI and Formal Methods Meet | Marnix Suilen et.al. | 2411.11451 | null |
2024-11-18 | Deliberative XAI: How Explanations Impact Understanding and Decision-Making of AI Novices in Collective and Individual Settings | Timothée Schmude et.al. | 2411.11449 | null |
2024-11-18 | Causal Effect of Group Diversity on Redundancy and Coverage in Peer-Reviewing | Navita Goyal et.al. | 2411.11437 | null |
2024-11-18 | Cross-Patient Pseudo Bags Generation and Curriculum Contrastive Learning for Imbalanced Multiclassification of Whole Slide Image | Yonghuang Wu et.al. | 2411.11262 | null |
2024-11-18 | DrivingSphere: Building a High-fidelity 4D World for Closed-loop Simulation | Tianyi Yan et.al. | 2411.11252 | null |
2024-11-17 | DeepSPV: An Interpretable Deep Learning Pipeline for 3D Spleen Volume Estimation from 2D Ultrasound Images | Zhen Yuan et.al. | 2411.11190 | null |
2024-11-17 | Integrated Ising Model with global inhibition for decision making | Olga Tapinova et.al. | 2411.11143 | null |
2024-11-17 | Financial News-Driven LLM Reinforcement Learning for Portfolio Management | Ananya Unnikrishnan et.al. | 2411.11059 | null |
2024-11-15 | Emotion Detection in Reddit: Comparative Study of Machine Learning and Deep Learning Techniques | Maliheh Alaeddini et.al. | 2411.10328 | null |
2024-11-15 | Moving Forward: A Review of Autonomous Driving Software and Hardware Systems | Xu Wang et.al. | 2411.10291 | null |
2024-11-15 | From Score-Driven to Value-Sharing: Understanding Chinese Family Use of AI to Support Decision Making of College Applications | Si Chen et.al. | 2411.10280 | null |
2024-11-15 | Towards Sample-Efficiency and Generalization of Transfer and Inverse Reinforcement Learning: A Comprehensive Literature Review | Hossein Hassani et.al. | 2411.10268 | null |
2024-11-15 | Artificial Intelligence in Pediatric Echocardiography: Exploring Challenges, Opportunities, and Clinical Applications with Explainable AI and Federated Learning | Mohammed Yaseen Jabarulla et.al. | 2411.10255 | null |
2024-11-15 | Uncertainty in Supply Chain Digital Twins: A Quantum-Classical Hybrid Approach | Abdullah Abdullah et.al. | 2411.10254 | null |
2024-11-15 | Learning Generalizable 3D Manipulation With 10 Demonstrations | Yu Ren et.al. | 2411.10203 | link |
2024-11-15 | Agentic LLMs in the Supply Chain: Towards Autonomous Multi-Agent Consensus-Seeking | Valeria Jannelli et.al. | 2411.10184 | null |
2024-11-15 | Let people fail! Exploring the influence of explainable virtual and robotic agents in learning-by-doing tasks | Marco Matarese et.al. | 2411.10176 | null |
2024-11-15 | Imagine-2-Drive: High-Fidelity World Modeling in CARLA for Autonomous Vehicles | Anant Garg et.al. | 2411.10171 | null |
2024-11-15 | Better Safe Than Sorry: Enhancing Arbitration Graphs for Safe and Robust Autonomous Decision-Making | Piotr Spieker et.al. | 2411.10170 | link |
2024-11-15 | Adapting the Biological SSVEP Response to Artificial Neural Networks | Emirhan Böge et.al. | 2411.10084 | null |
2024-11-15 | Explanation for Trajectory Planning using Multi-modal Large Language Model for Autonomous Driving | Shota Yamazaki et.al. | 2411.09971 | null |
2024-11-15 | Planning by Simulation: Motion Planning with Learning-based Parallel Scenario Prediction for Autonomous Driving | Tian Niu et.al. | 2411.09887 | null |
2024-11-15 | Fair Secretaries with Unfair Predictions | Eric Balkanski et.al. | 2411.09854 | null |
2024-11-14 | Robustness Assessment of Static Structures for Efficient Object Handling | Philippe Nadeau et.al. | 2411.09810 | null |
2024-11-14 | Fair Resource Allocation in Weakly Coupled Markov Decision Processes | Xiaohui Tu et.al. | 2411.09804 | null |
2024-11-14 | Modular Fault Diagnosis Framework for Complex Autonomous Driving Systems | Stefan Orf et.al. | 2411.09643 | null |
2024-11-14 | The Moral Foundations Weibo Corpus | Renjie Cao et.al. | 2411.09612 | null |
2024-11-14 | Expert Study on Interpretable Machine Learning Models with Missing Data | Lena Stempfle et.al. | 2411.09591 | null |
2024-11-14 | An Approach to Twinning and Mining Collaborative Network of Construction Projects | Jia-Rui Lin et.al. | 2411.09486 | null |
2024-11-14 | Socio-Economic Consequences of Generative AI: A Review of Methodological Approaches | Carlos J. Costa et.al. | 2411.09313 | null |
2024-11-14 | LHRS-Bot-Nova: Improved Multimodal Large Language Model for Remote Sensing Vision-Language Interpretation | Zhenshi Li et.al. | 2411.09301 | link |
2024-11-14 | SAFES: Sequential Privacy and Fairness Enhancing Data Synthesis for Responsible AI | Spencer Giddens et.al. | 2411.09178 | link |
2024-11-14 | Gazing at Rewards: Eye Movements as a Lens into Human and AI Decision-Making in Hybrid Visual Foraging | Bo Wang et.al. | 2411.09176 | null |
2024-11-13 | A probabilistic reduced-order modeling framework for patient-specific cardio-mechanical analysis | Robin Willems et.al. | 2411.08822 | null |
2024-11-13 | Evaluating World Models with LLM for Decision Making | Chang Yang et.al. | 2411.08794 | null |
2024-11-13 | SANDWICH: Towards an Offline, Differentiable, Fully-Trainable Wireless Neural Ray-Tracing Surrogate | Yifei Jin et.al. | 2411.08767 | null |
2024-11-13 | Logic-based Knowledge Awareness for Autonomous Agents in Continuous Spaces | Arabinda Ghosh et.al. | 2411.08754 | null |
2024-11-13 | Polymetis:Large Language Modeling for Multiple Material Domains | Chao Huang et.al. | 2411.08728 | null |
2024-11-13 | High-resolution optical and acoustic remote sensing datasets of the Puck Lagoon, Southern Baltic | Łukasz Janowski et.al. | 2411.08712 | null |
2024-11-13 | TRACE: Transformer-based Risk Assessment for Clinical Evaluation | Dionysis Christopoulos et.al. | 2411.08701 | null |
2024-11-13 | UniMat: Unifying Materials Embeddings through Multi-modal Learning | Janghoon Ock et.al. | 2411.08664 | null |
2024-11-13 | Robot See, Robot Do: Imitation Reward for Noisy Financial Environments | Sven Goluža et.al. | 2411.08637 | null |
2024-11-13 | Zero-shot capability of SAM-family models for bone segmentation in CT scans | Caroline Magg et.al. | 2411.08629 | null |
2024-11-13 | An Empirical Examination of the Evaluative AI Framework | Jaroslaw Kornowicz et.al. | 2411.08583 | null |
2024-11-13 | TimeLess: A Vision for the Next Generation of Software Development | Zeeshan Rasheed et.al. | 2411.08507 | null |
2024-11-13 | Towards Objective and Unbiased Decision Assessments with LLM-Enhanced Hierarchical Attention Networks | Junhua Liu et.al. | 2411.08504 | link |
2024-11-13 | Methodology for a Statistical Analysis of Influencing Factors on 3D Object Detection Performance | Anton Kuznietsov et.al. | 2411.08482 | null |
2024-11-13 | Learning Dynamic Cognitive Map with Autonomous Navigation | Daria de Tinguy et.al. | 2411.08447 | null |
2024-11-13 | 3D Multi-Object Tracking with Semi-Supervised GRU-Kalman Filter | Xiaoxiang Wang et.al. | 2411.08433 | null |
2024-11-13 | Hybrid Vector Auto Regression and Neural Network Model for Order Flow Imbalance Prediction in High Frequency Trading | Abdul Rahman et.al. | 2411.08382 | link |
2024-11-13 | A Fuzzy Reinforcement LSTM-based Long-term Prediction Model for Fault Conditions in Nuclear Power Plants | Siwei Li et.al. | 2411.08370 | null |
2024-11-13 | How Transit Countries Become Refugee Destinations: Insights from Central and Eastern Europe | Liliana Harding et.al. | 2411.08350 | null |
2024-11-13 | TowerDebias: A Novel Debiasing Method based on the Tower Property | Norman Matloff et.al. | 2411.08297 | null |
2024-11-12 | Investigating the Effectiveness of Explainability Methods in Parkinson’s Detection from Speech | Eleonora Mancini et.al. | 2411.08013 | null |
2024-11-12 | Optimal Control of Mechanical Ventilators with Learned Respiratory Dynamics | Isaac Ronald Ward et.al. | 2411.07971 | link |
2024-11-12 | Learning Memory Mechanisms for Decision Making through Demonstrations | William Yue et.al. | 2411.07954 | link |
2024-11-12 | CryptoLLM: Unleashing the Power of Prompted LLMs for SmartQnA and Classification of Crypto Posts | Aniket Deroy et.al. | 2411.07917 | null |
2024-11-12 | Evidential time-to-event prediction model with well-calibrated uncertainty estimation | Ling Huang et.al. | 2411.07853 | null |
2024-11-12 | Impact of R&D and AI Investments on Economic Growth and Credit Rating | Davit Gondauri et.al. | 2411.07817 | null |
2024-11-12 | PatchCTG: Patch Cardiotocography Transformer for Antepartum Fetal Health Monitoring | M. Jaleed Khan et.al. | 2411.07796 | link |
2024-11-12 | ALOcc: Adaptive Lifting-based 3D Semantic Occupancy and Cost Volume-based Flow Prediction | Dubing Chen et.al. | 2411.07725 | link |
2024-11-12 | OWLed: Outlier-weighed Layerwise Pruning for Efficient Autonomous Driving Framework | Jiaxi Li et.al. | 2411.07711 | link |
2024-11-12 | xCG: Explainable Cell Graphs for Survival Prediction in Non-Small Cell Lung Cancer | Marvin Sextro et.al. | 2411.07643 | link |
2024-11-12 | A Simple Multi-agent Joint Prediction Method for Autonomous Driving | Mingyi Wang et.al. | 2411.07612 | null |
2024-11-12 | Collaborative and Federated Black-box Optimization: A Bayesian Optimization Perspective | Raed Al Kontar et.al. | 2411.07523 | null |
2024-11-11 | Towards a criteria-based approach to selecting human-AI interaction mode | Jessica Irons et.al. | 2411.07406 | null |
2024-11-11 | Advancements in Constitutive Model Calibration: Leveraging the Power of Full-Field DIC Measurements and In-Situ Load Path Selection for Reliable Parameter Inference | Denielle Ricciardi et.al. | 2411.07310 | null |
2024-11-11 | RoundTable: Investigating Group Decision-Making Mechanism in Multi-Agent Collaboration | Young-Min Cho et.al. | 2411.07161 | null |
2024-11-12 | OCMDP: Observation-Constrained Markov Decision Process | Taiyi Wang et.al. | 2411.07087 | null |
2024-11-11 | HeteroSample: Meta-path Guided Sampling for Heterogeneous Graph Representation Learning | Ao Liu et.al. | 2411.07022 | null |
2024-11-11 | SIESEF-FusionNet: Spatial Inter-correlation Enhancement and Spatially-Embedded Feature Fusion Network for LiDAR Point Cloud Semantic Segmentation | Jiale Chen et.al. | 2411.06991 | null |
2024-11-11 | Cancer-Answer: Empowering Cancer Care with Advanced Large Language Models | Aniket Deroy et.al. | 2411.06946 | null |
2024-11-11 | Distributed Graph Augmentation Protocols to Achieve Strong Connectivity in Multi-Agent Networks | Guilherme Ramos et.al. | 2411.06880 | link |
2024-11-11 | Classification of residential and non-residential buildings based on satellite data using deep learning | Jai G Singla et.al. | 2411.06879 | null |
2024-11-11 | Multi-Modal interpretable automatic video captioning | Antoine Hanna-Asaad et.al. | 2411.06872 | null |
2024-11-11 | Learning Interpretable Network Dynamics via Universal Neural Symbolic Regression | Jiao Hu et.al. | 2411.06833 | null |
2024-11-11 | Streetwise Agents: Empowering Offline RL Policies to Outsmart Exogenous Stochastic Disturbances in RTC | Aditya Soni et.al. | 2411.06815 | null |
2024-11-11 | AssistRAG: Boosting the Potential of Large Language Models with an Intelligent Information Assistant | Yujia Zhou et.al. | 2411.06805 | link |
2024-11-11 | Large-scale moral machine experiment on large language models | Muhammad Shahrul Zaim bin Ahmad et.al. | 2411.06790 | link |
2024-11-11 | Model Partition and Resource Allocation for Split Learning in Vehicular Edge Networks | Lu Yu et.al. | 2411.06773 | null |
2024-11-11 | DP and QP Based Decision-making and Planning for Autonomous Vehicle | Zhicheng Zhang et.al. | 2411.06751 | null |
2024-11-10 | SequentialSamplingModels.jl: Simulating and Evaluating Cognitive Models of Response Times in Julia | Kianté Fernandez et.al. | 2411.06631 | null |
2024-11-10 | Towards Graph Neural Network Surrogates Leveraging Mechanistic Expert Knowledge for Pandemic Response | Agatha Schmidt et.al. | 2411.06500 | null |
2024-11-10 | ClinicalBench: Can LLMs Beat Traditional ML Models in Clinical Prediction? | Canyu Chen et.al. | 2411.06469 | null |
2024-11-10 | Mastering NIM and Impartial Games with Weak Neural Networks: An AlphaZero-inspired Multi-Frame Approach | Søren Riis et.al. | 2411.06403 | null |
2024-11-10 | Local vs. Global Models for Hierarchical Forecasting | Zhao Yingjie et.al. | 2411.06394 | null |
2024-11-10 | Regret Minimization and Statistical Inference in Online Decision Making with High-dimensional Covariates | Congyuan Duan et.al. | 2411.06329 | null |
2024-11-08 | GazeSearch: Radiology Findings Search Benchmark | Trong Thang Pham et.al. | 2411.05780 | null |
2024-11-08 | Multi-armed Bandits with Missing Outcome | Ilia Mahrooghi et.al. | 2411.05661 | link |
2024-11-08 | WHALE: Towards Generalizable and Scalable World Models for Embodied Decision-making | Zhilong Zhang et.al. | 2411.05619 | null |
2024-11-08 | Expectation vs. Reality: Towards Verification of Psychological Games | Marta Kwiatkowska et.al. | 2411.05599 | null |
2024-11-08 | Parameterized Voter Relevance in Facility Location Games with Tree-Shaped Invitation Graphs | Ryoto Ando et.al. | 2411.05574 | null |
2024-11-08 | Open-set object detection: towards unified problem formulation and benchmarking | Hejer Ammar et.al. | 2411.05564 | null |
2024-11-08 | BayesianFitForecast: A User-Friendly R Toolbox for Parameter Estimation and Forecasting with Ordinary Differential Equations | Hamed Karami et.al. | 2411.05371 | link |
2024-11-08 | Stochastic games of parental vaccination decision making and bounded rationality | Andras Balogh et.al. | 2411.05369 | null |
2024-11-08 | Agricultural Landscape Understanding At Country-Scale | Radhika Dua et.al. | 2411.05359 | null |
2024-11-08 | LLM-PySC2: Starcraft II learning environment for Large Language Models | Zongyuan Li et.al. | 2411.05348 | link |
2024-11-08 | Differentiable Calibration of Inexact Stochastic Simulation Models via Kernel Score Minimization | Ziwei Su et.al. | 2411.05315 | null |
2024-11-08 | ZOPP: A Framework of Zero-shot Offboard Panoptic Perception for Autonomous Driving | Tao Ma et.al. | 2411.05311 | null |
2024-11-08 | SimpleBEV: Improved LiDAR-Camera Fusion Architecture for 3D Object Detection | Yun Zhao et.al. | 2411.05292 | null |
2024-11-08 | Decoding Report Generators: A Cyclic Vision-Language Adapter for Counterfactual Explanations | Yingying Fang et.al. | 2411.05261 | null |
2024-11-07 | Pruning the Path to Optimal Care: Identifying Systematically Suboptimal Medical Decision-Making with Inverse Reinforcement Learning | Inko Bovenzi et.al. | 2411.05237 | null |
2024-11-07 | Bootstrap Pettitt test for detecting change point in hydroclimatological data: a case study for Itaipu hydroelectric plant in Brazil | Luiza Chiarelli Conte et.al. | 2411.05233 | null |
2024-11-07 | AGE2HIE: Transfer Learning from Brain Age to Predicting Neurocognitive Outcome for Infant Brain Injury | Rina Bao et.al. | 2411.05188 | null |
2024-11-07 | Inverse Transition Learning: Learning Dynamics from Demonstrations | Leo Benac et.al. | 2411.05174 | null |
2024-11-07 | Few-Shot Task Learning through Inverse Generative Modeling | Aviv Netanyahu et.al. | 2411.04987 | null |
2024-11-07 | Position Paper On Diagnostic Uncertainty Estimation from Large Language Models: Next-Word Probability Is Not Pre-test Probability | Yanjun Gao et.al. | 2411.04962 | null |
2024-11-07 | Orbit: A Framework for Designing and Evaluating Multi-objective Rankers | Chenyang Yang et.al. | 2411.04798 | null |
2024-11-07 | Enhancing Investment Analysis: Optimizing AI-Agent Collaboration in Financial Research | Xuewen Han et.al. | 2411.04788 | link |
2024-11-07 | From CNN to ConvRNN: Adapting Visualization Techniques for Time-Series Anomaly Detection | Fabien Poirier et.al. | 2411.04707 | null |
2024-11-07 | Semantic-Aware Resource Management for C-V2X Platooning via Multi-Agent Reinforcement Learning | Zhiyu Shao et.al. | 2411.04672 | link |
2024-11-07 | IGDrivSim: A Benchmark for the Imitation Gap in Autonomous Driving | Clémence Grislain et.al. | 2411.04653 | link |
2024-11-07 | DISCO: DISCovering Overfittings as Causal Rules for Text Classification Models | Zijian Zhang et.al. | 2411.04649 | null |
2024-11-07 | Bayesian reconstruction of sparse raster-scanned mid-infrared optoacoustic signals enables fast, label-free chemical microscopy | Constantin Berger et.al. | 2411.04648 | null |
2024-11-07 | Dynamic Detection of Relevant Objectives and Adaptation to Preference Drifts in Interactive Evolutionary Multi-Objective Optimization | Seyed Mahdi Shavarani et.al. | 2411.04547 | null |
2024-11-07 | Enabling Adaptive Agent Training in Open-Ended Simulators by Targeting Diversity | Robby Costales et.al. | 2411.04466 | null |
2024-11-07 | GPT-Guided Monte Carlo Tree Search for Symbolic Regression in Financial Fraud Detection | Prashank Kadam et.al. | 2411.04459 | null |
2024-11-07 | Seeing Through Pixel Motion: Learning Obstacle Avoidance from Optical Flow with One Camera | Yu Hu et.al. | 2411.04413 | null |
2024-11-07 | LidaRefer: Outdoor 3D Visual Grounding for Autonomous Driving with Transformers | Yeong-Seung Baek et.al. | 2411.04351 | null |
2024-11-07 | Survival of the Notable: Gender Asymmetry in Wikipedia Collective Deliberations | Khandaker Tasnim Huq et.al. | 2411.04340 | null |
2024-11-07 | CodeTree: Agent-guided Tree Search for Code Generation with Large Language Models | Jierui Li et.al. | 2411.04329 | null |
2024-11-06 | Multimodal Structure-Aware Quantum Data Processing | Hala Hawashin et.al. | 2411.04242 | link |
2024-11-06 | Using Linked Micromaps for Evidence-Based Policy | Randall Powers et.al. | 2411.04211 | link |
2024-11-06 | A Capacitated Collection-and-Delivery-Point Location Problem with Random Utility Maximizing Customers | David Pinzon Ulloa et.al. | 2411.04200 | null |
2024-11-06 | A Comparative Study of Deep Reinforcement Learning for Crop Production Management | Joseph Balderas et.al. | 2411.04106 | null |
2024-11-06 | Aligning Characteristic Descriptors with Images for Human-Expert-like Explainability | Bharat Chandra Yalavarthi et.al. | 2411.04008 | null |
2024-11-06 | Synomaly Noise and Multi-Stage Diffusion: A Novel Approach for Unsupervised Anomaly Detection in Ultrasound Imaging | Yuan Bi et.al. | 2411.04004 | null |
2024-11-06 | Fine-tuning – a Transfer Learning approach | Joseph Arul Raj et.al. | 2411.03941 | null |
2024-11-06 | A Causal Framework for Precision Rehabilitation | R. James Cotton et.al. | 2411.03919 | null |
2024-11-06 | AdaSociety: An Adaptive Environment with Social Structures for Multi-Agent Decision-Making | Yizhe Huang et.al. | 2411.03865 | link |
2024-11-06 | A Comparative Study of Recent Large Language Models on Generating Hospital Discharge Summaries for Lung Cancer Patients | Yiming Li et.al. | 2411.03805 | null |
2024-11-06 | Navigating the landscape of multimodal AI in medicine: a scoping review on technical challenges and clinical applications | Daan Schouten et.al. | 2411.03782 | null |
2024-11-06 | Human-in-the-Loop Feature Selection Using Interpretable Kolmogorov-Arnold Network-based Double Deep Q-Network | Md Abrar Jahin et.al. | 2411.03740 | null |
2024-11-06 | Explaining Human Activity Recognition with SHAP: Validating Insights with Perturbation and Quantitative Measures | Felix Tempel et.al. | 2411.03714 | link |
2024-11-06 | Generalized Trusted Multi-view Classification Framework with Hierarchical Opinion Aggregation | Long Shi et.al. | 2411.03713 | link |
2024-11-06 | Graph-Based Multi-Modal Sensor Fusion for Autonomous Driving | Depanshu Sani et.al. | 2411.03702 | null |
2024-11-06 | OccLoff: Learning Optimized Feature Fusion for 3D Occupancy Prediction | Ji Zhang et.al. | 2411.03696 | null |
2024-11-06 | Towards 3D Semantic Scene Completion for Autonomous Driving: A Meta-Learning Framework Empowered by Deformable Large-Kernel Attention and Mamba Model | Yansong Qu et.al. | 2411.03672 | null |
2024-11-06 | Evaluating Moral Beliefs across LLMs through a Pluralistic Framework | Xuelin Liu et.al. | 2411.03665 | link |
2024-11-06 | RTify: Aligning Deep Neural Networks with Human Behavioral Decisions | Yu-Ang Cheng et.al. | 2411.03630 | null |
2024-11-06 | Hiring as Exploration | Danielle Li et.al. | 2411.03616 | null |
2024-11-06 | Can Robotic Cues Manipulate Human Decisions? Exploring Consensus Building via Bias-Controlled Non-linear Opinion Dynamics and Robotic Eye Gaze Mediated Interaction in Human-Robot Teaming | Rajul Kumar et.al. | 2411.03581 | null |
2024-11-06 | Hybrid Attention for Robust RGB-T Pedestrian Detection in Real-World Conditions | Arunkumar Rathinam et.al. | 2411.03576 | null |
2024-11-05 | Digital Twin for Autonomous Surface Vessels: Enabler for Safe Maritime Navigation | Daniel Menges et.al. | 2411.03465 | null |
2024-11-05 | Causal Responsibility Attribution for Human-AI Collaboration | Yahang Qi et.al. | 2411.03275 | link |
2024-11-05 | Knowledge Graphs of Driving Scenes to Empower the Emerging Capabilities of Neurosymbolic AI | Ruwan Wickramarachchi et.al. | 2411.03225 | null |
2024-11-05 | GIS Copilot: Towards an Autonomous GIS Agent for Spatial Analysis | Temitope Akinboyewa et.al. | 2411.03205 | link |
2024-11-05 | Evaluating Machine Learning Models against Clinical Protocols for Enhanced Interpretability and Continuity of Care | Christel Sirocchi et.al. | 2411.03105 | link |
2024-11-05 | Precise Drive with VLM: First Prize Solution for PRCV 2024 Drive LM challenge | Bin Huang et.al. | 2411.02999 | null |
2024-11-05 | Autonomous Decision Making for UAV Cooperative Pursuit-Evasion Game with Reinforcement Learning | Yang Zhao et.al. | 2411.02983 | null |
2024-11-05 | Region-Guided Attack on the Segment Anything Model (SAM) | Xiaoliang Liu et.al. | 2411.02974 | null |
2024-11-05 | Multi-modal NeRF Self-Supervision for LiDAR Semantic Segmentation | Xavier Timoneda et.al. | 2411.02969 | null |
2024-11-05 | Mapping Africa Settlements: High Resolution Urban and Rural Map by Deep Learning and Satellite Imagery | Mohammad Kakooei et.al. | 2411.02935 | link |
2024-11-05 | Exploring the Interplay Between Video Generation and World Models in Autonomous Driving: A Survey | Ao Fu et.al. | 2411.02914 | null |
2024-11-05 | A new family of ladder operators for macroscopic systems, with applications | Fabio Bagarello et.al. | 2411.02879 | null |
2024-11-05 | Safety Verification for Evasive Collision Avoidance in Autonomous Vehicles with Enhanced Resolutions | Aliasghar Arab et.al. | 2411.02706 | null |
2024-11-04 | Geometry of naturalistic object representations in recurrent neural network models of working memory | Xiaoxuan Lei et.al. | 2411.02685 | null |
2024-11-04 | Visually Analyze SHAP Plots to Diagnose Misclassifications in ML-based Intrusion Detection | Maraz Mia et.al. | 2411.02670 | null |
2024-11-04 | Designing and Evaluating Sampling Strategies for Multiple-Forecast Visualization (MFV) | Ruishi Zou et.al. | 2411.02576 | null |
2024-11-04 | Enhancing Risk Assessment in Transformers with Loss-at-Risk Functions | Jinghan Zhang et.al. | 2411.02558 | null |
2024-11-04 | Imagining and building wise machines: The centrality of AI metacognition | Samuel G. B. Johnson et.al. | 2411.02478 | null |
2024-11-04 | Energy-Aware Dynamic Neural Inference | Marcello Bullo et.al. | 2411.02471 | null |
2024-11-04 | WebRL: Training LLM Web Agents via Self-Evolving Online Curriculum Reinforcement Learning | Zehan Qi et.al. | 2411.02337 | link |
2024-11-04 | Federated GNNs for EEG-Based Stroke Assessment | Andrea Protani et.al. | 2411.02286 | null |
2024-11-04 | Stochastic Optimal Control of an Industrial Power-to-Heat System with High-Temperature Heat Pump and Thermal Energy Storage | Eric Pilling et.al. | 2411.02211 | null |
2024-11-04 | Learning Multiple Initial Solutions to Optimization Problems | Elad Sharony et.al. | 2411.02158 | link |
2024-11-04 | Optimizing AoI at Query in Multiuser Wireless Uplink Networks: A Whittle Index Approach | Jingwei Liu et.al. | 2411.02108 | null |
2024-11-04 | Amortized Bayesian Experimental Design for Decision-Making | Daolang Huang et.al. | 2411.02064 | link |
2024-11-04 | Probability of Error Analysis for NOMA Systems in Rayleigh Fading Channels: Enabling IoT in Civil Engineering | Amr Abdelbari et.al. | 2411.01977 | null |
2024-11-04 | The Certainty Ratio $C_ρ$ : a novel metric for assessing the reliability of classifier predictions | Jesus S. Aguilar-Ruiz et.al. | 2411.01973 | null |
2024-11-04 | Advancing DeFi Analytics: Efficiency Analysis with Decentralized Exchanges Comparison Service | Evgenii Onishchuk et.al. | 2411.01950 | null |
2024-11-04 | Datasets for Advanced Bankruptcy Prediction: A survey and Taxonomy | Xinlin Wang et.al. | 2411.01928 | null |
2024-11-04 | Traffic and Safety Rule Compliance of Humans in Diverse Driving Situations | Michael Kurenkov et.al. | 2411.01909 | null |
2024-11-04 | Towards the Industrial Metaverse: A Game-Based VR Application for Fire Drill and Evacuation Training for Ships and Shipbuilding | Musaab H. Hamed-Ahmed et.al. | 2411.01895 | null |
2024-11-04 | Causal Discovery and Classification Using Lempel-Ziv Complexity | Dhruthi et.al. | 2411.01881 | link |
2024-11-04 | Fixing the Loose Brake: Exponential-Tailed Stopping Time in Best Arm Identification | Kapilan Balagopalan et.al. | 2411.01808 | null |
2024-11-03 | Nash equilibria in four-strategy quantum game extensions of the Prisoner’s Dilemma | Piotr Frąckiewicz et.al. | 2411.01711 | null |
2024-11-03 | Understanding the decision-making process of choice modellers | Gabriel Nova et.al. | 2411.01704 | null |
2024-11-03 | Co-clustering for Federated Recommender System | Xinrui He et.al. | 2411.01690 | link |
2024-11-03 | ROAD-Waymo: Action Awareness at Scale for Autonomous Driving | Salman Khan et.al. | 2411.01683 | link |
2024-11-03 | Autoformulation of Mathematical Optimization Models Using LLMs | Nicolás Astorga et.al. | 2411.01679 | null |
2024-11-03 | Know Where You’re Uncertain When Planning with Multimodal Foundation Models: A Formal Framework | Neel P. Bhatt et.al. | 2411.01639 | null |
2024-10-31 | Language-Driven Policy Distillation for Cooperative Driving in Multi-Agent Reinforcement Learning | Jiaqi Liu et.al. | 2410.24152 | null |
2024-10-31 | AIDOVECL: AI-generated Dataset of Outpainted Vehicles for Eye-level Classification and Localization | Amir Kazemi et.al. | 2410.24116 | null |
2024-10-31 | Attention is All You Need to Optimize Wind Farm Operations and Maintenance | Iman Kazemian et.al. | 2410.24052 | null |
2024-10-31 | Representative Social Choice: From Learning Theory to AI Alignment | Tianyi Qiu et.al. | 2410.23953 | null |
2024-10-31 | Responsible Retrieval Augmented Generation for Climate Decision Making from Documents | Matyas Juhasz et.al. | 2410.23902 | null |
2024-10-31 | Plan-on-Graph: Self-Correcting Adaptive Planning of Large Language Model on Knowledge Graphs | Liyi Chen et.al. | 2410.23875 | link |
2024-10-31 | Driving by the Rules: A Benchmark for Integrating Traffic Sign Regulations into Vectorized HD Map | Xinyuan Chang et.al. | 2410.23780 | null |
2024-10-31 | Features characterizing safe aerial-aquatic robots | Andrea Giordano et.al. | 2410.23722 | null |
2024-10-31 | Automatically Learning Hybrid Digital Twins of Dynamical Systems | Samuel Holt et.al. | 2410.23691 | link |
2024-10-31 | Coach Reservation for Groups Requests | Carlos H. Cardonha et.al. | 2410.23542 | null |
2024-10-30 | Development and Comparative Analysis of Machine Learning Models for Hypoxemia Severity Triage in CBRNE Emergency Scenarios Using Physiological and Demographic Data from Medical-Grade Devices | Santino Nanini et.al. | 2410.23503 | null |
2024-10-30 | Venire: A Machine Learning-Guided Panel Review System for Community Content Moderation | Vinay Koshy et.al. | 2410.23448 | null |
2024-10-30 | Estimating Neural Network Robustness via Lipschitz Constant and Architecture Sensitivity | Abulikemu Abuduweili et.al. | 2410.23382 | null |
2024-10-30 | OpenSatMap: A Fine-grained High-resolution Satellite Dataset for Large-scale Map Construction | Hongbo Zhao et.al. | 2410.23278 | null |
2024-10-30 | EMMA: End-to-End Multimodal Model for Autonomous Driving | Jyh-Jing Hwang et.al. | 2410.23262 | null |
2024-10-31 | Enhancing Autonomous Driving Safety Analysis with Generative AI: A Comparative Study on Automated Hazard and Risk Assessment | Alireza Abbaspour et.al. | 2410.23207 | null |
2024-10-30 | S3PT: Scene Semantics and Structure Guided Clustering to Boost Self-Supervised Pre-Training for Autonomous Driving | Maciej K. Wozniak et.al. | 2410.23085 | null |
2024-10-30 | Online Intrinsic Rewards for Decision Making Agents from Large Language Model Feedback | Qinqing Zheng et.al. | 2410.23022 | null |
2024-10-31 | DiffLight: A Partial Rewards Conditioned Diffusion Model for Traffic Signal Control with Missing Data | Hanyang Chen et.al. | 2410.22938 | link |
2024-11-01 | Multi-Agent Large Language Models for Conversational Task-Solving | Jonas Becker et.al. | 2410.22932 | null |
2024-10-30 | Self-optimization in distributed manufacturing systems using Modular State-based Stackelberg Games | Steve Yuwono et.al. | 2410.22912 | null |
2024-10-30 | YOLOv11 for Vehicle Detection: Advancements, Performance, and Applications in Intelligent Transportation Systems | Mujadded Al Rabbani Alif et.al. | 2410.22898 | null |
2024-10-30 | A Graph-Based Model for Vehicle-Centric Data Sharing Ecosystem | Haiyue Yuan et.al. | 2410.22897 | null |
2024-10-30 | Reliability Assessment of Information Sources Based on Random Permutation Set | Juntao Xu et.al. | 2410.22772 | null |
2024-10-30 | Self-Driving Car Racing: Application of Deep Reinforcement Learning | Florentiana Yuwono et.al. | 2410.22766 | null |
2024-10-30 | A Game-Theoretic Approach for Security Control Selection | Dylan Léveillé et.al. | 2410.22762 | null |
2024-10-30 | SoftCTRL: Soft conservative KL-control of Transformer Reinforcement Learning for Autonomous Driving | Minh Tri Huynh et.al. | 2410.22752 | null |
2024-10-30 | Analysis of Classifier Training on Synthetic Data for Cross-Domain Datasets | Andoni Cortés et.al. | 2410.22748 | null |
2024-10-30 | Clustering Computer Mouse Tracking Data with Informed Hierarchical Shrinkage Partition Priors | Ziyi Song et.al. | 2410.22675 | link |
2024-10-30 | CoGS: Model Agnostic Causality Constrained Counterfactual Explanations using goal-directed ASP | Sopam Dasgupta et.al. | 2410.22615 | null |
2024-10-29 | Pre-Trained Vision Models as Perception Backbones for Safety Filters in Autonomous Driving | Yuxuan Yang et.al. | 2410.22585 | null |
2024-10-29 | Auto-Intent: Automated Intent Discovery and Self-Exploration for Large Language Model Web Agents | Jaekyeom Kim et.al. | 2410.22552 | null |
2024-10-29 | An Efficient Approach to Generate Safe Drivable Space by LiDAR-Camera-HDmap Fusion | Minghao Ning et.al. | 2410.22314 | null |
2024-10-29 | Senna: Bridging Large Vision-Language Models and End-to-End Autonomous Driving | Bo Jiang et.al. | 2410.22313 | link |
2024-10-29 | Fourier Head: Helping Large Language Models Learn Complex Probability Distributions | Nate Gillman et.al. | 2410.22269 | null |
2024-10-29 | MAPUNetR: A Hybrid Vision Transformer and U-Net Architecture for Efficient and Interpretable Medical Image Segmentation | Ovais Iqbal Shah et.al. | 2410.22223 | null |
2024-10-29 | Democratizing Reward Design for Personal and Representative Value-Alignment | Carter Blair et.al. | 2410.22203 | null |
2024-10-29 | EnvoDat: A Large-Scale Multisensory Dataset for Robotic Spatial Awareness and Semantic Reasoning in Heterogeneous Environments | Linus Nwankwo et.al. | 2410.22200 | null |
2024-10-29 | Hyperspectral Imaging-Based Perception in Autonomous Driving Scenarios: Benchmarking Baseline Semantic Segmentation Models | Imad Ali Shah et.al. | 2410.22101 | link |
2024-10-29 | Markov Stochastic Choice | Kremena Valkanova et.al. | 2410.22001 | null |
2024-10-29 | ActiveSplat: High-Fidelity Scene Reconstruction through Active Gaussian Splatting | Yuetao Li et.al. | 2410.21955 | null |
2024-10-29 | On the Robustness of Adversarial Training Against Uncertainty Attacks | Emanuele Ledda et.al. | 2410.21952 | link |
2024-10-29 | Reliable Semantic Understanding for Real World Zero-shot Object Goal Navigation | Halil Utku Unlu et.al. | 2410.21926 | null |
2024-10-29 | Cognitive Semantic Augmentation LEO Satellite Networks for Earth Observation | Hong-fu Chou et.al. | 2410.21916 | null |
2024-10-29 | Bayesian Stability Selection and Inference on Inclusion Probabilities | Mahdi Nouraie et.al. | 2410.21914 | link |
2024-10-29 | Building Altruistic and Moral AI Agent with Brain-inspired Affective Empathy Mechanisms | Feifei Zhao et.al. | 2410.21882 | null |
2024-10-29 | Enhanced Survival Prediction in Head and Neck Cancer Using Convolutional Block Attention and Multimodal Data Fusion | Aiman Farooq et.al. | 2410.21831 | null |
2024-10-30 | First-in-human spinal cord tumor imaging with fast adaptive focus tracking robotic-OCT | Bin He et.al. | 2410.21809 | null |
2024-10-29 | SS3DM: Benchmarking Street-View Surface Reconstruction with a Synthetic 3D Mesh Dataset | Yubin Hu et.al. | 2410.21739 | null |
2024-10-30 | Enhancing Safety and Robustness of Vision-Based Controllers via Reachability Analysis | Kaustav Chakraborty et.al. | 2410.21736 | null |
2024-10-28 | Adaptive Self-Calibration for Minimalistic Collective Perception by Imperfect Robot Swarms | Khai Yi Chin et.al. | 2410.21546 | link |
2024-10-28 | Bayesian Regression for Predicting Subscription to Bank Term Deposits in Direct Marketing Campaigns | Muhammad Farhan Tanvir et.al. | 2410.21539 | null |
2024-10-28 | Quantum Reinforcement Learning-Based Two-Stage Unit Commitment Framework for Enhanced Power Systems Robustness | Xiang Wei et.al. | 2410.21240 | null |
2024-10-28 | Belief in the Machine: Investigating Epistemological Blind Spots of Language Models | Mirac Suzgun et.al. | 2410.21195 | link |
2024-10-28 | Towards Human-centered Design of Explainable Artificial Intelligence (XAI): A Survey of Empirical Studies | Shuai Ma et.al. | 2410.21183 | null |
2024-10-28 | coVoxSLAM: GPU Accelerated Globally Consistent Dense SLAM | Emiliano Höss et.al. | 2410.21149 | link |
2024-10-28 | Towards Unifying Evaluation of Counterfactual Explanations: Leveraging Large Language Models for Human-Centric Assessments | Marharyta Domnich et.al. | 2410.21131 | null |
2024-10-28 | CloudHeatMap: Heatmap-Based Monitoring for Large-Scale Cloud Systems | Sarah Sohana et.al. | 2410.21092 | link |
2024-10-28 | Efficient Mixture-of-Expert for Video-based Driver State and Physiological Multi-task Estimation in Conditional Autonomous Driving | Jiyao Wang et.al. | 2410.21086 | null |
2024-10-28 | Exploring the Reliability of Foundation Model-Based Frontier Selection in Zero-Shot Object Goal Navigation | Shuaihang Yuan et.al. | 2410.21037 | null |
2024-10-28 | Edge Perception: Intelligent Wireless Sensing at Network Edge | Yuanhao Cui et.al. | 2410.21017 | null |
2024-10-28 | A Review of Graph-Powered Data Quality Applications for IoT Monitoring Sensor Networks | Pau Ferrer-Cid et.al. | 2410.21006 | null |
2024-10-28 | Efficient Bilinear Attention-based Fusion for Medical Visual Question Answering | Zhilin Zhang et.al. | 2410.21000 | null |
2024-10-28 | BEVPose: Unveiling Scene Semantics through Pose-Guided Multi-Modal BEV Alignment | Mehdi Hosseinzadeh et.al. | 2410.20969 | null |
2024-10-28 | Active Legibility in Multiagent Reinforcement Learning | Yanyu Liu et.al. | 2410.20954 | null |
2024-10-28 | On Spatio-Temporal Stochastic Frontier Models | Elisa Fusco et.al. | 2410.20915 | null |
2024-10-28 | Explainability in AI Based Applications: A Framework for Comparing Different Techniques | Arne Grobrugge et.al. | 2410.20873 | null |
2024-10-28 | Bridging the Gap between Expert and Language Models: Concept-guided Chess Commentary Generation and Evaluation | Jaechang Kim et.al. | 2410.20811 | null |
2024-10-28 | SparseTem: Boosting the Efficiency of CNN-Based Video Encoders by Exploiting Temporal Continuity | Kunyun Wang et.al. | 2410.20790 | null |
2024-10-27 | Language Models And A Second Opinion Use Case: The Pocket Professional | David Noever et.al. | 2410.20636 | null |
2024-10-27 | Toward Conditional Distribution Calibration in Survival Prediction | Shi-ang Qi et.al. | 2410.20579 | link |
2024-10-27 | Deep Reinforcement Learning Agents for Strategic Production Policies in Microeconomic Market Simulations | Eduardo C. Garrido-Merchán et.al. | 2410.20550 | link |
2024-10-25 | Robust Thompson Sampling Algorithms Against Reward Poisoning Attacks | Yinglun Xu et.al. | 2410.19705 | null |
2024-10-25 | Optimizing Hearthstone Agents using an Evolutionary Algorithm | Pablo García-Sánchez et.al. | 2410.19681 | link |
2024-10-25 | Planning-Aware Diffusion Networks for Enhanced Motion Forecasting in Autonomous Driving | Liu Yunhao et.al. | 2410.19639 | null |
2024-10-25 | Multi-modal Motion Prediction using Temporal Ensembling with Learning-based Aggregation | Kai-Yin Hong et.al. | 2410.19606 | null |
2024-10-25 | AgentForge: A Flexible Low-Code Platform for Reinforcement Learning Agent Design | Francisco Erivaldo Fernandes Junior et.al. | 2410.19528 | link |
2024-10-25 | COR-MP: Conservation of Resources Model for Maneuver Planning | Karim Essalmi et.al. | 2410.19510 | null |
2024-10-25 | Robust Time Series Causal Discovery for Agent-Based Model Validation | Gene Yu et.al. | 2410.19412 | null |
2024-10-25 | Autonomous Building Cyber-Physical Systems Using Decentralized Autonomous Organizations, Digital Twins, and Large Language Model | Reachsak Ly et.al. | 2410.19262 | null |
2024-10-25 | Enhancing Exchange Rate Forecasting with Explainable Deep Learning Models | Shuchen Meng et.al. | 2410.19241 | null |
2024-10-25 | Designing LLM-Agents with Personalities: A Psychometric Approach | Muhua Huang et.al. | 2410.19238 | null |
2024-10-24 | Context-Aware Trajectory Anomaly Detection | Haoji Hu et.al. | 2410.19136 | null |
2024-10-24 | Learning to Look: Seeking Information for Decision Making via Policy Factorization | Shivin Dass et.al. | 2410.18964 | null |
2024-10-24 | Context is Key: A Benchmark for Forecasting with Essential Textual Information | Andrew Robert Williams et.al. | 2410.18959 | link |
2024-10-24 | From Efficiency to Equity: Measuring Fairness in Preference Learning | Shreeyash Gowaikar et.al. | 2410.18841 | null |
2024-10-24 | Large Generative AI Models meet Open Networks for 6G: Integration, Platform, and Monetization | Peizheng Li et.al. | 2410.18790 | null |
2024-10-24 | A Joint Representation Using Continuous and Discrete Features for Cardiovascular Diseases Risk Prediction on Chest CT Scans | Minfeng Xu et.al. | 2410.18610 | link |
2024-10-24 | Learning Transparent Reward Models via Unsupervised Feature Selection | Daulet Baimukashev et.al. | 2410.18608 | null |
2024-10-24 | Aligning CodeLLMs with Direct Preference Optimization | Yibo Miao et.al. | 2410.18585 | null |
2024-10-24 | Resilience-based post disaster recovery optimization for infrastructure system via Deep Reinforcement Learning | Huangbin Liang et.al. | 2410.18577 | null |
2024-10-24 | Zero-shot Object Navigation with Vision-Language Models Reasoning | Congcong Wen et.al. | 2410.18570 | null |
2024-10-24 | Learn 2 Rage: Experiencing The Emotional Roller Coaster That Is Reinforcement Learning | Lachlan Mares et.al. | 2410.18462 | null |
2024-10-23 | Augmenting Training Data with Vector-Quantized Variational Autoencoder for Classifying RF Signals | Srihari Kamesh Kompella et.al. | 2410.18283 | null |
2024-10-23 | Real-Time Integrated Learning and Decision-Making for Asset Networks | Peter Verleijsdonk et.al. | 2410.18246 | null |
2024-10-23 | Characterising Open Source Co-opetition in Company-hosted Open Source Software Projects: The Cases of PyTorch, TensorFlow, and Transformers | Cailean Osborne et.al. | 2410.18241 | null |
2024-10-23 | CARLA2Real: a tool for reducing the sim2real gap in CARLA simulator | Stefanos Pasios et.al. | 2410.18238 | link |
2024-10-23 | WorldSimBench: Towards Video Generation Models as World Simulators | Yiran Qin et.al. | 2410.18072 | null |
2024-10-25 | MiniFed : Integrating LLM-based Agentic-Workflow for Simulating FOMC Meeting | Sungil Seok et.al. | 2410.18012 | null |
2024-10-23 | Lightweight Neural App Control | Filippos Christianos et.al. | 2410.17883 | null |
2024-10-23 | Identifiable Representation and Model Learning for Latent Dynamic Systems | Congxi Zhang et.al. | 2410.17882 | null |
2024-10-23 | ROCKET-1: Master Open-World Interaction with Visual-Temporal Context Prompting | Shaofei Cai et.al. | 2410.17856 | link |
2024-10-23 | Exploiting Text-Image Latent Spaces for the Description of Visual Concepts | Laines Schmalwasser et.al. | 2410.17832 | null |
2024-10-23 | PGDiffSeg: Prior-Guided Denoising Diffusion Model with Parameter-Shared Attention for Breast Cancer Segmentation | Feiyan Feng et.al. | 2410.17812 | null |
2024-10-23 | e-Values for Real-Time Residential Electricity Demand Forecast Model Selection | Fabian Backhaus et.al. | 2410.17800 | null |
2024-10-23 | Pointer: An Energy-Efficient ReRAM-based Point Cloud Recognition Accelerator with Inter-layer and Intra-layer Optimizations | Qijun Zhang et.al. | 2410.17782 | null |
2024-10-23 | Learning Versatile Skills with Curriculum Masking | Yao Tang et.al. | 2410.17744 | link |
2024-10-23 | YOLO-Vehicle-Pro: A Cloud-Edge Collaborative Framework for Object Detection in Autonomous Driving under Adverse Weather Conditions | Xiguang Li et.al. | 2410.17734 | null |
2024-10-23 | Longitudinal Causal Image Synthesis | Yujia Li et.al. | 2410.17691 | link |
2024-10-23 | Integrating Large Language Models for UAV Control in Simulated Environments: A Modular Interaction Approach | Abhishek Phadke et.al. | 2410.17602 | null |
2024-10-23 | Predicting Company Growth by Econophysics informed Machine Learning | Ruyi Tao et.al. | 2410.17587 | null |
2024-10-23 | Real-time Vehicle-to-Vehicle Communication Based Network Cooperative Control System through Distributed Database and Multimodal Perception: Demonstrated in Crossroads | Xinwen Zhu et.al. | 2410.17576 | link |
2024-10-23 | Bridging Swarm Intelligence and Reinforcement Learning | Karthik Soma et.al. | 2410.17517 | null |
2024-10-23 | Detecting fake review buyers using network structure: Direct evidence from Amazon | Sherry He et.al. | 2410.17507 | null |
2024-10-23 | Learning Fair and Preferable Allocations through Neural Network | Ryota Maruo et.al. | 2410.17500 | null |
2024-10-22 | Episodic Future Thinking Mechanism for Multi-agent Reinforcement Learning | Dongsu Lee et.al. | 2410.17373 | null |
2024-10-22 | Literature Meets Data: A Synergistic Approach to Hypothesis Generation | Haokun Liu et.al. | 2410.17309 | link |
2024-10-22 | Hierarchical Upper Confidence Bounds for Constrained Online Learning | Ali Baheri et.al. | 2410.17216 | null |
2024-10-22 | YOLO-TS: Real-Time Traffic Sign Detection with Enhanced Accuracy Using Optimized Receptive Fields and Anchor-Free Fusion | Junzhou Chen et.al. | 2410.17144 | null |
2024-10-22 | Trustworthy XAI and Application | MD Abdullah Al Nasim et.al. | 2410.17139 | null |
2024-10-22 | Impact of Cognitive Dissonance on Social Hysteresis: Insights fromthe Expressed and Private Opinions Model | Kamińska Barbara et.al. | 2410.16934 | null |
2024-10-22 | EnvBridge: Bridging Diverse Environments with Cross-Environment Knowledge Transfer for Embodied AI | Tomoyuki Kagaya et.al. | 2410.16919 | null |
2024-10-22 | Distribution of Responsibility During the Usage of AI-Based Exoskeletons for Upper Limb Rehabilitation | Huaxi et.al. | 2410.16887 | null |
2024-10-22 | Contrasting Attitudes Towards Current and Future AI Applications for Computerised Interpretation of ECG: A Clinical Stakeholder Interview Study | Lukas Hughes-Noehrer et.al. | 2410.16879 | null |
2024-10-22 | Pedestrian motion prediction evaluation for urban autonomous driving | Dmytro Zabolotnii et.al. | 2410.16864 | link |
2024-10-22 | Dynamic graph neural networks for enhanced volatility prediction in financial markets | Pulikandala Nithish Kumar et.al. | 2410.16858 | null |
2024-10-22 | Safe Load Balancing in Software-Defined-Networking | Lam Dinh et.al. | 2410.16846 | null |
2024-10-22 | Assessment of Transformer-Based Encoder-Decoder Model for Human-Like Summarization | Sindhu Nair et.al. | 2410.16842 | null |
2024-10-22 | SpikMamba: When SNN meets Mamba in Event-based Human Action Recognition | Jiaqi Chen et.al. | 2410.16746 | link |
2024-10-22 | Efficient Scheduling of Vehicular Tasks on Edge Systems with Green Energy and Battery Storage | Suvarthi Sarkar et.al. | 2410.16724 | null |
2024-10-22 | Resource-Efficient Sensor Fusion via System-Wide Dynamic Gated Neural Networks | Chetna Singhal et.al. | 2410.16723 | null |
2024-10-22 | SERN: Simulation-Enhanced Realistic Navigation for Multi-Agent Robotic Systems in Contested Environments | Jumman Hossain et.al. | 2410.16686 | null |
2024-10-22 | Improving Causal Reasoning in Large Language Models: A Survey | Siheng Xiong et.al. | 2410.16676 | link |
2024-10-22 | Convex Markov Games: A Framework for Fairness, Imitation, and Creativity in Multi-Agent Learning | Ian Gemp et.al. | 2410.16600 | null |
2024-10-22 | Dynamic Adaptive Rank Space Exploration for Efficient Sentiment Analysis with Large Language Models | Hongcheng Ding et.al. | 2410.16589 | null |
2024-10-21 | How Can We Diagnose and Treat Bias in Large Language Models for Clinical Decision-Making? | Kenza Benkirane et.al. | 2410.16574 | link |
2024-10-21 | Raising the Stakes: Performance Pressure Improves AI-Assisted Decision Making | Nikita Haduong et.al. | 2410.16560 | null |
2024-10-21 | Reflection-Bench: probing AI intelligence with reflection | Lingyu Li et.al. | 2410.16270 | link |
2024-10-22 | Mini-InternVL: A Flexible-Transfer Pocket Multimodal Model with 5% Parameters and 90% Performance | Zhangwei Gao et.al. | 2410.16261 | link |
2024-10-21 | Managing Bandwidth: The Key to Cloud-Assisted Autonomous Driving | Alexander Krentsel et.al. | 2410.16227 | null |
2024-10-21 | CiteClick: A Browser Extension for Real-Time Scholar Citation Tracking | Nishat Raihan et.al. | 2410.16211 | null |
2024-10-22 | LASER: Script Execution by Autonomous Agents for On-demand Traffic Simulation | Hao Gao et.al. | 2410.16197 | link |
2024-10-21 | Increasing Interpretability of Neural Networks By Approximating Human Visual Saliency | Aidan Boyd et.al. | 2410.16115 | null |
2024-10-21 | Fine-Tuning LLMs for Reliable Medical Question-Answering Services | Ali Anaissi et.al. | 2410.16088 | null |
2024-10-21 | A New Approach to Solving SMAC Task: Generating Decision Tree Code from Large Language Models | Yue Deng et.al. | 2410.16024 | link |
2024-10-21 | Systematic Exploration of Dialogue Summarization Approaches for Reproducibility, Comparative Assessment, and Methodological Innovations for Advancing Natural Language Processing in Abstractive Summarization | Yugandhar Reddy Gogireddy et.al. | 2410.15962 | null |
2024-10-21 | Bench4Merge: A Comprehensive Benchmark for Merging in Realistic Dense Traffic with Micro-Interactive Vehicles | Zhengming Wang et.al. | 2410.15912 | null |
2024-10-21 | How to Build a Pre-trained Multimodal model for Simultaneously Chatting and Decision-making? | Zuojin Tang et.al. | 2410.15885 | null |
2024-10-21 | Triplane Grasping: Efficient 6-DoF Grasping with Single RGB Images | Yiming Li et.al. | 2410.15879 | null |
2024-10-21 | WildOcc: A Benchmark for Off-Road 3D Semantic Occupancy Prediction | Heng Zhai et.al. | 2410.15792 | null |
2024-10-21 | High-Fidelity Transfer of Functional Priors for Wide Bayesian Neural Networks by Learning Activations | Marcin Sendera et.al. | 2410.15777 | link |
2024-10-21 | Generalizing Motion Planners with Mixture of Experts for Autonomous Driving | Qiao Sun et.al. | 2410.15774 | link |
2024-10-21 | Solving Sparse \& High-Dimensional-Output Regression via Compression | Renyuan Li et.al. | 2410.15762 | null |
2024-10-21 | Learning-to-Defer for Extractive Question Answering | Montreuil Yannis et.al. | 2410.15761 | null |
2024-10-21 | SPARC: Prediction-Based Safe Control for Coupled Controllable and Uncontrollable Agents with Conformal Predictions | Shuqi Wang et.al. | 2410.15660 | null |
2024-10-21 | How to Find the Exact Pareto Front for Multi-Objective MDPs? | Yining Li et.al. | 2410.15557 | null |
2024-10-21 | A Dual Process VLA: Efficient Robotic Manipulation Leveraging VLM | ByungOk Han et.al. | 2410.15549 | null |
2024-10-18 | Enhancing AI Accessibility in Veterinary Medicine: Linking Classifiers and Electronic Health Records | Chun Yin Kong et.al. | 2410.14625 | null |
2024-10-18 | MultiOrg: A Multi-rater Organoid-detection Dataset | Christina Bukas et.al. | 2410.14612 | null |
2024-10-18 | Towards Unsupervised Validation of Anomaly-Detection Models | Lihi Idan et.al. | 2410.14579 | null |
2024-10-18 | Spectral Representations for Accurate Causal Uncertainty Quantification with Gaussian Processes | Hugh Dance et.al. | 2410.14483 | null |
2024-10-18 | From Simple to Complex: Knowledge Transfer in Safe and Efficient Reinforcement Learning for Autonomous Driving | Rongliang Zhou et.al. | 2410.14468 | null |
2024-10-18 | Personalizing Low-Rank Bayesian Neural Networks Via Federated Learning | Boning Zhang et.al. | 2410.14390 | null |
2024-10-18 | A Model Checker for Natural Strategic Ability | Marco Aruta et.al. | 2410.14374 | null |
2024-10-18 | Assistive AI for Augmenting Human Decision-making | Natabara Máté Gyöngyössy et.al. | 2410.14353 | null |
2024-10-18 | Continuous models combining slacks-based measures of efficiency and super-efficiency | Vicente J. Bolos et.al. | 2410.14303 | null |
2024-10-18 | Optimizing Collaborative Robotics since Pre-Deployment via Cyber-Physical Systems’ Digital Twins | Christian Cella et.al. | 2410.14298 | null |
2024-10-18 | Towards Robust Knowledge Representations in Multilingual LLMs for Equivalence and Inheritance based Consistent Reasoning | Gaurav Arora et.al. | 2410.14235 | null |
2024-10-18 | LabSafety Bench: Benchmarking LLMs on Safety Issues in Scientific Labs | Yujun Zhou et.al. | 2410.14182 | null |
2024-10-18 | XForecast: Evaluating Natural Language Explanations for Time Series Forecasting | Taha Aksu et.al. | 2410.14180 | null |
2024-10-18 | Learning autonomous driving from aerial imagery | Varun Murali et.al. | 2410.14177 | null |
2024-10-18 | Auto Detecting Cognitive Events Using Machine Learning on Pupillary Data | Quang Dang et.al. | 2410.14174 | null |
2024-10-17 | Interpreting Inflammation Prediction Model via Tag-based Cohort Explanation | Fanyu Meng et.al. | 2410.14082 | null |
2024-10-17 | Sliding Puzzles Gym: A Scalable Benchmark for State Representation in Visual Reinforcement Learning | Bryan L. M. de Oliveira et.al. | 2410.14038 | link |
2024-10-17 | Recurrent Neural Goodness-of-Fit Test for Time Series | Aoran Zhang et.al. | 2410.13986 | null |
2024-10-17 | FinQAPT: Empowering Financial Decisions with End-to-End LLM-driven Question Answering Pipeline | Kuldeep Singh et.al. | 2410.13959 | null |
2024-10-17 | Identifying High Consideration E-Commerce Search Queries | Zhiyu Chen et.al. | 2410.13951 | null |
2024-10-17 | UniDrive: Towards Universal Driving Perception Across Camera Configurations | Ye Li et.al. | 2410.13864 | link |
2024-10-17 | MobA: A Two-Level Agent System for Efficient Mobile Task Automation | Zichen Zhu et.al. | 2410.13757 | link |
2024-10-17 | Optimizing Probabilistic Conformal Prediction with Vectorized Non-Conformity Scores | Minxing Zheng et.al. | 2410.13735 | null |
2024-10-17 | The Subtlety of Optimal Paternalism in a Population with Bounded Rationality | Charles F. Manski et.al. | 2410.13658 | null |
2024-10-17 | A Sequential Game Framework for Target Tracking | Daniel Leal et.al. | 2410.13587 | null |
2024-10-17 | Pseudo Dataset Generation for Out-of-Domain Multi-Camera View Recommendation | Kuan-Ying Lee et.al. | 2410.13585 | null |
2024-10-17 | DriveDreamer4D: World Models Are Effective Data Machines for 4D Driving Scene Representation | Guosheng Zhao et.al. | 2410.13571 | null |
2024-10-17 | RGB to Hyperspectral: Spectral Reconstruction for Enhanced Surgical Imaging | Tobias Czempiel et.al. | 2410.13570 | null |
2024-10-17 | Interactive Navigation with Adaptive Non-prehensile Mobile Manipulation | Cunxi Dai et.al. | 2410.13418 | null |
2024-10-17 | Accurate Checkerboard Corner Detection under Defoucs | Zezhun Shi et.al. | 2410.13371 | link |
2024-10-17 | Probing-RAG: Self-Probing to Guide Language Models in Selective Document Retrieval | Ingeol Baek et.al. | 2410.13339 | null |
2024-10-17 | Breaking Chains: Unraveling the Links in Multi-Hop Knowledge Unlearning | Minseok Choi et.al. | 2410.13274 | null |
2024-10-17 | FDF: Flexible Decoupled Framework for Time Series Forecasting with Conditional Denoising and Polynomial Modeling | Jintao Zhang et.al. | 2410.13253 | link |
2024-10-17 | Annealed Stein Variational Gradient Descent for Improved Uncertainty Estimation in Full-Waveform Inversion | Miguel Corrales et.al. | 2410.13249 | null |
2024-10-17 | Web Agents with World Models: Learning and Leveraging Environment Dynamics in Web Navigation | Hyungjoo Chae et.al. | 2410.13232 | null |
2024-10-17 | LLMOPT: Learning to Define and Solve General Optimization Problems from Scratch | Caigao Jiang et.al. | 2410.13213 | link |
2024-10-17 | Measuring Free-Form Decision-Making Inconsistency of Language Models in Military Crisis Simulations | Aryan Shrivastava et.al. | 2410.13204 | link |
2024-10-16 | Future of Algorithmic Organization: Large-Scale Analysis of Decentralized Autonomous Organizations (DAOs) | Tanusree Sharma et.al. | 2410.13095 | null |
2024-10-16 | Double-Bayesian Learning | Stefan Jaeger et.al. | 2410.12984 | null |
2024-10-16 | Multi-modal graph neural networks for localized off-grid weather forecasting | Qidong Yang et.al. | 2410.12938 | link |
2024-10-16 | Machine Learning-Augmented Ontology-Based Data Access for Renewable Energy Data | Marco Calautti et.al. | 2410.12734 | null |
2024-10-16 | Best-Worst Disaggregation: An approach to the preference disaggregation problem | Matteo Brunelli et.al. | 2410.12678 | null |
2024-10-16 | MambaBEV: An efficient 3D detection model with Mamba2 | Zihan You et.al. | 2410.12673 | null |
2024-10-16 | Hybrid Decision Making for Scalable Multi-Agent Navigation: Integrating Semantic Maps, Discrete Coordination, and Model Predictive Control | Koen de Vos et.al. | 2410.12651 | null |
2024-10-16 | Rethinking Visual Counterfactual Explanations Through Region Constraint | Bartlomiej Sobieski et.al. | 2410.12591 | null |
2024-10-16 | Self-DenseMobileNet: A Robust Framework for Lung Nodule Classification using Self-ONN and Stacking-based Meta-Classifier | Md. Sohanur Rahman et.al. | 2410.12584 | null |
2024-10-16 | STRUX: An LLM for Decision-Making with Structured Explanations | Yiming Lu et.al. | 2410.12583 | null |
2024-10-16 | Robust RL with LLM-Driven Data Synthesis and Policy Adaptation for Autonomous Driving | Sihao Wu et.al. | 2410.12568 | null |
2024-10-16 | Counterfactual Effect Decomposition in Multi-Agent Sequential Decision Making | Stelios Triantafyllou et.al. | 2410.12539 | link |
2024-10-16 | Insights from the Inverse: Reconstructing LLM Training Goals Through Inverse RL | Jared Joselowitz et.al. | 2410.12491 | null |
2024-10-16 | SAC-GLAM: Improving Online RL for LLM agents with Soft Actor-Critic and Hindsight Relabeling | Loris Gaven et.al. | 2410.12481 | null |
2024-10-16 | ConLUX: Concept-Based Local Unified Explanations | Junhao Liu et.al. | 2410.12439 | null |
2024-10-16 | Conformity in Large Language Models | Xiaochen Zhu et.al. | 2410.12428 | null |
2024-10-16 | Real-time Stereo-based 3D Object Detection for Streaming Perception | Changcai Li et.al. | 2410.12394 | link |
2024-10-16 | Proactive Agent: Shifting LLM Agents from Reactive Responses to Active Assistance | Yaxi Lu et.al. | 2410.12361 | link |
2024-10-16 | TPFL: A Trustworthy Personalized Federated Learning Framework via Subjective Logic | Jinqian Chen et.al. | 2410.12316 | null |
2024-10-16 | Consistency Calibration: Improving Uncertainty Calibration via Consistency among Perturbed Neighbors | Linwei Tao et.al. | 2410.12295 | null |
2024-10-16 | Implementation of EMR System in Indonesian Health Facilities: Benefits and Constraints | Rasyid Juliansyah et.al. | 2410.12226 | null |
2024-10-16 | Sparse Prototype Network for Explainable Pedestrian Behavior Prediction | Yan Feng et.al. | 2410.12195 | link |
2024-10-16 | ExoTST: Exogenous-Aware Temporal Sequence Transformer for Time Series Prediction | Kshitij Tayal et.al. | 2410.12184 | null |
2024-10-15 | Technical Report of 1:10 Scale Autonomous Vehicle Robot | Amirhossein Kheiri Holighi et.al. | 2410.11746 | null |
2024-10-15 | MTU-Bench: A Multi-granularity Tool-Use Benchmark for Large Language Models | Pei Wang et.al. | 2410.11710 | link |
2024-10-15 | Fully-discrete provably Lyapunov consistent discretizations for convection-diffusion-reaction PDE systems | Rasha Al Jahdali et.al. | 2410.11669 | null |
2024-10-15 | Black-box Uncertainty Quantification Method for LLM-as-a-Judge | Nico Wagner et.al. | 2410.11594 | null |
2024-10-15 | A Data-Driven Aggressive Autonomous Racing Framework Utilizing Local Trajectory Planning with Velocity Prediction | Zhouheng Li et.al. | 2410.11570 | link |
2024-10-15 | Effect modification and non-collapsibility leads to conflicting treatment decisions: a review of marginal and conditional estimands and recommendations for decision-making | David M. Phillippo et.al. | 2410.11438 | null |
2024-10-15 | DODT: Enhanced Online Decision Transformer Learning through Dreamer’s Actor-Critic Trajectory Forecasting | Eric Hanchen Jiang et.al. | 2410.11359 | null |
2024-10-15 | DIAR: Diffusion-model-guided Implicit Q-learning with Adaptive Revaluation | Jaehyun Park et.al. | 2410.11338 | null |
2024-10-15 | Diffusion-Based Offline RL for Improved Decision-Making in Augmented ARC Task | Yunho Kim et.al. | 2410.11324 | null |
2024-10-15 | Communication-Control Codesign for Large-Scale Wireless Networked Control Systems | Gaoyang Pang et.al. | 2410.11316 | null |
2024-10-15 | Process Reward Model with Q-Value Rankings | Wendi Li et.al. | 2410.11287 | link |
2024-10-15 | Bayes Adaptive Monte Carlo Tree Search for Offline Model-based Reinforcement Learning | Jiayu Chen et.al. | 2410.11234 | null |
2024-10-15 | TEOcc: Radar-camera Multi-modal Occupancy Prediction via Temporal Enhancement | Zhiwei Lin et.al. | 2410.11228 | link |
2024-10-14 | Gender Bias in Decision-Making with Large Language Models: A Study of Relationship Conflicts | Sharon Levy et.al. | 2410.11084 | link |
2024-10-14 | SGUQ: Staged Graph Convolution Neural Network for Alzheimer’s Disease Diagnosis using Multi-Omics Data | Liang Tao et.al. | 2410.11046 | link |
2024-10-14 | Persistent Topological Features in Large Language Models | Yuri Gardinazzi et.al. | 2410.11042 | link |
2024-10-14 | ET-Former: Efficient Triplane Deformable Attention for 3D Semantic Scene Completion From Monocular Camera | Jing Liang et.al. | 2410.11019 | null |
2024-10-14 | 6G RIS-aided Single-LEO Localization with Slow and Fast Doppler Effects | Sharief Saleh et.al. | 2410.11010 | null |
2024-10-14 | Condition-Aware Multimodal Fusion for Robust Semantic Perception of Driving Scenes | Tim Broedermann et.al. | 2410.10791 | null |
2024-10-14 | 3DArticCyclists: Generating Simulated Dynamic 3D Cyclists for Human-Object Interaction (HOI) and Autonomous Driving Applications | Eduardo R. Corral-Soto et.al. | 2410.10782 | null |
2024-10-14 | Focused ReAct: Improving ReAct through Reiterate and Early Stop | Shuoqiu Li et.al. | 2410.10779 | null |
2024-10-14 | Towards Calibrated Losses for Adversarial Robust Reject Option Classification | Vrund Shah et.al. | 2410.10736 | link |
2024-10-14 | Navigation under uncertainty: Trajectory prediction and occlusion reasoning with switching dynamical systems | Ran Wei et.al. | 2410.10653 | null |
2024-10-14 | Echo State Networks for Spatio-Temporal Area-Level Data | Zhenhua Wang et.al. | 2410.10641 | null |
2024-10-14 | Intelligent prospector v2.0: exploration drill planning under epistemic model uncertainty | John Mern et.al. | 2410.10610 | null |
2024-10-14 | Burning RED: Unlocking Subtask-Driven Reinforcement Learning and Risk-Awareness in Average-Reward Markov Decision Processes | Juan Sebastian Rojas et.al. | 2410.10578 | null |
2024-10-14 | Words to Wheels: Vision-Based Autonomous Driving Understanding Human Language Instructions Using Foundation Models | Chanhoe Ryu et.al. | 2410.10577 | null |
2024-10-14 | When Precedents Clash | Cecilia Di Florio et.al. | 2410.10567 | null |
2024-10-14 | Graph Classification Gaussian Processes via Hodgelet Spectral Features | Mathieu Alain et.al. | 2410.10546 | null |
2024-10-14 | Exploiting Local Features and Range Images for Small Data Real-Time Point Cloud Semantic Segmentation | Daniel Fusaro et.al. | 2410.10510 | link |
2024-10-15 | Ada-K Routing: Boosting the Efficiency of MoE-based LLMs | Tongtian Yue et.al. | 2410.10456 | null |
2024-10-14 | QUITE: Quantifying Uncertainty in Natural Language Text in Bayesian Reasoning Scenarios | Timo Pierre Schrader et.al. | 2410.10449 | null |
2024-10-14 | In-Materia Speech Recognition | Mohamadreza Zolfagharinejad et.al. | 2410.10434 | null |
2024-10-14 | DOME: Taming Diffusion Model into High-Fidelity Controllable Occupancy World Model | Songen Gu et.al. | 2410.10429 | null |
2024-10-15 | Improved Depth Estimation of Bayesian Neural Networks | Bart van Erp et.al. | 2410.10395 | link |
2024-10-14 | MentalGLM Series: Explainable Large Language Models for Mental Health Analysis on Chinese Social Media | Wei Zhai et.al. | 2410.10323 | link |
2024-10-14 | Preliminary Evaluation of an Ultrasound-Guided Robotic System for Autonomous Percutaneous Intervention | Pratima Mohan et.al. | 2410.10299 | null |
2024-10-14 | ROA-BEV: 2D Region-Oriented Attention for BEV-based 3D Object | Jiwei Chen et.al. | 2410.10298 | null |
2024-10-11 | Variance reduction combining pre-experiment and in-experiment data | Zhexiao Lin et.al. | 2410.09027 | null |
2024-10-11 | Learning Representations of Instruments for Partial Identification of Treatment Effects | Jonas Schweisthal et.al. | 2410.08976 | link |
2024-10-11 | Transferable Belief Model on Quantum Circuits | Qianli Zhou et.al. | 2410.08949 | null |
2024-10-11 | DiffPO: A causal diffusion model for learning distributions of potential outcomes | Yuchen Ma et.al. | 2410.08924 | null |
2024-10-11 | Hybrid LLM-DDQN based Joint Optimization of V2I Communication and Autonomous Driving | Zijiang Yan et.al. | 2410.08854 | null |
2024-10-11 | Online Learning for Intelligent Thermal Management of Interference-coupled and Passively Cooled Base Stations | Zhanwei Yu et.al. | 2410.08799 | null |
2024-10-11 | Integrating Expert Judgment and Algorithmic Decision Making: An Indistinguishability Framework | Rohan Alur et.al. | 2410.08783 | link |
2024-10-11 | VideoSAM: Open-World Video Segmentation | Pinxue Guo et.al. | 2410.08781 | null |
2024-10-11 | Causal machine learning for predicting treatment outcomes | Stefan Feuerriegel et.al. | 2410.08770 | null |
2024-10-11 | MMLF: Multi-modal Multi-class Late Fusion for Object Detection with Uncertainty Estimation | Qihang Yang et.al. | 2410.08739 | null |
2024-10-11 | Investigating Human-Computer Interaction and Visual Comprehension in Text Generation Process of Natural Language Generation Models | Yunchao Wang et.al. | 2410.08723 | null |
2024-10-11 | Impact of Surface Reflections in Maritime Obstacle Detection | Samed Yalçın et.al. | 2410.08713 | link |
2024-10-11 | Opacity Enforcement by Edit Functions Under Incomparable Observations | Wei Duan et.al. | 2410.08471 | null |
2024-10-11 | AdvDiffuser: Generating Adversarial Safety-Critical Driving Scenarios via Guided Diffusion | Yuting Xie et.al. | 2410.08453 | null |
2024-10-11 | JurEE not Judges: safeguarding llm interactions with small, specialised Encoder Ensembles | Dom Nasrabadi et.al. | 2410.08442 | null |
2024-10-10 | Can LLMs advance democratic values? | Seth Lazar et.al. | 2410.08418 | null |
2024-10-10 | Are We Ready for Real-Time LiDAR Semantic Segmentation in Autonomous Driving? | Samir Abou Haidar et.al. | 2410.08365 | null |
2024-10-10 | Large Legislative Models: Towards Efficient AI Policymaking in Economic Simulations | Henry Gasztowtt et.al. | 2410.08345 | link |
2024-10-10 | Towards Foundation Models for Mixed Integer Linear Programming | Sirui Li et.al. | 2410.08288 | null |
2024-10-10 | RGM: Reconstructing High-fidelity 3D Car Assets with Relightable 3D-GS Generative Model from a Single Image | Xiaoxue Chen et.al. | 2410.08181 | null |
2024-10-10 | Mars: Situated Inductive Reasoning in an Open-World Environment | Xiaojuan Tang et.al. | 2410.08126 | null |
2024-10-10 | A Generative AI Technique for Synthesizing a Digital Twin for U.S. Residential Solar Adoption and Generation | Aparna Kishore et.al. | 2410.08098 | null |
2024-10-10 | Gaussian Process Thompson Sampling via Rootfinding | Taiwo A. Adebiyi et.al. | 2410.08071 | null |
2024-10-10 | Agent-based modeling for realistic reproduction of human mobility and contact behavior to evaluate test and isolation strategies in epidemic infectious disease spread | David Kerkmann et.al. | 2410.08050 | link |
2024-10-10 | Harmonic Oscillator based Particle Swarm Optimization | Yury Chernyak et.al. | 2410.08043 | null |
2024-10-10 | APOLLO: A GPT-based tool to detect phishing emails and generate explanations that warn users | Giuseppe Desolda et.al. | 2410.07997 | null |
2024-10-10 | Octopus Inspired Optimization Algorithm: Multi-Level Structures and Parallel Computing Strategies | Xu Wang et.al. | 2410.07968 | link |
2024-10-10 | Eco-driving Incentive Mechanisms for Mitigating Emissions in Urban Transportation | M. Umar B. Niazi et.al. | 2410.07952 | null |
2024-10-10 | AI Surrogate Model for Distributed Computing Workloads | David K. Park et.al. | 2410.07940 | null |
2024-10-10 | Offline Hierarchical Reinforcement Learning via Inverse Optimization | Carolin Schmidt et.al. | 2410.07933 | null |
2024-10-10 | Decision-Aware Predictive Model Selection for Workforce Allocation | Eric G. Stratman et.al. | 2410.07932 | null |
2024-10-10 | Efficient Reinforcement Learning with Large Language Model Priors | Xue Yan et.al. | 2410.07927 | null |
2024-10-10 | Understanding Human Activity with Uncertainty Measure for Novelty in Graph Convolutional Networks | Hao Xing et.al. | 2410.07917 | null |
2024-10-10 | L-VITeX: Light-weight Visual Intuition for Terrain Exploration | Antar Mazumder et.al. | 2410.07872 | null |
2024-10-10 | Autonomous Vehicles Path Planning under Temporal Logic Specifications | Akshay Dhonthi et.al. | 2410.07845 | null |
2024-10-10 | Fine-Tuning Language Models for Ethical Ambiguity: A Comparative Study of Alignment with Human Responses | Pranav Senthilkumar et.al. | 2410.07826 | null |
2024-10-10 | HeightFormer: A Semantic Alignment Monocular 3D Object Detection Method from Roadside Perspective | Pei Liu et.al. | 2410.07758 | null |
2024-10-10 | Give Me a Choice: The Consequences of Restricting Choices Through AI-Support for Perceived Autonomy, Motivational Variables, and Decision Performance | Cedric Faas et.al. | 2410.07728 | null |
2024-10-10 | Autonomous Driving in Unstructured Environments: How Far Have We Come? | Chen Min et.al. | 2410.07701 | link |
2024-10-09 | Embodied Agent Interface: Benchmarking LLMs for Embodied Decision Making | Manling Li et.al. | 2410.07166 | link |
2024-10-09 | Identifying and Addressing Delusions for Target-Directed Decision-Making | Mingde Zhao et.al. | 2410.07096 | link |
2024-10-09 | Optimizing Estimators of Squared Calibration Errors in Classification | Sebastian G. Gruber et.al. | 2410.07014 | null |
2024-10-09 | Reproducing and Extending Experiments in Behavioral Strategy with Large Language Models | Daniel Albert et.al. | 2410.06932 | null |
2024-10-09 | How hard can it be? Quantifying MITRE attack campaigns with attack trees and cATM logic | Stefano M. Nicoletti et.al. | 2410.06692 | null |
2024-10-09 | $β$ -calibration of Language Model Confidence Scores for Generative QA | Putra Manggala et.al. | 2410.06615 | null |
2024-10-09 | Decentralized Clinical Trials in the Era of Real-World Evidence: A Statistical Perspective | Jie Chen et.al. | 2410.06591 | null |
2024-10-09 | Use of Real-World Data and Real-World Evidence in Rare Disease Drug Development: A Statistical Perspective | Jie Chen et.al. | 2410.06586 | null |
2024-10-09 | Challenges and Possible Strategies to Address Them in Rare Disease Drug Development: A Statistical Perspective | Jie Chen et.al. | 2410.06585 | null |
2024-10-10 | When Does Interference Matter? Decision-Making in Platform Experiments | Ramesh Johari et.al. | 2410.06580 | null |
2024-10-09 | Detecting Bias and Enhancing Diagnostic Accuracy in Large Language Models for Healthcare | Pardis Sadat Zahraei et.al. | 2410.06566 | null |
2024-10-09 | QuadBEV: An Efficient Quadruple-Task Perception Framework via Bird’s-Eye-View Representation | Yuxin Li et.al. | 2410.06516 | null |
2024-10-09 | Overcoming Autoware-Ubuntu Incompatibility in Autonomous Driving Systems-Equipped Vehicles: Lessons Learned | Dada Zhang et.al. | 2410.06492 | null |
2024-10-09 | Flipping-based Policy for Chance-Constrained Markov Decision Processes | Xun Shen et.al. | 2410.06474 | null |
2024-10-09 | Modeling chaotic Lorenz ODE System using Scientific Machine Learning | Sameera S Kashyap et.al. | 2410.06452 | null |
2024-10-08 | Biased AI can Influence Political Decision-Making | Jillian Fisher et.al. | 2410.06415 | null |
2024-10-08 | BEVLoc: Cross-View Localization and Matching via Birds-Eye-View Synthesis | Christopher Klammer et.al. | 2410.06410 | link |
2024-10-08 | Cooperative and Asynchronous Transformer-based Mission Planning for Heterogeneous Teams of Mobile Robots | Milad Farjadnasab et.al. | 2410.06372 | link |
2024-10-08 | HumVI: A Multilingual Dataset for Detecting Violent Incidents Impacting Humanitarian Aid | Hemank Lamba et.al. | 2410.06370 | link |
2024-10-10 | Context-Aware Command Understanding for Tabletop Scenarios | Paul Gajewski et.al. | 2410.06355 | null |
2024-10-07 | LADEV: A Language-Driven Testing and Evaluation Platform for Vision-Language-Action Models in Robotic Manipulation | Zhijie Wang et.al. | 2410.05191 | null |
2024-10-07 | ReasoningRank: Teaching Student Models to Rank through Reasoning-Based Knowledge Distillation | Yuelyu Ji et.al. | 2410.05168 | null |
2024-10-07 | Assouad, Fano, and Le Cam with Interaction: A Unifying Lower Bound Framework and Characterization for Bandit Learnability | Fan Chen et.al. | 2410.05117 | null |
2024-10-07 | LiDAR-GS:Real-time LiDAR Re-Simulation using Gaussian Splatting | Qifeng Chen et.al. | 2410.05111 | null |
2024-10-07 | HE-Drive: Human-Like End-to-End Driving with Vision Language Models | Junming Wang et.al. | 2410.05051 | null |
2024-10-07 | Real-time Ship Recognition and Georeferencing for the Improvement of Maritime Situational Awareness | Borja Carrillo Perez et.al. | 2410.04946 | null |
2024-10-07 | PRFusion: Toward Effective and Robust Multi-Modal Place Recognition with Image and Point Cloud Fusion | Sijie Wang et.al. | 2410.04939 | link |
2024-10-07 | Why am I seeing this: Democratizing End User Auditing for Online Content Recommendations | Chaoran Chen et.al. | 2410.04917 | null |
2024-10-07 | Data-driven Diffusion Models for Enhancing Safety in Autonomous Vehicle Traffic Simulations | Jinxiong Lu et.al. | 2410.04809 | null |
2024-10-07 | WTCL-Dehaze: Rethinking Real-world Image Dehazing via Wavelet Transform and Contrastive Learning | Divine Joseph Appiah et.al. | 2410.04762 | null |
2024-10-07 | Driving with Regulation: Interpretable Decision-Making for Autonomous Vehicles with Retrieval-Augmented Reasoning via LLM | Tianhui Cai et.al. | 2410.04759 | null |
2024-10-07 | Diffusion Models in 3D Vision: A Survey | Zhen Wang et.al. | 2410.04738 | null |
2024-10-07 | Does the Infamous Pie Chart Really Hurt Decision-Making in the Real World? Assessing the Role of Visualization in High-Level Academic Decisions | Yixuan Li et.al. | 2410.04686 | null |
2024-10-06 | VISTA: A Visual and Textual Attention Dataset for Interpreting Multimodal Models | Harshit et.al. | 2410.04609 | null |
2024-10-06 | CardioAI: A Multimodal AI-based System to Support Symptom Monitoring and Risk Detection of Cancer Treatment-Induced Cardiotoxicity | Siyi Wu et.al. | 2410.04592 | null |
2024-10-06 | Ranking Policy Learning via Marketplace Expected Value Estimation From Observational Data | Ehsan Ebrahimzadeh et.al. | 2410.04568 | null |
2024-10-06 | Bisimulation metric for Model Predictive Control | Yutaka Shimizu et.al. | 2410.04553 | link |
2024-10-06 | In-Place Panoptic Radiance Field Segmentation with Perceptual Prior for 3D Scene Understanding | Shenghao Li et.al. | 2410.04529 | null |
2024-10-06 | A Reinforcement Learning Engine with Reduced Action and State Space for Scalable Cyber-Physical Optimal Response | Shining Sun et.al. | 2410.04518 | null |
2024-10-06 | Two-fund separation under hyperbolically distributed returns and concave utility function | Nuerxiati Abudurexiti et.al. | 2410.04459 | null |
2024-10-04 | Minimax-optimal trust-aware multi-armed bandits | Changxiao Cai et.al. | 2410.03651 | null |
2024-10-04 | Open-World Reinforcement Learning over Long Short-Term Imagination | Jiajian Li et.al. | 2410.03618 | null |
2024-10-04 | A Multi-model Approach for Video Data Retrieval in Autonomous Vehicle Development | Jesper Knapp et.al. | 2410.03580 | null |
2024-10-04 | MO-DDN: A Coarse-to-Fine Attribute-based Exploration Agent for Multi-object Demand-driven Navigation | Hongcheng Wang et.al. | 2410.03488 | null |
2024-10-04 | Predictive Coding for Decision Transformer | Tung M. Luu et.al. | 2410.03408 | link |
2024-10-04 | Make Interval Bound Propagation great again | Patryk Krukowski et.al. | 2410.03373 | link |
2024-10-04 | SELU: Self-Learning Embodied MLLMs in Unknown Environments | Boyu Li et.al. | 2410.03303 | null |
2024-10-04 | Deliberate Reasoning for LLMs as Structure-aware Planning with Accurate World Model | Siheng Xiong et.al. | 2410.03136 | null |
2024-10-04 | Spatial-aware decision-making with ring attractors in reinforcement learning systems | Marcos Negre Saura et.al. | 2410.03119 | null |
2024-10-04 | Strategic Insights from Simulation Gaming of AI Race Dynamics | Ross Gruetzemacher et.al. | 2410.03092 | null |
2024-10-04 | MetaOOD: Automatic Selection of OOD Detection Models | Yuehan Qin et.al. | 2410.03074 | null |
2024-10-03 | Learning a Fast Mixing Exogenous Block MDP using a Single Trajectory | Alexander Levine et.al. | 2410.03016 | link |
2024-10-03 | Harm Ratio: A Novel and Versatile Fairness Criterion | Soroush Ebadian et.al. | 2410.02977 | null |
2024-10-03 | Acoustic signaling enables collective perception and control in active matter systems | Alexander Ziepke et.al. | 2410.02940 | null |
2024-10-03 | ReLIC: A Recipe for 64k Steps of In-Context Reinforcement Learning for Embodied AI | Ahmad Elawady et.al. | 2410.02751 | link |
2024-10-03 | DailyDilemmas: Revealing Value Preferences of LLMs with Quandaries of Daily Life | Yu Ying Chiu et.al. | 2410.02683 | null |
2024-10-03 | Grounded Answers for Multi-agent Decision-making Problem through Generative World Model | Zeyang Liu et.al. | 2410.02664 | null |
2024-10-03 | Agent Security Bench (ASB): Formalizing and Benchmarking Attacks and Defenses in LLM-based Agents | Hanrong Zhang et.al. | 2410.02644 | link |
2024-10-03 | Spatial-Temporal Multi-Cuts for Online Multiple-Camera Vehicle Tracking | Fabian Herzog et.al. | 2410.02638 | link |
2024-10-03 | Beyond Expected Returns: A Policy Gradient Algorithm for Cumulative Prospect Theoretic Reinforcement Learning | Olivier Lepel et.al. | 2410.02605 | null |
2024-10-03 | Expected Maximin Fairness in Max-Cut and other Combinatorial Optimization Problems | Jad Salem et.al. | 2410.02589 | null |
2024-10-03 | Spontaneous Symmetry Breaking, Group Decision Making and Beyond 1. Echo Chambers and Random Polarization | Serge Galam et.al. | 2410.02582 | null |
2024-10-03 | ColaCare: Enhancing Electronic Health Record Modeling through Large Language Model-Driven Multi-Agent Collaboration | Zixiang Wang et.al. | 2410.02551 | null |
2024-10-03 | Meta-Models: An Architecture for Decoding LLM Behaviors Through Interpreted Embeddings and Natural Language | Anthony Costarelli et.al. | 2410.02472 | link |
2024-10-03 | Behavior Trees in Functional Safety Supervisors for Autonomous Vehicles | Carlos Conejo et.al. | 2410.02469 | link |
2024-10-03 | Aggregation of Constrained Crowd Opinions for Urban Planning | Akanksha Das et.al. | 2410.02454 | null |
2024-10-03 | Self-eXplainable AI for Medical Image Analysis: A Survey and New Outlooks | Junlin Hou et.al. | 2410.02331 | null |
2024-10-03 | Selection Guidelines for Geographical SMR Protocols: A Communication Pattern-based Latency Modeling Approach | Kohya Shiozaki et.al. | 2410.02295 | null |
2024-10-03 | Perfect Counterfactuals in Imperfect Worlds: Modelling Noisy Implementation of Actions in Sequential Algorithmic Recourse | Yueqing Xuan et.al. | 2410.02273 | null |
2024-10-03 | End-to-end Driving in High-Interaction Traffic Scenarios with Reinforcement Learning | Yueyuan Li et.al. | 2410.02253 | null |
2024-10-03 | Probabilistic road classification in historical maps using synthetic data and deep learning | Dominik J. Mühlematter et.al. | 2410.02250 | link |
2024-10-03 | SEAL: SEmantic-Augmented Imitation Learning via Language Model | Chengyang Gu et.al. | 2410.02231 | null |
2024-10-03 | Measuring, Evaluating and Improving Logical Consistency in Large Language Models | Yinhong Liu et.al. | 2410.02205 | null |
2024-10-03 | Remember and Recall: Associative-Memory-based Trajectory Prediction | Hang Guo et.al. | 2410.02201 | null |
2024-10-02 | Bellman Diffusion: Generative Modeling as Learning a Linear Operator in the Distribution Space | Yangming Li et.al. | 2410.01796 | null |
2024-10-02 | DeFine: Enhancing LLM Decision-Making with Factor Profiles and Analogical Reasoning | Yebowen Hu et.al. | 2410.01772 | null |
2024-10-02 | Decision-Focused Uncertainty Quantification | Santiago Cortes-Gomez et.al. | 2410.01767 | null |
2024-10-02 | Mimicking Human Intuition: Cognitive Belief-Driven Q-Learning | Xingrui Gu et.al. | 2410.01739 | null |
2024-10-02 | Open3DTrack: Towards Open-Vocabulary 3D Multi-Object Tracking | Ayesha Ishaq et.al. | 2410.01678 | link |
2024-10-02 | Moral Alignment for LLM Agents | Elizaveta Tennant et.al. | 2410.01639 | null |
2024-10-02 | Entropy-Based Uncertainty Modeling for Trajectory Prediction in Autonomous Driving | Aron Distelzweig et.al. | 2410.01628 | null |
2024-10-02 | AI-Native Network Digital Twin for Intelligent Network Management in 6G | Wen Wu et.al. | 2410.01584 | null |
2024-10-02 | Uncertainty quantification in neutron and gamma time correlation measurements | Paul Lartaud et.al. | 2410.01522 | null |
2024-10-02 | One Wave to Explain Them All: A Unifying Perspective on Post-hoc Explainability | Gabriel Kasmi et.al. | 2410.01482 | null |
2024-10-02 | Adaptive teachers for amortized samplers | Minsu Kim et.al. | 2410.01432 | null |
2024-10-02 | Regularized e-processes: anytime valid inference with knowledge-based efficiency gains | Ryan Martin et.al. | 2410.01427 | null |
2024-10-02 | CSLens: Towards Better Deploying Charging Stations via Visual Analytics – A Coupled Networks Perspective | Yutian Zhang et.al. | 2410.01384 | null |
2024-10-02 | MARLens: Understanding Multi-agent Reinforcement Learning for Traffic Signal Control via Visual Analytics | Yutian Zhang et.al. | 2410.01364 | null |
2024-10-02 | Detecting Viral Social Events through Censored Observation with Deep Survival Analysis | Maryam Ramezani et.al. | 2410.01320 | null |
2024-10-02 | FanCric : Multi-Agentic Framework for Crafting Fantasy 11 Cricket Teams | Mohit Bhatnagar et.al. | 2410.01307 | null |
2024-10-02 | What Did I Say Again? Relating User Needs to Search Outcomes in Conversational Commerce | Kevin Schott et.al. | 2410.01291 | null |
2024-10-02 | Uncertainty-aware Human Mobility Modeling and Anomaly Detection | Haomin Wen et.al. | 2410.01281 | null |
2024-10-02 | Perceptual Piercing: Human Visual Cue-based Object Detection in Low Visibility Conditions | Ashutosh Kumar et.al. | 2410.01225 | link |
2024-10-02 | An uncertainty-aware Digital Shadow for underground multimodal CO2 storage monitoring | Abhinav Prakash Gahlot et.al. | 2410.01218 | null |
2024-09-30 | Maia-2: A Unified Model for Human-AI Alignment in Chess | Zhenwei Tang et.al. | 2409.20553 | link |
2024-09-30 | Best Practices for Responsible Machine Learning in Credit Scoring | Giovani Valdrighi et.al. | 2409.20536 | link |
2024-09-30 | End-to-End Conformal Calibration for Optimization Under Uncertainty | Christopher Yeh et.al. | 2409.20534 | link |
2024-09-30 | Quantifying Metrics for Wildfire Ignition Risk from Geographic Data in Power Shutoff Decision-Making | Ryan Piansky et.al. | 2409.20511 | null |
2024-09-30 | Online Decision Deferral under Budget Constraints | Mirabel Reid et.al. | 2409.20489 | null |
2024-09-30 | The Secretary Problem with Predicted Additive Gap | Alexander Braun et.al. | 2409.20460 | null |
2024-09-30 | Sufficient and Necessary Explanations (and What Lies in Between) | Beepul Bharti et.al. | 2409.20427 | null |
2024-09-30 | Conformal Prediction for Dose-Response Models with Continuous Treatments | Jarne Verhaeghe et.al. | 2409.20412 | link |
2024-09-30 | Efficient Driving Behavior Narration and Reasoning on Edge Device Using Large Language Models | Yizhou Huang et.al. | 2409.20364 | null |
2024-09-30 | Leveraging CAM Algorithms for Explaining Medical Semantic Segmentation | Tillmann Rheude et.al. | 2409.20287 | link |
2024-09-30 | Learning to Ground Existentially Quantified Goals | Martin Funkquist et.al. | 2409.20259 | null |
2024-09-30 | Inferring Preferences from Demonstrations in Multi-objective Reinforcement Learning | Junlin Lu et.al. | 2409.20258 | link |
2024-09-30 | Feature Extractor or Decision Maker: Rethinking the Role of Visual Encoders in Visuomotor Policies | Ruiyu Wang et.al. | 2409.20248 | null |
2024-09-30 | Customized Information and Domain-centric Knowledge Graph Construction with Large Language Models | Frank Wawrzik et.al. | 2409.20010 | null |
2024-10-01 | OccRWKV: Rethinking Efficient 3D Semantic Occupancy Prediction with Linear Complexity | Junming Wang et.al. | 2409.19987 | null |
2024-09-30 | DAOcc: 3D Object Detection Assisted Multi-Sensor Fusion for 3D Occupancy Prediction | Zhen Yang et.al. | 2409.19972 | link |
2024-09-30 | Data-driven decision-making under uncertainty with entropic risk measure | Utsav Sadana et.al. | 2409.19926 | null |
2024-10-01 | On The Planning Abilities of OpenAI’s o1 Models: Feasibility, Optimality, and Generalizability | Kevin Wang et.al. | 2409.19924 | link |
2024-09-30 | ForecastBench: A Dynamic Benchmark of AI Forecasting Capabilities | Ezra Karger et.al. | 2409.19839 | link |
2024-09-29 | Generalizability of Graph Neural Networks for Decentralized Unlabeled Motion Planning | Shreyas Muthusamy et.al. | 2409.19829 | null |
2024-09-27 | LML: Language Model Learning a Dataset for Data-Augmented Prediction | Praneeth Vadlapati et.al. | 2409.18957 | link |
2024-09-27 | AIPatient: Simulating Patients with EHRs and LLM Powered Agentic Workflow | Huizi Yu et.al. | 2409.18924 | null |
2024-09-27 | Moldable Development Patterns | Oscar Nierstrasz et.al. | 2409.18811 | null |
2024-09-27 | Open-Nav: Exploring Zero-Shot Vision-and-Language Navigation in Continuous Environment with Open-Source LLMs | Yanyuan Qiao et.al. | 2409.18794 | null |
2024-09-27 | Enhancing Explainability in Multimodal Large Language Models Using Ontological Context | Jihen Amara et.al. | 2409.18753 | null |
2024-09-27 | Renewal equations for vector-borne diseases | Cathal Mills et.al. | 2409.18726 | null |
2024-09-27 | The Craft of Selective Prediction: Towards Reliable Case Outcome Classification – An Empirical Study on European Court of Human Rights Cases | T. Y. S. S. Santosh et.al. | 2409.18645 | null |
2024-09-27 | Incorporating Precedents for Legal Judgement Prediction on European Court of Human Rights Cases | T. Y. S. S. Santosh et.al. | 2409.18644 | null |
2024-09-27 | DP-SCC-PL:Differentially Private Decentralized Byzantine-Resilient Stochastic Optimization via Self-Centered Clipping Under Polyak-Łojasiewicz Condition | Jinhui Hu et.al. | 2409.18632 | null |
2024-09-27 | Unsupervised Cognition | Alfredo Ibias et.al. | 2409.18624 | null |
2024-09-27 | Analysis of Truncated Singular Value Decomposition for Koopman Operator-Based Lane Change Model | Chinnawut Nantabut et.al. | 2409.18586 | null |
2024-09-27 | Climate Adaptation with Reinforcement Learning: Experiments with Flooding and Transportation in Copenhagen | Miguel Costa et.al. | 2409.18574 | link |
2024-09-27 | BoT-Drive: Hierarchical Behavior and Trajectory Planning for Autonomous Driving using POMDPs | Xuanjin Jin et.al. | 2409.18411 | null |
2024-09-27 | Multimodal Trajectory Prediction for Autonomous Driving on Unstructured Roads using Deep Convolutional Network | Lei Li et.al. | 2409.18399 | null |
2024-09-27 | ChARLES: Change-Aware Recovery of Latent Evolution Semantics in Relational Data | Shiyi He et.al. | 2409.18386 | null |
2024-09-27 | Robo-CSK-Organizer: Commonsense Knowledge to Organize Detected Objects for Multipurpose Robots | Rafael Hidalgo et.al. | 2409.18385 | null |
2024-09-27 | A model-constrained Discontinuous Galerkin Network (DGNet) for Compressible Euler Equations with Out-of-Distribution Generalization | Hai Van Nguyen et.al. | 2409.18371 | null |
2024-09-26 | Improving Agent Behaviors with RL Fine-tuning for Autonomous Driving | Zhenghao Peng et.al. | 2409.18343 | null |
2024-09-26 | Does End-to-End Autonomous Driving Really Need Perception Tasks? | Peidong Li et.al. | 2409.18341 | link |
2024-09-26 | Spatial Visibility and Temporal Dynamics: Revolutionizing Field of View Prediction in Adaptive Point Cloud Video Streaming | Chen Li et.al. | 2409.18236 | null |
2024-09-26 | DiffSSC: Semantic LiDAR Scan Completion using Denoising Diffusion Probabilistic Models | Helin Cao et.al. | 2409.18092 | null |
2024-09-26 | DualAD: Dual-Layer Planning for Reasoning in Autonomous Driving | Dingrui Wang et.al. | 2409.18053 | link |
2024-09-26 | HARMONIC: A Framework for Explanatory Cognitive Robots | Sanjay Oruganti et.al. | 2409.18037 | null |
2024-09-26 | Reasoning Multi-Agent Behavioral Topology for Interactive Autonomous Driving | Haochen Liu et.al. | 2409.18031 | link |
2024-09-26 | ReliOcc: Towards Reliable Semantic Occupancy Prediction via Uncertainty Learning | Song Wang et.al. | 2409.18026 | null |
2024-09-26 | Safe Time-Varying Optimization based on Gaussian Processes with Spatio-Temporal Kernel | Jialin Li et.al. | 2409.18000 | null |
2024-09-26 | A Decision-Making Method in Polyhedral Convex Set Optimization | Andreas Löhne et.al. | 2409.17998 | null |
2024-09-26 | Adaptive Stream Processing on Edge Devices through Active Inference | Boris Sedlak et.al. | 2409.17937 | null |
2024-09-26 | PhantomLiDAR: Cross-modality Signal Injection Attacks against LiDAR | Zizhi Jin et.al. | 2409.17907 | null |
2024-09-27 | A New Dataset for Monocular Depth Estimation Under Viewpoint Shifts | Aurel Pjetri et.al. | 2409.17851 | null |
2024-09-26 | CASPFormer: Trajectory Prediction from BEV Images with Deformable Attention | Harsh Yadav et.al. | 2409.17790 | null |
2024-09-26 | AlterMOMA: Fusion Redundancy Pruning for Camera-LiDAR Fusion Models with Alternative Modality Masking | Shiqi Sun et.al. | 2409.17728 | null |
2024-09-26 | Hierarchical End-to-End Autonomous Driving: Integrating BEV Perception with Deep Reinforcement Learning | Siyi Lu et.al. | 2409.17659 | null |
2024-09-26 | Intervention strategies for misinformation sharing on social media: A bibliometric analysis | Juanita Zainudin et.al. | 2409.17637 | null |
2024-09-27 | Learning Occlusion-aware Decision-making from Agent Interaction via Active Perception | Jie Jia et.al. | 2409.17618 | null |
2024-09-26 | Good Data Is All Imitation Learning Needs | Amir Samadi et.al. | 2409.17605 | null |
2024-09-26 | Planned behavior, perceptual biases, and the dynamics of collective action | Alice C Schwarze et.al. | 2409.17573 | null |
2024-09-26 | Joint Source-Channel Coding: Fundamentals and Recent Progress in Practical Designs | Deniz Gündüz et.al. | 2409.17557 | null |
2024-09-26 | GLinSAT: The General Linear Satisfiability Neural Network Layer By Accelerated Gradient Descent | Hongtai Zeng et.al. | 2409.17500 | link |
2024-09-26 | How Do Observational Astronomers Learn to Inspect Imaging Data | Hugo Walsh et.al. | 2409.17468 | null |
2024-09-25 | Learning with Dynamics: Autonomous Regulation of UAV Based Communication Networks with Dynamic UAV Crew | Ran Zhang et.al. | 2409.17139 | null |
2024-09-25 | Enhancing robot reliability for health-care facilities by means of Human-Aware Navigation Planning | Olga E. Sorokoletova et.al. | 2409.17131 | null |
2024-09-25 | On-orbit Servicing for Spacecraft Collision Avoidance With Autonomous Decision Making | Susmitha Patnala et.al. | 2409.17125 | null |
2024-09-25 | Deep Learning and Machine Learning, Advancing Big Data Analytics and Management: Handy Appetizer | Benji Peng et.al. | 2409.17120 | null |
2024-09-25 | Dynamic Obstacle Avoidance through Uncertainty-Based Adaptive Planning with Diffusion | Vineet Punyamoorty et.al. | 2409.16950 | null |
2024-09-25 | Quantifying Visual Properties of GAM Shape Plots: Impact on Perceived Cognitive Load and Interpretability | Sven Kruschel et.al. | 2409.16870 | null |
2024-09-25 | The Role of Language Models in Modern Healthcare: A Comprehensive Review | Amna Khalid et.al. | 2409.16860 | null |
2024-09-25 | Dispute resolution in legal mediation with quantitative argumentation | Xiao Chi et.al. | 2409.16854 | null |
2024-09-25 | Uncertainty Representations in State-Space Layers for Deep Reinforcement Learning under Partial Observability | Carlos E. Luis et.al. | 2409.16824 | null |
2024-09-25 | PeerArg: Argumentative Peer Review with LLMs | Purin Sukpanichnant et.al. | 2409.16813 | null |
2024-09-25 | Spacewalker: Traversing Representation Spaces for Fast Interactive Exploration and Annotation of Unstructured Data | Lukas Heine et.al. | 2409.16793 | link |
2024-09-25 | MSI-Agent: Incorporating Multi-Scale Insight into Embodied Agents for Superior Planning and Decision-Making | Dayuan Fu et.al. | 2409.16686 | null |
2024-09-25 | Skyeyes: Ground Roaming using Aerial View Images | Zhiyuan Gao et.al. | 2409.16685 | null |
2024-09-25 | An Integrated Machine Learning and Deep Learning Framework for Credit Card Approval Prediction | Kejian Tong et.al. | 2409.16676 | null |
2024-09-25 | Stochastic Shortest Path Problem with Failure Probability | Ritsusamuel Otsubo et.al. | 2409.16672 | null |
2024-09-26 | Mitigating Covariate Shift in Imitation Learning for Autonomous Vehicles Using Latent Space Generative World Models | Alexander Popov et.al. | 2409.16663 | null |
2024-09-25 | Examining the Rat in the Tunnel: Interpretable Multi-Label Classification of Tor-based Malware | Ishan Karunanayake et.al. | 2409.16639 | null |
2024-09-25 | Optimized Monte Carlo Tree Search for Enhanced Decision Making in the FrozenLake Environment | Esteban Aldana Guerra et.al. | 2409.16620 | null |
2024-09-25 | CasFT: Future Trend Modeling for Information Popularity Prediction with Dynamic Cues-Driven Diffusion Models | Xin Jing et.al. | 2409.16619 | null |
2024-09-25 | EMIT- Event-Based Masked Auto Encoding for Irregular Time Series | Hrishikesh Patel et.al. | 2409.16554 | link |
2024-09-18 | Finetuning Language Models to Emit Linguistic Expressions of Uncertainty | Arslan Chaudhry et.al. | 2409.12180 | null |
2024-09-18 | Publishing Instincts: An Exploration-Exploitation Framework for Studying Academic Publishing Behavior and “Home Venues” | Teddy Lazebnik et.al. | 2409.12158 | null |
2024-09-18 | Decoding Style: Efficient Fine-Tuning of LLMs for Image-Guided Outfit Recommendation with Preference | Najmeh Forouzandehmehr et.al. | 2409.12150 | null |
2024-09-18 | Pareto Data Framework: Steps Towards Resource-Efficient Decision Making Using Minimum Viable Data (MVD) | Tashfain Ahmed et.al. | 2409.12112 | null |
2024-09-18 | Unveiling the Black Box: Independent Functional Module Evaluation for Bird’s-Eye-View Perception Model | Ludan Zhang et.al. | 2409.11969 | null |
2024-09-18 | Optimizing Job Shop Scheduling in the Furniture Industry: A Reinforcement Learning Approach Considering Machine Setup, Batch Variability, and Intralogistics | Malte Schneevogt et.al. | 2409.11820 | null |
2024-09-18 | Conformal Prediction for Manifold-based Source Localization with Gaussian Processes | Vadim Rozenfeld et.al. | 2409.11804 | null |
2024-09-18 | Explaining Non-monotonic Normative Reasoning using Argumentation Theory with Deontic Logic | Zhe Yu et.al. | 2409.11780 | null |
2024-09-18 | RopeBEV: A Multi-Camera Roadside Perception Network in Bird’s-Eye-View | Jinrang Jia et.al. | 2409.11706 | null |
2024-09-18 | From Words to Wheels: Automated Style-Customized Policy Generation for Autonomous Driving | Xu Han et.al. | 2409.11694 | null |
2024-09-18 | Towards Explainable Goal Recognition Using Weight of Evidence (WoE): A Human-Centered Approach | Abeer Alshehri et.al. | 2409.11675 | null |
2024-09-18 | Blockchain-Enabled IoV: Secure Communication and Trustworthy Decision-Making | Jingyi Sun et.al. | 2409.11621 | null |
2024-09-17 | Exploring Dimensions of Expertise in AR-Guided Psychomotor Tasks | Steven Yoo et.al. | 2409.11599 | null |
2024-09-17 | Automating proton PBS treatment planning for head and neck cancers using policy gradient-based deep reinforcement learning | Qingqing Wang et.al. | 2409.11576 | null |
2024-09-17 | Balancing Optimality and Diversity: Human-Centered Decision Making through Generative Curation | Michael Lingzhi Li et.al. | 2409.11535 | null |
2024-09-17 | Leveraging AI-Generated Emotional Self-Voice to Nudge People towards their Ideal Selves | Cathy Mengying Fang et.al. | 2409.11531 | null |
2024-09-17 | Partially Observable Contextual Bandits with Linear Payoffs | Sihan Zeng et.al. | 2409.11521 | null |
2024-09-17 | Beyond Algorithmic Fairness: A Guide to Develop and Deploy Ethical AI-Enabled Decision-Support Tools | Rosemarie Santa Gonzalez et.al. | 2409.11489 | null |
2024-09-24 | Consensus decision making on a complete graph: complex behaviour from simple assumptions | P. Sarkanych et.al. | 2409.11475 | null |
2024-09-17 | UniLCD: Unified Local-Cloud Decision-Making via Reinforcement Learning | Kathakoli Sengupta et.al. | 2409.11403 | null |
2024-09-17 | RenderWorld: World Model with Self-Supervised 3D Label | Ziyang Yan et.al. | 2409.11356 | null |
2024-09-17 | TopoMaskV2: Enhanced Instance-Mask-Based Formulation for the Road Topology Problem | M. Esat Kalfaoglu et.al. | 2409.11325 | null |
2024-09-17 | Navigating Process Mining: A Case study using pm4py | Ali Jlidi et.al. | 2409.11294 | null |
2024-09-17 | Cost-informed dimensionality reduction for structural digital twin technologies | Aidan J. Hughes et.al. | 2409.11236 | null |
2024-09-18 | High-Order Evolving Graphs for Enhanced Representation of Traffic Dynamics | Aditya Humnabadkar et.al. | 2409.11206 | null |
2024-09-17 | Optimization of Rulebooks via Asymptotically Representing Lexicographic Hierarchies for Autonomous Vehicles | Matteo Penlington et.al. | 2409.11199 | null |
2024-09-18 | Annealed Winner-Takes-All for Motion Forecasting | Yihong Xu et.al. | 2409.11172 | link |
2024-09-17 | UltimateDO: An Efficient Framework to Marry Occupancy Prediction with 3D Object Detection via Channel2height | Zichen Yu et.al. | 2409.11160 | null |
2024-09-17 | Unleashing the Potential of Mamba: Boosting a LiDAR 3D Sparse Detector by Using Cross-Model Knowledge Distillation | Rui Yu et.al. | 2409.11018 | null |
2024-09-17 | PSFHS Challenge Report: Pubic Symphysis and Fetal Head Segmentation from Intrapartum Ultrasound Images | Jieyun Bai et.al. | 2409.10980 | null |
2024-09-17 | Beyond Rationality: Unveiling the Role of Animal Spirits and Inflation Extrapolation in Central Bank Communication of the US | Arpan Chakraborty et.al. | 2409.10938 | null |
2024-09-17 | TrajSSL: Trajectory-Enhanced Semi-Supervised 3D Object Detection | Philip Jacobson et.al. | 2409.10901 | null |
2024-09-17 | DIGIMON: Diagnosis and Mitigation of Sampling Skew for Reinforcement Learning based Meta-Planner in Robot Navigation | Shiwei Feng et.al. | 2409.10832 | null |
2024-09-16 | NaviQAte: Functionality-Guided Web Application Navigation | Mobina Shahbandeh et.al. | 2409.10741 | null |
2024-09-16 | Trustworthy Conceptual Explanations for Neural Networks in Robot Decision-Making | Som Sagar et.al. | 2409.10733 | link |
2024-09-16 | Aligning Judgment Using Task Context and Explanations to Improve Human-Recommender System Performance | Divya Srivastava et.al. | 2409.10717 | null |
2024-09-16 | CoMamba: Real-time Cooperative Perception Unlocked with State Space Models | Jinlong Li et.al. | 2409.10699 | null |
2024-09-16 | Disentangling Uncertainty for Safe Social Navigation using Deep Reinforcement Learning | Daniel Flögel et.al. | 2409.10655 | null |
2024-09-16 | Development of Data Evaluation Benchmark for Data Wrangling Recommendation System | Yuqing Wang et.al. | 2409.10635 | null |
2024-09-16 | MusicLIME: Explainable Multimodal Music Understanding | Theodoros Sotirou et.al. | 2409.10496 | link |
2024-09-16 | Radar Teach and Repeat: Architecture and Initial Field Testing | Xinyuan Qiao et.al. | 2409.10491 | link |
2024-09-16 | XLM for Autonomous Driving Systems: A Comprehensive Review | Sonda Fourati et.al. | 2409.10484 | null |
2024-09-16 | Quantile Fourier regressions for decision making under uncertainty | Arash Khojaste et.al. | 2409.10455 | null |
2024-09-16 | Stretchable Arduinos embedded in soft robots | Stephanie J. Woodman et.al. | 2409.10333 | link |
2024-09-16 | DRIVE: Dependable Robust Interpretable Visionary Ensemble Framework in Autonomous Driving | Songning Lai et.al. | 2409.10330 | null |
2024-09-16 | InfoDisent: Explainability of Image Classification Models by Information Disentanglement | Łukasz Struski et.al. | 2409.10329 | null |
2024-09-16 | Fairness, not Emotion, Drives Socioeconomic Decision Making | Rudra Mukhopadhyay et.al. | 2409.10322 | null |
2024-09-16 | SEAL: Towards Safe Autonomous Driving via Skill-Enabled Adversary Learning for Closed-Loop Scenario Generation | Benjamin Stoler et.al. | 2409.10320 | link |
2024-09-16 | A Note on Piecewise Affine Decision Rules for Robust, Stochastic, and Data-Driven Optimization | Simon Thomä et.al. | 2409.10295 | link |
2024-09-16 | ReflectDiffu: Reflect between Emotion-intent Contagion and Mimicry for Empathetic Response Generation via a RL-Diffusion Framework | Jiahao Yuan et.al. | 2409.10289 | link |
2024-09-16 | Questioning AI: Promoting Decision-Making Autonomy Through Reflection | Simon WS Fischer et.al. | 2409.10250 | null |
2024-09-16 | Robust Bird’s Eye View Segmentation by Adapting DINOv2 | Merve Rabia Barın et.al. | 2409.10228 | null |
2024-09-16 | LLMs for clinical risk prediction | Mohamed Rezk et.al. | 2409.10191 | null |
2024-09-16 | ExelMap: Explainable Element-based HD-Map Change Detection and Update | Lena Wild et.al. | 2409.10178 | null |
2024-09-16 | Maneuver Decision-Making with Trajectory Streams Prediction for Autonomous Vehicles | Mais Jamal et.al. | 2409.10165 | null |
2024-09-16 | AALF: Almost Always Linear Forecasting | Matthias Jakobs et.al. | 2409.10142 | link |
2024-09-16 | Advancing Towards a Marine Digital Twin Platform: Modeling the Mar Menor Coastal Lagoon Ecosystem in the South Western Mediterranean | Yu Ye et.al. | 2409.10134 | null |
2024-09-16 | Human Insights Driven Latent Space for Different Driving Perspectives: A Unified Encoder for Efficient Multi-Task Inference | Huy-Dung Nguyen et.al. | 2409.10095 | null |
2024-09-16 | LeGEND: A Top-Down Approach to Scenario Generation of Autonomous Driving Systems Assisted by Large Language Models | Shuncheng Tang et.al. | 2409.10066 | link |
2024-09-13 | Generic and ML Workloads in an HPC Datacenter: Node Energy, Job Failures, and Node-Job Analysis | Xiaoyu Chu et.al. | 2409.08949 | link |
2024-09-13 | Visual Language Tracking with Multi-modal Interaction: A Robust Benchmark | Xuchen Li et.al. | 2409.08887 | null |
2024-09-13 | Electrocardiogram Report Generation and Question Answering via Retrieval-Augmented Self-Supervised Modeling | Jialu Tang et.al. | 2409.08788 | null |
2024-09-13 | Causal Transformer for Fusion and Pose Estimation in Deep Visual Inertial Odometry | Yunus Bilge Kurt et.al. | 2409.08769 | link |
2024-09-13 | GenMapping: Unleashing the Potential of Inverse Perspective Mapping for Robust Online HD Map Construction | Siyu Li et.al. | 2409.08688 | link |
2024-09-13 | xTED: Cross-Domain Policy Adaptation via Diffusion-Based Trajectory Editing | Haoyi Niu et.al. | 2409.08687 | link |
2024-09-13 | Agile Decision-Making and Safety-Critical Motion Planning for Emergency Autonomous Vehicles | Yiming Shu et.al. | 2409.08665 | null |
2024-09-13 | Optimizing Item-based Marketing Promotion Efficiency in C2C Marketplace with Dynamic Sequential Coupon Allocation Framework | Jie Yang et.al. | 2409.08609 | null |
2024-09-13 | Common revenue allocation in DMUs with two stages based on DEA cross-efficiency and cooperative game | Xinyu Wang et.al. | 2409.08502 | null |
2024-09-12 | An Experimental Study of Competitive Market Behavior Through LLMs | Jingru Jia et.al. | 2409.08357 | null |
2024-09-13 | The Design of Informative Take-Over Requests for Semi-Autonomous Cyber-Physical Systems: Combining Spoken Language and Visual Icons in a Drone-Controller Setting | Ashwini Gundappa et.al. | 2409.08253 | null |
2024-09-12 | How can the tragedy of the commons be prevented?: Introducing Linear Quadratic Mixed Mean Field Games | Gokce Dayanikli et.al. | 2409.08235 | null |
2024-09-12 | Model Ensemble for Brain Tumor Segmentation in Magnetic Resonance Imaging | Daniel Capellán-Martín et.al. | 2409.08232 | link |
2024-09-12 | Optimal Management of Grid-Interactive Efficient Buildings via Safe Reinforcement Learning | Xiang Huo et.al. | 2409.08132 | null |
2024-09-12 | The JPEG Pleno Learning-based Point Cloud Coding Standard: Serving Man and Machine | André F. R. Guarda et.al. | 2409.08130 | null |
2024-09-12 | Value of Communication: Data-Driven Topology Optimization for Distributed Linear Cyber-Physical Systems | Michael Nestor et.al. | 2409.08116 | null |
2024-09-12 | SoVAR: Building Generalizable Scenarios from Accident Reports for Autonomous Driving Testing | An Guo et.al. | 2409.08081 | null |
2024-09-12 | LED: Light Enhanced Depth Estimation at Night | Simon de Moreau et.al. | 2409.08031 | link |
2024-09-12 | Games for AI Control: Models of Safety Evaluations of AI Deployment Protocols | Charlie Griffin et.al. | 2409.07985 | link |
2024-09-12 | WirelessAgent: Large Language Model Agents for Intelligent Wireless Networks | Jingwen Tong et.al. | 2409.07964 | link |
2024-09-12 | On an optimization model for firefighting helicopter planning | Marta Rodríguez Barreiro et.al. | 2409.07937 | null |
2024-09-12 | Conformal Distributed Remote Inference in Sensor Networks Under Reliability and Communication Constraints | Meiyi Zhu et.al. | 2409.07902 | null |
2024-09-12 | Real-time Multi-view Omnidirectional Depth Estimation System for Robots and Autonomous Driving on Real Scenes | Ming Li et.al. | 2409.07843 | null |
2024-09-12 | ReGentS: Real-World Safety-Critical Driving Scenario Generation Made Stable | Yuan Yin et.al. | 2409.07830 | link |
2024-09-12 | GateAttentionPose: Enhancing Pose Estimation with Agent Attention and Improved Gated Convolutions | Liang Feng et.al. | 2409.07798 | null |
2024-09-12 | ROCAS: Root Cause Analysis of Autonomous Driving Accidents via Cyber-Physical Co-mutation | Shiwei Feng et.al. | 2409.07774 | link |
2024-09-12 | GatedUniPose: A Novel Approach for Pose Estimation Combining UniRepLKNet and Gated Convolution | Liang Feng et.al. | 2409.07752 | null |
2024-09-12 | Attack End-to-End Autonomous Driving through Module-Wise Noise | Lu Wang et.al. | 2409.07706 | null |
2024-09-11 | Gaussian Process Upper Confidence Bounds in Distributed Point Target Tracking over Wireless Sensor Networks | Xingchi Liu et.al. | 2409.07652 | null |
2024-09-11 | A Survey of Inverse Constrained Reinforcement Learning: Definitions, Progress and Challenges | Guiliang Liu et.al. | 2409.07569 | null |
2024-09-11 | Hierarchical Reinforcement Learning for Temporal Abstraction of Listwise Recommendation | Luo Ji et.al. | 2409.07416 | null |
2024-09-11 | Dynamic Bayesian Networks, Elicitation and Data Embedding for Secure Environments | Kieran Drury et.al. | 2409.07389 | null |
2024-09-11 | Multi-source Stable Variable Importance Measure via Adversarial Machine Learning | Zitao Wang et.al. | 2409.07380 | null |
2024-09-11 | Policy consequences of the new neuroeconomic framework | A. David Redish et.al. | 2409.07373 | null |
2024-09-11 | The Role of Explainable AI in Revolutionizing Human Health Monitoring | Abdullah Alharthi et.al. | 2409.07347 | null |
2024-09-11 | Explanation, Debate, Align: A Weak-to-Strong Framework for Language Model Generalization | Mehrdad Zakershahrak et.al. | 2409.07335 | null |
2024-09-11 | Module-wise Adaptive Adversarial Training for End-to-end Autonomous Driving | Tianyuan Zhang et.al. | 2409.07321 | null |
2024-09-11 | MiniDrive: More Efficient Vision-Language Models with Multi-Level 2D Features as Text Tokens for Autonomous Driving | Enming Zhang et.al. | 2409.07267 | link |
2024-09-11 | Behavioral Cloning Models Reality Check for Autonomous Driving | Mustafa Yildirim et.al. | 2409.07218 | null |
2024-09-11 | Quantum Monte Carlo methods for Newsvendor problem with Multiple Unreliable Suppliers | Monit Sharma et.al. | 2409.07183 | null |
2024-09-11 | Fast Medical Shape Reconstruction via Meta-learned Implicit Neural Representations | Gaia Romana De Paolis et.al. | 2409.07100 | null |
2024-09-11 | A Novel Voting System for Medical Catalogues in National Health Insurance | Xingyuan Liang et.al. | 2409.07057 | null |
2024-09-10 | Cross-Modal Self-Supervised Learning with Effective Contrastive Units for LiDAR Point Clouds | Mu Cai et.al. | 2409.06827 | link |
2024-09-10 | Hint-AD: Holistically Aligned Interpretability in End-to-End Autonomous Driving | Kairui Ding et.al. | 2409.06702 | null |
2024-09-10 | Technical Report of Mobile Manipulator Robot for Industrial Environments | Erfan Amoozad Khalili et.al. | 2409.06693 | null |
2024-09-10 | Designing Resource Allocation Tools to Promote Fair Allocation: Do Visualization and Information Framing Matter? | Arnav Verma et.al. | 2409.06688 | null |
2024-09-10 | Memory and Personality in Ideological Polarization: The Politico-physics of Mnemomatter | Shengkai Li et.al. | 2409.06660 | null |
2024-09-10 | Transtreaming: Adaptive Delay-aware Transformer for Real-time Streaming Perception | Xiang Zhang et.al. | 2409.06584 | null |
2024-09-10 | MAGDA: Multi-agent guideline-driven diagnostic assistance | David Bani-Harouni et.al. | 2409.06351 | null |
2024-09-10 | Multi-Weather Image Restoration via Histogram-Based Transformer Feature Enhancement | Yang Wen et.al. | 2409.06334 | null |
2024-09-10 | Towards Robust Uncertainty-Aware Incomplete Multi-View Classification | Mulin Chen et.al. | 2409.06270 | null |
2024-09-10 | UdeerLID+: Integrating LiDAR, Image, and Relative Depth with Semi-Supervised | Tao Ni et.al. | 2409.06197 | null |
2024-09-11 | MyGo: Consistent and Controllable Multi-View Driving Video Generation with Camera Control | Yining Yao et.al. | 2409.06189 | null |
2024-09-10 | HierLLM: Hierarchical Large Language Model for Question Recommendation | Yuxuan Liu et.al. | 2409.06177 | null |
2024-09-09 | Coarse Descriptions and Cautious Preferences | Evan Piermont et.al. | 2409.06054 | null |
2024-09-09 | Bridging Rested and Restless Bandits with Graph-Triggering: Rising and Rotting | Gianmarco Genalti et.al. | 2409.05980 | null |
2024-09-09 | Predicting Electricity Consumption with Random Walks on Gaussian Processes | Chloé Hashimoto-Cullen et.al. | 2409.05934 | null |
2024-09-09 | A Framework for Evaluating PM2.5 Forecasts from the Perspective of Individual Decision Making | Renato Berlinghieri et.al. | 2409.05866 | link |
2024-09-09 | Promptable Closed-loop Traffic Simulation | Shuhan Tan et.al. | 2409.05863 | null |
2024-09-09 | An Introduction to Quantum Reinforcement Learning (QRL) | Samuel Yen-Chi Chen et.al. | 2409.05846 | null |
2024-09-09 | Vision-Driven 2D Supervised Fine-Tuning Framework for Bird’s Eye View Perception | Lei He et.al. | 2409.05834 | null |
2024-09-09 | Limits on the computational expressivity of non-equilibrium biophysical processes | Carlos Floyd et.al. | 2409.05827 | null |
2024-09-09 | Cooperative Decision-Making for CAVs at Unsignalized Intersections: A MARL Approach with Attention and Hierarchical Game Priors | Jiaqi Liu et.al. | 2409.05712 | null |
2024-09-09 | Quantum Volunteer’s Dilemma | Dax Enshan Koh et.al. | 2409.05708 | null |
2024-09-09 | Replay Consolidation with Label Propagation for Continual Object Detection | Riccardo De Monte et.al. | 2409.05650 | null |
2024-09-09 | Forward KL Regularized Preference Optimization for Aligning Diffusion Policies | Zhao Shan et.al. | 2409.05622 | null |
2024-09-09 | StratXplore: Strategic Novelty-seeking and Instruction-aligned Exploration for Vision and Language Navigation | Muraleekrishna Gopinathan et.al. | 2409.05593 | null |
2024-09-09 | Interpretable Responsibility Sharing as a Heuristic for Task and Motion Planning | Arda Sarp Yenicesu et.al. | 2409.05586 | link |
2024-09-10 | DriveScape: Towards High-Resolution Controllable Multi-View Driving Video Generation | Wei Wu et.al. | 2409.05463 | null |
2024-09-09 | Distribution Discrepancy and Feature Heterogeneity for Active 3D Object Detection | Huang-Yu Chen et.al. | 2409.05425 | link |
2024-09-09 | Common or specific source, features or scores; it is all a matter of information | Aafko Boonstra et.al. | 2409.05403 | null |
2024-09-09 | Diagnostic Reasoning in Natural Language: Computational Model and Application | Nils Dycke et.al. | 2409.05367 | null |
2024-09-09 | Driving with Prior Maps: Unified Vector Prior Encoding for Autonomous Vehicle Mapping | Shuang Zeng et.al. | 2409.05352 | null |
2024-09-09 | ICPR 2024 Competition on Safe Segmentation of Drive Scenes in Unstructured Traffic and Adverse Weather Conditions | Furqan Ahmed Shaik et.al. | 2409.05327 | null |
2024-09-09 | Developing Trajectory Planning with Behavioral Cloning and Proximal Policy Optimization for Path-Tracking and Static Obstacle Nudging | Mingyan Zhou et.al. | 2409.05289 | link |
2024-09-08 | Sliding-Window Thompson Sampling for Non-Stationary Settings | Marco Fiandri et.al. | 2409.05181 | null |
2024-09-08 | Enhancing the Performance of Multi-Vehicle Navigation in Unstructured Environments using Hard Sample Mining | Yining Ma et.al. | 2409.05119 | link |
2024-09-06 | Future Does Matter: Boosting 3D Object Detection with Temporal Motion Estimation in Point Cloud Sequences | Rui Yu et.al. | 2409.04390 | null |
2024-09-06 | Evaluating Fairness in Transaction Fraud Models: Fairness Metrics, Bias Audits, and Challenges | Parameswaran Kamalaruban et.al. | 2409.04373 | null |
2024-09-06 | A naive aggregation algorithm for improving generalization in a class of learning problems | Getachew K Befekadu et.al. | 2409.04352 | null |
2024-09-06 | Active learning for regression in engineering populations: A risk-informed approach | Daniel R. Clarkson et.al. | 2409.04328 | null |
2024-09-06 | Safe and Efficient Path Planning under Uncertainty via Deep Collision Probability Fields | Felix Herrmann et.al. | 2409.04306 | null |
2024-09-06 | SPACE: A Python-based Simulator for Evaluating Decentralized Multi-Robot Task Allocation Algorithms | Inmo Jang et.al. | 2409.04230 | link |
2024-09-06 | Secure Traffic Sign Recognition: An Attention-Enabled Universal Image Inpainting Mechanism against Light Patch Attacks | Hangcheng Cao et.al. | 2409.04133 | null |
2024-09-06 | Algorithmic Collusion Without Threats | Eshwar Ram Arunachaleswaran et.al. | 2409.03956 | link |
2024-09-05 | DRAL: Deep Reinforcement Adaptive Learning for Multi-UAVs Navigation in Unknown Indoor Environment | Kangtong Mo et.al. | 2409.03930 | null |
2024-09-05 | Understanding Fairness Metrics in Recommender Systems: A Healthcare Perspective | Veronica Kecki et.al. | 2409.03893 | null |
2024-09-05 | Multi-agent Path Finding for Mixed Autonomy Traffic Coordination | Han Zheng et.al. | 2409.03881 | null |
2024-09-05 | PARCO: Learning Parallel Autoregressive Policies for Efficient Multi-Agent Combinatorial Optimization | Federico Berto et.al. | 2409.03811 | link |
2024-09-05 | A Deep Generative Learning Approach for Two-stage Adaptive Robust Optimization | Aron Brenner et.al. | 2409.03731 | null |
2024-09-05 | A Fused Large Language Model for Predicting Startup Success | Abdurahman Maarouf et.al. | 2409.03668 | null |
2024-09-05 | Prediction Accuracy & Reliability: Classification and Object Localization under Distribution Shift | Fabian Diet et.al. | 2409.03543 | null |
2024-09-05 | Distributionally Robust Optimisation with Bayesian Ambiguity Sets | Charita Dellaporta et.al. | 2409.03492 | null |
2024-09-05 | Neural HD Map Generation from Multiple Vectorized Tiles Locally Produced by Autonomous Vehicles | Miao Fan et.al. | 2409.03445 | null |
2024-09-05 | F3T: A soft tactile unit with 3D force and temperature mathematical decoupling ability for robots | Xiong Yang et.al. | 2409.03421 | null |
2024-09-06 | CogniDual Framework: Self-Training Large Language Models within a Dual-System Theoretical Framework for Improving Cognitive Tasks | Yongxin Deng et.al. | 2409.03381 | null |
2024-09-05 | YOLO-PPA based Efficient Traffic Sign Detection for Cruise Control in Autonomous Driving | Jingyu Zhang et.al. | 2409.03320 | null |
2024-09-05 | OccLLaMA: An Occupancy-Language-Action Generative World Model for Autonomous Driving | Julong Wei et.al. | 2409.03272 | null |
2024-09-05 | Multiple weather images restoration using the task transformer and adaptive mixup strategy | Yang Wen et.al. | 2409.03249 | null |
2024-09-05 | Enhancing Healthcare LLM Trust with Atypical Presentations Recalibration | Jeremy Qin et.al. | 2409.03225 | link |
2024-09-05 | InfraLib: Enabling Reinforcement Learning and Decision Making for Large Scale Infrastructure Management | Pranay Thangeda et.al. | 2409.03167 | null |
2024-09-05 | Autonomous Drifting Based on Maximal Safety Probability Learning | Hikaru Hoshino et.al. | 2409.03160 | link |
2024-09-05 | Non-stationary and Sparsely-correlated Multi-output Gaussian Process with Spike-and-Slab Prior | Wang Xinming et.al. | 2409.03149 | null |
2024-09-04 | Developing, Analyzing, and Evaluating Self-Drive Algorithms Using Drive-by-Wire Electric Vehicles | Beñat Froemming-Aldanondo et.al. | 2409.03114 | link |
2024-09-04 | Explainable AI for computational pathology identifies model limitations and tissue biomarkers | Jakub R. Kaczmarzyk et.al. | 2409.03080 | link |
2024-09-04 | Incorporating dense metric depth into neural 3D representations for view synthesis and relighting | Arkadeep Narayan Chaudhury et.al. | 2409.03061 | null |
2024-09-04 | UC-NeRF: Uncertainty-aware Conditional Neural Radiance Fields from Endoscopic Sparse Views | Jiaxin Guo et.al. | 2409.02917 | link |
2024-09-04 | Can LVLMs Obtain a Driver’s License? A Benchmark Towards Reliable AGI for Autonomous Driving | Yuhang Lu et.al. | 2409.02914 | null |
2024-09-04 | Exploring Sentiment Dynamics and Predictive Behaviors in Cryptocurrency Discussions by Few-Shot Learning with Large Language Models | Moein Shahiki Tash et.al. | 2409.02836 | null |
2024-09-04 | Towards Edge-Based Data Lake Architecture for Intelligent Transportation System | Danilo Fernandes et.al. | 2409.02808 | null |
2024-09-04 | Beyond Nash Equilibrium: Achieving Bayesian Perfect Equilibrium with Belief Update Fictitious Play | Qi Ju et.al. | 2409.02706 | link |
2024-09-04 | Decision Transformer for Enhancing Neural Local Search on the Job Shop Scheduling Problem | Constantin Waubert de Puiseau et.al. | 2409.02697 | null |
2024-09-04 | The Role of Artificial Intelligence and Machine Learning in Software Testing | Ahmed Ramadan et.al. | 2409.02693 | null |
2024-09-04 | Improved Single Camera BEV Perception Using Multi-Camera Training | Daniel Busch et.al. | 2409.02676 | null |
2024-09-04 | PUB: Plot Understanding Benchmark and Dataset for Evaluating Large Language Models on Synthetic Visual Data Interpretation | Aneta Pawelec et.al. | 2409.02617 | null |
2024-09-04 | AlignGroup: Learning and Aligning Group Consensus with Member Preferences for Group Recommendation | Jinfeng Xu et.al. | 2409.02580 | link |
2024-09-05 | Assembling the Puzzle: Exploring Collaboration and Data Sensemaking in Nursing Practices for Remote Patient Monitoring | Mihnea Calota et.al. | 2409.02579 | null |
2024-09-04 | How Do You Perceive My Face? Recognizing Facial Expressions in Multi-Modal Context by Modeling Mental Representations | Florian Blume et.al. | 2409.02566 | null |
2024-09-04 | Want a Ride? Attitudes Towards Autonomous Driving and Behavior in Autonomous Vehicles | Enrico Del Re et.al. | 2409.02556 | null |
2024-09-05 | A Sequential Decision-Making Model for Perimeter Identification | Ayal Taitler et.al. | 2409.02549 | null |
2024-09-04 | A Joint Time and Energy-Efficient Federated Learning-based Computation Offloading Method for Mobile Edge Computing | Anwesha Mukherjee et.al. | 2409.02548 | null |
2024-09-04 | Cog-GA: A Large Language Models-based Generative Agent for Vision-Language Navigation in Continuous Environments | Zhiyuan Li et.al. | 2409.02522 | null |
2024-09-04 | TLD: A Vehicle Tail Light signal Dataset and Benchmark | Jinhao Chai et.al. | 2409.02508 | null |
2024-09-04 | eRSS-RAMP: A Rule-Adherence Motion Planner Based on Extended Responsibility-Sensitive Safety for Autonomous Driving | Pengfei Lin et.al. | 2409.02503 | null |
2024-09-04 | A Learnable Color Correction Matrix for RAW Reconstruction | Anqi Liu et.al. | 2409.02497 | null |
2024-09-04 | TASAR: Transferable Attack on Skeletal Action Recognition | Yunfeng Diao et.al. | 2409.02483 | null |
2024-08-30 | Dual-criterion Dose Finding Designs Based on Dose-Limiting Toxicity and Tolerability | Yunlong Yang et.al. | 2408.17392 | null |
2024-08-30 | Structuring a Training Strategy to Robustify Perception Models with Realistic Image Augmentations | Ahmed Hammam et.al. | 2408.17311 | null |
2024-08-30 | An Integer Linear Programming Model for Earth Observation Missions | Vincenzo Basco et.al. | 2408.17288 | null |
2024-08-30 | How Could Generative AI Support Compliance with the EU AI Act? A Review for Safe Automated Driving Perception | Mert Keser et.al. | 2408.17222 | null |
2024-08-30 | NanoMVG: USV-Centric Low-Power Multi-Task Visual Grounding based on Prompt-Guided Camera and 4D mmWave Radar | Runwei Guan et.al. | 2408.17207 | null |
2024-08-30 | Modelling Growth, Remodelling and Damage of a Thick-walled Fibre-reinforced Artery with Active Response: Application to Cerebral Vasospasm and Treatment | Giulia Pederzani et.al. | 2408.17206 | null |
2024-08-30 | Towards Symbolic XAI – Explanation Through Human Understandable Logical Relationships Between Features | Thomas Schnake et.al. | 2408.17198 | null |
2024-09-03 | Controllable Edge-Type-Specific Interpretation in Multi-Relational Graph Neural Networks for Drug Response Prediction | Xiaodi Li et.al. | 2408.17129 | link |
2024-08-30 | A Two-Timescale Decision-Hazard-Decision Formulation for Storage Usage Values Calculation | Camila Martinez Parra et.al. | 2408.17113 | null |
2024-08-30 | UTrack: Multi-Object Tracking with Uncertain Detections | Edgardo Solano-Carrillo et.al. | 2408.17098 | link |
2024-08-30 | Reasoning AI Performance Degradation in 6G Networks with Large Language Models | Liming Huang et.al. | 2408.17097 | null |
2024-08-30 | PIB: Prioritized Information Bottleneck Framework for Collaborative Edge Video Analytics | Zhengru Fang et.al. | 2408.17047 | link |
2024-08-30 | Tonal Cognition in Sonification: Exploring the Needs of Practitioners in Sonic Interaction Design | Minsik Choi et.al. | 2408.17012 | null |
2024-08-30 | Transient Fault Tolerant Semantic Segmentation for Autonomous Driving | Leonardo Iurada et.al. | 2408.16952 | link |
2024-08-29 | Enhancing Autism Spectrum Disorder Early Detection with the Parent-Child Dyads Block-Play Protocol and an Attention-enhanced GCN-xLSTM Hybrid Deep Learning Framework | Xiang Li et.al. | 2408.16924 | null |
2024-08-29 | Auricular Vagus Nerve Stimulation for Enhancing Remote Pilot Training and Operations | William J. Tyler et.al. | 2408.16755 | null |
2024-08-29 | Enhanced forecasting of stock prices based on variational mode decomposition, PatchTST, and adaptive scale-weighted layer | Xiaorui Xue et.al. | 2408.16707 | null |
2024-08-29 | RoboMNIST: A Multimodal Dataset for Multi-Robot Activity Recognition Using WiFi Sensing, Video, and Audio | Kian Behzad et.al. | 2408.16703 | link |
2024-08-29 | A Catalog of Fairness-Aware Practices in Machine Learning Engineering | Gianmario Voria et.al. | 2408.16683 | null |
2024-08-29 | DriveGenVLM: Real-world Video Generation for Vision Language Model based Autonomous Driving | Yongjie Fu et.al. | 2408.16647 | null |
2024-08-29 | RLCP: A Reinforcement Learning-based Copyright Protection Method for Text-to-Image Diffusion Model | Zhuan Shi et.al. | 2408.16634 | null |
2024-08-29 | CooTest: An Automated Testing Approach for V2X Communication Systems | An Guo et.al. | 2408.16470 | null |
2024-08-29 | Consensus Planning with Primal, Dual, and Proximal Agents | Alvaro Maggiar et.al. | 2408.16462 | null |
2024-08-29 | BEVal: A Cross-dataset Evaluation Study of BEV Segmentation Models for Autononomous Driving | Manuel Alejandro Diaz-Zapata et.al. | 2408.16322 | link |
2024-08-29 | Passenger hazard perception based on EEG signals for highly automated driving vehicles | Ashton Yu Xuan Tan et.al. | 2408.16315 | null |
2024-08-29 | PolarBEVDet: Exploring Polar Representation for Multi-View 3D Object Detection in Bird’s-Eye-View | Zichen Yu et.al. | 2408.16200 | link |
2024-08-28 | Improving the Prediction of Individual Engagement in Recommendations Using Cognitive Models | Roderick Seow et.al. | 2408.16147 | null |
2024-08-28 | EPO: Hierarchical LLM Agents with Environment Preference Optimization | Qi Zhao et.al. | 2408.16090 | link |
2024-08-28 | Logic-Enhanced Language Model Agents for Trustworthy Social Simulations | Agnieszka Mensfelt et.al. | 2408.16081 | link |
2024-08-28 | WebPilot: A Versatile and Autonomous Multi-Agent System for Web Task Execution with Strategic Exploration | Yao Zhang et.al. | 2408.15978 | null |
2024-08-28 | SLAM2REF: Advancing Long-Term Mapping with 3D LiDAR and Reference Map Integration for Precise 6-DoF Trajectory Estimation and Map Extension | Miguel Arturo Vega Torres et.al. | 2408.15948 | link |
2024-08-28 | GenDDS: Generating Diverse Driving Video Scenarios with Prompt-to-Video Generative Model | Yongjie Fu et.al. | 2408.15868 | null |
2024-08-28 | FlowAct: A Proactive Multimodal Human-robot Interaction System with Continuous Flow of Perception and Modular Action Sub-systems | Timothée Dhaussy et.al. | 2408.15864 | null |
2024-08-28 | Network transferability of adversarial patches in real-time object detection | Jens Bayer et.al. | 2408.15833 | link |
2024-08-28 | Emulating Brain-like Rapid Learning in Neuromorphic Edge Computing | Kenneth Stewart et.al. | 2408.15800 | link |
2024-08-28 | LogicGame: Benchmarking Rule-Based Reasoning Abilities of Large Language Models | Jiayi Gui et.al. | 2408.15778 | link |
2024-08-28 | Str-L Pose: Integrating Point and Structured Line for Relative Pose Estimation in Dual-Graph | Zherong Zhang et.al. | 2408.15750 | null |
2024-08-28 | Comparing diversity, negativity, and stereotypes in Chinese-language AI technologies: a case study on Baidu, Ernie and Qwen | Geng Liu et.al. | 2408.15696 | link |
2024-08-28 | TeFF: Tracking-enhanced Forgetting-free Few-shot 3D LiDAR Semantic Segmentation | Junbao Zhou et.al. | 2408.15657 | link |
2024-08-28 | Correlation-Adjusted Simultaneous Testing for Ultra High-dimensional Grouped Data | Iris Ivy Gauran et.al. | 2408.15623 | null |
2024-08-28 | Latent Relationship Mining of Glaucoma Biomarkers: a TRI-LSTM based Deep Learning | Cheng Huang et.al. | 2408.15555 | null |
2024-08-28 | Trustworthy and Responsible AI for Human-Centric Autonomous Decision-Making Systems | Farzaneh Dehghani et.al. | 2408.15550 | null |
2024-08-28 | RoboSense: Large-scale Dataset and Benchmark for Multi-sensor Low-speed Autonomous Driving | Haisheng Su et.al. | 2408.15503 | link |
2024-08-28 | MODULI: Unlocking Preference Generalization via Diffusion Models for Offline Multi-Objective Reinforcement Learning | Yifu Yuan et.al. | 2408.15501 | null |
2024-08-28 | PersonalizedUS: Interpretable Breast Cancer Risk Assessment with Local Coverage Uncertainty Quantification | Alek Fröhlich et.al. | 2408.15458 | null |
2024-08-27 | Understanding GNNs for Boolean Satisfiability through Approximation Algorithms | Jan Hůla et.al. | 2408.15418 | null |
2024-08-27 | Panoptic Perception for Autonomous Driving: A Survey | Yunge Li et.al. | 2408.15388 | null |
2024-08-27 | Drone-assisted Road Gaussian Splatting with Cross-view Uncertainty | Saining Zhang et.al. | 2408.15242 | link |
2024-08-27 | Learning-based Multi-View Stereo: A Survey | Fangjinhua Wang et.al. | 2408.15235 | null |
2024-08-27 | Using LLMs for Explaining Sets of Counterfactual Examples to Final Users | Arturo Fredes et.al. | 2408.15133 | link |
2024-08-27 | T-FAKE: Synthesizing Thermal Images for Facial Landmarking | Philipp Flotho et.al. | 2408.15127 | link |
2024-08-27 | Subgroup Analysis via Model-based Rule Forest | I-Ling Cheng et.al. | 2408.15057 | null |
2024-08-27 | Cross-subject Brain Functional Connectivity Analysis for Multi-task Cognitive State Evaluation | Jun Chen et.al. | 2408.15018 | null |
2024-08-27 | Adversarial Manhole: Challenging Monocular Depth Estimation and Semantic Segmentation Models with Patch Attack | Naufal Suryanto et.al. | 2408.14879 | link |
2024-08-27 | Unsupervised-to-Online Reinforcement Learning | Junsu Kim et.al. | 2408.14785 | null |
2024-08-27 | Optimization model for electric aircraft tow tractors considering operator coalition | Dan-Wen Bao et.al. | 2408.14748 | null |
2024-08-26 | Artificial Intelligence in Landscape Architecture: A Survey | Yue Xing et.al. | 2408.14700 | null |
2024-08-26 | Enhancing Neural Network Interpretability Through Conductance-Based Information Plane Analysis | Jaouad Dabounou et.al. | 2408.14681 | null |
2024-08-26 | Relationships are Complicated! An Analysis of Relationships Between Datasets on the Web | Kate Lin et.al. | 2408.14636 | link |
2024-08-26 | EVINCE: Optimizing Adversarial LLM Dialogues via Conditional Statistics and Information Theory | Edward Y. Chang et.al. | 2408.14575 | null |
2024-08-26 | Aiding Humans in Financial Fraud Decision Making: Toward an XAI-Visualization Framework | Angelos Chatzimparmpas et.al. | 2408.14552 | null |
2024-08-26 | Taxicab distance based best-worst method for multi-criteria decision-making: An analytical approach | Harshit Ratandhara et.al. | 2408.14452 | null |
2024-08-26 | Driving in the Occupancy World: Vision-Centric 4D Occupancy Forecasting and Planning via World Models for Autonomous Driving | Yu Yang et.al. | 2408.14197 | null |
2024-08-26 | EMDFNet: Efficient Multi-scale and Diverse Feature Network for Traffic Sign Detection | Pengyu Li et.al. | 2408.14189 | null |
2024-08-26 | DynamicRouteGPT: A Real-Time Multi-Vehicle Dynamic Navigation Framework Based on Large Language Models | Ziai Zhou et.al. | 2408.14185 | null |
2024-08-26 | Dynamic Pricing for Electric Vehicle Charging | Arun Kumar Kalakanti et.al. | 2408.14169 | null |
2024-08-26 | Crowd-Calibrator: Can Annotator Disagreement Inform Calibration in Subjective Tasks? | Urja Khurana et.al. | 2408.14141 | null |
2024-08-26 | Quantitative Representation of Scenario Difficulty for Autonomous Driving Based on Adversarial Policy Search | Shuo Yang et.al. | 2408.14000 | null |
2024-08-26 | FusionSAM: Latent Space driven Segment Anything Model for Multimodal Fusion and Segmentation | Daixun Li et.al. | 2408.13980 | null |
2024-08-26 | Speeding Ticket: Unveiling the Energy and Emission Burden of AI-Accelerated Distributed and Decentralized Power Dispatch Models | Meiyi Li et.al. | 2408.13968 | null |
2024-08-25 | Optimizing Luxury Vehicle Dealership Networks: A Graph Neural Network Approach to Site Selection | Luca Silvano Carocci et.al. | 2408.13961 | link |
2024-08-27 | Time Series Analysis for Education: Methods, Applications, and Future Directions | Shengzhong Mao et.al. | 2408.13960 | link |
2024-08-25 | Bridging the Gap between Real-world and Synthetic Images for Testing Autonomous Driving Systems | Mohammad Hossein Amini et.al. | 2408.13950 | null |
2024-08-25 | CoT Rerailer: Enhancing the Reliability of Large Language Models in Complex Reasoning Tasks through Error Detection and Correction | Guangya Wan et.al. | 2408.13940 | null |
2024-08-25 | TraIL-Det: Transformation-Invariant Local Feature Networks for 3D LiDAR Object Detection with Unsupervised Pre-Training | Li Li et.al. | 2408.13902 | null |
2024-08-25 | Making Large Language Models Better Planners with Reasoning-Decision Alignment | Zhijian Huang et.al. | 2408.13890 | null |
2024-08-25 | Exploring Reliable Matching with Phase Enhancement for Night-time Semantic Segmentation | Yuwen Pan et.al. | 2408.13838 | null |
2024-08-25 | Towards Reliable Medical Question Answering: Techniques and Challenges in Mitigating Hallucinations in Language Models | Duy Khoa Pham et.al. | 2408.13808 | null |
2024-08-25 | TripleMixer: A 3D Point Cloud Denoising Model for Adverse Weather | Xiongwei Zhao et.al. | 2408.13802 | link |
2024-08-25 | CV-MOS: A Cross-View Model for Motion Segmentation | Xiaoyu Tang et.al. | 2408.13790 | link |
2024-08-25 | Enhancing Adaptive Deep Networks for Image Classification via Uncertainty-aware Decision Fusion | Xu Zhang et.al. | 2408.13744 | link |
2024-08-23 | Temporal Fairness in Decision Making Problems | Manuel R. Torres et.al. | 2408.13208 | null |
2024-08-23 | Causal machine learning for sustainable agroecosystems | Vasileios Sitokonstantinou et.al. | 2408.13155 | null |
2024-08-23 | Interpretable breast cancer classification using CNNs on mammographic images | Ann-Kristin Balve et.al. | 2408.13154 | link |
2024-08-23 | Analysis of child development facts and myths using text mining techniques and classification models | Mehedi Tajrian et.al. | 2408.13091 | null |
2024-08-23 | General Intelligent Imaging and Uncertainty Quantification by Deterministic Diffusion Model | Weiru Fan et.al. | 2408.13061 | null |
2024-08-23 | Fair Pairs: Fairness-Aware Ranking Recovery from Pairwise Comparisons | Georg Ahnert et.al. | 2408.13034 | link |
2024-08-23 | Focused Discriminative Training For Streaming CTC-Trained Automatic Speech Recognition Models | Adnan Haider et.al. | 2408.13008 | null |
2024-08-23 | Systematic Evaluation of LLM-as-a-Judge in LLM Alignment Tasks: Explainable Metrics and Diverse Prompt Templates | Hui Wei et.al. | 2408.13006 | link |
2024-08-23 | MedDec: A Dataset for Extracting Medical Decisions from Discharge Summaries | Mohamed Elgaar et.al. | 2408.12980 | link |
2024-08-23 | iSee: Advancing Multi-Shot Explainable AI Using Case-based Recommendations | Anjana Wijekoon et.al. | 2408.12941 | null |
2024-08-23 | ml_edm package: a Python toolkit for Machine Learning based Early Decision Making | Aurélien Renault et.al. | 2408.12925 | link |
2024-08-23 | Structural Representation Learning and Disentanglement for Evidential Chinese Patent Approval Prediction | Jinzhi Shan et.al. | 2408.12852 | null |
2024-08-23 | Underwater SONAR Image Classification and Analysis using LIME-based Explainable Artificial Intelligence | Purushothaman Natarajan et.al. | 2408.12837 | link |
2024-08-23 | Courteous MPC for Autonomous Driving with CBF-inspired Risk Assessment | Yanze Zhang et.al. | 2408.12822 | null |
2024-08-23 | VALE: A Multimodal Visual and Language Explanation Framework for Image Classifiers using eXplainable AI and Language Models | Purushothaman Natarajan et.al. | 2408.12808 | link |
2024-08-23 | A Safe Self-evolution Algorithm for Autonomous Driving Based on Data-Driven Risk Quantification Model | Shuo Yang et.al. | 2408.12805 | null |
2024-08-22 | Does Spatial Information Improve Influenza Forecasting? | Gabrielle Thivierge et.al. | 2408.12722 | link |
2024-08-22 | Revisiting Cross-Domain Problem for LiDAR-based 3D Object Detection | Ruixiao Zhang et.al. | 2408.12708 | null |
2024-08-22 | Can LLMs Understand Social Norms in Autonomous Driving Games? | Boxuan Wang et.al. | 2408.12680 | null |
2024-08-22 | A Monte Carlo Tree Search approach to QAOA: finding a needle in the haystack | Andoni Agirre et.al. | 2408.12648 | null |
2024-08-22 | The Importance of Cognitive Biases in the Recommendation Ecosystem | Markus Schedl et.al. | 2408.12492 | null |
2024-08-22 | Frame Order Matters: A Temporal Sequence-Aware Model for Few-Shot Action Recognition | Bozheng Li et.al. | 2408.12475 | null |
2024-08-22 | Multi-Knowledge Fusion Network for Time Series Representation Learning | Sagar Srinivas Sakhinana et.al. | 2408.12423 | null |
2024-08-22 | Advancing Strategic Planning and Dynamic Control of Complex Projects | L. G. Teuber et.al. | 2408.12422 | null |
2024-08-22 | Multi-Source Knowledge-Based Hybrid Neural Framework for Time Series Representation Learning | Sagar Srinivas Sakhinana et.al. | 2408.12409 | null |
2024-08-22 | Enhancing Uncertainty Communication in Time Series Predictions: Insights and Recommendations | Apoorva Karagappa et.al. | 2408.12365 | null |
2024-08-22 | Graph Retrieval Augmented Trustworthiness Reasoning | Ying Zhu et.al. | 2408.12333 | link |
2024-08-22 | Multimodal Foundational Models for Unsupervised 3D General Obstacle Detection | Tamás Matuszka et.al. | 2408.12322 | null |
2024-08-22 | A Safety-Oriented Self-Learning Algorithm for Autonomous Driving: Evolution Starting from a Basic Model | Shuo Yang et.al. | 2408.12190 | null |
2024-08-22 | A Safe and Efficient Self-evolving Algorithm for Decision-making and Control of Autonomous Driving Systems | Shuo Yang et.al. | 2408.12187 | null |
2024-08-22 | DRExplainer: Quantifiable Interpretability in Drug Response Prediction with Directed Graph Convolutional Network | Haoyuan Shi et.al. | 2408.12139 | link |
2024-08-22 | Pareto Inverse Reinforcement Learning for Diverse Expert Policy Generation | Woo Kyung Kim et.al. | 2408.12110 | null |
2024-08-22 | Enhancing Sampling Protocol for Robust Point Cloud Classification | Chongshou Li et.al. | 2408.12062 | null |
2024-08-21 | Reasoning and Tools for Human-Level Forecasting | Elvis Hsieh et.al. | 2408.12036 | null |
2024-08-21 | Let Community Rules Be Reflected in Online Content Moderation | Wangjiaxuan Xin et.al. | 2408.12035 | null |
2024-08-21 | Sentiment and Emotion-aware Multi-criteria Fuzzy Group Decision Making System | Adilet Yerkin et.al. | 2408.11976 | null |
2024-08-21 | Valuing an Engagement Surface using a Large Scale Dynamic Causal Model | Abhimanyu Mukerji et.al. | 2408.11967 | null |
2024-08-21 | Decoding SEC Actions: Enforcement Trends through Analyzing Blockchain litigation using LLM-based Thematic Factor Mapping | Junliang Luo et.al. | 2408.11961 | null |
2024-08-21 | Decoding Pedestrian Stress on Urban Streets using Electrodermal Activity Monitoring in Virtual Immersive Reality | Mohsen Nazemi et.al. | 2408.11769 | null |
2024-08-21 | Less is more: AI Decision-Making using Dynamic Deep Neural Networks for Short-Term Stock Index Prediction | CJ Finnegan et.al. | 2408.11740 | null |
2024-08-21 | Explainable Deep Learning Framework for Human Activity Recognition | Yiran Huang et.al. | 2408.11552 | null |
2024-08-21 | MambaOcc: Visual State Space Model for BEV-based Occupancy Prediction with Local Adaptive Reordering | Yonglin Tian et.al. | 2408.11464 | null |
2024-08-21 | Probabilistic Medical Predictions of Large Language Models | Bowen Gu et.al. | 2408.11316 | null |
2024-08-21 | Counterfactuals As a Means for Evaluating Faithfulness of Attribution Methods in Autoregressive Language Models | Sepehr Kamahi et.al. | 2408.11252 | link |
2024-08-20 | Optimal Guarantees for Online Selection Over Time | Sebastian Perez-Salazar et.al. | 2408.11224 | null |
2024-08-20 | Statistical Challenges with Dataset Construction: Why You Will Never Have Enough Images | Josh Goldman et.al. | 2408.11160 | null |
2024-08-20 | Experimentation, deployment and monitoring Machine Learning models: Approaches for applying MLOps | Diego Nogare et.al. | 2408.11112 | null |
2024-08-20 | ISLES’24: Improving final infarct prediction in ischemic stroke using multimodal imaging and clinical data | Ezequiel de la Rosa et.al. | 2408.10966 | null |
2024-08-20 | Conformalized Interval Arithmetic with Symmetric Calibration | Rui Luo et.al. | 2408.10939 | link |
2024-08-20 | Enhancing End-to-End Autonomous Driving Systems Through Synchronized Human Behavior Data | Yiqun Duan et.al. | 2408.10908 | null |
2024-08-20 | Leveraging LLMs for the Quality Assurance of Software Requirements | Sebastian Lubos et.al. | 2408.10886 | null |
2024-08-20 | Open 3D World in Autonomous Driving | Xinlong Cheng et.al. | 2408.10880 | null |
2024-08-20 | Multi-agent based modeling for investigating excess heat utilization from electrolyzer production to district heating network | Kristoffer Christensen et.al. | 2408.10783 | null |
2024-08-20 | Classification of Endoscopy and Video Capsule Images using CNN-Transformer Model | Aliza Subedi et.al. | 2408.10733 | null |
2024-08-20 | Towards reliable real-time trajectory optimization | Fatemeh Rastgar et.al. | 2408.10731 | null |
2024-08-20 | On NVD Users’ Attitudes, Experiences, Hopes and Hurdles | Julia Wunder et.al. | 2408.10695 | null |
2024-08-20 | Privacy-preserving Universal Adversarial Defense for Black-box Models | Qiao Li et.al. | 2408.10647 | null |
2024-08-20 | Finding the DeepDream for Time Series: Activation Maximization for Univariate Time Series | Udo Schlegel et.al. | 2408.10628 | null |
2024-08-20 | Safety Metric Aware Trajectory Repairing for Automated Driving | Kailin Tong et.al. | 2408.10622 | null |
2024-08-20 | MV-MOS: Multi-View Feature Fusion for 3D Moving Object Segmentation | Jintao Cheng et.al. | 2408.10602 | null |
2024-08-20 | Constrained Behavior Cloning for Robotic Learning | Wensheng Liang et.al. | 2408.10568 | null |
2024-08-20 | Leveraging Temporal Contexts to Enhance Vehicle-Infrastructure Cooperative Perception | Jiaru Zhong et.al. | 2408.10531 | null |
2024-08-20 | Approximate Estimation of High-dimension Execution Skill for Dynamic Agents in Continuous Domains | Delma Nieves-Rivera et.al. | 2408.10512 | null |
2024-08-20 | An End-to-End Reinforcement Learning Based Approach for Micro-View Order-Dispatching in Ride-Hailing | Xinlang Yue et.al. | 2408.10479 | null |
2024-08-19 | System-Level Design Space Exploration for High-Level Synthesis under End-to-End Latency Constraints | Yuchao Liao et.al. | 2408.10431 | null |
2024-08-19 | Real-Time Digital Twin Platform: A Case Study on Core Network Selection in Aeronautical Ad-Hoc Networks | Lal Verda Cakir et.al. | 2408.10409 | null |
2024-08-19 | Tax Credits and Household Behavior: The Roles of Myopic Decision-Making and Liquidity in a Simulated Economy | Jialin Dong et.al. | 2408.10391 | null |
2024-08-19 | FFAA: Multimodal Large Language Model based Explainable Open-World Face Forgery Analysis Assistant | Zhengchao Huang et.al. | 2408.10072 | link |
2024-08-19 | Edge-Cloud Collaborative Motion Planning for Autonomous Driving with Large Language Models | Jiao Chen et.al. | 2408.09972 | null |
2024-08-19 | Control by Adding Players to Change or Maintain the Shapley-Shubik or the Penrose-Banzhaf Power Index in Weighted Voting Games Is Complete for NP^PP | Joanna Kaczmarek et.al. | 2408.09953 | null |
2024-08-19 | Segment-Anything Models Achieve Zero-shot Robustness in Autonomous Driving | Jun Yan et.al. | 2408.09839 | link |
2024-08-19 | Automated Vehicle Driver Monitoring Dataset from Real-World Scenarios | Mohamed Sabry et.al. | 2408.09833 | null |
2024-08-19 | GoNoGo: An Efficient LLM-based Multi-Agent System for Streamlining Automotive Software Release Decision-Making | Arsham Gholamzadeh Khoee et.al. | 2408.09785 | null |
2024-08-19 | Quantitative 3D Map Accuracy Evaluation Hardware and Algorithm for LiDAR(-Inertial) SLAM | Sanghyun Hahn et.al. | 2408.09727 | link |
2024-08-19 | Optimal Replenishment Strategy for Satellite Constellation with Dual Supply Modes | Taehyun Sung et.al. | 2408.09696 | null |
2024-08-19 | Continuous-Time Dynamic Decision Making with Costly Information | Christoph Knochenhauer et.al. | 2408.09693 | null |
2024-08-19 | Multi-Agent Reinforcement Learning for Autonomous Driving: A Survey | Ruiqi Zhang et.al. | 2408.09675 | link |
2024-08-19 | BLADE: Benchmarking Language Model Agents for Data-Driven Science | Ken Gu et.al. | 2408.09667 | link |
2024-08-19 | Contextual Bandits for Unbounded Context Distributions | Puning Zhao et.al. | 2408.09655 | null |
2024-08-18 | Experimental Design For Causal Inference Through An Optimization Lens | Jinglong Zhao et.al. | 2408.09607 | null |
2024-08-18 | Prescribed-time Convergent Distributed Multiobjective Optimization with Dynamic Event-triggered Communication | Tengyang Gong et.al. | 2408.09602 | null |
2024-08-18 | Sample-Optimal Large-Scale Optimal Subset Selection | Zaile Li et.al. | 2408.09537 | null |
2024-08-18 | Towards Safe and Robust Autonomous Vehicle Platooning: A Self-Organizing Cooperative Control Framework | Chengkai Xu et.al. | 2408.09468 | null |
2024-08-18 | In-Memory Learning Automata Architecture using Y-Flash Cell | Omar Ghazal et.al. | 2408.09456 | null |
2024-08-18 | Retina-inspired Object Motion Segmentation | Victoria Clerico et.al. | 2408.09454 | null |
2024-08-18 | OVOSE: Open-Vocabulary Semantic Segmentation in Event-Based Cameras | Muhammad Rameez Ur Rahman et.al. | 2408.09424 | link |
2024-08-18 | Value-Enriched Population Synthesis: Integrating a Motivational Layer | Alba Aguilera et.al. | 2408.09407 | link |
2024-08-16 | Visual Agents as Fast and Slow Thinkers | Guangyan Sun et.al. | 2408.08862 | link |
2024-08-16 | HistoGym: A Reinforcement Learning Environment for Histopathological Image Analysis | Zhi-Bo Liu et.al. | 2408.08847 | link |
2024-08-16 | Shapley Marginal Surplus for Strong Models | Daniel de Marchi et.al. | 2408.08845 | null |
2024-08-16 | Retrieval-augmented Few-shot Medical Image Segmentation with Foundation Models | Lin Zhao et.al. | 2408.08813 | null |
2024-08-16 | Artificial Intelligence and Strategic Decision-Making: Evidence from Entrepreneurs and Investors | Felipe A. Csaszar et.al. | 2408.08811 | null |
2024-08-16 | PriorMapNet: Enhancing Online Vectorized HD Map Construction with Priors | Rongxuan Wang et.al. | 2408.08802 | null |
2024-08-16 | A Transparency Paradox? Investigating the Impact of Explanation Specificity and Autonomous Vehicle Perceptual Inaccuracies on Passengers | Daniel Omeiza et.al. | 2408.08785 | null |
2024-08-16 | Multi-task Learning Approach for Intracranial Hemorrhage Prognosis | Miriam Cobo et.al. | 2408.08784 | link |
2024-08-16 | Beyond Proportional Individual Guarantees for Binary Perpetual Voting | Yotam Gafni et.al. | 2408.08767 | null |
2024-08-16 | SYMPOL: Symbolic Tree-Based On-Policy Reinforcement Learning | Sascha Marton et.al. | 2408.08761 | link |
2024-08-16 | SE-SGformer: A Self-Explainable Signed Graph Transformer for Link Sign Prediction | Lu Li et.al. | 2408.08754 | null |
2024-08-16 | Quantifying the Effectiveness of Student Organization Activities using Natural Language Processing | Lyberius Ennio F. Taruc et.al. | 2408.08694 | null |
2024-08-16 | Med-PMC: Medical Personalized Multi-modal Consultation with a Proactive Ask-First-Observe-Next Paradigm | Hongcheng Liu et.al. | 2408.08693 | link |
2024-08-16 | A survey on secure decentralized optimization and learning | Changxin Liu et.al. | 2408.08628 | null |
2024-08-16 | RPLUW/M: Enabling RPL on the Internet of Underwater Things | Mohammadhossein Homaei et.al. | 2408.08607 | null |
2024-08-16 | S-RAF: A Simulation-Based Robustness Assessment Framework for Responsible Autonomous Driving | Daniel Omeiza et.al. | 2408.08584 | link |
2024-08-16 | AgentSimulator: An Agent-based Approach for Data-driven Business Process Simulation | Lukas Kirchdorfer et.al. | 2408.08571 | link |
2024-08-16 | Multilevel Graph Reinforcement Learning for Consistent Cognitive Decision-making in Heterogeneous Mixed Autonomy | Xin Gao et.al. | 2408.08516 | null |
2024-08-16 | CoSEC: A Coaxial Stereo Event Camera Dataset for Autonomous Driving | Shihan Peng et.al. | 2408.08500 | null |
2024-08-16 | The Limitations of Model Retraining in the Face of Performativity | Anmol Kabra et.al. | 2408.08499 | null |
2024-08-15 | Autonomous Behavior Planning For Humanoid Loco-manipulation Through Grounded Language Model | Jin Wang et.al. | 2408.08282 | null |
2024-08-15 | A Conflicts-free, Speed-lossless KAN-based Reinforcement Learning Decision System for Interactive Driving in Roundabouts | Zhihao Lin et.al. | 2408.08242 | null |
2024-08-15 | Learned Multimodal Compression for Autonomous Driving | Hadi Hadizadeh et.al. | 2408.08211 | null |
2024-08-15 | Confidence-weighted integration of human and machine judgments for superior decision-making | Felipe Yáñez et.al. | 2408.08083 | link |
2024-08-15 | A Survey on Integrated Sensing, Communication, and Computation | Dingzhu Wen et.al. | 2408.08074 | null |
2024-08-15 | Co-Fix3D: Enhancing 3D Object Detection with Collaborative Refinement | Wenxuan Li et.al. | 2408.07999 | link |
2024-08-15 | Capturing the Complexity of Human Strategic Decision-Making with Machine Learning | Jian-Qiao Zhu et.al. | 2408.07865 | null |
2024-08-14 | From Decision to Action in Surgical Autonomy: Multi-Modal Large Language Models for Robot-Assisted Blood Suction | Sadra Zargarzadeh et.al. | 2408.07806 | null |
2024-08-14 | NeuroPapyri: A Deep Attention Embedding Network for Handwritten Papyri Retrieval | Giuseppe De Gregorio et.al. | 2408.07785 | null |
2024-08-14 | MedTsLLM: Leveraging LLMs for Multimodal Medical Time Series Analysis | Nimeesha Chan et.al. | 2408.07773 | link |
2024-08-14 | Panacea+: Panoramic and Controllable Video Generation for Autonomous Driving | Yuqing Wen et.al. | 2408.07605 | null |
2024-08-14 | A Nested Graph Reinforcement Learning-based Decision-making Strategy for Eco-platooning | Xin Gao et.al. | 2408.07578 | null |
2024-08-14 | Development of a Multi-Agent Clinical Decision Support System for Korean Triage and Acuity Scale (KTAS)-Based Triage and Treatment Planning in Emergency Departments | Seungjun Han et.al. | 2408.07531 | null |
2024-08-14 | LLMI3D: Empowering LLM with 3D Perception from a Single 2D Image | Fan Yang et.al. | 2408.07422 | null |
2024-08-14 | The Restaurant Meal Delivery Problem with Ghost Kitchens | Gal Neria et.al. | 2408.07417 | null |
2024-08-14 | Risk Occupancy: A New and Efficient Paradigm through Vehicle-Road-Cloud Collaboration | Jiaxing Chen et.al. | 2408.07367 | null |
2024-08-14 | Towards Few-shot Self-explaining Graph Neural Networks | Jingyu Peng et.al. | 2408.07340 | link |
2024-08-14 | Learning Decisions Offline from Censored Observations with ε-insensitive Operational Costs | Minxia Chen et.al. | 2408.07305 | null |
2024-08-14 | NL2OR: Solve Complex Operations Research Problems Using Natural Language Inputs | Junxuan Li et.al. | 2408.07272 | null |
2024-08-13 | Neural embedding of beliefs reveals the role of relative dissonance in human decision-making | Byunghwee Lee et.al. | 2408.07237 | link |
2024-08-13 | Agent Q: Advanced Reasoning and Learning for Autonomous AI Agents | Pranav Putta et.al. | 2408.07199 | null |
2024-08-13 | Efficient Human-Object-Interaction (EHOI) Detection via Interaction Label Coding and Conditional Decision | Tsung-Shan Yang et.al. | 2408.07018 | null |
2024-08-14 | Automatic Feature Recognition and Dimensional Attributes Extraction From CAD Models for Hybrid Additive-Subtractive Manufacturing | Muhammad Tayyab Khan et.al. | 2408.06891 | null |
2024-08-13 | Geotree of Geodetector: An Anatomy of Knowledge Diffusion of a Novel Statistic | Yuting Liang et.al. | 2408.06839 | null |
2024-08-13 | FlatFusion: Delving into Details of Sparse Transformer-based Camera-LiDAR Fusion for Autonomous Driving | Yutao Zhu et.al. | 2408.06832 | null |
2024-08-13 | Exploring Domain Shift on Radar-Based 3D Object Detection Amidst Diverse Environmental Conditions | Miao Zhang et.al. | 2408.06772 | null |
2024-08-13 | Adaptive Data Quality Scoring Operations Framework using Drift-Aware Mechanism for Industrial Applications | Firas Bayram et.al. | 2408.06724 | null |
2024-08-13 | MAPPO-PIS: A Multi-Agent Proximal Policy Optimization Method with Prior Intent Sharing for CAVs’ Cooperative Decision-Making | Yicheng Guo et.al. | 2408.06656 | link |
2024-08-13 | Dynamic Pricing of Electric Vehicle Charging Station Alliances Under Information Asymmetry | Zeyu Liu et.al. | 2408.06645 | null |
2024-08-13 | A lightweight YOLOv5-FFM model for occlusion pedestrian detection | Xiangjie Luo et.al. | 2408.06633 | null |
2024-08-13 | IFShip: A Large Vision-Language Model for Interpretable Fine-grained Ship Classification via Domain Knowledge-Enhanced Instruction Tuning | Mingning Guo et.al. | 2408.06631 | null |
2024-08-14 | OpenEP: Open-Ended Future Event Prediction | Yong Guan et.al. | 2408.06578 | null |
2024-08-13 | Value of Information and Reward Specification in Active Inference and POMDPs | Ran Wei et.al. | 2408.06542 | null |
2024-08-12 | Hierarchical in-Context Reinforcement Learning with Hindsight Modular Reflections for Planning | Chuanneng Sun et.al. | 2408.06520 | null |
2024-08-12 | Decentralized Cooperation in Heterogeneous Multi-Agent Reinforcement Learning via Graph Neural Network-Based Intrinsic Motivation | Jahir Sadik Monon et.al. | 2408.06503 | link |
2024-08-12 | Towards Autonomous Agents: Adaptive-planning, Reasoning, and Acting in Language Models | Yen-Che Hsiao et.al. | 2408.06458 | link |
2024-08-12 | Finding Patterns in Ambiguity: Interpretable Stress Testing in the Decision~Boundary | Inês Gomes et.al. | 2408.06302 | link |
2024-08-12 | A Digital Twin Framework Utilizing Machine Learning for Robust Predictive Maintenance: Enhancing Tire Health Monitoring | Vispi Karkaria et.al. | 2408.06220 | null |
2024-08-12 | IIT Bombay Racing Driverless: Autonomous Driving Stack for Formula Student AI | Yash Rampuria et.al. | 2408.06113 | null |
2024-08-12 | Building Decision Making Models Through Language Model Regime | Yu Zhang et.al. | 2408.06087 | null |
2024-08-12 | Sequential sampling without comparison to boundary through model-free reinforcement learning | Jamal Esmaily et.al. | 2408.06080 | null |
2024-08-12 | A-BDD: Leveraging Data Augmentations for Safe Autonomous Driving in Adverse Weather and Lighting | Felix Assion et.al. | 2408.06071 | null |
2024-08-12 | Perceptual Similarity for Measuring Decision-Making Style and Policy Diversity in Games | Chiu-Chou Lin et.al. | 2408.06051 | link |
2024-08-12 | Exploring and Learning Structure: Active Inference Approach in Navigational Agents | Daria de Tinguy et.al. | 2408.05982 | null |
2024-08-12 | Match Point AI: A Novel AI Framework for Evaluating Data-Driven Tennis Strategies | Carlo Nübel et.al. | 2408.05960 | link |
2024-08-12 | Statistically Optimal Uncertainty Quantification for Expensive Black-Box Models | Shengyi He et.al. | 2408.05887 | null |
2024-08-12 | Multi-Agent Deep Reinforcement Learning Framework for Wireless MAC Protocol Design and Optimization | Navid Keshtiarast et.al. | 2408.05884 | null |
2024-08-12 | Toward Pedestrian Head Tracking: A Benchmark Dataset and an Information Fusion Network | Kailai Sun et.al. | 2408.05877 | null |
2024-08-11 | Root Cause Attribution of Delivery Risks via Causal Discovery with Reinforcement Learning | Shi Bo et.al. | 2408.05860 | null |
2024-08-11 | Egocentric Vision Language Planning | Zhirui Fang et.al. | 2408.05802 | null |
2024-08-11 | Parallel Distributional Deep Reinforcement Learning for Mapless Navigation of Terrestrial Mobile Robots | Victor Augusto Kich et.al. | 2408.05744 | link |
2024-08-11 | ICSFuzz: Collision Detector Bug Discovery in Autonomous Driving Simulators | Weiwei Fu et.al. | 2408.05694 | null |
2024-08-10 | Residual-INR: Communication Efficient On-Device Learning Using Implicit Neural Representation | Hanqiu Chen et.al. | 2408.05617 | link |
2024-08-10 | Meta Clustering of Neural Bandits | Yikun Ban et.al. | 2408.05586 | null |
2024-08-10 | What Matters in Autonomous Driving Anomaly Detection: A Weakly Supervised Horizon | Utkarsh Tiwari et.al. | 2408.05562 | link |
2024-08-10 | S-SIRUS: an explainability algorithm for spatial regression Random Forest | Luca Patelli et.al. | 2408.05537 | link |
2024-08-09 | Modeling Transit in a Fully Integrated Agent-Based Framework: Methodology and Large-Scale Application | Omer Verbas et.al. | 2408.05176 | null |
2024-08-09 | Cautious Calibration in Binary Classification | Mari-Liis Allikivi et.al. | 2408.05120 | link |
2024-08-09 | Depth Helps: Improving Pre-trained RGB-based Policy with Depth Information Injection | Xincheng Pang et.al. | 2408.05107 | null |
2024-08-09 | Evaluating Layout Dimensionalities in PC+VR Asymmetric Collaborative Decision Making | Daniel Enriquez et.al. | 2408.05105 | null |
2024-08-09 | DeepInteraction++: Multi-Modality Interaction for Autonomous Driving | Zeyu Yang et.al. | 2408.05075 | link |
2024-08-09 | Surgical-VQLA++: Adversarial Contrastive Learning for Calibrated Robust Visual Question-Localized Answering in Robotic Surgery | Long Bai et.al. | 2408.04958 | link |
2024-08-12 | Unleashing Artificial Cognition: Integrating Multiple AI Systems | Muntasir Adnan et.al. | 2408.04910 | link |
2024-08-09 | CTE-MLO: Continuous-time and Efficient Multi-LiDAR Odometry with Localizability-aware Point Cloud Sampling | Hongming Shen et.al. | 2408.04901 | null |
2024-08-09 | VLM-MPC: Vision Language Foundation Model (VLM)-Guided Model Predictive Controller (MPC) for Autonomous Driving | Keke Long et.al. | 2408.04821 | null |
2024-08-08 | DaedalusData: Exploration, Knowledge Externalization and Labeling of Particles in Medical Manufacturing – A Design Study | Alexander Wyss et.al. | 2408.04749 | null |
2024-08-08 | Eliminating Backdoors in Neural Code Models via Trigger Inversion | Weisong Sun et.al. | 2408.04683 | null |
2024-08-08 | Field Testing and Detection of Camera Interference for Autonomous Driving | Ki Beom Park et.al. | 2408.04524 | null |
2024-08-08 | Model-Based Transfer Learning for Contextual Reinforcement Learning | Jung-Hoon Cho et.al. | 2408.04498 | link |
2024-08-08 | Multi-Objective LQR with Linear Scalarization | Ali Jadbabaie et.al. | 2408.04488 | null |
2024-08-09 | Achieving Robust Data-driven Contextual Decision Making in a Data Augmentation Way | Zhaoen Li et.al. | 2408.04469 | null |
2024-08-08 | Reinforcement Learning from Human Feedback for Lane Changing of Autonomous Vehicles in Mixed Traffic | Yuting Wang et.al. | 2408.04447 | null |
2024-08-08 | Non-maximizing policies that fulfill multi-criterion aspirations in expectation | Simon Dima et.al. | 2408.04385 | null |
2024-08-08 | Design and Implementation of Smart Infrastructures and Connected Vehicles in A Mini-city Platform | Daniel Vargas et.al. | 2408.04195 | null |
2024-08-08 | The Data Addition Dilemma | Judy Hanwen Shen et.al. | 2408.04154 | link |
2024-08-07 | Machine Learning-Based Reward-Driven Tuning of Scanning Probe Microscopy: Towards Fully Automated Microscopy | Yu Liu et.al. | 2408.04055 | null |
2024-08-07 | Learning Rate-Free Reinforcement Learning: A Case for Model Selection with Non-Stationary Objectives | Aida Afshar et.al. | 2408.04046 | link |
2024-08-07 | How Well Can Vision Language Models See Image Details? | Chenhui Gou et.al. | 2408.03940 | null |
2024-08-07 | MORTAR: A Model-based Runtime Action Repair Framework for AI-enabled Cyber-Physical Systems | Renzhi Wang et.al. | 2408.03892 | null |
2024-08-07 | GAIA – A Large Language Model for Advanced Power Dispatch | Yuheng Cheng et.al. | 2408.03847 | null |
2024-08-07 | Vision-Language Guidance for LiDAR-based Unsupervised 3D Object Detection | Christian Fruhwirth-Reisinger et.al. | 2408.03790 | link |
2024-08-07 | Methodological Explainability Evaluation of an Interpretable Deep Learning Model for Post-Hepatectomy Liver Failure Prediction Incorporating Counterfactual Explanations and Layerwise Relevance Propagation: A Prospective In Silico Trial | Xian Zhong et.al. | 2408.03771 | null |
2024-08-07 | Intuitionistic Fuzzy Cognitive Maps for Interpretable Image Classification | Georgia Sovatzidi et.al. | 2408.03745 | null |
2024-08-07 | MS-Mapping: An Uncertainty-Aware Large-Scale Multi-Session LiDAR Mapping System | Xiangcheng Hu et.al. | 2408.03723 | link |
2024-08-07 | Asynchronous Credit Assignment Framework for Multi-Agent Reinforcement Learning | Yongheng Liang et.al. | 2408.03692 | null |
2024-08-07 | AgentsCoMerge: Large Language Model Empowered Collaborative Decision Making for Ramp Merging | Senkang Hu et.al. | 2408.03624 | null |
2024-08-07 | DRAMA: An Efficient End-to-end Motion Planner for Autonomous Driving with Mamba | Chengran Yuan et.al. | 2408.03601 | null |
2024-08-07 | Clinical Challenges and AI Opportunities in Decision-Making for Cancer Treatment-Induced Cardiotoxicity | Siyi Wu et.al. | 2408.03586 | null |
2024-08-07 | Leveraging LLMs for Enhanced Open-Vocabulary 3D Scene Understanding in Autonomous Driving | Amirhosein Chahe et.al. | 2408.03516 | null |
2024-08-06 | Communication-Aware Consistent Edge Selection for Mobile Users and Autonomous Vehicles | Nazish Tahir et.al. | 2408.03435 | null |
2024-08-06 | Probabilistic Scores of Classifiers, Calibration is not Enough | Agathe Fernandes Machado et.al. | 2408.03421 | link |
2024-08-07 | Adversarial Safety-Critical Scenario Generation using Naturalistic Human Driving Priors | Kunkun Hao et.al. | 2408.03200 | null |
2024-08-06 | RELIEF: Reinforcement Learning Empowered Graph Feature Prompt Tuning | Jiapeng Zhu et.al. | 2408.03195 | link |
2024-08-06 | Integrated Intention Prediction and Decision-Making with Spectrum Attention Net and Proximal Policy Optimization | Xiao Zhou et.al. | 2408.03191 | null |
2024-08-06 | QADQN: Quantum Attention Deep Q-Network for Financial Market Prediction | Siddhant Dutta et.al. | 2408.03088 | null |
2024-08-06 | Research on Autonomous Driving Decision-making Strategies based Deep Reinforcement Learning | Zixiang Wang et.al. | 2408.03084 | null |
2024-08-06 | Considerations on free-surface detachment and bed entrainment of fluvial plastics | Matthias Kramer et.al. | 2408.03081 | null |
2024-08-06 | SCOPE: A Synthetic Multi-Modal Dataset for Collective Perception Including Physical-Correct Weather Conditions | Jörg Gamerdinger et.al. | 2408.03065 | null |
2024-08-06 | Social Behavior as a Key to Learning-based Multi-Agent Pathfinding Dilemmas | Chengyang He et.al. | 2408.03063 | null |
2024-08-06 | Uniqueness Analysis of Controllability Scores and Their Application to Brain Networks | Kazuhiro Sato et.al. | 2408.03023 | null |
2024-08-06 | Cross-cultural analysis of pedestrian group behaviour influence on crossing decisions in interactions with autonomous vehicles | Sergio Martín Serrano et.al. | 2408.03003 | null |
2024-08-06 | Accuracy and Consistency of LLMs in the Registered Dietitian Exam: The Impact of Prompt Engineering and Knowledge Retrieval | Iman Azimi et.al. | 2408.02964 | link |
2024-08-06 | Few-shot Scooping Under Domain Shift via Simulated Maximal Deployment Gaps | Yifan Zhu et.al. | 2408.02949 | null |
2024-08-06 | Reinforcement Learning based Workflow Scheduling in Cloud and Edge Computing Environments: A Taxonomy, Review and Future Directions | Amanda Jayanetti et.al. | 2408.02938 | null |
2024-08-06 | Compromising Embodied Agents with Contextual Backdoor Attacks | Aishan Liu et.al. | 2408.02882 | null |
2024-08-05 | On The Stability of Moral Preferences: A Problem with Computational Elicitation Methods | Kyle Boerstler et.al. | 2408.02862 | null |
2024-08-05 | Nash Equilibrium in Games on Graphs with Incomplete Preferences | Abhishek N. Kulkarni et.al. | 2408.02860 | null |
2024-08-05 | SiCo: A Size-Controllable Virtual Try-On Approach for Informed Decision-Making | Sherry X. Chen et.al. | 2408.02803 | link |
2024-08-05 | LLM economicus? Mapping the Behavioral Biases of LLMs via Utility Theory | Jillian Ross et.al. | 2408.02784 | null |
2024-08-05 | Enhancing Medical Learning and Reasoning Systems: A Boxology-Based Comparative Analysis of Design Patterns | Chi Him Ng et.al. | 2408.02709 | null |
2024-08-05 | From LLMs to LLM-based Agents for Software Engineering: A Survey of Current, Challenges and Future | Haolin Jin et.al. | 2408.02479 | null |
2024-08-05 | An Integrated Approach to Importance Sampling and Machine Learning for Efficient Monte Carlo Estimation of Distortion Risk Measures in Black Box Models | Sören Bettels et.al. | 2408.02401 | null |
2024-08-05 | Explain via Any Concept: Concept Bottleneck Model with Open Vocabulary Concepts | Andong Tan et.al. | 2408.02265 | null |
2024-08-05 | Is Large Language Model Good at Database Knob Tuning? A Comprehensive Experimental Evaluation | Yiyan Li et.al. | 2408.02213 | null |
2024-08-04 | SPINEX-TimeSeries: Similarity-based Predictions with Explainable Neighbors Exploration for Time Series and Forecasting Problems | Ahmed Z Naser et.al. | 2408.02159 | null |
2024-08-04 | Model Hijacking Attack in Federated Learning | Zheng Li et.al. | 2408.02131 | null |
2024-08-04 | Value-Based Rationales Improve Social Experience: A Multiagent Simulation Study | Sz-Ting Tzeng et.al. | 2408.02117 | null |
2024-08-04 | KAN-RCBEVDepth: A multi-modal fusion algorithm in object detection for autonomous driving | Zhihao Lai et.al. | 2408.02088 | null |
2024-08-04 | Reinforcement Learning for an Efficient and Effective Malware Investigation during Cyber Incident Response | Dipo Dunsin et.al. | 2408.01999 | null |
2024-08-04 | Optimal and efficient text counterfactuals using Graph Neural Networks | Dimitris Lymperopoulos et.al. | 2408.01969 | link |
2024-08-04 | Bilateral Trade Flow Prediction by Gravity-informed Graph Auto-encoder | Naoto Minakawa et.al. | 2408.01938 | null |
2024-08-03 | Impact of Major Health Events on Pharmaceutical Stocks: A Comprehensive Analysis Using Macroeconomic and Market Indicators | Morteza Maleki et.al. | 2408.01883 | null |
2024-08-03 | ST-SACLF: Style Transfer Informed Self-Attention Classifier for Bias-Aware Painting Classification | Mridula Vijendran et.al. | 2408.01827 | link |
2024-08-03 | STDA: Spatio-Temporal Dual-Encoder Network Incorporating Driver Attention to Predict Driver Behaviors Under Safety-Critical Scenarios | Dongyang Xu et.al. | 2408.01774 | null |
2024-08-03 | LAM3D: Leveraging Attention for Monocular 3D Object Detection | Diana-Alexandra Sas et.al. | 2408.01739 | null |
2024-08-03 | Self-Emotion Blended Dialogue Generation in Social Simulation Agents | Qiang Zhang et.al. | 2408.01633 | null |
2024-08-03 | A Comparative Analysis of Wealth Index Predictions in Africa between three Multi-Source Inference Models | Márton Karsai et.al. | 2408.01631 | link |
2024-08-03 | Weighted Brier Score – an Overall Summary Measure for Risk Prediction Models with Clinical Utility Consideration | Kehao Zhu et.al. | 2408.01626 | null |
2024-08-03 | Data-Driven Machine Learning Approaches for Predicting In-Hospital Sepsis Mortality | Arseniy Shumilov et.al. | 2408.01612 | null |
2024-08-02 | Counterfactual Explanations for Medical Image Classification and Regression using Diffusion Autoencoder | Matan Atad et.al. | 2408.01571 | link |
2024-08-02 | NeuralBeta: Estimating Beta Using Deep Learning | Yuxin Liu et.al. | 2408.01387 | null |
2024-08-02 | A Robotics-Inspired Scanpath Model Reveals the Importance of Uncertainty and Semantic Object Cues for Gaze Guidance in Dynamic Scenes | Vito Mengers et.al. | 2408.01322 | null |
2024-08-02 | PsybORG+: Cognitive Modeling for Triggering and Detection of Cognitive Biases of Advanced Persistent Threats | Shuo Huang et.al. | 2408.01310 | null |
2024-08-02 | A Decision-driven Methodology for Designing Uncertainty-aware AI Self-Assessment | Gregory Canal et.al. | 2408.01301 | null |
2024-08-02 | Assessing Robustness of Machine Learning Models using Covariate Perturbations | Arun Prakash R et.al. | 2408.01300 | null |
2024-08-02 | The Mismeasure of Man and Models: Evaluating Allocational Harms in Large Language Models | Hannah Chen et.al. | 2408.01285 | null |
2024-08-02 | Metareasoning in uncertain environments: a meta-BAMDP framework | Prakhar Godara et.al. | 2408.01253 | null |
2024-08-02 | Game Theory Based Community-Aware Opinion Dynamics | Shanfan Zhang et.al. | 2408.01196 | link |
2024-08-02 | A Short-Term Planning Framework for the Operation of Tanker-Based Water Distribution Systems in Urban Areas | Abhilasha Maheshwari et.al. | 2408.01184 | null |
2024-08-02 | CommonUppRoad: A Framework of Formal Modelling, Verifying, Learning, and Visualisation of Autonomous Vehicles | Rong Gu et.al. | 2408.01093 | null |
2024-08-02 | Effect of Fog Particle Size Distribution on 3D Object Detection Under Adverse Weather Conditions | Ajinkya Shinde et.al. | 2408.01085 | null |
2024-08-02 | MambaST: A Plug-and-Play Cross-Spectral Spatial-Temporal Fuser for Efficient Pedestrian Detection | Xiangbo Gao et.al. | 2408.01037 | link |
2024-08-02 | Adaptive Two-Stage Cloud Resource Scaling via Hierarchical Multi-Indicator Forecasting and Bayesian Decision-Making | Yang Luo et.al. | 2408.01000 | link |
2024-08-02 | A Quantal Response Analysis of Defender-Attacker Sequential Security Games | Md Reya Shad Azim et.al. | 2408.00964 | null |
2024-08-01 | Generalisation of Total Uncertainty in AI: A Theoretical Study | Keivan Shariatmadar et.al. | 2408.00946 | null |
2024-08-01 | Optimizing Diffusion Models for Joint Trajectory Prediction and Controllable Generation | Yixiao Wang et.al. | 2408.00766 | null |
2024-08-02 | Reinforcement Learning applied to Insurance Portfolio Pursuit | Edward James Young et.al. | 2408.00713 | link |
2024-08-01 | Future of Artificial Intelligence in Agile Software Development | Mariyam Mahboob et.al. | 2408.00703 | null |
2024-08-01 | Learning in Multi-Objective Public Goods Games with Non-Linear Utilities | Nicole Orzan et.al. | 2408.00682 | null |
2024-08-01 | Deep Learning in Medical Image Classification from MRI-based Brain Tumor Images | Xiaoyi Liu et.al. | 2408.00636 | null |
2024-08-01 | MUFASA: Multi-View Fusion and Adaptation Network with Spatial Awareness for Radar Object Detection | Xiangyuan Peng et.al. | 2408.00565 | null |
2024-08-01 | Spatial Weather, Socio-Economic and Political Risks in Probabilistic Load Forecasting | Monika Zimmermann et.al. | 2408.00507 | null |
2024-08-01 | Explainable Emotion Decoding for Human and Computer Vision | Alessio Borriero et.al. | 2408.00493 | null |
2024-08-01 | An Operational Scheduling Framework for Tanker-based Water Distribution System under Uncertainty | Abhilasha Maheshwari et.al. | 2408.00431 | null |
2024-08-01 | DriveArena: A Closed-loop Generative Simulation Platform for Autonomous Driving | Xuemeng Yang et.al. | 2408.00415 | null |
2024-08-01 | Enabling Next-Generation V2X Perception: Wireless Rigid Body Localization and Tracking | Niclas Führling et.al. | 2408.00349 | null |
2024-08-01 | RoCo:Robust Collaborative Perception By Iterative Object Matching and Pose Adjustment | Zhe Huang et.al. | 2408.00257 | link |
2024-08-01 | Joint Vehicle Connection and Beamforming Optimization in Digital Twin Assisted Integrated Sensing and Communication Vehicular Networks | Weihang Ding et.al. | 2408.00248 | null |
2024-08-02 | Bringing Data into the Conversation: Adapting Content from Business Intelligence Dashboards for Threaded Collaboration Platforms | Hyeok Kim et.al. | 2408.00242 | null |
2024-08-01 | Invariant Discovery of Features Across Multiple Length Scales: Applications in Microscopy and Autonomous Materials Characterization | Aditya Raghavan et.al. | 2408.00229 | null |
2024-08-01 | Load Balancing in Federated Learning | Alireza Javani et.al. | 2408.00217 | null |
2024-07-31 | Areas of Improvement for Autonomous Vehicles: A Machine Learning Analysis of Disengagement Reports | Tyler Ward et.al. | 2408.00051 | null |
2024-07-31 | Algorithms for Collaborative Machine Learning under Statistical Heterogeneity | Seok-Ju Hahn et.al. | 2408.00050 | null |
2024-07-31 | Coordinating Decisions via Quantum Telepathy | Dawei Ding et.al. | 2407.21723 | null |
2024-07-31 | An Explainable Vision Transformer with Transfer Learning Combined with Support Vector Machine Based Efficient Drought Stress Identification | Aswini Kumar Patra et.al. | 2407.21666 | null |
2024-07-31 | MART: MultiscAle Relational Transformer Networks for Multi-agent Trajectory Prediction | Seongju Lee et.al. | 2407.21635 | link |
2024-07-31 | Voxel Scene Graph for Intracranial Hemorrhage | Antoine P. Sanner et.al. | 2407.21580 | link |
2024-08-01 | Analysis of Functional Insufficiencies and Triggering Conditions to Improve the SOTIF of an MPC-based Trajectory Planner | Mirko Conrad et.al. | 2407.21569 | null |
2024-07-31 | Interpreting and learning voice commands with a Large Language Model for a robot system | Stanislau Stankevich et.al. | 2407.21512 | null |
2024-07-31 | Mitral Regurgitation Recogniton based on Unsupervised Out-of-Distribution Detection with Residual Diffusion Amplification | Zhe Liu et.al. | 2407.21497 | null |
2024-07-31 | KemenkeuGPT: Leveraging a Large Language Model on Indonesia’s Government Financial Data and Regulations to Enhance Decision Making | Gilang Fajar Febrian et.al. | 2407.21459 | null |
2024-07-31 | Cost-Effective Hallucination Detection for LLMs | Simon Valentin et.al. | 2407.21424 | null |
2024-07-31 | Pathology Foundation Models | Mieko Ochi et.al. | 2407.21317 | null |
2024-07-31 | Who should I trust? A Visual Analytics Approach for Comparing Net Load Forecasting Models | Kaustav Bhattacharjee et.al. | 2407.21299 | null |
2024-07-31 | SimpleLLM4AD: An End-to-End Vision-Language Model with Graph Visual Question Answering for Autonomous Driving | Peiru Zheng et.al. | 2407.21293 | null |
2024-07-30 | Towards an Integrated Performance Framework for Fire Science and Management Workflows | H. Ahmed et.al. | 2407.21231 | null |
2024-07-30 | Algorithm-Assisted Decision Making and Racial Disparities in Housing: A Study of the Allegheny Housing Assessment Tool | Lingwei Cheng et.al. | 2407.21209 | null |
2024-07-30 | Deduction Game Framework and Information Set Entropy Search | Fandi Meng et.al. | 2407.21178 | null |
2024-07-30 | Extending choice assessments to choice functions: An algorithm for computing the natural extension | Arne Decadt et.al. | 2407.21164 | null |
2024-07-30 | Self-supervised Multi-future Occupancy Forecasting for Autonomous Driving | Bernard Lange et.al. | 2407.21126 | null |
2024-07-30 | Zero Shot Health Trajectory Prediction Using Transformer | Pawel Renc et.al. | 2407.21124 | link |
2024-07-30 | Integrating Agent-Based and Compartmental Models for Infectious Disease Modeling: A Novel Hybrid Approach | Inan Bostanci et.al. | 2407.20993 | null |
2024-07-30 | From Feature Importance to Natural Language Explanations Using LLMs with RAG | Sule Tekkesinoglu et.al. | 2407.20990 | link |
2024-07-30 | Learning Ordinality in Semantic Segmentation | Rafael Cristino et.al. | 2407.20959 | null |
2024-07-30 | Non-linear inhibitory responses enhance performance in collective decision-making | David March-Pons et.al. | 2407.20927 | null |
2024-07-30 | How to Choose a Reinforcement-Learning Algorithm | Fabian Bongratz et.al. | 2407.20917 | null |
2024-07-30 | Optimizing 5G-Advanced Networks for Time-critical Applications: The Role of L4S | Guangjin Pan et.al. | 2407.20852 | null |
2024-07-30 | Task-Oriented Communication for Vehicle-to-Infrastructure Cooperative Perception | Jiawei Shao et.al. | 2407.20748 | null |
2024-07-30 | Architectural Influence on Variational Quantum Circuits in Multi-Agent Reinforcement Learning: Evolutionary Strategies for Optimization | Michael Kölle et.al. | 2407.20739 | null |
2024-07-30 | Practices and Strategies in Responsive Thematic Map Design: A Report from Design Workshops with Experts | Sarah Schöttler et.al. | 2407.20735 | null |
2024-07-30 | Scene-Specific Trajectory Sets: Maximizing Representation in Motion Forecasting | Abhishek Vivekanandan et.al. | 2407.20732 | null |
2024-07-30 | Exploring Loss Landscapes through the Lens of Spin Glass Theory | Hao Liao et.al. | 2407.20724 | null |
2024-07-30 | On-the-fly Communication-and-Computing to Enable Representation Learning for Distributed Point Clouds | Xu Chen et.al. | 2407.20710 | null |
2024-07-30 | Powerful A/B-Testing Metrics and Where to Find Them | Olivier Jeunen et.al. | 2407.20665 | null |
2024-07-30 | Enhancing Agricultural Machinery Management through Advanced LLM Integration | Emily Johnson et.al. | 2407.20588 | null |
2024-07-30 | Laplace approximation for Bayesian variable selection via Le Cam’s one-step procedure | Tianrui Hou et.al. | 2407.20580 | null |
2024-07-30 | DiffusionCounterfactuals: Inferring High-dimensional Counterfactuals with Guidance of Causal Representations | Jiageng Zhu et.al. | 2407.20553 | null |
2024-07-30 | Evaluating Fairness in Black-box Algorithmic Markets: A Case Study of Ride Sharing in Chicago | Yuhan Liu et.al. | 2407.20522 | null |
2024-07-29 | Domain Adaptable Prescriptive AI Agent for Enterprise | Piero Orderique et.al. | 2407.20447 | null |
2024-07-29 | Appraisal-Guided Proximal Policy Optimization: Modeling Psychological Disorders in Dynamic Grid World | Hari Prasad et.al. | 2407.20383 | null |
2024-07-29 | SAPG: Split and Aggregate Policy Gradients | Jayesh Singla et.al. | 2407.20230 | null |
2024-07-29 | Time series forecasting with high stakes: A field study of the air cargo industry | Abhinav Garg et.al. | 2407.20192 | null |
2024-07-29 | An Interpretable Rule Creation Method for Black-Box Models based on Surrogate Trees – SRules | Mario Parrón Verdasco et.al. | 2407.20070 | null |
2024-07-29 | Collision Probability Distribution Estimation via Temporal Difference Learning | Thomas Steinecker et.al. | 2407.20000 | link |
2024-07-29 | Private and Secure Fuzzy Name Matching | Harsh Kasyap et.al. | 2407.19979 | null |
2024-07-29 | Hydrodynamics of pulsating active liquids | Tirthankar Banerjee et.al. | 2407.19955 | null |
2024-07-29 | AOTree: Aspect Order Tree-based Model for Explainable Recommendation | Wenxin Zhao et.al. | 2407.19937 | null |
2024-07-29 | Anomalous State Sequence Modeling to Enhance Safety in Reinforcement Learning | Leen Kweider et.al. | 2407.19860 | null |
2024-07-29 | Evolution of cooperation in the public goods game with Q-learning | Guozhong Zheng et.al. | 2407.19851 | null |
2024-07-29 | Legal Minds, Algorithmic Decisions: How LLMs Apply Constitutional Principles in Complex Scenarios | Camilla Bignotti et.al. | 2407.19760 | null |
2024-07-29 | ALEN: A Dual-Approach for Uniform and Non-Uniform Low-Light Image Enhancement | Ezequiel Perez-Zarate et.al. | 2407.19708 | link |
2024-07-29 | Towards Detecting IoT Event Spoofing Attacks Using Time-Series Classification | Uzma Maroof et.al. | 2407.19662 | null |
2024-07-29 | AI-Driven Healthcare: A Survey on Ensuring Fairness and Mitigating Bias | Sribala Vidyadhari Chinta et.al. | 2407.19655 | null |
2024-07-29 | “A Good Bot Always Knows Its Limitations”: Assessing Autonomous System Decision-making Competencies through Factorized Machine Self-confidence | Brett Israelsen et.al. | 2407.19631 | link |
2024-07-28 | Evaluating LLMs for Text-to-SQL Generation With Complex SQL Workload | Limin Ma et.al. | 2407.19517 | null |
2024-07-28 | EPD: Long-term Memory Extraction, Context-awared Planning and Multi-iteration Decision @ EgoPlan Challenge ICML 2024 | Letian Shi et.al. | 2407.19510 | link |
2024-07-28 | HD-maps as Prior Information for Globally Consistent Mapping in GPS-denied Environments | Waqas Ali et.al. | 2407.19463 | null |
2024-07-28 | Reputation-Driven Asynchronous Federated Learning for Enhanced Trajectory Prediction with Blockchain | Weiliang Chen et.al. | 2407.19428 | null |
2024-07-28 | The influence of Automated Decision-Making systems in the context of street-level bureaucrats’ practices | Manuel Portela et.al. | 2407.19427 | null |
2024-07-28 | Logic Distillation: Learning from Code Function by Function for Planning and Decision-making | Dong Chen et.al. | 2407.19405 | null |
2024-07-26 | Wolf: Captioning Everything with a World Summarization Framework | Boyi Li et.al. | 2407.18908 | null |
2024-07-26 | SHANGUS: Deep Reinforcement Learning Meets Heuristic Optimization for Speedy Frontier-Based Exploration of Autonomous Vehicles in Unknown Spaces | Seunghyeop Nam et.al. | 2407.18892 | null |
2024-07-26 | Agent-Based Insight into Eco-Choices: Simulating the Fast Fashion Shift | Daria Soboleva et.al. | 2407.18814 | null |
2024-07-26 | HERO-SLAM: Hybrid Enhanced Robust Optimization of Neural SLAM | Zhe Xin et.al. | 2407.18813 | null |
2024-07-26 | Foundation Models for the Digital Twin Creation of Cyber-Physical Systems | Shaukat Ali et.al. | 2407.18779 | null |
2024-07-26 | Set risk measures | Marcelo Righi et.al. | 2407.18687 | null |
2024-07-26 | Reinforcement Learning for Sustainable Energy: A Survey | Koen Ponse et.al. | 2407.18597 | null |
2024-07-26 | PP-TIL: Personalized Planning for Autonomous Driving with Instance-based Transfer Imitation Learning | Fangze Lin et.al. | 2407.18569 | link |
2024-07-29 | Multi-Agent Trajectory Prediction with Difficulty-Guided Feature Enhancement Network | Guipeng Xin et.al. | 2407.18551 | link |
2024-07-26 | Socially efficient mechanism on the minimum budget | Hirota Kinoshita et.al. | 2407.18515 | null |
2024-07-26 | Design Spaces and How Software Designers Use Them: a sampler | Mary Shaw et.al. | 2407.18502 | null |
2024-07-26 | Gaussian Lane Keeping: A Robust Prediction Baseline | David Isele et.al. | 2407.18451 | null |
2024-07-26 | Impact of Recurrent Neural Networks and Deep Learning Frameworks on Real-time Lightweight Time Series Anomaly Detection | Ming-Chang Lee et.al. | 2407.18439 | null |
2024-07-25 | Adversarial Robust Decision Transformer: Enhancing Robustness of RvS via Minimax Returns-to-go | Xiaohang Tang et.al. | 2407.18414 | null |
2024-07-25 | Large Language Model Integrated Healthcare Cyber-Physical Systems Architecture | Malithi Wanniarachchi Kankanamge et.al. | 2407.18407 | null |
2024-07-25 | Phase transition in a kinetic mean-field game model of inertial self-propelled agents | Piyush Grover et.al. | 2407.18400 | null |
2024-07-25 | Galaxy Mergers in UNIONS – I: A Simulation-driven Hybrid Deep Learning Ensemble for Pure Galaxy Merger Classification | Leonardo Ferreira et.al. | 2407.18396 | null |
2024-07-25 | Automated Ensemble Multimodal Machine Learning for Healthcare | Fergus Imrie et.al. | 2407.18227 | null |
2024-07-25 | Differentiable Quantum Architecture Search in Asynchronous Quantum Reinforcement Learning | Samuel Yen-Chi Chen et.al. | 2407.18202 | null |
2024-07-25 | Taxonomy-Aware Continual Semantic Segmentation in Hyperbolic Spaces for Open-World Perception | Julia Hindel et.al. | 2407.18145 | null |
2024-07-25 | TiCoSS: Tightening the Coupling between Semantic Segmentation and Stereo Matching within A Joint Learning Framework | Guanfeng Tang et.al. | 2407.18038 | null |
2024-07-25 | ECG Arrhythmia Detection Using Disease-specific Attention-based Deep Learning Model | Linpeng Jin et.al. | 2407.18033 | null |
2024-07-25 | Network Inversion of Convolutional Neural Nets | Pirzada Suhail et.al. | 2407.18002 | null |
2024-07-25 | StreamMOS: Streaming Moving Object Segmentation with Multi-View Perception and Dual-Span Memory | Zhiheng Li et.al. | 2407.17905 | link |
2024-07-25 | Financial Statement Analysis with Large Language Models | Alex Kim et.al. | 2407.17866 | null |
2024-07-26 | MDS-ED: Multimodal Decision Support in the Emergency Department – a Benchmark Dataset for Diagnoses and Deterioration Prediction in Emergency Medicine | Juan Miguel Lopez Alcaraz et.al. | 2407.17856 | link |
2024-07-25 | Long-term Fairness in Ride-Hailing Platform | Yufan Kang et.al. | 2407.17839 | null |
2024-07-25 | Image Segmentation via Divisive Normalization: dealing with environmental diversity | Pablo Hernández-Cámara et.al. | 2407.17829 | null |
2024-07-25 | CRASH: Crash Recognition and Anticipation System Harnessing with Context-Aware and Temporal Focus Attentions | Haicheng Liao et.al. | 2407.17757 | null |
2024-07-25 | Control Informed Design of the IAC Autonomous Racecar for Operation at the Dynamic Envelope | Qilun Zhu et.al. | 2407.17737 | null |
2024-07-25 | Enhancing Agent Learning through World Dynamics Modeling | Zhiyuan Sun et.al. | 2407.17695 | link |
2024-07-24 | Towards Neural Network based Cognitive Models of Dynamic Decision-Making by Humans | Changyu Chen et.al. | 2407.17622 | link |
2024-07-24 | Toward human-centered shared autonomy AI paradigms for human-robot teaming in healthcare | Reza Abiri et.al. | 2407.17464 | null |
2024-07-24 | Hidden or Inferred: Fair Learning-To-Rank with Unknown Demographics | Oluseun Olulana et.al. | 2407.17459 | link |
2024-07-24 | 3D Question Answering for City Scene Understanding | Penglei Sun et.al. | 2407.17398 | null |
2024-07-24 | Five reasons against assuming a data-generating distribution in Machine Learning | Benedikt Höltgen et.al. | 2407.17395 | null |
2024-07-24 | Causal modelling without counterfactuals and individualised effects | Benedikt Höltgen et.al. | 2407.17385 | null |
2024-07-24 | Gradient-based inference of abstract task representations for generalization in neural networks | Ali Hummos et.al. | 2407.17356 | null |
2024-07-25 | Enhanced Deep Learning Methodologies and MRI Selection Techniques for Dementia Diagnosis in the Elderly Population | Nikolaos Ntampakis et.al. | 2407.17324 | null |
2024-07-24 | Physical Adversarial Attack on Monocular Depth Estimation via Shape-Varying Patches | Chenxing Zhao et.al. | 2407.17312 | null |
2024-07-24 | An MDP-Based Approach for Distribution System Control with PV Generation and Battery Storage | Robert Sosnowski et.al. | 2407.17257 | null |
2024-07-24 | Testing Large Language Models on Driving Theory Knowledge and Skills for Connected Autonomous Vehicles | Zuoyin Tang et.al. | 2407.17211 | null |
2024-07-24 | Semantic Vehicle-to-Everything (V2X) Communications Towards 6G | Tengfei Lyu et.al. | 2407.17186 | null |
2024-07-24 | Generalized Ordinal Priority Approach for Multi-Attribute Decision-Making under Incomplete Preference Information | Renlong Wang et.al. | 2407.17099 | null |
2024-07-24 | NewsUnfold: Creating a News-Reading Application That Indicates Linguistic Media Bias and Collects Feedback | Smi Hinterreiter et.al. | 2407.17045 | null |
2024-07-24 | Applications of Multi-Agent Deep Reinforcement Learning Communication in Network Management: A Survey | Yue Pi et.al. | 2407.17030 | null |
2024-07-25 | Simulation in discrete choice models evaluation: SDCM, a simulation tool for performance evaluation of DCMs | Amirreza Talebi et.al. | 2407.17014 | null |
2024-07-24 | Progressive Query Refinement Framework for Bird’s-Eye-View Semantic Segmentation from Surrounding Images | Dooseop Choi et.al. | 2407.17003 | link |
2024-07-24 | Toward an Integrated Decision Making Framework for Optimized Stroke Diagnosis with DSA and Treatment under Uncertainty | Nur Ahmad Khatim et.al. | 2407.16962 | link |
2024-07-23 | On the Separability of Vector-Valued Risk Measures | Çağın Ararat et.al. | 2407.16878 | null |
2024-07-23 | Trust Your Gut: Comparing Human and Machine Inference from Noisy Visualizations | Ratanond Koonchanok et.al. | 2407.16871 | null |
2024-07-23 | SECRM-2D: RL-Based Efficient and Comfortable Route-Following Autonomous Driving with Analytic Safety Guarantees | Tianyu Shi et.al. | 2407.16857 | null |
2024-07-24 | A Simulation Benchmark for Autonomous Racing with Large-Scale Human Data | Adrian Remonda et.al. | 2407.16680 | link |
2024-07-23 | Deformable Convolution Based Road Scene Semantic Segmentation of Fisheye Images in Autonomous Driving | Anam Manzoor et.al. | 2407.16647 | null |
2024-07-24 | Velocity Driven Vision: Asynchronous Sensor Fusion Birds Eye View Models for Autonomous Vehicles | Seamie Hayes et.al. | 2407.16636 | null |
2024-07-23 | Knowledge-driven AI-generated data for accurate and interpretable breast ultrasound diagnoses | Haojun Yu et.al. | 2407.16634 | null |
2024-07-23 | MonoWAD: Weather-Adaptive Diffusion Model for Robust Monocular 3D Object Detection | Youngmin Oh et.al. | 2407.16448 | link |
2024-07-23 | Can time series forecasting be automated? A benchmark and analysis | Anvitha Thirthapura Sreedhara et.al. | 2407.16445 | null |
2024-07-23 | Evaluating Uncertainties in Electricity Markets via Machine Learning and Quantum Computing | Shuyang Zhu et.al. | 2407.16404 | null |
2024-07-23 | Cleaning Robots in Public Spaces: A Survey and Proposal for Benchmarking Based on Stakeholders Interviews | Raphael Memmesheimer et.al. | 2407.16393 | null |
2024-07-23 | PhenoFlow: A Human-LLM Driven Visual Analytics System for Exploring Large and Complex Stroke Datasets | Jaeyoung Kim et.al. | 2407.16329 | null |
2024-07-23 | Improving multidimensional projection quality with user-specific metrics and optimal scaling | Maniru Ibrahim et.al. | 2407.16328 | null |
2024-07-23 | Understanding Impacts of Electromagnetic Signal Injection Attacks on Object Detection | Youqian Zhang et.al. | 2407.16327 | null |
2024-07-23 | MOMAland: A Set of Benchmarks for Multi-Objective Multi-Agent Reinforcement Learning | Florian Felten et.al. | 2407.16312 | link |
2024-07-23 | Optimizing Robotic Manipulation with Decision-RWKV: A Recurrent Sequence Modeling Approach for Lifelong Learning | Yujian Dong et.al. | 2407.16306 | link |
2024-07-23 | On the Use of Immersive Digital Technologies for Designing and Operating UAVs | Yousef Emami et.al. | 2407.16288 | null |
2024-07-23 | When, Where, and What? An Novel Benchmark for Accident Anticipation and Localization with Large Language Models | Haicheng Liao et.al. | 2407.16277 | null |
2024-07-23 | Identifiable latent bandits: Combining observational data and exploration for personalized healthcare | Ahmet Zahid Balcıoğlu et.al. | 2407.16239 | null |
2024-07-23 | Strategy and Skill Learning for Physics-based Table Tennis Animation | Jiashun Wang et.al. | 2407.16210 | null |
2024-07-23 | LiCROcc: Teach Radar for Accurate Semantic Occupancy Prediction using LiDAR and Camera | Yukai Ma et.al. | 2407.16197 | null |
2024-07-23 | Advanced AI Framework for Enhanced Detection and Assessment of Abdominal Trauma: Integrating 3D Segmentation with 2D CNN and RNN Models | Liheng Jiang et.al. | 2407.16165 | null |
2024-07-23 | Diffusion Models as Optimizers for Efficient Planning in Offline RL | Renming Huang et.al. | 2407.16142 | link |
2024-07-22 | MILAN: Milli-Annotations for Lidar Semantic Segmentation | Nermin Samet et.al. | 2407.15797 | null |
2024-07-22 | Extracting Structured Insights from Financial News: An Augmented LLM Driven Approach | Rian Dolphin et.al. | 2407.15788 | null |
2024-07-22 | Concept-Based Interpretable Reinforcement Learning with Limited to No Human Labels | Zhuorui Ye et.al. | 2407.15786 | null |
2024-07-22 | CrashEventLLM: Predicting System Crashes with Large Language Models | Priyanka Mudgal et.al. | 2407.15716 | null |
2024-07-22 | Flow-guided Motion Prediction with Semantics and Dynamic Occupancy Grid Maps | Rabbia Asghar et.al. | 2407.15675 | null |
2024-07-22 | DriveDiTFit: Fine-tuning Diffusion Transformers for Autonomous Driving | Jiahang Tu et.al. | 2407.15661 | link |
2024-07-22 | Evaluation of Reinforcement Learning for Autonomous Penetration Testing using A3C, Q-learning and DQN | Norman Becker et.al. | 2407.15656 | null |
2024-07-22 | Psychometric Alignment: Capturing Human Knowledge Distributions via Language Models | Joy He-Yueya et.al. | 2407.15645 | link |
2024-07-22 | Reinforcement Learning Meets Visual Odometry | Nico Messikommer et.al. | 2407.15626 | link |
2024-07-22 | Towards a Universal Evaluation Model for Careful and Competent Autonomous Driving | Kethan Reddy et.al. | 2407.15596 | null |
2024-07-22 | Empowering Agile-Based Generative Software Development through Human-AI Teamwork | Sai Zhang et.al. | 2407.15568 | link |
2024-07-22 | Interpretable Concept-Based Memory Reasoning | David Debot et.al. | 2407.15527 | link |
2024-07-22 | WTS: A Pedestrian-Centric Traffic Video Dataset for Fine-grained Spatial-Temporal Understanding | Quan Kong et.al. | 2407.15350 | null |
2024-07-22 | Explore the LiDAR-Camera Dynamic Adjustment Fusion for 3D Object Detection | Yiran Yang et.al. | 2407.15334 | link |
2024-07-21 | Explaining Decisions of Agents in Mixed-Motive Games | Maayan Orner et.al. | 2407.15255 | null |
2024-07-21 | Decoding Multilingual Moral Preferences: Unveiling LLM’s Biases Through the Moral Machine Experiment | Karina Vida et.al. | 2407.15184 | link |
2024-07-20 | Is Behavior Cloning All You Need? Understanding Horizon in Imitation Learning | Dylan J. Foster et.al. | 2407.15007 | null |
2024-07-20 | A Measure for Level of Autonomy Based on Observable System Behavior | Jason M. Pittman et.al. | 2407.14975 | null |
2024-07-20 | (Non-)Commutative Aggregation | Yuzhao Yang et.al. | 2407.14959 | null |
2024-07-20 | CoCoG-2: Controllable generation of visual stimuli for understanding human concept representation | Chen Wei et.al. | 2407.14949 | link |
2024-07-19 | Quantifying the value of positive transfer: An experimental case study | Aidan J. Hughes et.al. | 2407.14342 | null |
2024-07-19 | Complementary Learning for Real-World Model Failure Detection | Daniel Bogdoll et.al. | 2407.14306 | link |
2024-07-19 | Hyperparameter Optimization for Driving Strategies Based on Reinforcement Learning | Nihal Acharya Adde et.al. | 2407.14262 | null |
2024-07-19 | KoMA: Knowledge-driven Multi-agent Framework for Autonomous Driving with Large Language Models | Kemou Jiang et.al. | 2407.14239 | null |
2024-07-19 | Domain Adaptation for Industrial Time-series Forecasting via Counterfactual Inference | Chao Min et.al. | 2407.14214 | null |
2024-07-19 | Achieving Well-Informed Decision-Making in Drug Discovery: A Comprehensive Calibration Study using Neural Network-Based Structure-Activity Models | Hannah Rosa Friesacher et.al. | 2407.14185 | link |
2024-07-19 | Integrated Push-and-Pull Update Model for Goal-Oriented Effective Communication | Pouya Agheli et.al. | 2407.14092 | null |
2024-07-19 | Data Guards: Challenges and Solutions for Fostering Trust in Data | Nicole Sultanum et.al. | 2407.14042 | null |
2024-07-19 | Causal Inference with Complex Treatments: A Survey | Yingrong Wang et.al. | 2407.14022 | link |
2024-07-19 | A trustworthy blockchain-based energy trading scheme for V2G operations in distributed power grids via integrated scheduling and trading framework | Yunwang Chen et.al. | 2407.13988 | null |
2024-07-18 | Boosting Online 3D Multi-Object Tracking through Camera-Radar Cross Check | Sheng-Yao Kuan et.al. | 2407.13937 | null |
2024-07-18 | Unmasking Social Bots: How Confident Are We? | James Giroux et.al. | 2407.13929 | link |
2024-07-18 | PRAGyan – Connecting the Dots in Tweets | Rahul Ravi et.al. | 2407.13909 | null |
2024-07-18 | A review of handcrafted and deep radiomics in neurological diseases: transitioning from oncology to clinical neuroimaging | Elizaveta Lavrova et.al. | 2407.13813 | null |
2024-07-18 | Black-Box Opinion Manipulation Attacks to Retrieval-Augmented Generation of Large Language Models | Zhuo Chen et.al. | 2407.13757 | null |
2024-07-18 | Temporal Representation Learning for Stock Similarities and Its Applications in Investment Management | Yoontae Hwang et.al. | 2407.13751 | null |
2024-07-18 | Managing Risk using Rolling Forecasts in Energy-Limited and Stochastic Energy Systems | Thomas Mortimer et.al. | 2407.13626 | null |
2024-07-18 | The Storage Location Assignment and Picker Routing Problem: A Generic Branch-Cut-and-Price Algorithm | Thibault Prunet et.al. | 2407.13570 | null |
2024-07-19 | Hyp2Nav: Hyperbolic Planning and Curiosity for Crowd Navigation | Guido Maria D’Amely di Melendugno et.al. | 2407.13567 | link |
2024-07-18 | Fundamental Visual Navigation Algorithms: Indirect Sequential, Biased Diffusive, & Direct Pathing | Patrick Govoni et.al. | 2407.13535 | null |
2024-07-19 | Mask2Map: Vectorized HD Map Construction Using Bird’s Eye View Segmentation Masks | Sehwan Choi et.al. | 2407.13517 | link |
2024-07-18 | Risk-Aware Vehicle Trajectory Prediction Under Safety-Critical Scenarios | Qingfan Wang et.al. | 2407.13480 | null |
2024-07-18 | Improving Out-of-Distribution Generalization of Trajectory Prediction for Autonomous Driving via Polynomial Representations | Yue Yao et.al. | 2407.13431 | link |
2024-07-18 | Ultra-Low-Latency Edge Inference for Distributed Sensing | Zhanwei Wang et.al. | 2407.13360 | null |
2024-07-18 | Why do you cite? An investigation on citation intents and decision-making classification processes | Lorenzo Paolini et.al. | 2407.13329 | null |
2024-07-18 | CoD, Towards an Interpretable Medical Agent using Chain of Diagnosis | Junying Chen et.al. | 2407.13301 | link |
2024-07-18 | $μ$ Drive: User-Controlled Autonomous Driving | Kun Wang et.al. | 2407.13201 | null |
2024-07-18 | Adaptive Foundation Models for Online Decisions: HyperAgent with Fast Incremental Uncertainty Estimation | Yingru Li et.al. | 2407.13195 | link |
2024-07-18 | Real-Time 3D Occupancy Prediction via Geometric-Semantic Disentanglement | Yulin He et.al. | 2407.13155 | null |
2024-07-19 | PG-Rainbow: Using Distributional Reinforcement Learning in Policy Gradient Methods | WooJae Jeon et.al. | 2407.13146 | null |
2024-07-18 | OE-BevSeg: An Object Informed and Environment Aware Multimodal Framework for Bird’s-eye-view Vehicle Semantic Segmentation | Jian Sun et.al. | 2407.13137 | null |
2024-07-18 | PG-Attack: A Precision-Guided Adversarial Attack Framework Against Vision Foundation Models for Autonomous Driving | Jiyuan Fu et.al. | 2407.13111 | link |
2024-07-18 | On Causally Disentangled State Representation Learning for Reinforcement Learning based Recommender Systems | Siyu Wang et.al. | 2407.13091 | null |
2024-07-17 | Fighting Sampling Bias: A Framework for Training and Evaluating Credit Scoring Models | Nikita Kozodoi et.al. | 2407.13009 | null |
2024-07-17 | AgentPoison: Red-teaming LLM Agents via Poisoning Memory or Knowledge Bases | Zhaorun Chen et.al. | 2407.12784 | link |
2024-07-17 | Bayesian spatial functional data clustering: applications in disease surveillance | Ruiman Zhong et.al. | 2407.12633 | null |
2024-07-17 | Continuous reasoning for adaptive container image distribution in the cloud-edge continuum | Damiano Azzolini et.al. | 2407.12605 | link |
2024-07-17 | Policies Grow on Trees: Model Checking Families of MDPs | Roman Andriushchenko et.al. | 2407.12552 | null |
2024-07-17 | Hierarchical and Decoupled BEV Perception Learning Framework for Autonomous Driving | Yuqi Dai et.al. | 2407.12491 | null |
2024-07-17 | What’s Distributive Justice Got to Do with It? Rethinking Algorithmic Fairness from the Perspective of Approximate Justice | Corinna Hertweck et.al. | 2407.12488 | null |
2024-07-17 | Fisheye-Calib-Adapter: An Easy Tool for Fisheye Camera Model Conversion | Sangjun Lee et.al. | 2407.12405 | link |
2024-07-17 | MEDFuse: Multimodal EHR Data Fusion with Masked Lab-Test Modeling and Large Language Models | Thao Minh Nguyen Phan et.al. | 2407.12309 | null |
2024-07-16 | CLUE: Safe Model-Based RL HVAC Control Using Epistemic Uncertainty Estimation | Xianzhong Ding et.al. | 2407.12195 | link |
2024-07-16 | Satisficing Exploration for Deep Reinforcement Learning | Dilip Arumugam et.al. | 2407.12185 | null |
2024-07-16 | Exploration Unbound | Dilip Arumugam et.al. | 2407.12178 | null |
2024-07-16 | Interpretability in Action: Exploratory Analysis of VPT, a Minecraft Agent | Karolis Jucys et.al. | 2407.12161 | null |
2024-07-16 | Predicting Emotion Intensity in Polish Political Texts: Comparing Supervised Models and Large Language Models in a Resource-Poor Language | Hubert Plisiecki et.al. | 2407.12141 | link |
2024-07-16 | UrbanWorld: An Urban World Model for 3D City Generation | Yu Shang et.al. | 2407.11965 | link |
2024-07-16 | Gated Temporal Diffusion for Stochastic Long-Term Dense Anticipation | Olga Zatsarynna et.al. | 2407.11954 | link |
2024-07-16 | Beyond Spatial Explanations: Explainable Face Recognition in the Frequency Domain | Marco Huber et.al. | 2407.11941 | null |
2024-07-16 | InferAct: Inferring Safe Actions for LLM-Based Agents Through Preemptive Evaluation and Human Feedback | Haishuo Fang et.al. | 2407.11843 | null |
2024-07-16 | MapDistill: Boosting Efficient Camera-based HD Map Construction via Camera-LiDAR Fusion Model Distillation | Xiaoshuai Hao et.al. | 2407.11682 | null |
2024-07-16 | Perception Helps Planning: Facilitating Multi-Stage Lane-Level Integration via Double-Edge Structures | Guoliang You et.al. | 2407.11644 | null |
2024-07-16 | Rethinking Fair Graph Neural Networks from Re-balancing | Zhixun Li et.al. | 2407.11624 | link |
2024-07-16 | DRL-based Joint Resource Scheduling of eMBB and URLLC in O-RAN | Rana M. Sohaib et.al. | 2407.11558 | null |
2024-07-16 | How Personality Traits Influence Negotiation Outcomes? A Simulation based on Large Language Models | Yin Jou Huang et.al. | 2407.11549 | link |
2024-07-16 | Generally-Occurring Model Change for Robust Counterfactual Explanations | Ao Xu et.al. | 2407.11426 | null |
2024-07-16 | Incremental high average-utility itemset mining: survey and challenges | Jing Chen et.al. | 2407.11425 | null |
2024-07-16 | EndoFinder: Online Image Retrieval for Explainable Colorectal Polyp Diagnosis | Ruijie Yang et.al. | 2407.11401 | null |
2024-07-16 | InvAgent: A Large Language Model based Multi-Agent System for Inventory Management in Supply Chains | Yinzhu Quan et.al. | 2407.11384 | link |
2024-07-17 | Segment, Lift and Fit: Automatic 3D Shape Labeling from 2D Prompts | Jianhao Li et.al. | 2407.11382 | null |
2024-07-16 | Adaptive Environment-Aware Robotic Arm Reaching Based on a Bio-Inspired Neurodynamical Computational Framework | Dimitrios Chatziparaschis et.al. | 2407.11377 | null |
2024-07-16 | Mask-Free Neuron Concept Annotation for Interpreting Neural Networks in Medical Domain | Hyeon Bae Kim et.al. | 2407.11375 | link |
2024-07-16 | Continuity Preserving Online CenterLine Graph Learning | Yunhui Han et.al. | 2407.11337 | link |
2024-07-15 | Novel Approach for Predicting the Air Quality Index of Megacities through Attention-Enhanced Deep Multitask Spatiotemporal Learning | Harun Khan et.al. | 2407.11283 | null |
2024-07-15 | Intelligent Cross-Organizational Process Mining: A Survey and New Perspectives | Yiyuan Yang et.al. | 2407.11280 | null |
2024-07-15 | CICAPT-IIOT: A provenance-based APT attack dataset for IIoT environment | Erfan Ghiasvand et.al. | 2407.11278 | null |
2024-07-15 | RepVF: A Unified Vector Fields Representation for Multi-task 3D Perception | Chunliang Li et.al. | 2407.10876 | link |
2024-07-15 | Enhancing Cyber Security through Predictive Analytics: Real-Time Threat Detection and Response | Muhammad Danish et.al. | 2407.10864 | link |
2024-07-15 | DINO Pre-training for Vision-based End-to-end Autonomous Driving | Shubham Juneja et.al. | 2407.10803 | null |
2024-07-15 | Interactive Public Transport Infrastructure Analysis through Mobility Profiles: Making the Mobility Transition Transparent | Yannick Metz et.al. | 2407.10791 | null |
2024-07-15 | The Missing Link: Allocation Performance in Causal Machine Learning | Unai Fischer-Abaigar et.al. | 2407.10779 | null |
2024-07-15 | Last-Iterate Global Convergence of Policy Gradients for Constrained Reinforcement Learning | Alessandro Montenegro et.al. | 2407.10775 | null |
2024-07-15 | Multi-Objective Optimization and Multi-Criteria Decision-Making Approach to Design Multi-Tubular Packed-Bed Membrane Reactor in Oxidative Dehydrogenation of Ethane | Seyed Reza Nabavi et.al. | 2407.10774 | null |
2024-07-15 | Globally-Constrained Decentralized Optimization with Variable Coupling | Dandan Wang et.al. | 2407.10770 | null |
2024-07-15 | Addressing Image Hallucination in Text-to-Image Generation through Factual Image Retrieval | Youngsun Lim et.al. | 2407.10683 | null |
2024-07-15 | XEQ Scale for Evaluating XAI Experience Quality Grounded in Psychometric Theory | Anjana Wijekoon et.al. | 2407.10662 | null |
2024-07-15 | Exploring incentive strategies and predicting development trends for new energy vehicles | Tao Jin et.al. | 2407.10611 | null |
2024-07-15 | Leveraging Hybrid Intelligence Towards Sustainable and Energy-Efficient Machine Learning | Daniel Geissler et.al. | 2407.10580 | null |
2024-07-15 | Understanding the Dependence of Perception Model Competency on Regions in an Image | Sara Pohland et.al. | 2407.10543 | link |
2024-07-15 | Communication- and Computation-Efficient Distributed Decision-Making in Multi-Robot Networks | Zirui Xu et.al. | 2407.10382 | null |
2024-07-14 | Mapping the Scholarship of Dark Pattern Regulation: A Systematic Review of Concepts, Regulatory Paradigms, and Solutions from an Interdisciplinary Perspective | Weiwei Yi et.al. | 2407.10340 | null |
2024-07-14 | Follow the Rules: Reasoning for Video Anomaly Detection with Large Language Models | Yuchen Yang et.al. | 2407.10299 | link |
2024-07-14 | Next-Generation 6G Networks: Deploying Cybertwin Technology for Enhanced Healthcare Solutions | Alinafe Kaliwo et.al. | 2407.10292 | null |
2024-07-14 | Towards detailed and interpretable hybrid modeling of continental-scale bird migration | Fiona Lippert et.al. | 2407.10259 | null |
2024-07-14 | RAPiD-Seg: Range-Aware Pointwise Distance Distribution Networks for 3D LiDAR Segmentation | Li Li et.al. | 2407.10159 | link |
2024-07-14 | FSD-BEV: Foreground Self-Distillation for Multi-view 3D Object Detection | Zheng Jiang et.al. | 2407.10135 | link |
2024-07-12 | Adaptive Prediction Ensemble: Improving Out-of-Distribution Generalization of Motion Forecasting | Jinning Li et.al. | 2407.09475 | null |
2024-07-12 | TRAVERSE: Traffic-Responsive Autonomous Vehicle Experience & Rare-event Simulation for Enhanced safety | Sandeep Thalapanane et.al. | 2407.09466 | null |
2024-07-12 | Neuroevolution of Decentralized Decision-Making in N-Bead Swimmers Leads to Scalable and Robust Collective Locomotion | Benedikt Hartl et.al. | 2407.09438 | null |
2024-07-12 | Good Intentions, Risky Inventions: A Method for Assessing the Risks and Benefits of AI in Mobile and Wearable Uses | Marios Constantinides et.al. | 2407.09322 | link |
2024-07-12 | Sample size for developing a prediction model with a binary outcome: targeting precise individual risk estimates to improve clinical decisions and fairness | Richard D Riley et.al. | 2407.09293 | null |
2024-07-12 | Predicting and Understanding Human Action Decisions: Insights from Large Language Models and Cognitive Instance-Based Learning | Thuy Ngoc Nguyen et.al. | 2407.09281 | null |
2024-07-12 | GNN with Model-based RL for Multi-agent Systems | Hanxiao Chen et.al. | 2407.09249 | null |
2024-07-12 | Decentralized multi-agent reinforcement learning algorithm using a cluster-synchronized laser network | Shun Kotoku et.al. | 2407.09124 | null |
2024-07-12 | KUNPENG: An Embodied Large Model for Intelligent Maritime | Naiyao Wang et.al. | 2407.09048 | link |
2024-07-12 | Privacy-Preserving Collaborative Genomic Research: A Real-Life Deployment and Vision | Zahra Rahmani et.al. | 2407.09004 | null |
2024-07-12 | Communication-Aware Reinforcement Learning for Cooperative Adaptive Cruise Control | Sicong Jiang et.al. | 2407.08964 | null |
2024-07-12 | Bora: Biomedical Generalist Video Generation Model | Weixiang Sun et.al. | 2407.08944 | null |
2024-07-12 | Deep Attention Driven Reinforcement Learning (DAD-RL) for Autonomous Vehicle Decision-Making in Dynamic Environment | Jayabrata Chowdhury et.al. | 2407.08932 | link |
2024-07-11 | DeepCodeProbe: Towards Understanding What Models Trained on Code Learn | Vahid Majdinasab et.al. | 2407.08890 | link |
2024-07-11 | Generalizable Physics-informed Learning for Stochastic Safety-critical Systems | Zhuoyuan Wang et.al. | 2407.08868 | null |
2024-07-11 | Latent Spaces Enable Transformer-Based Dose Prediction in Complex Radiotherapy Plans | Edward Wang et.al. | 2407.08650 | link |
2024-07-11 | A Review of Nine Physics Engines for Reinforcement Learning Research | Michael Kaup et.al. | 2407.08590 | null |
2024-07-11 | MapLocNet: Coarse-to-Fine Feature Registration for Visual Re-Localization in Navigation Maps | Hang Wu et.al. | 2407.08561 | null |
2024-07-11 | BLOS-BEV: Navigation Map Enhanced Lane Segmentation Network, Beyond Line of Sight | Hang Wu et.al. | 2407.08526 | null |
2024-07-11 | Converging Paradigms: The Synergy of Symbolic and Connectionist AI in LLM-Empowered Autonomous Agents | Haoyi Xiong et.al. | 2407.08516 | null |
2024-07-11 | Joint Optimization of Age of Information and Energy Consumption in NR-V2X System based on Deep Reinforcement Learning | Shulin Song et.al. | 2407.08458 | link |
2024-07-11 | CLEO: Continual Learning of Evolving Ontologies | Shishir Muralidhara et.al. | 2407.08411 | null |
2024-07-11 | Specialist vision-language models for clinical ophthalmology | Robbie Holland et.al. | 2407.08410 | link |
2024-07-11 | Data-Driven Model Predictive Control for Autonomous Vehicle Steering | Jiarui Zhang et.al. | 2407.08401 | null |
2024-07-11 | Accurate Cooperative Localization Utilizing LiDAR-equipped Roadside Infrastructure for Autonomous Driving | Yuze Jiang et.al. | 2407.08384 | null |
2024-07-11 | WayveScenes101: A Dataset and Benchmark for Novel View Synthesis in Autonomous Driving | Jannik Zürn et.al. | 2407.08280 | link |
2024-07-11 | Explainability of Sub-Field Level Crop Yield Prediction using Remote Sensing | Hiba Najjar et.al. | 2407.08274 | null |
2024-07-11 | Efficient Reinforcement Learning On Passive RRAM Crossbar Array | Arjun Tyagi et.al. | 2407.08242 | null |
2024-07-11 | CoGS: Causality Constrained Counterfactual Explanations using goal-directed ASP | Sopam Dasgupta et.al. | 2407.08179 | null |
2024-07-10 | NDST: Neural Driving Style Transfer for Human-Like Vision-Based Autonomous Driving | Donghyun Kim et.al. | 2407.08073 | null |
2024-07-10 | Deep Learning-Based Robust Multi-Object Tracking via Fusion of mmWave Radar and Camera Sensors | Lei Cheng et.al. | 2407.08049 | null |
2024-07-10 | Flow4D: Leveraging 4D Voxel Network for LiDAR Scene Flow Estimation | Jaeyeul Kim et.al. | 2407.07995 | link |
2024-07-10 | RoBus: A Multimodal Dataset for Controllable Road Networks and Building Layouts Generation | Tao Li et.al. | 2407.07835 | link |
2024-07-10 | When to Accept Automated Predictions and When to Defer to Human Judgment? | Daniel Sikar et.al. | 2407.07821 | null |
2024-07-10 | The Misclassification Likelihood Matrix: Some Classes Are More Likely To Be Misclassified Than Others | Daniel Sikar et.al. | 2407.07818 | null |
2024-07-11 | Evaluating Large Language Models with Grid-Based Game Competitions: An Extensible LLM Benchmark and Leaderboard | Oguzhan Topsakal et.al. | 2407.07796 | link |
2024-07-10 | LSM: A Comprehensive Metric for Assessing the Safety of Lane Detection Systems in Autonomous Driving | Jörg Gamerdinger et.al. | 2407.07740 | null |
2024-07-10 | Towards Human-Like Driving: Active Inference in Autonomous Vehicle Control | Elahe Delavari et.al. | 2407.07684 | null |
2024-07-10 | Why should we ever automate moral decision making? | Vincent Conitzer et.al. | 2407.07671 | null |
2024-07-10 | Pessimism Meets Risk: Risk-Sensitive Offline Reinforcement Learning | Dake Zhang et.al. | 2407.07631 | null |
2024-07-10 | Let Occ Flow: Self-Supervised 3D Occupancy Flow Prediction | Yili Liu et.al. | 2407.07587 | null |
2024-07-10 | Invisible Optical Adversarial Stripes on Traffic Sign against Autonomous Vehicles | Dongfang Guo et.al. | 2407.07510 | null |
2024-07-10 | Exploring the Untouched Sweeps for Conflict-Aware 3D Segmentation Pretraining | Tianfang Sun et.al. | 2407.07465 | null |
2024-07-10 | CM-DQN: A Value-Based Deep Reinforcement Learning Model to Simulate Confirmation Bias | Jiacheng Shen et.al. | 2407.07454 | link |
2024-07-10 | Long-Term Fairness in Sequential Multi-Agent Selection with Positive Reinforcement | Bhagyashree Puranik et.al. | 2407.07350 | link |
2024-07-11 | FALFormer: Feature-aware Landmarks self-attention for Whole-slide Image Classification | Doanh C. Bui et.al. | 2407.07340 | link |
2024-07-10 | Event-Aided Time-to-Collision Estimation for Autonomous Driving | Jinghang Li et.al. | 2407.07324 | null |
2024-07-09 | Exploring Camera Encoder Designs for Autonomous Driving Perception | Barath Lakshmanan et.al. | 2407.07276 | null |
2024-07-09 | The mouth speaks as much as the eyes: Free-ranging dogs depend on inner facial features for human recognition | Rohan Sarkar et.al. | 2407.07192 | null |
2024-07-09 | Can Learned Optimization Make Reinforcement Learning Less Difficult? | Alexander David Goldie et.al. | 2407.07082 | link |
2024-07-09 | Less is More: Efficient Brain-Inspired Learning for Autonomous Driving Trajectory Prediction | Haicheng Liao et.al. | 2407.07020 | null |
2024-07-09 | End-To-End Causal Effect Estimation from Unstructured Natural Language Data | Nikita Dhawan et.al. | 2407.07018 | null |
2024-07-09 | Explainable AI for Enhancing Efficiency of DL-based Channel Estimation | Abdul Karim Gizzini et.al. | 2407.07009 | null |
2024-07-09 | Learning to Complement and to Defer to Multiple Users | Zheng Zhang et.al. | 2407.07003 | link |
2024-07-09 | Towards Open-World Mobile Manipulation in Homes: Lessons from the Neurips 2023 HomeRobot Open Vocabulary Mobile Manipulation Challenge | Sriram Yenamandra et.al. | 2407.06939 | null |
2024-07-09 | Efficiency of the convex hull of the columns of certain triple perturbed consistent matrices | Susana Furtado et.al. | 2407.06878 | null |
2024-07-08 | A Mamba-based Siamese Network for Remote Sensing Change Detection | Jay N. Paranjape et.al. | 2407.06839 | link |
2024-07-09 | MDP Geometry, Normalization and Value Free Solvers | Arsenii Mustafin et.al. | 2407.06712 | null |
2024-07-09 | Integrating Clinical Knowledge into Concept Bottleneck Models | Winnie Pang et.al. | 2407.06600 | link |
2024-07-10 | FinCon: A Synthesized LLM Multi-Agent System with Conceptual Verbal Reinforcement for Enhanced Financial Decision Making | Yangyang Yu et.al. | 2407.06567 | null |
2024-07-09 | Exploring the Causality of End-to-End Autonomous Driving | Jiankun Li et.al. | 2407.06546 | link |
2024-07-09 | Graph Neural Networks and Deep Reinforcement Learning Based Resource Allocation for V2X Communications | Maoxin Ji et.al. | 2407.06518 | link |
2024-07-09 | VQA-Diff: Exploiting VQA and Diffusion for Zero-Shot Image-to-3D Vehicle Asset Generation in Autonomous Driving | Yibo Liu et.al. | 2407.06516 | null |
2024-07-09 | Computer vision tasks for intelligent aerospace missions: An overview | Huilin Chen et.al. | 2407.06513 | null |
2024-07-09 | Economic span selection of bridge based on deep reinforcement learning | Leye Zhang et.al. | 2407.06507 | link |
2024-07-09 | Not all explicit cues help communicate: Pedestrians’ perceptions, fixations, and decisions toward automated vehicles with varied appearance | Wei Lyu et.al. | 2407.06505 | null |
2024-07-10 | Optimal Decision Making Through Scenario Simulations Using Large Language Models | Sumedh Rasal et.al. | 2407.06486 | null |
2024-07-10 | Enhanced Safety in Autonomous Driving: Integrating Latent State Diffusion Model for End-to-End Navigation | Jianuo Huang et.al. | 2407.06317 | null |
2024-07-10 | 4D Contrastive Superflows are Dense 3D Representation Learners | Xiang Xu et.al. | 2407.06190 | link |
2024-07-08 | Real Space Imaging of Field-Driven Decision-Making in Nanomagnetic Galton Boards | Hanu Arava et.al. | 2407.06130 | null |
2024-07-08 | Enhancing Language Model Rationality with Bi-Directional Deliberation Reasoning | Yadong Zhang et.al. | 2407.06112 | null |
2024-07-08 | PerlDiff: Controllable Street View Synthesis Using Perspective-Layout Diffusion Models | Jinhua Zhang et.al. | 2407.06109 | link |
2024-07-08 | Object-Oriented Material Classification and 3D Clustering for Improved Semantic Perception and Mapping in Mobile Robots | Siva Krishna Ravipati et.al. | 2407.06077 | link |
2024-07-08 | How to Add Baskets to an Ongoing Basket Trial with Information Borrowing | Libby Daniells et.al. | 2407.06069 | link |
2024-07-08 | RHRSegNet: Relighting High-Resolution Night-Time Semantic Segmentation | Sarah Elmahdy et.al. | 2407.06016 | null |
2024-07-08 | Simulation-based Benchmarking for Causal Structure Learning in Gene Perturbation Experiments | Luka Kovačević et.al. | 2407.06015 | link |
2024-07-08 | Towards A Comprehensive Visual Saliency Explanation Framework for AI-based Face Recognition Systems | Yuhang Lu et.al. | 2407.05983 | null |
2024-07-08 | Enhancing Vision-Language Models with Scene Graphs for Traffic Accident Understanding | Aaron Lohner et.al. | 2407.05910 | null |
2024-07-08 | Affordances-Oriented Planning using Foundation Models for Continuous Vision-Language Navigation | Jiaqi Chen et.al. | 2407.05890 | null |
2024-07-08 | Cross-domain Few-shot In-context Learning for Enhancing Traffic Sign Recognition | Yaozong Gan et.al. | 2407.05814 | null |
2024-07-08 | MapsTP: HD Map Images Based Multimodal Trajectory Prediction for Automated Vehicles | Sushil Sharma et.al. | 2407.05811 | null |
2024-07-08 | Boosting 3D Object Detection with Semantic-Aware Multi-Branch Framework | Hao Jing et.al. | 2407.05769 | null |
2024-07-08 | Potential of Multimodal Large Language Models for Data Mining of Medical Images and Free-text Reports | Yutong Zhang et.al. | 2407.05758 | null |
2024-07-08 | BEVWorld: A Multimodal World Model for Autonomous Driving via Unified BEV Latent Space | Yumeng Zhang et.al. | 2407.05679 | link |
2024-07-08 | MSTF: Multiscale Transformer for Incomplete Trajectory Prediction | Zhanwen Liu et.al. | 2407.05671 | null |
2024-07-08 | Explainable Image Recognition via Enhanced Slot-attention Based Classifier | Bowen Wang et.al. | 2407.05616 | null |
2024-07-08 | GenFollower: Enhancing Car-Following Prediction with Large Language Models | Xianda Chen et.al. | 2407.05611 | null |
2024-07-08 | Cost-Efficient Computation Offloading in SAGIN: A Deep Reinforcement Learning and Perception-Aided Approach | Yulan Gao et.al. | 2407.05571 | null |
2024-07-05 | DCZNMaker: A Web-based Application for Multi-Attribute Utilities Analysis | Adrienne Kline et.al. | 2407.04655 | null |
2024-07-05 | Multiple stage stochastic linear programming with multiple objectives: flexible decision making | Andreas H. Hamel et.al. | 2407.04602 | null |
2024-07-05 | Leveraging Large Language Models for Integrated Satellite-Aerial-Terrestrial Networks: Recent Advances and Future Directions | Shumaila Javaid et.al. | 2407.04581 | null |
2024-07-05 | Graph Reinforcement Learning in Power Grids: A Survey | Mohamed Hassouna et.al. | 2407.04522 | null |
2024-07-05 | Leveraging Graph Structures to Detect Hallucinations in Large Language Models | Noa Nonkes et.al. | 2407.04485 | link |
2024-07-05 | Optimizing the image correction pipeline for pedestrian detection in the thermal-infrared domain | Christophe Karam et.al. | 2407.04484 | null |
2024-07-05 | Are Large Language Models Strategic Decision Makers? A Study of Performance and Bias in Two-Player Non-Zero-Sum Games | Nathan Herr et.al. | 2407.04467 | null |
2024-07-05 | Nash epidemics | Simon K. Schnyder et.al. | 2407.04366 | null |
2024-07-05 | AriGraph: Learning Knowledge Graph World Models with Episodic Memory for LLM Agents | Petr Anokhin et.al. | 2407.04363 | link |
2024-07-05 | Dance of the ADS: Orchestrating Failures through Historically-Informed Scenario Fuzzing | Tong Wang et.al. | 2407.04359 | null |
2024-07-05 | MobileFlow: A Multimodal LLM For Mobile GUI Agent | Songqin Nong et.al. | 2407.04346 | null |
2024-07-05 | Towards Stable 3D Object Detection | Jiabao Wang et.al. | 2407.04305 | null |
2024-07-05 | Robust Decision Transformer: Tackling Data Corruption in Offline RL via Sequence Modeling | Jiawei Xu et.al. | 2407.04285 | null |
2024-07-05 | WOMD-Reasoning: A Large-Scale Language Dataset for Interaction and Driving Intentions Reasoning | Yiheng Li et.al. | 2407.04281 | link |
2024-07-05 | Research, Applications and Prospects of Event-Based Pedestrian Detection: A Survey | Han Wang et.al. | 2407.04277 | null |
2024-07-04 | Quantifying Prediction Consistency Under Model Multiplicity in Tabular LLMs | Faisal Hamman et.al. | 2407.04173 | null |
2024-07-04 | ChartGemma: Visual Instruction-tuning for Chart Reasoning in the Wild | Ahmed Masry et.al. | 2407.04172 | link |
2024-07-04 | Annotating Control-Flow Graphs for Formalized Test Coverage Criteria | Sean Kauffman et.al. | 2407.04144 | null |
2024-07-04 | Behavioural gap assessment of human-vehicle interaction in real and virtual reality-based scenarios in autonomous driving | Sergio. Martín Serrano et.al. | 2407.04070 | null |
2024-07-04 | Detect Closer Surfaces that can be Seen: New Modeling and Evaluation in Cross-domain 3D Object Detection | Ruixiao Zhang et.al. | 2407.04061 | link |
2024-07-03 | Cooperative Multi-Agent Deep Reinforcement Learning Methods for UAV-aided Mobile Edge Computing Networks | Mintae Kim et.al. | 2407.03280 | null |
2024-07-03 | Streaming Large-Scale Electron Microscopy Data to a Supercomputing Facility | Samuel S. Welborn et.al. | 2407.03215 | null |
2024-07-03 | Tail calibration of probabilistic forecasts | Sam Allen et.al. | 2407.03167 | link |
2024-07-03 | xApp Distillation: AI-based Conflict Mitigation in B5G O-RAN | Hakan Erdol et.al. | 2407.03068 | null |
2024-07-03 | Predictions and Decision Making for Resilient Intelligent Sustainable Energy Systems | Martin Braun et.al. | 2407.03021 | null |
2024-07-03 | VIVA: A Benchmark for Vision-Grounded Decision-Making with Human Values | Zhe Hu et.al. | 2407.03000 | null |
2024-07-04 | Timely Requesting for Time-Critical Content Users in Decentralized F-RANs | Xingran Chen et.al. | 2407.02930 | null |
2024-07-03 | Efficient Fusion and Task Guided Embedding for End-to-end Autonomous Driving | Yipin Guo et.al. | 2407.02878 | null |
2024-07-03 | A Radiometric Correction based Optical Modeling Approach to Removing Reflection Noise in TLS Point Clouds of Urban Scenes | Li Fang et.al. | 2407.02830 | link |
2024-07-03 | Optimization of End-to-End AoI in Edge-Enabled Vehicular Fog Systems: A Dueling-DQN Approach | Seifu Birhanu Tadele et.al. | 2407.02815 | null |
2024-07-03 | Solving Motion Planning Tasks with a Scalable Generative Model | Yihan Hu et.al. | 2407.02797 | link |
2024-07-03 | DRLQ: A Deep Reinforcement Learning-based Task Placement for Quantum Cloud Computing | Hoa T. Nguyen et.al. | 2407.02748 | null |
2024-07-04 | The path towards contact-based physical human-robot interaction | Mohammad Farajtabar et.al. | 2407.02664 | null |
2024-07-02 | ResearchBot: Bridging the Gap between Academic Research and Practical Programming Communities | Sahar Farzanehpour et.al. | 2407.02643 | null |
2024-07-02 | D-Rax: Domain-specific Radiologic assistant leveraging multi-modal data and eXpert model predictions | Hareem Nisar et.al. | 2407.02604 | null |
2024-07-04 | AutoSplat: Constrained Gaussian Splatting for Autonomous Driving Scene Reconstruction | Mustafa Khan et.al. | 2407.02598 | null |
2024-07-02 | Diffusion Models for Tabular Data Imputation and Synthetic Data Generation | Mario Villaizán-Vallelado et.al. | 2407.02549 | null |
2024-07-02 | AXIAL: Attention-based eXplainability for Interpretable Alzheimer’s Localized Diagnosis using 2D CNNs on 3D MRI brain scans | Gabriele Lozupone et.al. | 2407.02418 | link |
2024-07-02 | Multilingual Trolley Problems for Language Models | Zhijing Jin et.al. | 2407.02273 | link |
2024-07-02 | Research on Reliable and Safe Occupancy Grid Prediction in Underground Parking Lots | JiaQi Luo et.al. | 2407.02197 | null |
2024-07-02 | I2EKF-LO: A Dual-Iteration Extended Kalman Filter Based LiDAR Odometry | Wenlu Yu et.al. | 2407.02190 | link |
2024-07-02 | Distributional Regression U-Nets for the Postprocessing of Precipitation Ensemble Forecasts | Romain Pic et.al. | 2407.02125 | link |
2024-07-02 | Automated Knowledge Graph Learning in Industrial Processes | Lolitta Ammann et.al. | 2407.02106 | null |
2024-07-02 | DM3D: Distortion-Minimized Weight Pruning for Lossless 3D Object Detection | Kaixin Xu et.al. | 2407.02098 | null |
2024-07-02 | LiDAR-based HD Map Localization using Semantic Generalized ICP with Road Marking Detection | Yansong Gong et.al. | 2407.02061 | null |
2024-07-02 | Revolutionising Role-Playing Games with ChatGPT | Rita Stampfl et.al. | 2407.02048 | null |
2024-07-03 | ViG-Bias: Visually Grounded Bias Discovery and Mitigation | Badr-Eddine Marani et.al. | 2407.01996 | null |
2024-07-02 | FlowTrack: Point-level Flow Network for 3D Single Object Tracking | Shuo Li et.al. | 2407.01959 | null |
2024-07-02 | Cloud-Edge-Terminal Collaborative AIGC for Autonomous Driving | Jianan Zhang et.al. | 2407.01956 | null |
2024-07-02 | CatMemo at the FinLLM Challenge Task: Fine-Tuning Large Language Models using Data Fusion in Financial Applications | Yupeng Cao et.al. | 2407.01953 | null |
2024-07-02 | LDP: A Local Diffusion Planner for Efficient Robot Navigation and Collision Avoidance | Wenhao Yu et.al. | 2407.01950 | null |
2024-07-02 | Probabilistic 3D Correspondence Prediction from Sparse Unsegmented Images | Krithika Iyer et.al. | 2407.01931 | null |
2024-07-02 | Securing Distributed Network Digital Twin Systems Against Model Poisoning Attacks | Zifan Zhang et.al. | 2407.01917 | null |
2024-07-02 | Beyond Numeric Awards: In-Context Dueling Bandits with LLM Agents | Fanzeng Xia et.al. | 2407.01887 | null |
2024-07-01 | An Efficient and Sybil Attack Resistant Voting Mechanism | Jeremias Lenzi et.al. | 2407.01844 | null |
2024-07-01 | Improving Trip Mode Choice Modeling Using Ensemble Synthesizer (ENSY) | Amirhossein Parsi et.al. | 2407.01769 | null |
2024-07-01 | Predicting Trust Dynamics with Dynamic SEM in Human-AI Cooperation | Sota Kaneko et.al. | 2407.01752 | null |
2024-06-28 | Futility analyses for the MCP-Mod methodology based on longitudinal models | Björn Bornkamp et.al. | 2406.19965 | null |
2024-06-28 | Automated Deep Neural Network Inference Partitioning for Distributed Embedded Systems | Fabian Kreß et.al. | 2406.19913 | null |
2024-06-28 | Evaluating potential landing sites for the Artemis III mission using a multi-criteria decision making approach | Eloy Peña-Asensio et.al. | 2406.19863 | null |
2024-06-28 | Operator World Models for Reinforcement Learning | Pietro Novelli et.al. | 2406.19861 | link |
2024-06-28 | StreamMOTP: Streaming and Unified Framework for Joint 3D Multi-Object Tracking and Trajectory Prediction | Jiaheng Zhuang et.al. | 2406.19844 | null |
2024-06-28 | LCSim: A Large-Scale Controllable Traffic Simulator | Yuheng Zhang et.al. | 2406.19781 | link |
2024-06-28 | Deep Fusion Model for Brain Tumor Classification Using Fine-Grained Gradient Preservation | Niful Islam et.al. | 2406.19690 | null |
2024-06-28 | Enhancing Radiological Diagnosis: A Collaborative Approach Integrating AI and Human Expertise for Visual Miss Correction | Akash Awasthi et.al. | 2406.19686 | null |
2024-06-28 | Deep Learning-based Depth Estimation Methods from Monocular Image and Videos: A Comprehensive Survey | Uchitha Rajapaksha et.al. | 2406.19675 | null |
2024-07-02 | Practical Power System Inertia Monitoring Based on Pumped Storage Hydropower Operation Signature | Hongyu Li et.al. | 2406.19627 | null |
2024-06-28 | Multimodal Data Integration for Precision Oncology: Challenges and Future Directions | Huajun Zhou et.al. | 2406.19611 | null |
2024-06-27 | Semantic orchestration and exploitation of material data: A dataspace solution demonstrated on steel and cooper applications | Yoav Nahshon et.al. | 2406.19509 | null |
2024-06-27 | Multi-agent Cooperative Games Using Belief Map Assisted Training | Qinwei Huang et.al. | 2406.19477 | link |
2024-06-27 | TTP-Based Cyber Resilience Index: A Probabilistic Quantitative Approach to Measure Defence Effectiveness Against Cyber Attacks | Lampis Alevizos et.al. | 2406.19374 | null |
2024-06-27 | The Odyssey of Commonsense Causality: From Foundational Benchmarks to Cutting-Edge Reasoning | Shaobo Cui et.al. | 2406.19307 | null |
2024-06-28 | FlowVQA: Mapping Multimodal Logic in Visual Question Answering with Flowcharts | Shubhankar Singh et.al. | 2406.19237 | null |
2024-06-27 | Think Step by Step: Chain-of-Gesture Prompting for Error Detection in Robotic Surgical Videos | Zhimin Shao et.al. | 2406.19217 | null |
2024-06-27 | CELLO: Causal Evaluation of Large Vision-Language Models | Meiqi Chen et.al. | 2406.19131 | link |
2024-06-27 | Evidential Concept Embedding Models: Towards Reliable Concept Explanations for Skin Disease Diagnosis | Yibo Gao et.al. | 2406.19130 | link |
2024-06-27 | BiCo-Fusion: Bidirectional Complementary LiDAR-Camera Fusion for Semantic- and Spatial-Aware 3D Object Detection | Yang Song et.al. | 2406.19048 | null |
2024-06-27 | Fine-tuned network relies on generic representation to solve unseen cognitive task | Dongyan Lin et.al. | 2406.18926 | null |
2024-06-27 | The Rise of Artificial Intelligence in Educational Measurement: Opportunities and Ethical Challenges | Okan Bulut et.al. | 2406.18900 | null |
2024-06-27 | Sequential three-way group decision-making for double hierarchy hesitant fuzzy linguistic term set | Nanfang Luo et.al. | 2406.18884 | null |
2024-06-27 | From Biased Selective Labels to Pseudo-Labels: An Expectation-Maximization Framework for Learning from Biased Decisions | Trenton Chang et.al. | 2406.18865 | link |
2024-06-27 | Predicting the duration of traffic incidents for Sydney greater metropolitan area using machine learning methods | Artur Grigorev et.al. | 2406.18861 | link |
2024-06-28 | The Impact of Feature Representation on the Accuracy of Photonic Neural Networks | Mauricio Gomes de Queiroz et.al. | 2406.18757 | link |
2024-06-26 | Decentralized Semantic Traffic Control in AVs Using RL and DQN for Dynamic Roadblocks | Emanuel Figetakis et.al. | 2406.18741 | null |
2024-06-26 | Petal-X: Human-Centered Visual Explanations to Improve Cardiovascular Risk Communication | Diego Rojo et.al. | 2406.18690 | null |
2024-06-26 | A Zero Auxiliary Knowledge Membership Inference Attack on Aggregate Location Data | Vincent Guan et.al. | 2406.18671 | null |
2024-06-26 | Mental Modeling of Reinforcement Learning Agents by Language Models | Wenhao Lu et.al. | 2406.18505 | null |
2024-06-26 | Complexity Aversion | Yuan Gu et.al. | 2406.18463 | null |
2024-06-27 | XLD: A Cross-Lane Dataset for Benchmarking Novel Driving View Synthesis | Hao Li et.al. | 2406.18360 | null |
2024-06-26 | Kolmogorov-Arnold Graph Neural Networks | Gianluca De Carlo et.al. | 2406.18354 | null |
2024-06-26 | Octo-planner: On-device Language Model for Planner-Action Agents | Wei Chen et.al. | 2406.18082 | null |
2024-06-26 | On Calibration of Speech Classification Models: Insights from Energy-Based Model Investigations | Yaqian Hao et.al. | 2406.18065 | null |
2024-06-26 | Multi-step Knowledge Retrieval and Inference over Unstructured Data | Aditya Kalyanpur et.al. | 2406.17987 | null |
2024-06-25 | Emerging AI-based weather prediction models as downscaling tools | Nikolay Koldunov et.al. | 2406.17977 | null |
2024-06-25 | Unbiasing on the Fly: Explanation-Guided Human Oversight of Machine Learning System Decisions | Hussaini Mamman et.al. | 2406.17906 | null |
2024-06-25 | Analysis of the Causes of Car Accidents in the United States of America in 2023: Gauge People Understanding of Data Visualisation | Hamoud Alhazmi et.al. | 2406.17872 | link |
2024-06-25 | End-to-End Autonomous Driving without Costly Modularization and 3D Manual Annotation | Mingzhe Guo et.al. | 2406.17680 | null |
2024-06-25 | MDHA: Multi-Scale Deformable Transformer with Hybrid Anchors for Multi-View 3D Object Detection | Michelle Adeline et.al. | 2406.17654 | link |
2024-06-25 | Querying Labeled Time Series Data with Scenario Programs | Devan Shanker et.al. | 2406.17627 | null |
2024-06-26 | Enhancing Explainability of Knowledge Learning Paths: Causal Knowledge Networks | Yuang Wei et.al. | 2406.17518 | null |
2024-06-25 | Robust Pareto Design of GaN HEMTs for Millimeter-Wave Applications | Rafael Perez Martinez et.al. | 2406.17337 | null |
2024-06-25 | Task Adaptation in Industrial Human-Robot Interaction: Leveraging Riemannian Motion Policies | Mike Allenspach et.al. | 2406.17333 | null |
2024-06-25 | The State-Action-Reward-State-Action Algorithm in Spatial Prisoner’s Dilemma Game | Lanyu Yang et.al. | 2406.17326 | null |
2024-06-25 | Can We Trust the Performance Evaluation of Uncertainty Estimation Methods in Text Summarization? | Jianfeng He et.al. | 2406.17274 | link |
2024-06-25 | Image-Guided Outdoor LiDAR Perception Quality Assessment for Autonomous Driving | Ce Zhang et.al. | 2406.17265 | null |
2024-06-25 | Large Language Models are Interpretable Learners | Ruochen Wang et.al. | 2406.17224 | link |
2024-06-25 | VR-based Blockchain-enabled Data Visualization Framework For Manufacturing Industry | Nitol Saha et.al. | 2406.17207 | null |
2024-06-25 | Model Checking of vGOAL | Yi Yang et.al. | 2406.17206 | null |
2024-06-24 | Paraphrase and Aggregate with Large Language Models for Minimizing Intent Classification Errors | Vikas Yadav et.al. | 2406.17163 | null |
2024-06-24 | Integrating Generative AI with Network Digital Twins for Enhanced Network Operations | Kassi Muhammad et.al. | 2406.17112 | null |
2024-06-24 | Learning Temporal Distances: Contrastive Successor Features Can Provide a Metric Structure for Decision-Making | Vivek Myers et.al. | 2406.17098 | link |
2024-06-24 | Boosting Bitcoin Minute Trend Prediction Using the Separation Index | Zeinab Shahsafdari et.al. | 2406.17083 | null |
2024-06-24 | Large Language Models Assume People are More Rational than We Really are | Ryan Liu et.al. | 2406.17055 | link |
2024-06-26 | Fair game: Urban free-ranging dogs balance resource use and risk aversion at seasonal fairs | Sourabh Biswas et.al. | 2406.17004 | null |
2024-06-24 | GPT-4V Explorations: Mining Autonomous Driving | Zixuan Li et.al. | 2406.16817 | null |
2024-06-24 | ShanghaiTech Mapping Robot is All You Need: Robot System for Collecting Universal Ground Vehicle Datasets | Bowen Xu et.al. | 2406.16713 | null |
2024-06-24 | Hacking a surrogate model approach to XAI | Alexander Wilhelm et.al. | 2406.16626 | null |
2024-06-24 | QuadrupedGPT: Towards a Versatile Quadruped Agent in Open-ended Worlds | Ye Wang et.al. | 2406.16578 | null |
2024-06-24 | Differentiable Distributionally Robust Optimization Layers | Xutao Ma et.al. | 2406.16571 | link |
2024-06-24 | Conditional Bayesian Quadrature | Zonghao Chen et.al. | 2406.16530 | link |
2024-06-24 | UNO Arena for Evaluating Sequential Decision-Making Capability of Large Language Models | Zhanyue Qin et.al. | 2406.16382 | null |
2024-06-24 | What Do VLMs NOTICE? A Mechanistic Interpretability Pipeline for Noise-free Text-Image Corruption and Evaluation | Michal Golovanevsky et.al. | 2406.16320 | link |
2024-06-24 | SegNet4D: Effective and Efficient 4D LiDAR Semantic Segmentation in Autonomous Driving Environments | Neng Wang et.al. | 2406.16279 | link |
2024-06-23 | Hardware-Aware Neural Dropout Search for Reliable Uncertainty Prediction on FPGA | Zehuan Zhang et.al. | 2406.16198 | link |
2024-06-23 | Accelerating Matrix Diagonalization through Decision Transformers with Epsilon-Greedy Optimization | Kshitij Bhatta et.al. | 2406.16191 | null |
2024-06-23 | DV-3DLane: End-to-end Multi-modal 3D Lane Detection with Dual-view Representation | Yueru Luo et.al. | 2406.16072 | link |
2024-06-23 | Entropy-driven decision-making dynamics sheds light on the emergence of the “paradox of choice” | Manish Gupta et.al. | 2406.16051 | null |
2024-06-23 | Imperfect-Recall Games: Equilibrium Concepts and Their Complexity | Emanuel Tewolde et.al. | 2406.15970 | null |
2024-06-22 | LLM-Powered Explanations: Unraveling Recommendations Through Subgraph Reasoning | Guangsi Shi et.al. | 2406.15859 | null |
2024-06-22 | Learning Abstract World Model for Value-preserving Planning with Options | Rafael Rodriguez-Sanchez et.al. | 2406.15850 | null |
2024-06-22 | CaT-BENCH: Benchmarking Language Model Understanding of Causal and Temporal Dependencies in Plans | Yash Kumar Lal et.al. | 2406.15823 | null |
2024-06-22 | Privacy Implications of Explainable AI in Data-Driven Systems | Fatima Ezzeddine et.al. | 2406.15789 | null |
2024-06-22 | ISS-Scenario: Scenario-based Testing in CARLA | Renjue Li et.al. | 2406.15777 | link |
2024-06-21 | PathoWAve: A Deep Learning-based Weight Averaging Method for Improving Domain Generalization in Histopathology Images | Parastoo Sotoudeh Sharifi et.al. | 2406.15685 | link |
2024-06-21 | NAVSIM: Data-Driven Non-Reactive Autonomous Vehicle Simulation and Benchmarking | Daniel Dauner et.al. | 2406.15349 | link |
2024-06-21 | Towards Robust Training Datasets for Machine Learning with Ontologies: A Case Study for Emergency Road Vehicle Detection | Lynn Vonderhaar et.al. | 2406.15268 | null |
2024-06-21 | Multimodal Deformable Image Registration for Long-COVID Analysis Based on Progressive Alignment and Multi-perspective Loss | Jiahua Li et.al. | 2406.15172 | null |
2024-06-21 | A Unified Framework for Input Feature Attribution Analysis | Jingyi Sun et.al. | 2406.15085 | null |
2024-06-21 | KnobTree: Intelligent Database Parameter Configuration via Explainable Reinforcement Learning | Jiahan Chen et.al. | 2406.15073 | null |
2024-06-21 | Colorful Priority $k$ -Supplier | Chandra Chekuri et.al. | 2406.14984 | null |
2024-06-21 | Autonomous Decision Making for Air Taxi Networks | Alex Vesel et.al. | 2406.14832 | link |
2024-06-20 | ImageFlowNet: Forecasting Multiscale Trajectories of Disease Progression with Irregularly-Sampled Longitudinal Medical Images | Chen Liu et.al. | 2406.14794 | link |
2024-06-20 | Active Learning for Fair and Stable Online Allocations | Riddhiman Bhattacharya et.al. | 2406.14784 | null |
2024-06-20 | Multi-Task Lane-Free Driving Strategy for Connected and Automated Vehicles: A Multi-Agent Deep Reinforcement Learning Approach | Mehran Berahman et.al. | 2406.14766 | null |
2024-06-20 | Risk thresholds for frontier AI | Leonie Koessler et.al. | 2406.14713 | null |
2024-06-20 | Preferential Multi-Objective Bayesian Optimization | Raul Astudillo et.al. | 2406.14699 | null |
2024-06-20 | Advantage Alignment Algorithms | Juan Agustin Duque et.al. | 2406.14662 | null |
2024-06-20 | ICAL: Continual Learning of Multimodal Agents by Transforming Trajectories into Actionable Insights | Gabriel Sarch et.al. | 2406.14596 | null |
2024-06-20 | Enhancing Dropout-based Bayesian Neural Networks with Multi-Exit on FPGA | Hao et.al. | 2406.14593 | link |
2024-06-21 | Asynchronous Large Language Model Enhanced Planner for Autonomous Driving | Yuan Chen et.al. | 2406.14556 | link |
2024-06-20 | MacroHFT: Memory Augmented Context-aware Reinforcement Learning On High Frequency Trading | Chuqiao Zong et.al. | 2406.14537 | link |
2024-06-20 | Energy Mapping of Existing Building Stock in Cambridge using Energy Performance Certificates and Thermal Infrared Imagery | Yinglong He et.al. | 2406.14520 | null |
2024-06-20 | FutureNet-LOF: Joint Trajectory Prediction and Lane Occupancy Field Prediction with Future Context Encoding | Mingkun Wang et.al. | 2406.14422 | null |
2024-06-20 | PoseBench: Benchmarking the Robustness of Pose Estimation Models under Corruptions | Sihan Ma et.al. | 2406.14367 | null |
2024-06-20 | iWISDM: Assessing instruction following in multimodal models at scale | Xiaoxuan Lei et.al. | 2406.14343 | link |
2024-06-20 | Self-supervised Interpretable Concept-based Models for Text Classification | Francesco De Santis et.al. | 2406.14335 | null |
2024-06-20 | Reproducibility in Machine Learning-based Research: Overview, Barriers and Drivers | Harald Semmelrock et.al. | 2406.14325 | null |
2024-06-21 | E-ANT: A Large-Scale Dataset for Efficient Automatic GUI NavigaTion | Ke Wang et.al. | 2406.14250 | null |
2024-06-20 | Uncertainty and Self-Supervision in Single-View Depth | Javier Rodriguez-Puigvert et.al. | 2406.14226 | null |
2024-06-21 | REVEAL-IT: REinforcement learning with Visibility of Evolving Agent poLicy for InTerpretability | Shuang Ao et.al. | 2406.14214 | link |
2024-06-20 | Tractable Equilibrium Computation in Markov Games through Risk Aversion | Eric Mazumdar et.al. | 2406.14156 | null |
2024-06-20 | Self-Attention in Transformer Networks Explains Monkeys’ Gaze Pattern in Pac-Man Game | Zhongqiao Lin et.al. | 2406.14100 | null |
2024-06-20 | GTP-UDrive: Unified Game-Theoretic Trajectory Planner and Decision-Maker for Autonomous Driving in Mixed Traffic Environments | Nouhed Naidja et.al. | 2406.14077 | null |
2024-06-20 | Urban-Focused Multi-Task Offline Reinforcement Learning with Contrastive Data Sharing | Xinbo Zhao et.al. | 2406.14054 | null |
2024-06-20 | MR-BEN: A Comprehensive Meta-Reasoning Benchmark for Large Language Models | Zhongshen Zeng et.al. | 2406.13975 | null |
2024-06-20 | CityBench: Evaluating the Capabilities of Large Language Model as World Model | Jie Feng et.al. | 2406.13945 | link |
2024-06-20 | A Decision-Making GPT Model Augmented with Entropy Regularization for Autonomous Vehicles | Jiaqi Liu et.al. | 2406.13908 | null |
2024-06-20 | The Use of Multimodal Large Language Models to Detect Objects from Thermal Images: Transportation Applications | Huthaifa I. Ashqar et.al. | 2406.13898 | null |
2024-06-19 | Combining Combined Forecasts: a Network Approach | Marcos R. Fernandes et.al. | 2406.13749 | null |
2024-06-18 | Scalable Rule Lists Learning with Sampling | Leonardo Pellegrina et.al. | 2406.12803 | link |
2024-06-18 | Online-Adaptive Anomaly Detection for Defect Identification in Aircraft Assembly | Siddhant Shete et.al. | 2406.12698 | null |
2024-06-18 | Investigating the Role of Explainability and AI Literacy in User Compliance | Niklas Kühl et.al. | 2406.12660 | null |
2024-06-18 | Ask-before-Plan: Proactive Language Agents for Real-World Planning | Xuan Zhang et.al. | 2406.12639 | link |
2024-06-18 | Reparameterizable Dual-Resolution Network for Real-time Semantic Segmentation | Guoyu Yang et.al. | 2406.12496 | link |
2024-06-18 | PlanRAG: A Plan-then-Retrieval Augmented Generation for Generative Large Language Models as Decision Makers | Myeonghwa Lee et.al. | 2406.12430 | link |
2024-06-18 | Deep Temporal Deaggregation: Large-Scale Spatio-Temporal Generative Models | David Bergström et.al. | 2406.12423 | null |
2024-06-18 | UAV-based Intelligent Information Systems on Winter Road Safety for Autonomous Vehicles | Siva Ariram et.al. | 2406.12370 | null |
2024-06-18 | A framework for developing a knowledge management platform | Marie Lisandra Zepeda Mendoza et.al. | 2406.12313 | null |
2024-06-19 | Is Your HD Map Constructor Reliable under Sensor Corruptions? | Xiaoshuai Hao et.al. | 2406.12214 | null |
2024-06-19 | MiSuRe is all you need to explain your image segmentation | Syed Nouman Hasany et.al. | 2406.12173 | null |
2024-06-18 | Statistical Uncertainty in Word Embeddings: GloVe-V | Andrea Vallebueno et.al. | 2406.12165 | link |
2024-06-17 | Efficient Sequential Decision Making with Large Language Models | Dingyang Chen et.al. | 2406.12125 | null |
2024-06-19 | Computing in the Life Sciences: From Early Algorithms to Modern AI | Samuel A. Donkor et.al. | 2406.12108 | link |
2024-06-17 | DistillNeRF: Perceiving 3D Scenes from Single-Glance Images by Distilling Neural Fields and Foundation Model Features | Letian Wang et.al. | 2406.12095 | null |
2024-06-17 | Grade Score: Quantifying LLM Performance in Option Selection | Dmitri Iourovitski et.al. | 2406.12043 | link |
2024-06-17 | FinTruthQA: A Benchmark Dataset for Evaluating the Quality of Financial Information Disclosure | Ziyue Xu et.al. | 2406.12009 | link |
2024-06-17 | Online Pareto-Optimal Decision-Making for Complex Tasks using Active Inference | Peter Amorese et.al. | 2406.11984 | null |
2024-06-17 | Crossfusor: A Cross-Attention Transformer Enhanced Conditional Diffusion Model for Car-Following Trajectory Prediction | Junwei You et.al. | 2406.11941 | null |
2024-06-17 | Optimal Transport-Assisted Risk-Sensitive Q-Learning | Zahra Shahrooei et.al. | 2406.11774 | null |
2024-06-18 | CHG Shapley: Efficient Data Valuation and Selection towards Trustworthy Machine Learning | Huaiguang Cai et.al. | 2406.11730 | null |
2024-06-17 | A First Physical-World Trajectory Prediction Attack via LiDAR-induced Deceptions in Autonomous Driving | Yang Lou et.al. | 2406.11707 | null |
2024-06-17 | Communication-Efficient MARL for Platoon Stability and Energy-efficiency Co-optimization in Cooperative Adaptive Cruise Control of CAVs | Min Hua et.al. | 2406.11653 | null |
2024-06-17 | Statistical Evolution of ODI Cricket: Analyzing Performance Trends and Effect Sizes | Pratik Mullick et.al. | 2406.11652 | null |
2024-06-17 | GRID-FAST: A Grid-based Intersection Detection for Fast Semantic Topometric Mapping | Scott Fredriksson et.al. | 2406.11635 | null |
2024-06-17 | Multistability of Small Zero-One Reaction Networks | Yue Jiao et.al. | 2406.11586 | link |
2024-06-17 | Constrained Reinforcement Learning with Average Reward Objective: Model-Based and Model-Free Algorithms | Vaneet Aggarwal et.al. | 2406.11481 | null |
2024-06-17 | Calibrating Where It Matters: Constrained Temperature Scaling | Stephen McKenna et.al. | 2406.11456 | null |
2024-06-17 | Can AI with High Reasoning Ability Replicate Human-like Decision Making in Economic Experiments? | Ayato Kitadai et.al. | 2406.11426 | null |
2024-06-17 | Predictive Probabilities Made Simple: A Fast and Accurate Method for Clinical Trial Decision Making | Joe Marion et.al. | 2406.11406 | null |
2024-06-17 | Uncertainties in ROC (Receiver Operating Characteristic) Curves Derived from Counting Data | M. P. Fewell et.al. | 2406.11396 | null |
2024-06-17 | Unveiling Assumptions: Exploring the Decisions of AI Chatbots and Human Testers | Francisco Gomes de Oliveira Neto et.al. | 2406.11339 | null |
2024-06-17 | Semi-Supervised Domain Adaptation Using Target-Oriented Domain Augmentation for 3D Object Detection | Yecheol Kim et.al. | 2406.11313 | link |
2024-06-17 | Development of an Adaptive Multi-Domain Artificial Intelligence System Built using Machine Learning and Expert Systems Technologies | Jeremy Straub et.al. | 2406.11272 | null |
2024-06-17 | Learning Iterative Reasoning through Energy Diffusion | Yilun Du et.al. | 2406.11179 | null |
2024-06-17 | Unanimity of two selves in decision making | Pierre Bardier et.al. | 2406.11166 | null |
2024-06-17 | Model Adaptation for Time Constrained Embodied Control | Jaehyun Song et.al. | 2406.11128 | null |
2024-06-16 | Not All Bias is Bad: Balancing Rational Deviations and Cognitive Biases in Large Language Model Reasoning | Liman Wang et.al. | 2406.10999 | link |
2024-06-18 | City-LEO: Toward Transparent City Management Using LLM with End-to-End Optimization | Zihao Jiao et.al. | 2406.10958 | null |
2024-06-14 | CarLLaVA: Vision language models for camera-only closed-loop driving | Katrin Renz et.al. | 2406.10165 | null |
2024-06-14 | MapVision: CVPR 2024 Autonomous Grand Challenge Mapless Driving Tech Report | Zhongyu Yang et.al. | 2406.10125 | null |
2024-06-14 | DurLAR: A High-fidelity 128-channel LiDAR Dataset with Panoramic Ambient and Reflectivity Imagery for Multi-modal Autonomous Driving Applications | Li Li et.al. | 2406.10068 | link |
2024-06-14 | Global Crop-Specific Fertilization Dataset from 1961-2019 | Fernando Coello et.al. | 2406.10001 | link |
2024-06-14 | SemanticSpray++: A Multimodal Dataset for Autonomous Driving in Wet Surface Conditions | Aldi Piroli et.al. | 2406.09945 | null |
2024-06-14 | CliBench: Multifaceted Evaluation of Large Language Models in Clinical Decisions on Diagnoses, Procedures, Lab Tests Orders and Prescriptions | Mingyu Derek Ma et.al. | 2406.09923 | link |
2024-06-14 | Globally Optimal GNSS Multi-Antenna Lever Arm Calibration | Thomas Wodtko et.al. | 2406.09866 | null |
2024-06-14 | LUMA: A Benchmark Dataset for Learning from Uncertain and Multimodal Data | Grigor Bezirganyan et.al. | 2406.09864 | link |
2024-06-14 | Retrieval Augmented Fact Verification by Synthesizing Contrastive Arguments | Zhenrui Yue et.al. | 2406.09815 | null |
2024-06-14 | A Two-Stage Masked Autoencoder Based Network for Indoor Depth Completion | Kailai Sun et.al. | 2406.09792 | link |
2024-06-14 | Road to Serenity: Individual Variations in the Efficacy of Unobtrusive Respiratory Guidance for Driving Stress Regulation | A. J. Bequet et.al. | 2406.09777 | null |
2024-06-14 | Research on Edge Detection of LiDAR Images Based on Artificial Intelligence Technology | Haowei Yang et.al. | 2406.09773 | null |
2024-06-14 | Mix Q-learning for Lane Changing: A Collaborative Decision-Making Method in Multi-Agent Deep Reinforcement Learning | Xiaojun Bi et.al. | 2406.09755 | null |
2024-06-14 | MoME: Mixture of Multimodal Experts for Cancer Survival Prediction | Conghao Xiong et.al. | 2406.09696 | link |
2024-06-13 | Cross-Modality Program Representation Learning for Electronic Design Automation with High-Level Synthesis | Zongyue Qin et.al. | 2406.09606 | null |
2024-06-13 | Towards Domain Adaptive Neural Contextual Bandits | Ziyan Wang et.al. | 2406.09564 | null |
2024-06-13 | Finite-Agent Stochastic Differential Games on Large Graphs: I. The Linear-Quadratic Case | Ruimeng Hu et.al. | 2406.09523 | null |
2024-06-13 | CleanDiffuser: An Easy-to-use Modularized Library for Diffusion Models in Decision Making | Zibin Dong et.al. | 2406.09509 | link |
2024-06-13 | Fair Data Generation via Score-based Diffusion Model | Yujie Lin et.al. | 2406.09495 | null |
2024-06-13 | SimGen: Simulator-conditioned Driving Scene Generation | Yunsong Zhou et.al. | 2406.09386 | null |
2024-06-13 | Active Inference Meeting Energy-Efficient Control of Parallel and Identical Machines | Yavar Taheri Yeganeh et.al. | 2406.09322 | link |
2024-06-13 | A tutorial on fairness in machine learning in healthcare | Jianhui Gao et.al. | 2406.09307 | null |
2024-06-13 | General Bayesian Predictive Synthesis | Masahiro Kato et.al. | 2406.09254 | null |
2024-06-13 | Optimizing Visual Question Answering Models for Driving: Bridging the Gap Between Human and Machine Attention Patterns | Kaavya Rekanar et.al. | 2406.09203 | null |
2024-06-13 | Auto-Vocabulary Segmentation for LiDAR Points | Weijie Wei et.al. | 2406.09126 | null |
2024-06-13 | Beyond Recommendations: From Backward to Forward AI Support of Pilots’ Decision-Making Process | Zelun Tony Zhang et.al. | 2406.08959 | null |
2024-06-13 | Beyond the Calibration Point: Mechanism Comparison in Differential Privacy | Georgios Kaissis et.al. | 2406.08918 | null |
2024-06-13 | CIMRL: Combining IMitiation and Reinforcement Learning for Safe Autonomous Driving | Jonathan Booher et.al. | 2406.08878 | null |
2024-06-13 | Trajectory Planning for Autonomous Driving in Unstructured Scenarios Based on Graph Neural Network and Numerical Optimization | Sumin Zhang et.al. | 2406.08855 | null |
2024-06-13 | Current applications and potential future directions of reinforcement learning-based Digital Twins in agriculture | Georg Goldenits et.al. | 2406.08854 | null |
2024-06-13 | Conceptual Learning via Embedding Approximations for Reinforcing Interpretability and Transparency | Maor Dikter et.al. | 2406.08840 | link |
2024-06-13 | Interpretable Temporal Class Activation Representation for Audio Spoofing Detection | Menglu Li et.al. | 2406.08825 | link |
2024-06-13 | BEVSpread: Spread Voxel Pooling for Bird’s-Eye-View Representation in Vision-based Roadside 3D Object Detection | Wenjie Wang et.al. | 2406.08785 | link |
2024-06-13 | Mathematical models for off-ball scoring prediction in basketball | Rikako Kono et.al. | 2406.08749 | link |
2024-06-13 | UruBots Autonomous Cars Team One Description Paper for FIRA 2024 | Pablo Moraes et.al. | 2406.08745 | null |
2024-06-12 | Defining a Reference Architecture for Edge Systems in Highly-Uncertain Environments | Kevin Pitstick et.al. | 2406.08583 | null |
2024-06-12 | Enhancing End-to-End Autonomous Driving with Latent World Model | Yingyan Li et.al. | 2406.08481 | link |
2024-06-12 | PRIBOOT: A New Data-Driven Expert for Improved Driving Simulations | Daniel Coelho et.al. | 2406.08421 | link |
2024-06-12 | LaneCPP: Continuous 3D Lane Detection using Physical Priors | Maximilian Pittner et.al. | 2406.08381 | null |
2024-06-12 | Utilizing Navigation Path to Generate Target Point for Enhanced End-to-End Autonomous Driving Planning | Yuanhua Shen et.al. | 2406.08349 | null |
2024-06-12 | Causality for Tabular Data Synthesis: A High-Order Structure Causal Benchmark Framework | Ruibo Tu et.al. | 2406.08311 | link |
2024-06-12 | The Importance of Positional Encoding Initialization in Transformers for Relational Reasoning | Takuya Ito et.al. | 2406.08272 | null |
2024-06-12 | Valeo4Cast: A Modular Approach to End-to-End Forecasting | Yihong Xu et.al. | 2406.08113 | link |
2024-06-12 | Conference Proceedings of The European DAO Workshop 2024 | Florian Spychiger et.al. | 2406.08110 | null |
2024-06-13 | CoXQL: A Dataset for Parsing Explanation Requests in Conversational XAI Systems | Qianli Wang et.al. | 2406.08101 | link |
2024-06-12 | LVBench: An Extreme Long Video Understanding Benchmark | Weihan Wang et.al. | 2406.08035 | link |
2024-06-12 | Deep reinforcement learning with positional context for intraday trading | Sven Goluža et.al. | 2406.08013 | null |
2024-06-12 | Efficient Adaptation in Mixed-Motive Environments via Hierarchical Opponent Modeling and Planning | Yizhe Huang et.al. | 2406.08002 | null |
2024-06-12 | Asymptotically Optimal Regret for Black-Box Predict-then-Optimize | Samuel Tan et.al. | 2406.07866 | null |
2024-06-12 | Are Objective Explanatory Evaluation metrics Trustworthy? An Adversarial Analysis | Prithwijit Chowdhury et.al. | 2406.07820 | null |
2024-06-11 | “It answers questions that I didn’t know I had”: Ph.D. Students’ Evaluation of an Information Sharing Knowledge Graph | Stanislava Gardasevic et.al. | 2406.07730 | null |
2024-06-11 | Out-Of-Context Prompting Boosts Fairness and Robustness in Large Language Model Predictions | Leonardo Cotta et.al. | 2406.07685 | null |
2024-06-11 | PLT-D3: A High-fidelity Dynamic Driving Simulation Dataset for Stereo Depth and Scene Flow | Joshua Tokarsky et.al. | 2406.07667 | null |
2024-06-11 | Comparing Deep Learning Models for Rice Mapping in Bhutan Using High Resolution Satellite Imagery | Biplov Bhandari et.al. | 2406.07482 | link |
2024-06-11 | Beyond ELBOs: A Large-Scale Evaluation of Variational Methods for Sampling | Denis Blessing et.al. | 2406.07423 | link |
2024-06-11 | Enhancing Tabular Data Optimization with a Flexible Graph-based Reinforced Exploration Strategy | Xiaohan Huang et.al. | 2406.07404 | null |
2024-06-11 | Accessing GPT-4 level Mathematical Olympiad Solutions via Monte Carlo Tree Self-refine with LLaMa-3 8B | Di Zhang et.al. | 2406.07394 | link |
2024-06-11 | World Models with Hints of Large Language Models for Goal Achieving | Zeyuan Liu et.al. | 2406.07381 | null |
2024-06-11 | EdgeTimer: Adaptive Multi-Timescale Scheduling in Mobile Edge Computing with Deep Reinforcement Learning | Yijun Hao et.al. | 2406.07342 | null |
2024-06-11 | Capacity Credit Evaluation of Generalized Energy Storage Considering Endogenous Uncertainty | Ning Qi et.al. | 2406.07338 | null |
2024-06-11 | Instruct Large Language Models to Drive like Humans | Ruijun Zhang et.al. | 2406.07296 | link |
2024-06-11 | Optimal policy design for decision problems under social influence | Valentina Breschi et.al. | 2406.07282 | null |
2024-06-11 | Towards Human-AI Collaboration in Healthcare: Guided Deferral Systems with Large Language Models | Joshua Strong et.al. | 2406.07212 | null |
2024-06-11 | Bilevel optimization with sustainability perspective: a survey on applications | Giulia Caselli et.al. | 2406.07184 | null |
2024-06-11 | EFFOcc: A Minimal Baseline for EFficient Fusion-based 3D Occupancy Network | Yining Shi et.al. | 2406.07042 | link |
2024-06-11 | PanoSSC: Exploring Monocular Panoptic 3D Scene Reconstruction for Autonomous Driving | Yining Shi et.al. | 2406.07037 | null |
2024-06-12 | LiSD: An Efficient Multi-Task Learning Framework for LiDAR Segmentation and Detection | Jiahua Xu et.al. | 2406.07023 | null |
2024-06-11 | SmartPQ: An Adaptive Concurrent Priority Queue for NUMA Architectures | Christina Giannoula et.al. | 2406.06900 | null |
2024-06-10 | Satisficing Exploration in Bandit Optimization | Qing Feng et.al. | 2406.06802 | null |
2024-06-10 | An Elliptic Kernel Unsupervised Autoencoder-Graph Convolutional Network Ensemble Model for Hyperspectral Unmixing | Estefania Alfaro-Mejia et.al. | 2406.06742 | null |
2024-06-10 | Long-Term Fairness Inquiries and Pursuits in Machine Learning: A Survey of Notions, Methods, and Challenges | Usman Gohar et.al. | 2406.06736 | null |
2024-06-10 | PatchRefiner: Leveraging Synthetic Data for Real-Domain High-Resolution Monocular Metric Depth Estimation | Zhenyu Li et.al. | 2406.06679 | null |
2024-06-10 | Adaptive Opponent Policy Detection in Multi-Agent MDPs: Real-Time Strategy Switch Identification Using Running Error Estimation | Mohidul Haque Mridul et.al. | 2406.06500 | null |
2024-06-10 | Can Language Models Serve as Text-Based World Simulators? | Ruoyao Wang et.al. | 2406.06485 | null |
2024-06-10 | Language Models are Alignable Decision-Makers: Dataset and Application to the Medical Triage Domain | Brian Hu et.al. | 2406.06435 | link |
2024-06-10 | Hybrid Video Anomaly Detection for Anomalous Scenarios in Autonomous Driving | Daniel Bogdoll et.al. | 2406.06423 | null |
2024-06-10 | UMAD: Unsupervised Mask-Level Anomaly Detection for Autonomous Driving | Daniel Bogdoll et.al. | 2406.06370 | null |
2024-06-10 | DualAD: Disentangling the Dynamic and Static World for End-to-End Driving | Simon Doll et.al. | 2406.06264 | null |
2024-06-10 | Data Augmentation in Earth Observation: A Diffusion Model Approach | Tiago Sousa et.al. | 2406.06218 | null |
2024-06-10 | Federated Machine Reasoning for Resource Provisioning in 6G O-RAN | Swastika Roy et.al. | 2406.06128 | null |
2024-06-10 | Sim-To-Real Transfer for Visual Reinforcement Learning of Deformable Object Manipulation for Robot-Assisted Surgery | Paul Maria Scheikl et.al. | 2406.06092 | null |
2024-06-10 | Algorithms for Multi-Criteria Decision-Making and Efficiency Analysis Problems | Fuh-Hwa Franklin Liu et.al. | 2406.06090 | null |
2024-06-10 | Text Analysis of ETDs in ProQuest Dissertations and Theses (PQDT) Global (2016-2018) | Manika Lamba et.al. | 2406.06076 | null |
2024-06-10 | Risk Sensitivity in Markov Games and Multi-Agent Reinforcement Learning: A Systematic Review | Hafez Ghaemi et.al. | 2406.06041 | null |
2024-06-10 | Decision-Making Behavior Evaluation Framework for LLMs under Uncertain Context | Jingru Jia et.al. | 2406.05972 | null |
2024-06-09 | Hello Again! LLM-powered Personalized Agent for Long-term Dialogue | Hao Li et.al. | 2406.05925 | link |
2024-06-09 | Self-supervised Adversarial Training of Monocular Depth Estimation against Physical-World Attacks | Zhiyuan Cheng et.al. | 2406.05857 | link |
2024-06-09 | BOSC: A toolbox for aerial imagery mapping | Ricard Durall et.al. | 2406.05833 | link |
2024-06-09 | ControlLoc: Physical-World Hijacking Attack on Visual Perception in Autonomous Driving | Chen Ma et.al. | 2406.05810 | null |
2024-06-09 | Production and distribution planning, scheduling, and routing optimization in a yogurt supply chain under demand uncertainty: A case study | Babak Javadi et.al. | 2406.05803 | null |
2024-06-09 | SlowPerception: Physical-World Latency Attack against Visual Perception in Autonomous Driving | Chen Ma et.al. | 2406.05800 | null |
2024-06-09 | Numerical solution of a PDE arising from prediction with expert advice | Jeff Calder et.al. | 2406.05754 | link |
2024-06-07 | Pretraining Decision Transformers with Reward Prediction for In-Context Multi-task Structured Bandit Learning | Subhojyoti Mukherjee et.al. | 2406.05064 | null |
2024-06-07 | Digital Twins of the EM Environment: Benchmark for Ray Launching Models | Michele Zhu et.al. | 2406.05042 | link |
2024-06-07 | Online Frequency Scheduling by Learning Parallel Actions | Anastasios Giovanidis et.al. | 2406.05041 | null |
2024-06-07 | CityCraft: A Real Crafter for 3D City Generation | Jie Deng et.al. | 2406.04983 | null |
2024-06-07 | Beyond Data, Towards Sustainability: A Sydney Case Study on Urban Digital Twins | Ammar Sohail et.al. | 2406.04902 | null |
2024-06-07 | Dynamic prediction of death risk given a renewal hospitalization process | Telmo J. Pérez-Izquierdo et.al. | 2406.04849 | link |
2024-06-07 | Fragile Model Watermarking: A Comprehensive Survey of Evolution, Characteristics, and Classification | Zhenzhe Gao et.al. | 2406.04809 | null |
2024-06-07 | Predictive Dynamic Fusion | Bing Cao et.al. | 2406.04802 | link |
2024-06-07 | SelfGoal: Your Language Agents Already Know How to Achieve High-level Goals | Ruihan Yang et.al. | 2406.04784 | null |
2024-06-07 | EAIA: An Efficient and Anonymous Identity Authentication Scheme in 5G-V2V | Qianmin Du et.al. | 2406.04705 | null |
2024-06-06 | Tangent differential privacy | Lexing Ying et.al. | 2406.04535 | null |
2024-06-06 | Step Out and Seek Around: On Warm-Start Training with Incremental Data | Maying Shen et.al. | 2406.04484 | null |
2024-06-06 | Optimizing Autonomous Driving for Safety: A Human-Centric Approach with LLM-Enhanced RLHF | Yuan Sun et.al. | 2406.04481 | null |
2024-06-06 | Everywhere & Nowhere: Envisioning a Computing Continuum for Science | Manish Parashar et.al. | 2406.04480 | null |
2024-06-06 | MoralBench: Moral Evaluation of LLMs | Jianchao Ji et.al. | 2406.04428 | link |
2024-06-06 | DeTra: A Unified Model for Object Detection and Trajectory Forecasting | Sergio Casas et.al. | 2406.04426 | null |
2024-06-06 | Regularized KL-Divergence for Well-Defined Function-Space Variational Inference in Bayesian neural networks | Tristan Cinquin et.al. | 2406.04317 | null |
2024-06-06 | Do Language Models Understand Morality? Towards a Robust Detection of Moral Content | Luana Bulla et.al. | 2406.04143 | link |
2024-06-06 | Explainability and Hate Speech: Structured Explanations Make Social Media Moderators Faster | Agostina Calabrese et.al. | 2406.04106 | link |
2024-06-06 | Leveraging automatic strategy discovery to teach people how to select better projects | Lovis Heindrich et.al. | 2406.04082 | link |
2024-06-06 | A Road-Map for Transferring Software Engineering methods for Model-Based Early V&V of Behaviour to Systems Engineering | Johan Cederbladh et.al. | 2406.04037 | null |
2024-06-06 | Contrastive Sparse Autoencoders for Interpreting Planning of Chess-Playing Agents | Yoann Poupart et.al. | 2406.04028 | link |
2024-06-06 | Frequency-based Matcher for Long-tailed Semantic Segmentation | Shan Li et.al. | 2406.03917 | link |
2024-06-06 | Memorization in deep learning: A survey | Jiaheng Wei et.al. | 2406.03880 | null |
2024-06-06 | Bench2Drive: Towards Multi-Ability Benchmarking of Closed-Loop End-To-End Autonomous Driving | Xiaosong Jia et.al. | 2406.03877 | link |
2024-06-06 | Small area estimation with generalized random forests: Estimating poverty rates in Mexico | Nicolas Frink et.al. | 2406.03861 | null |
2024-06-06 | Performance of large language models in numerical vs. semantic medical knowledge: Benchmarking on evidence-based Q&As | Eden Avnat et.al. | 2406.03855 | null |
2024-06-06 | Monocular Localization with Semantics Map for Autonomous Vehicles | Jixiang Wan et.al. | 2406.03835 | null |
2024-06-06 | Views about ChatGPT: Are human decision making and human learning necessary? | Eiji Yamamura et.al. | 2406.03823 | null |
2024-06-06 | Bayesian generalized method of moments applied to pseudo-observations in survival analysis | Léa Orsini et.al. | 2406.03821 | link |
2024-06-06 | POAM: Probabilistic Online Attentive Mapping for Efficient Robotic Information Gathering | Weizhe Chen et.al. | 2406.03669 | link |
2024-06-05 | Ensembling Portfolio Strategies for Long-Term Investments: A Distribution-Free Preference Framework for Decision-Making and Algorithms | Duy Khanh Lam et.al. | 2406.03652 | null |
2024-06-05 | FedPylot: Navigating Federated Learning for Real-Time Object Detection in Internet of Vehicles | Cyprien Quéméneur et.al. | 2406.03611 | link |
2024-06-05 | AD-H: Autonomous Driving with Hierarchical Agents | Zaibin Zhang et.al. | 2406.03474 | null |
2024-06-05 | Polarization Wavefront Lidar: Learning Large Scene Reconstruction from Polarized Wavefronts | Dominik Scheuble et.al. | 2406.03461 | null |
2024-06-05 | RemixTape: Enriching Narratives about Metrics with Semantic Alignment and Contextual Recommendation | Matthew Brehmer et.al. | 2406.03415 | null |
2024-06-05 | What Matters in Hierarchical Search for Combinatorial Reasoning Problems? | Michał Zawalski et.al. | 2406.03361 | link |
2024-06-05 | The Good, the Bad, and the Hulk-like GPT: Analyzing Emotional Decisions of Large Language Models in Cooperation and Bargaining Games | Mikhail Mozikov et.al. | 2406.03299 | null |
2024-06-05 | Prompt-based Visual Alignment for Zero-shot Policy Transfer | Haihan Gao et.al. | 2406.03250 | null |
2024-06-05 | Challenges and Considerations in the Evaluation of Bayesian Causal Discovery | Amir Mohammad Karimi Mamaghan et.al. | 2406.03209 | null |
2024-06-05 | Situation Monitor: Diversity-Driven Zero-Shot Out-of-Distribution Detection using Budding Ensemble Architecture for Object Detection | Qutub Syed et.al. | 2406.03188 | null |
2024-06-05 | Missci: Reconstructing Fallacies in Misrepresented Science | Max Glockner et.al. | 2406.03181 | link |
2024-06-06 | Detecting Model Misspecification in Amortized Bayesian Inference with Neural Networks: An Extended Investigation | Marvin Schmitt et.al. | 2406.03154 | null |
2024-06-05 | Enhanced Automotive Object Detection via RGB-D Fusion in a DiffusionDet Framework | Eliraz Orfaig et.al. | 2406.03129 | null |
2024-06-05 | Enhancing 3D Lane Detection and Topology Reasoning with 2D Lane Priors | Han Li et.al. | 2406.03105 | link |
2024-06-05 | Task-Oriented Wireless Communications for Collaborative Perception in Intelligent Unmanned Systems | Sheng Zhou et.al. | 2406.03086 | null |
2024-06-05 | “Give Me an Example Like This”: Episodic Active Reinforcement Learning from Demonstrations | Muhan Hou et.al. | 2406.03069 | null |
2024-06-05 | Efficient Exploration of the Rashomon Set of Rule Set Models | Martino Ciaperoni et.al. | 2406.03059 | null |
2024-06-05 | Correlation of Software-in-the-Loop Simulation with Physical Testing for Autonomous Driving | Zhennan Fei et.al. | 2406.03040 | null |
2024-06-05 | Analyzing the Influence of Training Samples on Explanations | André Artelt et.al. | 2406.03012 | null |
2024-06-05 | Unveiling Selection Biases: Exploring Order and Token Sensitivity in Large Language Models | Sheng-Lun Wei et.al. | 2406.03009 | null |
2024-06-05 | DriVLMe: Enhancing LLM-based Autonomous Driving Agents with Embodied and Social Experiences | Yidong Huang et.al. | 2406.03008 | link |
2024-06-05 | Simplification of Risk Averse POMDPs with Performance Guarantees | Yaacov Pariente et.al. | 2406.03000 | null |
2024-06-04 | Enhancing predictive imaging biomarker discovery through treatment effect analysis | Shuhan Xiao et.al. | 2406.02534 | null |
2024-06-04 | How Do Neural Spoofing Countermeasures Detect Partially Spoofed Audio? | Tianchi Liu et.al. | 2406.02483 | null |
2024-06-04 | A Generalized Apprenticeship Learning Framework for Modeling Heterogeneous Student Pedagogical Strategies | Md Mirajul Islam et.al. | 2406.02450 | null |
2024-06-04 | Out-of-Distribution Runtime Adaptation with Conformalized Neural Network Ensembles | Polo Contreras et.al. | 2406.02436 | null |
2024-06-04 | Decoupling of neural network calibration measures | Dominik Werner Wolf et.al. | 2406.02411 | null |
2024-06-04 | XRec: Large Language Models for Explainable Recommendation | Qiyao Ma et.al. | 2406.02377 | link |
2024-06-04 | Label-wise Aleatoric and Epistemic Uncertainty Quantification | Yusuf Sale et.al. | 2406.02354 | link |
2024-06-04 | Disentangled Representation via Variational AutoEncoder for Continuous Treatment Effect Estimation | Ruijing Cui et.al. | 2406.02310 | null |
2024-06-04 | Enabling Decision-Making with the Modified Causal Forest: Policy Trees for Treatment Assignment | Hugo Bodory et.al. | 2406.02241 | null |
2024-06-04 | Towards an Extensible Model-Based Digital Twin Framework for Space Launch Vehicles | Ran Wei et.al. | 2406.02222 | null |
2024-06-04 | Rectifying Reinforcement Learning for Reward Matching | Haoran He et.al. | 2406.02213 | null |
2024-06-04 | Radar Spectra-Language Model for Automotive Scene Parsing | Mariia Pushkareva et.al. | 2406.02158 | null |
2024-06-04 | UA-Track: Uncertainty-Aware End-to-End 3D Multi-Object Tracking | Lijun Zhou et.al. | 2406.02147 | null |
2024-06-04 | Why Would You Suggest That? Human Trust in Language Model Responses | Manasi Sharma et.al. | 2406.02018 | null |
2024-06-04 | Mamba as Decision Maker: Exploring Multi-scale Sequence Modeling in Offline Reinforcement Learning | Jiahang Cao et.al. | 2406.02013 | link |
2024-06-05 | Exploring Real World Map Change Generalization of Prior-Informed HD Map Prediction Models | Samuel M. Bateman et.al. | 2406.01961 | null |
2024-06-04 | Improving Generalization in Aerial and Terrestrial Mobile Robots Control Through Delayed Policy Learning | Ricardo B. Grando et.al. | 2406.01952 | null |
2024-06-04 | Orthogonal Causal Calibration | Justin Whitehouse et.al. | 2406.01933 | null |
2024-06-04 | ProGEO: Generating Prompts through Image-Text Contrastive Learning for Visual Geo-localization | Chen Mao et.al. | 2406.01906 | link |
2024-06-04 | Large Language Model-Enabled Multi-Agent Manufacturing Systems | Jonghan Lim et.al. | 2406.01893 | null |
2024-05-31 | Designing for Fairness in Human-Robot Interactions | Houston Claure et.al. | 2405.21044 | null |
2024-05-31 | G-Transformer for Conditional Average Potential Outcome Estimation over Time | Konstantin Hess et.al. | 2405.21012 | link |
2024-05-31 | Hard Cases Detection in Motion Prediction by Vision-Language Foundation Models | Yi Yang et.al. | 2405.20991 | link |
2024-05-31 | Uncertainty Quantification for Bird’s Eye View Semantic Segmentation: Methods and Benchmarks | Linlin Yu et.al. | 2405.20986 | null |
2024-05-31 | Goal-Oriented Sensor Reporting Scheduling for Non-linear Dynamic System Monitoring | Prasoon Raghuwanshi et.al. | 2405.20983 | null |
2024-05-31 | Unravelling the Use of Digital Twins to Assist Decision- and Policy-Making in Smart Cities | Lucy Temple et.al. | 2405.20916 | null |
2024-05-31 | Pursuing Overall Welfare in Federated Learning through Sequential Decision Making | Seok-Ju Hahn et.al. | 2405.20821 | link |
2024-05-31 | ADESSE: Advice Explanations in Complex Repeated Decision-Making Environments | Sören Schleibaum et.al. | 2405.20705 | link |
2024-05-31 | A flexible numerical tool for large dynamic DC networks | Erwin Luesink et.al. | 2405.20704 | null |
2024-05-31 | Robust Stable Spiking Neural Networks | Jianhao Ding et.al. | 2405.20694 | link |
2024-05-31 | In-Context Decision Transformer: Reinforcement Learning via Hierarchical Chain-of-Thought | Sili Huang et.al. | 2405.20692 | link |
2024-05-31 | Searching for internal symbols underlying deep learning | Jung H. Lee et.al. | 2405.20605 | null |
2024-05-31 | Class-Based Time Series Data Augmentation to Mitigate Extreme Class Imbalance for Solar Flare Prediction | Junzhi Wen et.al. | 2405.20590 | null |
2024-05-30 | Aquatic Navigation: A Challenging Benchmark for Deep Reinforcement Learning | Davide Corsi et.al. | 2405.20534 | link |
2024-05-30 | Probabilities of Causation for Continuous and Vector Variables | Yuta Kawakami et.al. | 2405.20487 | null |
2024-05-30 | Policy Trees for Prediction: Interpretable and Adaptive Model Selection for Machine Learning | Dimitris Bertsimas et.al. | 2405.20486 | null |
2024-05-30 | Quality of Non-Convergent Best Response Processes in Multi-Agent Systems through Sink Equilibrium | Rohit Konda et.al. | 2405.20426 | null |
2024-05-30 | Learning 3D Robotics Perception using Inductive Priors | Muhammad Zubair Irshad et.al. | 2405.20364 | null |
2024-05-30 | OccSora: 4D Occupancy Generation Models as World Simulators for Autonomous Driving | Lening Wang et.al. | 2405.20337 | link |
2024-05-30 | $\textit{S}^3$ Gaussian: Self-Supervised Street Gaussians for Autonomous Driving | Nan Huang et.al. | 2405.20323 | link |
2024-05-30 | Low-rank and sparse approximations for contact mechanics | Kiran Sagar Kollepara et.al. | 2405.20211 | null |
2024-05-31 | Using Large Language Models for Humanitarian Frontline Negotiation: Opportunities and Considerations | Zilin Ma et.al. | 2405.20195 | null |
2024-05-30 | MCDS-VSS: Moving Camera Dynamic Scene Video Semantic Segmentation by Filtering with Self-Supervised Geometry and Motion | Angel Villar-Corrales et.al. | 2405.19921 | link |
2024-05-30 | Fourier Controller Networks for Real-Time Decision-Making in Embodied Learning | Hengkai Tan et.al. | 2405.19885 | null |
2024-05-30 | From Words to Actions: Unveiling the Theoretical Underpinnings of LLM-Driven Autonomous Systems | Jianliang He et.al. | 2405.19883 | null |
2024-05-30 | Developing a Comprehensive Measurement Tool for Assessing the Rate of BIM Adoption in the Construction Industry | Mohammed Abdulsalam Alsofiani et.al. | 2405.19755 | null |
2024-05-30 | GaussianPrediction: Dynamic 3D Gaussian Prediction for Motion Extrapolation and Free View Synthesis | Boming Zhao et.al. | 2405.19745 | null |
2024-05-30 | Learning Task-relevant Sequence Representations via Intrinsic Dynamics Characteristics in Reinforcement Learning | Dayang Liang et.al. | 2405.19736 | link |
2024-05-30 | Generalized Bayesian Nash Equilibrium with Continuous Type and Action Spaces | Yuan Tao et.al. | 2405.19721 | null |
2024-05-31 | Autonomous Driving with Spiking Neural Networks | Rui-Jie Zhu et.al. | 2405.19687 | link |
2024-05-30 | Texture-guided Coding for Deep Features | Lei Xiong et.al. | 2405.19669 | null |
2024-05-30 | Reconciling Model Multiplicity for Downstream Decision Making | Ally Yalei Du et.al. | 2405.19667 | null |
2024-05-31 | SparseDrive: End-to-End Autonomous Driving via Sparse Scene Representation | Wenchao Sun et.al. | 2405.19620 | link |
2024-05-29 | Distributed Online Planning for Min-Max Problems in Networked Markov Games | Alexandros E. Tzikas et.al. | 2405.19570 | link |
2024-05-29 | Participation in the age of foundation models | Harini Suresh et.al. | 2405.19479 | null |
2024-05-29 | Posterior Sampling via Autoregressive Generation | Kelly W Zhang et.al. | 2405.19466 | null |
2024-05-29 | Reasoning3D – Grounding and Reasoning in 3D: Fine-Grained Zero-Shot Open-Vocabulary 3D Reasoning Part Segmentation via Large Vision-Language Models | Tianrun Chen et.al. | 2405.19326 | null |
2024-05-29 | Language Models Trained to do Arithmetic Predict Human Risky and Intertemporal Choice | Jian-Qiao Zhu et.al. | 2405.19313 | null |
2024-05-29 | Real-Time Environment Condition Classification for Autonomous Vehicles | Marco Introvigne et.al. | 2405.19305 | link |
2024-05-29 | Towards Next-Generation Urban Decision Support Systems through AI-Powered Generation of Scientific Ontology using Large Language Models – A Case in Optimizing Intermodal Freight Transportation | Jose Tupayachi et.al. | 2405.19255 | null |
2024-05-29 | Diffusion-based Dynamics Models for Long-Horizon Rollout in Offline Reinforcement Learning | Hanye Zhao et.al. | 2405.19189 | link |
2024-05-29 | Conditional Latent ODEs for Motion Prediction in Autonomous Driving | Khang Truong Giang et.al. | 2405.19183 | link |
2024-05-29 | Learning Interpretable Scheduling Algorithms for Data Processing Clusters | Zhibo Hu et.al. | 2405.19131 | null |
2024-05-29 | Early Detection of Critical Urban Events using Mobile Phone Network Data | Pierre Lemaire et.al. | 2405.19125 | link |
2024-05-29 | Can Graph Learning Improve Task Planning? | Xixi Wu et.al. | 2405.19119 | link |
2024-05-29 | Quantum Optimal Control of Squeezing in Cavity Optomechanics | Anton Halaski et.al. | 2405.19070 | null |
2024-05-29 | A Good Foundation is Worth Many Labels: Label-Efficient Panoptic Segmentation | Niclas Vödisch et.al. | 2405.19035 | link |
2024-05-29 | Distributed Management of Fluctuating Energy Resources in Dynamic Networked Systems | Xiaotong Cheng et.al. | 2405.19015 | null |
2024-05-29 | Optimizing Vehicular Networks with Variational Quantum Circuits-based Reinforcement Learning | Zijiang Yan et.al. | 2405.18984 | null |
2024-05-29 | DecomCAM: Advancing Beyond Saliency Maps through Decomposition and Integration | Yuguang Yang et.al. | 2405.18882 | link |
2024-05-29 | On Fairness Concerns in the Blockchain Ecosystem | Johnnatan Messias Peixoto Afonso et.al. | 2405.18876 | null |
2024-05-29 | SSGA-Net: Stepwise Spatial Global-local Aggregation Networks for for Autonomous Driving | Yiming Cui et.al. | 2405.18857 | null |
2024-05-29 | LetsMap: Unsupervised Representation Learning for Semantic BEV Mapping | Nikhil Gosala et.al. | 2405.18852 | null |
2024-05-29 | SFANet: Spatial-Frequency Attention Network for Weather Forecasting | Jiaze Wang et.al. | 2405.18849 | null |
2024-05-29 | FDQN: A Flexible Deep Q-Network Framework for Game Automation | Prabhath Reddy Gujavarthy et.al. | 2405.18761 | link |
2024-05-29 | Multi-objective Cross-task Learning via Goal-conditioned GPT-based Decision Transformers for Surgical Robot Task Automation | Jiawei Fu et.al. | 2405.18757 | null |
2024-05-28 | 3D StreetUnveiler with Semantic-Aware 2DGS | Jingwei Xu et.al. | 2405.18416 | null |
2024-05-28 | Is a 3D-Tokenized LLM the Key to Reliable Autonomous Driving? | Yifan Bai et.al. | 2405.18361 | null |
2024-05-28 | MMCTAgent: Multi-modal Critical Thinking Agent Framework for Complex Visual Reasoning | Somnath Kumar et.al. | 2405.18358 | null |
2024-05-28 | Can Automatic Metrics Assess High-Quality Translations? | Sweta Agrawal et.al. | 2405.18348 | null |
2024-05-28 | Safe Multi-Agent Reinforcement Learning with Bilevel Optimization in Autonomous Driving | Zhi Zheng et.al. | 2405.18209 | link |
2024-05-28 | LLM experiments with simulation: Large Language Model Multi-Agent System for Process Simulation Parametrization in Digital Twins | Yuchen Xia et.al. | 2405.18092 | link |
2024-05-28 | Towards Dialogues for Joint Human-AI Reasoning and Value Alignment | Elfia Bezou-Vrakatseli et.al. | 2405.18073 | null |
2024-05-28 | MULi-Ev: Maintaining Unperturbed LiDAR-Event Calibration | Mathieu Cocheteux et.al. | 2405.18021 | null |
2024-05-28 | Self-supervised Pre-training for Transferable Multi-modal Perception | Xiaohao Xu et.al. | 2405.17942 | link |
2024-05-28 | Cycle-YOLO: A Efficient and Robust Framework for Pavement Damage Detection | Zhengji Li et.al. | 2405.17905 | null |
2024-05-28 | Data-Driven Predictive Control and MPC: Do we achieve optimality? | Akhil S Anand et.al. | 2405.17892 | null |
2024-05-28 | Resisting Stochastic Risks in Diffusion Planners with the Trajectory Aggregation Tree | Lang Feng et.al. | 2405.17879 | link |
2024-05-28 | Ai.llude: Encouraging Rewriting AI-Generated Text to Support Creative Expression | David Zhou et.al. | 2405.17843 | null |
2024-05-28 | LNS2+RL: Combining Multi-agent Reinforcement Learning with Large Neighborhood Search in Multi-agent Path Finding | Yutong Wang et.al. | 2405.17794 | link |
2024-05-28 | Online Analytic Exemplar-Free Continual Learning with Large Models for Imbalanced Autonomous Driving Task | Huiping Zhuang et.al. | 2405.17779 | link |
2024-05-27 | OPERA: Automatic Offline Policy Evaluation with Re-weighted Aggregates of Multiple Estimators | Allen Nie et.al. | 2405.17708 | null |
2024-05-27 | Ontology-Enhanced Decision-Making for Autonomous Agents in Dynamic and Partially Observable Environments | Saeedeh Ghanadbashi et.al. | 2405.17691 | null |
2024-05-27 | Adaptive Device-Edge Collaboration on DNN Inference in AIoT: A Digital Twin-Assisted Approach | Shisheng Hu et.al. | 2405.17664 | null |
2024-05-27 | Robust Perception and Navigation of Autonomous Surface Vehicles in Challenging Environments | Mingi Jeong et.al. | 2405.17657 | null |
2024-05-27 | The Economic Implications of Large Language Model Selection on Earnings and Return on Investment: A Decision Theoretic Model | Geraldo Xexéo et.al. | 2405.17637 | null |
2024-05-27 | GaussianFormer: Scene as Gaussians for Vision-Based 3D Semantic Occupancy Prediction | Yuanhui Huang et.al. | 2405.17429 | link |
2024-05-27 | Benchmarking and Improving Bird’s Eye View Perception Robustness in Autonomous Driving | Shaoyuan Xie et.al. | 2405.17426 | link |
2024-05-27 | LARM: Large Auto-Regressive Model for Long-Horizon Embodied Intelligence | Zhuoling Li et.al. | 2405.17424 | null |
2024-05-27 | Hardness-Aware Scene Synthesis for Semi-Supervised 3D Object Detection | Shuai Zeng et.al. | 2405.17422 | link |
2024-05-27 | MultiOOD: Scaling Out-of-Distribution Detection for Multiple Modalities | Hao Dong et.al. | 2405.17419 | link |
2024-05-27 | Vista: A Generalizable Driving World Model with High Fidelity and Versatile Controllability | Shenyuan Gao et.al. | 2405.17398 | link |
2024-05-27 | BehaviorGPT: Smart Agent Simulation for Autonomous Driving with Next-Patch Prediction | Zikang Zhou et.al. | 2405.17372 | null |
2024-05-27 | Rethinking Transformers in Solving POMDPs | Chenhao Lu et.al. | 2405.17358 | link |
2024-05-27 | Exploring and steering the moral compass of Large Language Models | Alejandro Tlaie et.al. | 2405.17345 | link |
2024-05-27 | Leveraging Offline Data in Linear Latent Bandits | Chinmaya Kausik et.al. | 2405.17324 | null |
2024-05-27 | Towards Accurate Ego-lane Identification with Early Time Series Classification | Yuchuan Jin et.al. | 2405.17270 | null |
2024-05-27 | “Pass the butter”: A study on desktop-classic multitasking robotic arm based on advanced YOLOv7 and BERT | Haohua Que et.al. | 2405.17250 | null |
2024-05-27 | InsigHTable: Insight-driven Hierarchical Table Visualization with Reinforcement Learning | Guozheng Li et.al. | 2405.17229 | null |
2024-05-27 | Collage is the New Writing: Exploring the Fragmentation of Text and User Interfaces in AI Tools | Daniel Buschek et.al. | 2405.17217 | null |
2024-05-27 | CoSLight: Co-optimizing Collaborator Selection and Decision-making to Enhance Traffic Signal Control | Jingqing Ruan et.al. | 2405.17152 | link |
2024-05-27 | DINO-SD: Champion Solution for ICRA 2024 RoboDepth Challenge | Yifan Mao et.al. | 2405.17102 | null |
2024-05-27 | A Two-Level Stochastic Model for the Lateral Movement of Vehicles Within Their Lane Under Homogeneous Traffic Conditions | Nicole Neis et.al. | 2405.17080 | null |
2024-05-27 | Efficient mid-term forecasting of hourly electricity load using generalized additive models | Monika Zimmermann et.al. | 2405.17070 | null |
2024-05-27 | BWArea Model: Learning World Model, Inverse Dynamics, and Policy for Controllable Language Generation | Chengxing Jia et.al. | 2405.17039 | null |
2024-05-27 | SCaRL- A Synthetic Multi-Modal Dataset for Autonomous Driving | Avinash Nittur Ramesh et.al. | 2405.17030 | null |
2024-05-24 | Enhancing Adverse Drug Event Detection with Multimodal Dataset: Corpus Creation and Model Development | Pranab Sahoo et.al. | 2405.15766 | link |
2024-05-24 | An Adaptive Framework for Manipulator Skill Reproduction in Dynamic Environments | Ryan Donald et.al. | 2405.15711 | link |
2024-05-24 | SMART: Scalable Multi-agent Real-time Simulation via Next-token Prediction | Wei Wu et.al. | 2405.15677 | link |
2024-05-24 | Serving economic prosperity: economic impact assessments (EIA) on Earth observation-based services and tools by SERVIR | Reetwika Basu et.al. | 2405.15672 | null |
2024-05-24 | Predictive Uncertainty Quantification with Missing Covariates | Margaux Zaffran et.al. | 2405.15641 | null |
2024-05-24 | Federated Behavioural Planes: Explaining the Evolution of Client Behaviour in Federated Learning | Dario Fenoglio et.al. | 2405.15632 | link |
2024-05-24 | Inverse-RLignment: Inverse Reinforcement Learning from Demonstrations for LLM Alignment | Hao Sun et.al. | 2405.15624 | null |
2024-05-24 | Online Changepoint Detection via Dynamic Mode Decomposition | Victor K. Khamesi et.al. | 2405.15576 | null |
2024-05-24 | Transformer-XL for Long Sequence Tasks in Robotic Learning from Demonstration | Gao Tianci et.al. | 2405.15562 | null |
2024-05-24 | Mind the Gap: A Causal Perspective on Bias Amplification in Prediction & Decision-Making | Drago Plecko et.al. | 2405.15446 | null |
2024-05-24 | Decentralized Virtual Research Environment: Empowering Peer-to-Peer Trustworthy Data Sharing and Collaboration | Yuandou Wang et.al. | 2405.15392 | null |
2024-05-24 | Efficient Recurrent Off-Policy RL Requires a Context-Encoder-Specific Learning Rate | Fan-Ming Luo et.al. | 2405.15384 | link |
2024-05-24 | Large Language Models can Deliver Accurate and Interpretable Time Series Anomaly Detection | Jun Liu et.al. | 2405.15370 | null |
2024-05-24 | Continuously Learning, Adapting, and Improving: A Dual-Process Approach to Autonomous Driving | Jianbiao Mei et.al. | 2405.15324 | link |
2024-05-24 | Trajectory-Based Multi-Objective Hyperparameter Optimization for Model Retraining | Wenyu Wang et.al. | 2405.15303 | null |
2024-05-24 | Learning Invariant Causal Mechanism from Vision-Language Models | Zeen Song et.al. | 2405.15289 | null |
2024-05-24 | 3D Unsupervised Learning by Distilling 2D Open-Vocabulary Segmentation Models for Autonomous Driving | Boyi Sun et.al. | 2405.15286 | link |
2024-05-24 | Talk to Parallel LiDARs: A Human-LiDAR Interaction Method Based on 3D Visual Grounding | Yuhang Liu et.al. | 2405.15274 | null |
2024-05-24 | iVideoGPT: Interactive VideoGPTs are Scalable World Models | Jialong Wu et.al. | 2405.15223 | link |
2024-05-24 | Computational analysis on a linkage between generalized logit dynamic and discounted mean field game | Hidekazu Yoshioka et.al. | 2405.15180 | null |
2024-05-23 | An Empirical Study of Training State-of-the-Art LiDAR Segmentation Models | Jiahao Sun et.al. | 2405.14870 | link |
2024-05-23 | Local Causal Discovery for Structural Evidence of Direct Discrimination | Jacqueline Maasch et.al. | 2405.14848 | null |
2024-05-23 | As an AI Language Model, “Yes I Would Recommend Calling the Police’’: Norm Inconsistency in LLM Decision-Making | Shomik Jain et.al. | 2405.14812 | null |
2024-05-23 | DIDI: Diffusion-Guided Diversity for Offline Behavioral Generation | Jinxin Liu et.al. | 2405.14790 | link |
2024-05-23 | FinRobot: An Open-Source AI Agent Platform for Financial Applications using Large Language Models | Hongyang Yang et.al. | 2405.14767 | link |
2024-05-23 | TopoLogic: An Interpretable Pipeline for Lane Topology Reasoning on Driving Scenes | Yanping Fu et.al. | 2405.14747 | null |
2024-05-23 | Exploring Prosocial Irrationality for LLM Agents: A Social Cognition View | Xuan Liu et.al. | 2405.14744 | null |
2024-05-23 | Iterative Causal Segmentation: Filling the Gap between Market Segmentation and Marketing Strategy | Kaihua Ding et.al. | 2405.14743 | null |
2024-05-23 | A Systematic and Formal Study of the Impact of Local Differential Privacy on Fairness: Preliminary Results | Karima Makhlouf et.al. | 2405.14725 | null |
2024-05-23 | Learning-Based Intermittent CSI Estimation with Adaptive Intervals in Integrated Sensing and Communication Systems | Jie Chen et.al. | 2405.14724 | null |
2024-05-23 | Decision-Focused Forecasting: Decision Losses for Multistage Optimisation | Egon Peršak et.al. | 2405.14719 | link |
2024-05-23 | CityGPT: Towards Urban IoT Learning, Analysis and Interaction with Multi-Agent System | Qinghua Guan et.al. | 2405.14691 | null |
2024-05-23 | PerLLM: Personalized Inference Scheduling with Edge-Cloud Collaboration for Diverse LLM Services | Zheming Yang et.al. | 2405.14636 | null |
2024-05-23 | SE3D: A Framework For Saliency Method Evaluation In 3D Imaging | Mariusz Wiśniewski et.al. | 2405.14584 | link |
2024-05-23 | Explainable automatic industrial carbon footprint estimation from bank transaction classification using natural language processing | Jaime González-González et.al. | 2405.14505 | null |
2024-05-23 | Improving Single Domain-Generalized Object Detection: A Focus on Diversification and Alignment | Muhammad Sohail Danish et.al. | 2405.14497 | link |
2024-05-23 | MagicDrive3D: Controllable 3D Generation for Any-View Rendering in Street Scenes | Ruiyuan Gao et.al. | 2405.14475 | null |
2024-05-23 | Adaptive sampling with PIXL on the Mars Perseverance rover | Peter R. Lawson et.al. | 2405.14471 | null |
2024-05-23 | LoRA-Ensemble: Efficient Uncertainty Modelling for Self-attention Networks | Michelle Halbheer et.al. | 2405.14438 | link |
2024-05-23 | Motion-based video compression for resource-constrained camera traps | Malika Nisal Ratnayake et.al. | 2405.14419 | null |
2024-05-21 | Strategic Deployment of Honeypots in Blockchain-based IoT Systems | Daniel Commey et.al. | 2405.12951 | null |
2024-05-21 | Hybrid PDE-ODE Models for Efficient Simulation of Infection Spread in Epidemiology | Kristina Maier et.al. | 2405.12938 | null |
2024-05-21 | Skin-in-the-Game: Decision Making via Multi-Stakeholder Alignment in LLMs | Bilgehan Sel et.al. | 2405.12933 | null |
2024-05-21 | The implications of state aggregation in deteriorating Markov Decision Processes with optimal threshold policies | Madeleine Pollack et.al. | 2405.12912 | null |
2024-05-21 | Transparency Distortion Robustness for SOTA Image Segmentation Tasks | Volker Knauthe et.al. | 2405.12864 | null |
2024-05-21 | SmartFlow: Robotic Process Automation using LLMs | Arushi Jain et.al. | 2405.12842 | null |
2024-05-21 | Consumer lying in online reviews: recent evidence | Shawn Berry et.al. | 2405.12743 | null |
2024-05-21 | Multimodal video analysis for crowd anomaly detection using open access tourism cameras | Alejandro Dionis-Ros et.al. | 2405.12708 | null |
2024-05-21 | A Multimodal Learning-based Approach for Autonomous Landing of UAV | Francisco Neves et.al. | 2405.12681 | null |
2024-05-21 | Towards an AI/ML-defined Radio for Wi-Fi: Overview, Challenges, and Roadmap | Boris Bellalta et.al. | 2405.12675 | null |
2024-05-21 | TempoScale: A Cloud Workloads Prediction Approach Integrating Short-Term and Long-Term Information | Linfeng Wen et.al. | 2405.12635 | link |
2024-05-21 | Asymptotic Properties of Matthews Correlation Coefficient | Yuki Itaya et.al. | 2405.12622 | link |
2024-05-21 | Efficient modeling of sub-kilometer surface wind with Gaussian processes and neural networks | Francesco Zanetta et.al. | 2405.12614 | null |
2024-05-21 | Ergodic Unobservable MDPs: Decidability of Approximation | Krishnendu Chatterjee et.al. | 2405.12583 | null |
2024-05-21 | Active Object Detection with Knowledge Aggregation and Distillation from Large Models | Dejie Yang et.al. | 2405.12509 | link |
2024-05-21 | CLRKDNet: Speeding up Lane Detection with Knowledge Distillation | Weiqing Qi et.al. | 2405.12503 | link |
2024-05-21 | GASE: Graph Attention Sampling with Edges Fusion for Solving Vehicle Routing Problems | Zhenwei Wang et.al. | 2405.12475 | null |
2024-05-21 | Mutual Information Analysis in Multimodal Learning Systems | Hadi Hadizadeh et.al. | 2405.12456 | null |
2024-05-20 | A Unified Linear Programming Framework for Offline Reward Learning from Human Demonstrations and Feedback | Kihyun Kim et.al. | 2405.12421 | null |
2024-05-20 | Conformal Counterfactual Inference under Hidden Confounding | Zonghao Chen et.al. | 2405.12387 | null |
2024-05-20 | Optimistic Query Routing in Clustering-based Approximate Maximum Inner Product Search | Sebastian Bruch et.al. | 2405.12207 | null |
2024-05-20 | Robust VAR Capability Curve of DER with Uncertain Renewable Generation | Aditya Shankar Kar et.al. | 2405.12184 | null |
2024-05-20 | EdgeLoc: A Communication-Adaptive Parallel System for Real-Time Localization in Infrastructure-Assisted Autonomous Driving | Boyi Liu et.al. | 2405.12120 | null |
2024-05-20 | PATE: Proximity-Aware Time series anomaly Evaluation | Ramin Ghorbani et.al. | 2405.12096 | link |
2024-05-20 | Is Mamba Compatible with Trajectory Optimization in Offline Reinforcement Learning? | Yang Dai et.al. | 2405.12094 | link |
2024-05-20 | Safe by Design Autonomous Driving Systems | Marius Bozga et.al. | 2405.11995 | null |
2024-05-20 | Tutorial on Silicon Photonics Integrated Platform Fiber Edge Coupling | Sergey S. Avdeev et.al. | 2405.11980 | null |
2024-05-20 | A comprehensive overview of deep learning techniques for 3D point cloud classification and semantic segmentation | Sushmita Sarker et.al. | 2405.11903 | null |
2024-05-20 | Social norm dynamics in a behavioral epidemic model on multiplex networks | Christos Charalambous et.al. | 2405.11887 | null |
2024-05-20 | Stereo-Knowledge Distillation from dpMV to Dual Pixels for Light Field Video Reconstruction | Aryan Garg et.al. | 2405.11823 | null |
2024-05-20 | Efficient Multi-agent Reinforcement Learning by Planning | Qihan Liu et.al. | 2405.11778 | link |
2024-05-20 | Configurable Mirror Descent: Towards a Unification of Decision Making | Pengdeng Li et.al. | 2405.11746 | link |
2024-05-20 | Estimating optimal tailored active surveillance strategy under interval censoring | Muxuan Liang et.al. | 2405.11720 | null |
2024-05-20 | QComp: A QSAR-Based Data Completion Framework for Drug Discovery | Bingjia Yang et.al. | 2405.11703 | link |
2024-05-19 | FADet: A Multi-sensor 3D Object Detection Network based on Local Featured Attention | Ziang Guo et.al. | 2405.11682 | link |
2024-05-21 | Interpretable Machine Learning Enhances Disease Prognosis: Applications on COVID-19 and Onward | Jinzhi Shen et.al. | 2405.11672 | null |
2024-05-19 | Auto-Platoon : Freight by example | Tharun V. Puthanveettil et.al. | 2405.11659 | link |
2024-05-19 | URDFormer: A Pipeline for Constructing Articulated Simulation Environments from Real-World Images | Zoey Chen et.al. | 2405.11656 | null |
2024-05-19 | Movie Revenue Prediction using Machine Learning Models | Vikranth Udandarao et.al. | 2405.11651 | link |
2024-05-19 | Searching Realistic-Looking Adversarial Objects For Autonomous Driving Systems | Shengxiang Sun et.al. | 2405.11629 | null |
2024-05-17 | Strategic control for a Boltzmann like decision-making model | Luis Guillermo Venegas-Pineda et.al. | 2405.10915 | null |
2024-05-17 | Contestable AI needs Computational Argumentation | Francesco Leofante et.al. | 2405.10729 | null |
2024-05-17 | Challenging the Human-in-the-loop in Algorithmic Decision-making | Sebastian Tschiatschek et.al. | 2405.10706 | null |
2024-05-17 | Empowering Prior to Court Legal Analysis: A Transparent and Accessible Dataset for Defensive Statement Classification and Interpretation | Yannis Spyridis et.al. | 2405.10702 | null |
2024-05-17 | Pragmatic Communication for Remote Control of Finite-State Markov Processes | Pietro Talli et.al. | 2405.10672 | null |
2024-05-17 | GEOcc: Geometrically Enhanced 3D Occupancy Network with Implicit-Explicit Depth Fusion and Contextual Self-Supervision | Xin Tan et.al. | 2405.10591 | null |
2024-05-17 | Team Samsung-RAL: Technical Report for 2024 RoboDrive Challenge-Robust Map Segmentation Track | Xiaoshuai Hao et.al. | 2405.10567 | null |
2024-05-17 | NeRO: Neural Road Surface Reconstruction | Ruibo Wang et.al. | 2405.10554 | link |
2024-05-17 | Guidelines for evaluation of complex multi agent test scenarios | Ana Isabel Garcia Guerra et.al. | 2405.10526 | null |
2024-05-16 | Tell me more: Intent Fulfilment Framework for Enhancing User Experiences in Conversational XAI | Anjana Wijekoon et.al. | 2405.10446 | null |
2024-05-16 | Monitizer: Automating Design and Evaluation of Neural Network Monitors | Muqsit Azeem et.al. | 2405.10350 | null |
2024-05-16 | Stochastic Q-learning for Large Discrete Action Spaces | Fares Fourati et.al. | 2405.10310 | null |
2024-05-16 | Conformal Alignment: Knowing When to Trust Foundation Models with Guarantees | Yu Gui et.al. | 2405.10301 | link |
2024-05-17 | Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning | Yuexiang Zhai et.al. | 2405.10292 | null |
2024-05-16 | Towards Consistent and Explainable Motion Prediction using Heterogeneous Graph Attention | Tobias Demmler et.al. | 2405.10134 | null |
2024-05-16 | Cooperative Visual-LiDAR Extrinsic Calibration Technology for Intersection Vehicle-Infrastructure: A review | Xinyu Zhang et.al. | 2405.10132 | null |
2024-05-16 | When Large Language Model Meets Optimization | Sen Huang et.al. | 2405.10098 | null |
2024-05-16 | Optimizing Search and Rescue UAV Connectivity in Challenging Terrain through Multi Q-Learning | Mohammed M. H. Qazzaz et.al. | 2405.10042 | null |
2024-05-16 | $Δ\text{-}{\rm OPE}$ : Off-Policy Estimation with Pairs of Policies | Olivier Jeunen et.al. | 2405.10024 | link |
2024-05-16 | Solving the enigma: Deriving optimal explanations of deep networks | Michail Mamalakis et.al. | 2405.10008 | null |
2024-05-16 | A Unified Deep Transfer Learning Model for Accurate IoT Localization in Diverse Environments | Abdullahi Isa Ahmed et.al. | 2405.09960 | null |
2024-05-16 | Infrared Adversarial Car Stickers | Xiaopei Zhu et.al. | 2405.09924 | null |
2024-05-16 | PillarNeXt: Improving the 3D detector by introducing Voxel2Pillar feature encoding and extracting multi-scale features | Xusheng Li et.al. | 2405.09828 | null |
2024-05-16 | Collision Avoidance Metric for 3D Camera Evaluation | Vage Taamazyan et.al. | 2405.09755 | link |
2024-05-15 | Synth-to-Real Unsupervised Domain Adaptation for Instance Segmentation | Guo Yachan et.al. | 2405.09682 | null |
2024-05-15 | Challenges and opportunities for digital twins in precision medicine: a complex systems perspective | Manlio De Domenico et.al. | 2405.09649 | null |
2024-05-15 | DemOpts: Fairness corrections in COVID-19 case prediction models | Naman Awasthi et.al. | 2405.09483 | null |
2024-05-15 | Facilitating Opinion Diversity through Hybrid NLP Approaches | Michiel van der Meer et.al. | 2405.09439 | null |
2024-05-15 | The Unfairness of $\varepsilon$ -Fairness | Tolulope Fadina et.al. | 2405.09360 | null |
2024-05-15 | Multi-Source Conformal Inference Under Distribution Shift | Yi Liu et.al. | 2405.09331 | link |
2024-05-15 | Reinforcement Learning-Based Framework for the Intelligent Adaptation of User Interfaces | Daniel Gaspar-Figueiredo et.al. | 2405.09255 | null |
2024-05-15 | CarDreamer: Open-Source Learning Platform for World Model based Autonomous Driving | Dechen Gao et.al. | 2405.09111 | link |
2024-05-15 | Explainable AI for Ship Collision Avoidance: Decoding Decision-Making Processes and Behavioral Intentions | Hitoshi Yoshioka et.al. | 2405.09081 | null |
2024-05-15 | Perception Without Vision for Trajectory Prediction: Ego Vehicle Dynamics as Scene Representation for Efficient Active Learning in Autonomous Driving | Ross Greer et.al. | 2405.09049 | null |
2024-05-15 | Deep Learning in Earthquake Engineering: A Comprehensive Review | Yazhou Xie et.al. | 2405.09021 | null |
2024-05-14 | Contextual Emotion Recognition using Large Vision Language Models | Yasaman Etesam et.al. | 2405.08992 | null |
2024-05-14 | Bird’s-Eye View to Street-View: A Survey | Khawlah Bajbaa et.al. | 2405.08961 | null |
2024-05-14 | The Pitfalls and Promise of Conformal Inference Under Adversarial Attacks | Ziquan Liu et.al. | 2405.08886 | link |
2024-05-14 | The RoboDrive Challenge: Drive Anytime Anywhere in Any Condition | Lingdong Kong et.al. | 2405.08816 | null |
2024-05-14 | Ambiguous Annotations: When is a Pedestrian not a Pedestrian? | Luisa Schwirten et.al. | 2405.08794 | null |
2024-05-14 | Beyond the Black Box: Do More Complex Models Provide Superior XAI Explanations? | Mateusz Cedro et.al. | 2405.08658 | null |
2024-05-14 | vMFER: Von Mises-Fisher Experience Resampling Based on Uncertainty of Gradient Directions for Policy Improvement | Yiwen Zhu et.al. | 2405.08638 | null |
2024-05-15 | Learning Decision Policies with Instrumental Variables through Double Machine Learning | Daqian Shao et.al. | 2405.08498 | link |
2024-05-14 | Work-in-Progress: Crash Course: Can (Under Attack) Autonomous Driving Beat Human Drivers? | Francesco Marchiori et.al. | 2405.08466 | null |
2024-05-14 | Large-Scale Metric Computation in Online Controlled Experiment Platform | Tao Xiong et.al. | 2405.08411 | null |
2024-05-14 | Towards Multi-Task Generative-AI Edge Services with an Attention-based Diffusion DRL Approach | Yaju Liu et.al. | 2405.08328 | null |
2024-05-14 | Deep Reinforcement Learning for Real-Time Ground Delay Program Revision and Corresponding Flight Delay Assignments | Ke Liu et.al. | 2405.08298 | null |
2024-05-14 | Airport Delay Prediction with Temporal Fusion Transformers | Ke Liu et.al. | 2405.08293 | null |
2024-05-14 | VS-Assistant: Versatile Surgery Assistant on the Demand of Surgeons | Zhen Chen et.al. | 2405.08272 | null |
2024-05-13 | Factors Shaping Financial Success: A Deep Dive into Influencing Variables | Michael Zhou et.al. | 2405.08233 | null |
2024-05-13 | Community detection in bipartite signed networks is highly dependent on parameter choice | Elena Candellone et.al. | 2405.08203 | link |
2024-05-13 | Optimizing Task Scheduling in Heterogeneous Computing Environments: A Comparative Analysis of CPU, GPU, and ASIC Platforms Using E2C Simulator | Ali Mohammadjafari et.al. | 2405.08187 | null |
2024-05-13 | Do Bayesian imaging methods report trustworthy probabilities? | David Y. W. Thong et.al. | 2405.08179 | null |
2024-05-13 | Equivariant Deep Learning of Mixed-Integer Optimal Control Solutions for Vehicle Decision Making and Motion Planning | Rudolf Reiter et.al. | 2405.08122 | null |
2024-05-13 | SPIN: Simultaneous Perception, Interaction and Navigation | Shagun Uppal et.al. | 2405.07991 | null |
2024-05-13 | A Generalist Learner for Multifaceted Medical Image Interpretation | Hong-Yu Zhou et.al. | 2405.07988 | null |
2024-05-13 | OverlapMamba: Novel Shift State Space Model for LiDAR-based Place Recognition | Qiuchi Xiang et.al. | 2405.07966 | link |
2024-05-13 | Fast Computation of Superquantile-Constrained Optimization Through Implicit Scenario Reduction | Jake Roth et.al. | 2405.07965 | link |
2024-05-13 | AgentClinic: a multimodal agent benchmark to evaluate AI in simulated clinical environments | Samuel Schmidgall et.al. | 2405.07960 | null |
2024-05-13 | IMAFD: An Interpretable Multi-stage Approach to Flood Detection from time series Multispectral Data | Ziyang Zhang et.al. | 2405.07916 | null |
2024-05-13 | AnoVox: A Benchmark for Multimodal Anomaly Detection in Autonomous Driving | Daniel Bogdoll et.al. | 2405.07865 | link |
2024-05-13 | Collective Decision-Making on Task Allocation Feasibility | Samratul Fuady et.al. | 2405.07799 | null |
2024-05-13 | Human-Modeling in Sequential Decision-Making: An Analysis through the Lens of Human-Aware AI | Silvia Tulli et.al. | 2405.07773 | null |
2024-05-13 | Waste Factor and Waste Figure: A Unified Theory for Modeling and Analyzing Wasted Power in Radio Access Networks for Improved Sustainability | Theodore S. Rappaport et.al. | 2405.07710 | null |
2024-05-13 | oTTC: Object Time-to-Contact for Motion Estimation in Autonomous Driving | Abdul Hannan Khan et.al. | 2405.07698 | null |
2024-05-13 | Evaluating the Explainable AI Method Grad-CAM for Breath Classification on Newborn Time Series Data | Camelia Oprea et.al. | 2405.07590 | null |
2024-05-13 | MaskFuser: Masked Fusion of Joint Multi-Modal Tokenization for End-to-End Autonomous Driving | Yiqun Duan et.al. | 2405.07573 | null |
2024-05-13 | Safety-Aware Human-Lead Vehicle Platooning by Proactively Reacting to Uncertain Human Behaving | Jia Hu et.al. | 2405.07556 | null |
2024-05-13 | Prompt-based Code Completion via Multi-Retrieval Augmented Generation | Hanzhuo Tan et.al. | 2405.07530 | null |
2024-05-13 | Consistency Policy: Accelerated Visuomotor Policies via Consistency Distillation | Aaditya Prasad et.al. | 2405.07503 | null |
2024-05-12 | CaFA: Global Weather Forecasting with Factorized Attention on Sphere | Zijie Li et.al. | 2405.07395 | link |
2024-05-12 | Explainable Convolutional Neural Networks for Retinal Fundus Classification and Cutting-Edge Segmentation Models for Retinal Blood Vessels from Fundus Images | Fatema Tuj Johora Faria et.al. | 2405.07338 | link |
2024-05-12 | Computational analysis of US Congressional speeches reveals a shift from evidence to intuition | Segun Taofeek Aroyehun et.al. | 2405.07323 | null |
2024-05-12 | Enhancing Decision-Making in Optimization through LLM-Assisted Inference: A Neural Networks Perspective | Gaurav Singh et.al. | 2405.07212 | null |
2024-05-10 | Multi-Object Tracking in the Dark | Xinzhe Wang et.al. | 2405.06600 | link |
2024-05-10 | Hierarchical Learned Risk-Aware Planning Framework for Human Driving Modeling | Nathan Ludlow et.al. | 2405.06578 | null |
2024-05-10 | Good Things Come in Trees: Emotion and Context Aware Behaviour Trees for Ethical Robotic Decision-Making | Paige Tuttösí et.al. | 2405.06543 | null |
2024-05-10 | Aspect-based Sentiment Evaluation of Chess Moves (ASSESS): an NLP-based Method for Evaluating Chess Strategies from Textbooks | Haifa Alrdahi et.al. | 2405.06499 | null |
2024-05-10 | Autonomous Driving with a Deep Dual-Model Solution for Steering and Braking Control | Ana Petra Jukić et.al. | 2405.06473 | null |
2024-05-10 | Residual-based Attention Physics-informed Neural Networks for Efficient Spatio-Temporal Lifetime Assessment of Transformers Operated in Renewable Power Plants | Ibai Ramirez et.al. | 2405.06443 | null |
2024-05-10 | Building Trust in AI-Driven Decision Making for Cyber-Physical Systems (CPS): A Comprehensive Review | Rahul Umesh Mhapsekar et.al. | 2405.06347 | null |
2024-05-10 | FedGCS: A Generative Framework for Efficient Client Selection in Federated Learning via Gradient-based Optimization | Zhiyuan Ning et.al. | 2405.06312 | link |
2024-05-10 | Exploring the Interplay of Interpretability and Robustness in Deep Neural Networks: A Saliency-guided Approach | Amira Guesmi et.al. | 2405.06278 | null |
2024-05-10 | XAI4LLM. Let Machine Learning Models and LLMs Collaborate for Enhanced In-Context Learning in Healthcare | Fatemeh Nazary et.al. | 2405.06270 | null |
2024-05-10 | Selective Focus: Investigating Semantics Sensitivity in Post-training Quantization for Lane Detection | Yunqian Fan et.al. | 2405.06264 | null |
2024-05-09 | Probing Multimodal LLMs as World Models for Driving | Shiva Sreeram et.al. | 2405.05956 | link |
2024-05-09 | A Survey on Visualization Approaches in Political Science for Social and Political Factors: Progress to Date and Future Opportunities | Dongyun Han et.al. | 2405.05947 | null |
2024-05-09 | Co-driver: VLM-based Autonomous Driving Assistant with Human-like Behavior and Understanding for Complex Road Scenes | Ziang Guo et.al. | 2405.05885 | link |
2024-05-09 | Informed Decision-Making through Advancements in Open Set Recognition and Unknown Sample Detection | Atefeh Mahdavi et.al. | 2405.05836 | null |
2024-05-09 | Robots Can Feel: LLM-based Framework for Robot Ethical Reasoning | Artem Lykov et.al. | 2405.05824 | link |
2024-05-09 | Optimal Baseline Corrections for Off-Policy Contextual Bandits | Shashank Gupta et.al. | 2405.05736 | link |
2024-05-09 | TransAnaNet: Transformer-based Anatomy Change Prediction Network for Head and Neck Cancer Patient Radiotherapy | Meixu Chen et.al. | 2405.05674 | null |
2024-05-09 | Emerging Optimization Problems for Distribution in Same-day Delivery | Yuanyuan Li et.al. | 2405.05620 | null |
2024-05-09 | Towards Robust Physical-world Backdoor Attacks on Lane Detection | Xinwei Zhang et.al. | 2405.05553 | null |
2024-05-09 | Benchmarking Neural Radiance Fields for Autonomous Robots: An Overview | Yuhang Ming et.al. | 2405.05526 | null |
2024-05-09 | Multi-Scale Dilated Convolution Network for Long-Term Time Series Forecasting | Feifei Li et.al. | 2405.05499 | null |
2024-05-09 | Advancing Head and Neck Cancer Survival Prediction via Multi-Label Learning and Deep Model Interpretation | Meixu Chen et.al. | 2405.05488 | null |
2024-05-09 | Design of Targeted Community-Based Resource Allocation in the Presence of Vaccine Hesitancy via a Data-Driven Compartmental Stochastic Optimization Model | Hieu Bui et.al. | 2405.05487 | null |
2024-05-09 | Topological bifurcations in a mean-field game | Ali Akbar Rezaei Lori et.al. | 2405.05473 | null |
2024-05-08 | Mitigating Exaggerated Safety in Large Language Models | Ruchi Bhalani et.al. | 2405.05418 | null |
2024-05-08 | Multi-Modal Data-Efficient 3D Scene Understanding for Autonomous Driving | Lingdong Kong et.al. | 2405.05258 | link |
2024-05-08 | Clustering Retail Products Based on Customer Behaviour | Vladimír Holý et.al. | 2405.05218 | null |
2024-05-08 | A Survey on Occupancy Perception for Autonomous Driving: The Information Fusion Perspective | Huaiyuan Xu et.al. | 2405.05173 | link |
2024-05-08 | DenserRadar: A 4D millimeter-wave radar point cloud detector based on dense LiDAR point clouds | Zeyu Han et.al. | 2405.05131 | null |
2024-05-08 | Novel Actor-Critic Algorithm for Robust Decision Making of CAV under Delays and Loss of V2X Data | Zine el abidine Kherroubi et.al. | 2405.05072 | null |
2024-05-08 | Designing Skill-Compatible AI: Methodologies and Frameworks in Chess | Karim Hamade et.al. | 2405.05066 | link |
2024-05-08 | Impact of Tone-Aware Explanations in Recommender Systems | Ayano Okoso et.al. | 2405.05061 | null |
2024-05-08 | Quantum Circuit Ansatz: Abstraction and Reuse of Quantum Algorithm Design | Xiaoyu Guo et.al. | 2405.05021 | null |
2024-05-08 | Overcoming Anchoring Bias: The Potential of AI and XAI-based Decision Support | Felix Haag et.al. | 2405.04972 | null |
2024-05-08 | Traj-LLM: A New Exploration for Empowering Trajectory Prediction with Pre-trained Large Language Models | Zhengxing Lan et.al. | 2405.04909 | null |
2024-05-07 | Enhancing Organizational Performance: Harnessing AI and NLP for User Feedback Analysis in Product Development | Tian Tian et.al. | 2405.04692 | null |
2024-05-07 | ACEGEN: Reinforcement learning of generative chemical agents for drug discovery | Albert Bou et.al. | 2405.04657 | link |
2024-05-07 | New allometric models for the USA create a step-change in forest carbon estimation, modeling, and mapping | Lucas K. Johnson et.al. | 2405.04507 | null |
2024-05-07 | TorchDriveEnv: A Reinforcement Learning Benchmark for Autonomous Driving with Reactive, Realistic, and Diverse Non-Playable Characters | Jonathan Wilder Lavington et.al. | 2405.04491 | null |
2024-05-07 | POV Learning: Individual Alignment of Multimodal Models using Human Perception | Simon Werner et.al. | 2405.04443 | null |
2024-05-07 | Designing, Developing, and Validating Network Intelligence for Scaling in Service-Based Architectures based on Deep Reinforcement Learning | Paola Soto et.al. | 2405.04441 | null |
2024-05-07 | Designing the Network Intelligence Stratum for 6G Networks | Paola Soto et.al. | 2405.04432 | null |
2024-05-07 | Mathematical Modeling of $^{18}$F-Fluoromisonidazole ($^{18}$ F-FMISO) Radiopharmaceutical Transport in Vascularized Solid Tumors | Mohammad Amin Abazari et.al. | 2405.04418 | null |
2024-05-09 | Weakly-Supervised Residual Evidential Learning for Multi-Instance Uncertainty Estimation | Pei Liu et.al. | 2405.04405 | link |
2024-05-07 | Efficient Online Set-valued Classification with Bandit Feedback | Zhou Wang et.al. | 2405.04393 | null |
2024-05-07 | DriveWorld: 4D Pre-trained Scene Understanding via World Models for Autonomous Driving | Chen Min et.al. | 2405.04390 | null |
2024-05-07 | Bayesian Simultaneous Localization and Multi-Lane Tracking Using Onboard Sensors and a SD Map | Yuxuan Xia et.al. | 2405.04290 | null |
2024-05-07 | pFedLVM: A Large Vision Model (LVM)-Driven and Latent Feature-Based Personalized Federated Learning Framework in Autonomous Driving | Wei-Bin Kou et.al. | 2405.04146 | null |
2024-05-07 | Policy Learning with a Language Bottleneck | Megha Srivastava et.al. | 2405.04118 | link |
2024-05-07 | ESP: Extro-Spective Prediction for Long-term Behavior Reasoning in Emergency Scenarios | Dingrui Wang et.al. | 2405.04100 | null |
2024-05-07 | Counterfactual and Semifactual Explanations in Abstract Argumentation: Formal Foundations, Complexity and Computation | Gianvincenzo Alfano et.al. | 2405.04081 | null |
2024-05-07 | Feature Map Convergence Evaluation for Functional Module | Ludan Zhang et.al. | 2405.04041 | null |
2024-05-07 | Uncovering implementable dormant pruning decisions from three different stakeholder perspectives | Deanna Flynn et.al. | 2405.04030 | null |
2024-05-07 | Lumbar Spine Tumor Segmentation and Localization in T2 MRI Images Using AI | Rikathi Pal et.al. | 2405.04023 | null |
2024-05-07 | Certified Policy Verification and Synthesis for MDPs under Distributional Reach-avoidance Properties | S. Akshay et.al. | 2405.04015 | null |
2024-05-07 | Deep Event-based Object Detection in Autonomous Driving: A Survey | Bingquan Zhou et.al. | 2405.03995 | null |
2024-05-07 | Unified End-to-End V2X Cooperative Autonomous Driving | Zhiwei Li et.al. | 2405.03971 | null |
2024-05-06 | Anti-Heroes: An Ethics-focused Method for Responsible Designer Intentions | Shikha Mehta et.al. | 2405.03674 | null |
2024-05-06 | RoboCar: A Rapidly Deployable Open-Source Platform for Autonomous Driving Research | Mehdi Testouri et.al. | 2405.03572 | link |
2024-05-06 | Is Sora a World Simulator? A Comprehensive Survey on General World Models and Beyond | Zheng Zhu et.al. | 2405.03520 | link |
2024-05-06 | Uncertainty of Supply Chains: Risk and Ambiguity | d’Artis Kancs et.al. | 2405.03451 | null |
2024-05-06 | The high dimensional psychological profile and cultural bias of ChatGPT | Hang Yuan et.al. | 2405.03387 | null |
2024-05-06 | Enhancing Q-Learning with Large Language Model Heuristics | Xiefeng Wu et.al. | 2405.03341 | null |
2024-05-06 | Functional Equivalence with NARS | Robert Johansson et.al. | 2405.03340 | null |
2024-05-06 | Artificial Intelligence in the Autonomous Navigation of Endovascular Interventions: A Systematic Review | Harry Robertshaw et.al. | 2405.03305 | null |
2024-05-06 | End-to-End Reinforcement Learning of Curative Curtailment with Partial Measurement Availability | Hinrikus Wolf et.al. | 2405.03262 | null |
2024-05-06 | Anchored Answers: Unravelling Positional Bias in GPT-2’s Multiple-Choice Questions | Ruizhe Li et.al. | 2405.03205 | link |
2024-05-05 | High Order Reasoning for Time Critical Recommendation in Evidence-based Medicine | Manjiang Yu et.al. | 2405.03010 | null |
2024-05-05 | MERIT: Multi-view Evidential learning for Reliable and Interpretable liver fibrosis sTaging | Yuanye Liu et.al. | 2405.02918 | null |
2024-05-05 | SalFAU-Net: Saliency Fusion Attention U-Net for Salient Object Detection | Kassaw Abraham Mulat et.al. | 2405.02906 | null |
2024-05-05 | Region-specific Risk Quantification for Interpretable Prognosis of COVID-19 | Zhusi Zhong et.al. | 2405.02815 | link |
2024-05-04 | Sub-goal Distillation: A Method to Improve Small Language Agents | Maryam Hashemzadeh et.al. | 2405.02749 | link |
2024-05-04 | Grouping predictors via network-wide metrics | Brandon Woosuk Park et.al. | 2405.02715 | null |
2024-05-04 | Ambush strategy enhances organisms’ performance in rock-paper-scissors games | R. Barbalho et.al. | 2405.02674 | null |
2024-05-04 | Interpretable Multi-View Clustering | Mudi Jiang et.al. | 2405.02644 | null |
2024-05-04 | Accelerating Autonomy: Insights from Pro Racers in the Era of Autonomous Racing - An Expert Interview Study | Frederik Werner et.al. | 2405.02620 | link |
2024-05-04 | MEXGEN: An Effective and Efficient Information Gain Approximation for Information Gathering Path Planning | Joshua Chesser et.al. | 2405.02605 | null |
2024-05-03 | Subgraph2vec: A random walk-based algorithm for embedding knowledge graphs | Elika Bozorgi et.al. | 2405.02240 | null |
2024-05-03 | Optimistic Regret Bounds for Online Learning in Adversarial Markov Decision Processes | Sang Bin Moon et.al. | 2405.02188 | null |
2024-05-03 | Characterized Diffusion and Spatial-Temporal Interaction Network for Trajectory Prediction in Autonomous Driving | Haicheng Liao et.al. | 2405.02145 | null |
2024-05-03 | Multi-Objective Recommendation via Multivariate Policy Learning | Olivier Jeunen et.al. | 2405.02141 | link |
2024-05-03 | Learning from Evolution: Improving Collective Decision-Making Mechanisms using Insights from Evolutionary Robotics | Tanja Katharina Kaiser et.al. | 2405.02133 | null |
2024-05-03 | Argumentative Large Language Models for Explainable and Contestable Decision-Making | Gabriel Freedman et.al. | 2405.02079 | null |
2024-05-03 | Sampling to Achieve the Goal: An Age-aware Remote Markov Decision Process | Aimin Li et.al. | 2405.02042 | link |
2024-05-03 | Obstacle Avoidance of Autonomous Vehicles: An LPVMPC with Scheduling Trust Region | Maryam Nezami et.al. | 2405.02030 | null |
2024-05-03 | DiffMap: Enhancing Map Segmentation with Map Prior Using Diffusion Model | Peijin Jia et.al. | 2405.02008 | null |
2024-05-03 | M ${^2}$ Depth: Self-supervised Two-Frame Multi-camera Metric Depth Estimation | Yingshuang Zou et.al. | 2405.02004 | null |
2024-05-03 | Mathematics of statistical sequential decision-making: concentration, risk-awareness and modelling in stochastic bandits, with applications to bariatric surgery | Patrick Saux et.al. | 2405.01994 | null |
2024-05-03 | Conformal Prediction for Natural Language Processing: A Survey | Margarida M. Campos et.al. | 2405.01976 | null |
2024-05-03 | Unleashing the Power of AI: Transforming Marketing Decision-Making in Heavy Machinery with Machine Learning, Radar Chart Simulation, and Markov Chain Analysis | Tian Tian et.al. | 2405.01913 | null |
2024-05-03 | Transforming Investment Strategies and Strategic Decision-Making: Unveiling a Novel Methodology for Enhanced Performance and Risk Management in Financial Markets | Tian Tian et.al. | 2405.01892 | null |
2024-05-03 | Explainable Risk Classification in Financial Reports | Xue Wen Tan et.al. | 2405.01881 | null |
2024-05-03 | SocialGFs: Learning Social Gradient Fields for Multi-Agent Reinforcement Learning | Qian Long et.al. | 2405.01839 | null |
2024-05-03 | Non-linear Welfare-Aware Strategic Learning | Tian Xie et.al. | 2405.01810 | link |
2024-05-03 | Algorithmic Decision-Making under Agents with Persistent Improvement | Tian Xie et.al. | 2405.01807 | link |
2024-05-02 | Large Language Models for UAVs: Current State and Pathways to the Future | Shumaila Javaid et.al. | 2405.01745 | null |
2024-05-02 | Explainability Guided Adversarial Evasion Attacks on Malware Detectors | Kshitiz Aryal et.al. | 2405.01728 | null |
2024-05-02 | Multi-Space Alignments Towards Universal LiDAR Segmentation | Youquan Liu et.al. | 2405.01538 | link |
2024-05-02 | OmniDrive: A Holistic LLM-Agent Framework for Autonomous Driving with 3D Perception, Reasoning and Planning | Shihao Wang et.al. | 2405.01533 | link |
2024-05-02 | Supporting Business Document Workflows via Collection-Centric Information Foraging with Large Language Models | Raymond Fok et.al. | 2405.01501 | null |
2024-05-02 | A Basic Overview of Various Stochastic Approaches to Financial Modeling With Examples | Aashrit Cunchala et.al. | 2405.01397 | null |
2024-05-02 | An Advanced Framework for Ultra-Realistic Simulation and Digital Twinning for Autonomous Vehicles | Yuankai He et.al. | 2405.01328 | null |
2024-05-02 | MFTraj: Map-Free, Behavior-Driven Trajectory Prediction for Autonomous Driving | Haicheng Liao et.al. | 2405.01266 | null |
2024-05-02 | Causal Influence in Federated Edge Inference | Mert Kayaalp et.al. | 2405.01260 | null |
2024-05-02 | A Survey on Semantic Communication Networks: Architecture, Security, and Privacy | Shaolong Guo et.al. | 2405.01221 | null |
2024-05-02 | How A/B testing changes the dynamics of information spreading on a social network | Matteo Ottaviani et.al. | 2405.01165 | null |
2024-05-02 | Federated Learning with Heterogeneous Data Handling for Robust Vehicular Object Detection | Ahmad Khalil et.al. | 2405.01108 | link |
2024-05-02 | Poisoning Attacks on Federated Learning for Autonomous Driving | Sonakshi Garg et.al. | 2405.01073 | null |
2024-05-02 | Rare Collision Risk Estimation of Autonomous Vehicles with Multi-Agent Situation Awareness | Mahdieh Zaker et.al. | 2405.01011 | null |
2024-05-02 | Generative manufacturing systems using diffusion models and ChatGPT | Xingyu Li et.al. | 2405.00958 | null |
2024-05-02 | Provably Efficient Reinforcement Learning for Adversarial Restless Multi-Armed Bandits with Unknown Transitions and Bandit Feedback | Guojun Xiong et.al. | 2405.00950 | null |
2024-05-01 | DiL-NeRF: Delving into Lidar for Neural Radiance Field on Street Scenes | Shanlin Sun et.al. | 2405.00900 | null |
2024-05-01 | Uncovering Agendas: A Novel French & English Dataset for Agenda Detection on Social Media | Gregorios Katsios et.al. | 2405.00821 | link |
2024-05-01 | ADM: Accelerated Diffusion Model via Estimated Priors for Robust Motion Prediction under Uncertainties | Jiahui Li et.al. | 2405.00797 | link |
2024-05-01 | Lane Segmentation Refinement with Diffusion Models | Antonio Ruiz et.al. | 2405.00620 | null |
2024-05-03 | New Trends on the Systems Approach to Modeling SARS-CoV-2 Pandemics in a Globally Connected Planet | Giulia Bertaglia et.al. | 2405.00541 | null |
2024-05-01 | Design Implications for a Social and Collaborative Understanding of online Information Assessment Practices, Challenges and Heuristics | Vasilis Vlachokyriakos et.al. | 2405.00519 | null |
2024-05-01 | GAD-Generative Learning for HD Map-Free Autonomous Driving | Weijian Sun et.al. | 2405.00515 | link |
2024-05-01 | On the Relevance of Byzantine Robust Optimization Against Data Poisoning | Sadegh Farhadkhani et.al. | 2405.00491 | null |
2024-05-01 | RAG-based Explainable Prediction of Road Users Behaviors for Automated Driving using Knowledge Graphs and Large Language Models | Mohamed Manzour Hussien et.al. | 2405.00449 | null |
2024-05-01 | Geometric Insights into Focal Loss: Reducing Curvature for Enhanced Model Calibration | Masanari Kimura et.al. | 2405.00442 | null |
2024-05-01 | UCB-driven Utility Function Search for Multi-objective Reinforcement Learning | Yucheng Shi et.al. | 2405.00410 | link |
2024-05-01 | Dual-Role AoI-based Incentive Mechanism for HD map Crowdsourcing | Wentao Ye et.al. | 2405.00353 | null |
2024-05-01 | A Self-explaining Neural Architecture for Generalizable Concept Learning | Sanchit Sinha et.al. | 2405.00349 | null |
2024-05-01 | Finding the white male: The prevalence and consequences of algorithmic gender and race bias in political Google searches | Tobias Rohrbach et.al. | 2405.00335 | null |
2024-05-01 | Reevaluating coexistence and stability in ecosystem networks to address ecological transients: methods and implications | Sarah A. Vollert et.al. | 2405.00333 | null |
2024-05-01 | Enhance Planning with Physics-informed Safety Controllor for End-to-end Autonomous Driving | Hang Zhou et.al. | 2405.00316 | null |
2024-05-01 | Social Life Simulation for Non-Cognitive Skills Learning | Zihan Yan et.al. | 2405.00273 | null |
2024-04-30 | SemVecNet: Generalizable Vector Map Generation for Arbitrary Sensor Configurations | Narayanan Elavathur Ranganatha et.al. | 2405.00250 | link |
2024-04-30 | Guiding Attention in End-to-End Driving Models | Diego Porres et.al. | 2405.00242 | link |
2024-04-30 | STT: Stateful Tracking with Transformers for Autonomous Driving | Longlong Jing et.al. | 2405.00236 | null |
2024-04-30 | Comparing Motion Distortion Between Vehicle Field Deployments | Nicolas Samson et.al. | 2405.00189 | null |
2024-04-30 | Heart Rate and Body Temperature Relationship in Children Admitted to PICU – A Machine Learning Approach | Emilie Lu et.al. | 2405.00180 | null |
2024-04-30 | Analyzing Transport Policies in Developing Countries with ABM | Kathleen Salazar-Serna et.al. | 2404.19745 | link |
2024-04-30 | Collaborative Control Method of Transit Signal Priority Based on Cooperative Game and Reinforcement Learning | Hao Qin et.al. | 2404.19683 | null |
2024-04-30 | The Drawback of Insight: Detailed Explanations Can Reduce Agreement with XAI | Sabid Bin Habib Pias et.al. | 2404.19629 | null |
2024-04-30 | Enhancing Deep Learning Model Explainability in Brain Tumor Datasets using Post-Heuristic Approaches | Konstantinos Pasvantis et.al. | 2404.19568 | null |
2024-04-30 | Choosing a consultant in a dynamic investment problem | Yuval Cornfeld et.al. | 2404.19507 | null |
2024-04-30 | The harms of class imbalance corrections for machine learning based prediction models: a simulation study | Alex Carriero et.al. | 2404.19494 | link |
2024-04-30 | Transformer-Enhanced Motion Planner: Attention-Guided Sampling for State-Specific Decision Making | Lei Zhuang et.al. | 2404.19403 | null |
2024-04-30 | Online Electricity Purchase for Data Center with Dynamic Virtual Battery from Flexibility Aggregation | Kekun Gao et.al. | 2404.19387 | null |
2024-04-30 | Pseudo Label Refinery for Unsupervised Domain Adaptation on Cross-dataset 3D Object Detection | Zhanwei Zhang et.al. | 2404.19384 | null |
2024-04-30 | SemanticFormer: Holistic and Semantic Traffic Scene Representation for Trajectory Prediction using Knowledge Graphs | Zhigang Sun et.al. | 2404.19379 | link |
2024-04-30 | Reliable or Deceptive? Investigating Gated Features for Smooth Visual Explanations in CNNs | Soham Mitra et.al. | 2404.19341 | link |
2024-04-30 | G2LTraj: A Global-to-Local Generation Approach for Trajectory Prediction | Zhanwei Zhang et.al. | 2404.19330 | link |
2024-04-30 | Bias Mitigation via Compensation: A Reinforcement Learning Perspective | Nandhini Swaminathan et.al. | 2404.19256 | null |
2024-04-29 | Scalable Bayesian Inference in the Era of Deep Learning: From Gaussian Processes to Deep Neural Networks | Javier Antoran et.al. | 2404.19157 | null |
2024-04-29 | Blind Spots and Biases: Exploring the Role of Annotator Cognitive Biases in NLP | Sanjana Gautam et.al. | 2404.19071 | null |
2024-04-29 | Synthesizing the Born rule with reinforcement learning | Rodrigo S. Piera et.al. | 2404.19011 | null |
2024-04-29 | Detecting critical treatment effect bias in small subgroups | Piersilvio De Bartolomeis et.al. | 2404.18905 | link |
2024-04-29 | Overcoming Knowledge Barriers: Online Imitation Learning from Observation with Pretrained World Models | Xingyuan Zhang et.al. | 2404.18896 | link |
2024-04-29 | PlanNetX: Learning an Efficient Neural Network Planner from MPC for Longitudinal Control | Jasper Hoffmann et.al. | 2404.18863 | null |
2024-04-29 | Safe Reach Set Computation via Neural Barrier Certificates | Alessandro Abate et.al. | 2404.18813 | null |
2024-04-29 | Three-state Opinion Dynamics for Financial Markets on Complex Networks | Bernardo J. Zubillaga et.al. | 2404.18709 | null |
2024-04-29 | Why You Should Not Trust Interpretations in Machine Learning: Adversarial Attacks on Partial Dependence Plots | Xi Xin et.al. | 2404.18702 | null |
2024-04-29 | Work Smarter…Not Harder: Efficient Minimization of Dependency Length in SOV Languages | Sidharth Ranjan et.al. | 2404.18684 | null |
2024-04-29 | LLMClean: Context-Aware Tabular Data Cleaning via LLM-Generated OFDs | Fabian Biester et.al. | 2404.18681 | null |
2024-04-29 | Enhancing Uncertain Demand Prediction in Hospitals Using Simple and Advanced Machine Learning | Annie Hu et.al. | 2404.18670 | null |
2024-04-29 | Uncertainty-boosted Robust Video Activity Anticipation | Zhaobo Qi et.al. | 2404.18648 | link |
2024-04-29 | CoSense3D: an Agent-based Efficient Learning Framework for Collective Perception | Yunshuang Yuan et.al. | 2404.18617 | link |
2024-04-29 | Assessing Quality Metrics for Neural Reality Gap Input Mitigation in Autonomous Driving Testing | Stefano Carlo Lambertenghi et.al. | 2404.18577 | link |
2024-04-29 | Predicting Safety Misbehaviours in Autonomous Driving Systems using Uncertainty Quantification | Ruben Grewal et.al. | 2404.18573 | link |
2024-04-29 | IncidentResponseGPT: Generating Traffic Incident Response Plans with Generative Artificial Intelligence | Artur Grigorev et.al. | 2404.18550 | null |
2024-04-29 | Reduced-Rank Multi-objective Policy Learning and Optimization | Ezinne Nwankwo et.al. | 2404.18490 | null |
2024-04-29 | MRIC: Model-Based Reinforcement-Imitation Learning with Mixture-of-Codebooks for Autonomous Driving Simulation | Baotian He et.al. | 2404.18464 | null |
2024-04-29 | $ν$ -DBA: Neural Implicit Dense Bundle Adjustment Enables Image-Only Driving Scene Reconstruction | Yunxuan Mao et.al. | 2404.18439 | null |
2024-04-28 | Bias Neutralization Framework: Measuring Fairness in Large Language Models with Bias Intelligence Quotient (BiQ) | Malur Narayan et.al. | 2404.18276 | null |
2024-04-28 | A General Causal Inference Framework for Cross-Sectional Observational Data | Yonghe Zhao et.al. | 2404.18197 | null |
2024-04-28 | Application and practice of AI technology in quantitative investment | Shuochen Bi et.al. | 2404.18184 | null |
2024-04-26 | The Role of Marketing in Public Policy Decision Making: The Case of Fuel Subsidy Removal in Nigeria | Salome O. Ighomereho et.al. | 2404.17551 | null |
2024-04-26 | CoCar NextGen: a Multi-Purpose Platform for Connected Autonomous Driving Research | Marc Heinrich et.al. | 2404.17550 | null |
2024-04-26 | A Cognitive-Driven Trajectory Prediction Model for Autonomous Driving in Mixed Autonomy Environment | Haicheng Liao et.al. | 2404.17520 | null |
2024-04-26 | Q-Learning to navigate turbulence without a map | Marco Rando et.al. | 2404.17495 | null |
2024-04-26 | Causally Abstracted Multi-armed Bandits | Fabio Massimo Zennaro et.al. | 2404.17493 | link |
2024-04-26 | A multi-agent model of hierarchical decision dynamics | Paul Kinsler et.al. | 2404.17477 | null |
2024-04-26 | Cost-Sensitive Uncertainty-Based Failure Recognition for Object Detection | Moussa Kassem Sbeyti et.al. | 2404.17427 | link |
2024-04-26 | Assessing the Potential of AI for Spatially Sensitive Nature-Related Financial Risks | Steven Reece et.al. | 2404.17369 | null |
2024-04-26 | On the Road to Clarity: Exploring Explainable AI for World Models in a Driver Assistance System | Mohamed Roshdi et.al. | 2404.17350 | null |
2024-04-26 | Scene-Extrapolation: Generating Interactive Traffic Scenarios | Maximilian Zipfl et.al. | 2404.17224 | null |
2024-04-26 | Beyond Imitation: A Life-long Policy Learning Framework for Path Tracking Control of Autonomous Driving | C. Gong et.al. | 2404.17198 | null |
2024-04-26 | Online $\mathrm{L}^{\natural}$ -Convex Minimization | Ken Yokoyama et.al. | 2404.17158 | null |
2024-04-26 | On the Federated Learning Framework for Cooperative Perception | Zhenrong Zhang et.al. | 2404.17147 | null |
2024-04-25 | Defect Localization Using Region of Interest and Histogram-Based Enhancement Approaches in 3D-Printing | Md Manjurul Ahsan et.al. | 2404.17015 | null |
2024-04-25 | Evolve Cost-aware Acquisition Functions Using Large Language Models | Yiming Yao et.al. | 2404.16906 | link |
2024-04-25 | Harnessing Inferior Solutions For Superior Outcomes: Obtaining Robust Solutions From Quantum Algorithms | Pascal Halffmann et.al. | 2404.16784 | null |
2024-04-25 | SHINE: Social Homology Identification for Navigation in Crowded Environments | Diego Martinez-Baselga et.al. | 2404.16705 | null |
2024-04-25 | Cooperate or Collapse: Emergence of Sustainability Behaviors in a Society of LLM Agents | Giorgio Piatti et.al. | 2404.16698 | link |
2024-04-25 | Benchmarking Mobile Device Control Agents across Diverse Configurations | Juyong Lee et.al. | 2404.16660 | null |
2024-04-25 | T-Explainer: A Model-Agnostic Explainability Framework Based on Gradients | Evandro S. Ortigossa et.al. | 2404.16495 | null |
2024-04-25 | CoCoG: Controllable Visual Stimuli Generation based on Human Concept Representations | Chen Wei et.al. | 2404.16482 | link |
2024-04-25 | DiffSeg: A Segmentation Model for Skin Lesions Based on Diffusion Difference | Zhihao Shuai et.al. | 2404.16474 | null |
2024-04-25 | Label-Free Topic-Focused Summarization Using Query Augmentation | Wenchuan Mu et.al. | 2404.16411 | null |
2024-04-25 | ReZero: Boosting MCTS-based Algorithms by Just-in-Time and Speedy Reanalyze | Chunyu Xuan et.al. | 2404.16364 | link |
2024-04-25 | Unraveling cell-cell communication with NicheNet by inferring active ligands from transcriptomics data | Chananchida Sang-aram et.al. | 2404.16358 | null |
2024-04-25 | Integration of Mixture of Experts and Multimodal Generative AI in Internet of Vehicles: A Survey | Minrui Xu et.al. | 2404.16356 | null |
2024-04-25 | Style Adaptation for Domain-adaptive Semantic Segmentation | Ting Li et.al. | 2404.16301 | null |
2024-04-25 | A Multi-objective Optimization Benchmark Test Suite for Real-time Semantic Segmentation | Yifan Zhao et.al. | 2404.16266 | link |
2024-04-24 | A Survey on Intermediate Fusion Methods for Collaborative Perception Categorized by Real World Challenges | Melih Yazgan et.al. | 2404.16139 | null |
2024-04-24 | Cantor: Inspiring Multimodal Chain-of-Thought of MLLM | Timin Gao et.al. | 2404.16033 | null |
2024-04-24 | Learning Car-Following Behaviors Using Bayesian Matrix Normal Mixture Regression | Chengyuan Zhang et.al. | 2404.16023 | null |
2024-04-24 | Explainable AI models for predicting liquefaction-induced lateral spreading | Cheng-Hsi Hsiao et.al. | 2404.15959 | link |
2024-04-24 | Rechargeable UAV Trajectory Optimization for Real-Time Persistent Data Collection of Large-Scale Sensor Networks | Rui Wang et.al. | 2404.15761 | null |
2024-04-24 | Efficient Multi-Model Fusion with Adversarial Complementary Representation Learning | Zuheng Kang et.al. | 2404.15704 | null |
2024-04-24 | Delay-Aware Multi-Agent Reinforcement Learning for Cooperative Adaptive Cruise Control with Model-based Stability Enhancement | Jiaqi Liu et.al. | 2404.15696 | null |
2024-04-24 | Hybrid LLM/Rule-based Approaches to Business Insights Generation from Structured Data | Aliaksei Vertsel et.al. | 2404.15604 | null |
2024-04-23 | CASPR: Automated Evaluation Metric for Contrastive Summarization | Nirupan Ananthamurugan et.al. | 2404.15565 | link |
2024-04-23 | Safe POMDP Online Planning among Dynamic Agents via Adaptive Conformal Prediction | Shili Sheng et.al. | 2404.15557 | null |
2024-04-23 | BattleAgent: Multi-modal Dynamic Emulation on Historical Battles to Complement Historical Analysis | Shuhang Lin et.al. | 2404.15532 | link |
2024-04-23 | Deep Models for Multi-View 3D Object Recognition: A Review | Mona Alzahrani et.al. | 2404.15224 | null |
2024-04-23 | Evaluating Physician-AI Interaction for Cancer Management: Paving the Path towards Precision Oncology | Zeshan Hussain et.al. | 2404.15187 | null |
2024-04-23 | Bias patterns in the application of LLMs for clinical decision support: A comprehensive study | Raphael Poulain et.al. | 2404.15149 | link |
2024-04-23 | Using ARIMA to Predict the Expansion of Subscriber Data Consumption | Mike Wa Nkongolo et.al. | 2404.15095 | null |
2024-04-23 | Hyperparameter Optimization Can Even be Harmful in Off-Policy Learning and How to Deal with It | Yuta Saito et.al. | 2404.15084 | null |
2024-04-23 | A Mechanism-Based Approach to Mitigating Harms from Persuasive Generative AI | Seliem El-Sayed et.al. | 2404.15058 | null |
2024-04-23 | Conformal Predictive Systems Under Covariate Shift | Jef Jonkers et.al. | 2404.15018 | link |
2024-04-23 | OccGen: Generative Multi-modal 3D Occupancy Prediction for Autonomous Driving | Guoqing Wang et.al. | 2404.15014 | null |
2024-04-23 | Vision Beyond Boundaries: An Initial Design Space of Domain-specific Large Vision Models in Human-robot Interaction | Yuchong Zhang et.al. | 2404.14965 | null |
2024-04-23 | Enhancing High-Speed Cruising Performance of Autonomous Vehicles through Integrated Deep Reinforcement Learning Framework | Jinhao Liang et.al. | 2404.14713 | null |
2024-04-23 | LaneCorrect: Self-supervised Lane Detection | Ming Nie et.al. | 2404.14671 | null |
2024-04-23 | Illuminating the Unseen: A Framework for Designing and Mitigating Context-induced Harms in Behavioral Sensing | Han Zhang et.al. | 2404.14665 | null |
2024-04-23 | AI Procurement Checklists: Revisiting Implementation in the Age of AI Governance | Tom Zick et.al. | 2404.14660 | null |
2024-04-23 | Uncertainty Quantification on Graph Learning: A Survey | Chao Chen et.al. | 2404.14642 | null |
2024-04-23 | Digital Twins for forecasting and decision optimisation with machine learning: applications in wastewater treatment | Matthew Colwell et.al. | 2404.14635 | null |
2024-04-22 | A general framework for supporting economic feasibility of generator and storage energy systems through capacity and dispatch optimization | Saeed Azad et.al. | 2404.14583 | link |
2024-04-22 | Designing forecasting software for forecast users: Empowering non-experts to create and understand their own forecasts | Richard Stromer et.al. | 2404.14575 | null |
2024-04-22 | A Survey of Decomposition-Based Evolutionary Multi-Objective Optimization: Part I-Past and Future | Ke Li et.al. | 2404.14571 | null |
2024-04-22 | Exploring Algorithmic Explainability: Generating Explainable AI Insights for Personalized Clinical Decision Support Focused on Cannabis Intoxication in Young Adults | Tongze Zhang et.al. | 2404.14563 | null |
2024-04-22 | Mapping Wireless Networks into Digital Reality through Joint Vertical and Horizontal Learning | Zifan Zhang et.al. | 2404.14497 | null |
2024-04-22 | Analysing the interaction of expansion decisions by end customers and grid development in the context of a municipal energy system | Paul Maximilian Röhrig et.al. | 2404.14371 | null |
2024-04-22 | PLUTO: Pushing the Limit of Imitation Learning-based Planning for Autonomous Driving | Jie Cheng et.al. | 2404.14327 | null |
2024-04-22 | Localization Based on MIMO Backscattering from Retro-Directive Antenna Arrays | Marina Lotti et.al. | 2404.14206 | null |
2024-04-22 | Unlawful Proxy Discrimination: A Framework for Challenging Inherently Discriminatory Algorithms | Hilde Weerts et.al. | 2404.14050 | null |
2024-04-22 | PointDifformer: Robust Point Cloud Registration With Neural Diffusion and Transformer | Rui She et.al. | 2404.14034 | null |
2024-04-22 | Collaborative Perception Datasets in Autonomous Driving: A Survey | Melih Yazgan et.al. | 2404.14022 | null |
2024-04-22 | Benchmarking Multi-Modal LLMs for Testing Visual Deep Learning Systems Through the Lens of Image Mutation | Liwen Wang et.al. | 2404.13945 | null |
2024-04-22 | Open Datasets for Satellite Radio Resource Control | Husnain Shahid et.al. | 2404.13920 | null |
2024-04-22 | Surveying Attitudinal Alignment Between Large Language Models Vs. Humans Towards 17 Sustainable Development Goals | Qingyang Wu et.al. | 2404.13885 | null |
2024-04-22 | Neural Radiance Field in Autonomous Driving: A Survey | Lei He et.al. | 2404.13816 | null |
2024-04-21 | Soar: Design and Deployment of A Smart Roadside Infrastructure System for Autonomous Driving | Shuyao Shi et.al. | 2404.13786 | null |
2024-04-21 | A Practical Multilevel Governance Framework for Autonomous and Intelligent Systems | Lukas D. Pöhler et.al. | 2404.13719 | null |
2024-04-21 | In-situ process monitoring and adaptive quality enhancement in laser additive manufacturing: a critical review | Lequn Chen et.al. | 2404.13673 | null |
2024-04-21 | Are We Ready for Planetary Exploration Robots? The TAIL-Plus Dataset for SLAM in Granular Environments | Zirui Wang et.al. | 2404.13600 | null |
2024-04-20 | FisheyeDetNet: Object Detection on Fisheye Surround View Camera Systems for Automated Driving | Ganesh Sistu et.al. | 2404.13443 | null |
2024-04-20 | Distribution Network Restoration: Resource Scheduling Considering Coupled Transportation-Power Networks | Harshal D. Kaushik et.al. | 2404.13422 | null |
2024-04-20 | On Modeling Multi-Criteria Decision Making with Uncertain Information using Probabilistic Rules | Shengxin Hong et.al. | 2404.13419 | null |
2024-04-20 | Social Force Embedded Mixed Graph Convolutional Network for Multi-class Trajectory Prediction | Quancheng Du et.al. | 2404.13378 | null |
2024-04-20 | Beyond Collaborative Filtering: A Relook at Task Formulation in Recommender Systems | Aixin Sun et.al. | 2404.13375 | null |
2024-04-20 | On Risk-Sensitive Decision Making Under Uncertainty | Chung-Han Hsieh et.al. | 2404.13371 | null |
2024-04-19 | Towards Robust Ferrous Scrap Material Classification with Deep Learning and Conformal Prediction | Paulo Henrique dos Santos et.al. | 2404.13002 | null |
2024-04-19 | Private Agent-Based Modeling | Ayush Chopra et.al. | 2404.12983 | null |
2024-04-19 | Physical Backdoor Attack can Jeopardize Driving with Vision-Large-Language Models | Zhenyang Ni et.al. | 2404.12916 | link |
2024-04-19 | FipTR: A Simple yet Effective Transformer Framework for Future Instance Prediction in Autonomous Driving | Xingtai Gui et.al. | 2404.12867 | link |
2024-04-19 | Language-Driven Active Learning for Diverse Open-Set 3D Object Detection | Ross Greer et.al. | 2404.12856 | link |
2024-04-19 | Explainable Deepfake Video Detection using Convolutional Neural Network and CapsuleNet | Gazi Hasin Ishrak et.al. | 2404.12841 | null |
2024-04-19 | Open Datasets for AI-Enabled Radio Resource Control in Non-Terrestrial Networks | Husnain Shahid et.al. | 2404.12813 | null |
2024-04-19 | Algorithmic Changes Are Not Enough: Evaluating the Removal of Race Adjustment from the eGFR Equation | Marika M. Cusick et.al. | 2404.12812 | link |
2024-04-19 | A Point-Based Approach to Efficient LiDAR Multi-Task Perception | Christopher Lang et.al. | 2404.12798 | null |
2024-04-19 | Camera Agnostic Two-Head Network for Ego-Lane Inference | Chaehyeon Song et.al. | 2404.12770 | null |
2024-04-19 | Immersive Analysis: Enhancing Material Inspection of X-Ray Computed Tomography Datasets in Augmented Reality | Alexander Gall et.al. | 2404.12751 | null |
2024-04-19 | Demonstration of quantum projective simulation on a single-photon-based quantum computer | Giacomo Franceschetto et.al. | 2404.12729 | null |
2024-04-19 | A Containerized Microservice Architecture for a ROS 2 Autonomous Driving Software: An End-to-End Latency Evaluation | Tobias Betz et.al. | 2404.12683 | null |
2024-04-19 | Dragtraffic: A Non-Expert Interactive and Point-Based Controllable Traffic Scene Generation Framework | Sheng Wang et.al. | 2404.12624 | null |
2024-04-19 | Deep Reinforcement Learning-aided Transmission Design for Energy-efficient Link Optimization in Vehicular Communications | Zhengpeng Wang et.al. | 2404.12595 | null |
2024-04-19 | Multi-Objective Offloading Optimization in MEC and Vehicular-Fog Systems: A Distributed-TD3 Approach | Frezer Guteta Wakgra et.al. | 2404.12584 | null |
2024-04-19 | Just Like Me: The Role of Opinions and Personal Experiences in The Perception of Explanations in Subjective Decision-Making | Sharon Ferguson et.al. | 2404.12558 | null |
2024-04-19 | Variance-informed Rounding Uncertainty Analysis for Floating-point Statistical Models | Sahil Bhola et.al. | 2404.12556 | null |
2024-04-18 | State Discretization for Continuous-State MDPs in Infectious Disease Control | Suyanpeng Zhang et.al. | 2404.12540 | null |
2024-04-18 | TrACT: A Training Dynamics Aware Contrastive Learning Framework for Long-tail Trajectory Prediction | Junrui Zhang et.al. | 2404.12538 | null |
2024-04-18 | RoboDreamer: Learning Compositional World Models for Robot Imagination | Siyuan Zhou et.al. | 2404.12377 | null |
2024-04-18 | MedThink: Explaining Medical Visual Question Answering via Multimodal Decision-Making Rationale | Xiaotang Gai et.al. | 2404.12372 | null |
2024-04-18 | Decision making in stochastic extensive form I: Stochastic decision forests | E. Emanuel Rapsch et.al. | 2404.12332 | null |
2024-04-18 | Reducing Bias in Pre-trained Models by Tuning while Penalizing Change | Niklas Penzel et.al. | 2404.12292 | null |
2024-04-18 | An Online Spatial-Temporal Graph Trajectory Planner for Autonomous Vehicles | Jilan Samiuddin et.al. | 2404.12256 | null |
2024-04-18 | Privacy-Preserving UCB Decision Process Verification via zk-SNARKs | Xikun Jiang et.al. | 2404.12186 | null |
2024-04-18 | Stability Certificates for Receding Horizon Games | Sophie Hall et.al. | 2404.12165 | null |
2024-04-18 | The Neutrality Fallacy: When Algorithmic Fairness Interventions are (Not) Positive Action | Hilde Weerts et.al. | 2404.12143 | null |
2024-04-18 | Character is Destiny: Can Large Language Models Simulate Persona-Driven Decisions in Role-Playing? | Rui Xu et.al. | 2404.12138 | null |
2024-04-18 | mABC: multi-Agent Blockchain-Inspired Collaboration for root cause analysis in micro-services architecture | Wei Zhang et.al. | 2404.12135 | link |
2024-04-18 | Intelligence Education made in Europe | Lars Berger et.al. | 2404.12125 | null |
2024-04-18 | Evolutionary Multi-Objective Optimisation for Fairness-Aware Self Adjusting Memory Classifiers in Data Streams | Pivithuru Thejan Amarasinghe et.al. | 2404.12076 | null |
2024-04-18 | emrQA-msquad: A Medical Dataset Structured with the SQuAD V2.0 Framework, Enriched with emrQA Medical Information | Jimenez Eladio et.al. | 2404.12050 | null |
2024-04-18 | Cost and CO2 emissions co-optimisation of green hydrogen production in a grid-connected renewable energy system | Sleiman Farah et.al. | 2404.11995 | null |
2024-04-18 | S4TP: Social-Suitable and Safety-Sensitive Trajectory Planning for Autonomous Vehicles | Xiao Wang et.al. | 2404.11946 | null |
2024-04-18 | Toward Short-Term Glucose Prediction Solely Based on CGM Time Series | Ming Cheng et.al. | 2404.11924 | null |
2024-04-18 | JointPPO: Diving Deeper into the Effectiveness of PPO in Multi-Agent Reinforcement Learning | Chenxing Liu et.al. | 2404.11831 | null |
2024-04-17 | TempBEV: Improving Learned BEV Encoders with Combined Image and BEV Space Temporal Aggregation | Thomas Monninger et.al. | 2404.11803 | null |
2024-04-17 | Multimodal 3D Object Detection on Unseen Domains | Deepti Hegde et.al. | 2404.11764 | null |
2024-04-17 | Language Models Still Struggle to Zero-shot Reason about Time Series | Mike A. Merrill et.al. | 2404.11757 | null |
2024-04-17 | VG4D: Vision-Language Model Goes 4D Video Recognition | Zhichao Deng et.al. | 2404.11605 | link |
2024-04-17 | Explainable Artificial Intelligence Techniques for Accurate Fault Detection and Diagnosis: A Review | Ahmed Maged et.al. | 2404.11597 | null |
2024-04-17 | Open-Ended Wargames with Large Language Models | Daniel P. Hogan et.al. | 2404.11446 | link |
2024-04-17 | Explainable Lung Disease Classification from Chest X-Ray Images Utilizing Deep Learning and XAI | Tanzina Taher Ifty et.al. | 2404.11428 | null |
2024-04-18 | SERENE: A Collusion Resilient Replication-based Verification Framework | Amir Esmaeili et.al. | 2404.11410 | null |
2024-04-17 | Pharmacokinetic Measurements in Dose Finding Model Guided by Escalation with Overdose Control | Arnab Kumar Maity et.al. | 2404.11406 | null |
2024-04-17 | Detector Collapse: Backdooring Object Detection to Catastrophic Overload or Blindness | Hangtao Zhang et.al. | 2404.11357 | null |
2024-04-17 | The dynamics of diversity on corporate boards | Matthias Raddant et.al. | 2404.11334 | null |
2024-04-17 | Towards Human Awareness in Robot Task Planning with Large Language Models | Yuchen Liu et.al. | 2404.11267 | null |
2024-04-17 | KI-GAN: Knowledge-Informed Generative Adversarial Networks for Enhanced Multi-Vehicle Trajectory Forecasting at Signalized Intersections | Chuheng Wei et.al. | 2404.11181 | link |
2024-04-17 | D-Aug: Enhancing Data Augmentation for Dynamic LiDAR Scenes | Jiaxing Zhao et.al. | 2404.11127 | null |
2024-04-17 | Reuse out-of-year data to enhance land cover mappingvia feature disentanglement and contrastive learning | Cassio F. Dantas et.al. | 2404.11114 | null |
2024-04-17 | Recommender Systems in Financial Trading: Using machine-based conviction analysis in an explainable AI investment framework | Alicia Vidler et.al. | 2404.11080 | null |
2024-04-17 | Do you need a DAO? | Henrik Axelsen et.al. | 2404.11076 | null |
2024-04-17 | Sky-GVIO: an enhanced GNSS/INS/Vision navigation with FCN-based sky-segmentation in urban canyon | Jingrong Wang et.al. | 2404.11070 | link |
2024-04-17 | Periodicity in New York State COVID-19 Hospitalizations Leveraged from the Variable Bandpass Periodic Block Bootstrap | Asmaa Ahmad et.al. | 2404.11006 | null |
2024-04-17 | How to deal with glare for improved perception of Autonomous Vehicles | Muhammad Z. Alam et.al. | 2404.10992 | null |
2024-04-17 | Group-Aware Coordination Graph for Multi-Agent Reinforcement Learning | Wei Duan et.al. | 2404.10976 | link |
2024-04-17 | Procedural Dilemma Generation for Evaluating Moral Reasoning in Humans and Language Models | Jan-Philipp Fränken et.al. | 2404.10975 | link |
2024-04-16 | Human-Algorithm Collaborative Bayesian Optimization for Engineering Systems | Tom Savage et.al. | 2404.10949 | link |
2024-04-16 | N-Agent Ad Hoc Teamwork | Caroline Wang et.al. | 2404.10740 | link |
2024-04-16 | PD-Insighter: A Visual Analytics System to Monitor Daily Actions for Parkinson’s Disease Treatment | Jade Kandel et.al. | 2404.10661 | null |
2024-04-16 | Towards free-response paradigm: a theory on decision-making in spiking neural networks | Zhichao Zhu et.al. | 2404.10599 | null |
2024-04-16 | Automated Evaluation of Large Vision-Language Models on Self-driving Corner Cases | Yanze Li et.al. | 2404.10595 | null |
2024-04-16 | PAKT: Perspectivized Argumentation Knowledge Graph and Tool for Deliberation Analysis (with Supplementary Materials) | Moritz Plenz et.al. | 2404.10570 | null |
2024-04-16 | Quantum Mechanics of Human Perception, Behaviour and Decision-Making: A Do-It-Yourself Model Kit for Modelling Optical Illusions and Opinion Formation in Social Networks | Ivan S. Maksymov et.al. | 2404.10554 | link |
2024-04-16 | Warm-Start Variational Quantum Policy Iteration | Nico Meyer et.al. | 2404.10546 | link |
2024-04-16 | LAECIPS: Large Vision Model Assisted Adaptive Edge-Cloud Collaboration for IoT-based Perception System | Shijing Hu et.al. | 2404.10498 | null |
2024-04-16 | Would You Trust an AI Doctor? Building Reliable Medical Predictions with Kernel Dropout Uncertainty | Ubaid Azam et.al. | 2404.10483 | null |
2024-04-16 | AudioProtoPNet: An interpretable deep learning model for bird sound classification | René Heinrich et.al. | 2404.10420 | null |
2024-04-16 | Generating Counterfactual Trajectories with Latent Diffusion Models for Concept Discovery | Payal Varshney et.al. | 2404.10356 | null |
2024-04-16 | Application of Deep Learning Methods to Processing of Noisy Medical Video Data | Danil Afonchikov et.al. | 2404.10319 | null |
2024-04-16 | NeuroMorphix: A Novel Brain MRI Asymmetry-specific Feature Construction Approach For Seizure Recurrence Prediction | Soumen Ghosh et.al. | 2404.10290 | null |
2024-04-16 | PreGSU-A Generalized Traffic Scene Understanding Model for Autonomous Driving based on Pre-trained Graph Attention Network | Yuning Wang et.al. | 2404.10263 | null |
2024-04-16 | Rethinking Software Engineering in the Foundation Model Era: From Task-Driven AI Copilots to Goal-Driven AI Pair Programmers | Ahmed E. Hassan et.al. | 2404.10225 | null |
2024-04-16 | The Impact of Machine Learning on Society: An Analysis of Current Trends and Future Implications | Md Kamrul Hossain Siam et.al. | 2404.10204 | null |
2024-04-15 | Online Estimation via Offline Estimation: An Information-Theoretic Framework | Dylan J. Foster et.al. | 2404.10122 | null |
2024-04-15 | Explainable Light-Weight Deep Learning Pipeline for Improved Drought Stres | Aswini Kumar Patra et.al. | 2404.10073 | null |
2024-04-15 | Evaluating the Explainability of Attributes and Prototypes for a Medical Classification Model | Luisa Gallée et.al. | 2404.09917 | null |
2024-04-15 | Flow-Based Synthesis of Reactive Tests for Discrete Decision-Making Systems with Temporal Logic Specifications | Josefine B. Graebener et.al. | 2404.09888 | null |
2024-04-15 | Effective Reinforcement Learning Based on Structural Information Principles | Xianghua Zeng et.al. | 2404.09760 | link |
2024-04-15 | Sampling for Model Predictive Trajectory Planning in Autonomous Driving using Normalizing Flows | Georg Rabenstein et.al. | 2404.09657 | null |
2024-04-15 | SparseOcc: Rethinking Sparse Latent Representation for Vision-Based Semantic Occupancy Prediction | Pin Tang et.al. | 2404.09502 | null |
2024-04-15 | Towards Collaborative Autonomous Driving: Simulation Platform and End-to-End System | Genjia Liu et.al. | 2404.09496 | link |
2024-04-15 | VFMM3D: Releasing the Potential of Image by Vision Foundation Model for Monocular 3D Object Detection | Bonan Ding et.al. | 2404.09431 | null |
2024-04-14 | SyntStereo2Real: Edge-Aware GAN for Remote Sensing Image-to-Image Translation while Maintaining Stereo Constraint | Vasudha Venkatesan et.al. | 2404.09277 | null |
2024-04-14 | VRS-NeRF: Visual Relocalization with Sparse Neural Radiance Field | Fei Xue et.al. | 2404.09271 | link |
2024-04-14 | A Reinforcement Learning Based Backfilling Strategy for HPC Batch Jobs | Elliot Kolker-Hicks et.al. | 2404.09264 | null |
2024-04-14 | Increasing SLAM Pose Accuracy by Ground-to-Satellite Image Registration | Yanhao Zhang et.al. | 2404.09169 | link |
2024-04-14 | Evaluating the efficacy of haptic feedback, 360° treadmill-integrated Virtual Reality framework and longitudinal training on decision-making performance in a complex search-and-shoot simulation | Akash K Rao et.al. | 2404.09147 | null |
2024-04-13 | Exploring Explainability in Video Action Recognition | Avinab Saha et.al. | 2404.09067 | null |
2024-04-13 | Do LLMs Play Dice? Exploring Probability Distribution Sampling in Large Language Models for Behavioral Simulation | Jia Gu et.al. | 2404.09043 | null |
2024-04-13 | Intention-Aware Control Based on Belief-Space Specifications and Stochastic Expansion | Zengjie Zhang et.al. | 2404.09037 | link |
2024-04-13 | An Agent-Based Model of Elephant Crop Raid Dynamics in the Periyar-Agasthyamalai Complex, India | Purathekandy Anjali et.al. | 2404.09024 | link |
2024-04-13 | Incremental Residual Concept Bottleneck Models | Chenming Shang et.al. | 2404.08978 | link |
2024-04-13 | MCPNet: An Interpretable Classifier via Multi-Level Concept Prototypes | Bor-Shiun Wang et.al. | 2404.08968 | link |
2024-04-13 | Understanding Multimodal Deep Neural Networks: A Concept Selection View | Chenming Shang et.al. | 2404.08964 | null |
2024-04-13 | Voting Participation and Engagement in Blockchain-Based Fan Tokens | Lennart Ante et.al. | 2404.08906 | null |
2024-04-12 | WROOM: An Autonomous Driving Approach for Off-Road Navigation | Dvij Kalaria et.al. | 2404.08855 | link |
2024-04-12 | A Typology of Decision-Making Tasks for Visualization | Camelia D. Brumar et.al. | 2404.08812 | null |
2024-04-12 | Semantic Approach to Quantifying the Consistency of Diffusion Model Image Generation | Brinnae Bent et.al. | 2404.08799 | link |
2024-04-12 | FusionPortableV2: A Unified Multi-Sensor Dataset for Generalized SLAM Across Diverse Platforms and Scalable Environments | Hexiang Wei et.al. | 2404.08563 | null |
2024-04-12 | Leveraging Multi-AI Agents for Cross-Domain Knowledge Discovery | Shiva Aryal et.al. | 2404.08511 | null |
2024-04-12 | Prescribing Optimal Health-Aware Operation for Urban Air Mobility with Deep Reinforcement Learning | Mina Montazeri et.al. | 2404.08497 | null |
2024-04-12 | Maturity of Vehicle Digital Twins: From Monitoring to Enabling Autonomous Driving | Robert Klar et.al. | 2404.08438 | null |
2024-04-12 | SIR-RL: Reinforcement Learning for Optimized Policy Control during Epidemiological Outbreaks in Emerging Market and Developing Economies | Maeghal Jain et.al. | 2404.08423 | null |
2024-04-12 | Collective Bayesian Decision-Making in a Swarm of Miniaturized Robots for Surface Inspection | Thiemen Siemensma et.al. | 2404.08390 | null |
2024-04-12 | Uncertainty Aware Tropical Cyclone Wind Speed Estimation from Satellite Data | Nils Lehmann et.al. | 2404.08325 | link |
2024-04-12 | Transfer Learning Study of Motion Transformer-based Trajectory Predictions | Lars Ullrich et.al. | 2404.08271 | null |
2024-04-12 | Enhancing Fairness and Performance in Machine Learning Models: A Multi-Task Learning Approach with Monte-Carlo Dropout and Pareto Optimality | Khadija Zanna et.al. | 2404.08230 | null |
2024-04-11 | Real-Time Detection and Analysis of Vehicles and Pedestrians using Deep Learning | Md Nahid Sadik et.al. | 2404.08081 | null |
2024-04-11 | VeTraSS: Vehicle Trajectory Similarity Search Through Graph Modeling and Representation Learning | Ming Cheng et.al. | 2404.08021 | null |
2024-04-11 | The Power of Properties: Uncovering the Influential Factors in Emotion Classification | Tim Büchner et.al. | 2404.07867 | null |
2024-04-11 | Sparse Laneformer | Ji Liu et.al. | 2404.07821 | null |
2024-04-12 | NeuroNCAP: Photorealistic Closed-loop Safety Testing for Autonomous Driving | William Ljungbergh et.al. | 2404.07762 | link |
2024-04-11 | Enhancing Valuation of Variable Annuities in Lévy Models with Stochastic Interest Rate | Ludovic Goudenège et.al. | 2404.07658 | null |
2024-04-11 | Homography Guided Temporal Fusion for Road Line and Marking Segmentation | Shan Wang et.al. | 2404.07626 | link |
2024-04-11 | International environmental treaties: An honest or a misguided effort | Reza Hafezi et.al. | 2404.07574 | null |
2024-04-11 | Can Vehicle Motion Planning Generalize to Realistic Long-tail Scenarios? | Marcel Hallgarten et.al. | 2404.07569 | link |
2024-04-11 | PillarTrack: Redesigning Pillar-based Transformer Network for Single Object Tracking on Point Clouds | Weisheng Xu et.al. | 2404.07495 | link |
2024-04-11 | WESE: Weak Exploration to Strong Exploitation for LLM Agents | Xu Huang et.al. | 2404.07456 | null |
2024-04-11 | Data-Driven Portfolio Management for Motion Pictures Industry: A New Data-Driven Optimization Methodology Using a Large Language Model as the Expert | Mohammad Alipour-Vaezi et.al. | 2404.07434 | null |
2024-04-11 | Diversity’s Double-Edged Sword: Analyzing Race’s Effect on Remote Pair Programming Interactions | Shandler A. Mason et.al. | 2404.07427 | null |
2024-04-10 | Structured Reinforcement Learning for Media Streaming at the Wireless Edge | Archana Bura et.al. | 2404.07315 | null |
2024-04-10 | Sequential Decision Making with Expert Demonstrations under Unobserved Heterogeneity | Vahid Balazadeh et.al. | 2404.07266 | link |
2024-04-10 | Reward Learning from Suboptimal Demonstrations with Applications in Surgical Electrocautery | Zohre Karimi et.al. | 2404.07185 | null |
2024-04-10 | Machine learning-based similarity measure to forecast M&A from patent data | Giambattista Albora et.al. | 2404.07179 | link |
2024-04-10 | Rethinking Out-of-Distribution Detection for Reinforcement Learning: Advancing Methods for Evaluation and Detection | Linas Nasvytis et.al. | 2404.07099 | link |
2024-04-10 | Identification of Fine-grained Systematic Errors via Controlled Scene Generation | Valentyn Boreiko et.al. | 2404.07045 | null |
2024-04-10 | LM Transparency Tool: Interactive Tool for Analyzing Transformer Language Models | Igor Tufanov et.al. | 2404.07004 | null |
2024-04-10 | Multi-Agent Soft Actor-Critic with Global Loss for Autonomous Mobility-on-Demand Fleet Control | Zeno Woywood et.al. | 2404.06975 | link |
2024-04-10 | A Survey on the Integration of Generative AI for Critical Thinking in Mobile Networks | Athanasios Karapantelakis et.al. | 2404.06946 | null |
2024-04-10 | SparseAD: Sparse Query-Centric Paradigm for Efficient End-to-End Autonomous Driving | Diankun Zhang et.al. | 2404.06892 | null |
2024-04-10 | RESSCAL3D: Resolution Scalable 3D Semantic Segmentation of Point Clouds | Remco Royen et.al. | 2404.06863 | null |
2024-04-10 | Monocular 3D lane detection for Autonomous Driving: Recent Achievements, Challenges, and Outlooks | Fulong Ma et.al. | 2404.06860 | null |
2024-04-10 | Sparse Points to Dense Clouds: Enhancing 3D Detection with Limited LiDAR Data | Aakash Kumar et.al. | 2404.06715 | null |
2024-04-09 | SAM-I-Am: Semantic Boosting for Zero-shot Atomic-Scale Electron Micrograph Segmentation | Waqwoya Abebe et.al. | 2404.06638 | link |
2024-04-09 | RoadBEV: Road Surface Reconstruction in Bird’s Eye View | Tong Zhao et.al. | 2404.06605 | link |
2024-04-09 | Graph Reinforcement Learning for Combinatorial Optimization: A Survey and Unifying Perspective | Victor-Alexandru Darvariu et.al. | 2404.06492 | null |
2024-04-09 | Enhancing Decision Analysis with a Large Language Model: pyDecision a Comprehensive Library of MCDA Methods in Python | Valdecy Pereira et.al. | 2404.06370 | link |
2024-04-11 | HPNet: Dynamic Trajectory Forecasting with Historical Prediction Attention | Xiaolong Tang et.al. | 2404.06351 | link |
2024-04-09 | AgentsCoDriver: Large Language Model Empowered Collaborative Driving with Lifelong Learning | Senkang Hu et.al. | 2404.06345 | null |
2024-04-09 | Label-Efficient 3D Object Detection For Road-Side Units | Minh-Quan Dao et.al. | 2404.06256 | null |
2024-04-09 | Towards Autonomous Driving with Small-Scale Cars: A Survey of Recent Development | Dianzhao Li et.al. | 2404.06229 | null |
2024-04-09 | Intelligence and Motion Models of Continuum Robots: an Overview | Oxana Shamilyan et.al. | 2404.06171 | null |
2024-04-09 | Distributed Artificial Intelligence as a Means to Achieve Self-X-Functions for Increasing Resilience: the First Steps | Oxana Shamilyan et.al. | 2404.06159 | null |
2024-04-09 | Hierarchical Insights: Exploiting Structural Similarities for Reliable 3D Semantic Segmentation | Mariella Dreissig et.al. | 2404.06124 | null |
2024-04-09 | Passive None-line-of-sight imaging with arbitrary scene condition and detection pattern in small amount of prior data | Yunting Gui et.al. | 2404.06015 | null |
2024-04-09 | Feel-Good Thompson Sampling for Contextual Dueling Bandits | Xuheng Li et.al. | 2404.06013 | null |
2024-04-09 | Concept-Attention Whitening for Interpretable Skin Lesion Diagnosis | Junlin Hou et.al. | 2404.05997 | null |
2024-04-09 | Deep Reinforcement Learning for Personalized Diagnostic Decision Pathways Using Electronic Health Records: A Comparative Study on Anemia and Systemic Lupus Erythematosus | Lillian Muyama et.al. | 2404.05913 | link |
2024-04-08 | ClusterRadar: an Interactive Web-Tool for the Multi-Method Exploration of Spatial Clusters Over Time | Lee Mason et.al. | 2404.05897 | link |
2024-04-08 | Model Predictive Control based Energy Management System for Home Energy Resiliency | Ninad Gaikwad et.al. | 2404.05873 | null |
2024-04-08 | Approaching Emergent Risks: An Exploratory Study into Artificial Intelligence Risk Management within Financial Organisations | Finlay McGee et.al. | 2404.05847 | null |
2024-04-08 | Attention-Driven Multi-Agent Reinforcement Learning: Enhancing Decisions with Expertise-Informed Tasks | Andre R Kuroswiski et.al. | 2404.05840 | null |
2024-04-09 | Dynamic Backtracking in GFlowNets: Enhancing Decision Steps with Reward-Dependent Adjustment Mechanisms | Shuai Guo et.al. | 2404.05576 | null |
2024-04-08 | Evaluating Interventional Reasoning Capabilities of Large Language Models | Tejas Kasetty et.al. | 2404.05545 | null |
2024-04-08 | Decisioning Workshop 2023 | Mario Lezoche et.al. | 2404.05495 | null |
2024-04-08 | What Are the Odds? Improving the foundations of Statistical Model Checking | Tobias Meggendorfer et.al. | 2404.05424 | null |
2024-04-08 | Residual Chain Prediction for Autonomous Driving Path Planning | Liguo Zhou et.al. | 2404.05423 | null |
2024-04-08 | Logic-dependent emergence of multistability, hysteresis, and biphasic dynamics in a minimal positive feedback network with an autoloop | Akriti Srivastava et.al. | 2404.05379 | null |
2024-04-08 | A Max-Min-Max Algorithm for Large-Scale Robust Optimization | Kai Tu et.al. | 2404.05377 | null |
2024-04-08 | Human Detection from 4D Radar Data in Low-Visibility Field Conditions | Mikael Skog et.al. | 2404.05307 | null |
2024-04-08 | Detecting Every Object from Events | Haitian Zhang et.al. | 2404.05285 | link |
2024-04-08 | MOSE: Boosting Vision-based Roadside 3D Object Detection with Scene Cues | Xiahan Chen et.al. | 2404.05280 | null |
2024-04-08 | Fair Machine Guidance to Enhance Fair Decision Making in Biased People | Mingzhe Yang et.al. | 2404.05228 | null |
2024-04-08 | Maximally Forward-Looking Core Inflation | Philippe Goulet Coulombe et.al. | 2404.05209 | null |
2024-04-08 | GloSoFarID: Global multispectral dataset for Solar Farm IDentification in satellite imagery | Zhiyuan Yang et.al. | 2404.05180 | link |
2024-04-08 | Semantic Stealth: Adversarial Text Attacks on NLP Using Several Methods | Roopkatha Dey et.al. | 2404.05159 | null |
2024-04-08 | UniMix: Towards Domain Adaptive and Generalizable LiDAR Semantic Segmentation in Adverse Weather | Haimei Zhao et.al. | 2404.05145 | null |
2024-04-08 | Enhancing Clinical Efficiency through LLM: Discharge Note Generation for Cardiac Patients | HyoJe Jung et.al. | 2404.05144 | null |
2024-04-08 | Better Monocular 3D Detectors with LiDAR from the Past | Yurong You et.al. | 2404.05139 | link |
2024-04-07 | Data Conditioning for Subsurface Models with Single-Image Generative Adversarial Network (SinGAN) | Lei Liu et.al. | 2404.05068 | null |
2024-04-07 | Dir-SPGLM: A Bayesian semiparametric GLM with data-driven reference distribution | Entejar Alam et.al. | 2404.05060 | null |
2024-04-07 | Percentile Criterion Optimization in Offline Reinforcement Learning | Elita A. Lobo et.al. | 2404.05055 | link |
2024-04-05 | Enhancing IoT Intelligence: A Transformer-based Reinforcement Learning Methodology | Gaith Rjoub et.al. | 2404.04205 | null |
2024-04-05 | Exploring Probabilistic Models for Semi-supervised Learning | Jianfeng Wang et.al. | 2404.04199 | null |
2024-04-05 | You Can Use But Cannot Recognize: Preserving Visual Privacy in Deep Neural Networks | Qiushi Li et.al. | 2404.04098 | null |
2024-04-05 | The forgotten pillar of sustainability: development of the S-assessment tool to evaluate Organizational Social Sustainability | Alessandro Annarelli et.al. | 2404.04077 | null |
2024-04-05 | Bidirectional Human Interactive AI Framework for Social Robot Navigation | Tuba Girgin et.al. | 2404.04069 | null |
2024-04-05 | Balancing Progress and Responsibility: A Synthesis of Sustainability Trade-Offs of AI-Based Systems | Apoorva Nalini Pradeep Kumar et.al. | 2404.03995 | null |
2024-04-05 | Modulation of metastable ensemble dynamics explains optimal coding at moderate arousal in auditory cortex | Lia Papadopoulos et.al. | 2404.03902 | null |
2024-04-05 | Enhancing Breast Cancer Diagnosis in Mammography: Evaluation and Integration of Convolutional Neural Networks and Explainable AI | Maryam Ahmed et.al. | 2404.03892 | null |
2024-04-05 | Heterogeneous Multi-Agent Reinforcement Learning for Zero-Shot Scalable Collaboration | Xudong Guo et.al. | 2404.03869 | null |
2024-04-05 | Scaling Motion Forecasting Models with Ensemble Distillation | Scott Ettinger et.al. | 2404.03843 | null |
2024-04-04 | An ExplainableFair Framework for Prediction of Substance Use Disorder Treatment Completion | Mary M. Lucas et.al. | 2404.03833 | null |
2024-04-04 | Language-Guided Instance-Aware Domain-Adaptive Panoptic Segmentation | Elham Amin Mansour et.al. | 2404.03799 | null |
2024-04-04 | Dendrites endow artificial neural networks with accurate, robust and parameter-efficient learning | Spyridon Chavlis et.al. | 2404.03708 | null |
2024-04-04 | AutoWebGLM: Bootstrap And Reinforce A Large Language Model-based Web Navigating Agent | Hanyu Lai et.al. | 2404.03648 | link |
2024-04-04 | Is CLIP the main roadblock for fine-grained open-world perception? | Lorenzo Bianchi et.al. | 2404.03539 | link |
2024-04-04 | Integrating Generative AI into Financial Market Prediction for Improved Decision Making | Chang Che et.al. | 2404.03523 | null |
2024-04-04 | Materials for High Temperature Digital Electronics | Dhiren K. Pradhan et.al. | 2404.03510 | null |
2024-04-05 | A Methodology to Study the Impact of Spiking Neural Network Parameters considering Event-Based Automotive Data | Iqra Bano et.al. | 2404.03493 | null |
2024-04-04 | Knowledge Distillation-Based Model Extraction Attack using Private Counterfactual Explanations | Fatima Ezzeddine et.al. | 2404.03348 | link |
2024-04-04 | Learning to Bid in Forward Electricity Markets Using a No-Regret Algorithm | Arega Getaneh Abate et.al. | 2404.03314 | null |
2024-04-04 | Decentralized Learning Strategies for Estimation Error Minimization with Graph Neural Networks | Xingran Chen et.al. | 2404.03227 | null |
2024-04-04 | CORP: A Multi-Modal Dataset for Campus-Oriented Roadside Perception Tasks | Beibei Wang et.al. | 2404.03191 | null |
2024-04-04 | The Probabilities Also Matter: A More Faithful Metric for Faithfulness of Free-Text Explanations in Large Language Models | Noah Y. Siegel et.al. | 2404.03189 | null |
2024-04-03 | Ego-Motion Aware Target Prediction Module for Robust Multi-Object Tracking | Navid Mahdian et.al. | 2404.03110 | link |
2024-04-03 | Composite Bayesian Optimization In Function Spaces Using NEON – Neural Epistemic Operator Networks | Leonardo Ferreira Guilhoto et.al. | 2404.03099 | null |
2024-04-03 | Data-Driven Goal Recognition Design for General Behavioral Agents | Robert Kasumba et.al. | 2404.03054 | null |
2024-04-03 | When Digital Twin Meets Generative AI: Intelligent Closed-Loop Network Management | Xinyu Huang et.al. | 2404.03025 | null |
2024-04-03 | Tricks from the Trade for Large-Scale Markdown Pricing: Heuristic Cut Generation for Lagrangian Decomposition | Robert Streeck et.al. | 2404.02996 | null |
2024-04-03 | LidarDM: Generative LiDAR Simulation in a Generated World | Vlas Zyrianov et.al. | 2404.02903 | link |
2024-04-03 | IEEE VIS Workshop on Visualization for Climate Action and Sustainability | Benjamin Bach et.al. | 2404.02743 | null |
2024-04-03 | Unsupervised Learning of Effective Actions in Robotics | Marko Zaric et.al. | 2404.02728 | link |
2024-04-03 | Towards detecting unanticipated bias in Large Language Models | Anna Kruspe et.al. | 2404.02650 | null |
2024-04-03 | On the Importance of Uncertainty in Decision-Making with Large Language Models | Nicolò Felicioni et.al. | 2404.02649 | null |
2024-04-03 | One Stack to Rule them All: To Drive Automated Vehicles, and Reach for the 4th level | Sven Ochs et.al. | 2404.02645 | null |
2024-04-04 | Vestibular schwannoma growth prediction from longitudinal MRI by time conditioned neural fields | Yunjie Chen et.al. | 2404.02614 | link |
2024-04-03 | Incremental Learning with Concept Drift Detection and Prototype-based Embeddings for Graph Stream Classification | Kleanthis Malialis et.al. | 2404.02572 | null |
2024-04-03 | HENet: Hybrid Encoding for End-to-end Multi-task 3D Perception from Multi-view Cameras | Zhongyu Xia et.al. | 2404.02517 | link |
2024-04-03 | Task Agnostic Architecture for Algorithm Induction via Implicit Composition | Sahil J. Sindhi et.al. | 2404.02450 | null |
2024-04-03 | From Narratives to Numbers: Valid Inference Using Language Model Predictions from Verbal Autopsy Narratives | Shuxian Fan et.al. | 2404.02438 | null |
2024-04-03 | AD4RL: Autonomous Driving Benchmarks for Offline Reinforcement Learning with Value-based Dataset | Dongsu Lee et.al. | 2404.02429 | null |
2024-04-03 | TCLC-GS: Tightly Coupled LiDAR-Camera Gaussian Splatting for Surrounding Autonomous Driving Scenes | Cheng Zhao et.al. | 2404.02410 | null |
2024-04-04 | CAPE: CAM as a Probabilistic Ensemble for Enhanced DNN Interpretation | Townim Faisal Chowdhury et.al. | 2404.02388 | link |
2024-04-02 | Attribution Regularization for Multimodal Paradigms | Sahiti Yerramilli et.al. | 2404.02359 | null |
2024-04-02 | From Delays to Densities: Exploring Data Uncertainty through Speech, Text, and Visualization | Chase Stokes et.al. | 2404.02317 | null |
2024-04-02 | OFMPNet: Deep End-to-End Model for Occupancy and Flow Prediction in Urban Environment | Youshaa Murhij et.al. | 2404.02263 | link |
2024-04-02 | OOSTraj: Out-of-Sight Trajectory Prediction With Vision-Positioning Denoising | Haichao Zhang et.al. | 2404.02227 | link |
2024-04-02 | FLawN-T5: An Empirical Examination of Effective Instruction-Tuning Data Mixtures for Legal Reasoning | Joel Niklaus et.al. | 2404.02127 | link |
2024-04-02 | Risk-Aware Real-Time Task Allocation for Stochastic Multi-Agent Systems under STL Specifications | Maico H. W. Engelaar et.al. | 2404.02111 | null |
2024-04-02 | WcDT: World-centric Diffusion Transformer for Traffic Scene Generation | Chen Yang et.al. | 2404.02082 | link |
2024-04-02 | A Survey on Large Language Model-Based Game Agents | Sihao Hu et.al. | 2404.02039 | link |
2024-04-02 | Enhancing Portfolio Optimization with Transformer-GAN Integration: A Novel Approach in the Black-Litterman Framework | Enmin Zhu et.al. | 2404.02029 | null |
2024-04-02 | DELAN: Dual-Level Alignment for Vision-and-Language Navigation by Cross-Modal Contrastive Learning | Mengfei Du et.al. | 2404.01994 | link |
2024-04-02 | Heuristic Optimization of Amplifier Reconfiguration Process for Autonomous Driving Optical Networks | Qizhi Qiu et.al. | 2404.01949 | null |
2024-04-02 | Improving Bird’s Eye View Semantic Segmentation by Task Decomposition | Tianhao Zhao et.al. | 2404.01925 | null |
2024-04-02 | A Rationale-centric Counterfactual Data Augmentation Method for Cross-Document Event Coreference Resolution | Bowen Ding et.al. | 2404.01921 | link |
2024-04-02 | Neuromorphic Split Computing with Wake-Up Radios: Architecture and Design via Digital Twinning | Jiechen Chen et.al. | 2404.01815 | null |
2024-04-02 | Analyzing the Single Event Upset Vulnerability of Binarized Neural Networks on SRAM FPGAs | Ioanna Souvatzoglou et.al. | 2404.01757 | null |
2024-04-02 | Safe Interval RRT* for Scalable Multi-Robot Path Planning in Continuous Space | Joonyeol Sim et.al. | 2404.01752 | link |
2024-04-02 | Exploring Latent Pathways: Enhancing the Interpretability of Autonomous Driving with a Variational Autoencoder | Anass Bairouk et.al. | 2404.01750 | null |
2024-04-02 | Towards Scalable & Efficient Interaction-Aware Planning in Autonomous Vehicles using Knowledge Distillation | Piyush Gupta et.al. | 2404.01746 | null |
2024-04-02 | Boosting Visual Recognition for Autonomous Driving in Real-world Degradations with Deep Channel Prior | Zhanwen Liu et.al. | 2404.01703 | link |
2024-04-02 | JRDB-PanoTrack: An Open-world Panoptic Segmentation and Tracking Robotic Dataset in Crowded Human Environments | Duy-Tho Le et.al. | 2404.01686 | null |
2024-04-02 | Collaborative Optimization of Wireless Communication and Computing Resource Allocation based on Multi-Agent Federated Weighting Deep Reinforcement Learning | Junjie Wu et.al. | 2404.01638 | null |
2024-04-02 | Voice EHR: Introducing Multimodal Audio Data for Health | James Anibal et.al. | 2404.01620 | null |
2024-04-02 | Haina Storage: A Decentralized Secure Storage Framework Based on Improved Blockchain Structure | Zijian Zhou et.al. | 2404.01606 | link |
2024-04-02 | Language Model Guided Interpretable Video Action Reasoning | Ning Wang et.al. | 2404.01591 | null |
2024-03-29 | Localising the Seizure Onset Zone from Single-Pulse Electrical Stimulation Responses with a Transformer | Jamie Norris et.al. | 2403.20324 | link |
2024-03-29 | Can LLMs Correct Physicians, Yet? Investigating Effective Interaction Methods in the Medical Domain | Burcu Sayin et.al. | 2403.20288 | link |
2024-03-29 | Optimal Policy Learning with Observational Data in Multi-Action Scenarios: Estimation, Risk Preference, and Potential Failures | Giovanni Cerulli et.al. | 2403.20250 | null |
2024-03-29 | A simple EEG-based decision tool for neonatal therapeutic hypothermia in hypoxic-ischemic encephalopathy | Marc Fiammante et.al. | 2403.20239 | null |
2024-03-29 | Enhancing Lithological Mapping with Spatially Constrained Bayesian Network (SCB-Net): An Approach for Field Data-Constrained Predictions with Uncertainty Evaluation | Victor Silva dos Santos et.al. | 2403.20195 | link |
2024-03-29 | Biologically-Plausible Topology Improved Spiking Actor Network for Efficient Deep Reinforcement Learning | Duzhen Zhang et.al. | 2403.20163 | null |
2024-03-29 | Conformal Prediction for Stochastic Decision-Making of PV Power in Electricity Markets | Yvet Renkema et.al. | 2403.20149 | null |
2024-03-29 | Application of Machine Learning Algorithms in Classifying Postoperative Success in Metabolic Bariatric Surgery: A Comprehensive Study | José Alberto Benítez-Andrades et.al. | 2403.20124 | null |
2024-03-29 | LeGo-Drive: Language-enhanced Goal-oriented Closed-Loop End-to-End Autonomous Driving | Pranjal Paul et.al. | 2403.20116 | null |
2024-03-29 | SGD: Street View Synthesis with Gaussian Splatting and Diffusion Prior | Zhongrui Yu et.al. | 2403.20079 | null |
2024-03-29 | HO-Gaussian: Hybrid Optimization of 3D Gaussian Splatting for Urban Scenes | Zhuopeng Li et.al. | 2403.20032 | null |
2024-03-29 | Decision Mamba: Reinforcement Learning via Sequence Modeling with Selective State Spaces | Toshihiro Ota et.al. | 2403.19925 | link |
2024-03-29 | PLoc: A New Evaluation Criterion Based on Physical Location for Autonomous Driving Datasets | Ruining Yang et.al. | 2403.19893 | null |
2024-03-28 | Optimal regimes with limited resources | Aaron L. Sarvet et.al. | 2403.19842 | null |
2024-03-28 | Multi-Frame, Lightweight & Efficient Vision-Language Models for Question Answering in Autonomous Driving | Akshay Gopalkrishnan et.al. | 2403.19838 | link |
2024-03-28 | Segmentation Re-thinking Uncertainty Estimation Metrics for Semantic Segmentation | Qitian Ma et.al. | 2403.19826 | null |
2024-03-28 | A Digital Twin for Geological Carbon Storage with Controlled Injectivity | Abhinav Prakash Gahlot et.al. | 2403.19819 | null |
2024-03-28 | Human-compatible driving partners through data-regularized self-play reinforcement learning | Daphne Cornelisse et.al. | 2403.19648 | link |
2024-03-28 | In the driver’s mind: modeling the dynamics of human overtaking decisions in interactions with oncoming automated vehicles | Samir H. A. Mohammad et.al. | 2403.19637 | null |
2024-03-28 | Data-Adaptive Tradeoffs among Multiple Risks in Distribution-Free Prediction | Drew T. Nguyen et.al. | 2403.19605 | link |
2024-03-28 | Behavior Trees in Industrial Applications: A Case Study in Underground Explosive Charging | Mattias Hallen et.al. | 2403.19602 | null |
2024-03-28 | Swarm Characteristics Classification Using Neural Networks | Donald W. Peltier III et.al. | 2403.19572 | link |
2024-03-28 | Learning Sampling Distribution and Safety Filter for Autonomous Driving with VQ-VAE and Differentiable Optimization | Simon Idoko et.al. | 2403.19461 | link |
2024-03-28 | Transparent and Clinically Interpretable AI for Lung Cancer Detection in Chest X-Rays | Amy Rafferty et.al. | 2403.19444 | null |
2024-03-28 | SubjectDrive: Scaling Generative Data in Autonomous Driving via Subject Control | Binyuan Huang et.al. | 2403.19438 | null |
2024-03-28 | Learning a Formally Verified Control Barrier Function in Stochastic Environment | Manan Tayal et.al. | 2403.19332 | link |
2024-03-28 | A Machine Learning Approach for Crop Yield and Disease Prediction Integrating Soil Nutrition and Weather Factors | Forkan Uddin Ahmed et.al. | 2403.19273 | null |
2024-03-28 | Evaluating Fair Feature Selection in Machine Learning for Healthcare | Md Rahat Shahriar Zawad et.al. | 2403.19165 | null |
2024-03-28 | Gamu Blue: A Practical Tool for Game Theory Security Equilibria | Ameer Taweel et.al. | 2403.19130 | link |
2024-03-28 | CRKD: Enhanced Camera-Radar Object Detection with Cross-modality Knowledge Distillation | Lingjun Zhao et.al. | 2403.19104 | null |
2024-03-28 | GraphAD: Interaction Scene Graph for End-to-end Autonomous Driving | Yunpeng Zhang et.al. | 2403.19098 | link |
2024-03-27 | GENESIS-RL: GEnerating Natural Edge-cases with Systematic Integration of Safety considerations and Reinforcement Learning | Hsin-Jung Yang et.al. | 2403.19062 | null |
2024-03-27 | Ensuring Safe Autonomy: Navigating the Future of Autonomous Vehicles | Patrick Wolf et.al. | 2403.19006 | null |
2024-03-27 | LORD: Large Models based Opposite Reward Design for Autonomous Driving | Xin Ye et.al. | 2403.18965 | null |
2024-03-27 | 3P-LLM: Probabilistic Path Planning using Large Language Model for Autonomous Robot Navigation | Ehsan Latif et.al. | 2403.18778 | null |
2024-03-27 | Mitigating Hallucinations in Large Vision-Language Models with Instruction Contrastive Decoding | Xintong Wang et.al. | 2403.18715 | link |
2024-03-27 | Sampling-Based Motion Planning with Online Racing Line Generation for Autonomous Driving on Three-Dimensional Race Tracks | Levent Ögretmen et.al. | 2403.18643 | link |
2024-03-27 | Modeling Sustainable City Trips: Integrating CO2 Emissions, Popularity, and Seasonality into Tourism Recommender Systems | Ashmi Banerjee et.al. | 2403.18604 | null |
2024-03-27 | Modeling uncertainty for Gaussian Splatting | Luca Savant et.al. | 2403.18476 | null |
2024-03-27 | Uncertainty-Aware SAR ATR: Defending Against Adversarial Attacks via Bayesian Neural Networks | Tian Ye et.al. | 2403.18318 | null |
2024-03-27 | Manipulating Neural Path Planners via Slight Perturbations | Zikang Xiong et.al. | 2403.18256 | null |
2024-03-27 | From Two-Dimensional to Three-Dimensional Environment with Q-Learning: Modeling Autonomous Navigation with Reinforcement Learning and no Libraries | Ergon Cugler de Moraes Silva et.al. | 2403.18219 | link |
2024-03-27 | Preference-Based Planning in Stochastic Environments: From Partially-Ordered Temporal Goals to Most Preferred Policies | Hazhar Rahmani et.al. | 2403.18212 | null |
2024-03-27 | Long and Short-Term Constraints Driven Safe Reinforcement Learning for Autonomous Driving | Xuemin Hu et.al. | 2403.18209 | null |
2024-03-27 | Road Obstacle Detection based on Unknown Objectness Scores | Chihiro Noguchi et.al. | 2403.18207 | null |
2024-03-27 | Integrating urban digital twins with cloud-based geospatial dashboards for coastal resilience planning: A case study in Florida | Changjie Chen et.al. | 2403.18188 | null |
2024-03-26 | Low-Latency Neural Stereo Streaming | Qiqi Hou et.al. | 2403.17879 | null |
2024-03-27 | Empowering Data Mesh with Federated Learning | Haoyuan Li et.al. | 2403.17878 | link |
2024-03-26 | Counterfactual Fairness through Transforming Data Orthogonal to Bias | Shuyi Chen et.al. | 2403.17852 | null |
2024-03-26 | Scenario-Based Curriculum Generation for Multi-Agent Autonomous Driving | Axel Brunnbauer et.al. | 2403.17805 | link |
2024-03-27 | Query Refinement for Diverse Top- $k$ Selection | Felix S. Campbell et.al. | 2403.17786 | null |
2024-03-26 | LiDAR-Based Crop Row Detection Algorithm for Over-Canopy Autonomous Navigation in Agriculture Fields | Ruiji Liu et.al. | 2403.17774 | link |
2024-03-26 | Optimization-based Prompt Injection Attack to LLM-as-a-Judge | Jiawen Shi et.al. | 2403.17710 | link |
2024-03-26 | Healthcare Data Governance, Privacy, and Security – A Conceptual Framework | Amen Faridoon et.al. | 2403.17648 | null |
2024-03-27 | Intrinsic Subgraph Generation for Interpretable Graph based Visual Question Answering | Pascal Tilli et.al. | 2403.17647 | link |
2024-03-26 | Retentive Decision Transformer with Adaptive Masking for Reinforcement Learning based Recommendation Systems | Siyu Wang et.al. | 2403.17634 | null |
2024-03-26 | UADA3D: Unsupervised Adversarial Domain Adaptation for 3D Object Detection with Sparse LiDAR and Large Domain Gaps | Maciej K Wozniak et.al. | 2403.17633 | link |
2024-03-26 | Quadratic speed-ups in quantum kernelized binary classification | Jungyun Lee et.al. | 2403.17453 | null |
2024-03-26 | Integrating Mamba Sequence Model and Hierarchical Upsampling Network for Accurate Semantic Segmentation of Multiple Sclerosis Legion | Kazi Shahriar Sanjid et.al. | 2403.17432 | null |
2024-03-26 | A Survey on Resource Management in Joint Communication and Computing-Embedded SAGIN | Qian Chen et.al. | 2403.17400 | null |
2024-03-26 | AIDE: An Automatic Data Engine for Object Detection in Autonomous Driving | Mingfu Liang et.al. | 2403.17373 | null |
2024-03-26 | Addressing Myopic Constrained POMDP Planning with Recursive Dual Ascent | Paula Stocco et.al. | 2403.17358 | link |
2024-03-26 | Deep Support Vectors | Junhoo Lee et.al. | 2403.17329 | null |
2024-03-27 | Physical 3D Adversarial Attacks against Monocular Depth Estimation in Autonomous Driving | Junhao Zheng et.al. | 2403.17301 | link |
2024-03-25 | Review Ecosystems to access Educational XR Experiences: a Scoping Review | Shaun Bangay et.al. | 2403.17243 | null |
2024-03-25 | SynFog: A Photo-realistic Synthetic Fog Dataset based on End-to-end Imaging Simulation for Advancing Real-World Defogging in Autonomous Driving | Yiming Xie et.al. | 2403.17094 | null |
2024-03-25 | Towards Human-AI Deliberation: Design and Evaluation of LLM-Empowered Deliberative AI for AI-Assisted Decision-Making | Shuai Ma et.al. | 2403.16812 | null |
2024-03-25 | An LLM-Based Digital Twin for Optimizing Human-in-the Loop Systems | Hanqing Yang et.al. | 2403.16809 | link |
2024-03-25 | A Blotto Game Approach to Ride-hailing Markets with Electric Vehicles | Marko Maljkovic et.al. | 2403.16755 | null |
2024-03-25 | Synapse: Learning Preferential Concepts from Visual Demonstrations | Sadanand Modak et.al. | 2403.16689 | null |
2024-03-25 | Instantaneous Visual Analysis of Blood Flow in Stenoses Using Morphological Similarity | Pepe Eulzer et.al. | 2403.16653 | null |
2024-03-25 | Semantically Enriched Cross-Lingual Sentence Embeddings for Crisis-related Social Media Texts | Rabindra Lamsal et.al. | 2403.16614 | null |
2024-03-25 | ROXIE: Defining a Robotic eXplanation and Interpretability Engine | Francisco J. Rodríguez-Lera et.al. | 2403.16606 | null |
2024-03-25 | Hallucination Detection in Foundation Models for Decision-Making: A Flexible Definition and Review of the State of the Art | Neeloy Chakraborty et.al. | 2403.16527 | null |
2024-03-25 | Harnessing the power of LLMs for normative reasoning in MASs | Bastin Tony Roy Savarimuthu et.al. | 2403.16524 | null |
2024-03-25 | Learning To Guide Human Decision Makers With Vision-Language Models | Debodeep Banerjee et.al. | 2403.16501 | null |
2024-03-25 | RCBEVDet: Radar-camera Fusion in Bird’s Eye View for 3D Object Detection | Zhiwei Lin et.al. | 2403.16440 | link |
2024-03-25 | An image-computable model of speeded decision-making | Paul I. Jaffe et.al. | 2403.16382 | link |
2024-03-25 | ProIn: Learning to Predict Trajectory Based on Progressive Interactions for Autonomous Driving | Yinke Dong et.al. | 2403.16374 | null |
2024-03-25 | Impact of Video Compression Artifacts on Fisheye Camera Visual Perception Tasks | Madhumitha Sakthi et.al. | 2403.16338 | null |
2024-03-25 | MEDDAP: Medical Dataset Enhancement via Diversified Augmentation Pipeline | Yasamin Medghalchi et.al. | 2403.16335 | link |
2024-03-24 | Social Deliberation vs. Social Contracts in Self-Governing Voluntary Organisations | Matthew Scott et.al. | 2403.16329 | null |
2024-03-24 | MRSch: Multi-Resource Scheduling for HPC | Boyang Li et.al. | 2403.16298 | link |
2024-03-24 | Engineering Safety Requirements for Autonomous Driving with Large Language Models | Ali Nouri et.al. | 2403.16289 | null |
2024-03-24 | Sample Empirical Likelihood Methods for Causal Inference | Jingyue Huang et.al. | 2403.16283 | null |
2024-03-24 | The Evolution of Football Betting- A Machine Learning Approach to Match Outcome Forecasting and Bookmaker Odds Estimation | Purnachandra Mandadapu et.al. | 2403.16282 | null |
2024-03-24 | Interference Management for Integrated Sensing and Communication Systems: A Survey | Yangyang Niu et.al. | 2403.16189 | null |
2024-03-24 | Mixed-Initiative Human-Robot Teaming under Suboptimality with Online Bayesian Adaptation | Manisha Natarajan et.al. | 2403.16178 | link |
2024-03-24 | Self-Supervised Multi-Frame Neural Scene Flow | Dongrui Liu et.al. | 2403.16116 | null |
2024-03-22 | Can large language models explore in-context? | Akshay Krishnamurthy et.al. | 2403.15371 | null |
2024-03-22 | Augmented Reality based Simulated Data (ARSim) with multi-view consistency for AV perception networks | Aqeel Anwar et.al. | 2403.15370 | null |
2024-03-22 | CR3DT: Camera-RADAR Fusion for 3D Detection and Tracking | Nicolas Baumann et.al. | 2403.15313 | link |
2024-03-22 | Measuring Gender and Racial Biases in Large Language Models | Jiafu An et.al. | 2403.15281 | null |
2024-03-22 | IS-Fusion: Instance-Scene Collaborative Fusion for Multimodal 3D Object Detection | Junbo Yin et.al. | 2403.15241 | link |
2024-03-22 | Robust optimization for adversarial learning with finite sample complexity guarantees | André Bertolace et.al. | 2403.15207 | null |
2024-03-22 | An Agent-Centric Perspective on Norm Enforcement and Sanctions | Elena Yan et.al. | 2403.15128 | link |
2024-03-22 | Learning from Visual Demonstrations through Differentiable Nonlinear MPC for Personalized Autonomous Driving | Flavia Sofia Acerbo et.al. | 2403.15102 | null |
2024-03-22 | End-to-End Mineral Exploration with Artificial Intelligence and Ambient Noise Tomography | Jack Muir et.al. | 2403.15095 | null |
2024-03-22 | Robust Conformal Prediction under Distribution Shift via Physics-Informed Structural Causal Model | Rui Xu et.al. | 2403.15025 | null |
2024-03-22 | Extracting Human Attention through Crowdsourced Patch Labeling | Minsuk Chang et.al. | 2403.15013 | null |
2024-03-22 | Tri-Perspective View Decomposition for Geometry-Aware Depth Completion | Zhiqiang Yan et.al. | 2403.15008 | null |
2024-03-22 | Unifying Lane-Level Traffic Prediction from a Graph Structural Perspective: Benchmark and Baseline | Shuhao Li et.al. | 2403.14941 | link |
2024-03-22 | A Stochastic Model-Based Control Methodology for Glycemic Management in the Intensive Care Unit | Melike Sirlanci et.al. | 2403.14934 | null |
2024-03-21 | Establishing a leader in a pairwise comparisons method | Jacek Szybowski et.al. | 2403.14885 | null |
2024-03-21 | Consensus formation in quality-sensitive interdependent agent systems | David March-Pons et.al. | 2403.14856 | null |
2024-03-21 | ReAct Meets ActRe: Autonomous Annotations of Agent Trajectories for Contrastive Self-Training | Zonghan Yang et.al. | 2403.14589 | null |
2024-03-21 | Physics-Based Causal Reasoning for Safe & Robust Next-Best Action Selection in Robot Manipulation Tasks | Ricardo Cannizzaro et.al. | 2403.14488 | null |
2024-03-21 | The Ethics of ChatGPT in Medicine and Healthcare: A Systematic Review on Large Language Models (LLMs) | Joschka Haltaufderheide et.al. | 2403.14473 | null |
2024-03-21 | SurroundSDF: Implicit 3D Scene Understanding Based on Signed Distance Field | Lizhe Liu et.al. | 2403.14366 | null |
2024-03-21 | Beyond Surface Similarity: Detecting Subtle Semantic Shifts in Financial Narratives | Jiaxin Liu et.al. | 2403.14341 | null |
2024-03-21 | Investigating the validity of structure learning algorithms in identifying risk factors for intervention in patients with diabetes | Sheresh Zahoor et.al. | 2403.14327 | null |
2024-03-21 | UAV-Assisted Maritime Search and Rescue: A Holistic Approach | Martin Messmer et.al. | 2403.14281 | null |
2024-03-21 | Contrastive Balancing Representation Learning for Heterogeneous Dose-Response Curves Estimation | Minqin Zhu et.al. | 2403.14232 | link |
2024-03-21 | MMIDR: Teaching Large Language Model to Interpret Multimodal Misinformation via Knowledge Distillation | Longzheng Wang et.al. | 2403.14171 | link |
2024-03-21 | Hypothesis-Driven Deep Learning for Out of Distribution Detection | Yasith Jayawardana et.al. | 2403.14058 | null |
2024-03-20 | Spatial Fairness: The Case for its Importance, Limitations of Existing Work, and Guidelines for Future Research | Nripsuta Ani Saxena et.al. | 2403.14040 | null |
2024-03-20 | Pricing-driven Development and Operation of SaaS : Challenges and Opportunities | Alejandro García-Fernández et.al. | 2403.14007 | null |
2024-03-20 | “This is not a data problem”: Algorithms and Power in Public Higher Education in Canada | Kelly McConvey et.al. | 2403.13969 | null |
2024-03-20 | Sequential Modeling of Complex Marine Navigation: Case Study on a Passenger Vessel (Student Abstract) | Yimeng Fan et.al. | 2403.13909 | link |
2024-03-20 | Towards Learning Contrast Kinetics with Multi-Condition Latent Diffusion Models | Richard Osuala et.al. | 2403.13890 | link |
2024-03-20 | Chain-of-Interaction: Enhancing Large Language Models for Psychiatric Behavior Understanding by Dyadic Contexts | Guangzeng Han et.al. | 2403.13786 | link |
2024-03-20 | Towards Principled Representation Learning from Videos for Reinforcement Learning | Dipendra Misra et.al. | 2403.13765 | link |
2024-03-20 | Reinforcement Learning for Online Testing of Autonomous Driving Systems: a Replication and Extension Study | Luca Giamattei et.al. | 2403.13729 | null |
2024-03-20 | Multimodal Variational Autoencoder for Low-cost Cardiac Hemodynamics Instability Detection | Mohammod N. I. Suvon et.al. | 2403.13658 | link |
2024-03-21 | Adversarial Attacks and Defenses in Automated Control Systems: A Comprehensive Benchmark | Vitaliy Pozdnyakov et.al. | 2403.13502 | link |
2024-03-20 | Uncertainty quantification for data-driven weather models | Christopher Bülte et.al. | 2403.13458 | link |
2024-03-20 | IndiTag: An Online Media Bias Analysis and Annotation System Using Fine-Grained Bias Indicators | Luyang Lin et.al. | 2403.13446 | link |
2024-03-21 | AMP: Autoregressive Motion Prediction Revisited with Next Token Prediction for Autonomous Driving | Xiaosong Jia et.al. | 2403.13331 | null |
2024-03-20 | AdaViPro: Region-based Adaptive Visual Prompt for Large-Scale Models Adapting | Mengyu Yang et.al. | 2403.13282 | null |
2024-03-20 | Self-Supervised Class-Agnostic Motion Prediction with Spatial and Temporal Consistency Regularizations | Kewei Wang et.al. | 2403.13261 | link |
2024-03-20 | A Rule-Compliance Path Planner for Lane-Merge Scenarios Based on Responsibility-Sensitive Safety | Pengfei Lin et.al. | 2403.13251 | null |
2024-03-20 | Diffusion Model for Data-Driven Black-Box Optimization | Zihao Li et.al. | 2403.13219 | null |
2024-03-19 | Fast Value Tracking for Deep Reinforcement Learning | Frank Shih et.al. | 2403.13178 | null |
2024-03-19 | Interspecific dispersal constraints suppress pattern formation in metacommunities | Patrick Lawton et.al. | 2403.13098 | null |
2024-03-19 | Yell At Your Robot: Improving On-the-Fly from Language Corrections | Lucy Xiaoyang Shi et.al. | 2403.12910 | null |
2024-03-19 | Tighter Confidence Bounds for Sequential Kernel Regression | Hamish Flynn et.al. | 2403.12732 | null |
2024-03-19 | Deciphering AutoML Ensembles: cattleia’s Assistance in Decision-Making | Anna Kozak et.al. | 2403.12664 | null |
2024-03-19 | A Practical Guide to Statistical Distances for Evaluating Generative Models in Science | Sebastian Bischoff et.al. | 2403.12636 | link |
2024-03-19 | M2DA: Multi-Modal Fusion Transformer Incorporating Driver Attention for Autonomous Driving | Dongyang Xu et.al. | 2403.12552 | null |
2024-03-19 | Embodied LLM Agents Learn to Cooperate in Organized Teams | Xudong Guo et.al. | 2403.12482 | link |
2024-03-19 | INSIGHT: End-to-End Neuro-Symbolic Visual Reinforcement Learning with Language Explanations | Lirui Luo et.al. | 2403.12451 | link |
2024-03-19 | On Predictive planning and counterfactual learning in active inference | Aswin Paul et.al. | 2403.12417 | link |
2024-03-19 | Offline Imitation of Badminton Player Behavior via Experiential Contexts and Brownian Motion | Kuang-Da Wang et.al. | 2403.12406 | link |
2024-03-19 | Hierarchical Digital Twin for Efficient 6G Network Orchestration via Adaptive Attribute Selection and Scalable Network Modeling | Pengyi Jia et.al. | 2403.12398 | null |
2024-03-18 | The Best of Many Robustness Criteria in Decision Making: Formulation and Application to Robust Pricing | Jerry Anunrojwong et.al. | 2403.12260 | null |
2024-03-18 | Safety Implications of Explainable Artificial Intelligence in End-to-End Autonomous Driving | Shahin Atakishiyev et.al. | 2403.12176 | null |
2024-03-18 | HiKER-SGG: Hierarchical Knowledge Enhanced Robust Scene Graph Generation | Ce Zhang et.al. | 2403.12033 | link |
2024-03-18 | Expandable Subspace Ensemble for Pre-Trained Model-Based Class-Incremental Learning | Da-Wei Zhou et.al. | 2403.12030 | link |
2024-03-18 | From Pixels to Insights: A Survey on Automatic Chart Understanding in the Era of Large Foundation Models | Kung-Hsiang Huang et.al. | 2403.12027 | link |
2024-03-18 | Supervised Fine-Tuning as Inverse Reinforcement Learning | Hao Sun et.al. | 2403.12017 | null |
2024-03-18 | Proposal of a general framework to categorize continuous predictor variables | Irantzu Barrio et.al. | 2403.11983 | null |
2024-03-18 | Informed Spectral Normalized Gaussian Processes for Trajectory Prediction | Christian Schlauch et.al. | 2403.11966 | null |
2024-03-18 | Probabilistic Calibration by Design for Neural Network Regression | Victor Dheur et.al. | 2403.11964 | link |
2024-03-18 | AI-Assisted Cervical Cancer Screening | Kanchan Poudel et.al. | 2403.11936 | null |
2024-03-18 | BEVCar: Camera-Radar Fusion for BEV Map and Object Segmentation | Jonas Schramm et.al. | 2403.11761 | link |
2024-03-18 | TrajectoryNAS: A Neural Architecture Search for Trajectory Prediction | Ali Asghar Sharifi et.al. | 2403.11695 | null |
2024-03-18 | Sensitivity Assessment of Multi-Criteria Decision-Making Methods in Chemical Engineering Optimization Applications | Seyed Reza Nabavi et.al. | 2403.11569 | null |
2024-03-18 | OCR is All you need: Importing Multi-Modality into Image-based Defect Detection System | Chih-Chung Hsu et.al. | 2403.11536 | null |
2024-03-18 | State-Separated SARSA: A Practical Sequential Decision-Making Algorithm with Recovering Rewards | Yuto Tanimoto et.al. | 2403.11520 | link |
2024-03-18 | SSAP: A Shape-Sensitive Adversarial Patch for Comprehensive Disruption of Monocular Depth Estimation in Autonomous Navigation Applications | Amira Guesmi et.al. | 2403.11515 | null |
2024-03-18 | MCD: Diverse Large-Scale Multi-Campus Dataset for Robot Perception | Thien-Minh Nguyen et.al. | 2403.11496 | null |
2024-03-18 | LLM Guided Evolution - The Automation of Models Advancing Models | Clint Morris et.al. | 2403.11446 | link |
2024-03-18 | Demystifying Deep Reinforcement Learning-Based Autonomous Vehicle Decision-Making | Hanxi Wan et.al. | 2403.11432 | null |
2024-03-17 | Driving Style Alignment for LLM-powered Driver Agent | Ruoxuan Yang et.al. | 2403.11368 | link |
2024-03-17 | Multi-Sample Long Range Path Planning under Sensing Uncertainty for Off-Road Autonomous Driving | Matt Schmittle et.al. | 2403.11298 | null |
2024-03-17 | A Modified Word Saliency-Based Adversarial Attack on Text Classification Models | Hetvi Waghela et.al. | 2403.11297 | null |
2024-03-17 | Barely Random Algorithms for Metrical Task Systems | Romain Cosson et.al. | 2403.11267 | null |
2024-03-17 | A learning-based solution approach to the application placement problem in mobile edge computing under uncertainty | Taha-Hossein Hejazi et.al. | 2403.11259 | null |
2024-03-17 | Learning-Based Pricing and Matching for Two-Sided Queues | Zixian Yang et.al. | 2403.11093 | null |
2024-03-17 | Tokensome: Towards a Genetic Vision-Language GPT for Explainable and Cognitive Karyotyping | Haoxi Zhang et.al. | 2403.11073 | null |
2024-03-17 | Large Language Models Powered Context-aware Motion Prediction | Xiaoji Zheng et.al. | 2403.11057 | link |
2024-03-17 | JustQ: Automated Deployment of Fair and Accurate Quantum Neural Networks | Ruhan Wang et.al. | 2403.11048 | null |
2024-03-17 | From Pixels to Predictions: Spectrogram and Vision Transformer for Better Time Series Forecasting | Zhen Zeng et.al. | 2403.11047 | null |
2024-03-16 | Advancing multivariate time series similarity assessment: an integrated computational approach | Franck Tonle et.al. | 2403.11044 | null |
2024-03-15 | Can a GPT4-Powered AI Agent Be a Good Enough Performance Attribution Analyst? | Bruno de Melo et.al. | 2403.10482 | null |
2024-03-15 | Gradient based Feature Attribution in Explainable AI: A Technical Review | Yongjie Wang et.al. | 2403.10415 | null |
2024-03-15 | Real-Time Image Segmentation via Hybrid Convolutional-Transformer Architecture Search | Hongyuan Yu et.al. | 2403.10413 | link |
2024-03-15 | Evaluating Perceptual Distances by Fitting Binomial Distributions to Two-Alternative Forced Choice Data | Alexander Hepburn et.al. | 2403.10390 | null |
2024-03-15 | Regret Minimization via Saddle Point Optimization | Johannes Kirschner et.al. | 2403.10379 | null |
2024-03-15 | SimPB: A Single Model for 2D and 3D Object Detection from Multiple Cameras | Yingqi Tang et.al. | 2403.10353 | link |
2024-03-15 | Interactive Trimming against Evasive Online Data Manipulation Attacks: A Game-Theoretic Approach | Yue Fu et.al. | 2403.10313 | null |
2024-03-15 | Designing User-Centered Simulations of Leadership Situations for Cave Automatic Virtual Environments: Development and Usability Study | Francesco Vona et.al. | 2403.10312 | null |
2024-03-15 | A Multi-constraint and Multi-objective Allocation Model for Emergency Rescue in IoT Environment | Xinrun Xu et.al. | 2403.10299 | null |
2024-03-15 | The long-term and disparate impact of job loss on individual mobility behaviour | Simone Centellegher et.al. | 2403.10276 | null |
2024-03-15 | Interpretable Machine Learning for Survival Analysis | Sophie Hanna Langbein et.al. | 2403.10250 | link |
2024-03-15 | CoLeCLIP: Open-Domain Continual Learning via Joint Task Prompt and Vocabulary Learning | Yukun Li et.al. | 2403.10245 | link |
2024-03-15 | Explainability through uncertainty: Trustworthy decision-making with neural networks | Arthur Thuy et.al. | 2403.10168 | null |
2024-03-15 | RCooper: A Real-world Large-scale Dataset for Roadside Cooperative Perception | Ruiyang Hao et.al. | 2403.10145 | link |
2024-03-15 | Enhancing Human-Centered Dynamic Scene Understanding via Multiple LLMs Collaborated Reasoning | Hang Zhang et.al. | 2403.10107 | null |
2024-03-15 | RangeLDM: Fast Realistic LiDAR Point Cloud Generation | Qianjiang Hu et.al. | 2403.10094 | link |
2024-03-15 | Visual Foundation Models Boost Cross-Modal Unsupervised Domain Adaptation for 3D Semantic Segmentation | Jingyi Xu et.al. | 2403.10001 | link |
2024-03-15 | Global Convergence Guarantees for Federated Policy Gradient Methods with Adversaries | Swetha Ganesh et.al. | 2403.09940 | null |
2024-03-14 | Reality Bites: Assessing the Realism of Driving Scenarios with Large Language Models | Jiahui Wu et.al. | 2403.09906 | link |
2024-03-14 | Robust Subgraph Learning by Monitoring Early Training Representations | Sepideh Neshatfar et.al. | 2403.09901 | null |
2024-03-14 | Generalized Predictive Model for Autonomous Driving | Jiazhi Yang et.al. | 2403.09630 | link |
2024-03-14 | Scalable Autonomous Drone Flight in the Forest with Visual-Inertial SLAM and Dense Submaps Built without LiDAR | Sebastián Barbas Laina et.al. | 2403.09596 | null |
2024-03-14 | Iterative Forgetting: Online Data Stream Regression Using Database-Inspired Adaptive Granulation | Niket Kathiriya et.al. | 2403.09588 | null |
2024-03-14 | Are you a robot? Detecting Autonomous Vehicles from Behavior Analysis | Fabio Maresca et.al. | 2403.09571 | null |
2024-03-14 | Characterization of Polarimetric Properties in Various Brain Tumor Types Using Wide-Field Imaging Mueller Polarimetry | Romane Gros et.al. | 2403.09561 | null |
2024-03-14 | “Are You Really Sure?” Understanding the Effects of Human Self-Confidence Calibration in AI-Assisted Decision Making | Shuai Ma et.al. | 2403.09552 | null |
2024-03-14 | On STPA for Distributed Development of Safe Autonomous Driving: An Interview Study | Ali Nouri et.al. | 2403.09509 | null |
2024-03-14 | An Industrial Experience Report about Challenges from Continuous Monitoring, Improvement, and Deployment for Autonomous Driving Features | Ali Nouri et.al. | 2403.09474 | null |
2024-03-14 | Exploring the Interplay of Intrinsic Fluctuation and Complexity in Intracellular Calcium Dynamics | Athokpam Langlen Chanu et.al. | 2403.09386 | null |
2024-03-14 | EfficientMFD: Towards More Efficient Multimodal Synchronous Fusion Detection | Jiaqing Zhang et.al. | 2403.09323 | link |
2024-03-14 | Generating Feasible and Plausible Counterfactual Explanations for Outcome Prediction of Business Processes | Alexander Stevens et.al. | 2403.09232 | link |
2024-03-14 | Unlocking the Potential of Open Government Data: Exploring the Strategic, Technical, and Application Perspectives of High-Value Datasets Opening in Taiwan | Hsien-Lee Tseng et.al. | 2403.09216 | null |
2024-03-14 | Intention-aware Denoising Diffusion Model for Trajectory Prediction | Chen Liu et.al. | 2403.09190 | null |
2024-03-14 | PreSight: Enhancing Autonomous Vehicle Perception with City-Scale NeRF Priors | Tianyuan Yuan et.al. | 2403.09079 | link |
2024-03-13 | AutoGuide: Automated Generation and Selection of State-Aware Guidelines for Large Language Model Agents | Yao Fu et.al. | 2403.08978 | null |
2024-03-13 | Managing Distributional Ambiguity in Stochastic Optimization through a Statistical Upper Bound Framework | Shixin Liu et.al. | 2403.08966 | null |
2024-03-13 | Language-based game theory in the age of artificial intelligence | Valerio Capraro et.al. | 2403.08944 | null |
2024-03-13 | FogGuard: guarding YOLO against fog using perceptual loss | Soheil Gharatappeh et.al. | 2403.08939 | link |
2024-03-13 | CLIP-BEVFormer: Enhancing Multi-View Image-Based BEV Detector with Ground Truth Flow | Chenbin Pan et.al. | 2403.08919 | null |
2024-03-13 | A Framework for Strategic Discovery of Credible Neural Network Surrogate Models under Uncertainty | Pratyush Kumar Singh et.al. | 2403.08901 | null |
2024-03-13 | MIM4D: Masked Modeling with Multi-View Video for Autonomous Driving Representation Learning | Jialv Zou et.al. | 2403.08760 | link |
2024-03-13 | Real-time 3D semantic occupancy prediction for autonomous vehicles using memory-efficient sparse convolution | Samuel Sze et.al. | 2403.08748 | null |
2024-03-13 | Steering LLMs Towards Unbiased Responses: A Causality-Guided Debiasing Framework | Jingling Li et.al. | 2403.08743 | null |
2024-03-13 | Optimal sub-Gaussian variance proxy for truncated Gaussian and exponential random variables | Mathias Barreto et.al. | 2403.08628 | null |
2024-03-13 | Towards a Privacy and Security-Aware Framework for Ethical AI: Guiding the Development and Assessment of AI Systems | Daria Korobenko et.al. | 2403.08624 | null |
2024-03-13 | Pig aggression classification using CNN, Transformers and Recurrent Networks | Junior Silva Souza et.al. | 2403.08528 | null |
2024-03-13 | IAMCV Multi-Scenario Vehicle Interaction Dataset | Novel Certad et.al. | 2403.08455 | null |
2024-03-13 | DeepCSHAP: Utilizing Shapley Values to Explain Deep Complex-Valued Neural Networks | Florian Eilers et.al. | 2403.08428 | null |
2024-03-13 | Causal Graph Neural Networks for Wildfire Danger Prediction | Shan Zhao et.al. | 2403.08414 | null |
2024-03-13 | LLM-Assisted Light: Leveraging Large Language Model Capabilities for Human-Mimetic Traffic Signal Control in Complex Urban Environments | Maonan Wang et.al. | 2403.08337 | link |
2024-03-13 | Optimized Detection and Classification on GTRSB: Advancing Traffic Sign Recognition with Convolutional Neural Networks | Dhruv Toshniwal et.al. | 2403.08283 | null |
2024-03-13 | LIX: Implicitly Infusing Spatial Geometric Prior Knowledge into Visual Semantic Segmentation for Autonomous Driving | Sicen Guo et.al. | 2403.08215 | null |
2024-03-13 | Can Large Language Models Identify Authorship? | Baixiang Huang et.al. | 2403.08213 | link |
2024-03-14 | Data Monetization Pathways and Complex Dynamic Game Equilibrium Analysis in the Energy Industry | Zongxian Wang et.al. | 2403.08082 | null |
2024-03-12 | What would Plato say? Concepts and notions from Greek philosophy applied to gamification mechanics for a meaningful and ethical gamification | Kostas Karpouzis et.al. | 2403.08041 | null |
2024-03-12 | A Review of Cybersecurity Incidents in the Food and Agriculture Sector | Ajay Kulkarni et.al. | 2403.08036 | null |
2024-03-12 | Supervised Time Series Classification for Anomaly Detection in Subsea Engineering | Ergys Çokaj et.al. | 2403.08013 | null |
2024-03-12 | When Eye-Tracking Meets Machine Learning: A Systematic Review on Applications in Medical Image Analysis | Sahar Moradizeyveh et.al. | 2403.07834 | null |
2024-03-12 | FairRR: Pre-Processing for Group Fairness through Randomized Response | Xianli Zeng et.al. | 2403.07780 | link |
2024-03-12 | Transforming Competition into Collaboration: The Revolutionary Role of Multi-Agent Systems and Language Models in Modern Organizations | Carlos Jose Xavier Cruz et.al. | 2403.07769 | link |
2024-03-12 | Unleashing HyDRa: Hybrid Fusion, Depth Consistency and Radar for Unified 3D Perception | Philipp Wolters et.al. | 2403.07746 | link |
2024-03-12 | Equipping Computational Pathology Systems with Artifact Processing Pipelines: A Showcase for Computation and Performance Trade-offs | Neel Kanwal et.al. | 2403.07743 | link |
2024-03-12 | DSEG-LIME - Improving Image Explanation by Hierarchical Data-Driven Segmentation | Patrick Knab et.al. | 2403.07733 | link |
2024-03-12 | A Survey of Vision Transformers in Autonomous Driving: Current Trends and Future Directions | Quoc-Vinh Lai-Dang et.al. | 2403.07542 | null |
2024-03-12 | Adaptive Fusion of Single-View and Multi-View Depth for Autonomous Driving | JunDa Cheng et.al. | 2403.07535 | link |
2024-03-12 | Spatiotemporal Representation Learning for Short and Long Medical Image Time Series | Chengzhi Shen et.al. | 2403.07513 | link |
2024-03-12 | Reinforced Sequential Decision-Making for Sepsis Treatment: The POSNEGDM Framework with Mortality Classifier and Transformer | Dipesh Tamboli et.al. | 2403.07309 | link |
2024-03-12 | Improved Algebraic Inverter Modelling for Four-Wire Power Flow Optimization | Rahmat Heidari et.al. | 2403.07285 | null |
2024-03-12 | Adaptive Bounding Box Uncertainties via Two-Step Conformal Prediction | Alexander Timans et.al. | 2403.07263 | link |
2024-03-12 | Tractable Joint Prediction and Planning over Discrete Behavior Modes for Urban Driving | Adam Villaflor et.al. | 2403.07232 | null |
2024-03-11 | Bigraph Matching Weighted with Learnt Incentive Function for Multi-Robot Task Allocation | Steve Paul et.al. | 2403.07131 | null |
2024-03-11 | RaceMOP: Mapless Online Path Planning for Multi-Agent Autonomous Racing using Residual Policy Learning | Raphael Trumpp et.al. | 2403.07129 | link |
2024-03-11 | Better than classical? The subtle art of benchmarking quantum machine learning models | Joseph Bowles et.al. | 2403.07059 | link |
2024-03-11 | Numerical simulation of individual coil placement – A proof-of-concept study for the prediction of recurrence after aneurysm coiling | Julian Schwarting et.al. | 2403.06889 | null |
2024-03-11 | Model Predictive Control Strategies for Electric Endurance Race Cars Accounting for Competitors Interactions | Jorn van Kampen et.al. | 2403.06885 | null |
2024-03-11 | DriveDreamer-2: LLM-Enhanced World Models for Diverse Driving Video Generation | Guosheng Zhao et.al. | 2403.06845 | null |
2024-03-11 | From Factor Models to Deep Learning: Machine Learning in Reshaping Empirical Asset Pricing | Junyi Ye et.al. | 2403.06779 | null |
2024-03-11 | Real-Time Multimodal Cognitive Assistant for Emergency Medical Services | Keshara Weerasinghe et.al. | 2403.06734 | link |
2024-03-11 | PCLD: Point Cloud Layerwise Diffusion for Adversarial Purification | Mert Gulsen et.al. | 2403.06698 | link |
2024-03-11 | Maxitive functions with respect to general orders | M. Kupper et.al. | 2403.06613 | null |
2024-03-11 | Tactical Decision Making for Autonomous Trucks by Deep Reinforcement Learning with Total Cost of Operation Based Reward | Deepthi Pathare et.al. | 2403.06524 | null |
2024-03-11 | 3D Semantic Segmentation-Driven Representations for 3D Object Detection | Hayeon O et.al. | 2403.06501 | link |
2024-03-11 | CoRAL: Collaborative Retrieval-Augmented Large Language Models Improve Long-tail Recommendation | Junda Wu et.al. | 2403.06447 | null |
2024-03-10 | LinearAPT: An Adaptive Algorithm for the Fixed-Budget Thresholding Linear Bandit Problem | Yun-Ang Wu et.al. | 2403.06230 | null |
2024-03-10 | IDEAS: Information-Driven EV Admission in Charging Station Considering User Impatience to Improve QoS and Station Utilization | Animesh Chattopadhyay et.al. | 2403.06223 | null |
2024-03-10 | TRAD: Enhancing LLM Agents with Step-Wise Thought Retrieval and Aligned Decision | Ruiwen Zhou et.al. | 2403.06221 | link |
2024-03-10 | On depth prediction for autonomous driving using self-supervised learning | Houssem Boulahbal et.al. | 2403.06194 | null |
2024-03-10 | Cross-Cluster Shifting for Efficient and Effective 3D Object Detection in Autonomous Driving | Zhili Chen et.al. | 2403.06166 | null |
2024-03-10 | Target-constrained Bidirectional Planning for Generation of Target-oriented Proactive Dialogue | Jian Wang et.al. | 2403.06063 | link |
2024-03-09 | CarbonNet: How Computer Vision Plays a Role in Climate Change? Application: Learning Geomechanics from Subsurface Geometry of CCS to Mitigate Global Warming | Wei Chen et.al. | 2403.06025 | null |
2024-03-09 | End-to-end solution for linked open data query logs analytics | Dihia Lanasri et.al. | 2403.06016 | null |
2024-03-09 | Deep learning for multi-label classification of coral conditions in the Indo-Pacific via underwater photogrammetry | Xinlei Shao et.al. | 2403.05930 | link |
2024-03-09 | Towards Optimizing Human-Centric Objectives in AI-Assisted Decision-Making With Offline Reinforcement Learning | Zana Buçinca et.al. | 2403.05911 | null |
2024-03-08 | JointMotion: Joint Self-supervision for Joint Motion Prediction | Royden Wagner et.al. | 2403.05489 | link |
2024-03-08 | OccFusion: Depth Estimation Free Multi-sensor Fusion for 3D Occupancy Prediction | Ji Zhang et.al. | 2403.05329 | null |
2024-03-08 | Tapilot-Crossing: Benchmarking and Evolving LLMs Towards Interactive Data Analysis Agents | Jinyang Li et.al. | 2403.05307 | link |
2024-03-08 | Engineering consensus in static networks with unknown disruptors | Agathe Bouis et.al. | 2403.05272 | null |
2024-03-08 | Developing Federated Time-to-Event Scores Using Heterogeneous Real-World Survival Data | Siqi Li et.al. | 2403.05229 | link |
2024-03-08 | Interactive Perception for Deformable Object Manipulation | Zehang Weng et.al. | 2403.05177 | null |
2024-03-08 | LVIC: Multi-modality segmentation by Lifting Visual Info as Cue | Zichao Dong et.al. | 2403.05159 | null |
2024-03-08 | LanePtrNet: Revisiting Lane Detection as Point Voting and Grouping on Curves | Jiayan Cao et.al. | 2403.05155 | null |
2024-03-08 | Inverse Design of Photonic Crystal Surface Emitting Lasers is a Sequence Modeling Problem | Ceyao Zhang et.al. | 2403.05149 | null |
2024-03-08 | DyRoNet: A Low-Rank Adapter Enhanced Dynamic Routing Network for Streaming Perception | Xiang Huang et.al. | 2403.05050 | null |
2024-03-08 | LHMap-loc: Cross-Modal Monocular Localization Using LiDAR Point Cloud Heat Map | Xinrui Wu et.al. | 2403.05002 | link |
2024-03-07 | Fooling Neural Networks for Motion Forecasting via Adversarial Attacks | Edgar Medina et.al. | 2403.04954 | null |
2024-03-07 | A Survey on Human-AI Teaming with Large Pre-Trained Models | Vanshika Vats et.al. | 2403.04931 | null |
2024-03-07 | Mechanism for Decision-aware Collaborative Federated Learning: A Pitfall of Shapley Values | Meng Qi et.al. | 2403.04753 | null |
2024-03-07 | A General Calibrated Regret Metric for Detecting and Mitigating Human-Robot Interaction Failures | Kensuke Nakamura et.al. | 2403.04745 | null |
2024-03-07 | Literature Review of Current Sustainability Assessment Frameworks and Approaches for Organizations | Sarah Farahdel et.al. | 2403.04717 | null |
2024-03-07 | End-to-end Conditional Robust Optimization | Abhilash Chenreddy et.al. | 2403.04670 | null |
2024-03-07 | Embodied Understanding of Driving Scenarios | Yunsong Zhou et.al. | 2403.04593 | link |
2024-03-07 | FriendNet: Detection-Friendly Dehazing Network | Yihua Fan et.al. | 2403.04443 | link |
2024-03-07 | Cooperative Bayesian Optimization for Imperfect Agents | Ali Khoshvishkaie et.al. | 2403.04442 | null |
2024-03-07 | iTRPL: An Intelligent and Trusted RPL Protocol based on Multi-Agent Reinforcement Learning | Debasmita Dey et.al. | 2403.04416 | null |
2024-03-07 | Conjugate operators for transparent, explorable research outputs | Joseph Bond et.al. | 2403.04403 | null |
2024-03-07 | LitSim: Conflict-aware Policy for Long-term Interactive Traffic Simulation | Haojie Xin et.al. | 2403.04299 | null |
2024-03-07 | Generalizing Cooperative Eco-driving via Multi-residual Task Learning | Vindula Jayawardana et.al. | 2403.04232 | null |
2024-03-07 | Incremental Bayesian Learning for Fail-Operational Control in Autonomous Driving | Lei Zheng et.al. | 2403.04143 | null |
2024-03-06 | Hitchhiker’s guide to cancer-associated lymphoid aggregates in histology images: manual and deep learning-based quantification approaches | Karina Silina et.al. | 2403.04142 | null |
2024-03-07 | Towards learning-based planning:The nuPlan benchmark for real-world autonomous driving | Napat Karnchanachari et.al. | 2403.04133 | null |
2024-03-07 | An Explainable AI Framework for Artificial Intelligence of Medical Things | Al Amin et.al. | 2403.04130 | null |
2024-03-06 | Multi-Object Tracking with Camera-LiDAR Fusion for Autonomous Driving | Riccardo Pieroni et.al. | 2403.04112 | null |
2024-03-06 | Joint multi-task learning improves weakly-supervised biomarker prediction in computational pathology | Omar S. M. El Nahhas et.al. | 2403.03891 | link |
2024-03-06 | Confidence-Aware Decision-Making and Control for Tool Selection | Ajith Anil Meera et.al. | 2403.03808 | null |
2024-03-06 | 3D Object Visibility Prediction in Autonomous Driving | Chuanyu Luo et.al. | 2403.03681 | null |
2024-03-06 | Learning Adversarial MDPs with Stochastic Hard Constraints | Francesco Emanuele Stradi et.al. | 2403.03672 | null |
2024-03-06 | Development and evaluation of Artificial Intelligence techniques for IoT data quality assessment and curation | Laura Martín et.al. | 2403.03661 | null |
2024-03-06 | A Survey on Applications of Reinforcement Learning in Spatial Resource Allocation | Di Zhang et.al. | 2403.03643 | null |
2024-03-06 | Seamless Virtual Reality with Integrated Synchronizer and Synthesizer for Autonomous Driving | He Li et.al. | 2403.03541 | null |
2024-03-06 | Global Geolocated Realtime Data of Interfleet Urban Transit Bus Idling | Nicholas Kunz et.al. | 2403.03489 | link |
2024-03-06 | Multi-task Learning for Real-time Autonomous Driving Leveraging Task-adaptive Attention Generator | Wonhyeok Choi et.al. | 2403.03468 | null |
2024-03-06 | Human vs. Machine: Language Models and Wargames | Max Lamparth et.al. | 2403.03407 | link |
2024-03-05 | RACE-SM: Reinforcement Learning Based Autonomous Control for Social On-Ramp Merging | Jordan Poots et.al. | 2403.03359 | null |
2024-03-05 | Towards Democratized Flood Risk Management: An Advanced AI Assistant Enabled by GPT-4 for Enhanced Interpretability and Public Engagement | Rafaela Martelo et.al. | 2403.03188 | link |
2024-03-05 | Behavior Generation with Latent Actions | Seungjae Lee et.al. | 2403.03181 | link |
2024-03-05 | Deep-Learned Compression for Radio-Frequency Signal Classification | Armani Rodriguez et.al. | 2403.03150 | null |
2024-03-05 | Language Guided Exploration for RL Agents in Text Environments | Hitesh Golchha et.al. | 2403.03141 | null |
2024-03-05 | MiKASA: Multi-Key-Anchor & Scene-Aware Transformer for 3D Visual Grounding | Chun-Peng Chang et.al. | 2403.03077 | link |
2024-03-05 | SimuCourt: Building Judicial Decision-Making Agents with Real-world Judgement Documents | Zhitao He et.al. | 2403.02959 | link |
2024-03-05 | XAI-Based Detection of Adversarial Attacks on Deepfake Detectors | Ben Pinhasov et.al. | 2403.02955 | link |
2024-03-05 | User-Driven Adaptation: Tailoring Autonomous Driving Systems with Dynamic Preferences | Mingyue Zhang et.al. | 2403.02928 | null |
2024-03-05 | Risk-Constrained Community Battery Utilisation Optimisation for Electric Vehicle Charging with Photovoltaic Resources | Khalil Gholami et.al. | 2403.02927 | null |
2024-03-05 | Autonomous vehicle decision and control through reinforcement learning with traffic flow randomization | Yuan Lin et.al. | 2403.02882 | null |
2024-03-05 | ActiveAD: Planning-Oriented Active Learning for End-to-End Autonomous Driving | Han Lu et.al. | 2403.02877 | null |
2024-03-05 | FastOcc: Accelerating 3D Occupancy Prediction by Fusing the 2D Bird’s-Eye View and Perspective View | Jiawei Hou et.al. | 2403.02710 | null |
2024-03-05 | HoloVIC: Large-scale Dataset and Benchmark for Multi-Sensor Holographic Intersection and Vehicle-Infrastructure Cooperative | Cong Ma et.al. | 2403.02640 | null |
2024-03-05 | World Models for Autonomous Driving: An Initial Survey | Yanchen Guan et.al. | 2403.02622 | null |
2024-03-05 | Deep Cooperation in ISAC System: Resource, Node and Infrastructure Perspectives | Zhiqing Wei et.al. | 2403.02565 | null |
2024-03-04 | MORBDD: Multiobjective Restricted Binary Decision Diagrams by Learning to Sparsify | Rahul Patel et.al. | 2403.02482 | null |
2024-03-04 | The Ink Splotch Effect: A Case Study on ChatGPT as a Co-Creative Game Designer | Asad Anjum et.al. | 2403.02454 | null |
2024-03-04 | Uncertainty-Aware Prediction and Application in Planning for Autonomous Driving: Definitions, Methods, and Comparison | Wenbo Shao et.al. | 2403.02297 | null |
2024-03-04 | Recency-Weighted Temporally-Segmented Ensemble for Time-Series Modeling | Pål V. Johnsen et.al. | 2403.02150 | link |
2024-03-04 | Depth-Guided Robust and Fast Point Cloud Fusion NeRF for Sparse Input Views | Shuai Guo et.al. | 2403.02063 | null |
2024-03-03 | Optimization decision model of vegetable stock and pricing based on TCN-Attention and genetic algorithm | Linhan Xia et.al. | 2403.01367 | null |
2024-03-02 | Summary Paper: Use Case on Building Collaborative Safe Autonomous Systems-A Robotdog for Guiding Visually Impaired People | Aman Malhotra et.al. | 2403.01286 | null |
2024-03-02 | Automatic Speech Recognition using Advanced Deep Learning Approaches: A survey | Hamza Kheddar et.al. | 2403.01255 | null |
2024-03-02 | AcME-AD: Accelerated Model Explanations for Anomaly Detection | Valentina Zaccaria et.al. | 2403.01245 | null |
2024-03-02 | On the Road to Portability: Compressing End-to-End Motion Planner for Autonomous Driving | Kaituo Feng et.al. | 2403.01238 | link |
2024-03-02 | Results and Lessons Learned from Autonomous Driving Transportation Services in Airfield, Crowded Indoor, and Urban Environments | Doosan Baek et.al. | 2403.01233 | null |
2024-03-02 | Control of cascading failures using protective measures | Davood Fazli et.al. | 2403.01205 | null |
2024-03-01 | On the Role of Information Structure in Reinforcement Learning for Partially-Observable Sequential Teams and Games | Awni Altabaa et.al. | 2403.00993 | null |
2024-03-01 | Adaptive Learning Rate for Follow-the-Regularized-Leader: Competitive Ratio Analysis and Best-of-Both-Worlds | Shinji Ito et.al. | 2403.00715 | null |
2024-03-01 | Playing NetHack with LLMs: Potential & Limitations as Zero-Shot Agents | Dominik Jeurissen et.al. | 2403.00690 | link |
2024-03-01 | Safe Hybrid-Action Reinforcement Learning-Based Decision and Control for Discretionary Lane Change | Ruichen Xu et.al. | 2403.00446 | null |
2024-03-01 | MS-Net: A Multi-Path Sparse Model for Motion Prediction in Multi-Scenes | Xiaqiang Tang et.al. | 2403.00353 | null |
2024-03-01 | Deep Reinforcement Learning for Solving Management Problems: Towards A Large Management Mode | Jinyang Jiang et.al. | 2403.00318 | null |
2024-03-01 | Efficient Reinforcement Learning for Global Decision Making in the Presence of Local Agents at Scale | Emile Anand et.al. | 2403.00222 | null |
2024-02-29 | Identification of important nodes in the information propagation network based on the artificial intelligence method | Bin Yuan et.al. | 2403.00190 | null |
2024-02-29 | Causal Graph ODE: Continuous Treatment Effect Modeling in Multi-agent Dynamical Systems | Zijie Huang et.al. | 2403.00178 | null |
2024-02-29 | Implications of Regulations on the Use of AI and Generative AI for Human-Centered Responsible Artificial Intelligence | Marios Constantinides et.al. | 2403.00148 | null |
2024-02-29 | ArCHer: Training Language Model Agents via Hierarchical Multi-Turn RL | Yifei Zhou et.al. | 2402.19446 | link |
2024-02-29 | Genie: Smart ROS-based Caching for Connected Autonomous Robots | Zexin Li et.al. | 2402.19410 | null |
2024-02-29 | Towards Safe and Reliable Autonomous Driving: Dynamic Occupancy Set Prediction | Wenbo Shao et.al. | 2402.19385 | null |
2024-03-03 | RoadRunner - Learning Traversability Estimation for Autonomous Off-road Driving | Jonas Frey et.al. | 2402.19341 | null |
2024-02-29 | DISCERN: Designing Decision Support Interfaces to Investigate the Complexities of Workplace Social Decision-Making With Line Managers | Pranav Khadpe et.al. | 2402.19318 | null |
2024-02-29 | T3DNet: Compressing Point Cloud Models for Lightweight 3D Recognition | Zhiyuan Yang et.al. | 2402.19264 | null |
2024-02-29 | A Cognitive-Based Trajectory Prediction Approach for Autonomous Driving | Haicheng Liao et.al. | 2402.19251 | link |
2024-02-29 | Prediction of vaccination coverage level in the heterogeneous mixing population | Fan Bai et.al. | 2402.19190 | null |
2024-02-29 | MemoNav: Working Memory Model for Visual Navigation | Hongxin Li et.al. | 2402.19161 | link |
2024-02-29 | ARMCHAIR: integrated inverse reinforcement learning and model predictive control for human-robot collaboration | Angelo Caregnato-Neto et.al. | 2402.19128 | null |
2024-02-29 | CollaFuse: Navigating Limited Resources and Privacy in Collaborative Generative AI | Domenique Zipperling et.al. | 2402.19105 | link |
2024-02-29 | GoalNet: Goal Areas Oriented Pedestrian Trajectory Prediction | Ching-Lin Lee et.al. | 2402.19002 | null |
2024-02-29 | Applications of 0-1 Neural Networks in Prescription and Prediction | Vrishabh Patil et.al. | 2402.18851 | null |
2024-02-29 | A simple model of global cascades on random hypergraphs | Lei Chen et.al. | 2402.18850 | null |
2024-02-29 | Deep Learning for 3D Human Pose Estimation and Mesh Recovery: A Survey | Yang Liu et.al. | 2402.18844 | link |
2024-02-29 | On the Decision-Making Abilities in Role-Playing using Large Language Models | Chenglei Shen et.al. | 2402.18807 | null |
2024-02-29 | Conjectural Online Learning with First-order Beliefs in Asymmetric Information Stochastic Games | Tao Li et.al. | 2402.18781 | null |
2024-02-29 | The Situate AI Guidebook: Co-Designing a Toolkit to Support Multi-Stakeholder Early-stage Deliberations Around Public Sector AI Proposals | Anna Kawakami et.al. | 2402.18774 | null |
2024-02-28 | A revision on Multi-Criteria Decision Making methods for Multi-UAV Mission Planning Support | Cristian Ramirez-Atencia et.al. | 2402.18743 | null |
2024-03-01 | RORA: Robust Free-Text Rationale Evaluation | Zhengping Jiang et.al. | 2402.18678 | link |
2024-02-28 | Approaching Human-Level Forecasting with Language Models | Danny Halawi et.al. | 2402.18563 | null |
2024-02-28 | Selection of appropriate multispectral camera exposure settings and radiometric calibration methods for applications in phenotyping and precision agriculture | Vaishali Swaminathan et.al. | 2402.18553 | null |
2024-02-28 | FinAgent: A Multimodal Foundation Agent for Financial Trading: Tool-Augmented, Diversified, and Generalist | Wentao Zhang et.al. | 2402.18485 | null |
2024-02-28 | Evaluating Decision Optimality of Autonomous Driving via Metamorphic Testing | Mingfei Cheng et.al. | 2402.18393 | null |
2024-02-28 | Unveiling the Potential of Robustness in Evaluating Causal Inference Models | Yiyan Huang et.al. | 2402.18392 | link |
2024-02-28 | Enhancing Roadway Safety: LiDAR-based Tree Clearance Analysis | Miriam Louise Carnot et.al. | 2402.18309 | null |
2024-02-28 | EchoTrack: Auditory Referring Multi-Object Tracking for Autonomous Driving | Jiacheng Lin et.al. | 2402.18302 | link |
2024-02-28 | PiShield: A NeSy Framework for Learning with Requirements | Mihaela Cătălina Stoian et.al. | 2402.18285 | link |
2024-02-28 | EAN-MapNet: Efficient Vectorized HD Map Construction with Anchor Neighborhoods | Huiyuan Xiong et.al. | 2402.18278 | null |
2024-02-28 | NiteDR: Nighttime Image De-Raining with Cross-View Sensor Cooperative Learning for Dynamic Driving Scenes | Cidan Shi et.al. | 2402.18172 | link |
2024-02-28 | 3DSFLabelling: Boosting 3D Scene Flow Estimation by Pseudo Auto-labelling | Chaokang Jiang et.al. | 2402.18146 | link |
2024-02-28 | OccTransformer: Improving BEVFormer for 3D camera-only occupancy prediction | Jian Liu et.al. | 2402.18140 | null |
2024-02-28 | DecisionNCE: Embodied Multimodal Representations via Implicit Preference Learning | Jianxiong Li et.al. | 2402.18137 | link |
2024-02-28 | Passive Snapshot Coded Aperture Dual-Pixel RGB-D Imaging | Bhargav Ghanekar et.al. | 2402.18102 | null |
2024-02-27 | ICAT: An Indoor Connected and Autonomous Testbed for Vehicle Computing | Zhaofeng Tian et.al. | 2402.17933 | null |
2024-02-27 | SWTrack: Multiple Hypothesis Sliding Window 3D Multi-Object Tracking | Sandro Papais et.al. | 2402.17892 | null |
2024-02-27 | Public Goods Games in Disease Evolution and Spread | Christo Morison et.al. | 2402.17842 | null |
2024-02-27 | Personalizing Smart Home Privacy Protection With Individuals’ Regulatory Focus: Would You Preserve or Enhance Your Information Privacy? | Reza Ghaiumy Anaraky et.al. | 2402.17838 | null |
2024-02-27 | Federated Learning for Estimating Heterogeneous Treatment Effects | Disha Makhija et.al. | 2402.17705 | null |
2024-02-27 | Model Free Deep Deterministic Policy Gradient Controller for Setpoint Tracking of Non-minimum Phase Systems | Fatemeh Tavakkoli et.al. | 2402.17703 | null |
2024-02-27 | Autonomous Vehicles: Evolution of Artificial Intelligence and Learning Algorithms | Sneha Sudhir Shetiya et.al. | 2402.17690 | null |
2024-02-27 | QoS prediction in radio vehicular environments via prior user information | Noor Ul Ain et.al. | 2402.17689 | null |
2024-02-27 | Multi-Agent Deep Reinforcement Learning for Distributed Satellite Routing | Federico Lozano-Cuadra et.al. | 2402.17666 | null |
2024-02-27 | Mitigating Distributional Shift in Semantic Segmentation via Uncertainty Estimation from Unlabelled Data | David S. W. Williams et.al. | 2402.17653 | null |
2024-02-27 | Comparison of the Effects of Interaction with Intentional Agent and Artificial Intelligence using fNIRS | Mohammad Ghalavand et.al. | 2402.17650 | null |
2024-02-27 | Chronicles of CI/CD: A Deep Dive into its Usage Over Time | Hugo da Gião et.al. | 2402.17588 | null |
2024-02-27 | An Empirical Study of the Generalization Ability of Lidar 3D Object Detectors to Unseen Domains | George Eskandar et.al. | 2402.17562 | null |
2024-02-27 | Emergency Caching: Coded Caching-based Reliable Map Transmission in Emergency Networks | Zeyu Tian et.al. | 2402.17550 | null |
2024-02-27 | Highway Discretionary Lane-change Decision and Control Using Model Predictive Control | Zishun Zheng et.al. | 2402.17524 | null |
2024-02-27 | Leveraging Enhanced Queries of Point Sets for Vectorized Map Construction | Zihao Liu et.al. | 2402.17430 | link |
2024-02-27 | Determinants of LLM-assisted Decision-Making | Eva Eigner et.al. | 2402.17385 | null |
2024-02-27 | ICP-Flow: LiDAR Scene Flow Estimation with ICP | Yancong Lin et.al. | 2402.17351 | link |
2024-02-27 | VCD: Knowledge Base Guided Visual Commonsense Discovery in Images | Xiangqing Shen et.al. | 2402.17213 | null |
2024-02-27 | Benchmarking Data Science Agents | Yuge Zhang et.al. | 2402.17168 | link |
2024-02-27 | Video as the New Language for Real-World Decision Making | Sherry Yang et.al. | 2402.17139 | null |
2024-02-27 | Deep Reinforcement Learning (DRL)-based Methods for Serverless Stream Processing Engines: A Vision, Architectural Elements, and Future Directions | Maria R. Read et.al. | 2402.17117 | null |
2024-02-26 | Reinforcement Learning Based Oscillation Dampening: Scaling up Single-Agent RL algorithms to a 100 AV highway field operational test | Kathy Jang et.al. | 2402.17050 | null |
2024-02-26 | Successfully Guiding Humans with Imperfect Instructions by Highlighting Potential Errors and Suggesting Corrections | Lingjun Zhao et.al. | 2402.16973 | null |
2024-02-26 | Trajectory Prediction for Autonomous Driving Using a Transformer Network | Zhenning Li et.al. | 2402.16501 | null |
2024-02-26 | Edge Detectors Can Make Deep Convolutional Neural Networks More Robust | Jin Ding et.al. | 2402.16479 | null |
2024-02-26 | Learning to Schedule Online Tasks with Bandit Feedback | Yongxin Xu et.al. | 2402.16463 | null |
2024-02-26 | Contingency Planning Using Bi-level Markov Decision Processes for Space Missions | Somrita Banerjee et.al. | 2402.16342 | link |
2024-02-26 | Achieving $\tilde{O}(1/ε)$ Sample Complexity for Constrained Markov Decision Process | Jiashuo Jiang et.al. | 2402.16324 | null |
2024-02-26 | From Large Language Models and Optimization to Decision Optimization CoPilot: A Research Manifesto | Segev Wasserkrug et.al. | 2402.16269 | null |
2024-02-26 | SeqTrack3D: Exploring Sequence Information for Robust 3D Point Cloud Tracking | Yu Lin et.al. | 2402.16249 | link |
2024-02-25 | How Can LLM Guide RL? A Value-Based Approach | Shenao Zhang et.al. | 2402.16181 | link |
2024-02-25 | From Concept to Implementation: Streamlining Sensor and Actuator Selection for Collaborative Design and Engineering of Interactive Systems | İhsan Ozan Yıldırım et.al. | 2402.16084 | null |
2024-02-25 | EHRNoteQA: A Patient-Specific Question Answering Benchmark for Evaluating Large Language Models in Clinical Settings | Sunjun Kweon et.al. | 2402.16040 | link |
2024-02-25 | Machine Learning-Based Vehicle Intention Trajectory Recognition and Prediction for Autonomous Driving | Hanyi Yu et.al. | 2402.16036 | null |
2024-02-24 | Predicting Outcomes in Video Games with Long Short Term Memory Networks | Kittimate Chulajata et.al. | 2402.15923 | link |
2024-02-24 | Concurrent Learning of Policy and Unknown Safety Constraints in Reinforcement Learning | Lunet Yifru et.al. | 2402.15893 | null |
2024-02-24 | Statistical Games | Jozsef Konczer et.al. | 2402.15892 | null |
2024-02-24 | NaVid: Video-based VLM Plans the Next Step for Vision-and-Language Navigation | Jiazhao Zhang et.al. | 2402.15852 | null |
2024-02-24 | Multiple Instance Learning for Glioma Diagnosis using Hematoxylin and Eosin Whole Slide Images: An Indian cohort Study | Ekansh Chauhan et.al. | 2402.15832 | null |
2024-02-24 | Reward Design for Justifiable Sequential Decision-Making | Aleksa Sukovic et.al. | 2402.15826 | link |
2024-02-24 | Construction and application of artificial intelligence crowdsourcing map based on multi-track GPS data | Yong Wang et.al. | 2402.15796 | null |
2024-02-24 | Discretionary Lane-Change Decision and Control via Parameterized Soft Actor-Critic for Hybrid Action Space | Yuan Lin et.al. | 2402.15790 | null |
2024-02-24 | Detection Is Tracking: Point Cloud Multi-Sweep Deep Learning Models Revisited | Lingji Chen et.al. | 2402.15756 | null |
2024-02-23 | The Sample Average Approximation Method for Solving Two-Stage Stochastic Programs with Endogenous Uncertainty | Maria Bazotte et.al. | 2402.15486 | link |
2024-02-23 | Benchmarking the Robustness of Panoptic Segmentation for Automated Driving | Yiting Wang et.al. | 2402.15469 | null |
2024-02-23 | Information-Theoretic Safe Bayesian Optimization | Alessandro G. Bottero et.al. | 2402.15347 | null |
2024-02-23 | EMIFF: Enhanced Multi-scale Image Feature Fusion for Vehicle-Infrastructure Cooperative 3D Object Detection | Zhe Wang et.al. | 2402.15272 | link |
2024-02-23 | Multi-Agent Collaboration Framework for Recommender Systems | Zhefan Wang et.al. | 2402.15235 | link |
2024-02-23 | Safety Optimized Reinforcement Learning via Multi-Objective Policy Optimization | Homayoun Honari et.al. | 2402.15197 | null |
2024-02-23 | Multi-Armed Bandits with Abstention | Junwen Yang et.al. | 2402.15127 | null |
2024-02-23 | Large Multimodal Agents: A Survey | Junlin Xie et.al. | 2402.15116 | null |
2024-02-22 | Practice Makes Perfect: Planning to Learn Skill Parameter Policies | Nishanth Kumar et.al. | 2402.15025 | null |
2024-02-22 | On the Performance of Empirical Risk Minimization with Smoothed Data | Adam Block et.al. | 2402.14987 | null |
2024-02-22 | Unsupervised Domain Adaptation within Deep Foundation Latent Spaces | Dmitry Kangin et.al. | 2402.14976 | null |
2024-02-22 | Path Planning based on 2D Object Bounding-box | Yanliang Huang et.al. | 2402.14933 | null |
2024-02-22 | Autonomy Oriented Digital Twins for Real2Sim2Real Autoware Deployment | Chinmay Vilas Samak et.al. | 2402.14739 | link |
2024-02-22 | Doing AI: Algorithmic decision support as a human activity | Joachim Meyer et.al. | 2402.14674 | null |
2024-02-22 | Distributed Radiance Fields for Edge Video Compression and Metaverse Integration in Autonomous Driving | Eugen Šlapak et.al. | 2402.14642 | null |
2024-02-22 | Reframing the Expected Free Energy: Four Formulations and a Unification | Théophile Champion et.al. | 2402.14460 | null |
2024-02-22 | Model-Based Reinforcement Learning Control of Reaction-Diffusion Problems | Christina Schenk et.al. | 2402.14446 | null |
2024-02-22 | Algorithm-agnostic significance testing in supervised learning with multimodal data | Lucas Kook et.al. | 2402.14416 | link |
2024-02-22 | Human-machine social systems | Milena Tsvetkova et.al. | 2402.14410 | null |
2024-02-22 | RadarMOSEVE: A Spatial-Temporal Transformer Network for Radar-Only Moving Object Segmentation and Ego-Velocity Estimation | Changsong Pang et.al. | 2402.14380 | link |
2024-02-22 | We Choose to Go to Space: Agent-driven Human and Multi-Robot Collaboration in Microgravity | Miao Xin et.al. | 2402.14299 | null |
2024-02-22 | Enhancing Robotic Manipulation with AI Feedback from Multimodal Large Language Models | Jinyi Liu et.al. | 2402.14245 | null |
2024-02-22 | BeTAIL: Behavior Transformer Adversarial Imitation Learning from Human Racing Gameplay | Catherine Weaver et.al. | 2402.14194 | null |
2024-02-22 | Parking of Connected Automated Vehicles: Vehicle Control, Parking Assignment, and Multi-agent Simulation | Xu Shen et.al. | 2402.14183 | null |
2024-02-21 | Blending Data-Driven Priors in Dynamic Games | Justin Lidard et.al. | 2402.14174 | null |
2024-02-21 | Unveiling Crowdfunding Futures: Analyzing Campaign Outcomes through Distributed Models and Big Data Perspectives | Giuseppe Pipitò et.al. | 2402.14111 | null |
2024-02-21 | Social Environment Design | Edwin Zhang et.al. | 2402.14090 | link |
2024-02-21 | Beyond A*: Better Planning with Transformers via Search Dynamics Bootstrapping | Lucas Lehnert et.al. | 2402.14083 | link |
2024-02-21 | Efficient Normalized Conformal Prediction and Uncertainty Quantification for Anti-Cancer Drug Sensitivity Prediction with Deep Regression Forests | Daniel Nolte et.al. | 2402.14080 | null |
2024-02-21 | Information Elicitation in Agency Games | Serena Wang et.al. | 2402.14005 | null |
2024-02-21 | Generative Probabilistic Time Series Forecasting and Applications in Grid Operations | Xinyi Wang et.al. | 2402.13870 | null |
2024-02-21 | Voice-Driven Mortality Prediction in Hospitalized Heart Failure Patients: A Machine Learning Approach Enhanced with Diagnostic Biomarkers | Nihat Ahmadli et.al. | 2402.13812 | null |
2024-02-21 | Deep Generative Models for Offline Policy Learning: Tutorial, Survey, and Perspectives on Future Directions | Jiayu Chen et.al. | 2402.13777 | link |
2024-02-21 | SaGE: Evaluating Moral Consistency in Large Language Models | Vamshi Krishna Bonagiri et.al. | 2402.13709 | link |
2024-02-21 | Analyizing the Conjunction Fallacy as a Fact | Tomas Veloz et.al. | 2402.13615 | null |
2024-02-21 | Hybrid Reasoning Based on Large Language Models for Autonomous Car Driving | Mehdi Azarafza et.al. | 2402.13602 | link |
2024-02-21 | Mastering the Game of Guandan with Deep Reinforcement Learning and Behavior Regulating | Yifan Yanggong et.al. | 2402.13582 | null |
2024-02-21 | EffLoc: Lightweight Vision Transformer for Efficient 6-DOF Camera Relocalization | Zhendong Xiao et.al. | 2402.13537 | null |
2024-02-21 | Best of Many in Both Worlds: Online Resource Allocation with Predictions under Unknown Arrival Model | Lin An et.al. | 2402.13530 | null |
2024-02-21 | Learning to Model Diverse Driving Behaviors in Highly Interactive Autonomous Driving Scenarios with Multi-Agent Reinforcement Learning | Liu Weiwei et.al. | 2402.13481 | null |
2024-02-21 | A rational logit dynamic for decision-making under uncertainty: well-posedness, vanishing-noise limit, and numerical approximation | Hidekazu Yoshioka et.al. | 2402.13453 | null |
2024-02-21 | A Neuro-Symbolic Approach to Multi-Agent RL for Interpretability and Probabilistic Decision Making | Chitra Subramanian et.al. | 2402.13440 | null |
2024-02-20 | Toward TransfORmers: Revolutionizing the Solution of Mixed Integer Programs with Transformers | Joshua F. Cooper et.al. | 2402.13380 | null |
2024-02-20 | Referee-Meta-Learning for Fast Adaptation of Locational Fairness | Weiye Chen et.al. | 2402.13379 | null |
2024-02-20 | VADv2: End-to-End Vectorized Autonomous Driving via Probabilistic Planning | Shaoyu Chen et.al. | 2402.13243 | link |
2024-02-20 | Analyzing Operator States and the Impact of AI-Enhanced Decision Support in Control Rooms: A Human-in-the-Loop Specialized Reinforcement Learning Framework for Intervention Strategies | Ammar N. Abbas et.al. | 2402.13219 | link |
2024-02-20 | Testing Calibration in Subquadratic Time | Lunjia Hu et.al. | 2402.13187 | link |
2024-02-21 | What if LLMs Have Different World Views: Simulating Alien Civilizations with LLM-based Agents | Mingyu Jin et.al. | 2402.13184 | link |
2024-02-20 | 3D high-resolution imaging algorithm using 1D MIMO array for autonomous driving application | Sen Yuan et.al. | 2402.13062 | null |
2024-02-20 | Random Graph Set and Evidence Pattern Reasoning Model | Tianxiang Zhan et.al. | 2402.13058 | null |
2024-02-20 | Align Your Intents: Offline Imitation Learning via Optimal Transport | Maksim Bobrin et.al. | 2402.13037 | null |
2024-02-20 | Solving the decision-making analysis differential equation using eye fixation data in Unity software with Hermite Long-Short-Term Memory | Kourosh Parand et.al. | 2402.13027 | null |
2024-02-20 | Advancements in Point Cloud-Based 3D Defect Detection and Classification for Industrial Systems: A Comprehensive Survey | Anju Rani et.al. | 2402.12923 | null |
2024-02-20 | MORE-3S:Multimodal-based Offline Reinforcement Learning with Shared Semantic Spaces | Tianyu Zheng et.al. | 2402.12845 | link |
2024-02-20 | Are Large Language Models Rational Investors? | Yuhang Zhou et.al. | 2402.12713 | null |
2024-02-20 | XRL-Bench: A Benchmark for Evaluating and Comparing Explainable Reinforcement Learning Techniques | Yu Xiong et.al. | 2402.12685 | link |
2024-02-20 | Smart Mobility Digital Twin Based Automated Vehicle Navigation System: A Proof of Concept | Kui Wang et.al. | 2402.12682 | null |
2024-02-20 | Pre-trained Transformer-Enabled Strategies with Human-Guided Fine-Tuning for End-to-end Navigation of Autonomous Vehicles | Dong Hu et.al. | 2402.12666 | null |
2024-02-20 | Reflect-RL: Two-Player Online RL Fine-Tuning for LMs | Runlong Zhou et.al. | 2402.12621 | link |
2024-02-20 | A System Development Kit for Big Data Applications on FPGA-based Clusters: The EVEREST Approach | Christian Pilato et.al. | 2402.12612 | null |
2024-02-19 | Artifacts or Abduction: How Do LLMs Answer Multiple-Choice Questions Without the Question? | Nishant Balepur et.al. | 2402.12483 | link |
2024-02-19 | Multi-View Conformal Learning for Heterogeneous Sensor Fusion | Enrique Garcia-Ceja et.al. | 2402.12307 | link |
2024-02-19 | UncertaintyTrack: Exploiting Detection and Localization Uncertainty in Multi-Object Tracking | Chang Won Lee et.al. | 2402.12303 | link |
2024-02-19 | DriveVLM: The Convergence of Autonomous Driving and Large Vision-Language Models | Xiaoyu Tian et.al. | 2402.12289 | null |
2024-02-19 | Synthetic location trajectory generation using categorical diffusion models | Simon Dirmeier et.al. | 2402.12242 | link |
2024-02-19 | Towards AI-Based Precision Oncology: A Machine Learning Framework for Personalized Counterfactual Treatment Suggestions based on Multi-Omics Data | Manuel Schürch et.al. | 2402.12190 | null |
2024-02-19 | Examining Monitoring System: Detecting Abnormal Behavior In Online Examinations | Dinh An Ngo et.al. | 2402.12179 | null |
2024-02-19 | Modified RRT* for Path Planning in Autonomous Driving | Sugirtha T et.al. | 2402.12129 | null |
2024-02-19 | Towards Explainable LiDAR Point Cloud Semantic Segmentation via Gradient Based Target Localization | Abhishek Kuriyal et.al. | 2402.12098 | link |
2024-02-19 | All Language Models Large and Small | Zhixun Chen et.al. | 2402.12061 | null |
2024-02-19 | Surround-View Fisheye Optics in Computer Vision and Simulation: Survey and Challenge | Daniel Jakab et.al. | 2402.12041 | null |
2024-02-19 | Analyzing the Impact of Design Factors on Solar Module Thermomechanical Durability Using Interpretable Machine Learning Techniques | Xin Chen et.al. | 2402.11911 | link |
2024-02-19 | Two Online Map Matching Algorithms Based on Analytic Hierarchy Process and Fuzzy Logic | Jeremy J. Lin et.al. | 2402.11866 | null |
2024-02-19 | UniST: A Prompt-Empowered Universal Model for Urban Spatio-Temporal Prediction | Yuan Yuan et.al. | 2402.11838 | link |
2024-02-19 | SDGE: Stereo Guided Depth Estimation for 360° Camera Sets | Jialei Xu et.al. | 2402.11791 | null |
2024-02-19 | Statistical Test for Generated Hypotheses by Diffusion Models | Teruyuki Katsuoka et.al. | 2402.11789 | null |
2024-02-19 | MM-SurvNet: Deep Learning-Based Survival Risk Stratification in Breast Cancer Through Multimodal Data Fusion | Raktim Kumar Mondol et.al. | 2402.11788 | null |
2024-02-18 | A Note on Bias to Complete | Jia Xu et.al. | 2402.11710 | null |
2024-02-18 | Challenging the Black Box: A Comprehensive Evaluation of Attribution Maps of CNN Applications in Agriculture and Forestry | Lars Nieradzik et.al. | 2402.11670 | null |
2024-02-18 | Dynamic planning in hierarchical active inference | Matteo Priorelli et.al. | 2402.11658 | link |
2024-02-18 | Self-evolving Autoencoder Embedded Q-Network | J. Senthilnath et.al. | 2402.11604 | null |
2024-02-16 | Agent-based Simulation Evaluation of CBD Tolling: A Case Study from New York City | Qingnan Liang et.al. | 2402.10834 | null |
2024-02-16 | RAG-Driver: Generalisable Driving Explanations with Retrieval-Augmented In-Context Learning in Multi-Modal Large Language Model | Jianhao Yuan et.al. | 2402.10828 | null |
2024-02-16 | Enhancing ESG Impact Type Identification through Early Fusion and Multilingual Models | Hariram Veeramani et.al. | 2402.10772 | null |
2024-02-16 | RAGIC: Risk-Aware Generative Adversarial Model for Stock Interval Construction | Jingyi Gu et.al. | 2402.10760 | null |
2024-02-16 | Cloud Kitchen: Using Planning-based Composite AI to Optimize Food Delivery Process | Slavomír Švancár et.al. | 2402.10725 | null |
2024-02-16 | Rethinking Human-like Translation Strategy: Integrating Drift-Diffusion Model with Large Language Models for Machine Translation | Hongbin Na et.al. | 2402.10699 | null |
2024-02-16 | Network Formation and Dynamics Among Multi-LLMs | Marios Papachristou et.al. | 2402.10659 | link |
2024-02-16 | Efficiency at Scale: Investigating the Performance of Diminutive Language Models in Clinical Tasks | Niall Taylor et.al. | 2402.10597 | null |
2024-02-16 | Efficient Multi-task Uncertainties for Joint Semantic Segmentation and Monocular Depth Estimation | Steven Landgraf et.al. | 2402.10580 | null |
2024-02-16 | A novel integrated industrial approach with cobots in the age of industry 4.0 through conversational interaction and computer vision | Andrea Pazienza et.al. | 2402.10553 | null |
2024-02-16 | Quantifying Individual Risk for Binary Outcome: Bounds and Inference | Peng Wu et.al. | 2402.10537 | null |
2024-02-16 | PRISE: Learning Temporal Action Abstractions as a Sequence Compression Problem | Ruijie Zheng et.al. | 2402.10450 | link |
2024-02-16 | Barrier-Enhanced Homotopic Parallel Trajectory Optimization for Safety-Critical Autonomous Driving | Lei Zheng et.al. | 2402.10441 | null |
2024-02-16 | Explaining generative diffusion models via visual analysis for interpretable decision-making process | Ji-Hoon Park et.al. | 2402.10404 | link |
2024-02-15 | Thompson Sampling in Partially Observable Contextual Bandits | Hongju Park et.al. | 2402.10289 | null |
2024-02-15 | InfoNet: Neural Estimation of Mutual Information without Test-Time Optimization | Zhengyang Hu et.al. | 2402.10158 | null |
2024-02-15 | Mitigating subjectivity and bias in AI development indices: A robust approach to redefining country rankings | Betania Silva C Campello et.al. | 2402.10122 | link |
2024-02-15 | Neural Network Approaches for Parameterized Optimal Control | Deepanshu Verma et.al. | 2402.10033 | null |
2024-02-15 | Jack of All Trades, Master of Some, a Multi-Purpose Transformer Agent | Quentin Gallouédec et.al. | 2402.09844 | link |
2024-02-15 | Less is more: Ensemble Learning for Retinal Disease Recognition Under Limited Resources | Jiahao Wang et.al. | 2402.09747 | null |
2024-02-15 | Exploiting Alpha Transparency In Language And Vision-Based AI Systems | David Noever et.al. | 2402.09671 | null |
2024-02-15 | Practitioners’ Challenges and Perceptions of CI Build Failure Predictions at Atlassian | Yang Hong et.al. | 2402.09651 | null |
2024-02-14 | Probabilistic Reasoning in Generative Large Language Models | Aliakbar Nafar et.al. | 2402.09614 | link |
2024-02-14 | LogicPrpBank: A Corpus for Logical Implication and Equivalence | Zhexiong Liu et.al. | 2402.09609 | null |
2024-02-14 | Pulmonologists-Level lung cancer detection based on standard blood test results and smoking status using an explainable machine learning approach | Ricco Noel Hansen Flyckt et.al. | 2402.09596 | null |
2024-02-14 | Large Language Model-Based Interpretable Machine Learning Control in Building Energy Systems | Liang Zhang et.al. | 2402.09584 | null |
2024-02-14 | Rationality Report Cards: Assessing the Economic Rationality of Large Language Models | Narun Raman et.al. | 2402.09552 | null |
2024-02-14 | Dataset Clustering for Improved Offline Policy Learning | Qiang Wang et.al. | 2402.09550 | link |
2024-02-14 | How Secure Are Large Language Models (LLMs) for Navigation in Urban Environments? | Congcong Wen et.al. | 2402.09546 | null |
2024-02-14 | PC-NeRF: Parent-Child Neural Radiance Fields Using Sparse LiDAR Frames in Autonomous Driving Environments | Xiuzhong Hu et.al. | 2402.09325 | link |
2024-02-14 | Learning Interpretable Policies in Hindsight-Observable POMDPs through Partially Supervised Reinforcement Learning | Michael Lanier et.al. | 2402.09290 | null |
2024-02-14 | Synergistic eigenanalysis of covariance and Hessian matrices for enhanced binary classification | Agus Hartoyo et.al. | 2402.09281 | null |
2024-02-14 | Design and Realization of a Benchmarking Testbed for Evaluating Autonomous Platooning Algorithms | Michael Shaham et.al. | 2402.09233 | null |
2024-02-14 | BiasEye: A Bias-Aware Real-time Interactive Material Screening System for Impartial Candidate Assessment | Qianyu Liu et.al. | 2402.09148 | null |
2024-02-14 | Selective decision making and collective behavior of fish by the motion of visual attention | Susumu Ito et.al. | 2402.09073 | null |
2024-02-14 | Cross-Temporal Forecast Reconciliation at Digital Platforms with Machine Learning | Jeroen Rombouts et.al. | 2402.09033 | null |
2024-02-14 | Learning-enabled Flexible Job-shop Scheduling for Scalable Smart Manufacturing | Sihoon Moon et.al. | 2402.08979 | null |
2024-02-14 | Second Order Methods for Bandit Optimization and Control | Arun Suggala et.al. | 2402.08929 | null |
2024-02-14 | Inference for an Algorithmic Fairness-Accuracy Frontier | Yiqi Liu et.al. | 2402.08879 | null |
2024-02-13 | Intelligent Agricultural Management Considering N $_2$ O Emission and Climate Variability with Uncertainties | Zhaoan Wang et.al. | 2402.08832 | null |
2024-02-13 | An Adaptive System Architecture for Multimodal Intelligent Transportation Systems | Muhammad Farooq et.al. | 2402.08817 | null |
2024-02-13 | CaPS: Collaborative and Private Synthetic Data Generation from Distributed Sources | Sikha Pentyala et.al. | 2402.08614 | null |
2024-02-13 | Vehicle Behavior Prediction by Episodic-Memory Implanted NDT | Peining Shen et.al. | 2402.08423 | link |
2024-02-13 | LLMs and the Human Condition | Peter Wallis et.al. | 2402.08403 | null |
2024-02-13 | Uncertainty Quantification for Forward and Inverse Problems of PDEs via Latent Global Evolution | Tailin Wu et.al. | 2402.08383 | link |
2024-02-13 | The Duet of Representations and How Explanations Exacerbate It | Charles Wan et.al. | 2402.08379 | null |
2024-02-13 | Exploration by Optimization with Hybrid Regularizers: Logarithmic Regret with Adversarial Robustness in Partial Monitoring | Taira Tsuchiya et.al. | 2402.08321 | null |
2024-02-13 | Zero Trust Score-based Network-level Access Control in Enterprise Networks | Leonard Bradatsch et.al. | 2402.08299 | null |
2024-02-13 | A survey of recent methods for addressing AI fairness and bias in biomedicine | Yifan Yang et.al. | 2402.08250 | null |
2024-02-13 | Causal Learning for Trustworthy Recommender Systems: A Survey | Jin Li et.al. | 2402.08241 | null |
2024-02-13 | MetaTra: Meta-Learning for Generalized Trajectory Prediction in Unseen Domain | Xiaohe Li et.al. | 2402.08221 | null |
2024-02-13 | Inherent Diverse Redundant Safety Mechanisms for AI-based Software Elements in Automotive Applications | Mandar Pitale et.al. | 2402.08208 | null |
2024-02-13 | Group Decision-Making among Privacy-Aware Agents | Marios Papachristou et.al. | 2402.08156 | null |
2024-02-13 | CMA-R:Causal Mediation Analysis for Explaining Rumour Detection | Lin Tian et.al. | 2402.08155 | link |
2024-02-13 | Epistemic Exploration for Generalizable Planning and Learning in Non-Stationary Settings | Rushang Karia et.al. | 2402.08145 | null |
2024-02-13 | Average-Case Analysis of Iterative Voting | Joshua Kavner et.al. | 2402.08144 | null |
2024-02-12 | Addressing cognitive bias in medical language models | Samuel Schmidgall et.al. | 2402.08113 | link |
2024-02-12 | From Data to Decisions: The Transformational Power of Machine Learning in Business Recommendations | Kapilya Gangadharan et.al. | 2402.08109 | null |
2024-02-12 | Auditing Work: Exploring the New York City algorithmic bias audit regime | Lara Groves et.al. | 2402.08101 | null |
2024-02-12 | MAIDCRL: Semi-centralized Multi-Agent Influence Dense-CNN Reinforcement Learning | Ayesha Siddika Nipu et.al. | 2402.07890 | null |
2024-02-12 | Distributed Anomaly Detection in Modern Power Systems: A Penalty-based Mitigation Approach | Erfan Mehdipour Abadi et.al. | 2402.07884 | null |
2024-02-12 | Retrieval-Augmented Thought Process as Sequential Decision Making | Thomas Pouplin et.al. | 2402.07812 | null |
2024-02-12 | From Uncertainty to Precision: Enhancing Binary Classifier Performance through Calibration | Agathe Fernandes Machado et.al. | 2402.07790 | link |
2024-02-12 | TELLER: A Trustworthy Framework for Explainable, Generalizable and Controllable Fake News Detection | Hui Liu et.al. | 2402.07776 | link |
2024-02-12 | Towards Unified Alignment Between Agents, Humans, and Environment | Zonghan Yang et.al. | 2402.07744 | null |
2024-02-12 | Task-conditioned adaptation of visual features in multi-task policy learning | Pierre Marza et.al. | 2402.07739 | null |
2024-02-12 | Interaction-Based Driving Scenario Classification and Labeling | Cheng Chang et.al. | 2402.07720 | null |
2024-02-12 | Online Sequential Decision-Making with Unknown Delays | Ping Wu et.al. | 2402.07703 | null |
2024-02-12 | AYDIV: Adaptable Yielding 3D Object Detection via Integrated Contextual Vision Transformer | Tanmoy Dam et.al. | 2402.07680 | link |
2024-02-12 | DART: A Compact Platform For Autonomous Driving Research | Lorenzo Lyons et.al. | 2402.07602 | null |
2024-02-12 | Unveiling Group-Specific Distributed Concept Drift: A Fairness Imperative in Federated Learning | Teresa Salazar et.al. | 2402.07586 | link |
2024-02-12 | Topological Safeguard for Evasion Attack based on the Interpretability of Artificial Neural Network Behavior | Xabier Echeberria-Barrio et.al. | 2402.07480 | null |
2024-02-12 | Auxiliary Reward Generation with Transition Distance Representation Learning | Siyuan Li et.al. | 2402.07412 | null |
2024-02-12 | Enhancing Multi-Criteria Decision Analysis with AI: Integrating Analytic Hierarchy Process and GPT-4 for Automated Decision Support | Igor Svoboda et.al. | 2402.07404 | null |
2024-02-12 | Replicability is Asymptotically Free in Multi-armed Bandits | Junpei Komiyama et.al. | 2402.07391 | null |
2024-02-12 | Re-DiffiNet: Modeling discrepancies in tumor segmentation using diffusion | Tianyi Ren et.al. | 2402.07354 | link |
2024-02-12 | Noise-Adaptive Confidence Sets for Linear Bandits and Application to Bayesian Optimization | Kwang-Sung Jun et.al. | 2402.07341 | link |
2024-02-11 | Towards Explainable, Safe Autonomous Driving with Language Embeddings for Novelty Identification and Active Learning: Framework and Experimental Analysis with Real-World Data Sets | Ross Greer et.al. | 2402.07320 | null |
2024-02-11 | Self-Consistent Conformal Prediction | Lars van der Laan et.al. | 2402.07307 | link |
2024-02-09 | What is Hiding in Medicine’s Dark Matter? Learning with Missing Data in Medical Practices | Neslihan Suzen et.al. | 2402.06563 | null |
2024-02-09 | Diffusion-ES: Gradient-free Planning with Diffusion for Autonomous Driving and Zero-Shot Instruction Following | Brian Yang et.al. | 2402.06559 | link |
2024-02-09 | An Exercise in Tournament Design: When Some Matches Must Be Scheduled | Sushmita Gupta et.al. | 2402.06538 | null |
2024-02-09 | Scalable Interactive Machine Learning for Future Command and Control | Anna Madison et.al. | 2402.06501 | null |
2024-02-09 | CurveFormer++: 3D Lane Detection by Curve Propagation with Temporal Curve Queries and Attention | Yifeng Bai et.al. | 2402.06423 | null |
2024-02-09 | FD-Vision Mamba for Endoscopic Exposure Correction | Zhuoran Zheng et.al. | 2402.06378 | null |
2024-02-09 | High-Precision Geosteering via Reinforcement Learning and Particle Filters | Ressi Bonti Muhammad et.al. | 2402.06377 | null |
2024-02-09 | AI, Meet Human: Learning Paradigms for Hybrid Decision Making Systems | Clara Punzi et.al. | 2402.06287 | null |
2024-02-09 | Premier-TACO: Pretraining Multitask Representation via Temporal Action-Driven Contrastive Loss | Ruijie Zheng et.al. | 2402.06187 | link |
2024-02-09 | United We Fall: On the Nash Equilibria of Multiplex and Multilayer Network Games | Raman Ebrahimi et.al. | 2402.06108 | null |
2024-02-08 | Scaling Artificial Intelligence for Digital Wargaming in Support of Decision-Making | Scotty Black et.al. | 2402.06075 | null |
2024-02-08 | Aggregation of pairwise comparison matrices: A clustering approach | Kolos Csaba Ágoston et.al. | 2402.06061 | null |
2024-02-08 | Impact on Public Health Decision Making by Utilizing Big Data Without Domain Knowledge | Miao Zhang et.al. | 2402.06059 | null |
2024-02-08 | Intelligent Mode-switching Framework for Teleoperation | Burak Kizilkaya et.al. | 2402.06047 | null |
2024-02-08 | Optimizing Predictive AI in Physical Design Flows with Mini Pixel Batch Gradient Descent | Haoyu Yang et.al. | 2402.06034 | null |
2024-02-08 | Game-theoretic Counterfactual Explanation for Graph Neural Networks | Chirag Chhablani et.al. | 2402.06030 | null |
2024-02-08 | Driving Everywhere with Large Language Model Policy Adaptation | Boyi Li et.al. | 2402.05932 | null |
2024-02-08 | Understanding Social Immunity in Ants: A Markovian Approach to Collective Cleaning Strategies | Isabella Bueno et.al. | 2402.05924 | null |
2024-02-08 | Editable Scene Simulation for Autonomous Driving via Collaborative LLM-Agents | Yuxi Wei et.al. | 2402.05746 | link |
2024-02-08 | Offline Risk-sensitive RL with Partial Observability to Enhance Performance in Human-Robot Teaming | Giorgio Angelotti et.al. | 2402.05703 | null |
2024-02-08 | Stochastic COLREGs Evaluation for Safe Navigation under Uncertainty | Peter Nicholas Hansen et.al. | 2402.05662 | null |
2024-02-08 | Optimizing Delegation in Collaborative Human-AI Hybrid Teams | Andrew Fuchs et.al. | 2402.05605 | null |
2024-02-08 | Form-From: A Design Space of Social Media Systems | Amy X. Zhang et.al. | 2402.05388 | null |
2024-02-08 | Are We Asking the Right Questions?: Designing for Community Stakeholders’ Interactions with AI in Policing | MD Romael Haque et.al. | 2402.05348 | null |
2024-02-07 | Sym-Q: Adaptive Symbolic Regression via Sequential Decision-Making | Yuan Tian et.al. | 2402.05306 | link |
2024-02-07 | Safe Human-UAS Collaboration Abstraction | Hossein Rastgoftar et.al. | 2402.05277 | null |
2024-02-07 | Exploring Hierarchical Classification Performance for Time Series Data: Dissimilarity Measures and Classifier Comparisons | Celal Alagoz et.al. | 2402.05275 | null |
2024-02-07 | Adaptive Hypergraph Network for Trust Prediction | Rongwei Xu et.al. | 2402.05154 | link |
2024-02-07 | FlowPG: Action-constrained Policy Gradient with Normalizing Flows | Janaka Chathuranga Brahmanage et.al. | 2402.05149 | link |
2024-02-07 | Tuning the feedback controller gains is a simple way to improve autonomous driving performance | Wenyu Liang et.al. | 2402.05064 | null |
2024-02-07 | Conformal Monte Carlo Meta-learners for Predictive Inference of Individual Treatment Effects | Jef Jonkers et.al. | 2402.04906 | link |
2024-02-07 | Learning by Doing: An Online Causal Reinforcement Learning Framework with Causal-Aware Policy | Ruichu Cai et.al. | 2402.04869 | null |
2024-02-07 | Collaborative Computing in Non-Terrestrial Networks: A Multi-Time-Scale Deep Reinforcement Learning Approach | Yang Cao et.al. | 2402.04865 | null |
2024-02-07 | Learning Communication Policies for Different Follower Behaviors in a Collaborative Reference Game | Philipp Sadler et.al. | 2402.04824 | null |
2024-02-07 | Investigating Driving Interactions: A Robust Multi-Agent Simulation Framework for Autonomous Vehicles | Marc Kaufeld et.al. | 2402.04720 | link |
2024-02-07 | Large Language Models As Faithful Explainers | Yu-Neng Chuang et.al. | 2402.04678 | null |