arXiv antilibrary
FAQ
Q: What is this page about?
A: This is a collection of papers that I have saved on arXiv (with a few ProjectEuclid and JMLR additions) since 2012.
Q: Did you read all of these papers?
A: No. This is an antilibrary. I save papers that I find interesting or potentially useful in the future. Until now, I did not have a good way to traverse them, so I created this page to make it easier to browse through them.
Q: Why are there so many papers?
A: I am interested in everything and saved papers liberally.
Q: Since you have seen it all, why are you not the most clever person?
A: I have scanned widely, but, sometimes, I did not necessarily even read beyond the (vaguely interesting) abstract.
Q: What does this mean in the age of AI?
A: I don't know.
Q: Is this up-to-date?
A: Mostly yes. More recent papers will be added periodically (perhaps bi-annually), as I won't be exporting my entire antilibrary every time I save a new paper.
Q: Did you spend an insane amount of time preparing this page?
A: No. I scan papers in certain categories for years daily with an RSS reader (Feedly). "Save for Later" consists of just clicking a button. For many years, I couldn't go back due to the volume but I kept saving. After years, I managed to export the .html automatically, I vibe-coded a short Python script to clean it up. Papers older than 07/2013 were fetched separately (as I was using Google Reader which shut down in 2013).
Enjoy browsing.
Saved in 2025
- An interpretation of the Brownian bridge as a physics-informed prior for the Poisson equation
- NRGPT: An Energy-based Alternative for GPT
- A Unification of Discrete, Gaussian, and Simplicial Diffusion
- Wasserstein error bounds for aggregations of continuous-time Markov chains
- Machine Learning and Control: Foundations, Advances, and Perspectives
- Generative design of stabilizing controllers with diffusion models: the Youla approach
- On The Hidden Biases of Flow Matching Samplers
- Riemannian Stochastic Interpolants for Amorphous Particle Systems
- Efficient Monte-Carlo sampling of metastable systems using non-local collective variable updates
- Muon is Provably Faster with Momentum Variance Reduction
- Continuized Nesterov Acceleration for Non-Convex Optimization
- Learning without training: The implicit dynamics of in-context learning
- Curved representational Bregman divergences and their applications
- DiffEM: Learning from Corrupted Data with Diffusion Models via Expectation Maximization
- Sample-Efficient Optimization over Generative Priors via Coarse Learnability
- Variational Continual Test-Time Adaptation
- SoFlow: Solution Flow Models for One-Step Generative Modeling
- Corrective Diffusion Language Models
- The Universal Property of Measure-Theoretic Probability
- Quantitative estimates: How well does the discrete Fourier transform approximate the Fourier transform on $\mathbb{R}$
- Scale-invariant Attention
- Convergence of Noise-Free Sampling Algorithms with Regularized Wasserstein Proximals
- Error Bounds and Optimal Schedules for Masked Diffusions with Factorized Approximations
- Natural Variational Annealing for Multimodal Optimization
- Bregman proximal gradient method for linear optimization under entropic constraints
- Historical Information Accelerates Decentralized Optimization: A Proximal Bundle Method
- Tractable Model for Tunable Non-Markovian Dynamics
- Code, Data and Media Associated with this Article
- Effective sample size approximations as entropy measures
- Viability Theory in the $1$-Wasserstein Space
- On the Design of One-step Diffusion via Shortcutting Flow Paths
- Small Batch Size Training for Language Models: When Vanilla SGD Works, and Why Gradient Accumulation Is Wasteful
- Mathematics and Coding are Universal AI Benchmarks
- Geometric calculations on probability manifolds from reciprocal relations in Master equations
- Gradient descent avoids strict saddles with a simple line-search method too
- Two-scale integrators with high accuracy and long-time conservations for the nonlinear Klein-Gordon equation in the nonrelativistic limit regime
- Operator Splitting Methods for Numerical Solutions of Ordinary Differential Equations
- Polynomial Log-Marginals and Tweedie's Formula : When Is Bayes Possible?
- Understanding Sampler Stochasticity in Training Diffusion Models for RLHF
- On fractal minimizers and potentials of occupation measures
- Multiple Scale Methods For Optimization Of Discretized Continuous Functions
- A preconditioned second-order convex splitting algorithm with extrapolation
- DAMA: A Unified Accelerated Approach for Decentralized Nonconvex Minimax Optimization-Part I: Algorithm Development and Results
- q-Analogue of Hamiltonian Monte Carlo method
- RiemannFormer: A Framework for Attention in Curved Spaces
- Learning to Integrate Diffusion ODEs by Averaging the Derivatives
- Universality of high-dimensional scaling limits of stochastic gradient descent
- Stopping Rules for Stochastic Gradient Descent via Anytime-Valid Confidence Sequences
- Flow-matching Operators for Residual-Augmented Probabilistic Learning of Partial Differential Equations
- Iterative Sampling Methods for Sinkhorn Distributionally Robust Optimization
- Unified Control for Inference-Time Guidance of Denoising Diffusion Models
- Evolving Deep Learning Optimizers
- Image Diffusion Preview with Consistency Solver
- Scalable Formal Verification via Autoencoder Latent Space Abstraction
- B\'ezierFlow: B\'ezier Stochastic Interpolant Schedulers for Few-Step Generation
- On the continuity of flows
- Exploring the Design Space of Transition Matching
- On a $T_1$ Transport inequality for the adapted Wasserstein distance
- Imprecise Markov Semigroups and their Ergodicity
- Random Reshuffling for Stochastic Gradient Langevin Dynamics
- Newton Methods for Mean Field Games: A Numerical Study
- Temporal parallelisation of continuous-time maximum-a-posteriori trajectory estimation
- Efficient Generation of Smooth Paths with Curvature Guarantees by Mollification
- Generative Stochastic Optimal Transport: Guided Harmonic Path-Integral Diffusion
- Adaptive Path Integral Diffusion: AdaPID
- Residual subspace evolution strategies for nonlinear inverse problems
- Nonstationary Distribution Estimation via Wasserstein Probability Flows
- Augmenting Iterative Trajectory for Bilevel Optimization: Methodology, Analysis and Extensions
- A gradient flow on control space with rough initial condition
- Collective Annealing by Switching Temperatures: a Boltzmann-type description
- Linear convergence of relocated fixed-point iterations
- Next-Generation Iterative Algorithms for Large-Scale Min-Max Optimization: Design and Analysis
- Fractional Calculus in Optimal Control and Game Theory: Theory, Numerics, and Applications -- A Survey
- How Muon's Spectral Design Benefits Generalization: A Study on Imbalanced Data
- Stochastics of shapes and Kunita flows
- Small-Gain Nash: Certified Contraction to Nash Equilibria in Differentiable Games
- Gamma-Convergence of Convex Functions, Conjugates, and Subdifferentials
- Entropic Optimal Transport Problem with Convex Functional Cost
- Gradient Descent as a Perceptron Algorithm: Understanding Dynamics and Implicit Acceleration
- PDF Download from null
- Distributional Shrinkage I: Universal Denoisers in Multi-Dimensions
- Trustworthy scientific inference with generative models
- Equivariant Test-Time Training with Operator Sketching for Imaging Inverse Problems
- Differential Smoothing Mitigates Sharpening and Improves LLM Reasoning
- An efficient probabilistic hardware architecture for diffusion-like models
- An Eulerian Perspective on Straight-Line Sampling
- If generative AI is the answer, what is the question?
- Symmetry in Neural Network Parameter Spaces
- On the Design of KL-Regularized Policy Gradient Algorithms for LLM Reasoning
- Nonasymptotic CLT and Error Bounds for Two-Time-Scale Stochastic Approximation
- An Elementary Proof of the Near Optimality of LogSumExp Smoothing
- On Learning-Curve Monotonicity for Maximum Likelihood Estimators
- Generative Modeling from Black-box Corruptions via Self-Consistent Stochastic Interpolants
- Scaling Behavior of Discrete Diffusion Language Models
- The Localization Method for High-Dimensional Inequalities
- Metric-driven numerical methods
- Understanding Outer Optimizers in Local SGD: Learning Rates, Momentum, and Acceleration
- Variational structures for the Fokker--Planck equation with general Dirichlet boundary conditions
- A Smooth Approximation Framework for Weakly Convex Optimization
- Extending Douglas-Rachford Splitting for Convex Optimization
- Moreau envelope and proximal-point methods under the lens of high-order regularization
- On the Convergence Analysis of an Inexact Preconditioned Stochastic Model-Based Algorithm
- Primal-dual splitting for structured composite monotone inclusions with or without cocoercivity
- Workflow is All You Need: Escaping the "Statistical Smoothing Trap" via High-Entropy Information Foraging and Adversarial Pacing
- FALCON: Few-step Accurate Likelihoods for Continuous Flows
- Learning Unmasking Policies for Diffusion Language Models
- Open interacting particle systems and Ising measures
- Disordered Gibbs measures and Gaussian conditioning
- Understanding temperature tuning in energy-based models
- A Taxonomy of Numerical Differentiation Methods
- Analysis of splitting schemes for stochastic evolution equations with non-Lipschitz nonlinearities driven by fractional noise
- Spectral Analysis of Diffusion Models with Application to Schedule Design
- Statistical Properties of Rectified Flow
- Estimation of Stochastic Optimal Transport Maps
- Distributional Shrinkage II: Optimal Transport Denoisers with Higher-Order Scores
- A Minimalist Optimizer Design for LLM Pretraining
- New Results on the Polyak Stepsize: Tight Convergence Analysis and Universal Function Classes
- The tangent space to the Wasserstein space: parallel transport and other applications
- A Benamou-Brenier Proximal Splitting Method for Constrained Unbalanced Optimal Transport
- Astral Space: Convex Analysis at Infinity
- Curse of Slicing: Why Sliced Mutual Information is a Deceptive Measure of Statistical Dependence
- Numerical integrators for confined Langevin dynamics
- On the convergence of adaptive approximations for stochastic differential equations
- Solving the Poisson equation using coupled Markov chains
- Provable Diffusion Posterior Sampling for Bayesian Inversion
- A Unifying Framework for Global Optimization: From Theory to Formalization
- Sliced Wasserstein distance between probability measures on Hilbert spaces
- On Solving Minimization and Min-Max Problems by First-Order Methods with Relative Error in Gradients
- Robust equilibria in continuous games: From strategic to dynamic robustness
- Worst-case generation via minimax optimization in Wasserstein space
- Life as a Categorical Information-Handling System: An Evolutionary Information-Theoretic Model of the Holobiont
- Generalized Probabilistic Approximate Optimization Algorithm
- FlowLPS: Langevin-Proximal Sampling for Flow-based Inverse Problem Solvers
- Small-Gain Nash: Certified Contraction to Nash Equilibria in Differentiable Games
- Pathway to $O(\sqrt{d})$ Complexity bound under Wasserstein metric of flow-based models
- How to simulate L\'evy flights in a steep potential: An explicit splitting numerical scheme
- The Optimal Approximation Factor in Density Estimation
- Long memory score-driven models as approximations for rough Ornstein-Uhlenbeck processes
- Non-asymptotic convergence bounds of modified EM schemes for non-dissipative SDEs
- Scale-free weak output synchronization of multi-agent systems with adaptive protocols
- A Variable Smoothing for Weakly Convex Composite Minimization with Manifold Constraint
- A Particle Algorithm for Mean-Field Variational Inference
- Optimizing Optimizers for Fast Gradient-Based Learning
- Latent Nonlinear Denoising Score Matching for Enhanced Learning of Structured Distributions
- A Latent Variable Framework for Scaling Laws in Large Language Models
- From Tail Universality to Bernstein-von Mises: A Unified Statistical Theory of Semi-Implicit Variational Inference
- Inverse problems for infinite-dimensional transport PDEs on Wasserstein space
- Optimal and Diffusion Transports in Machine Learning
- Optimal Preconditioning is a Geodesically Convex Optimization Problem
- A Survey on Diffusion Language Models
- Do you precondition on the left or on the right?
- Operator learning meets inverse problems: A probabilistic perspective
- Fast Solvers for Discrete Diffusion Models: Theory and Applications of High-Order Algorithms
- Theoretical guarantees for lifted samplers
- Diffusion Models are Molecular Dynamics Simulators
- Optimal Control of McKean--Vlasov Branching Diffusion Processes
- Convergence of Reflected Langevin Diffusion for Constrained Sampling
- Quantitative rigidity of the Wasserstein contraction under convolution
- Towards Continuous-Time Approximations for Stochastic Gradient Descent without Replacement
- Convergence rate of empirical measures in the subspace robust Wasserstein distance
- Global Regularity Estimates for Optimal Transport via Entropic Regularisation
- Learning to Solve Constrained Bilevel Control Co-Design Problems
- Foundations of Diffusion Models in General State Spaces: A Self-Contained Introduction
- Control Consistency Losses for Diffusion Bridges
- Towards a unified framework for guided diffusion models
- Deep Unfolding: Recent Developments, Theory, and Design Guidelines
- Optimal Transportation and Alignment Between Gaussian Measures
- Computing Equilibrium Points of Electrostatic Potentials
- Simpson variational integrator for nonlinear systems: a tutorial on the Lagrange top
- Symplectic methods for stochastic Hamiltonian systems: asymptotic error distributions and Hamiltonian-specific analysis
- When Heating Isn't Cooling in Reverse: Nos\'e-Hoover Thermostat Fluctuations from Equilibrium Symmetry to Nonequilibrium Asymmetry
- Gradient-free optimization via integration
- Unbiased Kinetic Langevin Monte Carlo with Inexact Gradients
- Iterative Tilting for Diffusion Fine-Tuning
- Convergence of a class of gradient-free optimisation schemes when the objective function is noisy, irregular, or both
- Multiscale guidance of protein structure prediction with heterogeneous cryo-EM data
- Connecting Neural Models Latent Geometries with Relative Geodesic Representations
- In Search of Adam's Secret Sauce
- Testing Noise Assumptions of Learning Algorithms
- Formalization of Brownian motion in Lean
- SimpleFold: Folding Proteins is Simpler than You Think
- Error analysis of an acceleration corrected diffusion approximation of Langevin dynamics with background flow
- Phase Transitions as Emergent Geometric Phenomena: A Deterministic Entropy Evolution Law
- Chain of Unit-Physics: A Primitive-Centric Approach to Scientific Code Synthesis
- Non-Asymptotic Convergence of Discrete Diffusion Models: Masked and Random Walk dynamics
- Efficient and Programmable Exploration of Synthesizable Chemical Space
- The Mean-Field Dynamics of Transformers
- Can a Higher Order Markov Chain Be Treated as a First Order Markov Chain?
- Convergence of Reflected Langevin Diffusion for Constrained Sampling
- A brief introduction to matrix hydrodynamics
- Error analysis of an acceleration corrected diffusion approximation of Langevin dynamics with background flow
- Geometric Regularity in Deterministic Sampling of Diffusion-based Generative Models
- A Sampling-Based Domain Generalization Study with Diffusion Generative Models
- On the Condition Number Dependency in Bilevel Optimization
- Test-time scaling of diffusions with flow maps
- Flow Density Control: Generative Optimization Beyond Entropy-Regularized Fine-Tuning
- Closed-Loop Transformers: Autoregressive Modeling as Iterative Latent Equilibrium
- Wasserstein-{\L}ojasiewicz inequalities and asymptotics of McKean-Vlasov equation
- Covering-Space Normalizing Flows: Approximating Pushforwards on Lens Spaces
- Fast Solvers for Discrete Diffusion Models: Theory and Applications of High-Order Algorithms
- A boosted second-order convex splitting algorithm based on gradient flows
- Accelerating Diffusion Models with Parallel Sampling: Inference at Sub-Linear Time Complexity
- Convergence of Ray- and Pixel-Driven Discretization Frameworks in the Strong Operator Topology
- Generalization of Silver Stepsize Schedule to Stochastic Optimization
- Bayesian tit-for-tat fosters cooperation in evolutionary stochastic games
- Adam Simplified: Bias Correction Debunked
- Beyond Expectation: Concentration Inequalities for Randomized Iterative Methods
- Searching Latent Program Spaces
- Optimal transport maps, majorization, and log-subharmonic measures
- Darboux Transformation of Diffusion Processes
- Structured Continuity Equations in Fibred Wasserstein Spaces
- Formalization of Brownian motion in Lean
- Propagation of chaos in Fisher information
- Modified Equations for Stochastic Optimization
- Designing Preconditioners for SGD: Local Conditioning, Noise Floors, and Basin Stability
- Modified Equations for Stochastic Optimization
- Terminal Velocity Matching
- Demystifying Diffusion Objectives: Reweighted Losses are Better Variational Bounds
- Extending Douglas-Rachford Splitting for Convex Optimization
- Diffusion Models are Molecular Dynamics Simulators
- Solving a Research Problem in Mathematical Statistics with AI Assistance
- Time-uniform concentration bounds for iterative algorithms
- A Diffusion Model to Shrink Proteins While Maintaining Their Function
- Posterior Collapse as a Phase Transition in Variational Autoencoders
- Is Grokking a Computational Glass Relaxation?
- Equivariant Deep Equilibrium Models for Imaging Inverse Problems
- An operator splitting analysis of Wasserstein--Fisher--Rao gradient flows
- Analog Physical Systems Can Exhibit Double Descent
- Importance-Weighted Non-IID Sampling for Flow Matching Models
- CDLM: Consistency Diffusion Language Models For Faster Sampling
- On the Lipschitz properties of transportation along heat flows
- The Brownian transport map
- Machine learning on manifolds for inverse scattering: Lipschitz stability analysis
- When do World Models Successfully Learn Dynamical Systems?
- Ergodic control of McKean-Vlasov systems on the Wasserstein space
- Consensus Planning with Primal, Dual, and Proximal Agents
- Moduli space of optimization algorithms
- Iterative improvement of free energy landscape reconstructions with optimal protocols derived from differentiable simulations
- Thermodynamics + Natural Selection = Bayesian Inference
- Divide, Interact, Sample: The Two-System Paradigm
- Gradient flow for deep equilibrium single-index models
- Iterating marginalized Bayes maps for likelihood maximization with application to nonlinear panel models
- Evolution Strategies at the Hyperscale
- discretize_distributions: Efficient Quantization of Gaussian Mixtures with Guarantees in Wasserstein Distance
- Time dependent loss reweighting for flow matching and diffusion models is theoretically justified
- Global Convergence of Four-Layer Matrix Factorization under Random Initialization
- MAP Estimation with Denoisers: Convergence Rates and Guarantees
- Energy-based generator matching: A neural sampler for general state space
- QUASAR: An Evolutionary Algorithm to Accelerate High-Dimensional Optimization
- On the Convex Interpolation for Linear Operators
- Implicit Bias of the JKO Scheme
- Complex variational autoencoders admit K\"ahler structure
- Bringing Federated Learning to Space
- On the randomized Euler scheme for SDEs with integral-form drift
- Particle Monte Carlo methods for Lattice Field Theory
- The Sequential Nature of Science: Quantifying Learning from a Sequence of Studies
- Learning few-step posterior samplers by unfolding and distillation of diffusion models
- Improved Sample Complexity Bounds for Diffusion Model Training
- Generalized differentiation in Wasserstein space and application to multiagent control problem
- Wright--Fisher kernels: from linear to non-linear dynamics, ergodicity and McKean--Vlasov scaling limits
- An introduction to Coupling
- Time-causal and time-recursive wavelets
- Convex relaxation approaches for high dimensional optimal transport
- Gradient Flows of Potential Energies in the Geometry of Sinkhorn Divergences
- Energy Guided Geometric Flow Matching
- Functional Mean Flow in Hilbert Space
- Training Instabilities Induce Flatness Bias in Gradient Descent
- Chain-of-Generation: Progressive Latent Diffusion for Text-Guided Molecular Design
- Diffusion Models: A Mathematical Introduction
- Compact Schemes for $A^+B$, $A^+AB$ and $AA^+B$
- PID-controlled Langevin Dynamics for Faster Sampling of Generative Models
- Using Linearized Optimal Transport to Predict the Evolution of Stochastic Particle Systems
- On the Information Processing of One-Dimensional Wasserstein Distances with Finite Samples
- Convergence rate of randomized midpoint Langevin Monte Carlo
- Non-asymptotic Analysis of Poisson randomized midpoint Langevin Monte Carlo
- Bregman geometry-aware split Gibbs sampling for Bayesian Poisson inverse problems
- A Symplectic Analysis of Alternating Mirror Descent
- The iterates of FISTA convergence even under inexact computations
- On the Time Derivative of the KL Divergence for a Generalized Langevin Annealing Scheme
- Guaranteeing Higher Order Convergence Rates for Accelerated Wasserstein Gradient Flow Schemes
- Neural Local Wasserstein Regression
- Non-Convex Global Optimization as an Optimal Stabilization Problem: Dynamical Properties
- On the Moreau envelope properties of weakly convex functions
- (Adaptive) Scaled gradient methods beyond locally Holder smoothness: Lyapunov analysis, convergence rate and complexity
- Diffusion annealed Langevin dynamics: a theoretical study
- Computing Wasserstein Barycenters through Gradient Flows
- Convergence of Deterministic and Stochastic Diffusion-Model Samplers: A Simple Analysis in Wasserstein Distance
- Theory and computation for structured variational inference
- Bridging Constraints and Stochasticity: A Fully First-Order Method for Stochastic Bilevel Optimization with Linear Constraints
- Edit Flows: Flow Matching with Edit Operations
- Branching Flows: Discrete, Continuous, and Manifold Flow Matching with Splits and Deletions
- Controllable protein design through Feynman-Kac steering
- A kernel method for the learning of Wasserstein geometric flows
- Functional Adjoint Sampler: Scalable Sampling on Infinite Dimensional Spaces
- Proximal Oracles for Optimization and Sampling
- Gradient descent with adaptive stepsize converges (nearly) linearly under fourth-order growth
- Optimal Inference Schedules for Masked Diffusion Models
- The Evolving Nature of Latent Spaces: From GANs to Diffusion
- On the Convergence of Muon and Beyond
- The case for and against fixed step-size: Stochastic approximation algorithms in optimization and machine learning
- On the Kantorovich contraction of Markov semigroups
- Parallel Sampling via Autospeculation
- Schedulers for Schedule-free: Theoretically inspired hyperparameters
- Long-time behavior for discretization schemes of Fokker-Planck equations via couplings
- Fast Direct Solvers
- Uniqueness of DRS as the 2 Operator Resolvent-Splitting and Impossibility of 3 Operator Resolvent-Splitting
- A solvable model of learning generative diffusion: theory and insights
- Contact Wasserstein Geodesics for Non-Conservative Schrodinger Bridges
- Test-Time Iterative Error Correction for Efficient Diffusion Models
- Unveiling the Training Dynamics of ReLU Networks through a Linear Lens
- Effective Test-Time Scaling of Discrete Diffusion through Iterative Refinement
- A kernel method for the learning of Wasserstein geometric flows
- Adam symmetry theorem: characterization of the convergence of the stochastic Adam optimizer
- A PDE Perspective on Generative Diffusion Models
- A Course in Interacting Particle Systems
- Coarse-graining nonequilibrium diffusions with Markov chains
- Convexity and strict convexity for compositional neural networks in high-dimensional optimal control
- An efficient proximal algorithm for squared L1 over L2 regularized sparse recovery
- Piecewise deterministic sampling with splitting schemes
- Diffusion LLMs are Natural Adversaries for any LLM
- When Models Don't Collapse: On the Consistency of Iterative MLE
- Differentiable Generalized Sliced Wasserstein Plans
- Split Gibbs Discrete Diffusion Posterior Sampling
- Stability of the Kim--Milman flow map
- Fractional Diffusion Bridge Models
- Tracking solutions of time-varying variational inequalities
- Consistent Sampling and Simulation: Molecular Dynamics with Energy-Based Diffusion Models
- Forgetting is Everywhere
- Rates of Convergence of Maximum Smoothed Log-Likelihood Estimators for Semi-Parametric Multivariate Mixtures
- Mathematics with large language models as provers and verifiers
- Information-theoretic Generalization Analysis for VQ-VAEs: A Role of Latent Variables
- Measure-Theoretic Time-Delay Embedding
- How Memory in Optimization Algorithms Implicitly Modifies the Loss
- Understanding Adam Requires Better Rotation Dependent Assumptions
- ODE approximation for the Adam algorithm: General and overparametrized setting
- Learning Paths for Dynamic Measure Transport: A Control Perspective
- Functional central limit theorem for Euler--Maruyama scheme with decreasing step sizes
- Mean square error analysis of stochastic gradient and variance-reduced sampling algorithms
- Relative entropy estimate and geometric ergodicity for implicit Langevin Monte Carlo
- Continuous-time Riemannian SGD and SVRG Flows on Wasserstein Probabilistic Space
- Statistical Properties of Rectified Flow
- Training Optimal Large Diffusion Language Models
- On scalable and efficient training of diffusion samplers
- Nonlinear forward-backward-half forward splitting with momentum for monotone inclusions
- Diffusion Language Models are Super Data Learners
- FP-AbDiff: Improving Score-based Antibody Design by Capturing Nonequilibrium Dynamics through the Underlying Fokker-Planck Equation
- The Curved Spacetime of Transformer Architectures
- Uniform-in-time propagation of chaos for mean field Langevin dynamics
- Reversibility, covariance and coarse-graining for Langevin dynamics: On the choice of multiplicative noise
- Adam Reduces a Unique Form of Sharpness: Theoretical Insights Near the Minimizer Manifold
- A Theoretical Framework for Grokking: Interpolation followed by Riemannian Norm Minimisation
- A Review of Bilevel Optimization: Methods, Emerging Applications, and Recent Advancements
- Proximal gradient descent on the smoothed duality gap to solve saddle point problems
- Min-Max Optimization Is Strictly Easier Than Variational Inequalities
- The Evolution of Life as a Categorical Information-Handling Process
- Shifted Composition IV: Toward Ballistic Acceleration for Log-Concave Sampling
- Nonasymptotic Convergence Rates for Plug-and-Play Methods With MMSE Denoisers
- Diffusion LLMs are Natural Adversaries for any LLM
- Wasserstein Convergence of Critically Damped Langevin Diffusions
- Convergence of Random Batch Method with replacement for interacting particle systems
- The heat flow on glued spaces with varying dimension
- Piecewise deterministic sampling with splitting schemes
- Sharp inequalities between Zolotarev and Wasserstein distances in $\mathrm{P}_2(\mathbb{R}^d)$
- Concentration inequalities for strong laws and laws of the iterated logarithm
- Trade Execution Flow as the Underlying Source of Market Dynamics
- Particle system approximation of Nash equilibria in large games
- Accelerating Diffusion LLMs via Adaptive Parallel Decoding
- Discrete Diffusion Models: Novel Analysis and New Sampler Guarantees
- Minimax-Optimal Two-Sample Test with Sliced Wasserstein
- Rivers under Noise
- Gradient flow structure for some nonlocal diffusion equations
- Gradient Flows as Optimal Controlled Evolutions: From Rn to Wasserstein product spaces
- A geometric framework for momentum-based optimizers for low-rank training
- Is Grokking a Computational Glass Relaxation?
- Curly Flow Matching for Learning Non-gradient Field Dynamics
- Posterior Sampling by Combining Diffusion Models with Annealed Langevin Dynamics
- Likely Interpolants of Generative Models
- Differentiable Programming for Differential Equations: A Review
- Tuning-Free Sampling via Optimization on the Space of Probability Measures
- Large-Time Analysis of the Langevin Dynamics for Energies Fulfilling Polyak-{\L}ojasiewicz Conditions
- Hidden convexity property of a speed planning problem
- Star Quasiconvexity: an Unified Approach for Linear Convergence of First-Order Methods Beyond Convexity
- Learning Firmly Nonexpansive Operators
- Provable Scaling Laws for the Test-Time Compute of Large Language Models
- Score-based constrained generative modeling via Langevin diffusions with boundary conditions
- Strong and weak quantitative estimates in slow-fast diffusions using filtering techniques
- Equations driven by fast-oscillating functions of an It\^o diffusion process
- Averaging principle for slow-fast systems of PDEs with rough drivers
- Nonlinear forward-backward-half forward splitting with momentum for monotone inclusions
- Interpolating between Optimal Transport and KL regularized Optimal Transport using R\'enyi Divergences
- On the Contractivity of Stochastic Interpolation Flow
- On the Anisotropy of Score-Based Generative Models
- Model-free filtering in high dimensions via projection and score-based diffusions
- Proximal Hamiltonian Monte Carlo
- Strong and weak quantitative estimates in slow-fast diffusions using filtering techniques
- Thermodynamic Cost of Random-Time Protocols
- Anytime-valid, Bayes-assisted, Prediction-Powered Inference
- Inference-Time Scaling for Flow Models via Stochastic Generation and Rollover Budget Forcing
- Scaling can lead to compositional generalization
- Learning normalized image densities via dual score matching
- Structured Linear CDEs: Maximally Expressive and Parallel-in-Time Sequence Models
- Large Language Bayes
- Fisher meets Feynman: score-based variational inference with a product of experts
- Generalised Flow Maps for Few-Step Generative Modelling on Riemannian Manifolds
- Averaging principle for jump processes depending on fast ergodic dynamics
- Bean: A Language for Backward Error Analysis
- A Convergence Analysis of Adaptive Optimizers under Floating-point Quantization
- From Information to Generative Exponent: Learning Rate Induces Phase Transitions in SGD
- Fisher meets Feynman: score-based variational inference with a product of experts
- A Geometric Analysis of PCA
- Exponential Convergence Guarantees for Iterative Markovian Fitting
- A Brenier Theorem on $(P_2 (...P_2(H)...), W_2 )$ and Applications to Adapted Transport
- Smoothing inequalities for transport metrics in compact spaces
- Convergence of Stochastic Gradient Langevin Dynamics in the Lazy Training Regime
- Near Optimality of Discrete-Time Approximations for Controlled McKean-Vlasov Diffusions and Interacting Particle Systems
- Near optimal sample complexity for matrix and tensor normal models via geodesic convexity
- On the Emergence of Linear Analogies in Word Embeddings
- Sampling from multi-modal distributions with polynomial query complexity in fixed dimension via reverse diffusion
- Sign-In to the Lottery: Reparameterizing Sparse Training From Scratch
- Beyond the Ideal: Analyzing the Inexact Muon Update
- Beneath the kinetic interpretation of noise
- Topics in Probability, Parametric Estimation and Stochastic Calculus
- On the Structure of Stationary Solutions to McKean-Vlasov Equations with Applications to Noisy Transformers
- Unfolding Generative Flows with Koopman Operators: Fast and Interpretable Sampling
- Heavy-Ball Momentum Method in Continuous Time and Discretization Error Analysis
- Position: Many generalization measures for deep learning are fragile
- Smoothed Distance Kernels for MMDs and Applications in Wasserstein Gradient Flows
- Well-posedness and propagation of chaos for McKean-Vlasov stochastic variational inequalities
- Sharp comparisons between sliced and standard $1$-Wasserstein distances
- A Nonparametric Bayesian Solution of the Empirical Stochastic Inverse Problem
- Particle system approximation of Nash equilibria in large games
- Exponential stability of finite-$N$ consensus-based optimization
- REPA-E: Unlocking VAE for End-to-End Tuning with Latent Diffusion Transformers
- The Spacetime of Diffusion Models: An Information Geometry Perspective
- A Frequentist Statistical Introduction to Variational Inference, Autoencoders, and Diffusion Models
- Vision Foundation Models Can Be Good Tokenizers for Latent Diffusion Models
- Learning Boltzmann Generators via Constrained Mass Transport
- Latent Discrete Diffusion Models
- Demystifying Transition Matching: When and Why It Can Beat Flow Matching
- Equations driven by fast-oscillating functions of an It\^o diffusion process
- Variable-preconditioned transformed primal-dual method for generalized Wasserstein Gradient Flows
- Designing trajectories in the Earth-Moon system: a Levenberg-Marquardt approach
- Progressive Tempering Sampler with Diffusion
- Temperature is All You Need for Generalization in Langevin Dynamics and other Markov Processes
- Non-asymptotic error bounds for probability flow ODEs under weak log-concavity
- Inference-Time Compute Scaling For Flow Matching
- Score-based deterministic density sampling
- Free energy Wasserstein gradient flow and their particle counterparts: toy model, (degenerate) PL inequalities and exit times
- Free energy Wasserstein gradient flow and their particle counterparts: toy model, (degenerate) PL inequalities and exit times
- A unified framework for divergences, free energies, and Fokker-Planck equations
- Geometric Convergence Analysis of Variational Inference via Bregman Divergences
- Second order explicit stabilized multirate method for stiff differential equations with error control
- Efficient Single-Loop Stochastic Algorithms for Nonconvex-Concave Minimax Optimization
- Understanding Sampler Stochasticity in Training Diffusion Models for RLHF
- Path Gradients after Flow Matching
- Near-Optimality of Contrastive Divergence Algorithms
- A coupling-based approach to f-divergences diagnostics for Markov chain Monte Carlo
- Non-convex entropic mean-field optimization via Best Response flow
- A Formalization of the Ionescu-Tulcea Theorem in Mathlib
- An Introduction to Sliced Optimal Transport
- Revisit First-order Methods for Geodesically Convex Optimization
- Cumulants, Moments and Selection: The Connection Between Evolution and Statistics
- Non-coupling from the past
- Flows and Diffusions on the Neural Manifold
- A Geometric Approach to Optimal Experimental Design
- Geometric optics approximation sampling: near-field case
- Enhancing Diffusion-Based Sampling with Molecular Collective Variables
- Y-shaped Generative Flows
- Sculpting Latent Spaces With MMD: Disentanglement With Programmable Priors
- Multi Timescale Stochastic Approximation: Stability and Convergence
- Contraction and entropy production in continuous-time Sinkhorn dynamics
- Forward Euler for Wasserstein Gradient Flows: Breakdown and Regularization
- Stochastic optimal transport for the Langevin dynamics and its zero--mass limit
- Convergence, design and training of continuous-time dropout as a random batch method
- Unstable optimal transport maps
- Approximation theory for 1-Lipschitz ResNets
- Reinforced sequential Monte Carlo for amortised sampling
- An Eulerian Perspective on Straight-Line Sampling
- Understanding Sampler Stochasticity in Training Diffusion Models for RLHF
- The Algorithmic Regulator
- On Convolutions, Intrinsic Dimension, and Diffusion Models
- Mixing Times and Privacy Analysis for the Projected Langevin Algorithm under a Modulus of Continuity
- Measure estimation on a manifold explored by a diffusion process
- An Exploration of Non-Euclidean Gradient Descent: Muon and its Many Variants
- A mathematical theory for understanding when abstract representations emerge in neural networks
- A Statistical Physics perspective on fairness in shared expenses: The bar bill analogy
- A framework for the use of generative modelling in non-equilibrium statistical mechanics
- Deep Neural Networks Inspired by Differential Equations
- Modeling AdaGrad, RMSProp, and Adam with Integro-Differential Equations
- Analysis of the Geometric Heat Flow Equation: Computing Geodesics in Real-Time with Convergence Guarantees
- Concentration Inequalities and UQ Bounds for Hypocoercive MCMC Samplers
- Fast Wasserstein rates for estimating probability distributions of probabilistic graphical models
- Generalized Probabilistic Approximate Optimization Algorithm
- Convergence of two-timescale gradient descent ascent dynamics: finite-dimensional and mean-field perspectives
- On the Interpolation Effect of Score Smoothing in Diffusion Models
- Conditional Flow Matching for Bayesian Posterior Inference
- Mirror Flow Matching with Heavy-Tailed Priors for Generative Modeling on Convex Domains
- Geodesic Calculus on Latent Spaces
- Convergence of optimizers implies eigenvalues filtering at equilibrium
- Learning Regularizers: Learning Optimizers that can Regularize
- On the Interpolation Effect of Score Smoothing in Diffusion Models
- Convergence of two-timescale gradient descent ascent dynamics: finite-dimensional and mean-field perspectives
- Optimal Stopping in Latent Diffusion Models
- Nested superposition principle for random measures and the geometry of the Wasserstein on Wasserstein space
- Markets for Models
- Rethinking Losses for Diffusion Bridge Samplers
- The Poisson Midpoint Method for Langevin Dynamics: Provably Efficient Discretization for Diffusion Models
- Ergodicity and error estimate of laws for a random splitting Langevin Monte Carlo
- Inference-Time Scaling of Discrete Diffusion Models via Importance Weighting and Optimal Proposal Design
- Diffusion at Absolute Zero: Langevin Sampling using Successive Moreau Envelopes [journal paper]
- An Inertial Langevin Algorithm
- An Inertial Langevin Algorithm
- Maximum Ideal Likelihood Estimation: A Unified Inference Framework for Latent Variable Models
- Data as Commodity: a Game-Theoretic Principle for Information Pricing
- Measures of Dependence based on Wasserstein distances
- Expected Free Energy-based Planning as Variational Inference
- Bilevel optimization for learning hyperparameters: Application to solving PDEs and inverse problems with Gaussian processes
- A Probabilistic Basis for Low-Rank Matrix Learning
- Generalization of Gibbs and Langevin Monte Carlo Algorithms in the Interpolation Regime
- Carr\'e du champ flow matching: better quality-generalisation tradeoff in generative models
- Markov kernels in Mathlib's probability library
- Averaging principle for slow-fast fractional stochastic differential equations
- Attention as an Adaptive Filter
- How to build a consistency model: Learning flow maps via self-distillation
- Optimization via a Control-Centric Framework
- Evolution of social behaviors in noisy environments
- Curiosity-Driven Co-Development of Action and Language in Robots Through Self-Exploration
- Perspectives on Stochastic Localization
- Categorical Invariants of Learning Dynamics
- Wave-PDE Nets: Trainable Wave-Equation Layers as an Alternative to Attention
- How to Set $\beta_1, \beta_2$ in Adam: An Online Learning Perspective
- On amortizing convex conjugates for optimal transport
- Analysis of kinetic Langevin Monte Carlo under the stochastic exponential Euler discretization from underdamped all the way to overdamped
- Error estimates for deterministic empirical approximations of probability measures
- Dale meets Langevin: A Multiplicative Denoising Diffusion Model
- Effective continuous equations for adaptive SGD: a stochastic analysis view
- When Scores Learn Geometry: Rate Separations under the Manifold Hypothesis
- Averaging principles and central limit theorems for multiscale McKean-Vlasov stochastic systems
- A coarse-graining theory for elliptic operators and homogenization in high contrast
- Understanding Transformer Architecture through Continuous Dynamics: A Partial Differential Equation Perspective
- When Scores Learn Geometry: Rate Separations under the Manifold Hypothesis
- Consistency Models as Plug-and-Play Priors for Inverse Problems
- A multiscale analysis of mean-field transformers in the moderate interaction regime
- Electric Currents for Discrete Data Generation
- Two-Scale Latent Dynamics for Recurrent-Depth Transformers
- Localized Diffusion Models
- Concentration and moment inequalities for sums of independent heavy-tailed random matrices
- Zero variance self-normalized importance sampling via estimating equations
- A Bayesian Characterization of Ensemble Kalman Updates
- Poisson Midpoint Method for Log Concave Sampling: Beyond the Strong Error Lower Bounds
- A Physics-Inspired Optimizer: Velocity Regularized Adam
- Temporal Score Rescaling for Temperature Sampling in Diffusion and Flow Models
- Error Feedback for Muon and Friends
- The Transformer Cookbook
- Flow Autoencoders are Effective Protein Tokenizers
- Learning Energy-based Variational Latent Prior for VAEs
- Diffusion Models and the Manifold Hypothesis: Log-Domain Smoothing is Geometry Adaptive
- Fisher information and trajectorial interpretation to the It\^o--Langevin relative entropy dissipation
- Optimal Denoising in Score-Based Generative Models: The Role of Data Regularity
- Combining complex Langevin dynamics with score-based and energy-based diffusion models
- Modeling Others' Minds as Code
- Equilibrium Matching: Generative Modeling with Implicit Energy-Based Models
- Sampling projections in the uniform norm
- Convex Order and Arbitrage
- Drop-Muon: Update Less, Converge Faster
- Zero variance self-normalized importance sampling via estimating equations
- Spectral gap of Metropolis-within-Gibbs under log-concavity
- Sharpness of Minima in Deep Matrix Factorization: Exact Expressions
- One-shot Conditional Sampling: MMD meets Nearest Neighbors
- When Langevin Monte Carlo Meets Randomization: Non-asymptotic Error Bounds beyond Log-Concavity and Gradient Lipschitzness
- Efficiently Escaping Saddle Points for Policy Optimization
- Sharpness of Minima in Deep Matrix Factorization: Exact Expressions
- A Single-Loop Gradient Algorithm for Pessimistic Bilevel Optimization via Smooth Approximation
- Non-Expansive Mappings in Two-Time-Scale Stochastic Approximation: Finite-Time Analysis
- Local SGD and Federated Averaging Through the Lens of Time Complexity
- Training-Free Bayesianization for Low-Rank Adapters of Large Language Models
- Discretization Error of Fourier Neural Operators
- Mechanisms of Projective Composition of Diffusion Models
- Error Analysis of Discrete Flow with Generator Matching
- Effective continuous equations for adaptive SGD: a stochastic analysis view
- A Theoretical Analysis of Discrete Flow Matching Generative Models
- Transport Based Mean Flows for Generative Modeling
- Generation Properties of Stochastic Interpolation under Finite Training Set
- On the Complexity Theory of Masked Discrete Diffusion: From $\mathrm{poly}(1/\epsilon)$ to Nearly $\epsilon$-Free
- DistillKac: Few-Step Image Generation via Damped Wave Equations
- Are Hallucinations Bad Estimations?
- Effective continuous equations for adaptive SGD: a stochastic analysis view
- Slicing Wasserstein Over Wasserstein Via Functional Optimal Transport
- The Aldous$\unicode{x2013}$Hoover Theorem in Categorical Probability
- Fast Estimation of Wasserstein Distances via Regression on Sliced Wasserstein Distances
- Scaling Laws are Redundancy Laws
- Theoretical Bounds for Stable In-Context Learning
- Latent Twins
- Understanding Optimization in Deep Learning with Central Flows
- GradNetOT: Learning Optimal Transport Maps with GradNets
- Anchored Langevin Algorithms
- Singular jump processes as generalized gradient flows
- Propagation of Chaos in One-hidden-layer Neural Networks beyond Logarithmic Time
- Efficient Sliced Wasserstein Distance Computation via Adaptive Bayesian Optimization
- On the inadequacy of nudging data assimilation algorithms for non-dissipative systems
- Amortized Latent Steering: Low-Cost Alternative to Test-Time Optimization
- Compositional System Dynamics: The Higher Mathematics Underlying System Dynamics Diagrams & Practice
- On the Dynamics of Acceleration in First order Gradient Methods
- Kernel Variational Inference Flow for Nonlinear Filtering Problem
- Convergence, Duality and Well-Posedness in Convex Bilevel Optimization
- Sampling-Based Zero-Order Optimization Algorithms
- Understanding two-scale criteria for Poincar\'{e} and log-Sobolev inequalities in the Euclidean case through \Phi-entropies
- Absorb and Converge: Provable Convergence Guarantee for Absorbing Discrete Diffusion Models
- Averaging principle for two time-scale stochastic differential equations with fast component in noncompact space
- Audio Super-Resolution with Latent Bridge Models
- Discrete Diffusion Models: Novel Analysis and New Sampler Guarantees
- Parallel Simulation for Log-concave Sampling and Score-based Diffusion Models
- Monte Carlo on a single sample
- A Computational Method for the Inverse Robin Problem with Convergence Rate
- Measure-to-measure interpolation using Transformers
- Vibrational Control of Complex Networks
- Global Optimization via Softmin Energy Minimization
- Hessian-guided Perturbed Wasserstein Gradient Flows for Escaping Saddle Points
- A Unified Theory of Exact Inference and Learning in Exponential Family Latent Variable Models
- Gradient-Free Sequential Bayesian Experimental Design via Interacting Particle Systems
- Optimal Experimental Design of a Moving Sensor for Linear Bayesian Inverse Problems
- Lindblad evolution as gradient flow
- Entropic balance with feedback control: information equalities and tight inequalities
- Gradient-Free Sequential Bayesian Experimental Design via Interacting Particle Systems
- Mixing properties of some Markov chains models in random environments
- Transient regime of piecewise deterministic Monte Carlo algorithms
- Statistical Methods in Generative AI
- Pre-training under infinite compute
- Dynamics from iterated averaging
- The Ensemble Kalman Update is an Empirical Matheron Update
- Parameter Estimation for Weakly Interacting Hypoelliptic Diffusions
- Hamiltonian Descent Algorithms for Optimization: Accelerated Rates via Randomized Integration Time
- LiMuon: Light and Fast Muon Optimizer for Large Models
- Should We Relax Stability in Matching Markets?
- The Psychogenic Machine: Simulating AI Psychosis, Delusion Reinforcement and Harm Enablement in Large Language Models
- Nash Equilibria in Games with Playerwise Concave Coupling Constraints: Existence and Computation
- A Universal Banach--Bregman Framework for Stochastic Iterations: Unifying Stochastic Mirror Descent, Learning and LLM Training
- Masked Diffusion Models as Energy Minimization
- Stability of universal properties against perturbations of the Markov Chain Monte Carlo algorithm
- Concave tents: a new tool for constructing concave reformulations of a large class of nonconvex optimization problems
- On the Moreau envelope properties of weakly convex functions
- Absolute continuity, supports and idempotent splitting in categorical probability
- The Diffusion Duality
- A Monte Carlo Approach for Nonsmooth Convex Optimization via Proximal Splitting Algorithms
- Sliding motions on systems with non-Euclidean state spaces: A differential-geometric perspective
- Forward Euler for Wasserstein Gradient Flows: Breakdown and Regularization
- Learning Concave Bid Shading Strategies in Online Auctions via Measure-valued Proximal Optimization
- Contractive kinetic Langevin samplers beyond global Lipschitz continuity
- How to simulate L\'evy flights in a steep potential: An explicit splitting numerical scheme
- Distance Between Stochastic Linear Systems
- Global Optimization Algorithm through High-Resolution Sampling
- Lyapunov stability of the Euler method
- From PowerSGD to PowerSGD+: Low-Rank Gradient Compression for Distributed Optimization with Convergence Guarantees
- Compositionality in algorithms for smoothing
- Symplectic techniques for stochastic differential equations on reductive Lie groups with applications to Langevin diffusions
- Can one condition a killed random walk to survive?
- Entropy and Learning of Lipschitz Functions under Log-Concave Measures
- Input-Time Scaling
- The Overcooked Generalisation Challenge: Evaluating Cooperation with Novel Partners in Unknown Environments Using Unsupervised Environment Design
- Repulsive Monte Carlo on the sphere for the sliced Wasserstein distance
- From the Gradient-Step Denoiser to the Proximal Denoiser and their associated convergent Plug-and-Play algorithms
- Multiscaling in Wasserstein Spaces
- Exact worst-case convergence rates for Douglas--Rachford and Davis--Yin splitting methods
- Understanding Outer Optimizers in Local SGD: Learning Rates, Momentum, and Acceleration
- Constrained Variational Inference via Safe Particle Flow
- Guided filtering and smoothing for infinite-dimensional diffusions
- Euler-type methods for Levy-driven McKean-Vlasov SDEs with super-linear coefficients: mean-square error analysis
- A hierarchical entropy method for the delocalization of bias in high-dimensional Langevin Monte Carlo
- Divide, Interact, Sample: The Two-System Paradigm
- Worst-case convergence analysis of relatively inexact gradient descent on smooth convex functions
- Convexity of Optimization Curves: Local Sharp Thresholds, Robustness Impossibility, and New Counterexamples
- Bregman Douglas-Rachford Splitting Method
- A transport approach to the cutoff phenomenon
- Nesterov acceleration for strongly convex-strongly concave bilinear saddle point problems: discrete and continuous-time approaches
- Finding a Multiple Follower Stackelberg Equilibrium: A Fully First-Order Method
- Matrix Completion in Group Testing: Bounds and Simulations
- Triplication: an important component of the modern scientific method
- Resolvent Compositions for Positive Linear Operators
- Flow-based generative models as iterative algorithms in probability space
- A relaxed version of Ryu's three-operator splitting method for structured nonconvex optimization
- Cryo-EM as a Stochastic Inverse Problem
- On the convergence rate of the Douglas-Rachford splitting algorithm
- Stochastic processes with multiple temporal scales: timescale separation and information
- On the quadratic barycentric transport problem
- Fantastic Pretraining Optimizers and Where to Find Them
- Probabilistic operator learning: generative modeling and uncertainty quantification for foundation models of differential equations
- Stability Analysis for Stochastic Hybrid Inclusions
- Euler-type approximation for the invariant measure: An abstract framework
- Improved sampling algorithms and Poincar\'e inequalities for non-log-concave distributions
- Straighter Flow Matching via a Diffusion-Based Coupling Prior
- Connections between reinforcement learning with feedback,test-time scaling, and diffusion guidance: An anthology
- Energy-Weighted Flow Matching: Unlocking Continuous Normalizing Flows for Efficient and Scalable Boltzmann Sampling
- Attention as an Adaptive Filter
- An explicit splitting SAV scheme for the kinetic Langevin dynamics
- A dynamic view of some anomalous phenomena in SGD
- An overview of Koopman-based control: From error bounds to closed-loop guarantees
- The Information Dynamics of Generative Diffusion
- Scale-Adaptive Generative Flows for Multiscale Scientific Data
- Mathematical research with GPT-5: a Malliavin-Stein experiment
- AdaGrad Meets Muon: Adaptive Stepsizes for Orthogonal Updates
- From Image Denoisers to Regularizing Imaging Inverse Problems: An Overview
- Faster Gradient Methods for Highly-smooth Stochastic Bilevel Optimization
- On the Wasserstein median of probability measures
- A Markov Categorical Framework for Language Modeling
- Strong averaging principle for nonautonomous multi-scale SPDEs with fully local monotone and almost periodic coefficients
- Wasserstein Mirror Gradient Flow as the limit of the Sinkhorn Algorithm
- Fokker-Planck equation for stochastic heat equations
- Sampling, Diffusions, and Stochastic Localization
- Distribution estimation via Flow Matching with Lipschitz guarantees
- Entry Barriers in Content Markets
- Lipschitz-Guided Design of Interpolation Schedules in Generative Models
- Regime-Switching Langevin Monte Carlo Algorithms
- Differentiable Expectation-Maximisation and Applications to Gaussian Mixture Model Optimal Transport
- Data-Dependent Smoothing for Protein Discovery with Walk-Jump Sampling
- Fantastic Pretraining Optimizers and Where to Find Them
- Bouncy particle sampler with infinite exchanging parallel tempering
- Feynman-Kac-Flow: Inference Steering of Conditional Flow Matching to an Energy-Tilted Posterior
- Composable Uncertainty in Symmetric Monoidal Categories for Design Problems (Extended Version)
- Continuously Tempered Diffusion Samplers
- A-FloPS: Accelerating Diffusion Sampling with Adaptive Flow Path Sampler
- Lipschitz-Guided Design of Interpolation Schedules in Generative Models
- Noise equals control
- Preconditioned Regularized Wasserstein Proximal Sampling
- On the Global Optimality of Linear Policies for Sinkhorn Distributionally Robust Linear Quadratic Control
- Optimal control of SDEs with merely measurable drift: an HJB approach
- Boundary Stabilization of a Bending and Twisting Wing by Linear Quadratic Gaussian Theory
- Bayesian Double Descent
- Constrained Optimization From a Control Perspective via Feedback Linearization
- Modern aspects of Markov chains: entropy, curvature and the cutoff phenomenon
- Inference-Time Alignment Control for Diffusion Models with Reinforcement Learning Guidance
- Provable Benefits of In-Tool Learning for Large Language Models
- On the convergence of adaptive approximations for stochastic differential equations
- Understanding Incremental Learning with Closed-form Solution to Gradient Flow on Overparamerterized Matrix Factorization
- Operator learning meets inverse problems: A probabilistic perspective
- An introduction to large deviations with applications in physics
- Formal equivalence between global optimization consistency and random search
- A first-order condition for discrete-time distribution steering
- Revisit Stochastic Gradient Descent for Strongly Convex Objectives: Tight Uniform-in-Time Bounds
- Analysis and Synthesis Denoisers for Forward-Backward Plug-and-Play Algorithms
- Beyond Blur: A Fluid Perspective on Generative Diffusion Models
- Understanding Learning Dynamics Through Structured Representations
- Field Matching: an Electrostatic Paradigm to Generate and Transfer Data
- Occupied Processes: Going with the Flow
- Numerical Integration of stochastic differential equations: The Heun Algorithm Revisited and It\^o-Stratonovich Calculus
- Adaptive control mechanisms in gradient descent algorithms
- The Geometry of Constrained Optimization: Constrained Gradient Flows via Reparameterization: A-Stable Implicit Schemes, KKT from Stationarity, and Geometry-Respecting Algorithms
- Norm-Constrained Flows and Sign-Based Optimization: Theory and Algorithms
- On the Edge of Memorization in Diffusion Models
- Particle exchange Monte Carlo methods for eigenfunction and related nonlinear problems
- Introduction to Regularization and Learning Methods for Inverse Problems
- From Classical Probabilistic Latent Variable Models to Modern Generative AI: A Unified Perspective
- Finite Sample Bounds for Sequential Monte Carlo and Adaptive Path Selection Using the $L_2$ Norm
- Deep Learning for Markov Chains: Lyapunov Functions, Poisson's Equation, and Stationary Distributions
- Threshold Diffusions
- Strong averaging principle for nonautonomous multi-scale SPDEs with fully local monotone and almost periodic coefficients
- Existence of nonnegative mild solutions of stochastic evolution inclusions via weak topology
- Zeroth-Order Non-smooth Non-convex Optimization via Gaussian Smoothing
- Non-Euclidean Monotone Operator Theory and Applications
- Provable Mixed-Noise Learning with Flow-Matching
- The Statistical Fairness-Accuracy Frontier
- Generative diffusion posterior sampling for informative likelihoods
- Source-Guided Flow Matching
- Mirror, Mirror of the Flow: How Does Regularization Shape Implicit Bias?
- On the Collapse Errors Induced by the Deterministic Sampler for Diffusion Models
- Underdamped Langevin MCMC with third order convergence
- Score as Action: Fine-Tuning Diffusion Generative Models by Continuous-time Reinforcement Learning
- Interpretability of linear regression models of glassy dynamics
- Multitask Learning with Stochastic Interpolants
- From Points to Spheres: A Geometric Reinterpretation of Variational Autoencoders
- Mean-Field Langevin Diffusions with Density-dependent Temperature
- Mirror Descent for Stochastic Control Problems with Measure-valued Controls
- Analysis of mean field games via Fokker-Planck-Kolmogorov equations: existence of equilibria
- Flow Matching at Scale: A Machine Learning Framework for Efficient Large-Size Sampling of Many-Body Systems
- Poisson Midpoint Method for Log Concave Sampling: Beyond the Strong Error Lower Bounds
- Singular Perturbations of Hamilton-Jacobi Equations in the Wasserstein Space
- Thermodynamic Cost of Random-Time Protocols
- High to low temperature: $O(N)$ model at large $N$
- Joint Learning of Energy-based Models and their Partition Function
- Joint Problems in Learning Multiple Dynamical Systems
- Langevin Monte-Carlo Provably Learns Depth Two Neural Nets at Any Size and Data
- Convergence of kinetic Langevin samplers for non-convex potentials
- Stability and performance guarantees for misspecified multivariate score-driven filters
- Learning from Samples: Inverse Problems over measures via Sharpened Fenchel-Young Losses
- How can we measure the information created by natural selection?
- An introduction to large deviations with applications in physics
- Improving Diffusion Inverse Problem Solving with Decoupled Noise Annealing
- An Introduction to Sliced Optimal Transport
- Anderson Acceleration For Perturbed Newton Methods
- Constructive approximate transport maps with normalizing flows
- A class of generalized Nesterov's accelerated gradient method from dynamical perspective
- Can a One-Point Feedback Zeroth-order Algorithm Achieve Linear Dimension Dependent Sample Complexity?
- When can in-context learning generalize out of task distribution?
- A Kac system interacting with two heat reservoirs
- Zeroth-Order Non-smooth Non-convex Optimization via Gaussian Smoothing
- Robust Convolution Neural ODEs via Contractivity-promoting regularization
- MDNS: Masked Diffusion Neural Sampler via Stochastic Optimal Control
- Learning Classifiers That Induce Markets
- Rollout Roulette: A Probabilistic Inference Approach to Inference-Time Scaling of LLMs using Particle-Based Monte Carlo Methods
- Clipping Improves Adam-Norm and AdaGrad-Norm when the Noise Is Heavy-Tailed
- A Survey on Diffusion Language Models
- Energy-Based Models for Predicting Mutational Effects on Proteins
- Variance-Reduced Fast Operator Splitting Methods for Generalized Equations
- Solving multiscale dynamical systems by deep learning
- Optimal Transport on Lie Group Orbits
- The Market Effects of Algorithms
- A microscopically reversible kinetic theory of flocking
- Law of the Iterated Logarithm for Markov Semigroups with Exponential Mixing in the Wasserstein Distance
- On Experiments
- On Irreversibility and Stochastic Systems: Part One
- Efficient Approximate Posterior Sampling with Annealed Langevin Monte Carlo
- Finite-Time Convergence Analysis of ODE-based Generative Models for Stochastic Interpolants
- Kinetic Optimal Transport (OTIKIN) -- Part 1: Second-Order Discrepancies Between Probability Measures
- Propagation of weak log-concavity along generalised heat flows via Hamilton-Jacobi equations
- When and how can inexact generative models still sample from the data manifold?
- Gaussian Approximation for Two-Timescale Linear Stochastic Approximation
- What is emergence, after all?
- Optimization of a Nonlinear Acoustics -- Structure Interaction Model
- Time Scaling Makes Accelerated Gradient Flow and Proximal Method Faster in Multiobjective Optimization
- The Ensemble Kalman Update is an Empirical Matheron Update
- Reverse Diffusion Sequential Monte Carlo Samplers
- Improving sampling by modifying the effective diffusion
- A Linear Differential Inclusion for Contraction Analysis to Known Trajectories
- Proximal optimal transport divergences
- A solvable generative model with a linear, one-step denoiser
- Almost-Surely Convergent Randomly Activated Monotone Operator Splitting Methods
- Bernoulli-LoRA: A Theoretical Framework for Randomized Low-Rank Adaptation
- Potential Score Matching: Debiasing Molecular Structure Sampling with Potential Energy Guidance
- Convergence of Deterministic and Stochastic Diffusion-Model Samplers: A Simple Analysis in Wasserstein Distance
- On sliced Cram\'er metrics
- Polynomial complexity sampling from multimodal distributions using Sequential Monte Carlo
- A First-order Generative Bilevel Optimization Framework for Diffusion Models
- Stochastic thermodynamics of computation
- Why Heuristic Weighting Works: A Theoretical Analysis of Denoising Score Matching
- Flow Matching: Markov Kernels, Stochastic Processes and Transport Plans
- Categorical probability spaces, ergodic decompositions, and transitions to equilibrium
- Optimality of empirical measures as quantizers
- Stochastic Optimal Control via Measure Relaxations
- Better Embeddings with Coupled Adam
- Matrix Decomposition and Applications
- FMPlug: Plug-In Foundation Flow-Matching Priors for Inverse Problems
- PolarGrad: A Class of Matrix-Gradient Optimizers from a Unifying Preconditioning Perspective
- Single-loop methods for bilevel parameter learning in inverse imaging
- A Novel Sliced Fused Gromov-Wasserstein Distance
- Sliced Optimal Transport Plans
- Agentic Information Theory: Ergodicity and Intrinsic Semantics of Information Processes
- Fast Sampling of Protein Conformational Dynamics
- Categorical Lyapunov Theory I: Stability of Flows
- Generalized Optimal Transport
- Variational Inference Optimized Using the Curved Geometry of Coupled Free Energy
- Categorical Schr\"odinger Bridge Matching
- The Ball-Proximal (="Broximal") Point Method: a New Algorithm, Convergence Theory, and Applications
- Can sparse autoencoders make sense of gene expression latent variable models?
- Testing the spin-bath view of self-attention: A Hamiltonian analysis of GPT-2 Transformer
- A global Lipschitz stability perspective for understanding approximate approaches in Bayesian sequential learning
- State evolution beyond first-order methods I: Rigorous predictions and finite-sample guarantees
- Mean-Field Langevin Diffusions with Density-dependent Temperature
- Numerical Design of Optimized First-Order Algorithms
- A general perspective on CBO methods with stochastic rate of information
- From Conditional to Unconditional Independence: Testing Conditional Independence via Transport Maps
- A Markov Categorical Framework for Language Modeling
- Generalized Yosida Approximation and Multi-Valued Stochastic Evolution Inclusions
- On a $T_1$ Transport inequality for the adapted Wasserstein distance
- An inexact alternating projection method with application to matrix completion
- Stability of Wasserstein projections in convex order via metric extrapolation
- Relaxed and inertial nonlinear Forward-Backward algorithm
- Jacobi Hamiltonian Integrators
- Zeroth-order log-concave sampling
- The Role of the Time-Dependent Hessian in High-Dimensional Optimization
- On the Lipschitz Constant of Deep Networks and Double Descent
- Probabilistic Graphical Models: A Concise Tutorial
- Time Deep Gradient Flow Method for pricing American options
- Physical models realizing the transformer architecture of large language models
- Pre-Training LLMs on a budget: A comparison of three optimizers
- Rapid Mixing at the Uniqueness Threshold
- Protein folding classes -- High-dimensional geometry of amino acid composition space revisited
- Efficient design of rna sequences with desired properties, structure, and motifs using a grammar variational autoencoder
- Bridged Posterior: Optimization, Profile Likelihood and a New Approach to Generalized Bayes
- A Distributional View of High Dimensional Optimization
- ReDi: Rectified Discrete Flow
- The Intrinsic Riemannian Proximal Gradient Method for Convex Optimization
- Attention with Markov: A Framework for Principled Analysis of Transformers via Markov Chains
- Deep RL Dual Sourcing Inventory Management with Supply and Capacity Risk Awareness
- Telegrapher's Generative Model via Kac Flows
- Concentration inequalities for log-concave sequences
- Global Regularity Estimates for Optimal Transport via Entropic Regularisation
- MAP Estimation with Denoisers: Convergence Rates and Guarantees
- On an Abstraction of Lyapunov and Lagrange Stability
- Sampling Decisions
- Harnessing higher-dimensional fluctuations in an information engine
- Sharper Exponential Convergence Rates for Sinkhorn's Algorithm in Continuous Settings
- Compositional Discrete Latent Code for High Fidelity, Productive Diffusion Models
- On Accelerated Mixing of the No-U-turn Sampler
- Analysis of Langevin midpoint methods using an anticipative Girsanov theorem
- On the Effectiveness of the z-Transform Method in Quadratic Optimization
- Geometric Stability Analysis for Differential Inclusions Governed by Maximally Monotone Operators
- On the Statistical Properties of Generative Adversarial Models for Low Intrinsic Data Dimension
- A Survey of Deep Learning for Geometry Problem Solving
- Linearization-Based Feedback Stabilization of McKean-Vlasov PDEs
- Acceleration methods for fixed point iterations
- Model averaging in the space of probability distributions
- A Complete Loss Landscape Analysis of Regularized Deep Matrix Factorization
- A Unified View on Learning Unnormalized Distributions via Noise-Contrastive Estimation
- A General Framework for Inference-time Scaling and Steering of Diffusion Models
- BiLO: Bilevel Local Operator Learning for PDE Inverse Problems. Part I: PDE-Constrained Optimization
- Geometry, Computation, and Optimality in Stochastic Optimization
- Convergence of drift-diffusion PDEs arising as Wasserstein gradient flows of convex functions
- From Kinetic Theory to AI: a Rediscovery of High-Dimensional Divergences and Their Properties
- Improved sampling algorithms and Poincar\'e inequalities for non-log-concave distributions
- Frank-Wolfe Recursions for the Emergency Response Problem on Measure Spaces
- CoVAE: Consistency Training of Variational Autoencoders
- Learning Diffusion Models with Flexible Representation Guidance
- Theory-Informed Improvements to Classifier-Free Guidance for Discrete Diffusion Models
- Beyond Scores: Proximal Diffusion Models
- Revisiting Convergence: Shuffling Complexity Beyond Lipschitz Smoothness
- Convergence of non-reversible Markov processes via lifting and flow Poincar\'e inequality
- L\'evy Langevin Monte Carlo for sampling from heavy-tailed target distributions
- Convergence Rate of the Solution of Multi-marginal Schrodinger Bridge Problem with Marginal Constraints from SDEs
- Gromov-Wasserstein Barycenters: The Analysis Problem
- Large Deviations for Empirical Measures of Self-Interacting Markov Chains
- Solving Monge problem by Hilbert space embeddings of probability measures
- On the Uniform Convergence of Subdifferentials in Stochastic Optimization and Learning
- From Controllability to Information: A Unified Analysis via Gramian, Minimum Energy, Fisher Information Matrix and Entropy
- Amortized Posterior Sampling with Diffusion Prior Distillation
- Open Materials Generation with Stochastic Interpolants
- Field Matching: an Electrostatic Paradigm to Generate and Transfer Data
- Pocket2Mol: Efficient Molecular Sampling Based on 3D Protein Pockets
- Review, Remask, Refine (R3): Process-Guided Block Diffusion for Text Generation
- Unraveling the Potential of Diffusion Models in Small Molecule Generation
- Inference-Time Scaling of Diffusion Language Models with Particle Gibbs Sampling
- Splitting Regularized Wasserstein Proximal Algorithms for Nonsmooth Sampling Problems
- Local Flow Matching Generative Models
- Bayesian Double Descent
- Learning to control non-equilibrium dynamics using local imperfect gradients
- A statistical physics framework for optimal learning
- Pullback Flow Matching on Data Manifolds
- Guided filtering and smoothing for infinite-dimensional diffusions
- Parameter estimation in interacting particle systems on dynamic random networks
- Optimal closed-loop control of active particles and a minimal information engine
- Multilevel Bregman Proximal Gradient Descent
- Proximal Oracles for Optimization and Sampling
- Random Walks with Tweedie: A Unified View of Score-Based Diffusion Models
- Navigating Sparse Molecular Data with Stein Diffusion Guidance
- Simple Convergence Proof of Adam From a Sign-like Descent Perspective
- Iterative Importance Fine-tuning of Diffusion Models
- A Malliavin calculus approach to score functions in diffusion generative models
- Nonstationary Distribution Estimation via Wasserstein Probability Flows
- Computer-aided analyses of stochastic first-order methods, via interpolation conditions for stochastic optimization
- Why is it easier to predict the epidemic curve than to reconstruct the underlying contact network?
- The surrogate Gibbs-posterior of a corrected stochastic MALA: Towards uncertainty quantification for neural networks
- Test-Time Alignment of Discrete Diffusion Models with Sequential Monte Carlo
- Kinetic Langevin Diffusion for Crystalline Materials Generation
- A note on the unique properties of the Kullback--Leibler divergence for sampling via gradient flows
- Implicit Regularisation in Diffusion Models: An Algorithm-Dependent Generalisation Analysis
- The Difference between the Left and Right Invariant Extended Kalman Filter
- On the Dynamics of Control
- On the Mathematical Impossibility of Safe Universal Approximators
- Nested importance sampling for Bayesian inference: error bounds and the role of dimension
- On Global and Local Convergence of Iterative Linear Quadratic Optimization Algorithms for Discrete Time Nonlinear Control
- Iterative Linear Quadratic Optimization for Nonlinear Control: Differentiable Programming Algorithmic Templates
- Exploring the Design Space of Diffusion Bridge Models
- Learning few-step posterior samplers by unfolding and distillation of diffusion models
- Guided Generation for Developable Antibodies
- Affine Gateaux Differentials and the von Mises Statistical Calculus
- Analysis of Muon's Convergence and Critical Batch Size
- Efficiently Vectorized MCMC on Modern Accelerators
- Entropic optimal transport beyond product reference couplings: the Gaussian case on Euclidean space
- Asymptotic convexity of wide and shallow neural networks
- Disintegration theorem for multifunctions, with applications to empirical Wasserstein distances and average-case statistical bounds
- A first-order method for nonconvex-nonconcave minimax problems under a local Kurdyka-\L{}ojasiewicz condition
- Optimization, Isoperimetric Inequalities, and Sampling via Lyapunov Potentials
- Persistence Paradox in Dynamic Science
- Geometric Gaussian Approximations of Probability Distributions
- Navigating with Annealing Guidance Scale in Diffusion Space
- Neural Langevin Machine: a local asymmetric learning rule can be creative
- SGD with Adaptive Preconditioning: Unified Analysis and Momentum Acceleration
- Transition Matching: Scalable and Flexible Generative Modeling
- Riemannian-Geometric Fingerprints of Generative Models
- Concentration inequalities for random dynamical systems
- A convex lifting approach for the Calder\'on problem
- Fragile, Robust, and Antifragile: A Perspective from Parameter Responses in Reinforcement Learning Under Stress
- Experimenting, Fast and Slow: Bayesian Optimization of Long-term Outcomes with Online Experiments
- On the Convergence of Min-Max Langevin Dynamics and Algorithm
- Optimization, Isoperimetric Inequalities, and Sampling via Lyapunov Potentials
- Annealed Leap-Point Sampler for Multimodal Target Distributions
- Breaking a Logarithmic Barrier in the Stopping Time Convergence Rate of Stochastic First-order Methods
- Shifted Composition IV: Underdamped Langevin and Numerical Discretizations with Partial Acceleration
- Adjoint Schr\"odinger Bridge Sampler
- Mixing Time of the Proximal Sampler in Relative Fisher Information via Strong Data Processing Inequality
- Are Convex Optimization Curves Convex?
- Projected gradient descent accumulates at Bouligand stationary points
- Adjoint Schr\"odinger Bridge Sampler
- Swapping objectives accelerates Davis-Yin splitting
- Optimized methods for composite optimization: a reduction perspective
- Sociophysics models inspired by the Ising model
- Local convergence rates for Wasserstein gradient flows and McKean-Vlasov equations with multiple stationary solutions
- A gradient flow that is none: Heat flow with Wentzell boundary condition
- Mixing Time Bounds for the Gibbs Sampler under Isoperimetry
- Average-case complexity in statistical inference: A puzzle-driven research seminar
- Critically-Damped Higher-Order Langevin Dynamics
- Ecosystems as adaptive living circuits
- Toward a Unified Theory of Gradient Descent under Generalized Smoothness
- Energy Matching: Unifying Flow Matching and Energy-Based Models for Generative Modeling
- Gaussian Invariant Markov Chain Monte Carlo
- On Convolutions, Intrinsic Dimension, and Diffusion Models
- Non-equilibrium Annealed Adjoint Sampler
- Telegrapher's Generative Model via Kac Flows
- Gradient-Free Sequential Bayesian Experimental Design via Interacting Particle Systems
- Building Population-Informed Priors for Bayesian Inference Using Data-Consistent Stochastic Inversion
- On gradient descent-ascent flows in metric spaces
- The Elements of Differentiable Programming
- Information-Theoretic Proofs for Diffusion Sampling
- Statistical Inference for Optimal Transport Maps: Recent Advances and Perspectives
- Interpolating between Optimal Transport and KL regularized Optimal Transport using R\'enyi Divergences
- Leveraging neural network interatomic potentials for a foundation model of chemistry
- Geometric Contact Flows: Contactomorphisms for Dynamics and Control
- Simulation-Free Differential Dynamics through Neural Conservation Laws
- Optimization-Induced Dynamics of Lipschitz Continuity in Neural Networks
- Non-equilibrium Annealed Adjoint Sampler
- A geometric framework for momentum-based optimizers for low-rank training
- Origins of Creativity in Attention-Based Diffusion Models
- Stabilizing Temporal Difference Learning via Implicit Stochastic Recursion
- Averaging principles for time-inhomogeneous multi-scale SDEs with partially dissipative coefficients
- A Linear Parameter-Varying Framework for the Analysis of Time-Varying Optimization Algorithms
- Projected Normal Distribution: Moment Approximations and Generalizations
- Flow Matching: Markov Kernels, Stochastic Processes and Transport Plans
- Diffusion & Adversarial Schr\"odinger Bridges via Iterative Proportional Markovian Fitting
- Solving Zero-Sum Convex Markov Games
- Deep generative models as the probability transformation functions
- A Minimalist Optimizer Design for LLM Pretraining
- Energy-Based Transfer for Reinforcement Learning
- Progressive Inference-Time Annealing of Diffusion Models for Sampling from Boltzmann Densities
- Alternating Gradient-Type Algorithm for Bilevel Optimization with Inexact Lower-Level Solutions via Moreau Envelope-based Reformulation
- Accelerating Proximal Gradient Descent via Silver Stepsizes
- Finite Dimensional Projections of HJB Equations in the Wasserstein Space
- Evolution of Measures in Nonsmooth Dynamical Systems: Formalisms and Computation
- The LQR-Schr\"odinger Bridge
- Generative thermodynamic computing
- Lectures on Statistical Mechanics
- Stability of universal properties against perturbations of the Markov Chain Monte Carlo algorithm
- Stochastic heat flow is a black noise
- Consistent Sampling and Simulation: Molecular Dynamics with Energy-Based Diffusion Models
- Diffusion-Based Hypothesis Testing and Change-Point Detection
- Sampling conditioned diffusions via Pathspace Projected Monte Carlo
- Transport maps as flows of control-affine systems
- Moment Constrained Optimal Transport for Control Applications
- Online Feedback Optimization for Monotone Systems without Timescale Separation
- On the Convergence Rates of Iterative Regularization Algorithms for Composite Bi-Level Optimization
- Simultaneous sampling of multiple transition channels using adaptive paths of collective variables
- Local minima of the empirical risk in high dimension: General theorems and convex examples
- Simulating Diffusion Bridges with Score Matching
- Categorical Schr\"odinger Bridge Matching
- Learning Algorithms in the Limit
- Beyond Propagation of Chaos: A Stochastic Algorithm for Mean Field Optimization
- Coarse graining of stochastic differential equations: averaging and projection method
- Inexact JKO and proximal-gradient algorithms in the Wasserstein space
- Convergence analysis of controlled particle systems arising in deep learning: from finite to infinite sample size
- Proximal Operators of Sorted Nonconvex Penalties
- Revisiting Shooting Point Monte Carlo Methods for Transition Path Sampling
- Benchmarks for protocol control in nonequilibrium statistical mechanics
- Generative thermodynamic computing
- Posterior contraction rates of computational methods for Bayesian data assimilation
- Convergence Analysis of the Wasserstein Proximal Algorithm beyond Geodesic Convexity
- Universal Approximation of Operators with Transformers and Neural Integral Operators
- The Milieu, Science & Logic of Feedback Control
- Upper and lower bounds for local Lipschitz stability of Bayesian posteriors
- Computational lower bounds in latent models: clustering, sparse-clustering, biclustering
- Absolutely Continuous Curves of Stochastic Processes
- Stochastic intrinsic gradient flows on the Wasserstein space
- Restarted contractive operators to learn at equilibrium
- Adaptive Acceleration Without Strong Convexity Priors Or Restarts
- Foundations of ecological and evolutionary change
- Joint Learning of Energy-based Models and their Partition Function
- Lyapunov analysis for FISTA under strong convexity
- Heavy-ball dynamics with Hessian-driven damping for non-convex optimization under the {\L}ojasiewicz condition
- Accelerating Diffusion Large Language Models with SlowFast: The Three Golden Principles
- Geometric Regularity in Deterministic Sampling of Diffusion-based Generative Models
- Constrained Denoising, Empirical Bayes, and Optimal Transport
- Follow the Energy, Find the Path: Riemannian Metrics from Energy-Based Models
- Leveraging Coordinate Momentum in SignSGD and Muon: Memory-Optimized Zero-Order
- Chip Placement with Diffusion Models
- Scaling Laws in Linear Regression: Compute, Parameters, and Data
- A Simple Analysis of Discretization Error in Diffusion Models
- On a structure preserving closure of Langevin dynamics
- Solving Convex-Concave Problems with $\tilde{\mathcal{O}}(\epsilon^{-4/7})$ Second-Order Oracle Complexity
- Optimal hedging of an informed broker facing many traders
- The global convergence time of stochastic gradient descent in non-convex landscapes: Sharp estimates via large deviations
- Generalized Interpolating Discrete Diffusion
- Improving the Diffusability of Autoencoders
- Neural Flow Diffusion Models: Learnable Forward Process for Improved Diffusion Modelling
- Jarzynski Reweighting and Sampling Dynamics for Training Energy-Based Models: Theoretical Analysis of Different Transition Kernels
- On Fitting Flow Models with Large Sinkhorn Couplings
- Path Integral Optimiser: Global Optimisation via Neural Schr\"odinger-F\"ollmer Diffusion
- GeoClip: Geometry-Aware Clipping for Differentially Private SGD
- Stability of Mean-Field Variational Inference
- On the Instability of Nesterov's ODE under Non-Conservative Vector Fields
- Theoretical smoothing frameworks for nonsmooth simple bilevel problems
- Bregman level proximal subdifferentials and new characterizations of Bregman proximal operators
- Computational bottlenecks for denoising diffusions
- How to explain grokking
- MCMC-Correction of Score-Based Diffusion Models for Model Composition
- On the Importance of Gaussianizing Representations
- Constrained Sampling for Language Models Should Be Easy: An MCMC Perspective
- Long run convergence of discrete-time interacting particle systems of the McKean-Vlasov type
- Computational bottlenecks for denoising diffusions
- RNE: a plug-and-play framework for diffusion density estimation and inference-time control
- Sequential Monte Carlo approximations of Wasserstein--Fisher--Rao gradient flows
- Computable Bounds on Convergence of Markov Chains in Wasserstein Distance via Contractive Drift
- Zeroth-Order Optimization Finds Flat Minima
- UniSim: A Unified Simulator for Time-Coarsened Dynamics of Biomolecules
- Psi-Sampler: Initial Particle Sampling for SMC-Based Inference-Time Reward Alignment in Score Models
- Stein Variational Evolution Strategies
- On the Wasserstein Geodesic Principal Component Analysis of probability measures
- Optical Physics-Based Generative Models
- Second Order Ensemble Langevin Method for Sampling and Inverse Problems
- Kinetics: Rethinking Test-Time Scaling Laws
- Progressive Tempering Sampler with Diffusion
- Gradient flow in parameter space is equivalent to linear interpolation in output space
- An SDE Perspective on Stochastic Inertial Gradient Dynamics with Time-Dependent Viscosity and Geometric Damping
- Direct-search methods in the year 2025: Theoretical guarantees and algorithmic paradigms
- Curse of Slicing: Why Sliced Mutual Information is a Deceptive Measure of Statistical Dependence
- Strong and weak convergence rates for fully coupled multiscale stochastic differential equations driven by $\alpha$-stable processes
- Global Optimization Algorithm through High-Resolution Sampling
- On scalable and efficient training of diffusion samplers
- Flow map matching with stochastic interpolants: A mathematical framework for consistency models
- An Introduction to Flow Matching and Diffusion Models
- How to build your latent Markov model -- the role of time and space
- Latent Stochastic Interpolants
- Constrained Sliced Wasserstein Embedding
- Optimal control of the Poisson equation with transport regularization: Properties of optimal transport plans and transport map
- Inverse Entropic Optimal Transport Solves Semi-supervised Learning via Data Likelihood Maximization
- Geometry Meets Incentives: Sample-Efficient Incentivized Exploration with Linear Contexts
- Learning of Population Dynamics: Inverse Optimization Meets JKO Scheme
- Protocol Models: Scaling Decentralized Training with Communication-Efficient Model Parallelism
- Quantization-based Bounds on the Wasserstein Metric
- Generalization in VAE and Diffusion Models: A Unified Information-Theoretic Analysis
- Adapted Wasserstein distance between the laws of SDEs
- Phase transition for Minesweeper
- Markovian projections for functionals of It\^o semimartingales with jumps
- The Fastest Known First-Order Method for Minimizing Twice Continuously Differentiable Smooth Strongly Convex Functions
- Gradient-Free Score-Based Sampling Methods with Ensembles
- Machine-Learned Sampling of Conditioned Path Measures
- A geometric perspective of state estimation using Kalman filters
- Preconditioned primal-dual dynamics in convex optimization: non-ergodic convergence rates
- On the natural domain of Bregman operators
- Folded State Dynamics -- A Geometric and Deterministic Origin of Irreversibility
- Nonlinear Drift in Feynman-Kac Theory: Preserving Early Probabilistic Insights
- Near-Optimal Nonconvex-Strongly-Convex Bilevel Optimization with Fully First-Order Oracles
- The Gaussian Mixing Mechanism: Renyi Differential Privacy via Gaussian Sketches
- Rigorous enclosure of Lyapunov exponents of stochastic flows
- Provably convergent stochastic fixed-point algorithm for free-support Wasserstein barycenter of continuous non-parametric measures
- The Rich and the Simple: On the Implicit Bias of Adam and SGD
- Non-convex entropic mean-field optimization via Best Response flow
- Understanding Mode Connectivity via Parameter Space Symmetry
- Upper and lower bounds for local Lipschitz stability of Bayesian posteriors
- A gradient flow perspective on McKean-Vlasov equations in econophysics
- Optimal Protocols for Continual Learning via Statistical Physics and Control Theory
- Diffusive noise controls early stages of genetic demixing
- Adjoint Sampling: Highly Scalable Diffusion Samplers via Adjoint Matching
- On the Convergence Analysis of Muon
- Latent Representations for Control Design with Provable Stability and Safety Guarantees
- Inexact JKO and proximal-gradient algorithms in the Wasserstein space
- Provable Benefit of Random Permutations over Uniform Sampling in Stochastic Coordinate Descent
- Outsourced diffusion sampling: Efficient posterior inference in latent spaces of generative models
- Non-Asymptotic Analysis of (Sticky) Track-and-Stop
- Differentiable Generalized Sliced Wasserstein Plans
- In Search of Adam's Secret Sauce
- On creating convexity in high dimensions
- Adaptive tail index estimation: minimal assumptions and non-asymptotic guarantees
- Are Statistical Methods Obsolete in the Era of Deep Learning?
- On the performance of machine-learning assisted Monte Carlo in sampling from simple statistical physics models
- On Averaging and Extrapolation for Gradient Descent
- A Convergence Theory for Diffusion Language Models: An Information-Theoretic Perspective
- Conditional Diffusion Models with Classifier-Free Gibbs-like Guidance
- Dual Ascent Diffusion for Inverse Problems
- Computing Optimal Transport Plans via Min-Max Gradient Flows
- On the Almost Sure Convergence of the Stochastic Three Points Algorithm
- Learning Mixtures of Experts with EM: A Mirror Descent Perspective
- Continuous-Time Analysis of Heavy Ball Momentum in Min-Max Games
- When Models Don't Collapse: On the Consistency of Iterative MLE
- Energy-based generator matching: A neural sampler for general state space
- On scalable and efficient training of diffusion samplers
- Temperature is All You Need for Generalization in Langevin Dynamics and other Markov Processes
- How to build a consistency model: Learning flow maps via self-distillation
- Follow the Energy, Find the Path: Riemannian Metrics from Energy-Based Models
- Weighted quantization using MMD: From mean field to mean shift via gradient flows
- Homogenisation for Maxwell and Friends
- The velocity jump Langevin process and its splitting scheme: long time convergence and numerical accuracy
- A Langevin sampling algorithm inspired by the Adam optimizer
- On theoretical guarantees and a blessing of dimensionality for nonconvex sampling
- On the Relation between Rectified Flows and Optimal Transport
- Backpropagation-Free Metropolis-Adjusted Langevin Algorithm
- Discrete Neural Flow Samplers with Locally Equivariant Transformer
- Spacetime Geometry of Denoising in Diffusion Models
- A new class of finite difference methods: The zigzag schemes
- Importance Corrected Neural JKO Sampling
- Learning with Restricted Boltzmann Machines: Asymptotics of AMP and GD in High Dimensions
- New Tight Bounds for SGD without Variance Assumption: A Computer-Aided Lyapunov Analysis
- Why Diffusion Models Don't Memorize: The Role of Implicit Dynamical Regularization in Training
- Sampling from Conditional Distributions of Simplified Vines
- ScaleBiO: Scalable Bilevel Optimization for LLM Data Reweighting
- Sampled-data Systems: Stability, Contractivity and Single-iteration Suboptimal MPC
- Split-as-a-Pro: behavioral control via operator splitting and alternating projections
- Hamiltonian Theory and Computation of Optimal Probability Density Control in High Dimensions
- From Score Matching to Diffusion: A Fine-Grained Error Analysis in the Gaussian Setting
- Lindblad evolution as gradient flow
- Power of Generalized Smoothness in Stochastic Convex Optimization: First- and Zero-Order Algorithms
- SpectraLDS: Provable Distillation for Linear Dynamical Systems
- Algorithmic Collusion by Large Language Models
- Stochastic agent-based Monte Carlo simulations for reaction-diffusion models, population dynamics, and epidemic spreading
- Educational programs and crime: a compartmental model approach
- To Code or not to Code? Adaptive Tool Integration for Math Language Models via Expectation-Maximization
- Gradient Flows and Riemannian Structure in the Gromov-Wasserstein Geometry
- PO-Flow: Flow-based Generative Models for Sampling Potential Outcomes and Counterfactuals
- An Operator Splitting View of Federated Learning
- Gradient Flows and Riemannian Structure in the Gromov-Wasserstein Geometry
- Dimension-adapted Momentum Outscales SGD
- Smooth transport map via diffusion process
- An interacting particle consensus method for constrained global optimization
- Continuous-time iterative linear-quadratic regulator
- Neural Entropy
- Subspace Langevin Monte Carlo
- Harnessing the Universal Geometry of Embeddings
- Mirror Bridges Between Probability Measures
- An Empirical Analysis of Forgetting in Pre-trained Models with Incremental Low-Rank Updates
- From stability of Langevin diffusion to convergence of proximal MCMC for non-log-concave sampling
- Selective Code Generation for Functional Guarantees
- Latent Flow Transformer
- Fractional interacting particle system: drift parameter estimation via Malliavin calculus
- Nesterov Acceleration for Ensemble Kalman Inversion and Variants
- Training-Free Bayesianization for Low-Rank Adapters of Large Language Models
- Bayesian Inverse Problems Meet Flow Matching: Efficient and Flexible Inference via Transformers
- Greed is Good: A Unifying Perspective on Guided Generation
- Grokking at the Edge of Numerical Stability
- Wasserstein Flow Matching: Generative modeling over families of distributions
- On Probabilistic Pullback Metrics for Latent Hyperbolic Manifolds
- A Comprehensive Framework for Analyzing the Convergence of Adam: Bridging the Gap with SGD
- Group Symmetry Enables Faster Optimization in Inverse Problems
- Self-interacting approximation to McKean-Vlasov long-time limit: a Markov chain Monte Carlo method
- Black-box unadjusted Hamiltonian Monte Carlo
- A parameterized Wasserstein Hamiltonian flow approach for solving the Schr\"odinger equation
- EconoJax: A Fast & Scalable Economic Simulation in Jax
- Nesterov Acceleration for Ensemble Kalman Inversion and Variants
- A Symplectic Analysis of Alternating Mirror Descent
- Accelerated Markov Chain Monte Carlo Algorithms on Discrete States
- Hamiltonian Descent Algorithms for Optimization: Accelerated Rates via Randomized Integration Time
- Proximal optimal transport divergences
- A Generative Framework for Causal Estimation via Importance-Weighted Diffusion Distillation
- On the Nonconvexity of Push-Forward Constraints and Its Consequences in Machine Learning
- An Exponential Averaging Process with Strong Convergence Properties
- EXAdam: The Power of Adaptive Cross-Moments
- Variational structure of Fokker-Planck equations with variable mobility
- Laplace Meets Moreau: Smooth Approximation to Infimal Convolutions Using Laplace's Method
- Path Gradients after Flow Matching
- Energy Matching: Unifying Flow Matching and Energy-Based Models for Generative Modeling
- Nesterov acceleration in benignly non-convex landscapes
- Free-energy estimates from nonequilibrium trajectories under varying-temperature protocols
- Analytic theory of dropout regularization
- Discrete distributions are learnable from metastable samples
- Uniform-in-time propagation of chaos for Consensus-Based Optimization
- Hamiltonian replica exchange augmented with diffusion-based generative models and importance sampling to assess biomolecular conformational basins and barriers
- Unified Continuous Generative Models
- Diffusion Processes on $p$-Wasserstein Space over Banach Space
- Stochastic moments dynamics: a flexible finite-dimensional random perturbation of Wasserstein gradient descent
- Learning from Samples: Inverse Problems over measures via Sharpened Fenchel-Young Losses
- Langevin Diffusion Approximation to Same Marginal Schr\"{o}dinger Bridge
- A stochastic gradient method for trilevel optimization
- Sparsity for dynamic inverse problems on Wasserstein curves with bounded variation
- Strong and weak quantitative estimates in slow-fast diffusions using filtering techniques
- Iterative Orthogonalization Scaling Laws
- Guide your favorite protein sequence generative model
- Particle Gibbs without the Gibbs bit
- Categorical and geometric methods in statistical, manifold, and machine learning
- Gradient flow structure for some nonlocal diffusion equations
- Geometric Foundation of Nonequilibrium Transport: A Minkowski Embedding of Markov Dynamics
- Linear Analysis of Stochastic Verlet-Type Integrators for Langevin Equations
- Accelerated Gradient Methods Through Variable and Operator Splitting
- Lifting couplings in Wasserstein spaces
- Mitigating mode collapse in normalizing flows by annealing with an adaptive schedule: Application to parameter estimation
- Resolving Memorization in Empirical Diffusion Model for Manifold Data in High-Dimensional Spaces
- Metric extrapolation in the Wasserstein space
- Fluctuation without dissipation: Microcanonical Langevin Monte Carlo
- Which exceptional low-dimensional projections of a Gaussian point cloud can be found in polynomial time?
- Entropic Time Schedulers for Generative Diffusion Models
- Faster logconcave sampling from a cold start in high dimension
- A dynamic view of the double descent
- Mirror Mean-Field Langevin Dynamics
- Entropy-Guided Sampling of Flat Modes in Discrete Spaces
- Minimax entropy: The statistical physics of optimal models
- A non-asymptotic approach to stochastic differential games with many players under semi-monotonicity
- The dynamical law behind eye movements: distinguishing between L\'evy and intermittent strategies
- A Provably Convergent Plug-and-Play Framework for Stochastic Bilevel Optimization
- A Bayesian approach to inverse problems in spaces of measures
- On the Importance of Gaussianizing Representations
- A Theory of "Likes"
- Climate Science and Control Engineering: Insights, Parallels, and Connections
- Wellposedness and averaging principle for conditional distribution dependent SDEs driven by standard Brownian motions and fractional Brownian motions
- Extended convexity and smoothness and their applications in deep learning
- Geometry and Duality of Alternating Markov Chains
- Large deviation-based tuning schemes for Metropolis-Hastings algorithms
- Sharp higher order convergence rates for the Adam optimizer
- Inverse Problems Over Probability Measure Space
- A reverse isoperimetric inequality for the Cheeger constant under width constraint
- Negative Imaginary Neural ODEs: Learning to Control Mechanical Systems with Stability Guarantees
- A higher-order Otto calculus approach to the Gaussian completely monotone conjecture
- Mixing of metastable diffusion processes with Gibbs invariant distribution
- Entropic continuity bounds for conditional covariances with applications to Schr\" odinger and Sinkhorn bridges
- Sampling and estimation on manifolds using the Langevin diffusion
- A Langevin sampling algorithm inspired by the Adam optimizer
- Flow Matching Ergodic Coverage
- Action-Minimization Meets Generative Modeling: Efficient Transition Path Sampling with the Onsager-Machlup Functional
- A Geometric Framework for Stochastic Iterations
- Averaging principle for slow-fast systems of rough differential equations via controlled paths
- Heat kernels, intrinsic contractivity and ergodicity of discrete-time Markov chains killed by potentials
- Kalman-Langevin dynamics : exponential convergence, particle approximation and numerical approximation
- Score-Based Deterministic Density Sampling
- Analysis of Multiple-try Metropolis via Poincar\'e inequalities
- On the equivalence of a Hessian-free inequality and Lipschitz continuous Hessian
- Nonasymptotic CLT and Error Bounds for Two-Time-Scale Stochastic Approximation
- Likelihood-Free Variational Autoencoders
- Embedding Empirical Distributions for Computing Optimal Transport Maps
- Mean convergence rates for Gaussian-smoothed Wasserstein distances and classical Wasserstein distances
- Applied Antifragility in Natural Systems: Evolutionary Antifragility
- Computing Optimal Transport Plans via Min-Max Gradient Flows
- Solving Inverse Problems in Protein Space Using Diffusion-Based Priors
- Exact Sampling of Gibbs Measures with Estimated Losses
- Compositionality in algorithms for smoothing
- Transport f divergences
- On the Guidance of Flow Matching
- Muon Optimizer Accelerates Grokking
- An interacting particle consensus method for constrained global optimization
- Recovering Nesterov accelerated dynamics from Heavy Ball dynamics via time rescaling
- Markov Kernels, Distances and Optimal Control: A Parable of Linear Quadratic Non-Gaussian Distribution Steering
- First-Order Methods for Linearly Constrained Bilevel Optimization
- A Bayesian Interpretation of the Internal Model Principle
- Generative Learning of Densities on Manifolds
- A direct proof of a unified law of robustness for Bregman divergence losses
- Optimal Scheduling of Dynamic Transport
- First and Second Order Approximations to Stochastic Gradient Descent Methods with Momentum Terms
- Adjoint Sampling: Highly Scalable Diffusion Samplers via Adjoint Matching
- Entropic Time Schedulers for Generative Diffusion Models
- Convergence of the fully discrete JKO scheme
- Learning from Similar Linear Representations: Adaptivity, Minimaxity, and Robustness
- Efficient Primal-dual Forward-backward Splitting Method for Wasserstein-like Gradient Flows with General Nonlinear Mobilities
- Integral control of the proximal gradient method for unbiased sparse optimization
- Introduction to Langevin Stochastic Processes
- FEAT: Free energy Estimators with Adaptive Transport
- Adjoint Sampling: Highly Scalable Diffusion Samplers via Adjoint Matching
- Score as Action: Fine-Tuning Diffusion Generative Models by Continuous-time Reinforcement Learning
- Model-free Estimation of Latent Structure via Multiscale Nonparametric Maximum Likelihood
- On the Contractivity of Stochastic Interpolation Flow
- Energy Matching: Unifying Flow Matching and Energy-Based Models for Generative Modeling
- Non-Reversible Langevin Algorithms for Constrained Sampling
- Hessian stability and convergence rates for entropic and Sinkhorn potentials via semiconcavity
- Off-the-grid regularisation for Poisson inverse problems
- Adaptive teachers for amortized samplers
- Towards Weaker Variance Assumptions for Stochastic Optimization
- No-Regret Generative Modeling via Parabolic Monge-Amp\`ere PDE
- Score Matching Diffusion Based Feedback Control and Planning of Nonlinear Systems
- Improving the evaluation of samplers on multi-modal targets
- $\alpha$-Flow: A Unified Framework for Continuous-State Discrete Flow Matching Models
- A User's Guide to Sampling Strategies for Sliced Optimal Transport
- Convergence of the denoising diffusion probabilistic models for general noise schedules
- Exact inequalities and optimal recovery by inaccurate information
- Agentic Workflows for Economic Research: Design and Implementation
- Ising 100: review of solutions
- Entropically Driven Agents
- Global Regularity Estimates for Optimal Transport via Entropic Regularisation
- Measure Theory of Conditionally Independent Random Function Evaluation
- Proofs as Explanations: Short Certificates for Reliable Predictions
- Optimal Transport-Based Generative Models for Bayesian Posterior Sampling
- Controlled stochastic processes for simulated annealing
- On iteratively regularized first-order methods for simple bilevel optimization
- Microfoundation Inference for Strategic Prediction
- Wasserstein Gradient Flows for Moreau Envelopes of f-Divergences in Reproducing Kernel Hilbert Spaces
- Bregman-Wasserstein divergence: geometry and applications
- Slicing the Gaussian Mixture Wasserstein Distance
- Smoothed Distance Kernels for MMDs and Applications in Wasserstein Gradient Flows
- Bridging the Theoretical Gap in Randomized Smoothing
- Sampling from mixture distributions based on regime-switching diffusions
- Error estimate for regularized optimal transport problems via Bregman divergence
- Language Models Are Implicitly Continuous
- Gaussian Mixture Flow Matching Models
- On the Convergence Rate of Sinkhorn's Algorithm
- Transport information Bregman divergences
- Stability of optimal transport maps on Riemannian manifolds
- A New Approach to Controlling Linear Dynamical Systems
- Gradient-free stochastic optimization for additive models
- Sampling with time-changed Markov processes
- Dimension-Free Convergence of Diffusion Models for Approximate Gaussian Mixtures
- Randomised Splitting Methods and Stochastic Gradient Descent
- DDPM Score Matching and Distribution Learning
- Information Geometry of Exponentiated Gradient: Convergence beyond L-Smoothness
- Accelerating Particle-based Energetic Variational Inference
- Conditioning Diffusions Using Malliavin Calculus
- Averaging principle for rough slow-fast systems of level 3
- Stochastic Optimization with Optimal Importance Sampling
- Beyond Smoothness and Convexity: Optimization via sampling
- Geometric Reasoning in the Embedding Space
- Diffusion at Absolute Zero: Langevin Sampling Using Successive Moreau Envelopes [conference paper]
- Error Analysis of Sampling Algorithms for Approximating Stochastic Optimal Control
- A Geometric Framework for Stochastic Iterations
- Beyond Gaussian Assumptions: A Nonlinear Generalization of Linear Inverse Modeling
- Convergence of Ray- and Pixel-Driven Discretization Frameworks in the Strong Operator Topology
- A Unified Approach to Analysis and Design of Denoising Markov Models
- Are Convex Optimization Curves Convex?
- Acceleration via Perturbations on Low-resolution Ordinary Differential Equations
- Deep Generative Models: Complexity, Dimensionality, and Approximation
- Gradient Descent for Convex and Smooth Noisy Optimization
- Theoretical Insights into Fine-Tuning Attention Mechanism: Generalization and Optimization
- Remarks on the Polyak-Lojasiewicz inequality and the convergence of gradient systems
- Steering Large Agent Populations using Mean-Field Schrodinger Bridges with Gaussian Mixture Models
- Polynomial Inequalities and Optimal Stability of Numerical Integrators
- Decentralized Bilevel Optimization: A Perspective from Transient Iteration Complexity
- Accelerated Stein Variational Gradient Flow
- Single-loop Projection-free and Projected Gradient-based Algorithms for Nonconvex-concave Saddle Point Problems with Bilevel Structure
- Convergence analysis of controlled particle systems arising in deep learning: from finite to infinite sample size
- On the convergence of the Euler-Maruyama scheme for McKean-Vlasov SDEs
- Open-loop control design for contraction in affine nonlinear systems
- Manifold learning in Wasserstein space
- A Tutorial on Multi-time Scale Optimization Models and Algorithms
- Asymptotics of the quantization problem on metric measure spaces
- Finite-Time Bounds for Two-Time-Scale Stochastic Approximation with Arbitrary Norm Contractions and Markovian Noise
- Solving Schr\"{o}dinger bridge problem via continuous normalizing flow
- Homotopy Methods for Convex Optimization
- On the Computational Power of Particle Methods
- Synthesis and Analysis of Data as Probability Measures with Entropy-Regularized Optimal Transport
- Entropy annealing for policy mirror descent in continuous time and space
- Generalization of the Gibbs algorithm with high probability at low temperatures
- Theory on Score-Mismatched Diffusion Models and Zero-Shot Conditional Samplers
- Wasserstein bounds for non-linear Gaussian filters
- Empirical Measures and Strong Laws of Large Numbers in Categorical Probability
- A uniform rate of convergence for the entropic potentials in the quadratic Euclidean setting
- Turnpike in optimal control and beyond: a survey
- Unlocking Guidance for Discrete State-Space Diffusion and Flow Models
- Learning Straight Flows by Learning Curved Interpolants
- Global Regularity Estimates for Optimal Transport via Entropic Regularisation
- Denoising Diffusion Variational Inference: Diffusion Models as Expressive Variational Posteriors
- Stochastic Transport Maps in Diffusion Models and Sampling
- A four-operator splitting algorithm for nonconvex and nonsmooth optimization
- KL-geodesics flow matching with a novel sampling scheme
- Lean Formalization of Generalization Error Bound by Rademacher Complexity
- Flow to Learn: Flow Matching on Neural Network Parameters
- Factorizations of relative entropy using stochastic localization
- Stochastic neighborhood embedding and the gradient flow of relative entropy
- An Overview of Low-Rank Structures in the Training and Adaptation of Large Models
- Accelerating Langevin Monte Carlo Sampling: A Large Deviations Analysis
- High-Order and Energy-Stable Implicit-Explicit Relaxation Runge-Kutta Schemes for Gradient Flows
- Non-Bayesian Learning in Misspecified Models
- AutoBayes: A Compositional Framework for Generalized Variational Inference
- Schr\"odinger Bridges for Systems of Interacting Particles
- Semi-Implicit Functional Gradient Flow for Efficient Sampling
- Advances in Protein Representation Learning: Methods, Applications, and Future Directions
- Contraction Theory for Nonlinear Stability Analysis and Learning-based Control: A Tutorial Overview
- Semi-Implicit Functional Gradient Flow for Efficient Sampling
- Statistical exploration of the Manifold Hypothesis
- Tuning Sequential Monte Carlo Samplers via Greedy Incremental Divergence Minimization
- Stability of Schr\"odinger bridges and Sinkhorn semigroups for log-concave models
- Efficient Bayesian Computation Using Plug-and-Play Priors for Poisson Inverse Problems
- Statistical accuracy of the ensemble Kalman filter in the near-linear setting
- Tuning Sequential Monte Carlo Samplers via Greedy Incremental Divergence Minimization
- The global convergence time of stochastic gradient descent in non-convex landscapes: Sharp estimates via large deviations
- Stability of Schr\"odinger bridges and Sinkhorn semigroups for log-concave models
- An introduction to large deviations with applications in physics
- From Denoising Score Matching to Langevin Sampling: A Fine-Grained Error Analysis in the Gaussian Setting
- Revisiting Strong Duality, Hidden Convexity, and Gradient Dominance in the Linear Quadratic Regulator
- Understanding Flatness in Generative Models: Its Role and Benefits
- Stochastic Primal-Dual Three Operator Splitting Algorithm with Extension to Equivariant Regularization-by-Denoising
- Beyond Propagation of Chaos: A Stochastic Algorithm for Mean Field Optimization
- SynLlama: Generating Synthesizable Molecules and Their Analogs with Large Language Models
- Explicit numerical approximations for McKean-Vlasov stochastic differential equations in finite and infinite time
- Strong convergence of multiscale truncated Euler-Maruyama method for super-linear slow-fast stochastic differential equations
- How compositional generalization and creativity improve as diffusion models are trained
- Regularization Using Synthetic Data in High-Dimensional Models
- Memorization and Regularization in Generative Diffusion Models
- On The Convergence of Euler Discretization of Finite-Time Convergent Gradient Flows
- Sampling Decisions
- Topology meets Machine Learning: An Introduction using the Euler Characteristic Transform
- Control, Optimal Transport and Neural Differential Equations in Supervised Learning
- A Unified Model for High-Resolution ODEs: New Insights on Accelerated Methods
- On The Convergence of Euler Discretization of Finite-Time Convergent Gradient Flows
- Fast alignment of heterogeneous images in sliced Wasserstein distance
- Theoretical Convergence Guarantees for Variational Autoencoders
- Optimal Denoising in Score-Based Generative Models: The Role of Data Regularity
- Informed Correctors for Discrete Diffusion Models
- Zero-shot Imputation with Foundation Inference Models for Dynamical Systems
- Constrained Optimization From a Control Perspective via Feedback Linearization
- From Denoising Score Matching to Langevin Sampling: A Fine-Grained Error Analysis in the Gaussian Setting
- InverseBench: Benchmarking Plug-and-Play Diffusion Priors for Inverse Problems in Physical Sciences
- Numerical and statistical analysis of NeuralODE with Runge-Kutta time integration
- The Problem of the Priors, or Posteriors?
- Online estimation of the inverse of the Hessian for stochastic optimization with application to universal stochastic Newton algorithms
- Quadratically Regularized Optimal Transport: Existence and Multiplicity of Potentials
- SGD with memory: fundamental properties and stochastic acceleration
- Multi-Iteration Stochastic Optimizers
- Scaffold with Stochastic Gradients: New Analysis with Linear Speed-Up
- On Solving Minimization and Min-Max Problems by First-Order Methods with Relative Error in Gradients
- Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models
- KL-geodesics flow matching with a novel sampling scheme
- RNACG: A Universal RNA Sequence Conditional Generation model based on Flow-Matching
- Convergence of the Chambolle-Pock Algorithm in the Absence of Monotonicity
- Self-interacting processes via Doob conditioning
- Schr\"odinger Bridges for Systems of Interacting Particles
- Thermodynamic inference in molecular motors: a Martingale approach
- Diffusion Models as Cartoonists: The Curious Case of High Density Regions
- Inductive Moment Matching
- Representation Theorems for Convex Expectations and Semigroups on Path Space
- Score matching for bridges without learning time-reversals
- Sample and Map from a Single Convex Potential: Generation using Conjugate Moment Measures
- Are Convex Optimization Curves Convex?
- Improving Diffusion-based Inverse Algorithms under Few-Step Constraint via Learnable Linear Extrapolation
- The Curse of Conditions: Analyzing and Improving Optimal Transport for Conditional Flow-Based Generation
- Optimal Riemannian metric for Poincar\'e inequalities and how to ideally precondition Langevin dynamics
- Langevin Monte-Carlo Provably Learns Depth Two Neural Nets at Any Size and Data
- Connections between sequential Bayesian inference and evolutionary dynamics
- A Geometric Framework for Understanding Memorization in Generative Models
- Bayesian Experimental Design via Contrastive Diffusions
- Score matching for bridges without learning time-reversals
- Sample and Map from a Single Convex Potential: Generation using Conjugate Moment Measures
- Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models
- Theoretical Guarantees for High Order Trajectory Refinement in Generative Flows
- On the Generalization Properties of Diffusion Models
- State-space systems as dynamic generative models
- Minimax Optimality of the Probability Flow ODE for Diffusion Models
- One-step Diffusion Models with $f$-Divergence Distribution Matching
- SGD with memory: fundamental properties and stochastic acceleration
- Plug-and-Play Posterior Sampling under Mismatched Measurement and Prior Models
- Constrained Approximate Optimal Transport Maps
- Learning Energy-Based Models by Self-normalising the Likelihood
- Time-reversal solution of BSDEs in stochastic optimal control: a linear quadratic study
- Sampling the space of solutions of an artificial neural network
- Flow Matching for Discrete Systems: Efficient Free Energy Sampling Across Lattice Sizes and Temperatures
- Statistical and Geometrical properties of regularized Kernel Kullback-Leibler divergence
- Regularization by Texts for Latent Diffusion Inverse Solvers
- Computing high-dimensional optimal transport by flow neural networks
- Adaptive Methods through the Lens of SDEs: Theoretical Insights on the Role of Noise
- Chemistry-Inspired Diffusion with Non-Differentiable Guidance
- Structure Preserving Diffusion Models
- Rethinking Diffusion Model in High Dimension
- FlowDPS: Flow-Driven Posterior Sampling for Inverse Problems
- Computational bottlenecks for denoising diffusions
- The Space Between: On Folding, Symmetries and Sampling
- Convergence Dynamics and Stabilization Strategies of Co-Evolving Generative Models
- Diffusion Approximation for Slow-Fast SDEs with State-Dependent Switching
- Sampling from Bayesian Neural Network Posteriors with Symmetric Minibatch Splitting Langevin Dynamics
- Computational bottlenecks for denoising diffusions
- Understanding the Learning Dynamics of LoRA: A Gradient Flow Perspective on Low-Rank Adaptation in Matrix Factorization
- Generative modelling with jump-diffusions
- The Parametric Complexity of Operator Learning
- Slow-fast systems with stochastic resetting
- Energy-Based Diffusion Language Models for Text Generation
- Riemannian Metric Learning: Closer to You than You Imagine
- SymmetricDiffusers: Learning Discrete Diffusion on Finite Symmetric Groups
- Implicit Diffusion: Efficient Optimization through Stochastic Sampling
- Tutorial on amortized optimization
- Generalized Interpolating Discrete Diffusion
- All-atom Diffusion Transformers: Unified generative modelling of molecules and materials
- When Can You Get Away with Low Memory Adam?
- On the Convergence of Adam-Type Algorithm for Bilevel Optimization under Unbounded Smoothness
- Convergence of non-reversible Markov processes via lifting and flow Poincar{\'e} inequality
- R\'enyi Divergences in Central Limit Theorems: Old and New
- Learning truly monotone operators with applications to nonlinear inverse problems
- Quantitative Flow Approximation Properties of Narrow Neural ODEs
- Iterative Flow Matching -- Path Correction and Gradual Refinement for Enhanced Generative Modeling
- Guided smoothing and control for diffusion processes
- Accelerating optimization over the space of probability measures
- Sliced-Wasserstein Distances and Flows on Cartan-Hadamard Manifolds
- Fractional Sobolev paths on Wasserstein spaces and their energy-minimizing particle representations
- Introduction to Online Control
- Linear-quadratic optimal control for non-exchangeable mean-field SDEs and applications to systemic risk
- Amortized Probabilistic Conditioning for Optimization, Simulation and Inference
- Wasserstein metric, gradient flow structure and well-posedness of Fokker-Planck equation on locally finite graphs
- Flow-based Bayesian filtering for high-dimensional nonlinear stochastic dynamical systems
- From Learning to Optimize to Learning Optimization Algorithms
- Grams: Gradient Descent with Adaptive Momentum Scaling
- $\mu^2$-SGD: Stable Stochastic Optimization via a Double Momentum Mechanism
- Can Diffusion Models Provide Rigorous Uncertainty Quantification for Bayesian Inverse Problems?
- Optimization, Isoperimetric Inequalities, and Sampling via Lyapunov Potentials
- Gradient-Free Generation for Hard-Constrained Systems
- Meta Flow Matching: Integrating Vector Fields on the Wasserstein Manifold
- Feynman-Kac Correctors in Diffusion: Annealing, Guidance, and Product of Experts
- Intrinsic regularity in the discrete log-Sobolev inequality
- Partition function approach to non-Gaussian likelihoods: information theory and state variables for Bayesian inference
- Does SGD really happen in tiny subspaces?
- Topology of the simplest gene switch
- High-Resolution Image Synthesis via Next-Token Prediction
- PnP-Flow: Plug-and-Play Image Restoration with Flow Matching
- Optimal Protocols for Continual Learning via Statistical Physics and Control Theory
- LDAdam: Adaptive Optimization from Low-Dimensional Gradient Statistics
- Simplifying, Stabilizing and Scaling Continuous-Time Consistency Models
- Lean Copilot: Large Language Models as Copilots for Theorem Proving in Lean
- On the Asymptotic Mean Square Error Optimality of Diffusion Models
- Probing the Latent Hierarchical Structure of Data via Diffusion Models
- How Discrete and Continuous Diffusion Meet: Comprehensive Analysis of Discrete Diffusion Models via a Stochastic Integral Framework
- Trajectory Inference with Smooth Schr\"odinger Bridges
- When Can You Get Away with Low Memory Adam?
- Split Gibbs Discrete Diffusion Posterior Sampling
- Underdamped Diffusion Bridges with Applications to Sampling
- End-To-End Learning of Gaussian Mixture Priors for Diffusion Sampler
- Contractive coupling rates and curvature lower bounds for Markov chains
- Learning Dynamics of Deep Linear Networks Beyond the Edge of Stability
- Numerical approximation of McKean-Vlasov SDEs via stochastic gradient descent
- The No-Underrun Sampler: A Locally-Adaptive, Gradient-Free MCMC Method
- Iterative Flow Matching -- Path Correction and Gradual Refinement for Enhanced Generative Modeling
- Provable Acceleration for Diffusion Models under Minimal Assumptions
- (Mis)Fitting: A Survey of Scaling Laws
- Constrained Generative Modeling with Manually Bridged Diffusion Models
- On the Interpolation Effect of Score Smoothing
- Large deviations for Independent Metropolis Hastings and Metropolis-adjusted Langevin algorithm
- Exponential convergence of general iterative proportional fitting procedures
- Posterior Inference with Diffusion Models for High-dimensional Black-box Optimization
- Concentration of Measure for Distributions Generated via Diffusion Models
- Near-Optimal Approximations for Bayesian Inference in Function Space
- A Concise Lyapunov Analysis of Nesterov's Accelerated Gradient Method
- Linear multistep methods with repeated global Richardson extrapolation
- The Stability and Accuracy of The Adams-Bashforth-type Integrator
- Nonlinear Assimilation via Score-based Sequential Langevin Sampling
- On Convergence of Adam for Stochastic Optimization under Relaxed Assumptions
- Flow-based linear embedding for Bayesian filtering of nonlinear stochastic dynamical systems
- Splitting Regularized Wasserstein Proximal Algorithms for Nonsmooth Sampling Problems
- One-step Diffusion Models with $f$-Divergence Distribution Matching
- Feedback Schr\"odinger Bridge Matching
- Antifragility and response to damage in the synchronization of oscillators on networks
- Kinetic Optimal Transport (OTIKIN) -- Part 1: Second-Order Discrepancies Between Probability Measures
- Categorical Lyapunov Theory I: Stability of Flows
- Categorical algebra of conditional probability
- Local geometry of high-dimensional mixture models: Effective spectral theory and dynamical transitions
- Jeffrey's update rule as a minimizer of Kullback-Leibler divergence
- Solving Inverse Problems with Deep Linear Neural Networks: Global Convergence Guarantees for Gradient Descent with Weight Decay
- MRS: A Fast Sampler for Mean Reverting Diffusion based on ODE and SDE Solvers
- High-dimensional manifold of solutions in neural networks: insights from statistical physics
- Interleaved Gibbs Diffusion for Constrained Generation
- Value Gradient Sampler: Sampling as Sequential Decision Making
- Poincare Inequality for Local Log-Polyak-\L ojasiewicz Measures: Non-asymptotic Analysis in Low-temperature Regime
- Stochastic Inertial Dynamics Via Time Scaling and Averaging
- Energy-Based Diffusion Language Models for Text Generation
- Improving the Diffusability of Autoencoders
- A Non-Asymptotic Theory of Seminorm Lyapunov Stability: From Deterministic to Stochastic Iterative Algorithms
- A Non-Asymptotic Theory of Seminorm Lyapunov Stability: From Deterministic to Stochastic Iterative Algorithms
- Sampling from the Continuous Random Energy Model in Total Variation Distance
- A Novel Unified Parametric Assumption for Nonconvex Optimization
- Gradient Equilibrium in Online Learning: Theory and Applications
- Derivative-Free Optimization via Finite Difference Approximation: An Experimental Study
- Composition and Control with Distilled Energy Diffusion Models and Sequential Monte Carlo
- Stability Bounds for Smooth Optimal Transport Maps and their Statistical Implications
- Separation of time scales in weakly interacting diffusions
- The gene's eye-view of quantitative genetics
- Anytime Solvers for Variational Inequalities: the (Recursive) Safe Monotone Flows
- Towards a Mechanistic Explanation of Diffusion Model Generalization
- Uniform-in-time bounds for a stochastic hybrid system with fast periodic sampling and small white-noise
- Fast Inexact Bilevel Optimization for Analytical Deep Image Priors
- On the unconventional Hug integrator
- Generalised Parallel Tempering: Flexible Replica Exchange via Flows and Diffusions
- LoRA Training Provably Converges to a Low-Rank Global Minimum or It Fails Loudly (But it Probably Won't Fail)
- A Bregman firmly nonexpansive proximal operator for baryconvex optimization
- Score-of-Mixture Training: Training One-Step Generative Models Made Simple via Score Estimation of Mixture Distributions
- Nonasymptotic CLT and Error Bounds for Two-Time-Scale Stochastic Approximation
- On creating convexity in high dimensions
- Rigorous lower bound of the dynamical critical exponent of the Ising model
- @Hesamation Same approach works for diffusion models too: arxiv.org/abs/2401.08741
- Partial Gromov-Wasserstein Metric
- Flow Matching: Markov Kernels, Stochastic Processes and Transport Plans
- Scaling Law for Stochastic Gradient Descent in Quadratically Parameterized Linear Regression
- Finite-Time Analysis of Discrete-Time Stochastic Interpolants
- A First-order Generative Bilevel Optimization Framework for Diffusion Models
- Scalable Discrete Diffusion Samplers: Combinatorial Optimization and Statistical Physics
- Flow Matching: Markov Kernels, Stochastic Processes and Transport Plans
- Non-asymptotic Analysis of Diffusion Annealed Langevin Monte Carlo for Generative Modelling
- Deep Generative Models with Hard Linear Equality Constraints
- A Probabilistic Inference Approach to Inference-Time Scaling of LLMs using Particle-Based Monte Carlo Methods
- The Role of Randomness in Stability
- An introduction to Malliavin calculus
- Fast Convergence of $\Phi$-Divergence Along the Unadjusted Langevin Algorithm and Proximal Sampler
- Understanding Classifier-Free Guidance: High-Dimensional Theory and Non-Linear Generalizations
- Concentration Inequalities for the Stochastic Optimization of Unbounded Objectives with Application to Denoising Score Matching
- On the Convergence of Min-Max Langevin Dynamics and Algorithm
- Coupled Wasserstein Gradient Flows for Min-Max and Cooperative Games
- Convergence analysis for a variant of manifold proximal point algorithm based on Kurdyka-{\L}ojasiewicz property
- Operator convexity along lines, self-concordance, and sandwiched R\'enyi entropies
- Gradient Flows and the Curvature of Theory Space
- Neural Flow Samplers with Shortcut Models
- Poincar\'e Inequality for Local Log-Polyak-Lojasiewicz Measures : Non-asymptotic Analysis in Low-temperature Regime
- Optimality in importance sampling: a gentle survey
- Riemannian Proximal Sampler for High-accuracy Sampling on Manifolds
- Flowing Through Layers: A Continuous Dynamical Systems Perspective on Transformers
- UniDB: A Unified Diffusion Bridge Framework via Stochastic Optimal Control
- Rate of convergence of the smoothed empirical Wasserstein distance
- On the query complexity of sampling from non-log-concave distributions
- Mixing Time of the Proximal Sampler in Relative Fisher Information via Strong Data Processing Inequality
- Random Variables aren't Random
- Properties of Wasserstein Gradient Flows for the Sliced-Wasserstein Distance
- Two-Point Deterministic Equivalence for Stochastic Gradient Dynamics in Linear Models
- Mechanisms of Projective Composition of Diffusion Models
- Optimizing the diffusion coefficient of overdamped Langevin dynamics
- Complexity Analysis of Normalizing Constant Estimation: from Jarzynski Equality to Annealed Importance Sampling and beyond
- Advancing Wasserstein Convergence Analysis of Score-Based Models: Insights from Discretization and Second-Order Acceleration
- Analysis of Diffusion Models for Manifold Data
- Iterative Importance Fine-tuning of Diffusion Models
- Importance Sampling via Score-based Generative Models
- On the Convergence of Min-Max Langevin Dynamics and Algorithm
- Lovely little pedagogical note; well worth a look. arxiv.org/abs/2502.02305 'Information-Theoretic Proofs for Diffusion Sampling' - Galen Reeves, Henry D. Pfister
- On the sequential convergence of Lloyd's algorithms
- MPAX: Mathematical Programming in JAX
- Variational Control for Guidance in Diffusion Models
- A Bayesian perspective on single-shot laser characterization
- The Ensemble Kalman Update is an Empirical Matheron Update
- Theoretical Guarantees for Low-Rank Compression of Deep Neural Networks
- Latent Space Energy-based Neural ODEs
- Diffusion Bridge Implicit Models
- A Mixture-Based Framework for Guiding Diffusion Models
- The Performance Of The Unadjusted Langevin Algorithm Without Smoothness Assumptions
- Data denoising with self consistency, variance maximization, and the Kantorovich dominance
- Variations on the Expectation Due to Changes in the Probability Measure
- Beyond Fixed Horizons: A Theoretical Framework for Adaptive Denoising Diffusions
- A theoretical framework for overfitting in energy-based modeling
- Temperature-Annealed Boltzmann Generators
- Constant Rate Schedule: Constant-Rate Distributional Change for Efficient Training and Sampling in Diffusion Models
- On the Guidance of Flow Matching
- Connections between Schedule-Free Optimizers, AdEMAMix, and Accelerated SGD Variants
- Field Matching: an Electrostatic Paradigm to Generate and Transfer Data
- How Memory in Optimization Algorithms Implicitly Modifies the Loss
- Rethinking Timesteps Samplers and Prediction Types
- Learning with Differentially Private (Sliced) Wasserstein Gradients
- SDE Matching: Scalable and Simulation-Free Training of Latent Stochastic Differential Equations
- Information-Theoretic Proofs for Diffusion Sampling
- A User Guide to Sampling Strategies for Sliced Optimal Transport
- The Ball-Proximal (="Broximal") Point Method: a New Algorithm, Convergence Theory, and Applications
- Approximate Slow Manifolds in the Fokker-Planck Equation
- Functional role of synchronization: A mean-field control perspective
- Uniform-in-time weak propagation of chaos for consensus-based optimization
- Equilibrium Moment Analysis of It\^o SDEs
- On Probabilistic Pullback Metrics on Latent Hyperbolic Manifolds
- Sampling in High-Dimensions using Stochastic Interpolants and Forward-Backward Stochastic Differential Equations
- Re-examining Double Descent and Scaling Laws under Norm-based Capacity via Deterministic Equivalence
- Flow Matching: Markov Kernels, Stochastic Processes and Transport Plans
- Diffusion at Absolute Zero: Langevin Sampling Using Successive Moreau Envelopes
- Doubly Adaptive Importance Sampling
- Statistical and Computational Guarantees of Kernel Max-Sliced Wasserstein Distances
- Hutchinson's Estimator is Bad at Kronecker-Trace-Estimation
- Wellposedness, exponential ergodicity and numerical approximation of fully super-linear McKean--Vlasov SDEs and associated particle systems
- Linearization Turns Neural Operators into Function-Valued Gaussian Processes
- The Surprising Agreement Between Convex Optimization Theory and Learning-Rate Scheduling for Large Model Training
- Lightspeed Geometric Dataset Distance via Sliced Optimal Transport
- Adaptivity and Convergence of Probability Flow ODEs in Diffusion Generative Models
- Fisher information dissipation for time-inhomogeneous stochastic differential equations
- A Unified Perspective on the Dynamics of Deep Transformers
- Global Optimization Algorithm through High-Resolution Sampling
- Joint Learning of Energy-based Models and their Partition Function
- Convergence rates for an Adaptive Biasing Potential scheme from a Wasserstein optimization perspective
- A Proximal Operator for Inducing 2:4-Sparsity
- Joint Learning of Energy-based Models and their Partition Function
- Exploring Non-Convex Discrete Energy Landscapes: A Langevin-Like Sampler with Replica Exchange
- Hellinger-Kantorovich Gradient Flows: Global Exponential Decay of Entropy Functionals
- Safe Gradient Flow for Bilevel Optimization
- Predictive variational inference: Learn the predictively optimal posterior distribution
- Generative diffusion models from a PDE perspective
- Variational Schr\"odinger Momentum Diffusion
- Flow Matching: Markov Kernels, Stochastic Processes and Transport Plans
- Convergence of two-timescale gradient descent ascent dynamics: finite-dimensional and mean-field perspectives
- Safe Gradient Flow for Bilevel Optimization
- Two-Timescale Gradient Descent Ascent Algorithms for Nonconvex Minimax Optimization
- Closed-Form Diffusion Models
- Matrix Calculus (for Machine Learning and Beyond)
- A mirror descent approach to maximum likelihood estimation in latent variable models
- Fast convex optimization via closed-loop time scaling of gradient dynamics
- Memorization and Regularization in Generative Diffusion Models
- Bean: A Language for Backward Error Analysis
- Hidden Markov Models and the Bayes Filter in Categorical Probability
- Diffusive transport on the real line: semi-contractive gradient flows and their discretization
- Full Event Particle-Level Unfolding with Variable-Length Latent Variational Diffusion
- The Pseudo-Dimension of Contracts
- The Wasserstein Space of Stochastic Processes in Continuous Time
- Variational Analysis of Proximal Compositions and Integral Proximal Mixtures
- Gradient correlation is a key ingredient to accelerate SGD with momentum
- Sticky-reflecting diffusion as a Wasserstein gradient flow
- On the Almost Sure Convergence of the Stochastic Three Points Algorithm
- Differentially Private Gradient Flow based on the Sliced Wasserstein Distance
- MirrorCBO: A consensus-based optimization method in the spirit of mirror descent
- Second-order flows for approaching stationary points of a class of non-convex energies via convex-splitting schemes
- Solving Non-Monotone Inclusions Using Monotonicity of Pairs of Operators
- A solvable generative model with a linear, one-step denoiser
- Mini-batch descent in semiflows
- Linearization of ergodic McKean SDEs and applications
- O(d/T) Convergence Theory for Diffusion Probabilistic Models under Minimal Assumptions
- Low-dimensional adaptation of diffusion models: Convergence in total variation
- On measures strongly log-concave on a subspace
- Approximating particle-based clustering dynamics by stochastic PDEs
- Global Regularity Estimates for Optimal Transport via Entropic Regularisation
- Large Deviations for Slow-Fast Mean-Field Diffusions
- Averaging principles and central limit theorems for multiscale McKean-Vlasov stochastic systems
- Wellposedness, exponential ergodicity and numerical approximation of fully super-linear McKean--Vlasov SDEs and associated particle systems
- Non-Expansive Mappings in Two-Time-Scale Stochastic Approximation: Finite-Time Analysis
- Uniform in time convergence of numerical schemes for stochastic differential equations via Strong Exponential stability: Euler methods, Split-Step and Tamed Schemes
- Lee and Seung (2000)'s Algorithms for Non-negative Matrix Factorization: A Supplementary Proof Guide
- Differentially Private Gradient Flow based on the Sliced Wasserstein Distance
- On the Convergence of the Gradient Descent Method with Stochastic Fixed-point Rounding Errors under the Polyak-Lojasiewicz Inequality
- Non-Reversible Langevin Algorithms for Constrained Sampling
- Quantitative Error Bounds for Scaling Limits of Stochastic Iterative Algorithms
- Stochastic Optimal Control via Local Occupation Measures
- Geometry-Preserving Encoder/Decoder in Latent Generative Models
- Optimal Execution among $N$ Traders with Transient Price Impact
- Nonsmooth Nonconvex-Nonconcave Minimax Optimization: Primal-Dual Balancing and Iteration Complexity Analysis
- A General Framework for Inference-time Scaling and Steering of Diffusion Models
- FlowDock: Geometric Flow Matching for Generative Protein-Ligand Docking and Affinity Prediction
- Generative diffusion model with inverse renormalization group flows
- Rapid Bayesian Computation and Estimation for Neural Networks via Log-Concave Coupling
- Wasserstein Gradient Flows for Moreau Envelopes of f-Divergences in Reproducing Kernel Hilbert Spaces
- Generative Models with ELBOs Converging to Entropy Sums
- A General Framework for Inference-time Scaling and Steering of Diffusion Models
- Differentiability and overlap concentration in optimal Bayesian inference
- Particle Semi-Implicit Variational Inference
- Nesterov Acceleration for Ensemble Kalman Inversion and Variants
- On the Asymptotics of Importance Weighted Variational Inference
- Slicing of Radial Functions: a Dimension Walk in the Fourier Space
- Measure transfer via stochastic slicing and matching
- Flow matching for stochastic linear control systems
- Grand-Canonical Optimal Transport
- A Similarity Measure Between Functions with Applications to Statistical Learning and Optimization
- Proximal Flow Inspired Multi-Step Methods
- Control Strategies for Maintaining Transport Symmetries Far From Equilibrium
- On the Statistical Capacity of Deep Generative Models
- Concentration of Measure for Distributions Generated via Diffusion Models
- A General Framework for Inference-time Scaling and Steering of Diffusion Models
- Uniform large-scale $\varepsilon$-regularity for entropic optimal transport
- From discrete-time policies to continuous-time diffusion samplers: Asymptotic equivalences and faster training
- Structure preservation via the Wasserstein distance
- Regret Analysis: a control perspective
- Control of Overpopulated Tails in Kinetic Epidemic Models
- Decentralized Diffusion Models
- Accelerated Diffusion Models via Speculative Sampling
- Stochastic Process Learning via Operator Flow Matching
- Sharp Quantitative Stability for the Pr\'ekopa-Leindler and Borell-Brascamp-Lieb Inequalities
- New probabilistic methods for physics
- Grokking at the Edge of Numerical Stability
- Kinetic theory of decentralized learning for smart active matter
- Manifolds, Random Matrices and Spectral Gaps: The geometric phases of generative diffusion
- A precise asymptotic analysis of learning diffusion models: theory and insights
- Smooth transport map via diffusion process
- Constrained Sampling with Primal-Dual Langevin Monte Carlo
- Beyond Log-Concavity and Score Regularity: Improved Convergence Bounds for Score-Based Generative Models in W2-distance
- MCMC Importance Sampling via Moreau-Yosida Envelopes
- From thermodynamics to protein design: Diffusion models for biomolecule generation towards autonomous protein engineering
- The Mean Value Theorem: Analytical Proof and Computational Approaches
- Can ChatGPT implement finite element models for geotechnical engineering applications?
- How to explain grokking
- Variational autoencoders with latent high-dimensional steady geometric flows for dynamics
- Adapting to Unknown Low-Dimensional Structures in Score-Based Diffusion Models
- High-accuracy sampling from constrained spaces with the Metropolis-adjusted Preconditioned Langevin Algorithm
- Algebraic Control: Complete Stable Inversion with Necessary and Sufficient Conditions
- Polynomial time sampling from log-smooth distributions in fixed dimension under semi-log-concavity of the forward diffusion with application to strongly dissipative distributions
- Optimizing Noise Schedules of Generative Models in High Dimensionss
- Score-Based Metropolis-Hastings Algorithms
- Poincare Inequality for Local Log-Polyak-Lojasiewicz Measures: Non-asymptotic Analysis in Low-temperature Regime
- Which constraints of a numerical problem cause ill-conditioning?
- Mimetic finite difference schemes for transport operators with divergence-free advective field and applications to plasma physics
- High-Dimensional Differential Parameter Inference in Exponential Family using Time Score Matching
- An analytic theory of creativity in convolutional diffusion models
- Fitting Dynamically Misspecified Models: An Optimal Transportation Approach
- Stability and convergence analysis of AdaGrad for non-convex optimization via novel stopping time-based techniques
- Tighter Learning Guarantees on Digital Computers via Concentration of Measure on Finite Spaces
Saved in 2024
- Convergence of the Min-Max Langevin Dynamics and Algorithm for Zero-Sum Games
- Isoperimetry is All We Need: Langevin Posterior Sampling for RL with Sublinear Regret
- EXAdam: The Power of Adaptive Cross-Moments
- Deep Kalman Filters Can Filter
- Stochastic Approximation with Two Time Scales: The General Case
- Gradient flow structure for some nonlocal diffusion equations
- A Particle Algorithm for Mean-Field Variational Inference
- Slow and fast dynamics in measure functional differential equations with state-dependent delays through averaging principles and applications to extremum seeking
- Global Search of Optimal Spacecraft Trajectories using Amortization and Deep Generative Models
- Optimal longevity of a dynasty
- Empirical likelihood for Fr\'echet means on open books
- High-accuracy sampling from constrained spaces with the Metropolis-adjusted Preconditioned Langevin Algorithm
- Methods for Convex $(L_0,L_1)$-Smooth Optimization: Clipping, Acceleration, and Adaptivity
- Optimality of the Right-Invariant Prior
- Shifted Composition III: Local Error Framework for KL Divergence
- Go With the Flow: Fast Diffusion for Gaussian Mixture Models
- Foxtsage vs. Adam: Revolution or Evolution in Optimization?
- Sch\"odinger Bridge Type Diffusion Models as an Extension of Variational Autoencoders
- Stochastic Control for Fine-tuning Diffusion Models: Optimality, Regularity, and Convergence
- Shifted Composition III: Local Error Framework for KL Divergence
- Latent Schr{\"o}dinger Bridge Diffusion Model for Generative Learning
- Transport Quasi-Monte Carlo
- Anisotropic Proximal Point Algorithm
- The Unreasonable Effectiveness of Guidance for Diffusion Models
- Posterior Mean Matching: Generative Modeling through Online Bayesian Inference
- Learning sparsity-promoting regularizers for linear inverse problems
- On cutoff via rigidity for high dimensional curved diffusions
- Covariance-modulated optimal transport and gradient flows
- Covariance-modulated optimal transport and gradient flows
- Langevin dynamics for high-dimensional optimization: the case of multi-spiked tensor PCA
- Data for Mathematical Copilots: Better Ways of Presenting Proofs for Machine Learning
- Log-concavity in one-dimensional Coulomb gases and related ensembles
- Langevin dynamics for high-dimensional optimization: the case of multi-spiked tensor PCA
- Posterior Projection for Inference in Constrained Spaces
- Sinkhorn Algorithm for Sequentially Composed Optimal Transports
- Lyapunov Analysis For Monotonically Forward-Backward Accelerated Algorithms
- Go With the Flow: Fast Diffusion for Gaussian Mixture Models
- Time-Reversible Bridges of Data with Machine Learning
- Computing Your Ideal Haircut Routine
- Improving Diffusion Inverse Problem Solving with Decoupled Noise Annealing
- jinns: a JAX Library for Physics-Informed Neural Networks
- Preconditioned Subspace Langevin Monte Carlo
- Diffusion map particle systems for generative modeling
- The impact of AI on engineering design procedures for dynamical systems
- Controllability and Tracking of Ensembles: An Optimal Transport Theory Viewpoint
- Transport maps as flows of control-affine systems
- Variational f-divergence Minimization
- Statistical analysis of the drying pattern of coffee
- Proposing and solving olympiad geometry with guided tree search
- Wasserstein Bounds for generative diffusion models with Gaussian tail targets
- Exploring Diffusion and Flow Matching Under Generator Matching
- State-Space Systems as Dynamic Generative Models
- Small-time asymptotics for hypoelliptic diffusions
- Hypocoercivity meets lifts
- Dynamical Reversibility and A New Theory of Causal Emergence
- Averaging principles for time-inhomogeneous multi-scale SDEs via nonautonomous Poisson equations
- Liquidity Pools as Mean Field Games: A New Framework
- Training Free Guided Flow Matching with Optimal Control
- Entropy-Regularized Optimal Transport in Information Design
- A semiconcavity approach to stability of entropic plans and exponential convergence of Sinkhorn's algorithm
- Controlling the asymptotic bias of the unadjusted (Microcanonical) Hamiltonian and Langevin Monte Carlo
- Large Deviations and Metastability Analysis for Heavy-Tailed Dynamical Systems
- Mathematical description of continuous time and space replicator-mutator equations for quadratic fitness landscapes
- Score Change of Variables
- Nonlinear Bayesian Filtering with Natural Gradient Gaussian Approximation
- Learning Variational Inequalities from Data: Fast Generalization Rates under Strong Monotonicity
- Phase-aware Training Schedule Simplifies Learning in Flow-Based Generative Models
- Statistical Convergence Rates of Optimal Transport Map Estimation between General Distributions
- Analysis and Synthesis Denoisers for Forward-Backward Plug-and-Play Algorithms
- A tutorial on automatic differentiation with complex numbers
- Parallel simulation for sampling under isoperimetry and score-based diffusion models
- Diffusing Differentiable Representations
- Geometric properties of disintegration of measures
- Ballistic Convergence in Hit-and-Run Monte Carlo and a Coordinate-free Randomized Kaczmarz Algorithm
- Mathematical analysis of singularities in the diffusion model under the submanifold assumption
- Paired Wasserstein Autoencoders for Conditional Sampling
- Acceleration by Random Stepsizes: Hedging, Equalization, and the Arcsine Stepsize Schedule
- Improved Sample Complexity Bounds for Diffusion Model Training
- Enhancing Sample Generation of Diffusion Models using Noise Level Correction
- Flow Matching Guide and Code
- Understanding the differences in Foundation Models: Attention, State Space Models, and Recurrent Neural Networks
- Energy-based, geometric, and compositional formulation of fluid and plasma models
- Local Curvature Smoothing with Stein's Identity for Efficient Score Matching
- 2-Rectifications are Enough for Straight Flows: A Theoretical Insight into Wasserstein Convergence
- Old Optimizer, New Norm: An Anthology
- EM Distillation for One-step Diffusion Models
- The Score-Difference Flow for Implicit Generative Modeling
- APOLLO: SGD-like Memory, AdamW-level Performance
- Optimal transport maps, majorization, and log-subharmonic measures
- Perfect sampling from rapidly mixing Markov chains
- Conditions for uniform in time convergence: applications to averaging, numerical discretisations and mean-field systems
- Strong convergence of the Euler scheme for singular kinetic SDEs driven by $\alpha$-stable processes
- A Complexity-Based Theory of Compositionality
- A Noise is Worth Diffusion Guidance
- Understanding Memorization in Generative Models via Sharpness in Probability Landscapes
- Flow Matching with General Discrete Paths: A Kinetic-Optimal Perspective
- Path-Guided Particle-based Sampling
- Non-asymptotic entropic bounds for non-linear kinetic Langevin sampler with second-order splitting scheme
- Causal transport on path space
- Sinkhorn Algorithm for Sequentially Composed Optimal Transports
- Schrodinger Bridge over Averaged Systems
- Denoising: A Powerful Building-Block for Imaging, Inverse Problems, and Machine Learning
- Flow matching for stochastic linear control systems
- Beyond Monte Carlo: Harnessing Diffusion Models to Simulate Financial Market Dynamics
- Foundations of algorithmic thermodynamics
- Nonequilbrium physics of generative diffusion models
- CBX: Python and Julia packages for consensus-based interacting particle methods
- Scaling Law for Language Models Training Considering Batch Size
- An overview of diffusion models for generative artificial intelligence
- Stochastic sewing lemma on Wasserstein space
- On the Surprising Effectiveness of Spectrum Clipping in Learning Stable Linear Dynamics
- Optimal Particle-based Approximation of Discrete Distributions (OPAD)
- A Pontryagin Perspective on Reinforcement Learning
- Bias-inducing geometries: an exactly solvable data model with fairness implications
- An Operator Splitting View of Federated Learning
- Tractable Agreement Protocols
- Towards a Mechanistic Explanation of Diffusion Model Generalization
- Isoperimetric inequalities in high-dimensional convex sets
- Markov Equivalence and Consistency in Differentiable Structure Learning
- Diffusion State-Guided Projected Gradient for Inverse Problems
- From memorization to generalization: a theoretical framework for diffusion-based generative models
- Exponential speed up in Monte Carlo sampling through Radial Updates
- Fast convolution algorithm for state space models
- Federated Automatic Differentiation
- Annealing Flow Generative Model Towards Sampling High-Dimensional and Multi-Modal Distributions
- On Statistical Rates of Conditional Diffusion Transformers: Approximation, Estimation and Minimax Optimality
- Conditional Variable Flow Matching: Transforming Conditional Densities with Amortized Conditional Optimal Transport
- Improving the Convergence Rates of Forward Gradient Descent with Repeated Sampling
- Frequency-Guided Posterior Sampling for Diffusion-Based Image Restoration
- Statistical algorithms for low-frequency diffusion data: A PDE approach
- Controllability and Vector Potential
- Anytime Acceleration of Gradient Descent
- Avoiding Deadlocks Is Not Enough: Analysis and Resolution of Blocked Airplanes
- Logarithmic Sobolev inequalities for generalised Cauchy measures
- Lipschitz constant estimation for general neural network architectures using control tools
- A Theoretical Survey on Foundation Models
- Connections between sequential Bayesian inference and evolutionary dynamics
- Cautious Optimizers: Improving Training with One Line of Code
- Adaptive Methods through the Lens of SDEs: Theoretical Insights on the Role of Noise
- Stability properties of gradient flow dynamics for the symmetric low-rank matrix factorization problem
- Why you don't overfit, and don't need Bayes if you only train for one epoch
- Geometry and analytic properties of the sliced Wasserstein space
- Linear convergence of proximal descent schemes on the Wasserstein space
- Massive Particle Systems, Wasserstein Brownian Motions, and the Dean--Kawasaki Equation
- In-and-Out: Algorithmic Diffusion for Sampling Convex Bodies
- The stabilizing role of multiplicative noise in non-confining potentials
- Partition function approach to non-Gaussian likelihoods: information theory and state variables for Bayesian inference
- Multi-Objective Optimization via Wasserstein-Fisher-Rao Gradient Flow
- Schr\"odinger Bridge Problem for Jump Diffusions
- Long time behavior of killed Feynman-Kac semigroups with singular Schr{\"o}dinger potentials
- Computational and Experimental Exploration of Protein Fitness Landscapes: Navigating Smooth and Rugged Terrains
- Sampling and Integration of Logconcave Functions by Algorithmic Diffusion
- Derivatives of Stochastic Gradient Descent in parametric optimization
- Wavelet s-Wasserstein distances for 0 < s <= 1
- Ergodicity of Langevin Dynamics and its Discretizations for Non-smooth Potentials
- Optimal transport maps, majorization, and log-subharmonic measures
- Anisotropic Gaussian Smoothing for Gradient-based Optimization
- Straightness of Rectified Flow: A Theoretical Insight into Wasserstein Convergence
- Constrained Diffusion with Trust Sampling
- 4+3 Phases of Compute-Optimal Neural Scaling Laws
- Spectral gap bounds for reversible hybrid Gibbs chains
- Straightness of Rectified Flow: A Theoretical Insight into Wasserstein Convergence
- From Optimization to Sampling via Lyapunov Potentials
- Understanding Learning with Sliced-Wasserstein Requires Rethinking Informative Slices
- Parallelly Tempered Generative Adversarial Networks
- Unbiased Approximations for Stationary Distributions of McKean-Vlasov SDEs
- Efficient inference for differential equation models without numerical solvers
- PAPAL: A Provable PArticle-based Primal-Dual ALgorithm for Mixed Nash Equilibrium
- Geometric optics approximation sampling
- Hamiltonian Monte Carlo for efficient Gaussian sampling: long and random steps
- Scaling Law for Post-training after Model Pruning
- The Unreasonable Effectiveness of Guidance for Diffusion Models
- Modeling AdaGrad, RMSProp, and Adam with Integro-Differential Equations
- Self-interacting CBO: Existence, uniqueness, and long-time convergence
- Smooth transport map via diffusion process
- Learning efficient and provably convergent splitting methods
- Golden Noise for Diffusion Models: A Learning Framework
- Inconsistencies In Consistency Models: Better ODE Solving Does Not Imply Better Samples
- How to implement the Bayes' formula in the age of ML?
- On importance sampling and independent Metropolis-Hastings with an unbounded weight function
- Parameter Inference via Differentiable Diffusion Bridge Importance Sampling
- Weak Poincar\'e Inequalities, Simulated Annealing, and Sampling from Spherical Spin Glasses
- Harmonic Path Integral Diffusion
- Searching Latent Program Spaces
- Stochastic Optimization under Hidden Convexity
- Gravity from entropy
- Event-horizon-scale Imaging of M87* under Different Assumptions via Deep Generative Image Priors
- Divide-and-Conquer Posterior Sampling for Denoising Diffusion Priors
- Conditioning non-linear and infinite-dimensional diffusion processes
- Accelerating optimization over the space of probability measures
- Learning Energy-Based Models by Cooperative Diffusion Recovery Likelihood
- Scaling Law Hypothesis for Multimodal Model
- Diffusion Models With Learned Adaptive Noise
- Conditional simulation via entropic optimal transport: Toward non-parametric estimation of conditional Brenier maps
- Diverse capability and scaling of diffusion and auto-regressive models when learning abstract rules
- Convergence Rate Analysis of LION
- On theoretical guarantees and a blessing of dimensionality for nonconvex sampling
- Localized KBO with genetic dynamics for multi-modal optimization
- Score-based generative diffusion with "active" correlated noise sources
- The Unreasonable Effectiveness of Monte Carlo Simulations in A/B Testing
- Understanding Scaling Laws with Statistical and Approximation Theory for Transformer Neural Networks on Intrinsically Low-dimensional Data
- Diffusion Sampling Correction via Approximately 10 Parameters
- Linear Spherical Sliced Optimal Transport: A Fast Metric for Comparing Spherical Data
- Kaczmarz Kac Walk
- JKO for Landau: a variational particle method for homogeneous Landau equation
- Learnability of high-dimensional targets by two-parameter models and gradient flow
- Optimal Flow Matching: Learning Straight Trajectories in Just One Step
- Nonlinear Fokker--Planck--Kolmogorov equations as gradient flows on the space of probability measures
- Control of probability flow in Markov chain Monte Carlo -- Nonreversibility and lifting
- Parameter Symmetry and Noise Equilibrium of Stochastic Gradient Descent
- Stochastic Optimization Using Ricci Flow
- Kinetic Theory of Stellar Systems: A Tutorial
- Latent Diffusion Model for Conditional Reservoir Facies Generation
- GENOT: Entropic (Gromov) Wasserstein Flow Matching with Applications to Single-Cell Genomics
- Scaling Law Hypothesis for Multimodal Model
- Parameter Symmetry and Noise Equilibrium of Stochastic Gradient Descent
- Scaling Laws for Pre-training Agents and World Models
- Scaling Laws for Precision
- Principled Probabilistic Imaging using Diffusion Models as Plug-and-Play Priors
- Sampling metastable systems using collective variables and Jarzynski-Crooks paths
- Measure-to-measure interpolation using Transformers
- Constrained Sampling with Primal-Dual Langevin Monte Carlo
- Inclusive KL Minimization: A Wasserstein-Fisher-Rao Gradient Flow Perspective
- A Three-Operator Splitting Scheme Derived from Three-Block ADMM
- Risk-sensitive control as inference with R\'enyi divergence
- Bayesian scaling laws for in-context learning
- Understanding Generalizability of Diffusion Models Requires Rethinking the Hidden Gaussian Structure
- Constrained Synthesis with Projected Diffusion Models
- Learning Controlled Stochastic Differential Equations
- Denoising Diffusions with Optimal Transport: Localization, Curvature, and Multi-Scale Complexity
- Conditional Latent Space Molecular Scaffold Optimization for Accelerated Molecular Design
- Bridge-IF: Learning Inverse Protein Folding with Markov Bridges
- Finite-time thermodynamics: A journey beginning with optimizing heat engines
- DMPlug: A Plug-in Method for Solving Inverse Problems with Diffusion Models
- Contraction Theory for Nonlinear Stability Analysis and Learning-based Control: A Tutorial Overview
- Pathway-Guided Optimization of Deep Generative Molecular Design Models for Cancer Therapy
- Exponential convergence rates for momentum stochastic gradient descent in the overparametrized setting
- Denoising Fisher Training For Neural Implicit Samplers
- Analysis of Primal-Dual Langevin Algorithms
- Scaling Laws with Hidden Structure
- Diffusion Bridge Implicit Models
- ADOPT: Modified Adam Can Converge with Any $\beta_2$ with the Optimal Rate
- How much is a noisy image worth? Data Scaling Laws for Ambient Diffusion
- MicroAdam: Accurate Adaptive Optimization with Low Space Overhead and Provable Convergence
- A Framework for Bilevel Optimization on Riemannian Manifolds
- eXponential FAmily Dynamical Systems (XFADS): Large-scale nonlinear Gaussian state-space modeling
- Denoising Diffusions with Optimal Transport: Localization, Curvature, and Multi-Scale Complexity
- Statistical guarantees for denoising reflected diffusion models
- Scaling Laws with Hidden Structure
- Provable optimal transport with transformers: The essence of depth and prompt engineering
- Modern, Efficient, and Differentiable Transport Equation Models using JAX: Applications to Population Balance Equations
- Wasserstein Flow Matching: Generative modeling over families of distributions
- Constrained Diffusion Implicit Models
- Constant Acceleration Flow
- DiffusionPDE: Generative PDE-Solving Under Partial Observation
- Why do we regularise in every iteration for imaging inverse problems?
- Fast Samplers for Inverse Problems in Iterative Refinement Models
- Categorical Flow Matching on Statistical Manifolds
- Conditional Generative Models are Sufficient to Sample from Any Causal Effect Estimand
- Nesterov acceleration despite very noisy gradients
- Constrained Sampling with Primal-Dual Langevin Monte Carlo
- Inclusive KL Minimization: A Wasserstein-Fisher-Rao Gradient Flow Perspective
- A Geometric Framework for Understanding Memorization in Generative Models
- Micro-macro Parareal, from ODEs to SDEs and back again
- Why do we regularise in every iteration for imaging inverse problems?
- On Statistical Rates and Provably Efficient Criteria of Latent Diffusion Transformers (DiTs)
- Global Convergence in Training Large-Scale Transformers
- Understanding Optimization in Deep Learning with Central Flows
- Plug-and-play superiorization
- Non-Euclidean Monotone Operator Theory and Applications
- Fisher Flow Matching for Generative Modeling over Discrete Data
- Understanding Optimization in Deep Learning with Central Flows
- Bridging Geometric States via Geometric Diffusion Bridge
- CaAdam: Improving Adam optimizer using connection aware methods
- Learning Lipschitz Operators with respect to Gaussian Measures with Near-Optimal Sample Complexity
- Scaling Laws in Linear Regression: Compute, Parameters, and Data
- The Road Less Scheduled
- Universality of the $\pi^2/6$ Pathway in Avoiding Model Collapse
- Unpicking Data at the Seams: VAEs, Disentanglement and Independent Components
- Functional Gradient Flows for Constrained Sampling
- ET-Flow: Equivariant Flow-Matching for Molecular Conformer Generation
- Compositional imprecise probability
- Provable acceleration for diffusion models under minimal assumptions
- Consistency Diffusion Bridge Models
- Zeroth-Order Sampling Methods for Non-Log-Concave Distributions: Alleviating Metastability by Denoising Diffusion
- Adam with model exponential moving average is effective for nonconvex optimization
- Provable acceleration for diffusion models under minimal assumptions
- Life at low Reynolds number isn't such a drag
- Constrained Optimization with Compressed Gradients: A Dynamical Systems Perspective
- On the potential benefits of entropic regularization for smoothing Wasserstein estimators
- Averaging principle for multiscale controlled jump diffusions and associated nonlocal HJB equations
- The Poisson Midpoint Method for Langevin Dynamics: Provably Efficient Discretization for Diffusion Models
- Diffusion Approximations for Thompson Sampling
- A Mathematical Analysis of Neural Operator Behaviors
- Energy-Based Diffusion Language Models for Text Generation
- Evaluating the Posterior Sampling Ability of Plug&Play Diffusion Methods in Sparse-View CT
- Exploring the Design Space of Diffusion Bridge Models via Stochasticity Control
- E(3)-invaraint diffusion model for pocket-aware peptide generation
- Sharp spectral gap of adaptive Langevin dynamics
- Rivers under Noise
- EconoJax: A Fast & Scalable Economic Simulation in Jax
- Constrained Optimization with Compressed Gradients: A Dynamical Systems Perspective
- Long time behaviour of generalised gradient flows via occupational measures
- High-order Moreau envelope beyond convexity: An inexact two-level smoothing framework
- The Distance Between the Perturbation of a Convex Function and its $\Gamma$-regularization
- On Linear Convergence of PI Consensus Algorithm under the Restricted Secant Inequality
- Gradient-adjusted underdamped Langevin dynamics for sampling
- A new class of splitting methods that preserve ergodicity and exponential integrability for stochastic Langevin equation
- Adjoint Matching: Fine-tuning Flow and Diffusion Generative Models with Memoryless Stochastic Optimal Control
- Evaluating the design space of diffusion-based generative models
- Learning Variational Inequalities from Data: Fast Generalization Rates under Strong Monotonicity
- Stochastic Flow Matching for Resolving Small-Scale Physics
- Aligning Target-Aware Molecule Diffusion Models with Exact Energy Optimization
- Schr\"{o}dinger Bridge with Quadratic State Cost is Exactly Solvable
- Denoising Diffusion Variational Inference: Diffusion Models as Expressive Variational Posteriors
- Kernel Approximation of Fisher-Rao Gradient Flows
- Generator Matching: Generative modeling with arbitrary Markov processes
- Hamiltonian Score Matching and Generative Flows
- Understanding Adam Requires Better Rotation Dependent Assumptions
- Privacy without Noisy Gradients: Slicing Mechanism for Generative Model Training
- Provable optimal transport with transformers: The essence of depth and prompt engineering
- Sampling from Bayesian Neural Network Posteriors with Symmetric Minibatch Splitting Langevin Dynamics
- A twenty-first century statistical physics of life
- Optimizing Economic Markets through Monte Carlo Simulations and Magnetism-Inspired Modeling
- Accelerated optimization algorithms and ordinary differential equations: the convex non Euclidean case
- Fixed-Point Automatic Differentiation of Forward--Backward Splitting Algorithms for Partly Smooth Functions
- Optimization with First Order Algorithms
- Variational Schr\"odinger Diffusion Models
- Learned Reference-based Diffusion Sampling for multi-modal distributions
- Simpler Diffusion (SiD2): 1.5 FID on ImageNet512 with pixel-space diffusion
- Scaling Law with Learning Rate Annealing
- Notes on the Mathematical Structure of GPT LLM Architectures
- Jump Restore Light Transport
- Gradients of Functions of Large Matrices
- Derivative-Free Guidance in Continuous and Discrete Diffusion Models with Soft Value-Based Decoding
- Structured Diffusion Models with Mixture of Gaussians as Prior Distribution
- Conditional diffusions for neural posterior estimation
- Deterministic Fokker-Planck Transport -- With Applications to Sampling, Variational Inference, Kernel Mean Embeddings & Sequential Monte Carlo
- Diffusion Bridge Implicit Models
- Can we spot a fake?
- Saddlepoint Monte Carlo and its Application to Exact Ecological Inference
- Stochastic gradient descent in high dimensions for multi-spiked tensor PCA
- GeoLoRA: Geometric integration for parameter efficient fine-tuning
- Fast constrained sampling in pre-trained diffusion models
- Denoising diffusion probabilistic models are optimally adaptive to unknown low dimensionality
- Training Free Guided Flow Matching with Optimal Control
- Transport map unadjusted Langevin algorithms: learning and discretizing perturbed samplers
- Error estimates between SGD with momentum and underdamped Langevin diffusion
- Stable generative modeling using Schr\"odinger bridges
- One-Step Diffusion Distillation through Score Implicit Matching
- Theoretical Convergence Guarantees for Variational Autoencoders
- A Simple Model of Inference Scaling Laws
- Automatic Differentiation of Optimization Algorithms with Time-Varying Updates
- Transformers are Efficient Compilers, Provably
- Truncated Consistency Models
- NETS: A Non-Equilibrium Transport Sampler
- Antifragility of stochastic transport on networks with damage
- Provable Acceleration of Nesterov's Accelerated Gradient for Rectangular Matrix Factorization and Linear Neural Networks
- Wasserstein Gradient Flow over Variational Parameter Space for Variational Inference
- Concentration of the Langevin Algorithm's Stationary Distribution
- LDAdam: Adaptive Optimization from Low-Dimensional Gradient Statistics
- On the Relation Between Linear Diffusion and Power Iteration
- Straightness of Rectified Flow: A Theoretical Insight into Wasserstein Convergence
- Beyond Discretization: Learning the Optimal Solution Path
- Diffusion-PINN Sampler
- Multi-marginal Schr\"odinger Bridges with Iterative Reference Refinement
- Emergence in non-neural models: grokking modular arithmetic via average gradient outer product
- Learning diffusion at lightspeed
- Plug-and-Play Posterior Sampling under Mismatched Measurement and Prior Models
- Matrix normal distribution and elliptic distribution
- Feedback Schr{\"o}dinger Bridge Matching
- Stochastic Gradient Descent Jittering for Inverse Problems: Alleviating the Accuracy-Robustness Tradeoff
- Unified Convergence Analysis for Score-Based Diffusion Models with Deterministic Samplers
- Heavy-Tailed Diffusion Models
- A Mirror Descent Perspective of Smoothed Sign Descent
- Faster Diffusion Sampling with Randomized Midpoints: Sequential and Parallel
- Scaling Laws and Compute-Optimal Training Beyond Fixed Training Durations
- Improved Convergence Rate for Diffusion Probabilistic Models
- Geometric Trajectory Diffusion Models
- Theory on Score-Mismatched Diffusion Models and Zero-Shot Conditional Samplers
- Global Optimization Algorithm through High-Resolution Sampling
- Training Neural Samplers with Reverse Diffusive KL Divergence
- Expected Sliced Transport Plans
- Geometry and analytic properties of the sliced Wasserstein space
- Bayesian Experimental Design via Contrastive Diffusions
- Differentiable Programming for Computational Plasma Physics
- A Hitchhiker's Guide to Scaling Law Estimation
- Latent Schr{\"o}dinger Bridge Diffusion Model for Generative Learning
- Comparison Theorems for the Mixing Times of Systematic and Random Scan Dynamics
- Simplifying, Stabilizing and Scaling Continuous-Time Consistency Models
- Manifolds, Random Matrices and Spectral Gaps: The geometric phases of generative diffusion
- Improving Consistency Models with Generator-Induced Flows
- Improving Probabilistic Diffusion Models With Optimal Covariance Matching
- Unraveling the Smoothness Properties of Diffusion Models: A Gaussian Mixture Perspective
- Geometry, Computation, and Optimality in Stochastic Optimization
- Provable Acceleration of Nesterov's Accelerated Gradient for Rectangular Matrix Factorization and Linear Neural Networks
- Variational Diffusion Posterior Sampling with Midpoint Guidance
- Fast Convergence of $\Phi$-Divergence Along the Unadjusted Langevin Algorithm and Proximal Sampler
- High-Dimensional Differential Parameter Inference in Exponential Family using Time Score Matching
- Inverse Problems and Data Assimilation: A Machine Learning Approach
- Provable Convergence and Limitations of Geometric Tempering for Langevin Dynamics
- DFM: Interpolant-free Dual Flow Matching
- Convergence of the Adapted Smoothed Empirical Measures
- Adapted Wasserstein distance between the laws of SDEs
- Matrix denoising: Bayes-optimal estimators via low-degree polynomials
- Penalized Overdamped and Underdamped Langevin Monte Carlo Algorithms for Constrained Sampling
- Linear Convergence of Diffusion Models Under the Manifold Hypothesis
- Gradient-adjusted underdamped Langevin dynamics for sampling
- The velocity jump Langevin process and its splitting scheme: long time convergence and numerical accuracy
- Gradient-adjusted underdamped Langevin dynamics for sampling
- Efficient, Multimodal, and Derivative-Free Bayesian Inference With Fisher-Rao Gradient Flows
- Consistency Models Made Easy
- Flow matching achieves almost minimax optimal convergence
- Linear Convergence of Diffusion Models Under the Manifold Hypothesis
- Losing dimensions: Geometric memorization in generative diffusion
- Scaling Laws for Predicting Downstream Performance in LLMs
- Randomized Asymmetric Chain of LoRA: The First Meaningful Theoretical Framework for Low-Rank Adaptation
- Stochastic Optimal Control Matching
- Step-by-Step Reasoning for Math Problems via Twisted Sequential Monte Carlo
- Unlocking Guidance for Discrete State-Space Diffusion and Flow Models
- A Practical Guide to Sample-based Statistical Distances for Evaluating Generative Models in Science
- Adam Exploits $\ell_\infty$-geometry of Loss Landscape via Coordinate-wise Adaptivity
- Simple ReFlow: Improved Techniques for Fast Flow Models
- Linear combinations of Gaussian latents in generative models: interpolation and beyond
- Control, Transport and Sampling: Towards Better Loss Design
- Generalizing Stochastic Smoothing for Differentiation and Gradient Estimation
- Talagrand's mathematical journey to the Abel Prize 2024
- Improving the Training of Rectified Flows
- Gradient correlation is needed to accelerate SGD with momentum
- Towards a Theoretical Understanding of Memorization in Diffusion Models
- Posterior Sampling via Autoregressive Generation
- Through the Looking Glass: Mirror Schr\"odinger Bridges
- Backward Map for Filter Stability Analysis
- Independent projections of diffusions: Gradient flows for variational inference and optimal mean field approximations
- Diffusion Density Estimators
- A noise-corrected Langevin algorithm and sampling by half-denoising
- Ensemble Kalman Methods: A Mean Field Perspective
- On Interactions for Large Scale Interacting Systems
- On the Optimization and Generalization of Two-layer Transformers with Sign Gradient Descent
- Grokking at the Edge of Linear Separability
- To Clip or not to Clip: the Dynamics of SGD with Gradient Clipping in High-Dimensions
- Stochastic Optimal Control for Diffusion Bridges in Function Spaces
- Generative Marginalization Models
- Stochastic Runge-Kutta Methods: Provable Acceleration of Diffusion Models
- Generative Flows on Synthetic Pathway for Drug Design
- The Optimization Landscape of SGD Across the Feature Learning Strength
- A Comprehensive Framework for Analyzing the Convergence of Adam: Bridging the Gap with SGD
- SGD with memory: fundamental properties and stochastic acceleration
- On the SAGA algorithm with decreasing step
- Interpretation of generalized Langevin equations
- Bregman Proximal Method for Efficient Communications under Similarity
- How Discrete and Continuous Diffusion Meet: Comprehensive Analysis of Discrete Diffusion Models via a Stochastic Integral Framework
- Large Language Models as Markov Chains
- Randomized Runge-Kutta-Nystr\"om Methods for Unadjusted Hamiltonian and Kinetic Langevin Monte Carlo
- Randomized Runge-Kutta-Nystr\"om Methods for Unadjusted Hamiltonian and Kinetic Langevin Monte Carlo
- Diffusion Models are Evolutionary Algorithms
- Posterior sampling via Langevin dynamics based on generative priors
- NETS: A Non-Equilibrium Transport Sampler
- Diffusion & Adversarial Schr\"odinger Bridges via Iterative Proportional Markovian Fitting
- Convergence of Score-Based Discrete Diffusion Models: A Discrete-Time Analysis
- Stochastic Sampling from Deterministic Flow Models
- Score-based pullback Riemannian geometry
- Maximum Ideal Likelihood Estimator: An New Estimation and Inference Framework for Latent Variable Models
- Improving sampling by modifying the effective diffusion
- Thermodynamic Bayesian Inference
- Parametrized Families of Resolvent Compositions
- Edge-preserving noise for diffusion models
- Flow Matching for Accelerated Simulation of Atomic Transport in Materials
- Bellman Diffusion: Generative Modeling as Learning a Linear Operator in the Distribution Space
- Robust Guided Diffusion for Offline Black-Box Optimization
- Smoothed Energy Guidance: Guiding Diffusion Models with Reduced Energy Curvature of Attention
- Flow Priors for Linear Inverse Problems via Iterative Corrupted Trajectory Matching
- Beyond Single Concept Vector: Modeling Concept Subspace in LLMs with Gaussian Distribution
- A Taxonomy of Loss Functions for Stochastic Optimal Control
- Preconditioning for Accelerated Gradient Descent Optimization and Regularization
- A Survey on Diffusion Models for Inverse Problems
- Entropy contraction of the Gibbs sampler under log-concavity
- Stochastic Inverse Problem: stability, regularization and Wasserstein gradient flow
- A Unified Stability Theory for Classical and Monotone Markov Chains
- Bicausal optimal transport for SDEs with irregular coefficients
- Generative AI for fast and accurate Statistical Computation of Fluids
- Entropy, concentration, and learning: a statistical mechanics primer
- Density of states in neural networks: an in-depth exploration of learning in parameter space
- Concave tents: a new tool for constructing concave reformulations of a large class of nonconvex optimization problems
- Convergence of Diffusion Models Under the Manifold Hypothesis in High-Dimensions
- $O(d/T)$ Convergence Theory for Diffusion Probabilistic Models under Minimal Assumptions
- Loop-Diffusion: an equivariant diffusion model for designing and scoring protein loops
- Gradient-free Decoder Inversion in Latent Diffusion Models
- Bayesian Matrix Decomposition and Applications
- When Is Inductive Inference Possible?
- Optimal Protocols for Continual Learning via Statistical Physics and Control Theory
- Proximal Estimation and Inference
- Schr\"odinger bridge based deep conditional generative learning
- Uniform log-Sobolev inequalities for mean field particles beyond flat-convexity
- Two-Timescale Gradient Descent Ascent Algorithms for Nonconvex Minimax Optimization
- Forward Primal-Dual Half-Forward Algorithm for Splitting Four Operators
- Deep Generative Models through the Lens of the Manifold Hypothesis: A Survey and New Connections
- Acceleration Methods
- Protein Conformation Generation via Force-Guided SE(3) Diffusion Models
- Optimal longevity of a dynasty
- Robust Estimation under the Wasserstein Distance
- Bayesian computation with generative diffusion models by Multilevel Monte Carlo
- A theory of generalised coordinates for stochastic differential equations
- Examples of slow convergence for adaptive regularization optimization methods are not isolated
- Stochastic interpolants with data-dependent couplings
- The discrete analogue of the Gaussian
- Convergence rate of random scan Coordinate Ascent Variational Inference under log-concavity
- A Contract Theory for Layered Control Architectures
- What does guidance do? A fine-grained analysis in a simple setting
- State space models, emergence, and ergodicity: How many parameters are needed for stable predictions?
- Provable In-Context Learning of Linear Systems and Linear Elliptic PDEs with Transformers
- JKO for Landau: a variational particle method for homogeneous Landau equation
- Inverse Problems with Diffusion Models: A MAP Estimation Perspective
- From exponential to finite/fixed-time stability: Applications to optimization
- On the Statistical Complexity of Sample Amplification
- Differential Inversion of the Implicit Euler Method: Symbolic Analysis
- An Approximation Theory Framework for Measure-Transport Sampling Algorithms
- Denoising diffusion models for high-resolution microscopy image restoration
- A Fisher-Rao gradient flow for entropic mean-field min-max games
- From exponential to finite/fixed-time stability: Applications to optimization
- Proximal Gradient Dynamics: Monotonicity, Exponential Convergence, and Applications
- A Dynamical System View of Langevin-Based Non-Convex Sampling
- BM$^2$: Coupled Schr\"{o}dinger Bridge Matching
- Accuracy of the Ensemble Kalman Filter in the Near-Linear Setting
- Neural Thermodynamic Integration: Free Energies from Energy-based Diffusion Models
- A Note on the Convergence of Denoising Diffusion Probabilistic Models
- Conditional sampling within generative diffusion models
- A Statistical Viewpoint on Differential Privacy: Hypothesis Testing, Representation and Blackwell's Theorem
- BM$^2$: Coupled Schr\"{o}dinger Bridge Matching
- Schr\"odinger Bridge Flow for Unpaired Data Translation
- Uniform-in-$N$ log-Sobolev inequality for the mean-field Langevin dynamics with convex energy
- Causal Tracking of Distributions in Wasserstein Space: A Model Predictive Control Scheme
- HJ-sampler: A Bayesian sampler for inverse problems of a stochastic process by leveraging Hamilton-Jacobi PDEs and score-based generative models
- Neural Thermodynamic Integration: Free Energies from Energy-based Diffusion Models
- Optimal Low-dimensional Approximation of Transfer Operators via Flow Matching: Computation and Error Analysis
- Measure-Theoretic Time-Delay Embedding
- Adjoint Matching: Fine-tuning Flow and Diffusion Generative Models with Memoryless Stochastic Optimal Control
- Theoretical guarantees in KL for Diffusion Flow Matching
- Improved Finite-Particle Convergence Rates for Stein Variational Gradient Descent
- Revisiting Convergence of AdaGrad with Relaxed Assumptions
- Convergence Rate Bounds for the Mirror Descent Method: IQCs, Popov Criterion and Bregman Divergence
- Adjoint Matching: Fine-tuning Flow and Diffusion Generative Models with Memoryless Stochastic Optimal Control
- Tensor-train methods for sequential state and parameter learning in state-space models
- On the Concentration of the Minimizers of Empirical Risks
- Gaussian Interpolation Flows
- KL Convergence Guarantees for Score diffusion models under minimal data assumptions
- Localized Schr\"odinger Bridge Sampler
- Critically Damped Third-Order Langevin Dynamics
- Understanding Foundation Models: Are We Back in 1924?
- Projected gradient descent accumulates at Bouligand stationary points
- From optimal score matching to optimal sampling
- Proof mining and probability theory
- Convergence of Sinkhorn's Algorithm for Entropic Martingale Optimal Transport Problem
- A Short Information-Theoretic Analysis of Linear Auto-Regressive Learning
- Approximately Gaussian Replicator Flows: Nonconvex Optimization as a Nash-Convergent Evolutionary Game
- What happens to diffusion model likelihood when your model is conditional?
- Denoising: A Powerful Building-Block for Imaging, Inverse Problems, and Machine Learning
- Differentiable programming across the PDE and Machine Learning barrier
- A Unified Analysis of Saddle Flow Dynamics: Stability and Algorithm Design
- Unbiased Kinetic Langevin Monte Carlo with Inexact Gradients
- Faster Sampling from Log-Concave Densities over Polytopes via Efficient Linear Solvers
- WarpAdam: A new Adam optimizer based on Meta-Learning approach
- Latent Space Energy-based Neural ODEs
- The Stochastic Proximal Distance Algorithm
- A Partition Function Estimator
- Relative-Translation Invariant Wasserstein Distance
- Guidance for twisted particle filter: a continuous-time perspective
- Generalized Continuous-Time Models for Nesterov's Accelerated Gradient Methods
- Subspace Diffusion Posterior Sampling for Travel-Time Tomography
- Probabilistic Decomposed Linear Dynamical Systems for Robust Discovery of Latent Neural Dynamics
- An Empirical Study of Scaling Laws for Transfer
- Theoretical Insights into Overparameterized Models in Multi-Task and Replay-Based Continual Learning
- Non-Monotone Variational Inequalities
- Heat Death of Generative Models in Closed-Loop Learning
- Unlocking Global Optimality in Bilevel Optimization: A Pilot Study
- Force-Guided Bridge Matching for Full-Atom Time-Coarsened Dynamics of Peptides
- Statistical and Geometrical properties of regularized Kernel Kullback-Leibler divergence
- Local convergence rates for Wasserstein gradient flows and McKean-Vlasov equations with multiple stationary solutions
- Geometric Ergodicity and Wasserstein Continuity of Non-Linear Filters
- A Score-Based Density Formula, with Applications in Diffusion Generative Models
- A Tutorial on Brownian Motion for Biostatisticians
- Probabilistic Forecasting with Stochastic Interpolants and F\"ollmer Processes
- chemtrain: Learning Deep Potential Models via Automatic Differentiation and Statistical Physics
- Reconstructing dynamical systems as zero-noise limits
- Causal structure learning with momentum: Sampling distributions over Markov Equivalence Classes of DAGs
- On latent dynamics learning in nonlinear reduced order modeling
- Force-Guided Bridge Matching for Full-Atom Time-Coarsened Dynamics of Peptides
- Unveiling the Statistical Foundations of Chain-of-Thought Prompting Methods
- Meta Flow Matching: Integrating Vector Fields on the Wasserstein Manifold
- An invitation to adaptive Markov chain Monte Carlo convergence theory
- Quantitative convergence of a discretization of dynamic optimal transport using the dual formulation
- Sub-Riemannian Geometry, Mixing, and the Holonomy of Optimal Mass Transport
- A higher-order Otto calculus approach to the Gaussian completely monotone conjecture
- Symmetry & Critical Points
- Geometric ergodicity of SGLD via reflection coupling
- Randomized Kaczmarz with geometrically smoothed momentum
- How Diffusion Models Learn to Factorize and Compose
- Symplectic Bregman divergences
- Asymptotics for Optimal Empirical Quantization of Measures
- A Geometric Perspective on Diffusion Models
- Posterior Sampling in High Dimension via Diffusion Processes
- Convergence of Unadjusted Langevin in High Dimensions: Delocalization of Bias
- Optimised Annealed Sequential Monte Carlo Samplers
- Geometrical structures of digital fluctuations in parameter space of neural networks trained with adaptive momentum optimization
- Policy-guided Monte Carlo on general state spaces: Application to glass-forming mixtures
- On Statistical Rates and Provably Efficient Criteria of Latent Diffusion Transformers (DiTs)
- Stochastic Compositional Minimax Optimization with Provable Convergence Guarantees
- Two-Timescale Gradient Descent Ascent Algorithms for Nonconvex Minimax Optimization
- Adaptive Stereographic MCMC
- Small Sample Behavior of Wasserstein Projections, Connections to Empirical Likelihood, and Other Applications
- Nonequilbrium physics of generative diffusion models
- Inflationary Flows: Calibrated Bayesian Inference with Diffusion-Based Models
- Recent Advances in Optimal Transport for Machine Learning
- Plug-in estimation of Schr\"odinger bridges
- A Markovian Model for Learning-to-Optimize
- Annealed Sinkhorn for Optimal Transport: convergence, regularization path and debiasing
- Learning Deep Dissipative Dynamics
- How to Make the Gradients Small Privately: Improved Rates for Differentially Private Non-Convex Optimization
- Learning Multimodal Latent Space with EBM Prior and MCMC Inference
- Second-Order Forward-Mode Automatic Differentiation for Optimization
- Diffusion Model for Planning: A Systematic Literature Review
- Continuous Approximations of Projected Dynamical Systems via Control Barrier Functions
- A new perspective on the learning dynamics for a class of learning problems via averaged gradient systems coupled with diffusion-transmutation processes
- Sampling Foundational Transformer: A Theoretical Perspective
- Narrowing the Focus: Learned Optimizers for Pretrained Models
- Classifier-Free Guidance is a Predictor-Corrector
- Convergence in total variation for the kinetic Langevin algorithm
- Sharp $L^q$-Convergence Rate in $p$-Wasserstein Distance for Empirical Measures of Diffusion Processes
- Tractable Optimal Experimental Design using Transport Maps
- Accurate, scalable, and efficient Bayesian optimal experimental design with derivative-informed neural operators
- Quantized Distributed Nonconvex Optimization Algorithms with Linear Convergence under the Polyak--${\L}$ojasiewicz Condition
- Faster Adaptive Decentralized Learning Algorithms
- Learning to Optimally Stop a Diffusion Process
- Refining asymptotic complexity bounds for nonconvex optimization methods, including why steepest descent is $o(\epsilon^{-2})$ rather than $\mathcal{O}(\epsilon^{-2})$
- Explicit Convergence Rate of The Proximal Point Algorithm under R-Continuity
- On the convergence of adaptive approximations for stochastic differential equations
- Optimal transport natural gradient for statistical manifolds with continuous sample space
- AdamMCMC: Combining Metropolis Adjusted Langevin with Momentum-based Optimization
- Explore-then-Commit Algorithms for Decentralized Two-Sided Matching Markets
- Predictive performance of power posteriors
- Ensemble Kalman Methods: A Mean Field Perspective
- Rate of convergence of the smoothed empirical Wasserstein distance
- Understanding the Local Geometry of Generative Model Manifolds
- Mean-field limits for Consensus-Based Optimization and Sampling
- Two Completely Parameter-Free Alternating Gradient Projection Algorithms for Nonconvex-(strongly) Concave Minimax Problems
- Variational Analysis of Proximal Compositions and Integral Proximal Mixtures
- Error bounds, PL condition, and quadratic growth for weakly convex functions, and linear convergences of proximal point methods
- Blessing of Dimensionality for Approximating Sobolev Classes on Manifolds
- Exploring the generalizability of the optimal 0.234 acceptance rate in random-walk Metropolis and parallel tempering algorithms
- Quasi-Monte Carlo Beyond Hardy-Krause
- High-order Structure-preserving Methods for Damped Hamiltonian System
- Revisiting Inexact Fixed-Point Iterations for Min-Max Problems: Stochasticity and Structured Nonconvexity
- PeRFlow: Piecewise Rectified Flow as Universal Plug-and-Play Accelerator
- Convergence Properties of Score-Based Models for Linear Inverse Problems Using Graduated Optimisation
- Simplified Diffusion Schr\"odinger Bridge
- Rough stochastic differential equations
- Nonconvex Factorization and Manifold Formulations are Almost Equivalent in Low-rank Matrix Optimization
- A mathematical perspective on Transformers
- Accelerating Distributed Optimization: A Primal-Dual Perspective on Local Steps
- The distribution of Bayes' ratio
- Statistically Optimal Uncertainty Quantification for Expensive Black-Box Models
- On the Convergence of a Federated Expectation-Maximization Algorithm
- Testing Elliptical Models in High Dimensions
- Moreau-Yoshida Variational Transport: A General Framework For Solving Regularized Distributional Optimization Problems
- Kernel Density Estimators in Large Dimensions
- Integral Resolvent and Proximal Mixtures
- The Foundations of Tokenization: Statistical and Computational Concerns
- Differentiable Annealed Importance Sampling Minimizes The Symmetrized Kullback-Leibler Divergence Between Initial and Target Distribution
- Searching for (sharp) thresholds in random structures: where are we now?
- Consistent expansion of the Langevin propagator with application to entropy production
- Geometric theory of (extended) time-reversal symmetries in stochastic processes -- Part I: finite dimension
- Connective Viewpoints of Signal-to-Noise Diffusion Models
- The Road Less Scheduled
- Flexible Bayesian Last Layer Models Using Implicit Priors and Diffusion Posterior Sampling
- Regularity Properties of Optimization-Based Controllers
- A Hessian-Aware Stochastic Differential Equation for Modelling SGD
- Weak convergence analysis in the particle limit of the McKean--Vlasov equations using stochastic flows of particle systems
- Back-Projection Diffusion: Solving the Wideband Inverse Scattering Problem with Diffusion Models
- Closed-loop Diffusion Control of Complex Physical Systems
- Randomized Transport Plans via Hierarchical Fully Probabilistic Design
- Locally Stationary Distributions: A Framework for Analyzing Slow-Mixing Markov Chains
- On the Low-Temperature MCMC threshold: the cases of sparse tensor PCA, sparse regression, and a geometric rule
- Uniform log-Sobolev inequalites for mean field particles with flat-convex energy
- Optimal Control of Underdamped Systems: An Analytic Approach
- Embedding generalization within the learning dynamics: An approach based-on sample path large deviation theory
- Kullback-Leibler-based characterizations of score-driven updates
- Molecular relaxation by reverse diffusion with time step prediction
- Learnability of Parameter-Bounded Bayes Nets
- Variational Flow Models: Flowing in Your Style
- On Probabilistic Embeddings in Optimal Dimension Reduction
- A Sharp Convergence Theory for The Probability Flow ODEs of Diffusion Models
- Meta-Posterior Consistency for the Bayesian Inference of Metastable System
- Gradient flow in parameter space is equivalent to linear interpolation in output space
- Complexity of Minimizing Projected-Gradient-Dominated Functions with Stochastic First-order Oracles
- Using Linearized Optimal Transport to Predict the Evolution of Stochastic Particle Systems
- Autoencoders in Function Space
- Gradient-free optimization via integration
- Dilated convolution neural operator for multiscale partial differential equations
- On quantitative convergence for stochastic processes: Crossings, fluctuations and martingales
- The random timestep Euler method and its continuous dynamics
- Analysis of continuous data assimilation with large (or even infinite) nudging parameters
- Annealing approach to root-finding
- Potential Mean-Field Games and Gradient Flows
- Unlocking Guidance for Discrete State-Space Diffusion and Flow Models
- Conditional Independence in Stationary Diffusions
- Bounding adapted Wasserstein metrics
- Total variation distance between SDEs with stable noise and Brownian motion with applications to Poisson PDEs
- Convergence rates for the Adam optimizer
- Entropy, Thermodynamics and the Geometrization of the Language Model
- Zigzag path connects two Monte Carlo samplers: Hamiltonian counterpart to a piecewise deterministic Markov process
- Importance Corrected Neural JKO Sampling
- Persistent Sampling: Unleashing the Potential of Sequential Monte Carlo
- Inverse Problems with Diffusion Models: A MAP Estimation Perspective
- Doubly nonlinear diffusive PDEs: new existence results via generalized Wasserstein gradient flows
- Introduction to Nonsmooth Analysis and Optimization
- Accelerated forward-backward and Douglas-Rachford splitting dynamics
- Unsupervised Training of Convex Regularizers using Maximum Likelihood Estimation
- An Approximation Theory Framework for Measure-Transport Sampling Algorithms
- Differentially Private Gradient Flow based on the Sliced Wasserstein Distance
- Piecewise deterministic generative models
- Perspectives on Contractivity in Control, Optimization, and Learning
- Time-Varying Convex Optimization: A Contraction and Equilibrium Tracking Approach
- Analysis of Gradient Descent with Varying Step Sizes using Integral Quadratic Constraints
- A Lyapunov Analysis of Accelerated PDHG Algorithms
- Score matching through the roof: linear, nonlinear, and latent variables causal discovery
- Ensemble Kalman inversion approximate Bayesian computation
- Variational Inference via Smoothed Particle Hydrodynamics
- Log-Concave Coupling for Sampling Neural Net Posteriors
- Mathematical theory of deep learning
- Statistical optimal transport
- Fast convergence of the Expectation Maximization algorithm under a logarithmic Sobolev inequality
- A New Scalar Auxiliary Variable Approach for Gradient Flows
- A unified law of robustness for Bregman divergence losses
- Amortized Posterior Sampling with Diffusion Prior Distillation
- Strong solution of stochastic differential equations with discontinuous and unbounded coefficients
- The Anytime Convergence of Stochastic Gradient Descent with Momentum: From a Continuous-Time Perspective
- Efficient Convex Optimization Requires Superlinear Memory
- A particle consensus approach to solving nonconvex-nonconcave min-max problems
- Algorithm-independent bounds on complex optimization through the statistics of marginal optima
- Provable Benefit of Annealed Langevin Monte Carlo for Non-log-concave Sampling
- The Elements of Differentiable Programming
- Explicit convergence rates of underdamped Langevin dynamics under weighted and weak Poincar\'e--Lions inequalities
- Minimax Optimality of Score-based Diffusion Models: Beyond the Density Lower Bound Assumptions
- Fisher-Rao Gradient Flow: Geodesic Convexity and Functional Inequalities
- Diffusion Models as Optimizers for Efficient Planning in Offline RL
- Score matching for bridges without time-reversals
- Convergence of Empirical Optimal Transport in Unbounded Settings
- A finite-dimensional approximation for partial differential equations on Wasserstein space
- Existence of stationary measures for partially damped SDEs with generic, Euler-type nonlinearities
- A new numerical scheme for It\^o stochastic differential equations based on Wick-type Wong-Zakai arguments
- Rendering Wireless Environments Useful for Gradient Estimators: A Zero-Order Stochastic Federated Learning Method
- Data-Driven Optimal Feedback Laws via Kernel Mean Embeddings
- EM++: A parameter learning framework for stochastic switching systems
- Optimal Design of Resolvent Splitting Algorithms
- Discrete time crystals in the presence of non-Markovian dynamics
- Fast second-order dynamics with slow vanishing damping approaching the zeros of a monotone and continuous operator
- Metric extrapolation in the Wasserstein space
- Discrete Flow Matching
- Score matching for bridges without time-reversals
- Convergence of Sinkhorn's Algorithm for Entropic Martingale Optimal Transport Problem
- Stochastic Monotone Inclusion with Closed Loop Distributions
- Improved motif-scaffolding with SE(3) flow matching
- Conditional Generative Models are Provably Robust: Pointwise Guarantees for Bayesian Inverse Problems
- Learning Firmly Nonexpansive Operators
- An electrical engineering perspective on naturality in computational physics
- Constrained Approximate Optimal Transport Maps
- Sampling from mixture distributions based on regime-switching diffusions
- Importance Weighted Expectation-Maximization for Protein Sequence Design
- A Methodology Establishing Linear Convergence of Adaptive Gradient Methods under PL Inequality
- Kinetic based optimization enhanced by genetic dynamics
- Combining Wasserstein-1 and Wasserstein-2 proximals: robust manifold learning via well-posed generative flows
- A geometric integration approach to smooth optimisation: Foundations of the discrete gradient method
- Gradient Flows and Riemannian Structure in the Gromov-Wasserstein Geometry
- An exactly solvable model for emergence and scaling laws
- Generative Modeling by Minimizing the Wasserstein-2 Loss
- Construction of the Kolmogorov-Arnold representation using the Newton-Kaczmarz method
- The Anytime Convergence of Stochastic Gradient Descent with Momentum: From a Continuous-Time Perspective
- Metric extrapolation in the Wasserstein space
- Structure preserving schemes for a class of Wasserstein gradient flows
- Diffusion Tempering Improves Parameter Estimation with Probabilistic Integrators for Ordinary Differential Equations
- Inflationary Flows: Calibrated Bayesian Inference with Diffusion-Based Models
- Convergence in total variation for the kinetic Langevin algorithm
- What's the score? Automated Denoising Score Matching for Nonlinear Diffusions
- New algorithms for sampling and diffusion models
- How to beat a Bayesian adversary
- Diffusion Tempering Improves Parameter Estimation with Probabilistic Integrators for Ordinary Differential Equations
- Abide by the Law and Follow the Flow: Conservation Laws for Gradient Flows
- Sliced Wasserstein Geodesics and Equivalence Wasserstein and Sliced Wasserstein metrics
- Analysis of Langevin Monte Carlo from Poincar\'e to Log-Sobolev
- Dynamical Measure Transport and Neural PDE Solvers for Sampling
- Accelerated Fully First-Order Methods for Bilevel and Minimax Optimization
- Gaussian Interpolation Flows
- Convergence of the Chambolle-Pock Algorithm in the Absence of Monotonicity
- Learning Diffusion Priors from Observations by Expectation Maximization
- UDPM: Upsampling Diffusion Probabilistic Models
- LPGD: A General Framework for Backpropagation through Embedded Optimization Layers
- Scaling Exponents Across Parameterizations and Optimizers
- Finite Time Explosion of Stochastic Differential Equations: A survey into Khasminskii's Lyapunov Method and its Consistency with the Osgood Criterion
- Einstein from Noise: Statistical Analysis
- pythOS: A Python library for solving IVPs by operator splitting
- Low-rank approximated Kalman filter using Oja's principal component flow for discrete-time linear systems
- A general framework for inexact splitting algorithms with relative errors and applications to Chambolle-Pock and Davis-Yin methods
- Alice's Adventures in a Differentiable Wonderland -- Volume I, A Tour of the Land
- Adaptive proximal gradient methods are universal without approximation
- Proximity Operators of Perspective Functions with Nonlinear Scaling
- Langevin Dynamics: A Unified Perspective on Optimization via Lyapunov Potentials
- Convergence of flow-based generative models via proximal gradient descent in Wasserstein space
- Weak Convergence Of Tamed Exponential Integrators for Stochastic Differential Equations
- Lower bounds on the rate of convergence for accept-reject-based Markov chains in Wasserstein and total variation distances
- Learning to Control Unknown Strongly Monotone Games
- Stochastic Optimal Control Matching
- Adam-mini: Use Fewer Learning Rates To Gain More
- An exit contract optimization problem
- Toward Global Convergence of Gradient EM for Over-Parameterized Gaussian Mixture Models
- Sharper Exponential Convergence Rates for Sinkhorn's Algorithm in Continuous Settings
- Unified Control Framework for Optimization: A Fresh Perspective on Constrained Optimization, Optimization-based Control, and Parameter Estimation
- Particle Semi-Implicit Variational Inference
- Improving Diffusion Inverse Problem Solving with Decoupled Noise Annealing
- Posterior Sampling with Denoising Oracles via Tilted Transport
- Quantitative relative entropy estimates for interacting particle systems with common noise
- Sampling from the Continuous Random Energy Model in Total Variation Distance
- Markov-bridge representation of ergodic large-deviation principles
- Importance Weighted Expectation-Maximization for Protein Sequence Design
- Discretized Gradient Flow for Manifold Learning in the Space of Embeddings
- Revisiting Kinetic Monte Carlo Algorithms for Time-dependent Processes: from open-loop control to feedback control
- To smooth a cloud or to pin it down: Guarantees and Insights on Score Matching in Denoising Diffusion Models
- JAXbind: Bind any function to JAX
- The Price of Adaptivity in Stochastic Convex Optimization
- Fast Sampling via Discrete Non-Markov Diffusion Models
- Characterization of optimization problems that are solvable iteratively with linear convergence
- Efficient, Multimodal, and Derivative-Free Bayesian Inference With Fisher-Rao Gradient Flows
- Analysis of learning a flow-based generative model from limited sample complexity
- Can independent Metropolis beat crude Monte Carlo?
- Low-rank approximated Kalman-Bucy filters using Oja's principal component flow for linear time-invariant systems
- Exact worst-case convergence rates of gradient descent: a complete analysis for all constant stepsizes over nonconvex and convex functions
- Logarithmic and power-law entropies from convexity
- A New Perspective on Shampoo's Preconditioner
- Step-by-Step Diffusion: An Elementary Tutorial
- Sampling Strategies in Bayesian Inversion: A Study of RTO and Langevin Methods
- Automating Variational Differentiation
- Scaling and renormalization in high-dimensional regression
- A Dynamical Model of Neural Scaling Laws
- Why Transformers Need Adam: A Hessian Perspective
- Positive concave deep equilibrium models
- Score-based Diffusion Models via Stochastic Differential Equations -- a Technical Tutorial
- Generative Fractional Diffusion Models
- How to train your VAE
- Concentration Inequalities for $(f,\Gamma)$-GANs
- Trace is the New AutoDiff -- Unlocking Efficient Optimization of Computational Workflows
- Probabilistic Programming with Programmable Variational Inference
- Large Sample Theory for Bures-Wasserstein Barycentres
- Provable Adaptivity of Adam under Non-uniform Smoothness
- Jacobian Descent for Multi-Objective Optimization
- A four-operator splitting algorithm for nonconvex and nonsmooth optimization
- Long-time asymptotics of noisy SVGD outside the population limit
- Strong equivalence between metrics of Wasserstein type
- Forward-Backward algorithms for weakly convex problems
- Fast sampling from constrained spaces using the Metropolis-adjusted Mirror Langevin algorithm
- A Sherman--Morrison--Woodbury approach to solving least squares problems with low-rank updates
- Integral Probability Metrics on submanifolds: interpolation inequalities and optimal inference
- Computing the invariant distribution of McKean-Vlasov SDEs by ergodic simulation
- Open Problem: Anytime Convergence Rate of Gradient Descent
- The statistical thermodynamics of generative diffusion models: Phase transitions, symmetry breaking and critical instability
- Variational Schr\"odinger Diffusion Models
- A Practical Diffusion Path for Sampling
- Exponential time differencing for matrix-valued dynamical systems
- Hitchhiker's guide on Energy-Based Models: a comprehensive review on the relation with other generative models, sampling and statistical physics
- Quantitative contraction rates for Sinkhorn algorithm: beyond bounded costs and compact marginals
- Gradient Estimation via Differentiable Metropolis-Hastings
- A variational perspective on the dissipative Hamiltonian structure of the Vlasov-Fokker-Planck equation
- Sharp detection of low-dimensional structure in probability measures via dimensional logarithmic Sobolev inequalities
- Using Autodiff to Estimate Posterior Moments, Marginals and Samples
- To smooth a cloud or to pin it down: Guarantees and Insights on Score Matching in Denoising Diffusion Models
- Efficient algorithms for implementing incremental proximal-point methods
- Mitigating Information Asymmetry in Two-Stage Contracts with Non-Myopic Agents
- On the Convergence of T\^atonnement for Linear Fisher Markets
- Evaluating the design space of diffusion-based generative models
- Sampling metastable systems using collective variables and Jarzynski-Crooks paths
- Weakly Convex Regularisers for Inverse Problems: Convergence of Critical Points and Primal-Dual Optimisation
- A Tutorial on the Non-Asymptotic Theory of System Identification
- Dynamical Low-Rank Approximation for Stochastic Differential Equations
- Auto-Encoding Bayesian Inverse Games
- Stochastic Neural Network Symmetrisation in Markov Categories
- The Implicit Bias of Adam on Separable Data
- Diffusion Model With Optimal Covariance Matching
- Sampling and estimation on manifolds using the Langevin diffusion
- Iterated Schr\"odinger bridge approximation to Wasserstein Gradient Flows
- Bayesian Conditioned Diffusion Models for Inverse Problems
- Sailing in high-dimensional spaces: Low-dimensional embeddings through angle preservation
- Differentiable Programming for Differential Equations: A Review
- An excursion onto Schr\"odinger's bridges: Stochastic flows with spatio-temporal marginals
- New algorithms for sampling and diffusion models
- Flora: Low-Rank Adapters Are Secretly Gradient Compressors
- Rethinking Score Distillation as a Bridge Between Image Distributions
- What is the long-run distribution of stochastic gradient descent? A large deviations analysis
- Operator-informed score matching for Markov diffusion models
- Probabilistic ODE Solvers for Integration Error-Aware Numerical Optimal Control
- Mirror and Preconditioned Gradient Descent in Wasserstein Space
- From Biased to Unbiased Dynamics: An Infinitesimal Generator Approach
- The Monge-Kantorovich problem on Wasserstein space
- Diffusion models for Gaussian distributions: Exact solutions and Wasserstein errors
- Provably Robust Score-Based Diffusion Posterior Sampling for Plug-and-Play Image Reconstruction
- Optimal score estimation via empirical Bayes smoothing
- Scaling Laws in Linear Regression: Compute, Parameters, and Data
- Copy-composition for probabilistic graphical models
- Can Transformers Learn Optimal Filtering for Unknown Systems?
- Approximation properties relative to continuous scale space for hybrid discretizations of Gaussian derivative operators
- The statistical thermodynamics of generative diffusion models: Phase transitions, symmetry breaking and critical instability
- Convergence rate of random scan Coordinate Ascent Variational Inference under log-concavity
- Mean-field Chaos Diffusion Models
- On Differential and Riemannian Calculus on Wasserstein Spaces
- Economic DAO Governance: A Contestable Control Approach
- Fast Two-Time-Scale Stochastic Gradient Method with Applications in Reinforcement Learning
- Improved Performance of Stochastic Gradients with Gaussian Smoothing
- Universality of AdaGrad Stepsizes for Stochastic Optimization: Inexact Oracle, Acceleration and Variance Reduction
- Improving Antibody Design with Force-Guided Sampling in Diffusion Models
- Numerically robust square root implementations of statistical linear regression filters and smoothers
- Combinatorial Complex Score-based Diffusion Modelling through Stochastic Differential Equations
- Crafting Heavy-Tails in Weight Matrix Spectrum without Gradient Noise
- Gradient Clipping Improves AdaGrad when the Noise Is Heavy-Tailed
- Diffusion models for Gaussian distributions: Exact solutions and Wasserstein errors
- Generalised Diffusion Probabilistic Scale-Spaces
- Statistically Optimal Generative Modeling with Maximum Deviation from the Empirical Distribution
- Discrete error dynamics of mini-batch gradient descent for least squares regression
- On Limitation of Transformer for Learning HMMs
- Slicing Mutual Information Generalization Bounds for Neural Networks
- Solving Inverse Problems in Protein Space Using Diffusion-Based Priors
- Entropy annealing for policy mirror descent in continuous time and space
- Symplectic Methods in Deep Learning
- Policy Optimization in Control: Geometry and Algorithmic Implications
- Simulating infinite-dimensional nonlinear diffusion bridges
- Benign Nonconvex Landscapes in Optimal and Robust Control, Part II: Extended Convex Lifting
- Generative Flows on Discrete State-Spaces: Enabling Multimodal Flows with Applications to Protein Co-Design
- Identifying latent state transition in non-linear dynamical systems
- Learning Low-dimensional Latent Dynamics from High-dimensional Observations: Non-asymptotics and Lower Bounds
- Sampling in Unit Time with Kernel Fisher-Rao Flow
- Grokfast: Accelerated Grokking by Amplifying Slow Gradients
- The Illusion of State in State-Space Models
- Event-horizon-scale Imaging of M87* under Different Assumptions via Deep Generative Image Priors
- Conditional Wasserstein Distances with Applications in Bayesian OT Flow Matching
- Remove that Square Root: A New Efficient Scale-Invariant Version of AdaGrad
- Randomized Kaczmarz with geometrically smoothed momentum
- Nonsmooth Implicit Differentiation: Deterministic and Stochastic Convergence Rates
- Can We Remove the Square-Root in Adaptive Gradient Methods? A Second-Order Perspective
- Sifting through the Noise: A Survey of Diffusion Probabilistic Models and Their Applications to Biomolecules
- Generative Conditional Distributions by Neural (Entropic) Optimal Transport
- An excursion onto Schr\"odinger's bridges: Stochastic flows with spatio-temporal marginals
- Consistency Model is an Effective Posterior Sample Approximation for Diffusion Inverse Solvers
- Variational Schr\"odinger Diffusion Models
- Unifying Bayesian Flow Networks and Diffusion Models through Stochastic Differential Equations
- Interpreting and Improving Diffusion Models from an Optimization Perspective
- Learning to Solve Multiresolution Matrix Factorization by Manifold Optimization and Evolutionary Metaheuristics
- Scalable Wasserstein Gradient Flow for Generative Modeling through Unbalanced Optimal Transport
- Non-geodesically-convex optimization in the Wasserstein space
- Schr\"{o}dinger Bridge with Quadratic State Cost is Exactly Solvable
- Loss Symmetry and Noise Equilibrium of Stochastic Gradient Descent
- Interpreting and Improving Diffusion Models from an Optimization Perspective
- Gradient descent in matrix factorization: Understanding large initialization
- Weak convergence of adaptive Markov chain Monte Carlo
- Dirichlet Flow Matching with Applications to DNA Sequence Design
- Variance reduction techniques for stochastic proximal point algorithms
- Stochastic Online Fisher Markets: Static Pricing Limits and Adaptive Enhancements
- Non-asymptotic Convergence of Discrete-time Diffusion Models: New Approach and Improved Rate
- Flow matching achieves minimax optimal convergence
- An excursion onto Schr\"odinger's bridges: Stochastic flows with spatio-temporal marginals
- Representing Molecules as Random Walks Over Interpretable Grammars
- Convergence Bounds for Sequential Monte Carlo on Multimodal Distributions using Soft Decomposition
- Principled Probabilistic Imaging using Diffusion Models as Plug-and-Play Priors
- Sampling metastable systems using collective variables and Jarzynski-Crooks paths
- Cascade of phase transitions in the training of Energy-based models
- Fast Samplers for Inverse Problems in Iterative Refinement Models
- Transition Path Sampling with Boltzmann Generator-based MCMC Moves
- Stochastic Localization via Iterative Posterior Sampling
- Learning Latent Space Hierarchical EBM Diffusion Models
- Role of Momentum in Smoothing Objective Function and Generalizability of Deep Neural Networks
- Diffusion Rejection Sampling
- Critical windows: non-asymptotic theory for feature emergence in diffusion models
- DMPlug: A Plug-in Method for Solving Inverse Problems with Diffusion Models
- Dissecting the Interplay of Attention Paths in a Statistical Mechanics Theory of Transformers
- A unified law of robustness for Bregman divergence losses
- Unraveling the Smoothness Properties of Diffusion Models: A Gaussian Mixture Perspective
- Simplified Diffusion Schr\"odinger Bridge
- Interaction-Force Transport Gradient Flows
- A Fisher-Rao gradient flow for entropic mean-field min-max games
- Gaussian Measures Conditioned on Nonlinear Observations: Consistency, MAP Estimators, and Simulation
- Single-seed generation of Brownian paths and integrals for adaptive and high order SDE solvers
- Wasserstein Gradient Flow over Variational Parameter Space for Variational Inference
- Diffusive Gibbs Sampling
- Why is parameter averaging beneficial in SGD? An objective smoothing perspective
- Gradients of Functions of Large Matrices
- Categorical Flow Matching on Statistical Manifolds
- Diffusion Bridge Implicit Models
- A Separation in Heavy-Tailed Sampling: Gaussian vs. Stable Oracles for Proximal Samplers
- Faster Sampling via Stochastic Gradient Proximal Sampler
- A First Course in Monte Carlo Methods
- A Differential Equation Approach for Wasserstein GANs and Beyond
- Neural Fluidic System Design and Control with Differentiable Simulation
- Causal de Finetti: On the Identification of Invariant Causal Structure in Exchangeable Data
- Score-based generative models are provably robust: an uncertainty quantification perspective
- Understanding the differences in Foundation Models: Attention, State Space Models, and Recurrent Neural Networks
- Learning to Discretize Denoising Diffusion ODEs
- Log-Concave Sampling on Compact Supports: A Versatile Proximal Framework
- Metrizing Fairness
- Learning Latent Space Hierarchical EBM Diffusion Models
- Learning Diffusion Priors from Observations by Expectation Maximization
- Generalized Laplace Approximation
- Differentiable Annealed Importance Sampling Minimizes The Jensen-Shannon Divergence Between Initial and Target Distribution
- Generalised Bayes Linear Inference
- Parameterized Wasserstein Gradient Flow
- Adaptive tempering schedules with approximative intermediate measures for filtering problems
- Differentiable Annealed Importance Sampling Minimizes The Jensen-Shannon Divergence Between Initial and Target Distribution
- Markovian Flow Matching: Accelerating MCMC with Continuous Normalizing Flows
- Control, Transport and Sampling: Towards Better Loss Design
- Adapting to Unknown Low-Dimensional Structures in Score-Based Diffusion Models
- Fisher Flow Matching for Generative Modeling over Discrete Data
- Adversarial Schr\"odinger Bridge Matching
- ComboStoc: Combinatorial Stochasticity for Diffusion Generative Models
- Learning Diffusion Priors from Observations by Expectation Maximization
- Higher-order propagation of chaos in $L^2$ for interacting diffusions
- Diffusion models for Gaussian distributions: Exact solutions and Wasserstein errors
- Gaussian Measures Conditioned on Nonlinear Observations: Consistency, MAP Estimators, and Simulation
- Can Large Language Models Understand Molecules?
- Wasserstein Wormhole: Scalable Optimal Transport Distance with Transformers
- Learning the Infinitesimal Generator of Stochastic Diffusion Processes
- Keep the Momentum: Conservation Laws beyond Euclidean Gradient Flows
- FAdam: Adam is a natural gradient optimizer using diagonal empirical Fisher information
- A description based on optimal transport for a class of stochastic McKean-Vlasov control problems
- Stochastic optimal transport and Hamilton-Jacobi-Bellman equations on the set of probability measures
- TimeGPT-1
- One-step data-driven generative model via Schr\"odinger Bridge
- Entropy Production by Underdamped Langevin Dynamics
- On the Trajectory Regularity of ODE-based Diffusion Sampling
- Majorization-minimization Bregman proximal gradient algorithms for nonnegative matrix factorization with the Kullback--Leibler divergence
- Can a Transformer Represent a Kalman Filter?
- Improving Diffusion Models for Inverse Problems using Manifold Constraints
- Diffusion Posterior Sampling for General Noisy Inverse Problems
- Nonequilbrium physics of generative diffusion models
- Memory corrections to Markovian Langevin dynamics
- Discrete approximations of Gaussian smoothing and Gaussian derivatives
- Transport based particle methods for the Fokker-Planck-Landau equation
- Convergence of flow-based generative models via proximal gradient descent in Wasserstein space
- Convergence of kinetic Langevin samplers for non-convex potentials
- An Exact Theory of Causal Emergence for Linear Stochastic Iteration Systems
- Computation-Aware Kalman Filtering and Smoothing
- Compositional imprecise probability
- Gradient Estimation and Variance Reduction in Stochastic and Deterministic Models
- On the convergence of adaptive approximations for stochastic differential equations
- An interacting particle consensus method for constrained global optimization
- Proximal Langevin Sampling With Inexact Proximal Mapping
- Stochastic Langevin Differential Inclusions with Applications to Machine Learning
- Bayesian sampling using interacting particles
- Diffusion models as probabilistic neural operators for recovering unobserved states of dynamical systems
- Local convergence rates for Wasserstein gradient flows and McKean-Vlasov equations with multiple stationary solutions
- Second order quantitative bounds for unadjusted generalized Hamiltonian Monte Carlo
- Statistical Error of Numerical Integrators for Underdamped Langevin Dynamics with Deterministic And Stochastic Gradients
- Liouville Flow Importance Sampler
- Learning Low-dimensional Latent Dynamics from High-dimensional Observations: Non-asymptotics and Lower Bounds
- Long-Time Asymptotics of the Sliced-Wasserstein Flow
- Asymptotic Normality of $U$-Statistics is Equivalent to Convergence in the Wasserstein Distance
- Optimal schedules for annealing algorithms
- Regularized Stein Variational Gradient Flow
- On foundation of generative statistics with F-entropy: a gradient-based approach
- GD doesn't make the cut: Three ways that non-differentiability affects neural network training
- Accelerating Inference in Molecular Diffusion Models with Latent Representations of Protein Structure
- Diffusion Models as Stochastic Quantization in Lattice Field Theory
- Regularized Stein Variational Gradient Flow
- Learning Structural Causal Models through Deep Generative Models: Methods, Guarantees, and Challenges
- The Riemannian geometry of Sinkhorn divergences
- Wasserstein Proximal Coordinate Gradient Algorithms
- Navigating Chemical Space with Latent Flows
- Diffusion-HMC: Parameter Inference with Diffusion Model driven Hamiltonian Monte Carlo
- Variational Schr\"odinger Diffusion Models
- Projective splitting with backward, half-forward and proximal-Newton steps
- Revisiting Kinetic Monte Carlo Algorithms for Time-dependent Processes: from open-loop control to feedback control
- Splitting Methods for differential equations
- Score identity Distillation: Exponentially Fast Distillation of Pretrained Diffusion Models for One-Step Generation
- Exploring the Frontiers of Softmax: Provable Optimization, Applications in Diffusion Model, and Beyond
- Score-based Generative Priors Guided Model-driven Network for MRI Reconstruction
- Towards a theory of model distillation
- Verlet Flows: Exact-Likelihood Integrators for Flow-Based Generative Models
- Bridging discrete and continuous state spaces: Exploring the Ehrenfest process in time-continuous diffusion models
- Convergence of the Preconditioned Proximal Point Method and Douglas-Rachford Splitting in the Absence of Monotonicity
- A Symplectic Analysis of Alternating Mirror Descent
- Finite Sample Analysis and Bounds of Generalization Error of Gradient Descent in In-Context Linear Regression
- An interacting particle consensus method for constrained global optimization
- F$^3$low: Frame-to-Frame Coarse-grained Molecular Dynamics with SE(3) Guided Flow Matching
- Blue noise for diffusion models
- Diffusive Gibbs Sampling
- Strong convergence of the exponential Euler scheme for SDEs with superlinear growth coefficients and one-sided Lipschitz drift
- Backward Map for Filter Stability Analysis
- The Inverse of Exact Renormalization Group Flows as Statistical Inference
- U-Nets as Belief Propagation: Efficient Classification, Denoising, and Diffusion in Generative Hierarchical Models
- Gaussianity and the Kalman Filter: A Simple Yet Complicated Relationship
- Optimizing the diffusion coefficient of overdamped Langevin dynamics
- Data-driven approximation of Koopman operators and generators: Convergence rates and error bounds
- Stochastic Distinguishability of Markovian Trajectories
- Blurring Diffusion Models
- A variational approach to sampling in diffusion processes
- Do Diffusion Models Learn Semantically Meaningful and Efficient Representations?
- A Unified Theory of Exact Inference and Learning in Exponential Family Latent Variable Models
- More Compute Is What You Need
- Towards a Systems Theory of Algorithms
- On the stability of the invariant probability measures of McKean-Vlasov equations
- Fisher Information Improved Training-Free Conditional Diffusion Model
- MinBackProp -- Backpropagating through Minimal Solvers
- Explaining Neural Scaling Laws
- Learning general Gaussian mixtures with efficient score matching
- Convergence Analysis of Flow Matching in Latent Space with Transformers
- Learning Mixtures of Gaussians Using Diffusion Models
- BiLO: Bilevel Local Operator Learning for PDE inverse problems
- Swarm-based gradient descent meets simulated annealing
- Bayesian sampling using interacting particles
- Aligned Diffusion Schr\"odinger Bridges
- Probabilistic Inference in Language Models via Twisted Sequential Monte Carlo
- An exactly solvable model for emergence and scaling laws
- Beyond Linear Response: Equivalence between Thermodynamic Geometry and Optimal Transport
- Differentiating Through Linear Solvers
- Conditional Variational Diffusion Models
- Probabilistic Inference in Language Models via Twisted Sequential Monte Carlo
- Align Your Steps: Optimizing Sampling Schedules in Diffusion Models
- Heat flow, log-concavity, and Lipschitz transport maps
- Gradient Guidance for Diffusion Models: An Optimization Perspective
- Calculus rules for proximal {\epsilon}-subdifferentials and inexact proximity operators for weakly convex functions
- Generalized Score Matching
- Convergence Analyses of Davis-Yin Splitting via Scaled Relative Graphs
- Accelerating the Generation of Molecular Conformations with Progressive Distillation of Equivariant Latent Diffusion Models
- A General Continuous-Time Formulation of Stochastic ADMM and Its Variants
- Plug-and-Play Algorithm Convergence Analysis From The Standpoint of Stochastic Differential Equation
- Lion Secretly Solves Constrained Optimization: As Lyapunov Predicts
- Noise Stability Optimization for Flat Minima with Tight Rates
- Neural Flow Diffusion Models: Learnable Forward Process for Improved Diffusion Modelling
- Optimizing the diffusion of overdamped Langevin dynamics
- Generalized Schr\"odinger Bridge Matching
- Stochastic Optimal Control Matching
- Reflection coupling for unadjusted generalized Hamiltonian Monte Carlo in the nonconvex stochastic gradient case
- Extending Mean-Field Variational Inference via Entropic Regularization: Theory and Computation
- A Minkowski space embedding to understand Markov models dynamics
- Solving Inverse Obstacle Scattering Problem with Latent Surface Representations
- EMC$^2$: Efficient MCMC Negative Sampling for Contrastive Learning with Global Convergence
- Consistent Diffusion Meets Tweedie: Training Exact Ambient Diffusion Models with Noisy Data
- Image Restoration by Denoising Diffusion Models with Iteratively Preconditioned Guidance
- Unbiased Image Synthesis via Manifold Guidance in Diffusion Models
- 3D Gaussian Splatting as Markov Chain Monte Carlo
- Diffusion Models Meet Remote Sensing: Principles, Methods, and Perspectives
- Penalized Overdamped and Underdamped Langevin Monte Carlo Algorithms for Constrained Sampling
- Extending Mean-Field Variational Inference via Entropic Regularization: Theory and Computation
- Convergence Analysis of Probability Flow ODE for Score-based Generative Models
- Energetic Variational Neural Network Discretizations of Gradient Flows
- The Curse of Recursion: Training on Generated Data Makes Models Forget
- The Illusion of State in State-Space Models
- State-Space Systems as Dynamic Generative Models
- Convergence of coordinate ascent variational inference for log-concave measures via optimal transport
- Generalization in diffusion models arises from geometry-adaptive harmonic representations
- Beyond Bayesian Model Averaging over Paths in Probabilistic Programs with Stochastic Support
- Optimal Universal Quantum Encoding for Statistical Inference
- Score Matching for Truncated Density Estimation on a Manifold
- Accelerated Objective Gap and Gradient Norm Convergence for Gradient Descent via Long Steps
- Finite State Mean Field Games with Common Shocks
- Spurious Stationarity and Hardness Results for Mirror Descent
- SE(3)-Stochastic Flow Matching for Protein Backbone Generation
- Diffusion posterior sampling for simulation-based inference in tall data settings
- Towards Faster Training of Diffusion Models: An Inspiration of A Consistency Phenomenon
- An Overview of Diffusion Models: Applications, Guided Generation, Statistical Rates and Optimization
- Deep Generative Sampling in the Dual Divergence Space: A Data-efficient & Interpretative Approach for Generative AI
- Efficient Denoising using Score Embedding in Score-based Diffusion Models
- From latent dynamics to meaningful representations
- FiP: a Fixed-Point Approach for Causal Generative Modeling
- Variational Stochastic Gradient Descent for Deep Neural Networks
- Optimal Transport Divergences induced by Scoring Functions
- Global $\mathcal{L}^2$ minimization at uniform exponential rate via geometrically adapted gradient descent in Deep Learning
- Optimization methods for solving matrix equations
- Linear convergence of forward-backward accelerated algorithms without knowledge of the modulus of strong convexity
- An Edit Friendly DDPM Noise Space: Inversion and Manipulations
- High Noise Scheduling is a Must
- Sharp Propagation of Chaos for the Ensemble Langevin Sampler
- Improved Techniques for Maximum Likelihood Estimation for Diffusion ODEs
- Statistical Inference of Optimal Allocations I: Regularities and their Implications
- Implicit Bias of AdamW: $\ell_\infty$ Norm Constrained Optimization
- Unsupervised Training of Convex Regularizers using Maximum Likelihood Estimation
- Convergence analysis of controlled particle systems arising in deep learning: from finite to infinite sample size
- Randomized matrix computations: Themes and variations
- The convergence of the EM scheme in empirical approximation of invariant probability measure for McKean-Vlasov SDEs
- Generative downscaling of PDE solvers with physics-guided diffusion models
- Compositional Estimation of Lipschitz Constants for Deep Neural Networks
- How Bad is Training on Synthetic Data? A Statistical Analysis of Language Model Collapse
- Heat Death of Generative Models in Closed-Loop Learning
- The Underlying Scaling Laws and Universal Statistical Structure of Complex Datasets
- Poisson Equations with locally-Lipschitz coefficients and Uniform in Time Averaging for Stochastic Differential Equations via Strong Exponential Stability
- Score identity Distillation: Exponentially Fast Distillation of Pretrained Diffusion Models for One-Step Generation
- Gaussian-Smoothed Sliced Probability Divergences
- Convergence Guarantees for RMSProp and Adam in Generalized-smooth Non-convex Optimization with Affine Noise Variance
- A Theoretical and Empirical Study on the Convergence of Adam with an "Exact" Constant Step Size in Non-Convex Settings
- Deep Generative Models through the Lens of the Manifold Hypothesis: A Survey and New Connections
- Convergence Analysis of Flow Matching in Latent Space with Transformers
- Proximal Oracles for Optimization and Sampling
- Consistency Trajectory Models: Learning Probability Flow ODE Trajectory of Diffusion
- Large Language Models for Mathematicians
- Model reduction on manifolds: A differential geometric framework
- Towards a turnkey approach to unbiased Monte Carlo estimation of smooth functions of expectations
- Protocols for Observational Studies: Methods and Open Problems
- Variational Flow Models: Flowing in Your Style
- Deep Equilibrium Diffusion Restoration with Parallel Sampling
- Scalable Diffusion Models with State Space Backbone
- What's in a Prior? Learned Proximal Networks for Inverse Problems
- Taming the Interactive Particle Langevin Algorithm -- the superlinear case
- Nonsmooth Implicit Differentiation: Deterministic and Stochastic Convergence Rates
- SSM Meets Video Diffusion Models: Efficient Video Generation with Structured State Spaces
- Distilling ODE Solvers of Diffusion Models into Smaller Steps
- Conditional Wasserstein Distances with Applications in Bayesian OT Flow Matching
- Diffusion Models Generate Images Like Painters: an Analytical Theory of Outline First, Details Later
- Provably Robust Score-Based Diffusion Posterior Sampling for Plug-and-Play Image Reconstruction
- State Space Models as Foundation Models: A Control Theoretic Overview
- Convergence Analysis of Stochastic Gradient Descent with MCMC Estimators
- Fast ODE-based Sampling for Diffusion Models in Around 5 Steps
- Scalable Optimal Transport Methods in Machine Learning: A Contemporary Survey
- Fast Nonlinear Two-Time-Scale Stochastic Approximation: Achieving $O(1/k)$ Finite-Sample Complexity
- Optimal First-Order Algorithms as a Function of Inequalities
- Considerations in the use of ML interaction potentials for free energy calculations
- The Elements of Differentiable Programming
- Physics-Informed Diffusion Models
- Convergence of Empirical Optimal Transport in Unbounded Settings
- Posterior concentrations of fully-connected Bayesian neural networks with general priors on the weights
- Posterior Sampling Based on Gradient Flows of the MMD with Negative Distance Kernel
- Neural Wasserstein Gradient Flows for Maximum Mean Discrepancies with Riesz Kernels
- Analysing heavy-tail properties of Stochastic Gradient Descent by means of Stochastic Recurrence Equations
- CBX: Python and Julia packages for consensus-based interacting particle methods
- Energy diminishing implicit-explicit Runge--Kutta methods for gradient flows
- Optimal Flow Matching: Learning Straight Trajectories in Just One Step
- Analyzing and Improving the Training Dynamics of Diffusion Models
- Consistency Models Improve Diffusion Inverse Solvers
- Divide-and-Conquer Posterior Sampling for Denoising Diffusion Priors
- Solving General Noisy Inverse Problem via Posterior Sampling: A Policy Gradient Viewpoint
- Unveil Conditional Diffusion Models with Classifier-free Guidance: A Sharp Statistical Theory
- Understanding Diffusion Models by Feynman's Path Integral
- Properties of Discrete Sliced Wasserstein Losses
- Formalization of Complexity Analysis of the First-order Optimization Algorithms
- Convergence of Kinetic Langevin Monte Carlo on Lie groups
- Adaptive stepsize algorithms for Langevin dynamics
- Generalization in diffusion models arises from geometry-adaptive harmonic representations
- Discrete approximations of Gaussian smoothing and Gaussian derivatives
- OverleafCopilot: Empowering Academic Writing in Overleaf with Large Language Models
- Intriguing Properties of Data Attribution on Diffusion Models
- Understanding the Double Descent Phenomenon in Deep Learning
- Long-time behavior for discretization schemes of Fokker-Planck equations via couplings
- Max-sliced 2-Wasserstein distance
- Convergence and Trade-Offs in Riemannian Gradient Descent and Riemannian Proximal Point
- Evolutionary Algorithms Simulating Molecular Evolution: A New Field Proposal
- Stable Training of Probabilistic Models Using the Leave-One-Out Maximum Log-Likelihood Objective
- Reverse em-problem based on Bregman divergence and its application to classical and quantum information theory
- The Price of Adaptivity in Stochastic Convex Optimization
- Efficient geometric Markov chain Monte Carlo for nonlinear Bayesian inversion enabled by derivative-informed neural operators
- Consistency Trajectory Models: Learning Probability Flow ODE Trajectory of Diffusion
- Ambient Diffusion Posterior Sampling: Solving Inverse Problems with Diffusion Models trained on Corrupted Data
- Multistep Consistency Models
- Sampling via Gradient Flows in the Space of Probability Measures
- From Posterior Sampling to Meaningful Diversity in Image Restoration
- Training-free Linear Image Inverses via Flows
- Fine-tuning of diffusion models via stochastic control: entropy regularization and beyond
- Implicit Image-to-Image Schrodinger Bridge for CT Super-Resolution and Denoising
- Scalable couplings for the random walk Metropolis algorithm
- Sliced-Wasserstein Distances and Flows on Cartan-Hadamard Manifolds
- Wasserstein Gradient Flows for Moreau Envelopes of f-Divergences in Reproducing Kernel Hilbert Spaces
- A Compositional Framework for First-Order Optimization
- Leveraging Continuous Time to Understand Momentum When Training Diagonal Linear Networks
- Simulating conditioned diffusions on manifolds
- Towards Faster Non-Asymptotic Convergence for Diffusion-Based Generative Models
- Scalable Wasserstein Gradient Flow for Generative Modeling through Unbalanced Optimal Transport
- Diffusion Models for Constrained Domains
- A stochastic optimisation unadjusted Langevin method for empirical Bayesian estimation in semi-blind image deblurring problems
- The Geometric Structure of Topic Models
- On the Origins of Linear Representations in Large Language Models
- Accelerating Convergence of Score-Based Diffusion Models, Provably
- Analysis of Kernel Mirror Prox for Measure Optimization
- Sharp spectral gap of adaptive Langevin dynamics
- Tuning-Free Maximum Likelihood Training of Latent Variable Models via Coin Betting
- Sharp bounds for the max-sliced Wasserstein distance
- Non-asymptotic analysis of Langevin-type Monte Carlo algorithms
- Empirical Bayes in Bayesian learning: understanding a common practice
- The Blessing of Randomness: SDE Beats ODE in General Diffusion-based Image Editing
- Score dynamics: scaling molecular dynamics with picosecond timesteps via conditional diffusion model
- Theoretical Foundations of Deep Selective State-Space Models
- Categorical Deep Learning: An Algebraic Theory of Architectures
- Diffusion Posterior Proximal Sampling for Image Restoration
- Zeroth-Order Sampling Methods for Non-Log-Concave Distributions: Alleviating Metastability by Denoising Diffusion
- Dynamical Regimes of Diffusion Models
- On the different regimes of Stochastic Gradient Descent
- Stochastic Approximation with Biased MCMC for Expectation Maximization
- Proximal Algorithms for a class of abstract convex functions
- A Dynamical View of the Question of Why
- Randomized matrix computations: Themes and variations
- Diffusion Posterior Proximal Sampling for Image Restoration
- The curse of dimensionality in operator learning
- On Independent Samples Along the Langevin Diffusion and the Unadjusted Langevin Algorithm
- Statistical Accuracy of Approximate Filtering Methods
- Generative AI for Bayesian Computation
- Score-based generative models break the curse of dimensionality in learning a family of sub-Gaussian probability distributions
- Nonlinear Bayesian optimal experimental design using logarithmic Sobolev inequalities
- Optimal schedules for annealing algorithms
- A Language Model's Guide Through Latent Space
- Moonwalk: Inverse-Forward Differentiation
- Weak Poincar\'e inequality comparisons for ideal and hybrid slice sampling
- The Emergence of Reproducibility and Consistency in Diffusion Models
- Non-asymptotic Convergence of Discrete-time Diffusion Models: New Approach and Improved Rate
- Linear Convergence of Black-Box Variational Inference: Should We Stick the Landing?
- Touring sampling with pushforward maps
- Diffusion Posterior Sampling is Computationally Intractable
- Euler-Maruyama schemes for stochastic differential equations driven by stable L\'{e}vy processes with i.i.d. stable components
- Stein Boltzmann Sampling: A Variational Approach for Global Optimization
- Contractivity of neural ODEs: an eigenvalue optimization problem
- SDEs for Minimax Optimization
- On Averaging and Extrapolation for Gradient Descent
- On the Posterior Distribution in Denoising: Application to Uncertainty Quantification
- From Denoising Diffusions to Denoising Markov Models
- Decomposed Diffusion Sampler for Accelerating Large-Scale Inverse Problems
- Adam-family Methods for Nonsmooth Optimization with Convergence Guarantees
- Convergence of Gradient Descent for Recurrent Neural Networks: A Nonasymptotic Analysis
- Acousto-electric tomography by the convergence of Kaczamrz two-point gradient-$\Theta$ method
- Transformers Implement Functional Gradient Descent to Learn Non-Linear Functions In Context
- BlackJAX: Composable Bayesian inference in JAX
- Stochastic Localization via Iterative Posterior Sampling
- Efficient Sampling on Riemannian Manifolds via Langevin MCMC
- Optimal Transport with Tempered Exponential Measures
- Symmetric Mean-field Langevin Dynamics for Distributional Minimax Problems
- The Price of Adaptivity in Stochastic Convex Optimization
- The Emergence of Reproducibility and Consistency in Diffusion Models
- R\'enyi Resolvability, Noise Stability, and Anti-contractivity
- Revisiting Stochastic Realization Theory using Functional It\^o Calculus
- Closed-form Filtering for Non-linear Systems
- MCMC-driven learning
- Correction to "Wasserstein distance estimates for the distributions of numerical approximations to ergodic stochastic differential equations"
- Diffeomorphic Measure Matching with Kernels for Generative Modeling
- Sampling from the Mean-Field Stationary Distribution
- Towards Fast Stochastic Sampling in Diffusion Generative Models
- Can We Remove the Square-Root in Adaptive Gradient Methods? A Second-Order Perspective
- Score-based generative models break the curse of dimensionality in learning a family of sub-Gaussian probability distributions
- Mirror Descent-Ascent for mean-field min-max problems
- Fast sampling from constrained spaces using the Metropolis-adjusted Mirror Langevin algorithm
- Towards a mathematical theory for consistency training in diffusion models
- Optimal score estimation via empirical Bayes smoothing
- Sampling from the Mean-Field Stationary Distribution
- Foundations of Monte Carlo methods and stochastic simulations -- From Monte Carlo Lebesgue integration to weak approximation of SDEs
- Score-Based Physics-Informed Neural Networks for High-Dimensional Fokker-Planck Equations
- Controllable seismic velocity synthesis using generative diffusion models
- The Complexity of Sequential Prediction in Dynamical Systems
- Particle Denoising Diffusion Sampler
- Wasserstein proximal operators describe score-based generative models and resolve memorization
- Adaptive proximal gradient methods are universal without approximation
- Entropy and curvature: beyond the Peres-Tetali conjecture
- Linearizability of flows by embeddings
- An Introduction to Transformers
- Scalable Diffusion Models with State Space Backbone
- Incentive-Theoretic Bayesian Inference for Collaborative Science
- JAX-Fluids 2.0: Towards HPC for Differentiable CFD of Compressible Two-phase Flows
- Scalable Wasserstein Gradient Flow for Generative Modeling through Unbalanced Optimal Transport
- Convergence of Alternating Gradient Descent for Matrix Factorization
- Cortical Surface Diffusion Generative Models
- Denoising Diffusion Probabilistic Models in Six Simple Steps
- An analysis of the noise schedule for score-based generative models
- Proximal-point-like algorithms for abstract convex minimisation problems
- To be or not to be stable, that is the question: understanding neural networks for inverse problems
- Variational Representations of Annealing Paths: Bregman Information under Monotonic Embedding
- The Information of Large Language Model Geometry
- Mean-field underdamped Langevin dynamics and its spacetime discretization
- The Anytime Convergence of Stochastic Gradient Descent with Momentum: From a Continuous-Time Perspective
- Can We Remove the Square-Root in Adaptive Gradient Methods? A Second-Order Perspective
- Diffusive Gibbs Sampling
- Spectral State Space Models
- A rigorous introduction to linear models
- Diffusive Gibbs Sampling
- Non-asymptotic Analysis of Biased Adaptive Stochastic Approximation
- Denoising Diffusion-Based Control of Nonlinear Systems
- Universal Gradient Methods for Stochastic Convex Optimization
- Unconditional Latent Diffusion Models Memorize Patient Imaging Data
- Fisher information dissipation for time inhomogeneous stochastic differential equations
- Compositional Generative Modeling: A Single Model is Not All You Need
- Statistical Accuracy of Approximate Filtering Methods
- Weakly Convex Regularisers for Inverse Problems: Convergence of Critical Points and Primal-Dual Optimisation
- Geometry-Aware Normalizing Wasserstein Flows for Optimal Causal Inference
- A Theoretical Analysis of Noise Geometry in Stochastic Gradient Descent
- Hamilton--Jacobi equations for Wasserstein controlled gradient flows: existence of viscosity solutions
- A theoretical and empirical study of new adaptive algorithms with additional momentum steps and shifted updates for stochastic non-convex optimization
- A non-asymptotic error analysis for parallel Monte Carlo estimation from many short Markov chains
- On Inference Stability for Diffusion Models
- Convergence analysis of t-SNE as a gradient flow for point cloud on a manifold
- Enhancing Score-Based Sampling Methods with Ensembles
- Arrows of Time for Large Language Models
- Categorical probability spaces, ergodic decompositions, and transitions to equilibrium
- Convergence Analysis for General Probability Flow ODEs of Diffusion Models in Wasserstein Distances
- Reversing information flow: retrodiction in semicartesian categories
- Recovering Mental Representations from Large Language Models with Markov Chain Monte Carlo
- Prompt Design and Engineering: Introduction and Advanced Methods
- On the Algorithmic Verification of Nonlinear Superposition for Systems of First Order Ordinary Differential Equations
- Time-uniform log-Sobolev inequalities and applications to propagation of chaos
- The Geometry of Monotone Operator Splitting Methods
- Sticky-reflecting diffusion as a Wasserstein gradient flow
- Solving, Tracking and Stopping Streaming Linear Inverse Problems
- Sticky-reflecting diffusion as a Wasserstein gradient flow. (arXiv:2401.16842v1 [math.AP])
- Ensemble-Based Annealed Importance Sampling. (arXiv:2401.15645v1 [stat.CO])
- Neural Network-Based Score Estimation in Diffusion Models: Optimization and Generalization. (arXiv:2401.15604v1 [cs.LG])
- Particle-MALA and Particle-mGRAD: Gradient-based MCMC methods for high-dimensional state-space models. (arXiv:2401.14868v1 [stat.CO])
- Hidden Markov Models and the Bayes Filter in Categorical Probability. (arXiv:2401.14669v1 [math.ST])
- Estimation of partially known Gaussian graphical models with score-based structural priors. (arXiv:2401.14340v2 [stat.ML] UPDATED)
- Neural Sinkhorn Gradient Flow. (arXiv:2401.14069v1 [cs.LG])
- Towards a Systems Theory of Algorithms. (arXiv:2401.14029v1 [math.OC])
- Compositional Generative Inverse Design. (arXiv:2401.13171v1 [cs.LG])
- Contractive Diffusion Probabilistic Models. (arXiv:2401.13115v1 [cs.LG])
- Tensor train based sampling algorithms for approximating regularized Wasserstein proximal operators. (arXiv:2401.13125v2 [math.OC] UPDATED)
- Bayesian sampling using interacting particles. (arXiv:2401.13100v1 [math.NA])
- Accelerating Approximate Thompson Sampling with Underdamped Langevin Monte Carlo. (arXiv:2401.11665v1 [stat.ML])
- Fast Nonlinear Two-Time-Scale Stochastic Approximation: Achieving $O(1/k)$ Finite-Sample Complexity. (arXiv:2401.12764v2 [math.OC] UPDATED)
- Score-Based Generative Models for PET Image Reconstruction. (arXiv:2308.14190v2 [eess.IV] UPDATED)
- Fast Nonlinear Two-Time-Scale Stochastic Approximation: Achieving $O(1/k)$ Finite-Sample Complexity. (arXiv:2401.12764v2 [math.OC] UPDATED)
- Wasserstein Diffusion on Multidimensional Spaces. (arXiv:2401.12721v1 [math.PR])
- The Ensemble Kalman Filter for Dynamic Inverse Problems. (arXiv:2401.11948v1 [math.NA])
- Nearly $d$-Linear Convergence Bounds for Diffusion Models via Stochastic Localization. (arXiv:2308.03686v2 [stat.ML] UPDATED)
- Fast parallel sampling under isoperimetry. (arXiv:2401.09016v1 [cs.DS])
- Introduction to probability and statistics: a computational framework of randomness. (arXiv:2401.08622v2 [math.HO] UPDATED)
- Demystifying Variational Diffusion Models. (arXiv:2401.06281v1 [cs.LG])
- Faster Sampling without Isoperimetry via Diffusion-based Monte Carlo. (arXiv:2401.06325v1 [stat.ML])
- Almost Sure Diffusion Approximation in Averaging: Direct Proofs with Rough Paths Flavors. (arXiv:2401.05038v2 [math.PR] UPDATED)
- Solving the Scattering Problem for Open Wave-Guides, III: Radiation Conditions and Uniqueness. (arXiv:2401.04674v1 [math.AP])
- Stable generative modeling using diffusion maps. (arXiv:2401.04372v1 [stat.ML])
- Image Inpainting via Tractable Steering of Diffusion Models. (arXiv:2401.03349v1 [cs.CV])
- Reflected Schr\"odinger Bridge for Constrained Generative Modeling. (arXiv:2401.03228v1 [stat.ML])
- Diffusion Variational Inference: Diffusion Models as Expressive Variational Posteriors. (arXiv:2401.02739v1 [cs.LG])
- Hamilton--Jacobi equations for Wasserstein controlled gradient flows: existence of viscosity solutions. (arXiv:2401.02240v1 [math.AP])
- Sheaves of Probability. (arXiv:2401.01968v1 [math.PR])
- Nature-Inspired Algorithms in Optimization: Introduction, Hybridization and Insights. (arXiv:2401.00976v1 [cs.NE])
- A review of Monte Carlo-based versions of the EM algorithm. (arXiv:2401.00945v1 [stat.CO])
- Stochastic Optimization under Hidden Convexity. (arXiv:2401.00108v1 [math.OC])
- Fluctuation Theorem on Riemannian Manifold. (arXiv:2401.00046v1 [cond-mat.stat-mech])
- Mixing time of the conditional backward sampling particle filter. (arXiv:2312.17572v1 [stat.CO])
- Principled Gradient-based Markov Chain Monte Carlo for Text Generation. (arXiv:2312.17710v1 [cs.CL])
Saved in 2023
- Image Restoration by Denoising Diffusion Models with Iteratively Preconditioned Guidance. (arXiv:2312.16519v1 [eess.IV])
- Micro-Macro Consistency in Multiscale Modeling: Score-Based Model Assisted Sampling of Fast/Slow Dynamical Systems. (arXiv:2312.05715v2 [cs.LG] UPDATED)
- Projected Langevin Monte Carlo algorithms in non-convex and super-linear setting. (arXiv:2312.17077v2 [math.NA] UPDATED)
- Convergence and stability results for the particle system in the Stein gradient descent method. (arXiv:2312.16344v1 [math.AP])
- Mean-field underdamped Langevin dynamics and its spacetime discretization. (arXiv:2312.16360v4 [stat.CO] UPDATED)
- Unraveling the Temporal Dynamics of the Unet in Diffusion Models. (arXiv:2312.14965v1 [cs.CV])
- Diffusion Models for Generative Artificial Intelligence: An Introduction for Applied Mathematicians. (arXiv:2312.14977v1 [cs.LG])
- On the Trajectories of SGD Without Replacement. (arXiv:2312.16143v1 [cs.LG])
- Mini-batching error and adaptive Langevin dynamics
- Perturbation Analysis of Markov Chain Monte Carlo for Graphical Models. (arXiv:2312.14246v1 [math.PR])
- Time-changed normalizing flows for accurate SDE modeling. (arXiv:2312.14698v2 [cs.LG] UPDATED)
- Single-Cell RNA-seq Synthesis with Latent Diffusion Model. (arXiv:2312.14220v1 [q-bio.GN])
- Exploiting bias in optimal finite-time copying protocols. (arXiv:2312.14682v1 [cond-mat.stat-mech])
- Sampling and estimation on manifolds using the Langevin diffusion. (arXiv:2312.14882v1 [math.ST])
- Accelerated Convergence of Stochastic Heavy Ball Method under Anisotropic Gradient Noise. (arXiv:2312.14567v1 [cs.LG])
- A mathematical perspective on Transformers. (arXiv:2312.10794v2 [cs.LG] UPDATED)
- Metropolis-adjusted interacting particle sampling. (arXiv:2312.13889v1 [stat.CO])
- Wave Physics-informed Matrix Factorizations. (arXiv:2312.13584v2 [cs.LG] UPDATED)
- A Survey of Emerging Applications of Diffusion Probabilistic Models in MRI. (arXiv:2311.11383v2 [cs.CV] UPDATED)
- How Good Are Deep Generative Models for Solving Inverse Problems?. (arXiv:2312.12691v1 [cs.LG])
- Gradient flows for empirical Bayes in high-dimensional linear models. (arXiv:2312.12708v1 [math.ST])
- Metropolis-adjusted interacting particle sampling. (arXiv:2312.13889v1 [stat.CO])
- On the ensemble Kalman inversion under inequality constraints. (arXiv:2312.13804v1 [math.NA])
- Towards Accurate Guided Diffusion Sampling through Symplectic Adjoint Method. (arXiv:2312.12030v1 [cs.CV])
- Optimizing Diffusion Noise Can Serve As Universal Motion Priors. (arXiv:2312.11994v1 [cs.CV])
- Generalizing Adam to Manifolds for Efficiently Training Transformers. (arXiv:2305.16901v2 [cs.LG] UPDATED)
- Large Language Models Play StarCraft II: Benchmarks and A Chain of Summarization Approach. (arXiv:2312.11865v1 [cs.AI])
- Stability of the numerical scheme for stochastic McKean-Vlasov equations. (arXiv:2312.12699v1 [math.NA])
- On inverse problems in predator-prey models. (arXiv:2312.09653v1 [math.AP] CROSS LISTED)
- Fitting a manifold to data in the presence of large noise. (arXiv:2312.10598v2 [math.ST] UPDATED)
- Nonlocal Approximation of Slow and Fast Diffusion. (arXiv:2312.11438v1 [math.AP])
- Anisotropic Proximal Point Algorithm. (arXiv:2312.09834v1 [math.OC])
- Faster Diffusion: Rethinking the Role of UNet Encoder in Diffusion Models. (arXiv:2312.09608v1 [cs.CV])
- Noise in the reverse process improves the approximation capabilities of diffusion models. (arXiv:2312.07851v2 [cs.LG] UPDATED)
- A New Perspective On Denoising Based On Optimal Transport. (arXiv:2312.08135v1 [math.ST])
- A Unified Sampling Framework for Solver Searching of Diffusion Probabilistic Models. (arXiv:2312.07243v1 [cs.AI])
- Momentum Particle Maximum Likelihood. (arXiv:2312.07335v1 [cs.LG])
- Boosting Latent Diffusion with Flow Matching. (arXiv:2312.07360v1 [cs.CV])
- Mean-field limits for Consensus-Based Optimization and Sampling. (arXiv:2312.07373v1 [math.PR])
- Can a Transformer Represent a Kalman Filter?. (arXiv:2312.06937v2 [cs.LG] UPDATED)
- Towards Stability of Autoregressive Neural Operators. (arXiv:2306.10619v2 [cs.LG] UPDATED)
- The Geometry of Truth: Emergent Linear Structure in Large Language Model Representations of True/False Datasets. (arXiv:2310.06824v2 [cs.AI] UPDATED)
- Consistency Models for Scalable and Fast Simulation-Based Inference. (arXiv:2312.05440v1 [cs.LG])
- FreeFlow: A Comprehensive Understanding on Diffusion Probabilistic Models via Optimal Transport. (arXiv:2312.05486v1 [cs.AI])
- KEEC: Embed to Control on An Equivariant Geometry. (arXiv:2312.01544v2 [cs.LG] UPDATED)
- Information divergences of Markov chains and their applications. (arXiv:2312.04863v1 [cs.IT])
- Unnatural Algorithms in Machine Learning. (arXiv:2312.04739v1 [stat.ML])
- How to guess a gradient. (arXiv:2312.04709v1 [cs.LG])
- Train 'n Trade: Foundations of Parameter Markets. (arXiv:2312.04740v1 [cs.LG])
- Second- and third-order properties of multidimensional Langevin equations. (arXiv:2312.04585v1 [cond-mat.stat-mech])
- Schrodinger Bridges Beat Diffusion Models on Text-to-Speech Synthesis. (arXiv:2312.03491v1 [cs.SD])
- Generalized Contrastive Divergence: Joint Training of Energy-Based Model and Diffusion Model through Inverse Reinforcement Learning. (arXiv:2312.03397v1 [cs.LG])
- Improving Gradient-guided Nested Sampling for Posterior Inference. (arXiv:2312.03911v1 [cs.LG])
- Smooth Diffusion: Crafting Smooth Latent Spaces in Diffusion Models. (arXiv:2312.04410v1 [cs.CV])
- Convergence Rate Analysis of Continuous- and Discrete-Time Smoothing Gradient Algorithms. (arXiv:2312.04192v1 [math.OC])
- Iterated invariance principle for random dynamical systems. (arXiv:2312.04550v1 [math.DS])
- Geometry-Aware Normalizing Wasserstein Flows for Optimal Causal Inference. (arXiv:2311.18826v3 [cs.LG] UPDATED)
- The Bayesian Stability Zoo. (arXiv:2310.18428v2 [cs.LG] UPDATED)
- Algorithms for mean-field variational inference via polyhedral optimization in the Wasserstein space. (arXiv:2312.02849v1 [math.ST])
- Training Chain-of-Thought via Latent-Variable Inference. (arXiv:2312.02179v1 [cs.LG])
- Conditional Variational Diffusion Models. (arXiv:2312.02246v3 [cs.CV] UPDATED)
- Adam-like Algorithm with Smooth Clipping Attains Global Minima: Analysis Based on Ergodicity of Functional SDEs. (arXiv:2312.02182v1 [cs.LG])
- Lecture Notes on Computerized Tomography. (arXiv:2312.02393v1 [math.NA])
- Particle-based algorithm for stochastic optimal control. (arXiv:2311.06906v3 [math.OC] UPDATED)
- Taming Latent Diffusion Models to See in the Dark. (arXiv:2312.01027v2 [cs.CV] UPDATED)
- Beyond First-Order Tweedie: Solving Inverse Problems using Latent Diffusion. (arXiv:2312.00852v1 [cs.LG])
- Convergence Of The Unadjusted Langevin Algorithm For Discontinuous Gradients. (arXiv:2312.01950v1 [math.PR])
- Local monotone operator learning using non-monotone operators: MnM-MOL. (arXiv:2312.00386v1 [eess.IV])
- Fast ODE-based Sampling for Diffusion Models in Around 5 Steps. (arXiv:2312.00094v1 [cs.CV])
- Geodesic slice sampling on Riemannian manifolds. (arXiv:2312.00417v1 [stat.CO])
- On Exact Inversion of DPM-Solvers. (arXiv:2311.18387v1 [cs.CV])
- Diffusion Models Without Attention. (arXiv:2311.18257v1 [cs.CV])
- Bayesian Imaging for Radio Interferometry with Score-Based Priors. (arXiv:2311.18012v1 [astro-ph.IM])
- Automatic Functional Differentiation in JAX. (arXiv:2311.18727v2 [cs.PL] UPDATED)
- Learning Exactly Linearizable Deep Dynamics Models. (arXiv:2311.18261v1 [eess.SY])
- Convergence Analysis of Fractional Gradient Descent. (arXiv:2311.18426v3 [math.OC] UPDATED)
- Using Ornstein-Uhlenbeck Process to understand Denoising Diffusion Probabilistic Model and its Noise Schedules. (arXiv:2311.17673v1 [stat.ML])
- Effective Quantization for Diffusion Models on CPUs. (arXiv:2311.16133v2 [cs.CV] UPDATED)
- Sinkhorn Flow: A Continuous-Time Framework for Understanding and Generalizing the Sinkhorn Algorithm. (arXiv:2311.16706v1 [cs.LG])
- Riemannian Self-Attention Mechanism for SPD Networks. (arXiv:2311.16738v1 [cs.CV])
- Manifold Preserving Guided Diffusion. (arXiv:2311.16424v1 [cs.LG])
- General Derivative-Free Optimization Methods under Global and Local Lipschitz Continuity of Gradients. (arXiv:2311.16850v1 [math.OC])
- Closing the ODE-SDE gap in score-based diffusion models through the Fokker-Planck equation. (arXiv:2311.15996v1 [cs.LG])
- Proximal Algorithms for Accelerated Langevin Dynamics. (arXiv:2311.14829v2 [cs.CE] UPDATED)
- How does the contraction property fail for convex functions on normed spaces?. (arXiv:2311.15152v1 [math.OC] CROSS LISTED)
- Sample as You Infer: Predictive Coding With Langevin Dynamics. (arXiv:2311.13664v1 [cs.LG])
- Sample-Efficient Training for Diffusion. (arXiv:2311.13745v1 [cs.LG])
- The Noise Geometry of Stochastic Gradient Descent: A Quantitative and Analytical Characterization. (arXiv:2310.00692v2 [cs.LG] UPDATED)
- A new use of the Kurdyka-Lojasiewicz property to study asymptotic behaviours of some stochastic optimization algorithms in a non-convex differentiable framework. (arXiv:2311.14627v3 [math.OC] UPDATED)
- Concentration and local smoothness of the averaging process. (arXiv:2311.14176v1 [math.PR])
- The attractive log gas: stability, uniqueness, and propagation of chaos. (arXiv:2311.14560v1 [math.AP])
- On the convergence of adaptive approximations for stochastic differential equations. (arXiv:2311.14201v1 [math.NA])
- Differentially Private Non-Convex Optimization under the KL Condition with Optimal Rates. (arXiv:2311.13447v1 [cs.LG])
- Inverse Problems with Learned Forward Operators. (arXiv:2311.12528v1 [math.NA])
- Inverse Problems with Learned Forward Operators. (arXiv:2311.12528v1 [math.NA])
- Diffusion with Forward Models: Solving Stochastic Inverse Problems Without Direct Supervision. (arXiv:2306.11719v2 [cs.CV] UPDATED)
- The Hidden Linear Structure in Score-Based Models and its Application. (arXiv:2311.10892v1 [cs.AI])
- Wasserstein Convergence Guarantees for a General Class of Score-Based Generative Models. (arXiv:2311.11003v1 [cs.LG])
- Implicit Maximum a Posteriori Filtering via Adaptive Optimization. (arXiv:2311.10580v1 [cs.LG])
- Taming under isoperimetry. (arXiv:2311.09003v1 [math.PR])
- Mean-field variational inference with the TAP free energy: Geometric and statistical properties in linear models. (arXiv:2311.08442v1 [math.ST])
- Manifold learning in Wasserstein space. (arXiv:2311.08549v1 [stat.ML])
- Score-based generative models learn manifold-like structures with constrained mixing. (arXiv:2311.09952v1 [stat.ML])
- Unsupervised approaches based on optimal transport and convex analysis for inverse problems in imaging. (arXiv:2311.08972v2 [cs.CV] UPDATED)
- Taming under isoperimetry. (arXiv:2311.09003v1 [math.PR])
- A statistical perspective on algorithm unrolling models for inverse problems. (arXiv:2311.06395v1 [stat.ML])
- Particle-based algorithm for stochastic optimal control. (arXiv:2311.06906v3 [math.OC] UPDATED)
- Deep JKO: time-implicit particle methods for general nonlinear gradient flows. (arXiv:2311.06700v1 [math.NA])
- Unbiased Kinetic Langevin Monte Carlo with Inexact Gradients. (arXiv:2311.05025v1 [stat.CO])
- Diffusion Based Causal Representation Learning. (arXiv:2311.05421v1 [cs.LG])
- Efficient Compression of Overparameterized Deep Models through Low-Dimensional Learning Dynamics. (arXiv:2311.05061v1 [cs.LG])
- On the Consistency of Maximum Likelihood Estimation of Probabilistic Principal Component Analysis. (arXiv:2311.05046v2 [stat.ML] UPDATED)
- Improved DDIM Sampling with Moment Matching Gaussian Mixtures. (arXiv:2311.04938v2 [cs.CV] UPDATED)
- Unbiased Kinetic Langevin Monte Carlo with Inexact Gradients. (arXiv:2311.05025v1 [stat.CO])
- Geometry and analytic properties of the sliced Wasserstein space. (arXiv:2311.05134v1 [math.AP])
- Integral Resolvent and Proximal Mixtures. (arXiv:2311.04790v1 [math.OC])
- The Linear Representation Hypothesis and the Geometry of Large Language Models. (arXiv:2311.03658v1 [cs.CL])
- Score-based Source Separation with Applications to Digital Communication Signals. (arXiv:2306.14411v3 [cs.LG] UPDATED)
- Exploring the Optimal Choice for Generative Processes in Diffusion Models: Ordinary vs Stochastic Differential Equations. (arXiv:2306.02063v2 [cs.LG] UPDATED)
- Latent Diffusion for Language Generation. (arXiv:2212.09462v2 [cs.CL] UPDATED)
- Sampling via F\"ollmer Flow. (arXiv:2311.03660v1 [stat.ME])
- Improved Convergence Rates of Anderson Acceleration for a Large Class of Fixed-Point Iterations. (arXiv:2311.02490v1 [math.NA])
- Approximating Langevin Monte Carlo with ResNet-like Neural Network architectures. (arXiv:2311.03242v2 [cs.LG] UPDATED)
- Non-convergence to unstable equilibriums for continuous-time and discrete-time stochastic processes. (arXiv:2311.02978v1 [math.PR])
- For how many iterations should we run Markov chain Monte Carlo?. (arXiv:2311.02726v1 [stat.CO])
- A Novel Catalyst Scheme for Stochastic Minimax Optimization. (arXiv:2311.02814v2 [math.OC] UPDATED)
- On the Convergence of Encoder-only Shallow Transformers. (arXiv:2311.01575v1 [cs.LG])
- High Probability Convergence of Adam Under Unbounded Gradients and Affine Variance Noise. (arXiv:2311.02000v1 [math.OC])
- On the Generalization Properties of Diffusion Models. (arXiv:2311.01797v3 [cs.LG] UPDATED)
- A Variational Perspective on High-Resolution ODEs. (arXiv:2311.02002v1 [math.OC])
- Operator learning with PCA-Net: upper and lower complexity bounds
- A Continuous-time Stochastic Gradient Descent Method for Continuous Data
- The Blessing of Randomness: SDE Beats ODE in General Diffusion-based Image Editing. (arXiv:2311.01410v1 [cs.CV])
- Multi-Operational Mathematical Derivations in Latent Space. (arXiv:2311.01230v1 [cs.LG])
- On Feynman--Kac training of partial Bayesian neural networks. (arXiv:2310.19608v1 [cs.LG])
- Reflection coupling for unadjusted generalized Hamiltonian Monte Carlo in the nonconvex stochastic gradient case. (arXiv:2310.18774v1 [math.PR])
- Score Normalization for a Faster Diffusion Exponential Integrator Sampler. (arXiv:2311.00157v2 [cs.LG] UPDATED)
- Bridging the Gap Between Variational Inference and Wasserstein Gradient Flows. (arXiv:2310.20090v1 [stat.ML])
- Approximate Heavy Tails in Offline (Multi-Pass) Stochastic Gradient Descent. (arXiv:2310.18455v1 [cs.LG])
- Conditional score-based diffusion models for Bayesian inference in infinite dimensions. (arXiv:2305.19147v2 [stat.ML] UPDATED)
- Trans-Dimensional Generative Modeling via Jump Diffusion Models. (arXiv:2305.16261v2 [stat.ML] UPDATED)
- Training Energy-Based Normalizing Flow with Score-Matching Objectives. (arXiv:2305.15267v2 [cs.LG] UPDATED)
- Reflection coupling for unadjusted generalized Hamiltonian Monte Carlo in the nonconvex stochastic gradient case. (arXiv:2310.18774v1 [math.PR])
- Diffusion processes as Wasserstein gradient flows via stochastic control of the volatility matrix. (arXiv:2310.18678v1 [math.PR])
- Sample Complexity Bounds for Score-Matching: Causal Discovery and Generative Modeling. (arXiv:2310.18123v1 [cs.LG])
- Understanding the Latent Space of Diffusion Models through the Lens of Riemannian Geometry. (arXiv:2307.12868v2 [cs.CV] UPDATED)
- Closing the Gap Between the Upper Bound and the Lower Bound of Adam's Iteration Complexity. (arXiv:2310.17998v1 [cs.LG])
- Generative Fractional Diffusion Models. (arXiv:2310.17638v1 [cs.LG])
- Monte Carlo guided Diffusion for Bayesian linear inverse problems. (arXiv:2308.07983v2 [stat.ML] UPDATED)
- Unifying GANs and Score-Based Diffusion as Generative Particle Models. (arXiv:2305.16150v3 [cs.LG] UPDATED)
- The statistical thermodynamics of generative diffusion models. (arXiv:2310.17467v1 [stat.ML])
- Causal Modeling with Stationary Diffusions. (arXiv:2310.17405v1 [cs.LG])
- Transformers Learn Higher-Order Optimization Methods for In-Context Learning: A Study with Linear Models. (arXiv:2310.17086v1 [cs.LG])
- Benign Oscillation of Stochastic Gradient Descent with Large Learning Rates. (arXiv:2310.17074v1 [cs.LG])
- Probabilistic Integral Circuits. (arXiv:2310.16986v1 [cs.LG])
- Convergence of flow-based generative models via proximal gradient descent in Wasserstein space. (arXiv:2310.17582v1 [stat.ML])
- Covariance Operator Estimation: Sparsity, Lengthscale, and Ensemble Kalman Filters. (arXiv:2310.16933v1 [math.ST])
- Posterior Consistency for Missing Data in Variational Autoencoders. (arXiv:2310.16648v1 [cs.LG])
- Wasserstein Gradient Flow over Variational Parameter Space for Variational Inference. (arXiv:2310.16705v1 [cs.LG])
- Direct Diffusion Bridge using Data Consistency for Inverse Problems. (arXiv:2305.19809v2 [cs.CV] UPDATED)
- Learning Dynamics in Linear VAE: Posterior Collapse Threshold, Superfluous Latent Space Pitfalls, and Speedup with KL Annealing. (arXiv:2310.15440v1 [stat.ML])
- $L^2$-Wasserstein contraction for Euler schemes of elliptic diffusions and interacting particle systems. (arXiv:2310.15897v1 [math.PR])
- Introduction to Infinite Dimensional Statistics and Applications. (arXiv:2310.15818v1 [math.FA])
- Adam through a Second-Order Lens. (arXiv:2310.14963v1 [cs.LG])
- Random Flows of Covariance Operators and their Statistical Inference. (arXiv:2310.13764v1 [stat.ME])
- Randomized Forward Mode of Automatic Differentiation for Optimization Algorithms. (arXiv:2310.14168v2 [math.OC] UPDATED)
- The Fisher metric as a metric on the cotangent bundle. (arXiv:2310.13237v1 [cs.IT])
- Particle Guidance: non-I.I.D. Diverse Sampling with Diffusion Models. (arXiv:2310.13102v2 [cs.LG] UPDATED)
- Stable Nonconvex-Nonconcave Training via Linear Interpolation. (arXiv:2310.13459v2 [cs.LG] UPDATED)
- Numerical approximation of McKean-Vlasov SDEs via stochastic gradient descent. (arXiv:2310.13579v1 [math.NA])
- Closed-Form Diffusion Models. (arXiv:2310.12395v1 [cs.LG])
- A connection between Tempering and Entropic Mirror Descent. (arXiv:2310.11914v1 [stat.CO])
- Why do autoencoders work?. (arXiv:2310.02250v2 [cs.LG] UPDATED)
- Compositional Abilities Emerge Multiplicatively: Exploring Diffusion Models on a Synthetic Task. (arXiv:2310.09336v3 [cs.LG] UPDATED)
- On permutation symmetries in Bayesian neural network posteriors: a variational perspective. (arXiv:2310.10171v1 [stat.ML])
- Statistical guarantees for stochastic Metropolis-Hastings. (arXiv:2310.09335v1 [stat.ML])
- A Computational Framework for Solving Wasserstein Lagrangian Flows. (arXiv:2310.10649v2 [cs.LG] UPDATED)
- NF-ULA: Langevin Monte Carlo with Normalizing Flow Prior for Imaging Inverse Problems. (arXiv:2304.08342v2 [math.NA] UPDATED)
- On the convergence of discrete dynamic unbalanced transport models. (arXiv:2310.09420v1 [math.NA])
- Sampling from Mean-Field Gibbs Measures via Diffusion Processes. (arXiv:2310.08912v1 [math.PR])
- Time-vectorized numerical integration for systems of ODEs. (arXiv:2310.08649v1 [math.NA])
- The Geometry of Monotone Operator Splitting Methods. (arXiv:2310.08443v3 [math.OC] UPDATED)
- Lion Secretly Solves Constrained Optimization: As Lyapunov Predicts. (arXiv:2310.05898v4 [cs.LG] UPDATED)
- Efficient Integrators for Diffusion Generative Models. (arXiv:2310.07894v1 [cs.LG])
- Efficient Integrators for Diffusion Generative Models. (arXiv:2310.07894v1 [cs.LG])
- Fast Sampling and Inference via Preconditioned Langevin Dynamics. (arXiv:2310.07542v1 [stat.CO])
- Solving Inverse Problems with Latent Diffusion Models via Hard Data Consistency. (arXiv:2307.08123v2 [cs.CV] UPDATED)
- Transformers and Large Language Models for Chemistry and Drug Discovery. (arXiv:2310.06083v1 [cs.LG])
- Economic Theory as Successive Approximations of Statistical Moments. (arXiv:2310.05971v1 [econ.GN])
- Ito Diffusion Approximation of Universal Ito Chains for Sampling, Optimization and Boosting. (arXiv:2310.06081v1 [math.OC])
- The foundations of statistical physics: entropy, irreversibility, and inference. (arXiv:2310.06070v1 [cond-mat.stat-mech])
- On variational inference and maximum likelihood estimation with the {\lambda}-exponential family. (arXiv:2310.05781v1 [math.ST])
- In-Context Convergence of Transformers. (arXiv:2310.05249v1 [cs.LG])
- What's the Magic Word? A Control Theory of LLM Prompting. (arXiv:2310.04444v3 [cs.CL] UPDATED)
- Gradient Descent Provably Solves Nonlinear Tomographic Reconstruction. (arXiv:2310.03956v1 [cs.CV])
- Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference. (arXiv:2310.04378v1 [cs.CV])
- Observation-Guided Diffusion Probabilistic Models. (arXiv:2310.04041v1 [cs.LG])
- Categorical probability spaces, ergodic decompositions, and transitions to equilibrium. (arXiv:2310.04267v2 [math.PR] UPDATED)
- A calculus for Markov chain Monte Carlo: studying approximations in algorithms. (arXiv:2310.03853v1 [math.PR])
- High Order Schemes for Gradient Flow with Respect to a Metric. (arXiv:2211.07011v2 [math.NA] UPDATED)
- Accelerating optimization over the space of probability measures. (arXiv:2310.04006v2 [math.OC] UPDATED)
- Plug-and-Play Posterior Sampling under Mismatched Measurement and Prior Models. (arXiv:2310.03546v1 [stat.ML])
- Molecule Design by Latent Prompt Transformer. (arXiv:2310.03253v1 [cs.LG])
- Learning Energy-Based Prior Model with Diffusion-Amortized MCMC. (arXiv:2310.03218v1 [cs.LG])
- Denoising Diffusion Step-aware Models. (arXiv:2310.03337v2 [cs.CV] UPDATED)
- Sampling via Gradient Flows in the Space of Probability Measures. (arXiv:2310.03597v2 [stat.ML] UPDATED)
- Consistency Trajectory Models: Learning Probability Flow ODE Trajectory of Diffusion. (arXiv:2310.02279v1 [cs.LG])
- Variational Gaussian approximation of the Kushner optimal filter. (arXiv:2310.01859v1 [stat.ML])
- Score-based Data Assimilation for a Two-Layer Quasi-Geostrophic Model. (arXiv:2310.01853v2 [stat.ML] UPDATED)
- Robustifying State-space Models for Long Sequences via Approximate Diagonalization. (arXiv:2310.01698v1 [cs.LG])
- Score dynamics: scaling molecular dynamics with picosecond timesteps via conditional diffusion model. (arXiv:2310.01678v2 [physics.comp-ph] UPDATED)
- How Over-Parameterization Slows Down Gradient Descent in Matrix Sensing: The Curses of Symmetry and Initialization. (arXiv:2310.01769v3 [cs.LG] UPDATED)
- The Fisher-Rao geometry of CES distributions. (arXiv:2310.01032v1 [stat.ML])
- The Noise Geometry of Stochastic Gradient Descent: A Quantitative and Analytical Characterization. (arXiv:2310.00692v2 [cs.LG] UPDATED)
- A Geometric Perspective on Diffusion Models. (arXiv:2305.19947v2 [cs.CV] UPDATED)
- Steered Diffusion: A Generalized Framework for Plug-and-Play Conditional Image Synthesis. (arXiv:2310.00224v1 [cs.CV])
- Linear attention is (maybe) all you need (to understand transformer optimization). (arXiv:2310.01082v1 [cs.LG])
- Symmetric Solutions to Symmetric Partial Difference Equations. (arXiv:2310.00903v1 [math.CA])
- Unadjusted Langevin Algorithms for SDEs with Hoelder Drift. (arXiv:2310.00232v1 [math.PR])
- Denoising Diffusion Bridge Models. (arXiv:2309.16948v3 [cs.CV] UPDATED)
- Water Markets as a Coping Mechanism for Climate-Induced Water Changes on the Canadian Economy: A Computable General Equilibrium Approach. (arXiv:2309.16678v1 [econ.GN])
- Statistical physics, Bayesian inference and neural information processing. (arXiv:2309.17006v1 [cond-mat.dis-nn])
- Insight from the Kullback--Leibler divergence into adaptive importance sampling schemes for rare event analysis in high dimension. (arXiv:2309.16828v1 [math.ST])
- A Control Theoretical Approach to Online Constrained Optimization. (arXiv:2309.15498v1 [math.OC])
- Bayesian Cram\'er-Rao Bound Estimation with Score-Based Models. (arXiv:2309.16076v1 [math.ST])
- Learning Dissipative Neural Dynamical Systems. (arXiv:2309.16032v1 [cs.LG])
- Compositional Sculpting of Iterative Generative Processes. (arXiv:2309.16115v1 [cs.LG])
- Beauty beacon: correlated strategies for the Fisher runaway process. (arXiv:2309.15205v1 [q-bio.PE])
- Joint Sampling and Optimisation for Inverse Rendering. (arXiv:2309.15676v1 [cs.GR])
- Neural Operators for Accelerating Scientific Simulations and Design. (arXiv:2309.15325v5 [cs.LG] UPDATED)
- Beyond Log-Concavity: Theory and Algorithm for Sum-Log-Concave Optimization. (arXiv:2309.15298v1 [math.OC])
- Fantastic Generalization Measures are Nowhere to be Found. (arXiv:2309.13658v3 [cs.LG] UPDATED)
- On the Convergence of Black-Box Variational Inference. (arXiv:2305.15349v4 [cs.LG] UPDATED)
- AntiBARTy Diffusion for Property Guided Antibody Design. (arXiv:2309.13129v1 [q-bio.BM])
- Independent projections of diffusions: Gradient flows for variational inference and optimal mean field approximations. (arXiv:2309.13332v1 [math.PR])
- Self-Tuning Hamiltonian Monte Carlo for Accelerated Sampling. (arXiv:2309.13593v2 [physics.comp-ph] UPDATED)
- Langevin Quasi-Monte Carlo. (arXiv:2309.12664v1 [stat.CO])
- Flow Annealed Kalman Inversion for Gradient-Free Inference in Bayesian Inverse Problems. (arXiv:2309.11490v1 [stat.CO])
- The Topology and Geometry of Neural Representations. (arXiv:2309.11028v2 [q-bio.NC] UPDATED)
- General Optimal Step-size and Initializations for ADMM: A Proximal Operator View. (arXiv:2309.10124v1 [math.OC])
- Diffusion Methods for Generating Transition Paths. (arXiv:2309.10276v1 [physics.comp-ph])
- ChatGPT-4 with Code Interpreter can be used to solve introductory college-level vector calculus and electromagnetism problems. (arXiv:2309.08881v1 [cs.AI])
- High-dimensional manifold of solutions in neural networks: insights from statistical physics. (arXiv:2309.09240v1 [cond-mat.dis-nn])
- Diffusion Generative Inverse Design. (arXiv:2309.02040v2 [cs.LG] UPDATED)
- Finite Expression Methods for Discovering Physical Laws from Data. (arXiv:2305.08342v2 [cs.LG] UPDATED)
- Accelerated Gradient Descent via Long Steps. (arXiv:2309.09961v2 [math.OC] UPDATED)
- Computational Optimal Transport and Filtering on Riemannian manifolds. (arXiv:2309.08847v2 [math.OC] UPDATED)
- Sampling-Free Probabilistic Deep State-Space Models. (arXiv:2309.08256v1 [cs.LG])
- A Geometric Perspective on Autoencoders. (arXiv:2309.08247v2 [cs.LG] UPDATED)
- Projected Langevin dynamics and a gradient flow for entropic optimal transport. (arXiv:2309.08598v1 [math.PR])
- Optimization Algorithm Synthesis based on Integral Quadratic Constraints: A Tutorial. (arXiv:2306.00565v2 [math.OC] UPDATED)
- Bringing PDEs to JAX with forward and reverse modes automatic differentiation. (arXiv:2309.07137v1 [cs.MS])
- Revisiting Energy Based Models as Policies: Ranking Noise Contrastive Estimation and Interpolating Energy Models. (arXiv:2309.05803v1 [cs.RO])
- Elucidating the solution space of extended reverse-time SDE for diffusion models. (arXiv:2309.06169v2 [cs.LG] UPDATED)
- Learning Energy-Based Models by Cooperative Diffusion Recovery Likelihood. (arXiv:2309.05153v2 [stat.ML] UPDATED)
- SA-Solver: Stochastic Adams Solver for Fast Sampling of Diffusion Models. (arXiv:2309.05019v1 [cs.LG])
- Generalization Bounds: Perspectives from Information Theory and PAC-Bayes. (arXiv:2309.04381v1 [cs.LG])
- Computationally Efficient Data-Driven Discovery and Linear Representation of Nonlinear Systems For Control. (arXiv:2309.04074v1 [eess.SY])
- Early warning indicators via latent stochastic dynamical systems. (arXiv:2309.03842v2 [stat.ML] UPDATED)
- Shooting methods for computing geodesics on the Stiefel manifold. (arXiv:2309.03585v1 [math.NA])
- Large Language Models as Optimizers. (arXiv:2309.03409v2 [cs.LG] UPDATED)
- It\^o versus H\"anggi-Klimontovich. (arXiv:2309.03654v1 [math-ph])
- Concepts in Monte Carlo sampling. (arXiv:2309.03136v1 [cond-mat.stat-mech])
- Well-posedness and averaging principle for L\'evy-type McKean-Vlasov stochastic differential equations under local Lipschitz conditions. (arXiv:2309.02906v1 [math.PR])
- Exact Inference for Continuous-Time Gaussian Process Dynamics. (arXiv:2309.02351v2 [cs.LG] UPDATED)
- Backward error analysis and the qualitative behaviour of stochastic optimization algorithms: Application to stochastic coordinate descent. (arXiv:2309.02082v1 [math.OC])
- Accelerating Markov Chain Monte Carlo sampling with diffusion models. (arXiv:2309.01454v1 [hep-ph])
- CausalLM is not optimal for in-context learning. (arXiv:2308.06912v2 [cs.LG] UPDATED)
- An Ensemble Score Filter for Tracking High-Dimensional Nonlinear Dynamical Systems. (arXiv:2309.00983v1 [stat.ML])
- Controlled Martingale Problems And Their Markov Mimics. (arXiv:2309.00488v1 [math.PR])
- Copiloting the Copilots: Fusing Large Language Models with Completion Engines for Automated Program Repair. (arXiv:2309.00608v3 [cs.SE] UPDATED)
- Structure and Gradient Dynamics Near Global Minima of Two-layer Neural Networks. (arXiv:2309.00508v1 [cs.LG])
- Why do universal adversarial attacks work on large language models?: Geometry might be the answer. (arXiv:2309.00254v1 [cs.LG])
- Fast Diffusion EM: a diffusion model for blind inverse problems with application to deconvolution. (arXiv:2309.00287v2 [cs.CV] UPDATED)
- On the Implicit Bias of Adam. (arXiv:2309.00079v3 [cs.LG] UPDATED)
- Transformers as Support Vector Machines. (arXiv:2308.16898v2 [cs.LG] UPDATED)
- Exploring Model Transferability through the Lens of Potential Energy. (arXiv:2308.15074v1 [cs.CV])
- Noise-Free Sampling Algorithms via Regularized Wasserstein Proximals. (arXiv:2308.14945v3 [stat.ML] UPDATED)
- Score-Based Diffusion Models as Principled Priors for Inverse Imaging. (arXiv:2304.11751v2 [cs.CV] UPDATED)
- Compositional maps for registration in complex geometries. (arXiv:2308.15307v1 [math.NA])
- Continuous-time stochastic gradient descent for optimizing over the stationary distribution of stochastic differential equations. (arXiv:2202.06637v2 [cs.LG] UPDATED)
- Learning variational autoencoders via MCMC speed measures. (arXiv:2308.13731v1 [stat.ML])
- Sampling with flows, diffusion and autoregressive neural networks: A spin-glass perspective. (arXiv:2308.14085v1 [cond-mat.dis-nn])
- Solving Forward and Inverse Problems of Contact Mechanics using Physics-Informed Neural Networks. (arXiv:2308.12716v1 [math.NA])
- Score diffusion models without early stopping: finite Fisher information is all you need. (arXiv:2308.12240v1 [math.ST])
- Compositional nonlinear audio signal processing with Volterra series. (arXiv:2308.07229v3 [eess.AS] UPDATED)
- On-Manifold Projected Gradient Descent. (arXiv:2308.12279v1 [cs.LG])
- System Identification for Continuous-time Linear Dynamical Systems. (arXiv:2308.11933v2 [cs.LG] UPDATED)
- Solving Elliptic Optimal Control Problems using Physics Informed Neural Networks. (arXiv:2308.11925v1 [math.OC])
- Boosting Diffusion Models with an Adaptive Momentum Sampler. (arXiv:2308.11941v1 [cs.CV])
- Understanding Hessian Alignment for Domain Generalization. (arXiv:2308.11778v1 [cs.LG])
- Variational Autoencoding Molecular Graphs with Denoising Diffusion Probabilistic Model. (arXiv:2307.00623v2 [cs.LG] UPDATED)
- Diffusion Model as Representation Learner. (arXiv:2308.10916v1 [cs.CV])
- Nonlinear Hamiltonian Monte Carlo & its Particle Approximation. (arXiv:2308.11491v1 [math.ST])
- Convergence guarantee for consistency models. (arXiv:2308.11449v1 [math.NA])
- Semi-Implicit Variational Inference via Score Matching. (arXiv:2308.10014v1 [stat.ML])
- Understanding Self-attention Mechanism via Dynamical System Perspective. (arXiv:2308.09939v1 [cs.CV])
- Unbiased Image Synthesis via Manifold-Driven Sampling in Diffusion Models. (arXiv:2307.08199v2 [cs.CV] UPDATED)
- A Principle for Global Optimization with Gradients. (arXiv:2308.09556v1 [math.OC])
- Accelerated Bayesian imaging by relaxed proximal-point Langevin sampling. (arXiv:2308.09460v2 [stat.CO] UPDATED)
- Ensemble Kalman Filters with Resampling. (arXiv:2308.08751v1 [eess.SY])
- Can Transformers Learn Optimal Filtering for Unknown Systems?. (arXiv:2308.08536v1 [eess.SY])
- On the generalized vectorization and its inverse. (arXiv:2308.07928v2 [math.NA] UPDATED)
- SynJax: Structured Probability Distributions for JAX. (arXiv:2308.03291v3 [cs.LG] UPDATED)
- Sign Gradient Descent Algorithms for Kinetostatic Protein Folding. (arXiv:2308.07453v1 [eess.SY])
- GANs as Gradient Flows that Converge
- How many samples are needed to leverage smoothness?. (arXiv:2305.16014v3 [stat.ML] UPDATED)
- Mirror Diffusion Models. (arXiv:2308.06342v2 [cs.LG] UPDATED)
- Sampling and Filtering with Markov Chains. (arXiv:2308.06192v1 [math.ST])
- Homogenization of conditional slow-fast McKean-Vlasov SDEs. (arXiv:2308.05874v1 [math.PR])
- A Law of Data Separation in Deep Learning. (arXiv:2210.17020v2 [cs.LG] UPDATED)
- Filtering Dynamical Systems Using Observations of Statistics. (arXiv:2308.05484v2 [stat.ME] UPDATED)
- On the Stability and Convergence of Physics Informed Neural Networks. (arXiv:2308.05423v1 [math.NA])
- Gaussian Cooling and Dikin Walks: The Interior-Point Method for Logconcave Sampling. (arXiv:2307.12943v3 [cs.DS] UPDATED)
- Finite Element Operator Network for Solving Parametric PDEs. (arXiv:2308.04690v2 [math.NA] UPDATED)
- Symplectic Discretization Approach for Developing New Proximal Point Algorithms. (arXiv:2308.03986v3 [math.OC] UPDATED)
- A Review of Change of Variable Formulas for Generative Modeling. (arXiv:2308.02652v1 [cs.LG])
- SDDM: Score-Decomposed Diffusion Models on Manifolds for Unpaired Image-to-Image Translation. (arXiv:2308.02154v1 [cs.CV])
- Improved Order Analysis and Design of Exponential Integrator for Diffusion Models Sampling. (arXiv:2308.02157v1 [cs.LG])
- Strong convergence of multiscale truncated Euler-Maruyama method for super-linear slow-fast stochastic differential equations. (arXiv:2308.02110v1 [math.NA])
- Generative Modelling of L\'{e}vy Area for High Order SDE Simulation. (arXiv:2308.02452v1 [stat.ML])
- Statistical Estimation Under Distribution Shift: Wasserstein Perturbations and Minimax Theory. (arXiv:2308.01853v2 [stat.ML] UPDATED)
- Non-equilibrium physics: from spin glasses to machine and neural learning. (arXiv:2308.01538v1 [cond-mat.dis-nn])
- Divergence of the ADAM algorithm with fixed-stepsize: a (very) simple example. (arXiv:2308.00720v1 [cs.LG])
- Mirror Natural Evolution Strategies. (arXiv:2308.00469v1 [cs.LG])
- Exploring how a Generative AI interprets music. (arXiv:2308.00015v1 [cs.SD])
- Geometric Ergodicity and Wasserstein Continuity of Non-Linear Filters. (arXiv:2307.15764v2 [math.PR] UPDATED)
- Faster Stochastic Algorithms for Minimax Optimization under Polyak--{\L}ojasiewicz Conditions. (arXiv:2307.15868v1 [math.OC])
- How regularization affects the geometry of loss functions. (arXiv:2307.15744v1 [cs.LG])
- Inexact proximal methods for weakly convex functions. (arXiv:2307.15596v2 [math.OC] UPDATED)
- Nonlinear Convex Optimization: From Relaxed Proximal Point Algorithm to Prediction Correction Method. (arXiv:2307.14615v1 [math.OC])
- Linear Convergence of Black-Box Variational Inference: Should We Stick the Landing?. (arXiv:2307.14642v2 [stat.ML] UPDATED)
- A Survey on Generative Modeling with Limited Data, Few Shots, and Zero Shot. (arXiv:2307.14397v1 [cs.CV])
- BayesDAG: Gradient-Based Posterior Inference for Causal Discovery. (arXiv:2307.13917v2 [cs.LG] UPDATED)
- Sharpness Minimization Algorithms Do Not Only Minimize Sharpness To Achieve Better Generalization. (arXiv:2307.11007v2 [cs.LG] UPDATED)
- Convergence of Adam for Non-convex Objectives: Relaxed Hyperparameters and Non-ergodic Case. (arXiv:2307.11782v1 [math.OC])
- On the Fisher-Rao Gradient of the Evidence Lower Bound. (arXiv:2307.11249v1 [cs.LG])
- Optimal importance sampling for overdamped Langevin dynamics. (arXiv:2307.11744v1 [stat.ME])
- Amortized Variational Inference: When and Why?. (arXiv:2307.11018v2 [stat.ML] UPDATED)
- Properties of Discrete Sliced Wasserstein Losses. (arXiv:2307.10352v2 [stat.ML] UPDATED)
- LongNet: Scaling Transformers to 1,000,000,000 Tokens. (arXiv:2307.02486v2 [cs.CL] UPDATED)
- Gradient Surgery for One-shot Unlearning on Generative Model. (arXiv:2307.04550v2 [cs.LG] UPDATED)
- A Quick Guide for the Iterated Extended Kalman Filter on Manifolds. (arXiv:2307.09237v3 [eess.SY] UPDATED)
- A Survey of Techniques for Optimizing Transformer Inference. (arXiv:2307.07982v1 [cs.LG])
- Transformers are Universal Predictors. (arXiv:2307.07843v1 [cs.LG])
- Complexity Matters: Rethinking the Latent Space for Generative Modeling. (arXiv:2307.08283v2 [cs.LG] UPDATED)
- Variational Inference with Gaussian Score Matching. (arXiv:2307.07849v1 [stat.ML])
- Training Discrete Energy-Based Models with Energy Discrepancy. (arXiv:2307.07595v1 [stat.ML])
- The Future of Fundamental Science Led by Generative Closed-Loop Artificial Intelligence. (arXiv:2307.07522v3 [cs.AI] UPDATED)
- Control of neural transport for normalizing flows. (arXiv:2307.07817v2 [math.OC] UPDATED)
- Optimal contract design via relaxation: application to the problem of brokerage fee for a client with private signal. (arXiv:2307.07010v1 [q-fin.MF])
- Embracing the chaos: analysis and diagnosis of numerical instability in variational flows. (arXiv:2307.06957v2 [stat.ML] UPDATED)
- Accelerated gradient methods for nonconvex optimization: Escape trajectories from strict saddle points and convergence to local minima. (arXiv:2307.07030v1 [math.OC])
- Tensor Decompositions Meet Control Theory: Learning General Mixtures of Linear Dynamical Systems. (arXiv:2307.06538v2 [cs.LG] UPDATED)
- Energy Discrepancies: A Score-Independent Loss for Energy-Based Models. (arXiv:2307.06431v2 [stat.ML] UPDATED)
- Optimal Algorithms for Numerical Integration: Recent Results and Open Problems. (arXiv:2307.06787v1 [math.NA])
- Deep Generative Models for Decision-Making and Control. (arXiv:2306.08810v2 [cs.LG] UPDATED)
- DDGM: Solving inverse problems by Diffusive Denoising of Gradient-based Minimization. (arXiv:2307.04946v1 [cs.CV])
- Geometric Neural Diffusion Processes. (arXiv:2307.05431v1 [stat.ML])
- On the randomized Euler algorithm under inexact information. (arXiv:2307.04718v1 [math.NA])
- Stability Analysis for Electromagnetic Waveguides. Part 1: Acoustic and Homogeneous Electromagnetic Waveguides. (arXiv:2307.04521v1 [math.NA])
- A generative flow for conditional sampling via optimal transport. (arXiv:2307.04102v1 [stat.ML])
- Revisiting the Two-Filter Formula for Smoothing for State-Space Models. (arXiv:2307.03428v1 [stat.CO])
- Strategic Distribution Shift of Interacting Agents via Coupled Gradient Flows. (arXiv:2307.01166v3 [cs.LG] UPDATED)
- Adaptive Strategies in Non-convex Optimization. (arXiv:2306.10278v2 [cs.LG] UPDATED)
- Differentiable Turbulence. (arXiv:2307.03683v1 [physics.flu-dyn])
- Large Deviations and Metastability Analysis for Heavy-Tailed Dynamical Systems. (arXiv:2307.03479v1 [math.PR])
- On the convergence of dynamic implementations of Hamiltonian Monte Carlo and No U-Turn Samplers. (arXiv:2307.03460v1 [stat.CO])
- A Journey into Matrix Analysis. (arXiv:2307.03064v1 [math.FA])
- Convergence and concentration properties of constant step-size SGD through Markov chains. (arXiv:2306.11497v2 [stat.ML] UPDATED)
- DiffFlow: A Unified SDE Framework for Score-Based Diffusion Models and Generative Adversarial Networks. (arXiv:2307.02159v1 [stat.ML])
- Training Energy-Based Models with Diffusion Contrastive Divergences. (arXiv:2307.01668v1 [cs.LG])
- Reverse Diffusion Monte Carlo. (arXiv:2307.02037v2 [stat.ML] UPDATED)
- Abide by the Law and Follow the Flow: Conservation Laws for Gradient Flows. (arXiv:2307.00144v1 [cs.LG])
- Transport meets Variational Inference: Controlled Monte Carlo Diffusions. (arXiv:2307.01050v5 [stat.ML] UPDATED)
- Solving Linear Inverse Problems Provably via Posterior Sampling with Latent Diffusion Models. (arXiv:2307.00619v1 [cs.LG])
- An Introduction to Stochastic PDEs. (arXiv:0907.4178v2 [math.PR] UPDATED)
- Improved sampling via learned diffusions. (arXiv:2307.01198v1 [cs.LG])
- On Higher Order Drift and Diffusion Estimates for Stochastic SINDy. (arXiv:2306.17814v2 [math.NA] UPDATED)
- Proximal Langevin Sampling With Inexact Proximal Mapping. (arXiv:2306.17737v1 [stat.CO])
- Practical and Asymptotically Exact Conditional Sampling in Diffusion Models. (arXiv:2306.17775v1 [stat.ML])
- Designing Stable Neural Networks using Convex Analysis and ODEs. (arXiv:2306.17332v1 [cs.LG])
- Approximate Inference via Fibrations of Statistical Games. (arXiv:2306.17009v2 [math.CT] UPDATED)
- The Underlying Scaling Laws and Universal Statistical Structure of Complex Datasets. (arXiv:2306.14975v2 [cs.LG] UPDATED)
- Recent Advances in Optimal Transport for Machine Learning. (arXiv:2306.16156v1 [cs.LG])
- The curse of dimensionality in operator learning. (arXiv:2306.15924v1 [cs.LG])
- Improved error estimate for the order of strong convergence of the Euler method for random ordinary differential equations. (arXiv:2306.15418v4 [math.PR] UPDATED)
- Automating Steady and Unsteady Adjoints: Efficiently Utilizing Implicit and Algorithmic Differentiation. (arXiv:2306.15243v1 [math.OC])
- Linearizability of flows by embeddings. (arXiv:2305.18288v4 [math.DS] UPDATED)
- On Scalable Testing of Samplers. (arXiv:2306.13958v1 [cs.DS])
- Efficient preconditioned stochastic gradient descent for estimation in latent variable models. (arXiv:2306.12841v1 [math.ST])
- Balanced Training of Energy-Based Models with Adaptive Flow Sampling. (arXiv:2306.00684v3 [cs.LG] UPDATED)
- Thermodynamics of Information. (arXiv:2306.12447v1 [cond-mat.stat-mech])
- On the log-Sobolev constant of log-concave measures. (arXiv:2306.12997v1 [math.FA])
- Any Deep ReLU Network is Shallow. (arXiv:2306.11827v1 [cs.LG])
- Open Problem: Learning with Variational Objectives on Measures. (arXiv:2306.11928v2 [stat.ML] UPDATED)
- No Wrong Turns: The Simple Geometry Of Neural Networks Optimization Paths. (arXiv:2306.11922v1 [cs.LG])
- Convergence and concentration properties of constant step-size SGD through Markov chains. (arXiv:2306.11497v2 [stat.ML] UPDATED)
- Towards Stability of Autoregressive Neural Operators. (arXiv:2306.10619v2 [cs.LG] UPDATED)
- Fast Conditional Mixing of MCMC Algorithms for Non-log-concave Distributions. (arXiv:2306.10506v2 [cs.LG] UPDATED)
- Trained Transformers Learn Linear Models In-Context. (arXiv:2306.09927v3 [stat.ML] UPDATED)
- Adjoint and Its roles in Sciences, Engineering, and Mathematics: A Tutorial. (arXiv:2306.09917v1 [math.FA])
- Nonlinear Fokker--Planck--Kolmogorov equations as gradient flows on the space of probability measures. (arXiv:2306.09530v2 [math.AP] UPDATED)
- Second order quantitative bounds for unadjusted generalized Hamiltonian Monte Carlo. (arXiv:2306.09513v1 [math.PR])
- Evolutionary Algorithms in the Light of SGD: Limit Equivalence, Minima Flatness, and Transfer Learning. (arXiv:2306.09991v1 [cs.NE])
- Langevin Monte Carlo for strongly log-concave distributions: Randomized midpoint revisited. (arXiv:2306.08494v2 [math.ST] UPDATED)
- Towards Faster Non-Asymptotic Convergence for Diffusion-Based Generative Models. (arXiv:2306.09251v2 [stat.ML] UPDATED)
- New Methods for Parametric Optimization via Differential Equations. (arXiv:2306.08812v1 [math.OC])
- Fit Like You Sample: Sample-Efficient Generalized Score Matching from Fast Mixing Diffusions. (arXiv:2306.09332v3 [cs.DS] UPDATED)
- Beyond Implicit Bias: The Insignificance of SGD Noise in Online Learning. (arXiv:2306.08590v1 [cs.LG])
- Contraction Rate Estimates of Stochastic Gradient Kinetic Langevin Integrators. (arXiv:2306.08592v1 [math.NA])
- Noise Stability Optimization for Flat Minima with Tight Rates. (arXiv:2306.08553v2 [cs.LG] UPDATED)
- Langevin Monte Carlo for strongly log-concave distributions: Randomized midpoint revisited. (arXiv:2306.08494v2 [math.ST] UPDATED)
- Differentiating Metropolis-Hastings to Optimize Intractable Densities. (arXiv:2306.07961v3 [stat.ML] UPDATED)
- Learning Unnormalized Statistical Models via Compositional Optimization. (arXiv:2306.07485v1 [cs.LG])
- Catapults in SGD: spikes in the training loss and their impact on generalization through feature learning. (arXiv:2306.04815v1 [cs.LG] CROSS LISTED)
- Stability estimates for initial data in general Ornstein-Uhlenbeck equations. (arXiv:2306.06763v1 [math.AP])
- Quadratic models for understanding neural network dynamics. (arXiv:2205.11787v2 [cs.LG] CROSS LISTED)
- Convergence of mean-field Langevin dynamics: Time and space discretization, stochastic gradient, and variance reduction. (arXiv:2306.07221v1 [cs.LG])
- Latent Dynamical Implicit Diffusion Processes. (arXiv:2306.07077v2 [cs.LG] UPDATED)
- Can Forward Gradient Match Backpropagation?. (arXiv:2306.06968v1 [cs.LG])
- On Kinetic Optimal Probability Paths for Generative Models. (arXiv:2306.06626v1 [cs.LG])
- Markov bases: a 25 year update. (arXiv:2306.06270v3 [stat.ME] UPDATED)
- Asymptotically efficient one-step stochastic gradient descent. (arXiv:2306.05896v1 [math.ST])
- Data-Adaptive Probabilistic Likelihood Approximation for Ordinary Differential Equations. (arXiv:2306.05566v2 [stat.ML] UPDATED)
- Beyond Vanilla Variational Autoencoders: Detecting Posterior Collapse in Conditional and Hierarchical Variational Autoencoders. (arXiv:2306.05023v2 [stat.ML] UPDATED)
- Causal normalizing flows: from theory to practice. (arXiv:2306.05415v2 [cs.LG] UPDATED)
- Interpreting and Improving Diffusion Models Using the Euclidean Distance Function. (arXiv:2306.04848v2 [cs.LG] UPDATED)
- Exposing flaws of generative model evaluation metrics and their unfair treatment of diffusion models. (arXiv:2306.04675v2 [cs.LG] UPDATED)
- Unscented Autoencoder. (arXiv:2306.05256v1 [cs.LG])
- Stochastic Collapse: How Gradient Noise Attracts SGD Dynamics Towards Simpler Subnetworks. (arXiv:2306.04251v2 [cs.LG] UPDATED)
- Probabilistic Unrolling: Scalable, Inverse-Free Maximum Likelihood Estimation for Latent Gaussian Models. (arXiv:2306.03249v1 [cs.LG])
- Provable convergence guarantees for black-box variational inference. (arXiv:2306.03638v3 [cs.LG] UPDATED)
- Sequential Monte Carlo Steering of Large Language Models using Probabilistic Programs. (arXiv:2306.03081v2 [cs.AI] UPDATED)
- Introduction to Latent Variable Energy-Based Models: A Path Towards Autonomous Machine Intelligence. (arXiv:2306.02572v1 [cs.LG])
- Gradient-free optimization of highly smooth functions: improved analysis and a new algorithm. (arXiv:2306.02159v1 [math.ST])
- Correcting auto-differentiation in neural-ODE training. (arXiv:2306.02192v1 [cs.LG])
- Averaging principle for McKean-Vlasov SDEs driven by multiplicative fractional noise with highly oscillatory drift coefficient. (arXiv:2306.02028v1 [math.PR])
- GFlowNet-EM for learning compositional latent variable models. (arXiv:2302.06576v2 [cs.LG] UPDATED)
- Lifting Architectural Constraints of Injective Flows. (arXiv:2306.01843v3 [cs.LG] UPDATED)
- Entropic mean-field min-max problems via Best Response and Fisher-Rao flows. (arXiv:2306.03033v1 [math.OC])
- Curvature and complexity: Better lower bounds for geodesically convex optimization. (arXiv:2306.02959v2 [math.OC] UPDATED)
- Aiming towards the minimizers: fast convergence of SGD for overparametrized problems. (arXiv:2306.02601v1 [cs.LG])
- KL-Divergence Guided Temperature Sampling. (arXiv:2306.01286v2 [cs.CL] UPDATED)
- TIES-Merging: Resolving Interference When Merging Models. (arXiv:2306.01708v2 [cs.LG] UPDATED)
- Learning Transformer Programs. (arXiv:2306.01128v2 [cs.LG] UPDATED)
- The Fisher Geometry and Geodesics of the Multivariate Normals, without Differential Geometry. (arXiv:2306.01278v1 [math.ST])
- Convex and Non-convex Optimization Under Generalized Smoothness. (arXiv:2306.01264v2 [math.OC] UPDATED)
- Conditionally Strongly Log-Concave Generative Models. (arXiv:2306.00181v1 [stat.ML])
- From Perception to Programs: Regularize, Overparameterize, and Amortize. (arXiv:2206.05922v2 [cs.AI] UPDATED)
- The Implicit Regularization of Dynamical Stability in Stochastic Gradient Descent. (arXiv:2305.17490v2 [stat.ML] UPDATED)
- Towards Revealing the Mystery behind Chain of Thought: A Theoretical Perspective. (arXiv:2305.15408v5 [cs.LG] UPDATED)
- Parameterized Wasserstein Hamiltonian Flow. (arXiv:2306.00191v2 [math.NA] UPDATED)
- Generalized Implicit Follow-The-Regularized-Leader. (arXiv:2306.00201v1 [cs.LG])
- Fast global convergence of gradient descent for low-rank matrix approximation. (arXiv:2305.19206v1 [math.OC])
- Forensic analysis of the Turkey 2023 presidential election reveals extreme vote swings in remote areas. (arXiv:2305.19168v2 [stat.AP] UPDATED)
- On the Role of Noise in the Sample Complexity of Learning Recurrent Neural Networks: Exponential Gaps for Long Sequences. (arXiv:2305.18423v1 [stat.ML])
- A Heat Diffusion Perspective on Geodesic Preserving Dimensionality Reduction. (arXiv:2305.19043v1 [cs.LG])
- The Curse of Recursion: Training on Generated Data Makes Models Forget. (arXiv:2305.17493v2 [cs.LG] UPDATED)
- Spontaneous Symmetry Breaking in Generative Diffusion Models. (arXiv:2305.19693v3 [cs.LG] UPDATED)
- A Unified Framework for U-Net Design and Analysis. (arXiv:2305.19638v2 [stat.ML] UPDATED)
- Chain of Log-Concave Markov Chains. (arXiv:2305.19473v2 [stat.ML] UPDATED)
- Resampling schemes in population annealing: Numerical and theoretical results. (arXiv:2305.19994v2 [cond-mat.stat-mech] UPDATED)
- Conditional score-based diffusion models for Bayesian inference in infinite dimensions. (arXiv:2305.19147v2 [stat.ML] UPDATED)
- A Measure-Theoretic Axiomatisation of Causality. (arXiv:2305.17139v2 [cs.AI] UPDATED)
- Efficiency of reversible MCMC methods: elementary derivations and applications to composite methods. (arXiv:2305.18268v1 [math.PR])
- Fast and Minimax Optimal Estimation of Low-Rank Matrices via Non-Convex Gradient Descent. (arXiv:2305.17224v1 [math.OC])
- Stochastic resetting in interacting particle systems: A review. (arXiv:2305.16955v2 [cond-mat.stat-mech] UPDATED)
- Kaczmarz-Type Method for Solving Matrix Equation $AXB=C$. (arXiv:2305.16684v1 [math.NA])
- Unifying GANs and Score-Based Diffusion as Generative Particle Models. (arXiv:2305.16150v3 [cs.LG] UPDATED)
- How many samples are needed to leverage smoothness?. (arXiv:2305.16014v3 [stat.ML] UPDATED)
- Contracting Dynamics for Time-Varying Convex Optimization. (arXiv:2305.15595v1 [math.OC])
- CoinEM: Tuning-Free Particle-Based Variational Inference for Latent Variable Models. (arXiv:2305.14916v1 [stat.ML])
- Training Energy-Based Normalizing Flow with Score-Matching Objectives. (arXiv:2305.15267v2 [cs.LG] UPDATED)
- Subsampling Error in Stochastic Gradient Langevin Diffusions. (arXiv:2305.13882v1 [stat.ML])
- On the Convergence of Black-Box Variational Inference. (arXiv:2305.15349v4 [cs.LG] UPDATED)
- A Rigorous Link between Deep Ensembles and (Variational) Bayesian Methods. (arXiv:2305.15027v2 [stat.ML] UPDATED)
- On progressive sharpening, flat minima and generalisation. (arXiv:2305.14683v4 [cs.LG] UPDATED)
- One-step differentiation of iterative algorithms. (arXiv:2305.13768v1 [math.OC])
- Improved rates of convergence for the multivariate Central Limit Theorem in Wasserstein distance. (arXiv:2305.14248v3 [math.PR] UPDATED)
- Towards Understanding the Dynamics of Gaussian-Stein Variational Gradient Descent. (arXiv:2305.14076v4 [math.ST] UPDATED)
- Uniform-in-Time Wasserstein Stability Bounds for (Noisy) Stochastic Gradient Descent. (arXiv:2305.12056v2 [stat.ML] UPDATED)
- Gradient Descent Monotonically Decreases the Sharpness of Gradient Flow Solutions in Scalar Networks and Beyond. (arXiv:2305.13064v1 [cs.LG])
- Variational Diffusion Auto-encoder: Latent Space Extraction from Pre-trained Diffusion Models. (arXiv:2304.12141v2 [cs.LG] UPDATED)
- The probability flow ODE is provably fast. (arXiv:2305.11798v1 [cs.LG])
- Moment Matching Denoising Gibbs Sampling. (arXiv:2305.11650v5 [stat.ML] UPDATED)
- Sampling, Diffusions, and Stochastic Localization. (arXiv:2305.10690v1 [cs.LG])
- Blackout Diffusion: Generative Diffusion Models in Discrete-State Spaces. (arXiv:2305.11089v1 [cs.LG])
- A Theory of General Difference in Continuous and Discrete Domain. (arXiv:2305.08098v2 [cs.DM] UPDATED)
- Common Diffusion Noise Schedules and Sample Steps are Flawed. (arXiv:2305.08891v4 [cs.CV] UPDATED)
- On the connections between optimization algorithms, Lyapunov functions, and differential equations: theory and insights. (arXiv:2305.08658v1 [math.OC])
- A Dynamical Systems Perspective on Discrete Optimization. (arXiv:2305.08536v1 [math.OC])
- Generative AI: Implications and Applications for Education. (arXiv:2305.07605v3 [cs.CY] UPDATED)
- From Denoising Diffusions to Denoising Markov Models. (arXiv:2211.03595v2 [stat.ML] UPDATED)
- The Compositional Structure of Bayesian Inference. (arXiv:2305.06112v2 [math.CT] UPDATED)
- UAdam: Unified Adam-Type Algorithmic Framework for Non-Convex Stochastic Optimization. (arXiv:2305.05675v1 [cs.LG])
- The emergence of clusters in self-attention dynamics. (arXiv:2305.05465v4 [cs.LG] UPDATED)
- A Variational Perspective on Solving Inverse Problems with Diffusion Models. (arXiv:2305.04391v2 [cs.LG] UPDATED)
- Gradient descent with a general cost. (arXiv:2305.04917v2 [math.OC] UPDATED)
- Accelerated Stochastic Optimization Methods under Quasar-convexity. (arXiv:2305.04736v2 [math.OC] UPDATED)
- The complexity of first-order optimization methods from a metric perspective. (arXiv:2305.03208v1 [math.OC])
- A categorical treatment of the Radon-Nikodym theorem and martingales. (arXiv:2305.03421v2 [math.CT] UPDATED)
- On a Unified and Simplified Proof for the Ergodic Convergence Rates of PPM, PDHG and ADMM. (arXiv:2305.02165v2 [math.OC] UPDATED)
- The Pseudoinverse of $A=CR$ is $A^+=R^+C^+$ (?). (arXiv:2305.01716v2 [math.NA] UPDATED)
- Hessian-informed Hamiltonian Monte Carlo for high-dimensional problems. (arXiv:2305.01576v1 [stat.CO])
- Blended Latent Diffusion. (arXiv:2206.02779v2 [cs.CV] UPDATED)
- Class-Balancing Diffusion Models. (arXiv:2305.00562v2 [cs.CV] UPDATED)
- Predictions Based on Pixel Data: Insights from PDEs and Finite Differences. (arXiv:2305.00723v1 [math.NA])
- Generative Diffusion Models on Graphs: Methods and Applications. (arXiv:2302.02591v3 [cs.LG] UPDATED)
- On Underdamped Nesterov's Acceleration. (arXiv:2304.14642v1 [math.OC])
- Noise Is Not the Main Factor Behind the Gap Between SGD and Adam on Transformers, but Sign Descent Might Be. (arXiv:2304.13960v1 [cs.LG])
- Categorical Foundations of Explainable AI: A Unifying Theory. (arXiv:2304.14094v3 [cs.AI] UPDATED)
- Statistical Learning Theory for Control: A Finite Sample Perspective. (arXiv:2209.05423v2 [eess.SY] UPDATED)
- Convergence of Adam Under Relaxed Assumptions. (arXiv:2304.13972v3 [math.OC] UPDATED)
- Score-based Generative Modeling Through Backward Stochastic Differential Equations: Inversion and Generation. (arXiv:2304.13224v1 [cs.LG])
- Energy-Based Sliced Wasserstein Distance. (arXiv:2304.13586v3 [stat.ML] UPDATED)
- Latent Traversals in Generative Models as Potential Flows. (arXiv:2304.12944v2 [cs.LG] UPDATED)
- The secret life of matrix factorizations: how matrix decompositions reveal and keep secrets of linear equations and what we can do about it. (arXiv:2304.12451v1 [math.NA])
- Fast Diffusion Probabilistic Model Sampling through the lens of Backward Error Analysis. (arXiv:2304.11446v1 [cs.CV])
- An Introduction to Transformers. (arXiv:2304.10557v4 [cs.LG] UPDATED)
- Analysis of a Computational Framework for Bayesian Inverse Problems: Ensemble Kalman Updates and MAP Estimators Under Mesh Refinement. (arXiv:2304.09933v1 [math.NA])
- A Latent Space Theory for Emergent Abilities in Large Language Models. (arXiv:2304.09960v3 [cs.CL] UPDATED)
- Understanding Accelerated Gradient Methods: Lyapunov Analyses and Hamiltonian Assisted Interpretations. (arXiv:2304.10063v1 [math.OC])
- A Theory on Adam Instability in Large-Scale Machine Learning. (arXiv:2304.09871v2 [cs.LG] UPDATED)
- Analysis of a Computational Framework for Bayesian Inverse Problems: Ensemble Kalman Updates and MAP Estimators Under Mesh Refinement. (arXiv:2304.09933v1 [math.NA])
- Weak Convergence Of Tamed Exponential Integrators for Stochastic Differential Equations. (arXiv:2304.09496v1 [math.NA])
- Chameleon: Plug-and-Play Compositional Reasoning with Large Language Models. (arXiv:2304.09842v3 [cs.CL] UPDATED)
- Convergence of stochastic gradient descent under a local Lojasiewicz condition for deep neural networks. (arXiv:2304.09221v2 [cs.LG] UPDATED)
- Particle-based Variational Inference with Preconditioned Functional Gradient Flow. (arXiv:2211.13954v2 [stat.ML] UPDATED)
- Sliced Optimal Transport on the Sphere. (arXiv:2304.09092v2 [math.NA] UPDATED)
- Bayes Hilbert Spaces for Posterior Approximation. (arXiv:2304.09053v1 [math.ST])
- Learning in latent spaces improves the predictive accuracy of deep neural operators. (arXiv:2304.07599v1 [cs.LG])
- Non-asymptotic convergence bounds for Sinkhorn iterates and their gradients: a coupling approach. (arXiv:2304.06549v2 [math.PR] UPDATED)
- Delta Denoising Score. (arXiv:2304.07090v1 [cs.CV])
- What does self-attention learn from Masked Language Modelling?. (arXiv:2304.07235v2 [cond-mat.dis-nn] UPDATED)
- Energy-guided Entropic Neural Optimal Transport. (arXiv:2304.06094v3 [cs.LG] UPDATED)
- Forward-backward Gaussian variational inference via JKO in the Bures-Wasserstein Space. (arXiv:2304.05398v1 [math.ST])
- Diffusion models with location-scale noise. (arXiv:2304.05907v1 [cs.LG])
- Regulatory Markets: The Future of AI Governance. (arXiv:2304.04914v4 [cs.AI] UPDATED)
- Binary Latent Diffusion. (arXiv:2304.04820v1 [cs.CV])
- Diffusion Models for Constrained Domains. (arXiv:2304.05364v1 [cs.LG])
- Bayesian Optimization of Catalysts With In-context Learning. (arXiv:2304.05341v1 [physics.chem-ph])
- Simulated Annealing in Early Layers Leads to Better Generalization. (arXiv:2304.04858v1 [cs.LG])
- Criticality versus uniformity in deep neural networks. (arXiv:2304.04784v1 [cs.LG])
- Local Conditions for Global Convergence of Gradient Flows and Proximal Point Sequences in Metric Spaces. (arXiv:2304.05239v1 [math.OC])
- Gradient flows of interacting Laguerre cells as discrete porous media flows. (arXiv:2304.05069v1 [math.NA])
- A Family of Iteration Functions for General Linear Systems. (arXiv:2304.04940v2 [math.NA] UPDATED)
- A Simple Proof of the Mixing of Metropolis-Adjusted Langevin Algorithm under Smoothness and Isoperimetry. (arXiv:2304.04095v2 [stat.ML] UPDATED)
- Reflected Diffusion Models. (arXiv:2304.04740v3 [stat.ML] UPDATED)
- When does Metropolized Hamiltonian Monte Carlo provably outperform Metropolis-adjusted Langevin algorithm?. (arXiv:2304.04724v2 [stat.CO] UPDATED)
- Deep Generative Modeling with Backward Stochastic Differential Equations. (arXiv:2304.04049v1 [cs.LG])
- Interpretable statistical representations of neural population dynamics and geometry. (arXiv:2304.03376v2 [cs.LG] UPDATED)
- Transforming Butterflies into Graphs: Statistics of Chaotic and Turbulent Systems. (arXiv:2304.03362v1 [physics.flu-dyn])
- Deep linear networks can benignly overfit when shallow ones do
- Causal inference is not just a statistics problem. (arXiv:2304.02683v3 [stat.ME] UPDATED)
- Convergence Rates for Non-Log-Concave Sampling and Log-Partition Estimation. (arXiv:2303.03237v3 [stat.ML] UPDATED)
- Towards Efficient MCMC Sampling in Bayesian Neural Networks by Exploiting Symmetry. (arXiv:2304.02902v1 [stat.ML])
- GenPhys: From Physical Processes to Generative Models. (arXiv:2304.02637v1 [cs.LG])
- Query lower bounds for log-concave sampling. (arXiv:2304.02599v2 [math.ST] UPDATED)
- Solidarity of Gibbs Samplers: the spectral gap. (arXiv:2304.02109v1 [stat.CO])
- Effective Theory of Transformers at Initialization. (arXiv:2304.02034v1 [cs.LG])
- Diffusion Bridge Mixture Transports, Schr\"odinger Bridge Problems and Generative Modeling. (arXiv:2304.00917v2 [stat.ML] UPDATED)
- Diffusion map particle systems for generative modeling. (arXiv:2304.00200v2 [stat.ML] UPDATED)
- DRIP: Deep Regularizers for Inverse Problems. (arXiv:2304.00015v2 [cs.LG] UPDATED)
- On the Sample Complexity of the Linear Quadratic Gaussian Regulator. (arXiv:2304.00381v2 [math.OC] UPDATED)
- Universal approximation of flows of control systems by recurrent neural networks. (arXiv:2304.00352v2 [eess.SY] UPDATED)
- Optimal Strategies to Steer and Control Water Waves. (arXiv:2304.00376v1 [math.OC])
- Optimal Transport Particle Filters. (arXiv:2304.00392v1 [math.OC])
- Speeding up Langevin Dynamics by Mixing. (arXiv:2303.18168v2 [math.PR] UPDATED)
- Fluctuation without dissipation: Microcanonical Langevin Monte Carlo. (arXiv:2303.18221v2 [hep-lat] UPDATED)
- Diffusion Schr\"odinger Bridge Matching. (arXiv:2303.16852v3 [stat.ML] UPDATED)
- Unified analysis of SGD-type methods. (arXiv:2303.16502v1 [math.OC])
- Diffusion Schr\"odinger Bridge Matching. (arXiv:2303.16852v3 [stat.ML] UPDATED)
- Speeding up backpropagation of gradients through the Kalman filter via closed-form expressions. (arXiv:2303.16846v2 [math.OC] UPDATED)
- Unified analysis of SGD-type methods. (arXiv:2303.16502v1 [math.OC])
- Uniform in time convergence of numerical schemes for stochastic differential equations via Strong Exponential stability: Euler methods, Split-Step and Tamed Schemes. (arXiv:2303.15463v1 [math.NA])
- Noisy dynamical systems evolve error correcting codes and modularity. (arXiv:2303.14448v1 [q-bio.PE])
- Particle Mean Field Variational Bayes. (arXiv:2303.13930v2 [stat.CO] UPDATED)
- How averaged is the composition of two linear projections?. (arXiv:2303.13738v1 [math.OC])
- Symmetries, flat minima, and the conserved quantities of gradient flow. (arXiv:2210.17216v2 [cs.LG] UPDATED)
- Stability is Stable: Connections between Replicability, Privacy, and Adaptive Generalization. (arXiv:2303.12921v2 [cs.LG] UPDATED)
- Non-asymptotic analysis of Langevin-type Monte Carlo algorithms. (arXiv:2303.12407v4 [math.ST] UPDATED)
- Seven open problems in applied combinatorics. (arXiv:2303.11464v1 [math.CO])
- Speech Modeling with a Hierarchical Transformer Dynamical VAE. (arXiv:2303.09404v2 [eess.AS] UPDATED)
- Variational Principles for Mirror Descent and Mirror Langevin Dynamics. (arXiv:2303.09532v1 [math.OC])
- Accelerated Gradient and Skew-Symmetric Splitting Methods for a Class of Monotone Operator Equations. (arXiv:2303.09009v1 [math.OC])
- High order spatial discretization for variational time implicit schemes: Wasserstein gradient flows and reaction-diffusion systems. (arXiv:2303.08950v1 [math.NA])
- Diffusion Models in NLP: A Survey. (arXiv:2303.07576v1 [cs.CL])
- General Loss Functions Lead to (Approximate) Interpolation in High Dimensions. (arXiv:2303.07475v1 [stat.ML])
- Generalised Scale-Space Properties for Probabilistic Diffusion Models. (arXiv:2303.07900v4 [eess.IV] UPDATED)
- Physics-driven machine learning models coupling PyTorch and Firedrake. (arXiv:2303.06871v3 [cs.LG] UPDATED)
- L\'evy Langevin Monte Carlo. (arXiv:2303.07743v1 [math.PR])
- Variational Gaussian filtering via Wasserstein gradient flows. (arXiv:2303.06398v2 [stat.CO] UPDATED)
- Decomposed Diffusion Sampler for Accelerating Large-Scale Inverse Problems. (arXiv:2303.05754v2 [cs.LG] UPDATED)
- Optimal foraging strategies can be learned. (arXiv:2303.06050v3 [cond-mat.stat-mech] UPDATED)
- Controllable Video Generation by Learning the Underlying Dynamical System with Neural ODE. (arXiv:2303.05323v2 [cs.CV] UPDATED)
- Unifying Layout Generation with a Decoupled Diffusion Model. (arXiv:2303.05049v1 [cs.CV])
- A Categorical Framework of General Intelligence. (arXiv:2303.04571v2 [cs.AI] UPDATED)
- Computing with Categories in Machine Learning. (arXiv:2303.04156v1 [cs.LG])
- Patched Diffusion Models for Unsupervised Anomaly Detection in Brain MRI. (arXiv:2303.03758v1 [eess.IV])
- Towards a Complete Analysis of Langevin Monte Carlo: Beyond Poincar\'e Inequality. (arXiv:2303.03589v2 [math.ST] UPDATED)
- Properties of Marginal Sequential Monte Carlo Methods. (arXiv:2303.03498v1 [stat.CO])
- Global convergence of the gradient method for functions definable in o-minimal structures. (arXiv:2303.03534v4 [math.OC] UPDATED)
- Extending the Wasserstein metric to positive measures. (arXiv:2303.02183v1 [math.MG])
- Diffusion Models Generate Images Like Painters: an Analytical Theory of Outline First, Details Later. (arXiv:2303.02490v1 [cs.CV])
- Prompt, Generate, then Cache: Cascade of Foundation Models makes Strong Few-shot Learners. (arXiv:2303.02151v1 [cs.CV])
- Diffusion Models are Minimax Optimal Distribution Estimators. (arXiv:2303.01861v1 [stat.ML])
- A Complete Recipe for Diffusion Generative Models. (arXiv:2303.01748v2 [cs.LG] UPDATED)
- Bayesian Posterior Perturbation Analysis with Integral Probability Metrics. (arXiv:2303.01512v1 [stat.ML])
- Training neural networks with structured noise improves classification and generalization. (arXiv:2302.13417v4 [cond-mat.dis-nn] UPDATED)
- Consistency Models. (arXiv:2303.01469v2 [cs.LG] UPDATED)
- Continuous-Time Functional Diffusion Processes. (arXiv:2303.00800v3 [cs.LG] UPDATED)
- Understanding Diffusion Objectives as the ELBO with Simple Data Augmentation. (arXiv:2303.00848v7 [cs.LG] UPDATED)
- Understanding Diffusion Objectives as the ELBO with Simple Data Augmentation. (arXiv:2303.00848v7 [cs.LG] UPDATED)
- On the existence of minimizers in shallow residual ReLU neural network optimization landscapes. (arXiv:2302.14690v1 [math.OC])
- The Inverse of Exact Renormalization Group Flows as Statistical Inference. (arXiv:2212.11379v1 [hep-th] CROSS LISTED)
- Stochastic Gradient Descent under Markovian Sampling Schemes. (arXiv:2302.14428v3 [math.OC] UPDATED)
- Injectivity of ReLU networks: perspectives from statistical physics. (arXiv:2302.14112v1 [cond-mat.dis-nn])
- Denoising Diffusion Samplers. (arXiv:2302.13834v2 [cs.LG] UPDATED)
- Compositional Law Parsing with Latent Random Functions. (arXiv:2209.09115v2 [cs.CV] UPDATED)
- Revisiting LQR Control from the Perspective of Receding-Horizon Policy Gradient. (arXiv:2302.13144v2 [math.OC] UPDATED)
- A Counterexample to the L\'evy Flight Foraging Hypothesis in the Narrow Capture Framework. (arXiv:2302.13976v2 [cond-mat.stat-mech] UPDATED)
- Direct Estimation of Parameters in ODE Models Using WENDy: Weak-form Estimation of Nonlinear Dynamics. (arXiv:2302.13271v1 [cs.LG])
- Asymptotic convergence of iterative optimization algorithms. (arXiv:2302.12544v1 [stat.ML])
- Graph signal processing with categorical perspective. (arXiv:2302.12421v1 [eess.SP])
- Efficiently handling constraints with Metropolis-adjusted Langevin algorithm. (arXiv:2302.11971v2 [stat.CO] UPDATED)
- An Explicit Expansion of the Kullback-Leibler Divergence along its Fisher-Rao Gradient Flow. (arXiv:2302.12229v1 [math.OC])
- Composer: Creative and Controllable Image Synthesis with Composable Conditions. (arXiv:2302.09778v2 [cs.CV] UPDATED)
- From Optimization to Sampling Through Gradient Flows. (arXiv:2302.11449v1 [stat.CO])
- Reduce, Reuse, Recycle: Compositional Generation with Energy-Based Diffusion Models and MCMC. (arXiv:2302.11552v4 [cs.LG] UPDATED)
- Gradient Flows for Sampling: Mean-Field Models, Gaussian Approximations and Affine Invariance. (arXiv:2302.11024v6 [stat.ML] UPDATED)
- Contraction and Convergence Rates for Discretized Kinetic Langevin Dynamics. (arXiv:2302.10684v4 [math.NA] UPDATED)
- On Robust Numerical Solver for ODE via Self-Attention Mechanism. (arXiv:2302.10184v1 [cs.LG])
- Variational Autoencoding Neural Operators. (arXiv:2302.10351v1 [cs.LG])
- Faster high-accuracy log-concave sampling via algorithmic warm starts. (arXiv:2302.10249v1 [math.ST])
- On Equivalent Optimization of Machine Learning Methods. (arXiv:2302.09160v1 [cs.LG])
- The Generalization Error of Stochastic Mirror Descent on Over-Parametrized Linear Models. (arXiv:2302.09433v1 [cs.LG])
- Parameter Averaging for SGD Stabilizes the Implicit Bias towards Flat Regions. (arXiv:2302.09376v1 [stat.ML])
- Over-Parameterization Exponentially Slows Down Gradient Descent for Learning a Single Neuron. (arXiv:2302.10034v2 [cs.LG] UPDATED)
- Convergence rate in $\mathcal{L}^p$ sense of tamed EM scheme for highly nonlinear neutral multiple-delay stochastic McKean-Vlasov equations. (arXiv:2302.09724v1 [math.NA])
- SGD with AdaGrad Stepsizes: Full Adaptivity with High Probability to Unknown Parameters, Unbounded Gradients and Affine Variance. (arXiv:2302.08783v2 [cs.LG] UPDATED)
- Deterministic Nonsmooth Nonconvex Optimization. (arXiv:2302.08300v1 [cs.LG])
- Improved Discretization Analysis for Underdamped Langevin Monte Carlo. (arXiv:2302.08049v1 [math.ST])
- Where to Diffuse, How to Diffuse, and How to Get Back: Automated Learning for Multivariate Diffusions. (arXiv:2302.07261v2 [cs.LG] UPDATED)
- Stochastic Modified Flows, Mean-Field Limits and Dynamics of Stochastic Gradient Descent. (arXiv:2302.07125v1 [math.PR])
- Universal Guidance for Diffusion Models. (arXiv:2302.07121v1 [cs.CV])
- A Modern Look at the Relationship between Sharpness and Generalization. (arXiv:2302.07011v2 [cs.LG] UPDATED)
- Score-based Diffusion Models in Function Space. (arXiv:2302.07400v2 [cs.LG] UPDATED)
- The Geometry of Neural Nets' Parameter Spaces Under Reparametrization. (arXiv:2302.07384v3 [cs.LG] UPDATED)
- Linearized Wasserstein dimensionality reduction with approximation guarantees. (arXiv:2302.07373v1 [cs.LG])
- Covariance-modulated optimal transport and gradient flows. (arXiv:2302.07773v1 [math.AP])
- A new unified framework for designing convex optimization methods with prescribed theoretical convergence estimates: A numerical analysis approach. (arXiv:2302.07404v1 [math.OC])
- Beyond Statistical Similarity: Rethinking Metrics for Deep Generative Models in Engineering Design. (arXiv:2302.02913v4 [cs.LG] UPDATED)
- Vector Quantized Wasserstein Auto-Encoder. (arXiv:2302.05917v2 [cs.LG] UPDATED)
- From high-dimensional & mean-field dynamics to dimensionless ODEs: A unifying approach to SGD in two-layers networks. (arXiv:2302.05882v1 [stat.ML])
- Verifying Generalization in Deep Learning. (arXiv:2302.05745v2 [cs.LG] UPDATED)
- Automated tight Lyapunov analysis for first-order methods. (arXiv:2302.06713v1 [math.OC])
- Symbolic Discovery of Optimization Algorithms. (arXiv:2302.06675v4 [cs.LG] UPDATED)
- Where to Diffuse, How to Diffuse, and How to Get Back: Automated Learning for Multivariate Diffusions. (arXiv:2302.07261v2 [cs.LG] UPDATED)
- Score Approximation, Estimation and Distribution Recovery of Diffusion Models on Low-Dimensional Data. (arXiv:2302.07194v1 [cs.LG])
- Breaking the Lower Bound with (Little) Structure: Acceleration in Non-Convex Stochastic Optimization with Heavy-Tailed Noise. (arXiv:2302.06763v2 [cs.LG] UPDATED)
- Transport map unadjusted Langevin algorithms: learning and discretizing perturbed samplers. (arXiv:2302.07227v3 [stat.ME] UPDATED)
- Stochastic Modified Flows, Mean-Field Limits and Dynamics of Stochastic Gradient Descent. (arXiv:2302.07125v1 [math.PR])
- A Unified View of Long-Sequence Models towards Modeling Million-Scale Dependencies. (arXiv:2302.06218v3 [cs.LG] UPDATED)
- Star-Shaped Denoising Diffusion Probabilistic Models. (arXiv:2302.05259v3 [stat.ML] UPDATED)
- The Monge Gap: A Regularizer to Learn All Transport Maps. (arXiv:2302.04953v1 [cs.LG])
- Heckerthoughts. (arXiv:2302.05449v5 [cs.AI] UPDATED)
- GFlowNet-EM for learning compositional latent variable models. (arXiv:2302.06576v2 [cs.LG] UPDATED)
- Achieving acceleration despite very noisy gradients. (arXiv:2302.05515v2 [stat.ML] UPDATED)
- Bregman-Wasserstein divergence: geometry and applications. (arXiv:2302.05833v1 [math.PR])
- DynGFN: Towards Bayesian Inference of Gene Regulatory Networks with GFlowNets. (arXiv:2302.04178v4 [cs.LG] UPDATED)
- UniPC: A Unified Predictor-Corrector Framework for Fast Sampling of Diffusion Models. (arXiv:2302.04867v4 [cs.LG] UPDATED)
- On Sampling with Approximate Transport Maps. (arXiv:2302.04763v2 [stat.ML] UPDATED)
- Geometry of Score Based Generative Models. (arXiv:2302.04411v1 [cs.LG])
- On a continuous time model of gradient descent dynamics and instability in deep learning. (arXiv:2302.01952v3 [stat.ML] UPDATED)
- Wasserstein-$1$ distance between SDEs driven by Brownian motion and stable processes. (arXiv:2302.03372v1 [math.PR])
- Sharp Lower Bounds on Interpolation by Deep ReLU Neural Networks at Irregularly Spaced Data. (arXiv:2302.00834v1 [cs.LG])
- A Theoretical Justification for Image Inpainting using Denoising Diffusion Probabilistic Models. (arXiv:2302.01217v1 [stat.ML])
- High-Probability Bounds for Stochastic Optimization and Variational Inequalities: the Case of Unbounded Variance. (arXiv:2302.00999v2 [math.OC] UPDATED)
- Implicit regularization in Heavy-ball momentum accelerated stochastic gradient descent. (arXiv:2302.00849v1 [cs.LG])
- Stable Target Field for Reduced Variance Score Estimation in Diffusion Models. (arXiv:2302.00670v2 [cs.LG] UPDATED)
- Accelerated First-Order Optimization under Nonlinear Constraints. (arXiv:2302.00316v2 [math.OC] UPDATED)
- ERA-Solver: Error-Robust Adams Solver for Fast Sampling of Diffusion Probabilistic Models. (arXiv:2301.12935v3 [cs.LG] UPDATED)
- On the Stability of General Bayesian Inference. (arXiv:2301.13701v1 [stat.ME])
- Unifying Generative Models with GFlowNets and Beyond. (arXiv:2209.02606v2 [cs.LG] UPDATED)
- Dissecting the Effects of SGD Noise in Distinct Regimes of Deep Learning. (arXiv:2301.13703v2 [cs.LG] UPDATED)
- Continuous Spatiotemporal Transformers. (arXiv:2301.13338v2 [cs.LG] UPDATED)
- GFlowNets and variational inference. (arXiv:2210.00580v3 [cs.LG] UPDATED)
- Don't Play Favorites: Minority Guidance for Diffusion Models. (arXiv:2301.12334v1 [cs.LG])
- On the Lipschitz Constant of Deep Networks and Double Descent. (arXiv:2301.12309v4 [cs.LG] UPDATED)
- A theory of continuous generative flow networks. (arXiv:2301.12594v2 [cs.LG] UPDATED)
- Minimizing Trajectory Curvature of ODE-based Generative Models. (arXiv:2301.12003v3 [cs.LG] UPDATED)
- Accelerating Guided Diffusion Sampling with Splitting Numerical Methods. (arXiv:2301.11558v1 [cs.CV])
- Image Restoration with Mean-Reverting Stochastic Differential Equations. (arXiv:2301.11699v3 [cs.LG] UPDATED)
- A Denoising Diffusion Model for Fluid Field Prediction. (arXiv:2301.11661v2 [cs.LG] UPDATED)
- Simple diffusion: End-to-end diffusion for high resolution images. (arXiv:2301.11093v2 [cs.CV] UPDATED)
- On the Importance of Noise Scheduling for Diffusion Models. (arXiv:2301.10972v4 [cs.CV] UPDATED)
- Handbook of Convergence Theorems for (Stochastic) Gradient Methods. (arXiv:2301.11235v2 [math.OC] UPDATED)
- Simple diffusion: End-to-end diffusion for high resolution images. (arXiv:2301.11093v2 [cs.CV] UPDATED)
- On the Mathematics of Diffusion Models. (arXiv:2301.11108v3 [cs.LG] UPDATED)
- The Backpropagation algorithm for a math student. (arXiv:2301.09977v3 [cs.LG] UPDATED)
- A New Approach to Learning Linear Dynamical Systems. (arXiv:2301.09519v1 [math.OC])
- Deep Learning Meets Sparse Regularization: A Signal Processing Perspective. (arXiv:2301.09554v3 [stat.ML] UPDATED)
- On Investigating the Conservative Property of Score-Based Generative Models. (arXiv:2209.12753v3 [cs.LG] UPDATED)
- Latent Autoregressive Source Separation. (arXiv:2301.08562v1 [cs.LG])
- Discrete Variational Calculus for Accelerated Optimization
- A Nonstochastic Control Approach to Optimization. (arXiv:2301.07902v3 [cs.LG] UPDATED)
- Mathematical analysis of singularities in the diffusion model under the submanifold assumption. (arXiv:2301.07882v3 [cs.LG] UPDATED)
- On backpropagating Hessians through ODEs. (arXiv:2301.08085v1 [math.OC])
- Kinetic Langevin MCMC Sampling Without Gradient Lipschitz Continuity -- the Strongly Convex Case. (arXiv:2301.08039v1 [math.PR])
- Discrete Latent Structure in Neural Networks. (arXiv:2301.07473v1 [cs.LG])
- Image Embedding for Denoising Generative Models. (arXiv:2301.07485v1 [cs.CV])
- Transformers as Algorithms: Generalization and Stability in In-context Learning. (arXiv:2301.07067v2 [cs.LG] UPDATED)
- Word Embeddings as Statistical Estimators. (arXiv:2301.06710v1 [stat.ME])
- Geometric ergodicity of SGLD via reflection coupling. (arXiv:2301.06769v1 [math.PR])
- Computability of Optimizers. (arXiv:2301.06148v1 [math.OC])
- An Accelerated Lyapunov Function for Polyak's Heavy-Ball on Convex Quadratics. (arXiv:2301.05799v1 [math.OC])
- Min-Max Optimization Made Simple: Approximating the Proximal Point Method via Contraction Maps. (arXiv:2301.03931v2 [cs.GT] UPDATED)
- Diffused Heads: Diffusion Models Beat GANs on Talking-Face Generation. (arXiv:2301.03396v2 [cs.CV] UPDATED)
- Sharper Analysis for Minibatch Stochastic Proximal Point Methods: Stability, Smoothness, and Deviation. (arXiv:2301.03125v1 [stat.ML])
- Stochastic Langevin Monte Carlo for (weakly) log-concave posterior distributions. (arXiv:2301.03077v1 [stat.ML])
- Perron-Frobenius operator filter for stochastic dynamical systems. (arXiv:2301.03080v1 [math.NA])
- Training trajectories, mini-batch losses and the curious role of the learning rate. (arXiv:2301.02312v2 [cs.LG] UPDATED)
- Optimal Scaling Results for a Wide Class of Proximal MALA Algorithms. (arXiv:2301.02446v2 [stat.CO] UPDATED)
- Convergence rates of the stochastic alternating algorithm for bi-objective optimization. (arXiv:2203.10605v2 [math.OC] UPDATED)
- Restarts subject to approximate sharpness: A parameter-free and optimal scheme for first-order methods. (arXiv:2301.02268v1 [math.OC])
- Deep Learning and Computational Physics (Lecture Notes). (arXiv:2301.00942v1 [cs.LG])
- A Tutorial on Parametric Variational Inference. (arXiv:2301.01236v1 [stat.ML])
- A Survey of Feedback Particle Filter and related Controlled Interacting Particle Systems (CIPS). (arXiv:2301.00935v2 [eess.SY] UPDATED)
- Exploring Complex Dynamical Systems via Nonconvex Optimization. (arXiv:2301.00923v1 [cs.LG])
- Fast convex optimization via closed-loop time scaling of gradient dynamics. (arXiv:2301.00701v1 [math.OC])
- Posterior Collapse and Latent Variable Non-identifiability. (arXiv:2301.00537v1 [stat.ML])
- Pruning Before Training May Improve Generalization, Provably. (arXiv:2301.00335v3 [cs.LG] UPDATED)
- The Stable Artist: Steering Semantics in Diffusion Latent Space. (arXiv:2212.06013v3 [cs.CV] UPDATED)
- A Dynamics Theory of Implicit Regularization in Deep Low-Rank Matrix Factorization. (arXiv:2212.14150v2 [cs.LG] UPDATED)
- Hungry Hungry Hippos: Towards Language Modeling with State Space Models. (arXiv:2212.14052v3 [cs.LG] UPDATED)
- Can Direct Latent Model Learning Solve Linear Quadratic Gaussian Control?. (arXiv:2212.14511v1 [cs.LG])
- Gaussian Process Priors for Systems of Linear Partial Differential Equations with Constant Coefficients. (arXiv:2212.14319v4 [stat.ML] UPDATED)
Saved in 2022
- Exploring Vision Transformers as Diffusion Learners. (arXiv:2212.13771v1 [cs.CV])
- Continuous Depth Recurrent Neural Differential Equations. (arXiv:2212.13714v1 [cs.LG])
- Latent Discretization for Continuous-time Sequence Compression. (arXiv:2212.13659v1 [cs.LG])
- Limitations of Information-Theoretic Generalization Bounds for Gradient Descent Methods in Stochastic Convex Optimization. (arXiv:2212.13556v3 [cs.LG] UPDATED)
- Fundamental Limits of Two-layer Autoencoders, and Achieving Them with Gradient Methods. (arXiv:2212.13468v1 [cs.LG] CROSS LISTED)
- The Forward-Forward Algorithm: Some Preliminary Investigations. (arXiv:2212.13345v1 [cs.LG])
- Convergence to good non-optimal critical points in the training of neural networks: Gradient descent optimization with one random initialization overcomes all bad non-global local minima with high probability. (arXiv:2212.13111v1 [math.OC])
- Unsupervised Representation Learning from Pre-trained Diffusion Probabilistic Models. (arXiv:2212.12990v3 [cs.CV] UPDATED)
- Homophily modulates double descent generalization in graph convolution networks. (arXiv:2212.13069v3 [cs.LG] UPDATED)
- Deep Latent State Space Models for Time-Series Generation. (arXiv:2212.12749v3 [stat.ML] UPDATED)
- Concentration of the Langevin Algorithm's Stationary Distribution. (arXiv:2212.12629v1 [stat.ML])
- Your diffusion model secretly knows the dimension of the data manifold. (arXiv:2212.12611v5 [cs.LG] UPDATED)
- The Mean Field Ensemble Kalman Filter: Near-Gaussian Setting. (arXiv:2212.13239v5 [math.OC] UPDATED)
- Statistical Efficiency of Score Matching: The View from Isoperimetry. (arXiv:2210.00726v2 [cs.LG] UPDATED)
- Physics-Informed Gaussian Process Regression Generalizes Linear PDE Solvers. (arXiv:2212.12474v5 [cs.LG] UPDATED)
- Langevin algorithms for Markovian Neural Networks and Deep Stochastic control. (arXiv:2212.12018v2 [q-fin.CP] UPDATED)
- A Mathematical Framework for Learning Probability Distributions. (arXiv:2212.11481v2 [stat.ML] UPDATED)
- Stochastic differential variational inequalities with applications. (arXiv:2212.08366v1 [math.OC])
- Deep learning applied to computational mechanics: A comprehensive review, state of the art, and the classics. (arXiv:2212.08989v3 [cs.LG] UPDATED)
- The Underlying Correlated Dynamics in Neural Training. (arXiv:2212.09040v1 [cs.LG])
- The Calder\'on's problem via DeepONets. (arXiv:2212.08941v2 [math.AP] UPDATED)
- Efficient Long Sequence Modeling via State Space Augmented Transformer. (arXiv:2212.08136v1 [cs.CL])
- Can We Find Strong Lottery Tickets in Generative Models?. (arXiv:2212.08311v1 [cs.CV])
- Learning threshold neurons via the "edge of stability". (arXiv:2212.07469v2 [cs.LG] UPDATED)
- Diffusion Probabilistic Models beat GANs on Medical Images. (arXiv:2212.07501v1 [eess.IV])
- Unadjusted Hamiltonian MCMC with Stratified Monte Carlo Time Integration. (arXiv:2211.11003v2 [math.PR] UPDATED)
- RT-1: Robotics Transformer for Real-World Control at Scale. (arXiv:2212.06817v2 [cs.RO] UPDATED)
- Score-based Generative Modeling Secretly Minimizes the Wasserstein Distance. (arXiv:2212.06359v1 [cs.LG])
- On the Relationship Between Explanation and Prediction: A Causal View. (arXiv:2212.06925v4 [cs.LG] UPDATED)
- Neural Continuous-Time Markov Models. (arXiv:2212.05378v1 [stat.ML])
- Score-based Generative Modeling Secretly Minimizes the Wasserstein Distance. (arXiv:2212.06359v1 [cs.LG])
- Revisiting the acceleration phenomenon via high-resolution differential equations. (arXiv:2212.05700v1 [math.OC])
- Optimized Sparse Matrix Operations for Reverse Mode Automatic Differentiation. (arXiv:2212.05159v3 [cs.LG] UPDATED)
- evosax: JAX-based Evolution Strategies. (arXiv:2212.04180v1 [cs.NE])
- Diffusion Art or Digital Forgery? Investigating Data Replication in Diffusion Models. (arXiv:2212.03860v3 [cs.LG] UPDATED)
- Reconstructing Training Data from Model Gradient, Provably. (arXiv:2212.03714v3 [cs.LG] UPDATED)
- Codex Hacks HackerRank: Memorization Issues and a Framework for Code Synthesis Evaluation. (arXiv:2212.02684v1 [cs.SE])
- Continuous diffusion for categorical data. (arXiv:2211.15089v3 [cs.CL] UPDATED)
- An algorithmic guide for finite-dimensional optimal control problems. (arXiv:2212.03157v1 [math.OC])
- Learning to Optimize in Model Predictive Control. (arXiv:2212.02603v1 [cs.RO])
- Uniform-in-time propagation of chaos for mean field Langevin dynamics. (arXiv:2212.03050v3 [math.PR] UPDATED)
- On the Overlooked Structure of Stochastic Gradients. (arXiv:2212.02083v3 [cs.LG] UPDATED)
- Covertly Controlling a Linear System. (arXiv:2212.01052v1 [cs.IT])
- DiffusionBERT: Improving Generative Masked Language Models with Diffusion Models. (arXiv:2211.15029v2 [cs.CL] CROSS LISTED)
- Penalized Langevin and Hamiltonian Monte Carlo Algorithms for Constrained Sampling. (arXiv:2212.00570v1 [stat.ML])
- Are you using test log-likelihood correctly?. (arXiv:2212.00219v4 [stat.ML] UPDATED)
- Mechanism Design Theory in Control Engineering: A Tutorial and Overview of Applications in Communication, Power Grid, Transportation, and Security Systems. (arXiv:2212.00756v1 [eess.SY])
- On the Power of Foundation Models. (arXiv:2211.16327v4 [cs.AI] UPDATED)
- Iterated Function Systems: A Comprehensive Survey. (arXiv:2211.14661v1 [math.PR])
- Theoretical Foundations of t-SNE for Visualizing High-Dimensional Clustered Data
- Intrinsic Dimension Estimation Using Wasserstein Distance
- Convexifying Transformers: Improving optimization and understanding of transformer networks. (arXiv:2211.11052v1 [cs.LG] CROSS LISTED)
- Unadjusted Hamiltonian MCMC with Stratified Monte Carlo Time Integration. (arXiv:2211.11003v2 [math.PR] UPDATED)
- Being Bayesian in the 2020s: opportunities and challenges in the practice of modern applied Bayesian statistics. (arXiv:2211.10029v2 [stat.AP] UPDATED)
- Convergence of Adapted Empirical Measures on $\mathbb{R}^{d}$. (arXiv:2211.10162v2 [math.PR] UPDATED)
- Why Deep Learning Generalizes. (arXiv:2211.09639v2 [cs.LG] UPDATED)
- Neural Langevin Dynamics: towards interpretable Neural Stochastic Differential Equations. (arXiv:2211.09537v1 [cs.LG])
- Introduction to Online Nonstochastic Control. (arXiv:2211.09619v2 [cs.LG] UPDATED)
- Graph Filters for Signal Processing and Machine Learning on Graphs. (arXiv:2211.08854v1 [eess.SP])
- Galactica: A Large Language Model for Science. (arXiv:2211.09085v1 [cs.CL])
- Explicit convergence bounds for Metropolis Markov chains: isoperimetry, spectral gaps and profiles. (arXiv:2211.08959v2 [math.PR] UPDATED)
- Explicit convergence bounds for Metropolis Markov chains: isoperimetry, spectral gaps and profiles. (arXiv:2211.08959v2 [math.PR] UPDATED)
- Versatile Diffusion: Text, Images and Variations All in One Diffusion Model. (arXiv:2211.08332v4 [cs.CV] UPDATED)
- Disentangling Variational Autoencoders. (arXiv:2211.07700v1 [cs.LG])
- FP-Diffusion: Improving Score-based Diffusion Models by Enforcing the Underlying Score Fokker-Planck Equation. (arXiv:2210.04296v4 [cs.LG] UPDATED)
- Follow the flow: Proximal flow inspired multi-step methods. (arXiv:2211.04653v2 [math.NA] UPDATED)
- On the Algorithmic Stability and Generalization of Adaptive Optimization Methods. (arXiv:2211.03970v1 [cs.LG])
- From Denoising Diffusions to Denoising Markov Models. (arXiv:2211.03595v2 [stat.ML] UPDATED)
- Convergence of the Inexact Langevin Algorithm and Score-based Generative Models in KL Divergence. (arXiv:2211.01512v2 [cs.LG] UPDATED)
- DPM-Solver++: Fast Solver for Guided Sampling of Diffusion Probabilistic Models. (arXiv:2211.01095v2 [cs.LG] UPDATED)
- On the detection of synthetic images generated by diffusion models. (arXiv:2211.00680v1 [cs.CV])
- An optimal control perspective on diffusion-based generative modeling. (arXiv:2211.01364v2 [cs.LG] UPDATED)
- A Close Look into the Calibration of Pre-trained Language Models. (arXiv:2211.00151v3 [cs.CL] UPDATED)
- Recurrent Neural Networks and Universal Approximation of Bayesian Filters. (arXiv:2211.00335v2 [stat.ML] UPDATED)
- High order splitting methods for SDEs satisfying a commutativity condition. (arXiv:2210.17543v5 [math.NA] UPDATED)
- Layer-wise Shared Attention Network on Dynamical System Perspective. (arXiv:2210.16101v1 [cs.CV])
- Contrastive Decoding: Open-ended Text Generation as Optimization. (arXiv:2210.15097v2 [cs.CL] UPDATED)
- Towards Language-driven Scientific AI. (arXiv:2210.15327v2 [cs.CL] UPDATED)
- Embrace the Gap: VAEs Perform Independent Mechanism Analysis. (arXiv:2206.02416v3 [stat.ML] UPDATED)
- Fast Rates for Noisy Interpolation Require Rethinking the Effects of Inductive Bias. (arXiv:2203.03597v2 [stat.ML] UPDATED)
- What Language Model to Train if You Have One Million GPU Hours?. (arXiv:2210.15424v2 [cs.CL] UPDATED)
- Opening the Black Box of wav2vec Feature Encoder. (arXiv:2210.15386v1 [cs.SD])
- Can language models handle recursively nested grammatical structures? A case study on comparing models and humans. (arXiv:2210.15303v3 [cs.CL] UPDATED)
- Scaling Laws Beyond Backpropagation. (arXiv:2210.14593v1 [cs.LG])
- Parameter-free Regret in High Probability with Heavy Tails. (arXiv:2210.14355v2 [stat.ML] UPDATED)
- DiffuSeq: Sequence to Sequence Text Generation with Diffusion Models. (arXiv:2210.08933v3 [cs.CL] UPDATED)
- Categorical SDEs with Simplex Diffusion. (arXiv:2210.14784v1 [cs.LG])
- A single-cell gene expression language model. (arXiv:2210.14330v1 [q-bio.QM])
- JAX-DIPS: Neural bootstrapping of finite discretization methods and application to elliptic problems with discontinuities. (arXiv:2210.14312v3 [math.NA] UPDATED)
- From Points to Functions: Infinite-dimensional Representations in Diffusion Models. (arXiv:2210.13774v1 [cs.LG])
- A Dynamical System View of Langevin-Based Non-Convex Sampling. (arXiv:2210.13867v2 [cs.LG] UPDATED)
- A Control Theoretic Approach to Infrastructure-Centric Blockchain Tokenomics. (arXiv:2210.12881v1 [eess.SY])
- Representation Learning with Diffusion Models. (arXiv:2210.11058v1 [cs.CV])
- On Representations of Mean-Field Variational Inference. (arXiv:2210.11385v1 [stat.ML])
- Diffusion Models already have a Semantic Latent Space. (arXiv:2210.10960v2 [cs.CV] UPDATED)
- On Representations of Mean-Field Variational Inference. (arXiv:2210.11385v1 [stat.ML])
- Neural ODEs as Feedback Policies for Nonlinear Optimal Control. (arXiv:2210.11245v2 [math.OC] UPDATED)
- Optimisation & Generalisation in Networks of Neurons. (arXiv:2210.10101v1 [cs.NE])
- DALLE-2 is Seeing Double: Flaws in Word-to-Concept Mapping in Text2Image Models. (arXiv:2210.10606v1 [cs.CL])
- Foundation Transformers. (arXiv:2210.06423v2 [cs.LG] UPDATED)
- Differentially Private Diffusion Models. (arXiv:2210.09929v3 [stat.ML] UPDATED)
- Automatic Differentiation of Programs with Discrete Randomness. (arXiv:2210.08572v3 [cs.LG] UPDATED)
- Gradient Descent: The Ultimate Optimizer. (arXiv:1909.13371v2 [cs.LG] UPDATED)
- A Continuous Time Framework for Discrete Denoising Models. (arXiv:2205.14987v2 [stat.ML] UPDATED)
- From Gradient Flow on Population Loss to Learning with Stochastic Gradient Descent. (arXiv:2210.06705v1 [cs.LG])
- Alpha-divergence Variational Inference Meets Importance Weighted Auto-Encoders: Methodology and Asymptotics. (arXiv:2210.06226v2 [stat.ML] UPDATED)
- Unifying Diffusion Models' Latent Space, with Applications to CycleDiffusion and Guidance. (arXiv:2210.05559v2 [cs.CV] UPDATED)
- GENIE: Higher-Order Denoising Diffusion Solvers. (arXiv:2210.05475v1 [stat.ML])
- How Large Language Models are Transforming Machine-Paraphrased Plagiarism. (arXiv:2210.03568v3 [cs.CL] UPDATED)
- Sequential Neural Score Estimation: Likelihood-Free Inference with Conditional Score Based Diffusion Models. (arXiv:2210.04872v2 [stat.ML] UPDATED)
- Rieoptax: Riemannian Optimization in JAX. (arXiv:2210.04840v1 [math.OC])
- Certified machine learning: Rigorous a posteriori error bounds for PDE defined PINNs. (arXiv:2210.03426v1 [cs.LG])
- On Distillation of Guided Diffusion Models. (arXiv:2210.03142v3 [cs.CV] UPDATED)
- Efficient Sequence Packing without Cross-contamination: Accelerating Large Language Models without Impacting Performance. (arXiv:2107.02027v2 [cs.CL] UPDATED)
- Language Models are Multilingual Chain-of-Thought Reasoners. (arXiv:2210.03057v1 [cs.CL])
- Flow Matching for Generative Modeling. (arXiv:2210.02747v2 [cs.LG] UPDATED)
- Fisher information lower bounds for sampling. (arXiv:2210.02482v1 [stat.ML])
- Soft Diffusion: Score Matching for General Corruptions. (arXiv:2209.05442v2 [cs.CV] UPDATED)
- Non-Convergence and Limit Cycles in the Adam optimizer. (arXiv:2210.02070v1 [cs.LG])
- Gradient Descent in the Absence of Global Lipschitz Continuity of the Gradients. (arXiv:2210.02418v2 [math.OC] UPDATED)
- Alternating Differentiation for Optimization Layers. (arXiv:2210.01802v2 [cs.LG] UPDATED)
- DiffDock: Diffusion Steps, Twists, and Turns for Molecular Docking. (arXiv:2210.01776v2 [q-bio.BM] UPDATED)
- Analysis of Gradient Descent with Varying Step Sizes using Integral Quadratic Constraints. (arXiv:2210.00644v3 [math.OC] UPDATED)
- Rethinking skip connection model as a learnable Markov chain. (arXiv:2209.15278v3 [cs.LG] UPDATED)
- Ensemble-based gradient inference for particle methods in optimization and sampling. (arXiv:2209.15420v2 [stat.ML] UPDATED)
- Bounding the Error of Discretized Langevin Algorithms for Non-Strongly Log-Concave Targets
- On Constraints in First-Order Optimization: A View from Non-Smooth Dynamical Systems
- Neural Networks Efficiently Learn Low-Dimensional Representations with SGD. (arXiv:2209.14863v2 [stat.ML] UPDATED)
- On the Convergence of AdaGrad(Norm) on $\R^{d}$: Beyond Convexity, Non-Asymptotic Rate and Acceleration. (arXiv:2209.14827v4 [cs.LG] UPDATED)
- Analyzing Diffusion as Serial Reproduction. (arXiv:2209.14821v1 [cs.LG])
- Neural Networks Efficiently Learn Low-Dimensional Representations with SGD. (arXiv:2209.14863v2 [stat.ML] UPDATED)
- Creative Painting with Latent Diffusion Models. (arXiv:2209.14697v2 [cs.CV] UPDATED)
- Denoising MCMC for Accelerating Diffusion-Based Generative Models. (arXiv:2209.14593v1 [cs.LG])
- Transformer Meets Boundary Value Inverse Problems. (arXiv:2209.14977v4 [cs.LG] UPDATED)
- Spectral Diffusion Processes. (arXiv:2209.14125v2 [stat.ML] UPDATED)
- Denoising Diffusion Error Correction Codes. (arXiv:2209.13533v1 [cs.IT])
- Deep Generative Multimedia Children's Literature. (arXiv:2209.13129v4 [cs.AI] UPDATED)
- Liquid Structural State-Space Models. (arXiv:2209.12951v1 [cs.LG])
- Tighter Variational Bounds are Not Necessarily Better. A Research Report on Implementation, Ablation Study, and Extensions. (arXiv:2209.11875v1 [stat.ML])
- Stochastic Gradient Descent Captures How Children Learn About Physics. (arXiv:2209.12344v1 [cs.LG])
- Convergence of score-based generative modeling for general data distributions. (arXiv:2209.12381v2 [cs.LG] UPDATED)
- All are Worth Words: A ViT Backbone for Diffusion Models. (arXiv:2209.12152v4 [cs.CV] UPDATED)
- On the Complexity of Deterministic Nonsmooth and Nonconvex Optimization. (arXiv:2209.12463v2 [math.OC] UPDATED)
- First-order Conditions for Optimization in the Wasserstein Space. (arXiv:2209.12197v1 [math.OC])
- Error analysis based on inverse modified differential equations for discovery of dynamics using linear multistep methods and deep learning. (arXiv:2209.12123v2 [math.NA] UPDATED)
- Differentiable physics-enabled closure modeling for Burgers' turbulence. (arXiv:2209.11614v1 [physics.flu-dyn])
- A new perspective on parameter study of optimization problems. (arXiv:2209.11580v1 [math.OC])
- Ensemble Kalman Methods: A Mean Field Perspective. (arXiv:2209.11371v1 [math.OC])
- Implementing and Experimenting with Diffusion Models for Text-to-Image Generation. (arXiv:2209.10948v1 [cs.CV])
- Vanilla feedforward neural networks as a discretization of dynamic systems. (arXiv:2209.10909v1 [cs.LG])
- Sampling is as easy as learning the score: theory for diffusion models with minimal data assumptions. (arXiv:2209.11215v3 [cs.LG] UPDATED)
- A Closer Look at Learned Optimization: Stability, Robustness, and Inductive Biases. (arXiv:2209.11208v1 [cs.LG])
- Amortized Variational Inference: A Systematic Review. (arXiv:2209.10888v2 [cs.LG] UPDATED)
- Gaussian Process Hydrodynamics. (arXiv:2209.10707v3 [physics.flu-dyn] UPDATED)
- Solving Fredholm Integral Equations of the First Kind via Wasserstein Gradient Flows. (arXiv:2209.09936v2 [math.OC] UPDATED)
- Unbiased time-average estimators for Markov chains. (arXiv:2209.09581v1 [math.ST])
- Physics-Informed Machine Learning of Dynamical Systems for Efficient Bayesian Inference. (arXiv:2209.09349v1 [stat.ML])
- Deep Linear Networks can Benignly Overfit when Shallow Ones Do. (arXiv:2209.09315v2 [cs.LG] UPDATED)
- On the Theoretical Properties of Noise Correlation in Stochastic Optimization. (arXiv:2209.09162v1 [math.OC])
- Gradient Norm Minimization of Nesterov Acceleration: $o(1/k^3)$. (arXiv:2209.08862v1 [math.OC])
- A Geometric Perspective on Variational Autoencoders. (arXiv:2209.07370v2 [stat.ML] UPDATED)
- Langevin Autoencoders for Learning Deep Latent Variable Models. (arXiv:2209.07036v2 [cs.LG] UPDATED)
- Generative Visual Prompt: Unifying Distributional Control of Pre-Trained Generative Models. (arXiv:2209.06970v2 [cs.CV] UPDATED)
- Flexible Diffusion Modeling of Long Videos. (arXiv:2205.11495v3 [cs.CV] UPDATED)
- Lossy Image Compression with Conditional Diffusion Models. (arXiv:2209.06950v8 [eess.IV] UPDATED)
- MDM: Molecular Diffusion Model for 3D Molecule Generation. (arXiv:2209.05710v1 [cs.LG])
- Blurring Diffusion Models. (arXiv:2209.05557v2 [cs.LG] UPDATED)
- Deep Relaxation of Controlled Stochastic Gradient Descent via Singular Perturbations. (arXiv:2209.05564v2 [math.OC] UPDATED)
- Gradient-Free Methods for Deterministic and Stochastic Nonsmooth Nonconvex Optimization. (arXiv:2209.05045v3 [math.OC] UPDATED)
- Diffusion Models in Vision: A Survey. (arXiv:2209.04747v5 [cs.CV] UPDATED)
- Statistical Learning Theory for Control: A Finite Sample Perspective. (arXiv:2209.05423v2 [eess.SY] UPDATED)
- Open-loop contraction design. (arXiv:2209.04440v1 [eess.SY])
- Most probable flows for Kunita SDEs. (arXiv:2209.03868v2 [math.PR] UPDATED)
- First Hitting Diffusion Models for Generating Manifold, Graph and Categorical Data. (arXiv:2209.01170v2 [cs.CV] UPDATED)
- Diffusion-based Molecule Generation with Informative Prior Bridges. (arXiv:2209.00865v1 [cs.LG])
- Diffusion Models: A Comprehensive Survey of Methods and Applications. (arXiv:2209.00796v11 [cs.LG] UPDATED)
- Convergence of the empirical measure in expected Wasserstein distance: non asymptotic explicit bounds in $\mathbb{R}^d$. (arXiv:2209.00923v2 [math.PR] UPDATED)
- Transformers are Sample-Efficient World Models. (arXiv:2209.00588v2 [cs.LG] UPDATED)
- The Geometry and Calculus of Losses. (arXiv:2209.00238v2 [cs.LG] UPDATED)
- Continuous-time Particle Filtering for Latent Stochastic Differential Equations. (arXiv:2209.00173v1 [cs.LG])
- A Comprehensive Review of Digital Twin -- Part 1: Modeling and Twinning Enabling Technologies. (arXiv:2208.14197v2 [cs.CE] UPDATED)
- Solving parametric partial differential equations with deep rectified quadratic unit neural networks. (arXiv:2203.06973v2 [math.NA] UPDATED)
- Data-Driven Influence Functions for Optimization-Based Causal Inference. (arXiv:2208.13701v4 [stat.ME] UPDATED)
- On the Implicit Bias in Deep-Learning Algorithms. (arXiv:2208.12591v3 [cs.LG] UPDATED)
- Understanding Diffusion Models: A Unified Perspective. (arXiv:2208.11970v1 [cs.LG])
- NeuralUQ: A comprehensive library for uncertainty quantification in neural differential equations and operators. (arXiv:2208.11866v1 [cs.LG])
- Adam Can Converge Without Any Modification On Update Rules. (arXiv:2208.09632v5 [cs.LG] UPDATED)
- Provable Adaptivity in Adam. (arXiv:2208.09900v1 [cs.LG])
- Diffusion-based Time Series Imputation and Forecasting with Structured State Space Models. (arXiv:2208.09399v3 [cs.LG] UPDATED)
- Monte Carlo is a good sampling strategy for polynomial approximation in high dimensions. (arXiv:2208.09045v3 [math.NA] UPDATED)
- Riemannian Diffusion Models. (arXiv:2208.07949v1 [cs.LG])
- On the generalization of learning algorithms that do not converge. (arXiv:2208.07951v2 [cs.LG] UPDATED)
- Langevin Diffusion Variational Inference. (arXiv:2208.07743v2 [cs.LG] UPDATED)
- Score-Based Diffusion meets Annealed Importance Sampling. (arXiv:2208.07698v3 [stat.ML] UPDATED)
- Natural differentiable structures on statistical models and the Fisher metric. (arXiv:2208.06539v4 [math.DG] UPDATED)
- Inverse Extended Kalman Filter -- Part II: Highly Non-Linear and Uncertain Systems. (arXiv:2208.06683v2 [math.OC] UPDATED)
- Adan: Adaptive Nesterov Momentum Algorithm for Faster Optimizing Deep Models. (arXiv:2208.06677v4 [cs.LG] UPDATED)
- Duality for Nonlinear Filtering II: Optimal Control. (arXiv:2208.06587v1 [math.OC])
- Generative Models: An Interdisciplinary Perspective. (arXiv:2208.06011v1 [stat.ME])
- Incorporating History and Deviations in Forward--Backward Splitting. (arXiv:2208.05498v2 [math.OC] UPDATED)
- Foundations of Monte Carlo methods and stochastic simulations -- From Monte Carlo Lebesgue integration to weak approximation of SDEs. (arXiv:2208.05531v2 [math.NA] UPDATED)
- Wavelet Score-Based Generative Modeling. (arXiv:2208.05003v1 [cs.LG])
- Convergence of denoising diffusion models under the manifold hypothesis. (arXiv:2208.05314v2 [stat.ML] UPDATED)
- Poincar\'e inequalities for Markov chains: a meeting with Cheeger, Lyapunov and Metropolis. (arXiv:2208.05239v1 [math.PR])
- Simplified State Space Layers for Sequence Modeling. (arXiv:2208.04933v3 [cs.LG] UPDATED)
- Computable Contracts in the Financial Services Industry. (arXiv:2208.04685v1 [cs.CY])
- Sampling algorithms in statistical physics: a guide for statistics and machine learning. (arXiv:2208.04751v2 [stat.CO] UPDATED)
- Unbiased Estimation of the Vanilla and Deterministic Ensemble Kalman-Bucy Filters. (arXiv:2208.03947v1 [stat.ME])
- Non-Asymptotic Analysis of Ensemble Kalman Updates: Effective Dimension and Localization. (arXiv:2208.03246v3 [stat.ML] UPDATED)
- Fixed-Point Automatic Differentiation of Forward--Backward Splitting Algorithms for Partly Smooth Functions. (arXiv:2208.03107v1 [math.OC])
- Transformers as Meta-Learners for Implicit Neural Representations. (arXiv:2208.02801v2 [cs.LG] UPDATED)
- Smoothness of the density for McKean-Vlasov SDEs with measurable kernel. (arXiv:2208.02771v3 [math.PR] UPDATED)
- Differentiable Predictive Control with Safety Guarantees: A Control Barrier Function Approach. (arXiv:2208.02319v1 [eess.SY])
- Representer Theorem for Learning Koopman Operators. (arXiv:2208.01681v1 [eess.SY])
- Physics-informed Deep Super-resolution for Spatiotemporal Data. (arXiv:2208.01462v1 [cs.LG])
- Approximate Bayesian Neural Operators: Uncertainty Quantification for Parametric PDEs. (arXiv:2208.01565v1 [cs.LG])
- A Python-based Mixed Discrete-Continuous Simulation Framework for Digital Twins. (arXiv:2208.01408v1 [eess.SY])
- On the Power-Law Hessian Spectrums in Deep Learning. (arXiv:2201.13011v2 [cs.LG] UPDATED)
- Markov Chain Score Ascent: A Unifying Framework of Variational Inference with Markovian Gradients. (arXiv:2206.06295v4 [cs.LG] UPDATED)
- Encoder-Decoder Architecture for 3D Seismic Inversion. (arXiv:2207.14789v1 [physics.geo-ph])
- Language Models Can Teach Themselves to Program Better. (arXiv:2207.14502v4 [cs.LG] UPDATED)
- Adaptive Gradient Methods at the Edge of Stability. (arXiv:2207.14484v1 [cs.LG])
- Physics-Informed Neural Networks for Shell Structures. (arXiv:2207.14291v1 [cs.CE])
- Tangential Wasserstein Projections. (arXiv:2207.14727v2 [stat.ML] UPDATED)
- Bayesian quadrature for $H^1(\mu)$ with Poincar\'e inequality on a compact interval. (arXiv:2207.14564v1 [math.ST])
- Sharp High-dimensional Central Limit Theorems for Log-concave Distributions. (arXiv:2207.14536v4 [math.PR] UPDATED)
- Semi-supervised Learning of Partial Differential Operators and Dynamical Flows. (arXiv:2207.14366v1 [cs.LG])
- Generative Modelling With Inverse Heat Dissipation. (arXiv:2206.13397v7 [cs.CV] UPDATED)
- Statistics for stochastic differential equations and approximations of resolvent. (arXiv:2207.13831v2 [math.NA] UPDATED)
- Sliced Wasserstein Variational Inference. (arXiv:2207.13177v1 [stat.ML])
- UltimateKalman: Flexible Kalman Filtering and Smoothing Using Orthogonal Transformations. (arXiv:2207.13526v1 [math.NA])
- Thermodynamics of learning physical phenomena. (arXiv:2207.12749v3 [cs.LG] UPDATED)
- Analyzing Sharpness along GD Trajectory: Progressive Sharpening and Edge of Stability. (arXiv:2207.12678v2 [cs.LG] UPDATED)
- Multiscale Neural Operator: Learning Fast and Grid-independent PDE Solvers. (arXiv:2207.11417v1 [cs.LG])
- Tuning Stochastic Gradient Algorithms for Statistical Inference via Large-Sample Asymptotics. (arXiv:2207.12395v3 [stat.CO] UPDATED)
- Revisiting the central limit theorems for the SGD-type methods. (arXiv:2207.11755v3 [math.OC] UPDATED)
- Learn Continuously, Act Discretely: Hybrid Action-Space Reinforcement Learning For Optimal Execution. (arXiv:2207.11152v1 [q-fin.TR])
- Transformer with Implicit Edges for Particle-based Physics Simulation. (arXiv:2207.10860v1 [cs.LG])
- Language Model Cascades. (arXiv:2207.10342v2 [cs.CL] UPDATED)
- The Unscented Transform Controller: a new model predictive control law for highly nonlinear systems. (arXiv:2207.10496v1 [eess.SY])
- Formal Algorithms for Transformers. (arXiv:2207.09238v1 [cs.LG])
- Data-driven initialization of deep learning solvers for Hamilton-Jacobi-Bellman PDEs. (arXiv:2207.09299v1 [math.OC])
- A sharp uniform-in-time error estimate for Stochastic Gradient Langevin Dynamics. (arXiv:2207.09304v2 [math.PR] UPDATED)
- Mean-field Variational Inference via Wasserstein Gradient Flow. (arXiv:2207.08074v2 [math.ST] UPDATED)
- The Importance Markov Chain. (arXiv:2207.08271v2 [stat.CO] UPDATED)
- Duality for nonlinear filtering. (arXiv:2207.07709v1 [math.OC])
- Blessing of Nonconvexity in Deep Linear Models: Depth Flattens the Optimization Landscape Around the True Solution. (arXiv:2207.07612v1 [cs.LG])
- Equivalent Conditions for Weak Continuity of Nonlinear Filters. (arXiv:2207.07544v2 [math.OC] UPDATED)
- Comparing the latent space of generative models. (arXiv:2207.06812v1 [cs.LG])
- Continuous-time Analysis for Variational Inequalities: An Overview and Desiderata. (arXiv:2207.07105v1 [stat.ML])
- Fourier Neural Operator with Learned Deformations for PDEs on General Geometries. (arXiv:2207.05209v1 [cs.LG])
- Differentiable Physics Simulations with Contacts: Do They Have Correct Gradients w.r.t. Position, Velocity and Control?. (arXiv:2207.05060v1 [cs.LG])
- AdamNODEs: When Neural ODE Meets Adaptive Moment Estimation. (arXiv:2207.06066v1 [cs.LG])
- Iterative Linear Quadratic Optimization for Nonlinear Control: Differentiable Programming Algorithmic Templates. (arXiv:2207.06362v1 [math.OC])
- Physics Informed Symbolic Networks. (arXiv:2207.06240v2 [cs.LG] UPDATED)
- Physics-Informed Deep Neural Operator Networks. (arXiv:2207.05748v2 [cs.LG] UPDATED)
- Normalized gradient flow optimization in the training of ReLU artificial neural networks. (arXiv:2207.06246v1 [math.OC])
- Automatic Differentiation: Theory and Practice. (arXiv:2207.06114v1 [cs.LG])
- Conservative SPDEs as fluctuating mean field limits of stochastic gradient descent. (arXiv:2207.05705v2 [math.PR] UPDATED)
- Safe Drone Flight with Time-Varying Backup Controllers. (arXiv:2207.05220v1 [eess.SY])
- Inverse medium scattering problems with Kalman filter techniques. (arXiv:2207.05398v1 [math.AP])
- Fourier Neural Operator with Learned Deformations for PDEs on General Geometries. (arXiv:2207.05209v1 [cs.LG])
- A Forward Propagation Algorithm for Online Optimization of Nonlinear Stochastic Differential Equations. (arXiv:2207.04496v1 [math.PR])
- Last-Iterate Convergence of Saddle-Point Optimizers via High-Resolution Differential Equations. (arXiv:2112.13826v3 [math.OC] UPDATED)
- Implicit Bias of Gradient Descent on Reparametrized Models: On Equivalence to Mirror Descent. (arXiv:2207.04036v1 [cs.LG])
- Transport of currents and geometric Rademacher-type theorems. (arXiv:2207.03922v1 [math.AP])
- The Positive Effects of Stochastic Rounding in Numerical Algorithms. (arXiv:2207.03837v1 [math.NA])
- The effect of smooth parametrizations on nonconvex optimization landscapes. (arXiv:2207.03512v4 [math.OC] UPDATED)
- SC2EGSet: StarCraft II Esport Replay and Game-state Dataset. (arXiv:2207.03428v2 [cs.LG] UPDATED)
- Riemannian Diffusion Schr\"odinger Bridge. (arXiv:2207.03024v1 [stat.ML])
- Verifying the Union of Manifolds Hypothesis for Image Data. (arXiv:2207.02862v3 [stat.ML] UPDATED)
- BFE and AdaBFE: A New Approach in Learning Rate Automation for Stochastic Optimization. (arXiv:2207.02763v1 [cs.LG])
- The alignment property of SGD noise and how it helps select flat minima: A stability analysis. (arXiv:2207.02628v3 [stat.ML] UPDATED)
- Mitigating Propagation Failures in Physics-informed Neural Networks using Retain-Resample-Release (R3) Sampling. (arXiv:2207.02338v3 [cs.LG] UPDATED)
- A Tutorial on the Spectral Theory of Markov Chains. (arXiv:2207.02296v2 [cs.LG] UPDATED)
- Exponential integrators for non-linear diffusion. (arXiv:2207.02439v1 [math.NA])
- Non-asymptotic convergence bounds for modified tamed unadjusted Langevin algorithm in non-convex setting. (arXiv:2207.02600v1 [math.PR])
- An SDE perspective on stochastic convex optimization. (arXiv:2207.02750v1 [math.OC])
- Hierarchical modeling for an industrial implementation of a Digital Twin for electrical drives. (arXiv:2207.02171v3 [eess.SY] UPDATED)
- Improved Global Guarantees for the Nonconvex Burer--Monteiro Factorization via Rank Overparameterization. (arXiv:2207.01789v1 [math.OC])
- CodeRL: Mastering Code Generation through Pretrained Models and Deep Reinforcement Learning. (arXiv:2207.01780v3 [cs.LG] UPDATED)
- An optimal transport based characterization of convex order. (arXiv:2207.01235v3 [math.PR] UPDATED)
- A Stochastic Contraction Mapping Theorem. (arXiv:2207.00618v1 [math.PR])
- Learning to correct spectral methods for simulating turbulent flows. (arXiv:2207.00556v2 [cs.LG] UPDATED)
- Can we learn from developer mistakes? Learning to localize and repair real bugs from real bug fixes. (arXiv:2207.00301v1 [cs.SE])
- Rethinking Optimization with Differentiable Simulation from a Global Perspective. (arXiv:2207.00167v1 [stat.ML])
- Language model compression with weighted low-rank factorization. (arXiv:2207.00112v1 [cs.LG])
- GitHub Copilot AI pair programmer: Asset or Liability?. (arXiv:2206.15331v2 [cs.SE] UPDATED)
- Theoretical Perspectives on Deep Learning Methods in Inverse Problems. (arXiv:2206.14373v2 [stat.ML] UPDATED)
- Optimal Estimation of Generic Dynamics by Path-Dependent Neural Jump ODEs. (arXiv:2206.14284v5 [stat.ML] UPDATED)
- Long Range Language Modeling via Gated State Spaces. (arXiv:2206.13947v3 [cs.LG] UPDATED)
- Learning the Solution Operator of Boundary Value Problems using Graph Neural Networks. (arXiv:2206.14092v2 [cs.LG] UPDATED)
- Pen and Paper Exercises in Machine Learning. (arXiv:2206.13446v1 [cs.LG])
- Statistical inference with implicit SGD: proximal Robbins-Monro vs. Polyak-Ruppert. (arXiv:2206.12663v2 [stat.ML] UPDATED)
- Robustness Implies Generalization via Data-Dependent Generalization Bounds. (arXiv:2206.13497v4 [cs.LG] UPDATED)
- Learning stochastic filtering. (arXiv:2206.13018v1 [cond-mat.stat-mech])
- Score-based Generative Models for Calorimeter Shower Simulation. (arXiv:2206.11898v3 [hep-ph] UPDATED)
- An attempt to trace the birth of importance sampling. (arXiv:2206.12286v1 [math.HO])
- On the Parameterization and Initialization of Diagonal State Space Models. (arXiv:2206.11893v2 [cs.LG] UPDATED)
- Stochastic Langevin Differential Inclusions with Applications to Machine Learning. (arXiv:2206.11533v2 [math.OC] UPDATED)
- Understanding convolution on graphs via energies. (arXiv:2206.10991v5 [cs.LG] UPDATED)
- Diffusion models as plug-and-play priors. (arXiv:2206.09012v3 [cs.LG] UPDATED)
- Robust SDE-Based Variational Formulations for Solving Linear PDEs via Deep Learning. (arXiv:2206.10588v2 [cs.LG] UPDATED)
- Low-Precision Stochastic Gradient Langevin Dynamics. (arXiv:2206.09909v1 [cs.LG])
- Principled Acceleration of Iterative Numerical Methods Using Machine Learning. (arXiv:2206.08594v2 [math.NA] UPDATED)
- Mirror Descent with Relative Smoothness in Measure Spaces, with application to Sinkhorn and EM. (arXiv:2206.08873v2 [math.OC] UPDATED)
- Maximum Likelihood Training for Score-Based Diffusion ODEs by High-Order Denoising Score Matching. (arXiv:2206.08265v2 [stat.ML] UPDATED)
- Diffusion Models for Video Prediction and Infilling. (arXiv:2206.07696v3 [cs.CV] UPDATED)
- Learning to Accelerate Partial Differential Equations via Latent Global Evolution. (arXiv:2206.07681v2 [cs.LG] UPDATED)
- Estimating the Optimal Covariance with Imperfect Mean in Diffusion Probabilistic Models. (arXiv:2206.07309v1 [cs.LG])
- Implicit Regularization or Implicit Conditioning? Exact Risk Trajectories of SGD in High Dimensions. (arXiv:2206.07252v1 [stat.ML])
- Riemannian stochastic approximation algorithms. (arXiv:2206.06795v3 [math.OC] UPDATED)
- Why is constrained neural language generation particularly challenging?. (arXiv:2206.05395v1 [cs.CL])
- gDDIM: Generalized denoising diffusion implicit models. (arXiv:2206.05564v2 [cs.LG] UPDATED)
- Markov Chain Score Ascent: A Unifying Framework of Variational Inference with Markovian Gradients. (arXiv:2206.06295v4 [cs.LG] UPDATED)
- Convergence for score-based generative modeling with polynomial complexity. (arXiv:2206.06227v2 [cs.LG] UPDATED)
- Ornstein-Uhlenbeck Type Processes on Wasserstein Space. (arXiv:2206.05479v4 [math.PR] UPDATED)
- A Continuous-Time Perspective on Global Acceleration for Monotone Equation Problems. (arXiv:2206.04770v5 [math.OC] UPDATED)
- Probability flow solution of the Fokker-Planck equation. (arXiv:2206.04642v3 [cs.LG] UPDATED)
- Signal Propagation in Transformers: Theoretical Perspectives and the Role of Rank Collapse. (arXiv:2206.03126v1 [cs.LG])
- Computational Doob's h-transforms for Online Filtering of Discretely Observed Diffusions. (arXiv:2206.03369v2 [stat.ML] UPDATED)
- Accelerating Score-based Generative Models for High-Resolution Image Synthesis. (arXiv:2206.04029v3 [cs.CV] UPDATED)
- Neural Diffusion Processes. (arXiv:2206.03992v2 [stat.ML] UPDATED)
- High-dimensional limit theorems for SGD: Effective dynamics and critical scaling. (arXiv:2206.04030v4 [stat.ML] UPDATED)
- Motion control with optimal nonlinear damping: from theory to experiment. (arXiv:2206.03802v2 [eess.SY] UPDATED)
- A Unified Convergence Theorem for Stochastic Optimization Methods. (arXiv:2206.03907v2 [math.OC] UPDATED)
- Variational Monte Carlo Approach to Partial Differential Equations with Neural Networks. (arXiv:2206.01927v2 [math.NA] UPDATED)
- Neural Lyapunov Control of Unknown Nonlinear Systems with Stability Guarantees. (arXiv:2206.01913v2 [eess.SY] UPDATED)
- First-Order Algorithms for Min-Max Optimization in Geodesic Metric Spaces. (arXiv:2206.02041v2 [math.OC] UPDATED)
- A Control Theoretic Framework for Adaptive Gradient Optimizers in Machine Learning. (arXiv:2206.02034v2 [cs.LG] UPDATED)
- On the complexity of nonsmooth automatic differentiation. (arXiv:2206.01730v2 [math.NA] UPDATED)
- Mean field approximations via log-concavity. (arXiv:2206.01260v1 [math.PR])
- A memory-efficient neural ODE framework based on high-level adjoint differentiation. (arXiv:2206.01298v3 [cs.LG] UPDATED)
- Regularization-wise double descent: Why it occurs and how to eliminate it. (arXiv:2206.01378v1 [cs.LG])
- Accelerated first-order methods for convex optimization with locally Lipschitz continuous gradient. (arXiv:2206.01209v3 [math.OC] UPDATED)
- Improving Diffusion Models for Inverse Problems using Manifold Constraints. (arXiv:2206.00941v2 [cs.LG] UPDATED)
- DPM-Solver: A Fast ODE Solver for Diffusion Probabilistic Model Sampling in Around 10 Steps. (arXiv:2206.00927v3 [cs.LG] UPDATED)
- Bayesian Learning to Discover Mathematical Operations in Governing Equations of Dynamic Systems. (arXiv:2206.00669v1 [cs.LG])
- Score-Based Generative Models Detect Manifolds. (arXiv:2206.01018v3 [stat.ML] UPDATED)
- Convergence of Stein Variational Gradient Descent under a Weaker Smoothness Condition. (arXiv:2206.00508v1 [math.ST])
- Amortized backward variational inference in nonlinear state-space models. (arXiv:2206.00319v1 [stat.ME])
- Elucidating the Design Space of Diffusion-Based Generative Models. (arXiv:2206.00364v2 [cs.CV] UPDATED)
- Control of Two-way Coupled Fluid Systems with Differentiable Solvers. (arXiv:2206.00342v1 [cs.LG])
- Mario Plays on a Manifold: Generating Functional Content in Latent Space through Differential Geometry. (arXiv:2206.00106v1 [cs.LG])
- Automatic differentiation of nonsmooth iterative algorithms. (arXiv:2206.00457v1 [math.OC])
- First-order conditions for the optimal control of learning-informed nonsmooth PDEs. (arXiv:2206.00297v2 [math.OC] UPDATED)
- Regular Convergence and Finite Element Methods for Eigenvalue Problems. (arXiv:2206.00626v2 [math.NA] UPDATED)
- Discrete Gradient Flow Approximations of High Dimensional Evolution Partial Differential Equations via Deep Neural Networks. (arXiv:2206.00290v1 [math.NA])
- HierarchyNet: Learning to Summarize Source Code with Heterogeneous Representations. (arXiv:2205.15479v3 [cs.SE] UPDATED)
- Variational inference via Wasserstein gradient flows. (arXiv:2205.15902v3 [stat.ML] UPDATED)
- Generalised Implicit Neural Representations. (arXiv:2205.15674v2 [cs.LG] UPDATED)
- Few-Shot Diffusion Models. (arXiv:2205.15463v1 [cs.CV])
- Transformers from an Optimization Perspective. (arXiv:2205.13891v2 [cs.LG] UPDATED)
- Global Convergence of Over-parameterized Deep Equilibrium Models. (arXiv:2205.13814v2 [cs.LG] UPDATED)
- A Continuous Time Framework for Discrete Denoising Models. (arXiv:2205.14987v2 [stat.ML] UPDATED)
- Machine Learning for Microcontroller-Class Hardware: A Review. (arXiv:2205.14550v5 [cs.LG] UPDATED)
- Experience report of physics-informed neural networks in fluid simulations: pitfalls and frustration. (arXiv:2205.14249v3 [physics.flu-dyn] UPDATED)
- The Analysis of Optimization Algorithms, A Dissipativity Approach. (arXiv:2205.14264v1 [math.OC])
- Deep neural networks overcome the curse of dimensionality in the numerical approximation of semilinear partial differential equations. (arXiv:2205.14398v1 [math.NA])
- Transformer for Partial Differential Equations' Operator Learning. (arXiv:2205.13671v3 [cs.LG] UPDATED)
- Leveraging Causal Inference for Explainable Automatic Program Repair. (arXiv:2205.13342v2 [cs.SE] UPDATED)
- Embed to Control Partially Observed Systems: Representation Learning with Provable Sample Efficiency. (arXiv:2205.13476v1 [cs.LG])
- Transportation-Inequalities, Lyapunov Stability and Sampling for Dynamical Systems on Continuous State Space. (arXiv:2205.12448v2 [stat.ML] UPDATED)
- Data driven gradient flows. (arXiv:2205.12172v1 [math.NA])
- Generalization Gap in Amortized Inference. (arXiv:2205.11640v2 [stat.ML] UPDATED)
- RL with KL penalties is better viewed as Bayesian inference. (arXiv:2205.11275v2 [cs.LG] UPDATED)
- Neural Lyapunov Differentiable Predictive Control. (arXiv:2205.10728v1 [eess.SY])
- Generic bounds on the approximation error for physics-informed (and) operator learning. (arXiv:2205.11393v2 [cs.LG] UPDATED)
- Towards Size-Independent Generalization Bounds for Deep Operator Nets. (arXiv:2205.11359v2 [cs.LG] UPDATED)
- Spectral Neural Operators. (arXiv:2205.10573v1 [math.NA])
- On the SDEs and Scaling Rules for Adaptive Gradient Algorithms. (arXiv:2205.10287v2 [cs.LG] UPDATED)
- Replicating Portfolios: Constructing Permissionless Derivatives. (arXiv:2205.09890v2 [q-fin.CP] UPDATED)
- Differential learning methods for solving fully nonlinear PDEs. (arXiv:2205.09815v1 [q-fin.CP])
- Foundation Posteriors for Approximate Probabilistic Inference. (arXiv:2205.09735v2 [cs.LG] UPDATED)
- Optimizing the optimizer for data driven deep neural networks and physics informed neural networks. (arXiv:2205.07430v1 [cs.LG])
- Differentiable programming: Generalization, characterization and limitations of deep learning. (arXiv:2205.06898v1 [cs.LG])
- Robustness of Control Design via Bayesian Learning. (arXiv:2205.06896v1 [cs.LG])
- Predicting Emotional Volatility Using 41,000 Participants in the United Kingdom. (arXiv:2205.07742v2 [econ.GN] UPDATED)
- Formal limitations of sample-wise information-theoretic generalization bounds. (arXiv:2205.06915v2 [cs.LG] UPDATED)
- Physics-informed machine learning techniques for edge plasma turbulence modelling in computational theory and experiment. (arXiv:2205.07838v1 [physics.plasm-ph])
- Homogenization of SGD in high-dimensions: Exact dynamics and generalization properties. (arXiv:2205.07069v1 [math.ST])
- Neural-Fly Enables Rapid Learning for Agile Flight in Strong Winds. (arXiv:2205.06908v1 [cs.RO])
- Robust Fundamental Lemma for Data-driven Control. (arXiv:2205.06636v1 [math.OC])
- Virtual twins of nonlinear vibrating multiphysics microstructures: physics-based versus deep learning-based approaches. (arXiv:2205.05928v1 [math.DS])
- Dynamics of SGD with Stochastic Polyak Stepsizes: Truly Adaptive Variants and Convergence to Exact Solution. (arXiv:2205.04583v3 [math.OC] UPDATED)
- Optimizing a DIscrete Loss (ODIL) to solve forward and inverse problems for partial differential equations using machine learning tools. (arXiv:2205.04611v1 [math.NA])
- On Conditioning the Input Noise for Controlled Image Generation with Diffusion Models. (arXiv:2205.03859v1 [cs.CV])
- On state-space representations of general discrete-time dynamical systems. (arXiv:2205.03366v1 [eess.SY])
- Probabilistic Control and Majorization of Optimal Control. (arXiv:2205.03279v5 [cs.LG] UPDATED)
- Physics-informed neural networks for PDE-constrained optimization and control. (arXiv:2205.03377v2 [cs.LG] UPDATED)
- GANs as Gradient Flows that Converge. (arXiv:2205.02910v2 [cs.LG] UPDATED)
- Making SGD Parameter-Free. (arXiv:2205.02160v2 [math.OC] UPDATED)
- Wavelet neural operator: a neural operator for parametric partial differential equations. (arXiv:2205.02191v1 [physics.comp-ph])
- Subspace Diffusion Generative Models. (arXiv:2205.01490v2 [cs.LG] UPDATED)
- Efficient implementation of incremental proximal-point methods. (arXiv:2205.01457v1 [cs.LG])
- High-dimensional Asymptotics of Feature Learning: How One Gradient Step Improves the Representation. (arXiv:2205.01445v1 [stat.ML])
- Smooth over-parameterized solvers for non-smooth structured optimization. (arXiv:2205.01385v1 [math.OC])
- Learning Discrete Structured Variational Auto-Encoder using Natural Evolution Strategies. (arXiv:2205.01324v1 [cs.LG])
- Optimal Projection Filters. (arXiv:2205.01594v2 [math.PR] UPDATED)
- Distilling Governing Laws and Source Input for Dynamical Systems from Videos. (arXiv:2205.01314v1 [cs.CV])
- Physics-aware Reduced-order Modeling of Transonic Flow via $\beta$-Variational Autoencoder. (arXiv:2205.00608v2 [physics.flu-dyn] UPDATED)
- Learning Effective SDEs from Brownian Dynamics Simulations of Colloidal Particles. (arXiv:2205.00286v3 [math.DS] UPDATED)
- Skill Induction and Planning with Latent Language. (arXiv:2110.01517v2 [cs.LG] UPDATED)
- Continual Learning with Foundation Models: An Empirical Study of Latent Replay. (arXiv:2205.00329v2 [cs.LG] UPDATED)
- Stochastic Online Fisher Markets: Static Pricing Limits and Adaptive Enhancements. (arXiv:2205.00825v3 [cs.GT] UPDATED)
- Gradient Descent, Stochastic Optimization, and Other Tales. (arXiv:2205.00832v2 [cs.LG] UPDATED)
- Solving PDEs by Variational Physics-Informed Neural Networks: an a posteriori error analysis. (arXiv:2205.00786v1 [math.NA])
- Fast Sampling of Diffusion Models with Exponential Integrator. (arXiv:2204.13902v4 [cs.LG] UPDATED)
- Theory and Algorithms for Diffusion Processes on Riemannian Manifolds. (arXiv:2204.13665v3 [math.PR] UPDATED)
- Particle algorithms for maximum likelihood training of latent variable models. (arXiv:2204.12965v5 [stat.CO] UPDATED)
- Beyond Lipschitz: Sharp Generalization and Excess Risk Bounds for Full-Batch GD. (arXiv:2204.12446v5 [stat.ML] UPDATED)
- gLaSDI: Parametric Physics-informed Greedy Latent Space Dynamics Identification. (arXiv:2204.12005v2 [eess.SY] UPDATED)
- Convergence of the Riemannian Langevin Algorithm. (arXiv:2204.10818v1 [cs.LG])
- Finite-Time Analysis of Temporal Difference Learning: Discrete-Time Linear System Perspective. (arXiv:2204.10479v6 [cs.LG] UPDATED)
- Physics-Informed Bayesian Learning of Electrohydrodynamic Polymer Jet Printing Dynamics. (arXiv:2204.09513v1 [cs.LG])
- Proximal Implicit ODE Solvers for Accelerating Learning Neural ODEs. (arXiv:2204.08621v1 [math.NA])
- Visual Attention Methods in Deep Learning: An In-Depth Survey. (arXiv:2204.07756v2 [cs.CV] UPDATED)
- A Convergence Analysis of Nesterov's Accelerated Gradient Method in Training Deep Linear Neural Networks. (arXiv:2204.08306v1 [cs.LG])
- Universal approximation property of invertible neural networks. (arXiv:2204.07415v1 [cs.LG])
- InCoder: A Generative Model for Code Infilling and Synthesis. (arXiv:2204.05999v3 [cs.SE] UPDATED)
- Physics-assisted Generative Adversarial Network for X-Ray Tomography. (arXiv:2204.03703v2 [eess.IV] UPDATED)
- Video Diffusion Models. (arXiv:2204.03458v2 [cs.CV] UPDATED)
- Neural Implicit Flow: a mesh-agnostic dimensionality reduction paradigm of spatio-temporal data. (arXiv:2204.03216v5 [cs.LG] UPDATED)
- Statistical Model Criticism of Variational Auto-Encoders. (arXiv:2204.03030v1 [cs.LG])
- Fundamental limits to learning closed-form mathematical models from data. (arXiv:2204.02704v2 [cs.LG] UPDATED)
- Random Features Model with General Convex Regularization: A Fine Grained Analysis with Precise Asymptotic Learning Curves. (arXiv:2204.02678v2 [stat.ML] UPDATED)
- Nonlinear gradient mappings and stochastic optimization: A general framework with applications to heavy-tail noise. (arXiv:2204.02593v1 [math.OC])
- The First Principles of Deep Learning and Compression. (arXiv:2204.01782v1 [eess.IV])
- Signal Propagation: A Framework for Learning and Inference In a Forward Pass. (arXiv:2204.01723v2 [cs.LG] UPDATED)
- Deep learning, stochastic gradient descent and diffusion maps. (arXiv:2204.01365v3 [stat.ML] UPDATED)
- Stability of estimates for fundamental solutions under Feynman-Kac perturbations for symmetric Markov processes. (arXiv:2204.01419v1 [math.PR])
- Covariance Representations, $L^p$-Poincar\'e Inequalities, Stein's Kernels and High Dimensional CLTs. (arXiv:2204.01088v1 [math.PR])
- Differentially Private Sampling from Rashomon Sets, and the Universality of Langevin Diffusion for Convex Optimization. (arXiv:2204.01585v4 [cs.LG] UPDATED)
- Stochastic filtering under model ambiguity. (arXiv:2204.01226v2 [math.OC] UPDATED)
- Understanding the unstable convergence of gradient descent. (arXiv:2204.01050v2 [math.OC] UPDATED)
- Physically Consistent Neural Networks for building thermal modeling: theory and analysis. (arXiv:2112.03212v3 [cs.LG] UPDATED)
- Physics-guided neural networks for feedforward control: From consistent identification to feedforward controller design. (arXiv:2204.00431v1 [eess.SY])
- Learning the conditional law: signatures and conditional GANs in filtering and prediction of diffusion processes. (arXiv:2204.00611v2 [stat.ML] UPDATED)
- Time-invariant Prefix Coding for LQG Control. (arXiv:2204.00588v6 [cs.IT] UPDATED)
- Convergence Rate Bounds for the Mirror Descent Method: IQCs and the Bregman Divergence. (arXiv:2204.00502v2 [math.OC] UPDATED)
- Neural representation of a time optimal, constant acceleration rendezvous. (arXiv:2203.15490v1 [astro-ph.EP])
- Disentangling speech from surroundings with neural embeddings. (arXiv:2203.15578v2 [cs.SD] UPDATED)
- State space models vs. multi-step predictors in predictive control: Are state space models complicating safe data-driven designs?. (arXiv:2203.15471v3 [math.OC] UPDATED)
- Robust, Automated, and Accurate Black-box Variational Inference. (arXiv:2203.15945v1 [stat.ML])
- Physics-constrained Unsupervised Learning of Partial Differential Equations using Meshes. (arXiv:2203.16628v1 [cs.LG])
- Equivariant Diffusion for Molecule Generation in 3D. (arXiv:2203.17003v2 [cs.LG] UPDATED)
- When Physics Meets Machine Learning: A Survey of Physics-Informed Machine Learning. (arXiv:2203.16797v1 [cs.LG])
- SpecGrad: Diffusion Probabilistic Model based Neural Vocoder with Adaptive Noise Spectral Shaping. (arXiv:2203.16749v2 [eess.AS] UPDATED)
- Stochastic Calculus in Infinite Dimensions and SPDEs. (arXiv:2203.17206v3 [math.PR] UPDATED)
- An energy-based deep splitting method for the nonlinear filtering problem. (arXiv:2203.17153v3 [stat.CO] UPDATED)
- Neural Q-learning for solving PDEs. (arXiv:2203.17128v2 [math.NA] UPDATED)
- Gradient flow structure and convergence analysis of the ensemble Kalman inversion for nonlinear forward models. (arXiv:2203.17117v2 [math.NA] UPDATED)
- Wasserstein Distributionally Robust Control of Partially Observable Linear Systems: Tractable Approximation and Performance Guarantee. (arXiv:2203.17045v2 [eess.SY] UPDATED)
- A Derivation of Nesterov's Accelerated Gradient Algorithm from Optimal Control Theory. (arXiv:2203.17226v1 [math.OC])
- Gradient flow structure and convergence analysis of the ensemble Kalman inversion for nonlinear forward models. (arXiv:2203.17117v2 [math.NA] UPDATED)
- State space models vs. multi-step predictors in predictive control: Are state space models complicating safe data-driven designs?. (arXiv:2203.15471v3 [math.OC] UPDATED)
- Convergence of gradient descent for deep neural networks. (arXiv:2203.16462v4 [cs.LG] UPDATED)
- Nonequilibrium Statistical Mechanics and Optimal Prediction of Partially-Observed Complex Systems. (arXiv:2203.16048v1 [cond-mat.stat-mech])
- A New Diffusive Representation for Fractional Derivatives, Part II: Convergence Analysis of the Numerical Scheme. (arXiv:2203.16454v1 [math.NA])
- Blended Diffusion for Text-driven Editing of Natural Images. (arXiv:2111.14818v2 [cs.CV] UPDATED)
- Theoretical Connection between Locally Linear Embedding, Factor Analysis, and Probabilistic PCA. (arXiv:2203.13911v2 [stat.ML] UPDATED)
- JAX-FLUIDS: A fully-differentiable high-order computational fluid dynamics solver for compressible two-phase flows. (arXiv:2203.13760v1 [physics.flu-dyn])
- Efficient-VDVAE: Less is more. (arXiv:2203.13751v2 [cs.LG] UPDATED)
- On the Role of Fixed Points of Dynamical Systems in Training Physics-Informed Neural Networks. (arXiv:2203.13648v2 [cs.LG] UPDATED)
- Applications of physics informed neural operators. (arXiv:2203.12634v2 [physics.comp-ph] UPDATED)
- Economic Networks: Theory and Computation. (arXiv:2203.11972v5 [econ.GN] UPDATED)
- Variations and extensions of the Gaussian concentration inequality, Part II. (arXiv:2203.12523v3 [math.PR] UPDATED)
- NOSNOC: A Software Package for Numerical Optimal Control of Nonsmooth Systems. (arXiv:2203.11516v2 [math.OC] UPDATED)
- Safety of Sampled-Data Systems with Control Barrier Functions via Approximate Discrete Time Models. (arXiv:2203.11470v2 [eess.SY] UPDATED)
- PI-VAE: Physics-Informed Variational Auto-Encoder for stochastic differential equations. (arXiv:2203.11363v1 [stat.ML])
- On Neural Network Equivalence Checking using SMT Solvers. (arXiv:2203.11629v1 [cs.AI])
- Embedded Code Generation with CVXPY. (arXiv:2203.11419v2 [math.OC] UPDATED)
- A 3D Generative Model for Structure-Based Drug Design. (arXiv:2203.10446v2 [q-bio.BM] UPDATED)
- Deep Learning Generalization, Extrapolation, and Over-parameterization. (arXiv:2203.10366v1 [cs.LG])
- Geometric Methods for Sampling, Optimisation, Inference and Adaptive Agents. (arXiv:2203.10592v3 [stat.ML] UPDATED)
- On the Generalization Mystery in Deep Learning. (arXiv:2203.10036v3 [cs.LG] UPDATED)
- Alleviating Adversarial Attacks on Variational Autoencoders with MCMC. (arXiv:2203.09940v2 [cs.LG] UPDATED)
- Image Storage on Synthetic DNA Using Autoencoders. (arXiv:2203.09981v1 [cs.LG])
- Learning Stabilizable Deep Dynamics Models. (arXiv:2203.09710v1 [cs.LG])
- Diffusion Probabilistic Modeling for Video Generation. (arXiv:2203.09481v5 [cs.CV] UPDATED)
- Learning the Dynamics of Physical Systems from Sparse Observations with Finite Element Networks. (arXiv:2203.08852v1 [cs.LG])
- Dual Diffusion Implicit Bridges for Image-to-Image Translation. (arXiv:2203.08382v4 [cs.CV] UPDATED)
- Can A Neural Network Hear the Shape of A Drum?. (arXiv:2203.08073v2 [cs.SD] UPDATED)
- Neural Solvers for Fast and Accurate Numerical Optimal Control. (arXiv:2203.08072v1 [math.OC])
- Towards Neural Sparse Linear Solvers. (arXiv:2203.06944v1 [cs.LG])
- Parameter Inference of Time Series by Delay Embeddings and Learning Differentiable Operators. (arXiv:2203.06269v2 [cs.LG] UPDATED)
- Exponential convergence in Wasserstein metric for distribution dependent SDEs. (arXiv:2203.05856v1 [math.PR])
- The loss landscape of deep linear neural networks: a second-order analysis. (arXiv:2107.13289v2 [math.ST] CROSS LISTED)
- Reinforcement Learning for Linear Quadratic Control is Vulnerable Under Cost Manipulation. (arXiv:2203.05774v2 [eess.SY] UPDATED)
- Deep Learning for the Benes Filter. (arXiv:2203.05561v1 [stat.ML])
- Score-Based Generative Models for Molecule Generation. (arXiv:2203.04698v1 [cs.LG])
- Score matching enables causal discovery of nonlinear additive noise models. (arXiv:2203.04413v1 [cs.LG])
- Multi-trial Neural Architecture Search with Lottery Tickets. (arXiv:2203.04300v3 [cs.LG] UPDATED)
- Machine Learning based Optimal Feedback Control for Microgrid Stabilization. (arXiv:2203.04815v1 [eess.SY])
- On generative models as the basis for digital twins. (arXiv:2203.04384v1 [cs.LG])
- Quasi $\alpha$-Firmly Nonexpansive Mappings in Wasserstein Spaces. (arXiv:2203.04851v2 [math.FA] UPDATED)
- Geometric Aspects of Data-Processing of Markov Chains. (arXiv:2203.04575v3 [math.PR] UPDATED)
- Equivalences of Geometric Ergodicity of Markov Chains. (arXiv:2203.04395v5 [math.PR] UPDATED)
- Learning to Bound: A Generative Cram\'er-Rao Bound. (arXiv:2203.03695v2 [cs.LG] UPDATED)
- Variational methods for simulation-based inference. (arXiv:2203.04176v3 [stat.ML] UPDATED)
- Noisy Low-rank Matrix Optimization: Geometry of Local Minima and Convergence Rate. (arXiv:2203.03899v3 [math.OC] UPDATED)
- Flat minima generalize for low-rank matrix recovery. (arXiv:2203.03756v2 [cs.LG] UPDATED)
- Learn to Match with No Regret: Reinforcement Learning in Markov Matching Markets. (arXiv:2203.03684v1 [cs.LG])
- GeoDiff: a Geometric Diffusion Model for Molecular Conformation Generation. (arXiv:2203.02923v1 [q-bio.QM])
- Unbiased Estimation using a Class of Diffusion Processes. (arXiv:2203.03013v2 [stat.CO] UPDATED)
- Recursive Monte Carlo and Variational Inference with Auxiliary Variables. (arXiv:2203.02836v2 [cs.LG] UPDATED)
- LaSDI: Parametric Latent Space Dynamics Identification. (arXiv:2203.02076v2 [math.NA] UPDATED)
- Differentiable Causal Discovery Under Latent Interventions. (arXiv:2203.02336v1 [cs.LG])
- Interpretable Latent Variables in Deep State Space Models. (arXiv:2203.02057v2 [stat.ML] UPDATED)
- Whiplash Gradient Descent Dynamics. (arXiv:2203.02140v4 [math.OC] UPDATED)
- Sharper Bounds for Proximal Gradient Algorithms with Errors. (arXiv:2203.02204v1 [math.OC])
- WaveY-Net: Physics-augmented deep learning for high-speed electromagnetic simulation and optimization. (arXiv:2203.01248v1 [physics.app-ph])
- Path sampling of recurrent neural networks by incorporating known physics. (arXiv:2203.00597v2 [cond-mat.dis-nn] UPDATED)
- Particle-based Fast Jet Simulation at the LHC with Variational Autoencoders. (arXiv:2203.00520v1 [physics.comp-ph])
- Learning Neural Hamiltonian Dynamics: A Methodological Overview. (arXiv:2203.00128v1 [cs.LG])
- Neural Ordinary Differential Equations for Nonlinear System Identification. (arXiv:2203.00120v2 [cs.LG] UPDATED)
- Differentiable Matrix Elements with MadJax. (arXiv:2203.00057v1 [hep-ph])
- Variational Autoencoders Without the Variation. (arXiv:2203.00645v1 [cs.LG])
- On the sample complexity of stabilizing linear dynamical systems from data. (arXiv:2203.00474v2 [math.OC] UPDATED)
- On a linearization of quadratic Wasserstein distance. (arXiv:2201.13386v2 [math.NA] UPDATED)
- Parameter-free Mirror Descent. (arXiv:2203.00444v3 [cs.LG] UPDATED)
- Earthquake Control: An Emerging Application for Robust Control. Theory and Experimental Tests. (arXiv:2203.00296v2 [math.OC] UPDATED)
- Bregman three-operator splitting methods. (arXiv:2203.00252v4 [math.OC] UPDATED)
- Amortized Proximal Optimization. (arXiv:2203.00089v1 [cs.LG])
- On quantitative hypocoercivity estimates based on Harris-type theorems. (arXiv:2203.00096v2 [math.AP] UPDATED)
- Capturing Actionable Dynamics with Structured Latent Ordinary Differential Equations. (arXiv:2202.12932v2 [stat.ML] UPDATED)
- Conditional Simulation Using Diffusion Schr\"odinger Bridges. (arXiv:2202.13460v2 [stat.ML] UPDATED)
- Benchmarking Generative Latent Variable Models for Speech. (arXiv:2202.12707v2 [eess.AS] UPDATED)
- Directed Graph Auto-Encoders. (arXiv:2202.12449v1 [cs.LG])
- Learning POD of Complex Dynamics Using Heavy-ball Neural ODEs. (arXiv:2202.12373v2 [cs.LG] UPDATED)
- Physics Informed RNN-DCT Networks for Time-Dependent Partial Differential Equations. (arXiv:2202.12358v1 [physics.comp-ph])
- Clarifying MCMC-based training of modern EBMs : Contrastive Divergence versus Maximum Likelihood. (arXiv:2202.12176v1 [cs.LG])
- Physics-informed neural networks for inverse problems in supersonic flows. (arXiv:2202.11821v1 [math.NA])
- Safe Control with Learned Certificates: A Survey of Neural Lyapunov, Barrier, and Contraction methods. (arXiv:2202.11762v2 [cs.RO] UPDATED)
- Mirror Descent Strikes Again: Optimal Stochastic Convex Optimization under Infinite Noise Variance. (arXiv:2202.11632v1 [stat.ML])
- On PAC-Bayesian reconstruction guarantees for VAEs. (arXiv:2202.11455v1 [cs.LG])
- Learning Filterbanks for End-to-End Acoustic Beamforming. (arXiv:2111.04614v2 [eess.AS] UPDATED)
- Truncated Diffusion Probabilistic Models and Diffusion-based Adversarial Auto-Encoders. (arXiv:2202.09671v4 [stat.ML] UPDATED)
- Disentangling Autoencoders (DAE). (arXiv:2202.09926v2 [cs.LG] UPDATED)
- It's Raw! Audio Generation with State-Space Models. (arXiv:2202.09729v1 [cs.SD])
- Gradient Estimation with Discrete Stein Operators. (arXiv:2202.09497v6 [stat.ML] UPDATED)
- Finite-Time Analysis of Natural Actor-Critic for POMDPs. (arXiv:2202.09753v3 [cs.LG] UPDATED)
- Pseudo Numerical Methods for Diffusion Models on Manifolds. (arXiv:2202.09778v2 [cs.CV] UPDATED)
- Physics-informed neural networks for learning the homogenized coefficients of multiscale elliptic equations. (arXiv:2202.09712v1 [math.NA])
- Schr\"{o}dinger Meets Kuramoto via Feynman-Kac: Minimum Effort Distribution Steering for Noisy Nonuniform Kuramoto Oscillators. (arXiv:2202.09734v2 [math.OC] UPDATED)
- Signal Decomposition Using Masked Proximal Operators. (arXiv:2202.09338v6 [cs.LG] UPDATED)
- An alternative approach to train neural networks using monotone variational inequality. (arXiv:2202.08876v3 [stat.ML] UPDATED)
- Optimization flows landing on the Stiefel manifold. (arXiv:2202.09058v2 [math.OC] UPDATED)
- Analytic-DPM: an Analytic Estimate of the Optimal Reverse Variance in Diffusion Probabilistic Models. (arXiv:2201.06503v3 [cs.LG] UPDATED)
- Learning continuous models for continuous physics. (arXiv:2202.08494v2 [cs.LG] UPDATED)
- Gradients without Backpropagation. (arXiv:2202.08587v1 [cs.LG])
- Uniting Nesterov and Heavy Ball Methods for Uniform Global Asymptotic Stability of the Set of Minimizers. (arXiv:2202.07739v5 [math.OC] UPDATED)
- Deep Koopman Operator with Control for Nonlinear Systems. (arXiv:2202.08004v2 [cs.RO] UPDATED)
- Understanding DDPM Latent Codes Through Optimal Transport. (arXiv:2202.07477v2 [stat.ML] UPDATED)
- Machine Learning in Aerodynamic Shape Optimization. (arXiv:2202.07141v2 [cs.LG] UPDATED)
- Learned Turbulence Modelling with Differentiable Fluid Solvers: Physics-based Loss-functions and Optimisation Horizons. (arXiv:2202.06988v2 [physics.flu-dyn] UPDATED)
- A short note on an inequality between KL and TV. (arXiv:2202.07198v2 [math.PR] UPDATED)
- Reverse Back Propagation to Make Full Use of Derivative. (arXiv:2202.06316v1 [cs.LG])
- Maximizing Communication Efficiency for Large-scale Training via 0/1 Adam. (arXiv:2202.06009v3 [cs.LG] UPDATED)
- Improved analysis for a proximal algorithm for sampling. (arXiv:2202.06386v1 [math.ST])
- scpi: Uncertainty Quantification for Synthetic Control Methods. (arXiv:2202.05984v3 [stat.ME] UPDATED)
- Learning by Doing: Controlling a Dynamical System using Causality, Control, and Reinforcement Learning. (arXiv:2202.06052v1 [cs.LG])
- Analysis of Dual-Based PID Controllers through Convolutional Mirror Descent. (arXiv:2202.06152v4 [math.OC] UPDATED)
- Black-Scholes-Merton Option Pricing Revisited: Did we Find a Fatal Flaw?. (arXiv:2202.05671v2 [q-fin.PR] UPDATED)
- NAS-Bench-Suite: NAS Evaluation is (Now) Surprisingly Easy. (arXiv:2201.13396v2 [cs.LG] UPDATED)
- Learning Fast Samplers for Diffusion Models by Differentiating Through Sample Quality. (arXiv:2202.05830v1 [cs.LG])
- The Power of Adaptivity in SGD: Self-Tuning Step Sizes with Unbounded Gradients and Affine Variance. (arXiv:2202.05791v2 [stat.ML] UPDATED)
- Nonlinear MCMC for Bayesian Machine Learning. (arXiv:2202.05621v2 [stat.ML] UPDATED)
- On change of measure inequalities for $f$-divergences. (arXiv:2202.05568v1 [stat.ML])
- Formal verification of iterative convergence of numerical algorithms. (arXiv:2202.05587v2 [math.NA] UPDATED)
- Non-stationary Anderson acceleration with optimized damping. (arXiv:2202.05295v1 [math.NA])
- Recovering Stochastic Dynamics via Gaussian Schr\"odinger Bridges. (arXiv:2202.05722v1 [cs.LG])
- On an Asymptotic Criterion for Blockchain Design: The Asynchronous Composition Model. (arXiv:2202.05080v1 [math.PR])
- Conditional Diffusion Probabilistic Model for Speech Enhancement. (arXiv:2202.05256v1 [eess.AS])
- Towards a Theory of Non-Log-Concave Sampling: First-Order Stationarity Guarantees for Langevin Monte Carlo. (arXiv:2202.05214v1 [math.ST])
- Generalization Bounds via Convex Analysis. (arXiv:2202.04985v3 [stat.ML] UPDATED)
- Diffusion bridges vector quantized Variational AutoEncoders. (arXiv:2202.04895v2 [stat.ML] UPDATED)
- Optimal learning rate schedules in high-dimensional non-convex optimization problems. (arXiv:2202.04509v1 [cs.LG])
- InferGrad: Improving Diffusion Models for Vocoder by Considering Inference in Training. (arXiv:2202.03751v1 [eess.AS] CROSS LISTED)
- Turnpike in optimal control of PDEs, ResNets, and beyond. (arXiv:2202.04097v1 [math.OC])
- Differentiable Economics for Randomized Affine Maximizer Auctions. (arXiv:2202.02872v1 [cs.GT])
- Bilevel Optimization with a Lower-level Contraction: Optimal Sample Complexity without Warm-start. (arXiv:2202.03397v4 [stat.ML] UPDATED)
- Riemannian Score-Based Generative Modelling. (arXiv:2202.02763v3 [cs.LG] UPDATED)
- Score-based Generative Modeling of Graphs via the System of Stochastic Differential Equations. (arXiv:2202.02514v3 [cs.LG] UPDATED)
- Is High Variance Unavoidable in RL? A Case Study in Continuous Control. (arXiv:2110.11222v2 [cs.LG] UPDATED)
- LyaNet: A Lyapunov Framework for Training Neural ODEs. (arXiv:2202.02526v1 [cs.LG])
- Message Passing Neural PDE Solvers. (arXiv:2202.03376v3 [cs.LG] UPDATED)
- Learning the random variables in Monte Carlo simulations with stochastic gradient descent: Machine learning for parametric PDEs and financial derivative pricing. (arXiv:2202.02717v2 [math.NA] UPDATED)
- On Neural Differential Equations. (arXiv:2202.02435v1 [cs.LG])
- Dirty derivatives for output feedback stabilization. (arXiv:2202.01941v1 [math.OC])
- Non-Vacuous Generalisation Bounds for Shallow Neural Networks. (arXiv:2202.01627v3 [cs.LG] UPDATED)
- Fenrir: Physics-Enhanced Regression for Initial Value Problems. (arXiv:2202.01287v2 [cs.LG] UPDATED)
- Gradient estimators for normalising flows. (arXiv:2202.01314v2 [stat.ML] UPDATED)
- Adaptive Experimentation with Delayed Binary Feedback. (arXiv:2202.00846v1 [cs.IR])
- Physical Design using Differentiable Learned Simulators. (arXiv:2202.00728v1 [cs.LG])
- Exit Time Analysis for Approximations of Gradient Descent Trajectories Around Saddle Points. (arXiv:2006.01106v3 [math.OC] UPDATED)
- A Dynamical System Perspective for Lipschitz Neural Networks. (arXiv:2110.12690v2 [cs.LG] UPDATED)
- Understanding AdamW through Proximal Methods and Scale-Freeness. (arXiv:2202.00089v1 [cs.LG])
- Optimization Landscape of Gradient Descent for Discrete-time Static Output Feedback. (arXiv:2109.13132v4 [math.OC] UPDATED)
- Mean-Field Langevin Dynamics: Exponential Convergence and Annealing. (arXiv:2202.01009v3 [math.OC] UPDATED)
- HMC and underdamped Langevin united in the unadjusted convex smooth case. (arXiv:2202.00977v4 [math.PR] UPDATED)
- Extending FEniCS to Work in Higher Dimensions Using Tensor Product Finite Elements. (arXiv:2202.00762v2 [math.NA] UPDATED)
- Progressive Distillation for Fast Sampling of Diffusion Models. (arXiv:2202.00512v2 [cs.LG] UPDATED)
- Understanding AdamW through Proximal Methods and Scale-Freeness. (arXiv:2202.00089v1 [cs.LG])
- Deep Generative Models in Engineering Design: A Review. (arXiv:2110.10863v4 [cs.LG] UPDATED)
- Posterior Matching for Arbitrary Conditioning. (arXiv:2201.12414v4 [cs.LG] UPDATED)
- Learning Proximal Operators to Discover Multiple Optima. (arXiv:2201.11945v3 [cs.LG] UPDATED)
- Denoising Diffusion Restoration Models. (arXiv:2201.11793v3 [eess.IV] UPDATED)
- Improving Group Testing via Gradient Descent. (arXiv:2201.12325v1 [cs.IT])
- Optimal Transport Tools (OTT): A JAX Toolbox for all things Wasserstein. (arXiv:2201.12324v1 [cs.LG])
- On feedforward control using physics-guided neural networks: Training cost regularization and optimized initialization. (arXiv:2201.12088v1 [cs.LG])
- Bounding Kolmogorov distances through Wasserstein and related integral probability metrics. (arXiv:2201.12087v2 [math.PR] UPDATED)
- Half-space depth of log-concave probability measures. (arXiv:2201.11992v2 [math.PR] UPDATED)
- Towards Data-driven LQR with Koopmanizing Flows. (arXiv:2201.11640v2 [eess.SY] UPDATED)
- Uphill Roads to Variational Tightness: Monotonicity and Monte Carlo Objectives. (arXiv:2201.10989v1 [stat.ML])
- Ergodicity of supercritical SDEs driven by $\alpha$-stable processes and heavy-tailed sampling. (arXiv:2201.10158v1 [math.PR])
- Convex Analysis of the Mean Field Langevin Dynamics. (arXiv:2201.10469v2 [stat.ML] UPDATED)
- Exploring Differential Geometry in Neural Implicits. (arXiv:2201.09263v4 [cs.GR] UPDATED)
- Small-Signal Stability Analysis of Numerical Integration Methods. (arXiv:2201.09529v1 [eess.SY])
- The Forward-Backward Envelope for Sampling with the Overdamped Langevin Algorithm. (arXiv:2201.09096v1 [math.NA])
- Heavy-tailed Sampling via Transformed Unadjusted Langevin Algorithm. (arXiv:2201.08349v1 [math.ST])
- Geometrically adapted Langevin dynamics for Markov chain Monte Carlo simulations. (arXiv:2201.08072v1 [stat.AP])
- Accelerated Gradient Flow: Risk, Stability, and Implicit Regularization. (arXiv:2201.08311v1 [stat.ML])
- Error analysis for a statistical finite element method. (arXiv:2201.07543v1 [math.ST])
- Zero-Shot Machine Unlearning. (arXiv:2201.05629v3 [cs.LG] UPDATED)
- Optimal Algorithmic Monetary Policy. (arXiv:2104.07888v3 [econ.GN] CROSS LISTED)
- The Implicit Regularization of Momentum Gradient Descent with Early Stopping. (arXiv:2201.05405v1 [cs.LG])
- Probabilistic design of optimal sequential decision-making algorithms in learning and control. (arXiv:2201.05212v3 [math.OC] UPDATED)
- Roots of the identity operator and proximal mappings: (classical and phantom) cycles and gap vectors. (arXiv:2201.05189v2 [math.FA] UPDATED)
- Neural Koopman Lyapunov Control. (arXiv:2201.05098v2 [eess.SY] UPDATED)
- Boost your favorite Markov Chain Monte Carlo sampler using Kac's theorem: the Kick-Kac teleportation algorithm. (arXiv:2201.05002v2 [stat.CO] UPDATED)
- Flow selections for (nonlinear) Fokker-Planck-Kolmogorov equations. (arXiv:2201.04539v2 [math.PR] UPDATED)
- Stochastic Gradient Descent for Barycenters in Wasserstein Space. (arXiv:2201.04232v3 [math.OC] UPDATED)
- An Introduction to Autoencoders. (arXiv:2201.03898v1 [cs.LG])
- Backward error analysis for conjugate symplectic methods. (arXiv:2201.03911v2 [math.NA] UPDATED)
- Path differentiability of ODE flows. (arXiv:2201.03819v1 [cs.LG])
- Time-adaptive Lagrangian Variational Integrators for Accelerated Optimization on Manifolds. (arXiv:2201.03774v3 [math.OC] UPDATED)
- Data-driven Meets Geometric Control: Zero Dynamics, Subspace Stabilization, and Malicious Attacks. (arXiv:2201.03656v1 [eess.SY])
- Stability Based Generalization Bounds for Exponential Family Langevin Dynamics. (arXiv:2201.03064v2 [cs.LG] UPDATED)
- Computing optimal experimental designs on finite sets by log-determinant gradient flow. (arXiv:2201.03042v1 [math.NA])
- Accelerated Optimization on Riemannian Manifolds via Projected Variational Integrators. (arXiv:2201.02904v1 [math.OC])
- The dynamics of representation learning in shallow, non-linear autoencoders. (arXiv:2201.02115v2 [stat.ML] UPDATED)
- On the geometric convergence for MALA under verifiable conditions. (arXiv:2201.01951v1 [stat.CO])
- SDEdit: Guided Image Synthesis and Editing with Stochastic Differential Equations. (arXiv:2108.01073v2 [cs.CV] UPDATED)
- Linear Variational State-Space Filtering. (arXiv:2201.01353v3 [cs.LG] UPDATED)
- Inverse Extended Kalman Filter -- Part I: Fundamentals. (arXiv:2201.01539v3 [math.OC] UPDATED)
- Transport type metrics on the space of probability measures involving singular base measures. (arXiv:2201.00875v3 [math.OC] UPDATED)
- Finite-Element Domain Approximation for Maxwell Variational Problems on Curved Domains. (arXiv:2201.00883v1 [math.NA])
- Fundamental Limitations of Control and Filtering in Continuous-Time Systems: An Information-Theoretic Analysis. (arXiv:2201.00995v2 [cs.IT] UPDATED)
- DiffuseVAE: Efficient, Controllable and High-Fidelity Generation from Low-Dimensional Latents. (arXiv:2201.00308v3 [cs.LG] UPDATED)
- High Dimensional Optimization through the Lens of Machine Learning. (arXiv:2112.15392v1 [math.OC])
- A Strongly Monotonic Polygonal Euler Scheme. (arXiv:2112.15596v2 [math.PR] UPDATED)
- Bayesian Inference for Fluid Dynamics: A Case Study for the Stochastic Rotating Shallow Water Model. (arXiv:2112.15216v1 [math.NA])
Saved in 2021
- Importance sampling for option pricing with feedforward neural networks. (arXiv:2112.14247v2 [q-fin.CP] UPDATED)
- The Economics of Interstellar Flight. (arXiv:2112.13911v2 [econ.GN] UPDATED)
- Bounding Wasserstein distance with couplings. (arXiv:2112.03152v3 [stat.CO] UPDATED)
- Efficient Automatic Differentiation of Implicit Functions. (arXiv:2112.14217v2 [stat.CO] UPDATED)
- Unbiased Parameter Inference for a Class of Partially Observed Levy-Process Models. (arXiv:2112.13874v2 [stat.CO] UPDATED)
- Controlling Chaos in Van Der Pol Dynamics Using Signal-Encoded Deep Learning. (arXiv:2112.14707v2 [eess.SY] UPDATED)
- Optimal Sampled-Data Control of a Nonlinear System. (arXiv:2112.14507v1 [eess.SY])
- Control Theoretic Analysis of Temporal Difference Learning. (arXiv:2112.14417v6 [cs.AI] UPDATED)
- Empirical approximation to invariant measures for McKean--Vlasov processes: mean-field interaction vs self-interaction. (arXiv:2112.14112v1 [math.PR])
- Unbiased Gradient Estimation in Unrolled Computation Graphs with Persistent Evolution Strategies. (arXiv:2112.13835v1 [cs.LG])
- Interpreting Dynamical Systems as Bayesian Reasoners. (arXiv:2112.13523v1 [cs.AI])
- Quasi-Taylor Samplers for Diffusion Generative Models based on Ideal Derivatives. (arXiv:2112.13339v2 [stat.ML] UPDATED)
- Last-Iterate Convergence of Saddle-Point Optimizers via High-Resolution Differential Equations. (arXiv:2112.13826v3 [math.OC] UPDATED)
- Analysis of Langevin Monte Carlo from Poincar\'e to Log-Sobolev. (arXiv:2112.12662v1 [math.ST])
- Variational Learning of Euler-Lagrange Dynamics from Data. (arXiv:2112.12619v3 [math.NA] UPDATED)
- POD-Galerkin reduced order models and physics-informed neural networks for solving inverse problems for the Navier-Stokes equations. (arXiv:2112.11950v2 [physics.flu-dyn] UPDATED)
- PyTracer: Automatically profiling numerical instabilities in Python. (arXiv:2112.11508v2 [cs.MS] UPDATED)
- Discrete fully probabilistic design: towards a control pipeline for the synthesis of policies from examples. (arXiv:2112.11210v2 [eess.SY] UPDATED)
- Dynamically Stable Poincar\'e Embeddings for Neural Manifolds. (arXiv:2112.11172v2 [cs.LG] UPDATED)
- SPDE bridges with observation noise and their spatial approximation. (arXiv:2112.11141v2 [math.NA] UPDATED)
- FlowPool: Pooling Graph Representations with Wasserstein Gradient Flows. (arXiv:2112.09990v2 [cs.LG] UPDATED)
- Heavy-tailed denoising score matching. (arXiv:2112.09788v2 [cs.LG] UPDATED)
- On the existence of global minima and convergence analyses for gradient descent methods in the training of deep neural networks. (arXiv:2112.09684v2 [math.OC] UPDATED)
- Unadjusted Langevin algorithm for sampling a mixture of weakly smooth potentials. (arXiv:2112.09311v2 [stat.CO] UPDATED)
- Composed Physics- and Data-driven System Identification for Non-autonomous Systems in Control Engineering. (arXiv:2112.08148v1 [math.OC])
- Data-Driven Models for Control Engineering Applications Using the Koopman Operator. (arXiv:2112.07983v2 [eess.SY] UPDATED)
- Tackling the Generative Learning Trilemma with Denoising Diffusion GANs. (arXiv:2112.07804v2 [cs.LG] UPDATED)
- Learning to track environment state via predictive autoencoding. (arXiv:2112.07745v1 [cs.LG])
- Data-Driven Models for Control Engineering Applications Using the Koopman Operator. (arXiv:2112.07983v2 [eess.SY] UPDATED)
- Score-Based Generative Modeling with Critically-Damped Langevin Diffusion. (arXiv:2112.07068v4 [stat.ML] UPDATED)
- Efficient differentiable quadratic programming layers: an ADMM approach. (arXiv:2112.07464v1 [math.OC])
- Programming with Neural Surrogates of Programs. (arXiv:2112.06148v1 [cs.PL])
- Variational autoencoders in the presence of low-dimensional data: landscape and implicit bias. (arXiv:2112.06868v2 [cs.LG] UPDATED)
- The Past as a Stochastic Process. (arXiv:2112.05876v1 [stat.AP])
- Hedging Cryptocurrency Options. (arXiv:2112.06807v3 [q-fin.PR] UPDATED)
- Deterministic particle flows for constraining stochastic nonlinear systems. (arXiv:2112.05735v2 [cond-mat.stat-mech] UPDATED)
- A closed-measure approach to stochastic approximation. (arXiv:2112.05482v3 [math.OC] UPDATED)
- Optimal transport and control of active drops. (arXiv:2112.05676v1 [cond-mat.soft])
- A fully-differentiable compressible high-order computational fluid dynamics solver. (arXiv:2112.04979v1 [physics.flu-dyn])
- A More Stable Accelerated Gradient Method Inspired by Continuous-Time Perspective. (arXiv:2112.04922v2 [math.OC] UPDATED)
- Learning over All Stabilizing Nonlinear Controllers for a Partially-Observed Linear System. (arXiv:2112.04219v3 [eess.SY] UPDATED)
- Diffeomorphically Learning Stable Koopman Operators. (arXiv:2112.04085v2 [cs.LG] UPDATED)
- A Continuous-time Stochastic Gradient Descent Method for Continuous Data. (arXiv:2112.03754v1 [cs.LG])
- A Novel Convergence Analysis for Algorithms of the Adam Family. (arXiv:2112.03459v1 [cs.LG])
- Differentiable Generalised Predictive Coding. (arXiv:2112.03378v2 [cs.LG] UPDATED)
- Variational Wasserstein gradient flow. (arXiv:2112.02424v3 [cs.LG] UPDATED)
- Controllable and Compositional Generation with Latent-Space Energy-Based Models. (arXiv:2110.10873v2 [cs.CV] UPDATED)
- ProbNum: Probabilistic Numerics in Python. (arXiv:2112.02100v1 [cs.MS])
- A Structured Dictionary Perspective on Implicit Neural Representations. (arXiv:2112.01917v2 [cs.LG] UPDATED)
- Chronological Causal Bandits. (arXiv:2112.01819v1 [stat.ML])
- Convergence Properties of Monotone and Nonmonotone Proximal Gradient Methods Revisited. (arXiv:2112.01798v2 [math.OC] UPDATED)
- Data-Enabled Gradient Flow as Feedback Controller: Regulation of Linear Dynamical Systems to Minimizers of Unknown Functions. (arXiv:2112.01652v3 [math.OC] UPDATED)
- Robust Robotic Control from Pixels using Contrastive Recurrent State-Space Models. (arXiv:2112.01163v1 [cs.LG])
- The Physics of Machine Learning: An Intuitive Introduction for the Physical Scientist. (arXiv:2112.00851v1 [cond-mat.dis-nn])
- Optimal Control of the Kirchhoff Equation. (arXiv:2112.01067v1 [math.OC])
- Nonlinear Forward-Backward Splitting with Momentum Correction. (arXiv:2112.00481v5 [math.OC] UPDATED)
- Exact Asymptotics for Linear Quadratic Adaptive Control
- Path Integral Sampler: a stochastic control approach for sampling. (arXiv:2111.15141v2 [cs.LG] UPDATED)
- Rigorous data-driven computation of spectral properties of Koopman operators for dynamical systems. (arXiv:2111.14889v2 [math.NA] UPDATED)
- Stochastic Wasserstein Hamiltonian Flows. (arXiv:2111.15163v1 [math.PR])
- Encoding Causal Macrovariables. (arXiv:2111.14724v1 [cs.LG])
- Conditional Image Generation with Score-Based Diffusion Models. (arXiv:2111.13606v1 [cs.LG])
- On Recurrent Neural Networks for learning-based control: recent results and ideas for future developments. (arXiv:2111.13557v2 [eess.SY] UPDATED)
- Introduction to SPDEs from Probability and PDE. (arXiv:2111.13160v2 [math.PR] UPDATED)
- Learning Low-Dimensional Quadratic-Embeddings of High-Fidelity Nonlinear Dynamics using Deep Learning. (arXiv:2111.12995v1 [cs.LG])
- Generalized Normalizing Flows via Markov Chains. (arXiv:2111.12506v3 [cs.LG] UPDATED)
- Global Output Feedback Stabilization of Semilinear Reaction-Diffusion PDEs. (arXiv:2111.12649v1 [math.OC])
- An Expectation-Maximization Perspective on Federated Learning. (arXiv:2111.10192v1 [cs.LG])
- Kalman filters as the steady-state solution of gradient descent on variational free energy. (arXiv:2111.10530v1 [q-bio.NC])
- Ensemble-SINDy: Robust sparse model discovery in the low-data, high-noise limit, with active learning and control. (arXiv:2111.10992v1 [math.NA])
- Non-asymptotic and Accurate Learning of Nonlinear Dynamical Systems. (arXiv:2002.08538v2 [cs.LG] CROSS LISTED)
- Second-Order Mirror Descent: Convergence in Games Beyond Averaging and Discounting. (arXiv:2111.09982v4 [math.OC] UPDATED)
- Optimal control of PDEs using physics-informed neural networks. (arXiv:2111.09880v4 [math.OC] UPDATED)
- Gradient flows on graphons: existence, convergence, continuity equations. (arXiv:2111.09459v3 [math.PR] UPDATED)
- NeuralPDE: Modelling Dynamical Systems from Data. (arXiv:2111.07671v3 [cs.LG] UPDATED)
- Switching Recurrent Kalman Networks. (arXiv:2111.08291v1 [cs.LG])
- Solving Linear Algebra by Program Synthesis. (arXiv:2111.08171v1 [cs.LG])
- Solving Inverse Problems in Medical Imaging with Score-Based Generative Models. (arXiv:2111.08005v2 [eess.IV] UPDATED)
- Simulating Diffusion Bridges with Score Matching. (arXiv:2111.07243v2 [stat.CO] UPDATED)
- Theoretical Guarantees for the Statistical Finite Element Method. (arXiv:2111.07691v2 [math.NA] UPDATED)
- Data-Centric Engineering: integrating simulation, machine learning and statistics. Challenges and Opportunities. (arXiv:2111.06223v2 [cs.CE] UPDATED)
- Climate Modeling with Neural Diffusion Equations. (arXiv:2111.06011v1 [cs.LG])
- Kalman Filtering with Adversarial Corruptions. (arXiv:2111.06395v1 [stat.ML])
- Convergence and Stability of the Stochastic Proximal Point Algorithm with Momentum. (arXiv:2111.06171v5 [math.OC] UPDATED)
- Physics-enhanced deep surrogates for partial differential equations. (arXiv:2111.05841v4 [cs.LG] UPDATED)
- Gradients are Not All You Need. (arXiv:2111.05803v2 [cs.LG] UPDATED)
- Counterfactual Explanations for Models of Code. (arXiv:2111.05711v1 [cs.SE])
- A research framework for writing differentiable PDE discretizations in JAX. (arXiv:2111.05218v1 [cs.LG])
- Solving PDE-constrained Control Problems Using Operator Learning. (arXiv:2111.04941v3 [math.OC] UPDATED)
- Physics-Informed Neural Operator for Learning Partial Differential Equations. (arXiv:2111.03794v4 [cs.LG] UPDATED)
- Physics-Guided Generative Adversarial Networks for Sea Subsurface Temperature Prediction. (arXiv:2111.03064v1 [cs.LG])
- Learning Model Predictive Controllers for Real-Time Ride-Hailing Vehicle Relocation and Pricing Decisions. (arXiv:2111.03204v1 [cs.AI])
- MetaFEM: A Generic FEM Solver By Meta-expressions. (arXiv:2111.03541v2 [math.NA] UPDATED)
- Numerical Approximation in CFD Problems Using Physics Informed Machine Learning. (arXiv:2111.02987v1 [cs.LG])
- Variational Inference with Holder Bounds. (arXiv:2111.02947v2 [cs.LG] UPDATED)
- Gradient-enhanced physics-informed neural networks for forward and inverse PDE problems. (arXiv:2111.02801v1 [cs.LG])
- Consensus-based Optimization and Ensemble Kalman Inversion for Global Optimization Problems with Constraints. (arXiv:2111.02970v1 [math.OC])
- Non-linear Gaussian smoothing with Taylor moment expansion. (arXiv:2110.01396v2 [math.NA] UPDATED)
- Accelerated replica exchange stochastic gradient Langevin diffusion enhanced Bayesian DeepONet for solving noisy parametric PDEs. (arXiv:2111.02484v1 [math.NA])
- Global Controllability for General Nonlinear Systems. (arXiv:2111.02645v1 [math.OC])
- Direct data-driven control of LTV systems. (arXiv:2111.02342v3 [math.OC] UPDATED)
- Finite element analysis of the Dirichlet boundary control problem governed by linear parabolic equation. (arXiv:2111.02039v1 [math.NA])
- Large-Scale Deep Learning Optimizations: A Comprehensive Survey. (arXiv:2111.00856v2 [cs.LG] UPDATED)
- Koopman Q-learning: Offline Reinforcement Learning via Symmetries of Dynamics. (arXiv:2111.01365v2 [cs.LG] UPDATED)
- Deep neural networks as nested dynamical systems. (arXiv:2111.01297v1 [cs.LG])
- Reverse engineering recurrent neural networks with Jacobian switching linear dynamical systems. (arXiv:2111.01256v1 [cs.LG])
- Verifying Contracts for Perturbed Control Systems using Linear Programming. (arXiv:2111.01259v1 [eess.SY])
- Continuous Convolutional Neural Networks: Coupled Neural PDE and ODE. (arXiv:2111.00343v1 [cs.LG])
- Non-reversible processes: GENERIC, Hypocoercivity and fluctuations. (arXiv:2111.00286v2 [math.PR] UPDATED)
- Global Optimization via Schr{\"o}dinger-F{\"o}llmer Diffusion. (arXiv:2111.00402v6 [math.OC] UPDATED)
- A Dynamic Programming Formulation for the Nonlinear Filter. (arXiv:2111.00109v1 [math.OC])
- Scalable Inference in SDEs by Direct Matching of the Fokker-Planck-Kolmogorov Equation. (arXiv:2110.15739v1 [cs.LG])
- A Scalable Inference Method For Large Dynamic Economic Systems. (arXiv:2110.14346v1 [econ.EM])
- Deeptime: a Python library for machine learning dynamical models from time series data. (arXiv:2110.15013v2 [math.DS] UPDATED)
- Understanding How Encoder-Decoder Architectures Attend. (arXiv:2110.15253v1 [cs.LG])
- Stable Anderson Acceleration for Deep Learning. (arXiv:2110.14813v1 [cs.LG])
- Learning Stable Deep Dynamics Models for Partially Observed or Delayed Dynamical Systems. (arXiv:2110.14296v2 [cs.LG] UPDATED)
- Towards a Theory of Evolution as Multilevel Learning. (arXiv:2110.14602v1 [q-bio.PE])
- Dynamics of Stochastic Momentum Methods on Large-scale, Quadratic Models. (arXiv:2106.03696v2 [math.OC] UPDATED)
- Safe Pontryagin Differentiable Programming. (arXiv:2105.14937v2 [cs.LG] UPDATED)
- Physics informed machine learning with Smoothed Particle Hydrodynamics: Hierarchy of reduced Lagrangian models of turbulence. (arXiv:2110.13311v7 [physics.flu-dyn] UPDATED)
- Fast PDE-constrained optimization via self-supervised operator learning. (arXiv:2110.13297v1 [cs.LG])
- Generative Flows as a General Purpose Solution for Inverse Problems. (arXiv:2110.13285v3 [cs.CV] UPDATED)
- Relay Variational Inference: A Method for Accelerated Encoderless VI. (arXiv:2110.13422v2 [cs.LG] UPDATED)
- Towards Realistic Market Simulations: a Generative Adversarial Networks Approach. (arXiv:2110.13287v1 [cs.AI])
- Online Variational Filtering and Parameter Learning. (arXiv:2110.13549v2 [stat.ML] UPDATED)
- Likelihood Training of Schr\"odinger Bridge using Forward-Backward SDEs Theory. (arXiv:2110.11291v5 [stat.ML] UPDATED)
- Deterministic particle flows for constraining SDEs. (arXiv:2110.13020v3 [cond-mat.stat-mech] UPDATED)
- Deep Learning Approximation of Diffeomorphisms via Linear-Control Systems. (arXiv:2110.12393v2 [math.OC] UPDATED)
- On Seven Fundamental Optimization Challenges in Machine Learning. (arXiv:2110.12281v1 [math.OC])
- pystorms: A simulation sandbox for the development and evaluation of stormwater control algorithms. (arXiv:2110.12289v1 [eess.SY])
- Neural Flows: Efficient Alternative to Neural ODEs. (arXiv:2110.13040v1 [cs.LG])
- Conditioning of Random Feature Matrices: Double Descent and Generalization Error. (arXiv:2110.11477v2 [stat.ML] UPDATED)
- Differentiability with respect to the initial condition for Hamilton-Jacobi equations. (arXiv:2110.11845v3 [math.OC] UPDATED)
- The convergence analysis of an accelerated iteration for solving algebraic Riccati equations. (arXiv:2110.11706v1 [math.OC])
- Pick-and-Mix Information Operators for Probabilistic ODE Solvers. (arXiv:2110.10770v1 [stat.ML])
- Physics-guided Deep Markov Models for Learning Nonlinear Dynamical Systems with Uncertainty. (arXiv:2110.08607v3 [cs.LG] UPDATED)
- Learning the Koopman Eigendecomposition: A Diffeomorphic Approach. (arXiv:2110.07786v2 [cs.LG] UPDATED)
- Learning disentangled representation for classical models. (arXiv:2110.08082v2 [cond-mat.str-el] UPDATED)
- Learning Mean-Field Equations from Particle Data Using WSINDy. (arXiv:2110.07756v1 [stat.ML])
- Diffusion Normalizing Flow. (arXiv:2110.07579v1 [cs.LG])
- Weak and Strong Convergence of Generalized Proximal Point Algorithms with Relaxed Parameters. (arXiv:2110.07015v2 [math.OC] UPDATED)
- Learning Stable Koopman Embeddings. (arXiv:2110.06509v1 [cs.LG])
- Randomized Extended Kaczmarz is a Limit Point of Sketch-and-Project. (arXiv:2110.05605v2 [math.NA] UPDATED)
- Heavy Ball Neural Ordinary Differential Equations. (arXiv:2110.04840v1 [cs.LG])
- A Proximal Algorithm for Sampling from Non-smooth Potentials. (arXiv:2110.04597v2 [cs.LG] UPDATED)
- Statistical Regeneration Guarantees of the Wasserstein Autoencoder with Latent Space Consistency. (arXiv:2110.03995v1 [stat.ML])
- De-randomizing MCMC dynamics with the diffusion Stein operator. (arXiv:2110.03768v1 [stat.ML])
- Robustness to Incorrect Priors and Controlled Filter Stability in Partially Observed Stochastic Control. (arXiv:2110.03720v1 [math.OC])
- Stochastic Online Optimization using Kalman Recursion
- Wasserstein distance estimates for the distributions of numerical approximations to ergodic stochastic differential equations
- Evaluating model-based planning and planner amortization for continuous control. (arXiv:2110.03363v1 [cs.RO])
- On the Generalization of Models Trained with SGD: Information-Theoretic Bounds and Implications. (arXiv:2110.03128v2 [cs.LG] UPDATED)
- Generative Modeling with Optimal Transport Maps. (arXiv:2110.02999v2 [cs.LG] UPDATED)
- From Contraction Theory to Fixed Point Algorithms on Riemannian and Non-Euclidean Spaces. (arXiv:2110.03623v1 [math.OC])
- Iterate Averaging, the Kalman Filter, and 3DVAR for Linear Inverse Problem. (arXiv:2110.03045v2 [math.NA] UPDATED)
- Three Operator Splitting with Subgradients, Stochastic Gradients, and Adaptive Learning Rates. (arXiv:2110.03274v2 [math.OC] UPDATED)
- An optimal control approach to particle filtering. (arXiv:2110.03199v1 [math.OC])
- On gradient flows initialized near maxima. (arXiv:2110.03035v1 [math.OC])
- Coarsening Optimization for Differentiable Programming. (arXiv:2110.02307v1 [cs.PL])
- Randomized Nystr\"om Preconditioning. (arXiv:2110.02820v2 [math.NA] UPDATED)
- Autoregressive Diffusion Models. (arXiv:2110.02037v2 [cs.LG] UPDATED)
- When is the Convergence Time of Langevin Algorithms Dimension Independent? A Composite Optimization Viewpoint. (arXiv:2110.01827v1 [cs.LG])
- Global Convergence and Stability of Stochastic Gradient Descent. (arXiv:2110.01663v3 [cs.LG] UPDATED)
- From Control to Mathematics-Part II: Observability-Based Design for Iterative Methods in Solving Linear Equations. (arXiv:2110.01203v1 [eess.SY])
- Dynamic-Programming-Based Failure-Tolerant Control for Satellite with Thrusters in 6-DOF Motion. (arXiv:2110.00783v1 [eess.SY])
- NeuFENet: Neural Finite Element Solutions with Theoretical Bounds for Parametric PDEs. (arXiv:2110.01601v2 [cs.LG] UPDATED)
- Differentiable Spline Approximations. (arXiv:2110.01532v1 [cs.LG])
- Implicit Riemannian Concave Potential Maps. (arXiv:2110.01288v1 [stat.ML])
- Contraction Theory for Nonlinear Stability Analysis and Learning-based Control: A Tutorial Overview. (arXiv:2110.00675v2 [cs.LG] UPDATED)
- Induction, Popper, and machine learning. (arXiv:2110.00840v1 [cs.AI])
- A Lagged Particle Filter for Stable Filtering of certain High-Dimensional State-Space Models. (arXiv:2110.00884v2 [stat.CO] UPDATED)
- A survey on active noise control techniques -- Part I: Linear systems. (arXiv:2110.00531v1 [eess.SP])
- Inverse airfoil design method for generating varieties of smooth airfoils using conditional WGAN-gp. (arXiv:2110.00212v1 [cs.LG])
- Empirical measures and random walks on compact spaces in the quadratic Wasserstein metric. (arXiv:2110.00295v2 [math.PR] UPDATED)
- Finite-time and Fixed-time Convergence in Continuous-time Optimization. (arXiv:2109.15064v1 [math.OC])
- Optimal control of the Fokker-Planck equation under state constraints in the Wasserstein space. (arXiv:2109.14978v4 [math.OC] UPDATED)
- Second-Order Neural ODE Optimizer. (arXiv:2109.14158v2 [cs.LG] UPDATED)
- Variational Inference for Continuous-Time Switching Dynamical Systems. (arXiv:2109.14492v1 [cs.LG])
- Learning to Superoptimize Real-world Programs. (arXiv:2109.13498v2 [cs.LG] UPDATED)
- A unified differential equation solver approach for separable convex optimization: splitting, acceleration and nonergodic rate. (arXiv:2109.13467v2 [math.OC] UPDATED)
- Minimax Mixing Time of the Metropolis-Adjusted Langevin Algorithm for Log-Concave Sampling. (arXiv:2109.13055v2 [stat.ML] UPDATED)
- AdaInject: Injection Based Adaptive Gradient Descent Optimizers for Convolutional Neural Networks. (arXiv:2109.12504v2 [cs.LG] UPDATED)
- The Mirror Langevin Algorithm Converges with Vanishing Bias. (arXiv:2109.12077v2 [cs.DS] UPDATED)
- Convergence of Langevin-Simulated Annealing algorithms with multiplicative noise. (arXiv:2109.11669v2 [math.PR] UPDATED)
- Modeling calcium dynamics in neurons with endoplasmic reticulum: existence, uniqueness and an implicit-explicit finite element scheme. (arXiv:2109.11673v3 [math.NA] UPDATED)
- Stochastic Normalizing Flows for Inverse Problems: a Markov Chains Viewpoint. (arXiv:2109.11375v4 [cs.LG] UPDATED)
- Improved variants of the Hutch++ algorithm for trace estimation. (arXiv:2109.10659v3 [math.NA] UPDATED)
- Revisiting the Characteristics of Stochastic Gradient Noise and Dynamics. (arXiv:2109.09833v1 [cs.LG])
- Generalized Optimization: A First Step Towards Category Theoretic Learning Theory. (arXiv:2109.10262v1 [math.OC])
- Differentiable Physics: A Position Piece. (arXiv:2109.07573v1 [cs.LG])
- Reverse Differentiation via Predictive Coding. (arXiv:2103.04689v4 [cs.LG] UPDATED)
- Non-Asymptotic Analysis of Stochastic Approximation Algorithms for Streaming Data. (arXiv:2109.07117v7 [cs.LG] UPDATED)
- Deep Denerative Models for Drug Design and Response. (arXiv:2109.06469v1 [cs.LG])
- Physics-based Deep Learning. (arXiv:2109.05237v3 [cs.LG] UPDATED)
- Sqrt(d) Dimension Dependence of Langevin Monte Carlo. (arXiv:2109.03839v3 [cs.LG] UPDATED)
- Learning-based Moving Horizon Estimation through Differentiable Convex Optimization Layers. (arXiv:2109.03962v3 [eess.SY] UPDATED)
- Inverse Optimization: Theory and Applications. (arXiv:2109.03920v2 [math.OC] UPDATED)
- Analysis of Infinite Stiffness Using PID Controller. (arXiv:2109.03281v1 [eess.SY])
- Adaptive variational Bayes: Optimality, computation and applications. (arXiv:2109.03204v3 [math.ST] UPDATED)
- Finite Element Representations of Gaussian Processes: Balancing Numerical and Statistical Accuracy. (arXiv:2109.02777v2 [stat.CO] UPDATED)
- Narratives in economics. (arXiv:2109.02331v2 [econ.GN] UPDATED)
- Discrete-Time Linear-Quadratic Regulation via Optimal Transport. (arXiv:2109.02347v1 [math.OC])
- A generalized forward-backward splitting operator: Degenerate analysis and applications. (arXiv:2109.02064v2 [math.OC] UPDATED)
- On Faster Convergence of Scaled Sign Gradient Descent. (arXiv:2109.01806v1 [math.OC])
- Unbiased Estimation of the Hessian for Partially Observed Diffusions. (arXiv:2109.02371v1 [stat.ME])
- Optimization and Sampling Under Continuous Symmetry: Examples and Lie Theory. (arXiv:2109.01080v1 [cs.DS])
- Controlled Measure-Valued Martingales: a Viscosity Solution Approach. (arXiv:2109.00064v3 [math.PR] UPDATED)
- Deep $\mathcal{L}^1$ Stochastic Optimal Control Policies for Planetary Soft-landing. (arXiv:2109.00183v1 [eess.SY])
- The emergence of a concept in shallow neural networks. (arXiv:2109.00454v1 [cond-mat.dis-nn])
- Abstract strongly convergent variants of the proximal point algorithm. (arXiv:2108.13994v4 [math.OC] UPDATED)
- Learning to Synthesize Programs as Interpretable and Generalizable Policies. (arXiv:2108.13643v4 [cs.LG] UPDATED)
- Designing Rotationally Invariant Neural Networks from PDEs and Variational Methods. (arXiv:2108.13993v2 [cs.LG] UPDATED)
- A manifold learning perspective on representation learning: Learning decoder and representations without an encoder. (arXiv:2108.13910v2 [cs.LG] UPDATED)
- A New Approach to Multilinear Dynamical Systems and Control. (arXiv:2108.13583v1 [cs.LG])
- Uncertainty in Mechanism Design. (arXiv:2108.12633v1 [econ.TH])
- Convergence Rates for Learning Linear Operators from Noisy Data. (arXiv:2108.12515v3 [math.ST] UPDATED)
- Stochastic Uncertainty Propagation in Power System Dynamics using Measure-valued Proximal Recursions. (arXiv:2108.13405v2 [math.OC] UPDATED)
- SINDy with Control: A Tutorial. (arXiv:2108.13404v1 [math.OC])
- Online Stochastic Optimization for Unknown Linear Systems: Data-Driven Synthesis and Controller Analysis. (arXiv:2108.13040v1 [math.OC])
- Stochastic Approximation with Discontinuous Dynamics, Differential Inclusions, and Applications. (arXiv:2108.12652v1 [math.PR])
- Active manifolds, stratifications, and convergence to local minima in nonsmooth optimization. (arXiv:2108.11832v2 [math.OC] UPDATED)
- Disentangled Generative Models for Robust Prediction of System Dynamics. (arXiv:2108.11684v3 [cs.LG] UPDATED)
- Adaptive Control of Differentially Private Linear Quadratic Systems. (arXiv:2108.11563v1 [cs.LG])
- Active Inference for Stochastic Control. (arXiv:2108.12245v1 [cs.LG])
- Disentangled Generative Models for Robust Prediction of System Dynamics. (arXiv:2108.11684v3 [cs.LG] UPDATED)
- The Bregman proximal average. (arXiv:2108.11440v2 [math.OC] UPDATED)
- Physics-Based Causal Lifting Linearization of Nonlinear Control Systems Underpinned by the Koopman Operator. (arXiv:2108.10980v1 [eess.SY])
- A Historical Perspective of Adaptive Control and Learning. (arXiv:2108.11336v2 [math.OC] UPDATED)
- Lessons from AlphaZero for Optimal, Model Predictive, and Adaptive Control. (arXiv:2108.10315v2 [math.OC] UPDATED)
- Conditional sequential Monte Carlo in high dimensions. (arXiv:2108.10277v1 [stat.CO])
- Beyond Linear Algebra. (arXiv:2108.09494v1 [math.AG])
- Introduction to Finite Element Methods. (arXiv:1709.08618v2 [math.NA] UPDATED)
- $\mathcal{L}_1$ Adaptive Control with Switched Reference Models: Application to Learn-to-Fly. (arXiv:2108.08462v2 [eess.SY] UPDATED)
- Mechanisms for the emergence of Gaussian correlations. (arXiv:2108.07829v2 [quant-ph] UPDATED)
- Schr\"{o}dinger PCA: On the Duality between Principal Component Analysis and Schr\"{o}dinger Equation. (arXiv:2006.04379v2 [physics.comp-ph] UPDATED)
- Existence, uniqueness, and convergence rates for gradient flows in the training of artificial neural networks with ReLU activation. (arXiv:2108.08106v1 [cs.LG])
- Moser Flow: Divergence-based Generative Modeling on Manifolds. (arXiv:2108.08052v2 [stat.ML] UPDATED)
- Geometry-informed irreversible perturbations for accelerated convergence of Langevin dynamics. (arXiv:2108.08247v2 [stat.ME] UPDATED)
- Quantitative Uniform Stability of the Iterative Proportional Fitting Procedure. (arXiv:2108.08129v2 [stat.ML] UPDATED)
- Finite-data error bounds for Koopman-based prediction and control. (arXiv:2108.07102v2 [math.OC] UPDATED)
- Huygens' equations and the gradient-flow equations in information geometry. (arXiv:2105.12824v6 [cs.IT] UPDATED)
- An Operator Splitting View of Federated Learning. (arXiv:2108.05974v1 [cs.LG])
- Ergodic properties of some Markov chains models in random environments. (arXiv:2108.06211v1 [math.PR])
- Optimal Regulators in Geometric Robotics. (arXiv:2108.06022v1 [math.OC])
- Bayesian Inference using the Proximal Mapping: Uncertainty Quantification under Varying Dimensionality. (arXiv:2108.04851v3 [stat.ME] UPDATED)
- Multilevel Estimation of Normalization Constants Using the Ensemble Kalman-Bucy Filter. (arXiv:2108.03935v2 [math.NA] UPDATED)
- On the Hyperparameters in Stochastic Gradient Descent with Momentum. (arXiv:2108.03947v2 [cs.LG] UPDATED)
- Advances in Trajectory Optimization for Space Vehicle Control. (arXiv:2108.02335v2 [math.OC] UPDATED)
- Differentiable Moving Horizon Estimation for Robust Flight Control. (arXiv:2108.03212v10 [cs.RO] UPDATED)
- Resolvent Splitting for Sums of Monotone Operators with Minimal Lifting. (arXiv:2108.02897v2 [math.OC] UPDATED)
- Compositional Abstraction Error and a Category of Causal Models. (arXiv:2103.15758v2 [stat.ML] UPDATED)
- Asymptotic bias of inexact Markov Chain Monte Carlo methods in high dimension. (arXiv:2108.00682v2 [math.PR] UPDATED)
- A Unified Convergence Analysis of First Order Convex Optimization Methods via Strong Lyapunov Functions. (arXiv:2108.00132v1 [math.OC])
- Connections between Numerical Algorithms for PDEs and Neural Networks. (arXiv:2107.14742v2 [math.NA] UPDATED)
- Continuous time limit of the stochastic ensemble Kalman inversion: Strong convergence analysis. (arXiv:2107.14508v1 [math.NA])
- Uniform minorization condition and convergence bounds for discretizations of kinetic Langevin dynamics. (arXiv:2107.14542v3 [math.PR] UPDATED)
- Pixyz: a Python library for developing deep generative models. (arXiv:2107.13109v3 [cs.LG] UPDATED)
- Numerical wave propagation aided by deep learning. (arXiv:2107.13184v2 [math.NA] UPDATED)
- Stability & Generalisation of Gradient Descent for Shallow Neural Networks without the Neural Tangent Kernel. (arXiv:2107.12723v2 [stat.ML] UPDATED)
- Inverse and Quanto Inverse Options in a Black-Scholes World. (arXiv:2107.12041v5 [q-fin.PR] UPDATED)
- Kolmogorov equations on spaces of measures associated to nonlinear filtering processes. (arXiv:2107.11865v2 [math.PR] UPDATED)
- Signature asymptotics, empirical processes, and optimal transport. (arXiv:2107.11203v4 [math.PR] UPDATED)
- Optimization on manifolds: A symplectic approach. (arXiv:2107.11231v2 [cond-mat.stat-mech] UPDATED)
- Information-Theoretic Generalization Bounds for Stochastic Gradient Descent. (arXiv:2102.00931v3 [cs.LG] UPDATED)
- Differentiable Annealed Importance Sampling and the Perils of Gradient Noise. (arXiv:2107.10211v2 [stat.ML] UPDATED)
- Disentanglement via Mechanism Sparsity Regularization: A New Principle for Nonlinear ICA. (arXiv:2107.10098v3 [stat.ML] UPDATED)
- Interpreting diffusion score matching using normalizing flow. (arXiv:2107.10072v1 [cs.LG])
- On the accept-reject mechanism for Metropolis-Hastings algorithms. (arXiv:2011.04493v2 [math.ST] UPDATED)
- On some information-theoretic aspects of non-linear statistical inverse problems. (arXiv:2107.09488v1 [math.ST])
- An induction proof of the backpropagation algorithm in matrix notation. (arXiv:2107.09384v1 [stat.ML])
- Wave-Informed Matrix Factorization with Global Optimality Guarantees. (arXiv:2107.09144v2 [cs.LG] UPDATED)
- An Empirical Analysis of Measure-Valued Derivatives for Policy Gradients. (arXiv:2107.09359v1 [cs.LG])
- The Limiting Dynamics of SGD: Modified Loss, Phase Space Oscillations, and Anomalous Diffusion. (arXiv:2107.09133v4 [cs.LG] UPDATED)
- Linear quadratic mean field games: Decentralized $O(1/N)$-Nash equilibria. (arXiv:2107.09168v2 [math.OC] UPDATED)
- Active operator inference for learning low-dimensional dynamical-system models from noisy data. (arXiv:2107.09256v2 [cs.LG] UPDATED)
- Path Length Bounds for Gradient Descent and Flow. (arXiv:1908.01089v4 [cs.LG] CROSS LISTED)
- Equivariant Manifold Flows. (arXiv:2107.08596v2 [stat.ML] UPDATED)
- A Topological Perspective on Causal Inference. (arXiv:2107.08558v3 [cs.AI] UPDATED)
- Inverse Problem of Nonlinear Schr\"odinger Equation as Learning of Convolutional Neural Network. (arXiv:2107.08593v1 [math.NA])
- A Measure Theoretical Approach to the Mean-field Maximum Principle for Training NeurODEs. (arXiv:2107.08707v2 [math.OC] UPDATED)
- Improved Learning Rates for Stochastic Optimization: Two Theoretical Viewpoints. (arXiv:2107.08686v2 [cs.LG] UPDATED)
- On Constraints in First-Order Optimization: A View from Non-Smooth Dynamical Systems. (arXiv:2107.08225v3 [math.OC] UPDATED)
- PDE-constrained shape optimization: towards product shape spaces and stochastic models. (arXiv:2107.07744v1 [math.OC])
- On universal approximation and error bounds for Fourier Neural Operators. (arXiv:2107.07562v1 [math.NA])
- Adaptive first-order methods revisited: Convex optimization without Lipschitz requirements. (arXiv:2107.08011v1 [math.OC])
- Machine learning of Kondo physics using variational autoencoders and symbolic regression. (arXiv:2107.08013v2 [cond-mat.str-el] UPDATED)
- Systematic human learning and generalization from a brief tutorial with explanatory feedback. (arXiv:2107.06994v2 [cs.LG] UPDATED)
- Connections Between Finite Difference and Finite Element Approximations. (arXiv:2107.06965v1 [math.NA])
- Rough McKean-Vlasov dynamics for robust ensemble Kalman filtering. (arXiv:2107.06621v2 [math.PR] UPDATED)
- Continuous vs. Discrete Optimization of Deep Neural Networks. (arXiv:2107.06608v3 [cs.LG] UPDATED)
- Geometry and Generalization: Eigenvalues as predictors of where a network will fail to generalize. (arXiv:2107.06386v1 [cs.LG])
- Why Generalization in RL is Difficult: Epistemic POMDPs and Implicit Partial Observability. (arXiv:2107.06277v1 [cs.LG])
- Efficient and Modular Implicit Differentiation. (arXiv:2105.15183v5 [cs.LG] UPDATED)
- Convergence Analysis of Schr{\"o}dinger-F{\"o}llmer Sampler without Convexity. (arXiv:2107.04766v1 [stat.CO])
- Convergence analysis for gradient flows in the training of artificial neural networks with ReLU activation. (arXiv:2107.04479v1 [cs.LG])
- Generalization by design: Shortcuts to Generalization in Deep Learning. (arXiv:2107.02253v1 [cs.LG])
- Physics-informed regularization and structure preservation for learning stable reduced models from data with operator inference. (arXiv:2107.02597v1 [math.NA])
- Numerical Matrix Decomposition. (arXiv:2107.02579v6 [math.HO] UPDATED)
- Learning ODEs via Diffeomorphisms for Fast and Robust Integration. (arXiv:2107.01650v1 [cs.LG])
- Physics-Guided Deep Learning for Dynamical Systems: A Survey. (arXiv:2107.01272v6 [cs.LG] UPDATED)
- A Differentiable Solver Approach to Operator Inference. (arXiv:2107.02093v1 [math.NA])
- A theoretical analysis of one-dimensional discrete generation ensemble Kalman particle filters. (arXiv:2107.01855v1 [math.PR])
- Improving black-box optimization in VAE latent space using decoder uncertainty. (arXiv:2107.00096v1 [cs.LG])
- Variational Diffusion Models. (arXiv:2107.00630v6 [cs.LG] UPDATED)
- Stochastic Gradient Descent-Ascent and Consensus Optimization for Smooth Games: Convergence Analysis under Expected Co-coercivity. (arXiv:2107.00052v2 [cs.LG] UPDATED)
- Does Collective Genetic Regulation exist?. (arXiv:2106.15732v1 [q-bio.MN])
- Challenges and Opportunities in High-dimensional Variational Inference. (arXiv:2103.01085v2 [cs.LG] UPDATED)
- Revisiting the Effects of Stochasticity for Hamiltonian Samplers. (arXiv:2106.16200v2 [cs.LG] UPDATED)
- Monte Carlo Variational Auto-Encoders. (arXiv:2106.15921v1 [stat.ML])
- Diffusion Priors In Variational Autoencoders. (arXiv:2106.15671v1 [cs.LG])
- A Mechanism for Producing Aligned Latent Spaces with Autoencoders. (arXiv:2106.15456v1 [cs.LG])
- Bitcoin, Currencies, and Fragility. (arXiv:2106.14204v2 [econ.GN] UPDATED)
- The Convergence Rate of SGD's Final Iterate: Analysis on Dimension Dependence. (arXiv:2106.14588v1 [math.OC])
- Parameter Estimation for the McKean-Vlasov Stochastic Differential Equation. (arXiv:2106.13751v3 [math.ST] UPDATED)
- Black Box Probabilistic Numerics. (arXiv:2106.13718v2 [math.NA] UPDATED)
- Hessian informed mirror descent. (arXiv:2106.13477v1 [math.OC])
- Optimal Control, Numerics, and Applications of Fractional PDEs. (arXiv:2106.13289v1 [math.OC])
- Sparse Flows: Pruning Continuous-depth Models. (arXiv:2106.12718v2 [cs.LG] UPDATED)
- Understanding Modern Techniques in Optimization: Frank-Wolfe, Nesterov's Momentum, and Polyak's Momentum. (arXiv:2106.12923v1 [math.OC])
- Exploring the Representational Power of Graph Autoencoder. (arXiv:2106.12005v1 [cs.LG])
- From Bachelier to Dupire via Optimal Transport. (arXiv:2106.12395v1 [q-fin.MF])
- Sampling with Mirrored Stein Operators. (arXiv:2106.12506v3 [stat.ML] UPDATED)
- Sparsistent Model Discovery. (arXiv:2106.11936v2 [stat.ML] UPDATED)
- Choice of Damping Coefficient in Langevin Dynamics. (arXiv:2106.11597v1 [stat.CO])
- Constrained Ensemble Langevin Monte Carlo. (arXiv:2102.04279v4 [stat.ML] UPDATED)
- Deep Generative Learning via Schr\"{o}dinger Bridge. (arXiv:2106.10410v2 [cs.LG] UPDATED)
- Nested Variational Inference. (arXiv:2106.11302v1 [stat.ML])
- Schr{\"o}dinger-F{\"o}llmer Sampler: Sampling without Ergodicity. (arXiv:2106.10880v3 [stat.CO] UPDATED)
- Rough stochastic differential equations. (arXiv:2106.10340v4 [math.PR] UPDATED)
- A Geometric Structure of Acceleration and Its Role in Making Gradients Small Fast. (arXiv:2106.10439v3 [math.OC] UPDATED)
- Riemannian Convex Potential Maps. (arXiv:2106.10272v1 [cs.LG])
- Deterministic Gibbs Sampling via Ordinary Differential Equations. (arXiv:2106.10188v1 [stat.CO])
- ScoreGrad: Multivariate Probabilistic Time Series Forecasting with Continuous Energy-based Generative Models. (arXiv:2106.10121v1 [cs.LG])
- Stochastic Bias-Reduced Gradient Methods. (arXiv:2106.09481v2 [math.OC] UPDATED)
- Fr\'{e}chet derivatives of expected functionals of solutions to stochastic differential equations. (arXiv:2106.09149v1 [math.PR])
- Averaging on the Bures-Wasserstein manifold: dimension-free convergence of gradient descent. (arXiv:2106.08502v3 [math.OC] UPDATED)
- Multi-Resolution Continuous Normalizing Flows. (arXiv:2106.08462v5 [cs.CV] UPDATED)
- Towards Optimally Weighted Physics-Informed Neural Networks in Ocean Modelling. (arXiv:2106.08747v1 [cs.AI])
- A Feynman-Kac Type Theorem for ODEs: Solutions of Second Order ODEs as Modes of Diffusions. (arXiv:2106.08525v2 [math.CA] UPDATED)
- Optimization-friendly generic mechanisms without money. (arXiv:2106.07752v1 [cs.GT])
- A Continuized View on Nesterov Acceleration for Stochastic Gradient Descent and Randomized Gossip. (arXiv:2106.07644v2 [math.OC] UPDATED)
- Solving PDEs on Unknown Manifolds with Machine Learning. (arXiv:2106.06682v3 [math.NA] UPDATED)
- PriorGrad: Improving Conditional Denoising Diffusion Models with Data-Dependent Adaptive Prior. (arXiv:2106.06406v2 [stat.ML] UPDATED)
- Polynomial propagation of moments in stochastic differential equations. (arXiv:2106.06271v1 [math.NA])
- Asymptotic Properties of Monte Carlo Methods in Elliptic PDE-Constrained Optimization under Uncertainty. (arXiv:2106.06347v1 [math.OC])
- Differentiable Robust LQR Layers. (arXiv:2106.05535v1 [cs.RO])
- Score-based Generative Modeling in Latent Space. (arXiv:2106.05931v3 [stat.ML] UPDATED)
- Information Geometry of Reversible Markov Chains. (arXiv:2106.05669v2 [math.ST] UPDATED)
- Pulling back information geometry. (arXiv:2106.05367v2 [cs.LG] UPDATED)
- Online Optimization in Games via Control Theory: Connecting Regret, Passivity and Poincar\'e Recurrence. (arXiv:2106.04748v2 [cs.LG] UPDATED)
- Nonlinear Filtering of Partially Observed Systems arising in Singular Stochastic Optimal Control. (arXiv:2106.04635v1 [math.OC])
- Bayesian Bellman Operators. (arXiv:2106.05012v3 [cs.LG] UPDATED)
- Expectation Programming: Adapting Probabilistic Programming Systems to Estimate Expectations Efficiently. (arXiv:2106.04953v2 [cs.LG] UPDATED)
- Fully differentiable model discovery. (arXiv:2106.04886v2 [stat.ML] UPDATED)
- General-order observation-driven models: ergodicity and consistency of the maximum likelihood estimator. (arXiv:2106.05201v1 [math.ST])
- Incorporating NODE with Pre-trained Neural Differential Operator for Learning Dynamics. (arXiv:2106.04166v3 [cs.LG] UPDATED)
- Differentiable Multiple Shooting Layers. (arXiv:2106.03885v1 [cs.LG])
- Approximation to stochastic variance reduced gradient Langevin dynamics by stochastic delay differential equations. (arXiv:2106.04357v2 [math.PR] UPDATED)
- Nonsmooth Implicit Differentiation for Machine Learning and Optimization. (arXiv:2106.04350v2 [cs.LG] UPDATED)
- A Variational Perspective on Diffusion-Based Generative Models and Score Matching. (arXiv:2106.02808v2 [cs.LG] UPDATED)
- Control-Oriented Model-Based Reinforcement Learning with Implicit Differentiation. (arXiv:2106.03273v1 [cs.LG])
- Ensemble Markov chain Monte Carlo with teleporting walkers. (arXiv:2106.02686v1 [stat.CO])
- Nonconvex Optimization via MM Algorithms: Convergence Theory. (arXiv:2106.02805v1 [math.OC])
- A Discrete Variational Derivation of Accelerated Methods in Optimization. (arXiv:2106.02700v3 [math.OC] UPDATED)
- Projective Splitting as a Warped Proximal Algorithm. (arXiv:2106.02661v1 [math.OC])
- Langevin Dynamics for Adaptive Inverse Reinforcement Learning of Stochastic Gradient Algorithms
- Optimizing Functionals on the Space of Probabilities with Input Convex Neural Networks. (arXiv:2106.00774v3 [stat.ML] UPDATED)
- Diffusion Schr\"odinger Bridge with Applications to Score-Based Generative Modeling. (arXiv:2106.01357v5 [stat.ML] UPDATED)
- On Fast Sampling of Diffusion Probabilistic Models. (arXiv:2106.00132v2 [cs.LG] UPDATED)
- The kernel perspective on dynamic mode decomposition. (arXiv:2106.00106v3 [math.FA] UPDATED)
- Generalized AdaGrad (G-AdaGrad) and Adam: A State-Space Perspective. (arXiv:2106.00092v2 [cs.LG] UPDATED)
- How could Neural Networks understand Programs?. (arXiv:2105.04297v2 [cs.PL] UPDATED)
- Diffusion-Based Representation Learning. (arXiv:2105.14257v3 [cs.LG] UPDATED)
- Variational Autoencoders: A Harmonic Perspective. (arXiv:2105.14866v4 [stat.ML] UPDATED)
- Efficient and Modular Implicit Differentiation. (arXiv:2105.15183v5 [cs.LG] UPDATED)
- TensorFlow RiemOpt: a library for optimization on Riemannian manifolds. (arXiv:2105.13921v2 [cs.MS] UPDATED)
- Geometry of Gene Regulatory Dynamics. (arXiv:2105.13722v1 [q-bio.QM])
- Efficient and Accurate Gradients for Neural SDEs. (arXiv:2105.13493v3 [cs.LG] UPDATED)
- Control and numerical approximation of fractional diffusion equations. (arXiv:2105.13671v2 [math.AP] UPDATED)
- On the Impossibility of Statistically Improving Empirical Optimization: A Second-Order Stochastic Dominance Perspective. (arXiv:2105.13419v1 [math.OC])
- Polygonal Unadjusted Langevin Algorithms: Creating stable and efficient adaptive algorithms for neural networks. (arXiv:2105.13937v2 [cs.LG] UPDATED)
- Operator Autoencoders: Learning Physical Operations on Encoded Molecular Graphs. (arXiv:2105.12295v1 [cs.LG])
- DiBS: Differentiable Bayesian Structure Learning. (arXiv:2105.11839v3 [cs.LG] UPDATED)
- Unbiased Estimation of the Gradient of the Log-Likelihood for a Class of Continuous-Time State-Space Models. (arXiv:2105.11522v2 [stat.ML] UPDATED)
- Gradient Descent in Materio. (arXiv:2105.11233v1 [cs.NE])
- Sampling error correction in ensemble Kalman inversion. (arXiv:2105.11341v1 [math.NA])
- Geometric variational inference. (arXiv:2105.10470v2 [stat.ME] UPDATED)
- Error Bounds of the Invariant Statistics in Machine Learning of Ergodic It\^o Diffusions. (arXiv:2105.10102v2 [cs.LG] UPDATED)
- Kernel Stein Discrepancy Descent. (arXiv:2105.09994v1 [stat.ML])
- Decomposing reverse-mode automatic differentiation. (arXiv:2105.09469v1 [cs.PL])
- Diffusion Approximations for Thompson Sampling. (arXiv:2105.09232v2 [cs.LG] UPDATED)
- Gradient Methods with Memory. (arXiv:2105.09241v1 [math.OC])
- A Contraction Theory Approach to Optimization Algorithms from Acceleration Flows. (arXiv:2105.08832v3 [math.OC] UPDATED)
- Learning stochastic dynamical systems with neural networks mimicking the Euler-Maruyama scheme. (arXiv:2105.08449v1 [cs.LG])
- Parallel and Flexible Sampling from Autoregressive Models via Langevin Dynamics. (arXiv:2105.08164v2 [cs.LG] UPDATED)
- Parametrization invariant interpretation of priors and posteriors. (arXiv:2105.08304v2 [math.ST] UPDATED)
- Gradient Flows and Nonlinear Power Methods for the Computation of Nonlinear Eigenfunctions. (arXiv:2105.08405v2 [math.NA] UPDATED)
- Convergence Rates of Gradient Methods for Convex Optimization in the Space of Measures. (arXiv:2105.08368v3 [math.OC] UPDATED)
- On log-concave approximations of high-dimensional posterior measures and stability properties in non-linear inverse problems. (arXiv:2105.07835v2 [math.ST] UPDATED)
- Bayesian inference under model misspecification using transport-Lagrangian distances: an application to seismic inversion. (arXiv:2105.07027v1 [stat.ME])
- Efficient Stochastic Optimal Control through Approximate Bayesian Input Inference. (arXiv:2105.07693v2 [cs.LG] UPDATED)
- Adversarial Training for Gradient Descent: Analysis Through its Continuous-time Approximation. (arXiv:2105.08037v2 [cs.LG] UPDATED)
- On log-concave approximations of high-dimensional posterior measures and stability properties in non-linear inverse problems. (arXiv:2105.07835v2 [math.ST] UPDATED)
- Convergence and Implicit Bias of Gradient Flow on Overparametrized Linear Networks. (arXiv:2105.06351v2 [cs.LG] UPDATED)
- Good and Bad Optimization Models: Insights from Rockafellians. (arXiv:2105.06073v3 [math.OC] UPDATED)
- Leveraging Non-uniformity in First-order Non-convex Optimization. (arXiv:2105.06072v3 [cs.LG] UPDATED)
- A Langevinized Ensemble Kalman Filter for Large-Scale Static and Dynamic Learning. (arXiv:2105.05363v1 [stat.ME])
- Frank-Wolfe Methods in Probability Space. (arXiv:2105.05352v1 [stat.CO])
- On Unbiased Score Estimation for Partially Observed Diffusions. (arXiv:2105.04912v1 [stat.ME])
- Discovery of Nonlinear Dynamical Systems using a Runge-Kutta Inspired Dictionary-based Sparse Regression Approach. (arXiv:2105.04869v1 [cs.LG])
- Value Iteration in Continuous Actions, States and Time. (arXiv:2105.04682v1 [cs.LG])
- Opening the Blackbox: Accelerating Neural Differential Equations by Regularizing Internal Solver Heuristics. (arXiv:2105.03918v2 [cs.LG] UPDATED)
- A Small-Gain Theorem for Discrete-Time Convergent Systems and Its Applications. (arXiv:2105.02376v1 [eess.SY])
- Neural graphical modelling in continuous-time: consistency guarantees and algorithms. (arXiv:2105.02522v3 [stat.ML] UPDATED)
- A Unifying and Canonical Description of Measure-Preserving Diffusions. (arXiv:2105.02845v1 [math.PR])
- On Moment Matching for Stochastic Systems. (arXiv:2105.01680v1 [eess.SY])
- Regret-Optimal LQR Control. (arXiv:2105.01244v2 [math.OC] UPDATED)
- Variance Optimization and Control Regularity for Mean-Field Dynamics. (arXiv:2105.01158v2 [math.OC] UPDATED)
- Stochastic gradient descent with noise of machine learning type. Part I: Discrete time analysis. (arXiv:2105.01650v2 [stat.ML] UPDATED)
- On the stability of the stochastic gradient Langevin algorithm with dependent data stream. (arXiv:2105.01422v1 [math.PR])
- Hard Encoding of Physics for Learning Spatiotemporal Dynamics. (arXiv:2105.00557v1 [cs.LG])
- Data-driven discovery of Green's functions with human-understandable deep learning. (arXiv:2105.00266v2 [cs.LG] UPDATED)
- Mixing Time Guarantees for Unadjusted Hamiltonian Monte Carlo. (arXiv:2105.00887v1 [math.PR])
- The Wasserstein space of stochastic processes. (arXiv:2104.14245v2 [math.PR] UPDATED)
- A data-driven and model-based accelerated Hamiltonian Monte Carlo method for Bayesian elliptic inverse problems. (arXiv:2104.13070v1 [math.NA])
- Bringing Trimmed Serendipity Methods to Computational Practice in Firedrake. (arXiv:2104.12986v2 [math.NA] UPDATED)
- Bayesian Numerical Methods for Nonlinear Partial Differential Equations. (arXiv:2104.12587v2 [math.NA] UPDATED)
- Wasserstein distance estimates for the distributions of numerical approximations to ergodic stochastic differential equations. (arXiv:2104.12384v2 [stat.ML] UPDATED)
- On convergence rates of adaptive ensemble Kalman inversion for linear ill-posed problems. (arXiv:2104.10895v5 [math.NA] UPDATED)
- Weighted $L^2$-contractivity of Langevin dynamics with singular potentials. (arXiv:2104.10574v2 [math.PR] UPDATED)
- Tuning symplectic integrators is easy and worthwhile. (arXiv:2104.10269v1 [physics.comp-ph])
- Accelerated Optimization on Riemannian Manifolds via Discrete Constrained Variational Integrators. (arXiv:2104.07176v2 [math.NA] UPDATED)
- The computational asymptotics of Gaussian variational inference and the Laplace approximation. (arXiv:2104.05886v3 [stat.CO] UPDATED)
- Weak topology and Opial property in Wasserstein spaces, with applications to Gradient Flows and Proximal Point Algorithms of geodesically convex functionals. (arXiv:2104.06121v1 [math.OC])
- Neural ODE control for classification, approximation and transport. (arXiv:2104.05278v1 [math.OC])
- Stochastic Gradient Descent on Nonconvex Functions with General Noise Models. (arXiv:2104.00423v1 [math.OC])
- Eliminating Sharp Minima from SGD with Truncated Heavy-tailed Noise. (arXiv:2102.04297v4 [cs.LG] UPDATED)
- Robust Experimentation in the Continuous Time Bandit Problem. (arXiv:2104.00102v1 [econ.TH])
- Storchastic: A Framework for General Stochastic Automatic Differentiation. (arXiv:2104.00428v3 [stat.ML] UPDATED)
- Geometry of Program Synthesis. (arXiv:2103.16080v1 [cs.LG])
- On the Stability of Nonlinear Receding Horizon Control: A Geometric Perspective. (arXiv:2103.15010v3 [math.OC] UPDATED)
- Stiff Neural Ordinary Differential Equations. (arXiv:2103.15341v3 [math.NA] UPDATED)
- The Conditional Poincar\'e Inequality for Filter Stability. (arXiv:2103.14631v2 [math.PR] UPDATED)
- Why Do Local Methods Solve Nonconvex Problems?. (arXiv:2103.13462v1 [cs.LG])
- Minimization of magnetic forces on Stellarator coils. (arXiv:2103.13195v1 [math.OC])
- A High-order Tuner for Accelerated Learning and Control. (arXiv:2103.12868v1 [cs.LG])
- Learning to Optimize: A Primer and A Benchmark. (arXiv:2103.12828v2 [math.OC] UPDATED)
- Stochastic Optimal Control via Hilbert Space Embeddings of Distributions. (arXiv:2103.12759v1 [math.OC])
- Neural ODE Processes. (arXiv:2103.12413v2 [cs.LG] UPDATED)
- Optimization Algorithms as Robust Feedback Controllers. (arXiv:2103.11329v2 [math.OC] UPDATED)
- A Probabilistic State Space Model for Joint Inference from Differential Equations and Data. (arXiv:2103.10153v3 [stat.ML] UPDATED)
- A New Parameterized Family of Stochastic Particle Flow Filters. (arXiv:2103.09676v3 [eess.SP] UPDATED)
- Martingale Methods for Sequential Estimation of Convex Functionals and Divergences. (arXiv:2103.09267v4 [math.ST] UPDATED)
- Deep learning: a statistical viewpoint. (arXiv:2103.09177v1 [math.ST])
- A closed-form approximation for pricing geometric Istanbul options. (arXiv:2103.07440v1 [q-fin.PR])
- Differentiating densities on smooth manifolds. (arXiv:2103.07380v1 [math.NA])
- Proof that the Kalman gain minimizes the generalized variance. (arXiv:2103.07275v1 [eess.SY])
- Implicit energy regularization of neural ordinary-differential-equation control. (arXiv:2103.06525v1 [cs.LG])
- Advancing Trajectory Optimization with Approximate Inference: Exploration, Covariance Control and Adaptive Risk. (arXiv:2103.06319v1 [eess.SY])
- Why resampling outperforms reweighting for correcting sampling bias with stochastic gradients. (arXiv:2009.13447v3 [cs.LG] UPDATED)
- Accelerated differential inclusion for convex optimization. (arXiv:2103.06629v4 [math.OC] UPDATED)
- Monotonic Alpha-divergence Minimisation for Variational Inference. (arXiv:2103.05684v4 [stat.CO] UPDATED)
- Unbiased approximation of posteriors via coupled particle Markov chain Monte Carlo. (arXiv:2103.05176v2 [stat.CO] UPDATED)
- Program Synthesis Over Noisy Data with Guarantees. (arXiv:2103.05030v4 [cs.PL] UPDATED)
- Stochastic gradient descent and fast relaxation to thermodynamic equilibrium: a stochastic control approach. (arXiv:2103.05096v1 [math.OC])
- The Proximity Operator of the Log-Sum Penalty. (arXiv:2103.02681v2 [math.OC] UPDATED)
- A unified formulation of splitting-based implicit time integration schemes. (arXiv:2103.00757v4 [math.NA] UPDATED)
- Categorical Foundations of Gradient-Based Learning. (arXiv:2103.01931v2 [cs.LG] UPDATED)
- Information-geometry of physics-informed statistical manifolds and its use in data assimilation. (arXiv:2103.01160v1 [math.ST])
- Moment-Based Variational Inference for Stochastic Differential Equations. (arXiv:2103.00988v1 [cs.LG])
- Rate of convergence for particle approximation of PDEs in Wasserstein space. (arXiv:2103.00837v3 [math.OC] UPDATED)
- Learning to Make Compiler Optimizations More Effective. (arXiv:2102.13514v1 [cs.PL])
- On the Generalization of Stochastic Gradient Descent with Momentum. (arXiv:2102.13653v2 [cs.LG] UPDATED)
- Moreau-Yosida $f$-divergences. (arXiv:2102.13416v2 [cs.LG] UPDATED)
- Stein Variational Gradient Descent: many-particle and long-time asymptotics. (arXiv:2102.12956v1 [stat.ML])
- On the Validity of Modeling SGD with Stochastic Differential Equations (SDEs). (arXiv:2102.12470v2 [cs.LG] UPDATED)
- On Unbiased Estimation for Discretized Models. (arXiv:2102.12230v1 [stat.CO])
- Differentiable Logic Machines. (arXiv:2102.11529v5 [cs.AI] UPDATED)
- Actor-Critic Method for High Dimensional Static Hamilton--Jacobi--Bellman Partial Differential Equations based on Neural Networks. (arXiv:2102.11379v2 [math.OC] UPDATED)
- Revisiting the Role of Euler Numerical Integration on Acceleration and Stability in Convex Optimization. (arXiv:2102.11537v1 [math.OC])
- Online Stochastic Gradient Descent Learns Linear Dynamical Systems from A Single Trajectory. (arXiv:2102.11822v1 [cs.LG])
- On Proximal Policy Optimization's Heavy-tailed Gradients. (arXiv:2102.10264v2 [cs.LG] UPDATED)
- Optimal Transportation Methods in Nonlinear Filtering: The feedback particle filter. (arXiv:2102.10712v1 [eess.SY])
- Generalized Bregman envelopes and proximity operators. (arXiv:2102.10730v3 [math.FA] UPDATED)
- Optimal Transport of Information. (arXiv:2102.10909v4 [econ.GN] UPDATED)
- AI-SARAH: Adaptive and Implicit Stochastic Recursive Gradient Methods. (arXiv:2102.09700v3 [cs.LG] UPDATED)
- Permutation-Based SGD: Is Random Optimal?. (arXiv:2102.09718v2 [cs.LG] UPDATED)
- LEAD: Min-Max Optimization from a Physical Perspective. (arXiv:2010.13846v4 [cs.LG] UPDATED)
- SVRG Meets AdaGrad: Painless Variance Reduction. (arXiv:2102.09645v2 [cs.LG] UPDATED)
- On Riemannian Stochastic Approximation Schemes with Fixed Step-Size. (arXiv:2102.07586v2 [stat.ML] UPDATED)
- A Differential Geometry Perspective on Orthogonal Recurrent Models. (arXiv:2102.09589v1 [cs.LG])
- When Are Solutions Connected in Deep Networks?. (arXiv:2102.09671v2 [cs.LG] UPDATED)
- A Variance Controlled Stochastic Method with Biased Estimation for Faster Non-convex Optimization. (arXiv:2102.09893v1 [cs.LG])
- Auction Type Resolution on Smart Derivatives. (arXiv:2102.10099v1 [q-fin.TR])
- Stein variational gradient descent on infinite-dimensional space and applications to statistical inverse problems. (arXiv:2102.09741v4 [math.NA] UPDATED)
- Deluca -- A Differentiable Control Library: Environments, Methods, and Benchmarking. (arXiv:2102.09968v1 [cs.RO])
- Local Convergence of Adaptive Gradient Descent Optimizers. (arXiv:2102.09804v1 [cs.LG])
- On the Implicit Bias of Initialization Shape: Beyond Infinitesimal Mirror Descent. (arXiv:2102.09769v1 [cs.LG])
- Differentiable Particle Filtering via Entropy-Regularized Optimal Transport. (arXiv:2102.07850v3 [stat.ML] UPDATED)
- On Riemannian Stochastic Approximation Schemes with Fixed Step-Size. (arXiv:2102.07586v2 [stat.ML] UPDATED)
- Barriers for recent methods in geodesic optimization. (arXiv:2102.06652v2 [cs.CC] UPDATED)
- Jacobian Determinant of Normalizing Flows. (arXiv:2102.06539v2 [cs.LG] UPDATED)
- Stability and Convergence of Stochastic Gradient Clipping: Beyond Lipschitz Continuity and Smoothness. (arXiv:2102.06489v2 [math.OC] UPDATED)
- Projected Wasserstein gradient descent for high-dimensional Bayesian inference. (arXiv:2102.06350v2 [cs.LG] UPDATED)
- Higher Order Generalization Error for First Order Discretization of Langevin Diffusion. (arXiv:2102.06229v1 [stat.ML])
- Convex Synthesis of Accelerated Gradient Algorithms. (arXiv:2102.06520v2 [math.OC] UPDATED)
- Continuous Time Analysis of Momentum Methods
- A Continuized View on Nesterov Acceleration. (arXiv:2102.06035v1 [cs.DC])
- Noisy Recurrent Neural Networks. (arXiv:2102.04877v3 [stat.ML] UPDATED)
- Proximal Gradient Descent-Ascent: Variable Convergence under K{\L} Geometry. (arXiv:2102.04653v2 [math.OC] UPDATED)
- Neural SDEs as Infinite-Dimensional GANs. (arXiv:2102.03657v2 [cs.LG] UPDATED)
- Primal dual methods for Wasserstein gradient flows. (arXiv:1901.08081v2 [math.NA] UPDATED)
- Analysis of the Optimization Landscape of Linear Quadratic Gaussian (LQG) Control. (arXiv:2102.04393v1 [math.OC])
- SGD in the Large: Average-case Analysis, Asymptotics, and Stepsize Criticality. (arXiv:2102.04396v1 [math.OC])
- Eliminating Sharp Minima from SGD with Truncated Heavy-tailed Noise. (arXiv:2102.04297v4 [cs.LG] UPDATED)
- Koopman Operator Dynamical Models: Learning, Analysis and Control. (arXiv:2102.02522v2 [eess.SY] UPDATED)
- Algorithmic Instabilities of Accelerated Gradient Descent. (arXiv:2102.02167v2 [cs.LG] UPDATED)
- Exact Langevin Dynamics with Stochastic Gradients. (arXiv:2102.01691v1 [stat.ML])
- Iterated Kalman Methodology For Inverse Problems. (arXiv:2102.01580v5 [math.NA] UPDATED)
- A probabilistic Taylor expansion with Gaussian processes. (arXiv:2102.00877v2 [cs.LG] UPDATED)
- Asymptotic optimality in stochastic optimization
- Adaptivity without Compromise: A Momentumized, Adaptive, Dual Averaged Gradient Method for Stochastic Optimization. (arXiv:2101.11075v3 [cs.LG] UPDATED)
- On the Origin of Implicit Regularization in Stochastic Gradient Descent. (arXiv:2101.12176v1 [cs.LG])
- Generalized Doubly Reparameterized Gradient Estimators. (arXiv:2101.11046v2 [stat.ML] UPDATED)
- The Langevin Monte Carlo algorithm in the non-smooth log-concave case. (arXiv:2101.10695v3 [math.ST] UPDATED)
- A Concise Introduction to Control Theory for Stochastic Partial Differential Equations. (arXiv:2101.10678v1 [math.OC])
- Regret-Optimal Filtering for Prediction and Estimation. (arXiv:2101.10357v3 [math.OC] UPDATED)
- Online Adjoint Methods for Optimization of PDEs. (arXiv:2101.09621v4 [math.OC] UPDATED)
- Acceleration Methods. (arXiv:2101.09545v3 [math.OC] UPDATED)
- Nine Challenges in Modern Algorithmic Trading and Controls. (arXiv:2101.08813v1 [q-fin.TR])
- Differentiable Trust Region Layers for Deep Reinforcement Learning. (arXiv:2101.09207v2 [cs.LG] UPDATED)
- Neural networks-based algorithms for stochastic control and PDEs in finance. (arXiv:2101.08068v2 [math.OC] UPDATED)
- A short proof on the rate of convergence of the empirical measure for the Wasserstein distance. (arXiv:2101.08126v1 [math.ST])
- Automatic Differentiation via Effects and Handlers: An Implementation in Frank. (arXiv:2101.08095v1 [cs.PL])
- Multisymplectic Hamiltonian Variational Integrators. (arXiv:2101.07536v2 [math.NA] UPDATED)
- Unadjusted Langevin algorithm for non-convex weakly smooth potentials. (arXiv:2101.06369v3 [stat.CO] UPDATED)
- A Variational Formulation of Accelerated Optimization on Riemannian Manifolds. (arXiv:2101.06552v2 [math.OC] UPDATED)
- The Geometry of Deep Generative Image Models and its Applications. (arXiv:2101.06006v2 [cs.LG] UPDATED)
- Theoretical and numerical comparison of first-order algorithms for cocoercive equations and smooth convex optimization. (arXiv:2101.06152v4 [math.OC] UPDATED)
- Towards Practical Adam: Non-Convexity, Convergence Theory, and Mini-Batch Acceleration. (arXiv:2101.05471v2 [cs.LG] UPDATED)
- Convex Smoothed Autoencoder-Optimal Transport model. (arXiv:2101.05679v1 [stat.ML])
- A Dimension-free Computational Upper-bound for Smooth Optimal Transport Estimation. (arXiv:2101.05380v4 [math.ST] UPDATED)
- From Control to Mathematics-Part I: Controllability-Based Design for Iterative Methods in Solving Linear Equations. (arXiv:2101.04345v1 [eess.SY])
- Differential Invariants. (arXiv:2101.03334v1 [math.NA])
- The shifted ODE method for underdamped Langevin MCMC. (arXiv:2101.03446v2 [math.NA] UPDATED)
- How to Train Your Energy-Based Models. (arXiv:2101.03288v2 [cs.LG] UPDATED)
- Time-Varying Optimization of LTI Systems via Projected Primal-Dual Gradient Flows. (arXiv:2101.01799v3 [math.OC] UPDATED)
- Cauchy-Schwarz Regularized Autoencoder. (arXiv:2101.02149v2 [cs.LG] UPDATED)
- Minibatch optimal transport distances; analysis and applications. (arXiv:2101.01792v1 [stat.ML])
- Minimax Statistical Learning with Wasserstein Distances. (arXiv:1705.07815v2 [cs.LG] CROSS LISTED)
- Control of Stochastic Quantum Dynamics by Differentiable Programming. (arXiv:2101.01190v2 [quant-ph] UPDATED)
- Time fractional gradient flows: Theory and numerics. (arXiv:2101.00541v1 [math.AP])
- The structure of conservative gradient fields. (arXiv:2101.00699v1 [math.OC])
- First-Order Methods for Convex Optimization. (arXiv:2101.00935v2 [math.OC] UPDATED)
- Transport information Bregman divergences. (arXiv:2101.01162v1 [cs.IT])
- Adam revisited: a weighted past gradients perspective. (arXiv:2101.00238v1 [cs.LG])
- Optimizing Optimizers: Regret-optimal gradient descent algorithms. (arXiv:2101.00041v2 [cs.LG] UPDATED)
- Nonreversible MCMC from conditional invertible transforms: a complete recipe with convergence guarantees. (arXiv:2012.15550v1 [stat.CO])
- Differentiable Programming \`a la Moreau. (arXiv:2012.15458v2 [math.OC] UPDATED)
Saved in 2020
- Solving non-linear Kolmogorov equations in large dimensions by using deep learning: a numerical comparison of discretization schemes. (arXiv:2012.07747v3 [math.NA] UPDATED)
- Unadjusted Langevin algorithm with multiplicative noise: Total variation and Wasserstein bounds. (arXiv:2012.14310v2 [math.PR] UPDATED)
- Differentiable Molecular Simulations for Control and Learning. (arXiv:2003.00868v2 [physics.comp-ph] UPDATED)
- Newton acceleration on manifolds identified by proximal-gradient methods. (arXiv:2012.12936v4 [math.OC] UPDATED)
- State-Dependent Temperature Control for Langevin Diffusions. (arXiv:2011.07456v3 [math.OC] UPDATED)
- Optimal dimension dependence of the Metropolis-Adjusted Langevin Algorithm. (arXiv:2012.12810v1 [math.ST])
- Learning emergent PDEs in a learned emergent space. (arXiv:2012.12738v1 [nlin.AO])
- Stochastic Gradient Variance Reduction by Solving a Filtering Problem. (arXiv:2012.12418v2 [cs.LG] UPDATED)
- Assume/Guarantee Contracts for Dynamical Systems: Theory and Computational Tools. (arXiv:2012.12657v2 [eess.SY] UPDATED)
- Finding Global Minima via Kernel Approximations. (arXiv:2012.11978v1 [math.OC])
- Learning to Initialize Gradient Descent Using Gradient Descent. (arXiv:2012.12141v1 [cs.LG])
- Projected Stochastic Gradient Langevin Algorithms for Constrained Sampling and Non-Convex Learning. (arXiv:2012.12137v1 [cs.LG])
- L\'evy processes on smooth manifolds with a connection. (arXiv:2012.11633v2 [math.PR] UPDATED)
- Evolving the Behavior of Machines: From Micro to Macroevolution. (arXiv:2012.11692v1 [cs.NE])
- Complexity of zigzag sampling algorithm for strongly log-concave distributions. (arXiv:2012.11094v2 [stat.ML] UPDATED)
- Variational Transport: A Convergent Particle-BasedAlgorithm for Distributional Optimization. (arXiv:2012.11554v1 [cs.LG])
- A deep learning method for solving Fokker-Planck equations. (arXiv:2012.10696v1 [math.NA])
- Entropic-Wasserstein barycenters: PDE characterization, regularity and CLT. (arXiv:2012.10701v1 [math.AP])
- Convergence dynamics of Generative Adversarial Networks: the dual metric flows. (arXiv:2012.10410v2 [stat.ML] UPDATED)
- Towards Resolving the Implicit Bias of Gradient Descent for Matrix Factorization: Greedy Low-Rank Learning. (arXiv:2012.09839v2 [cs.LG] UPDATED)
- A fresh take on 'Barker dynamics' for MCMC. (arXiv:2012.09731v3 [stat.CO] UPDATED)
- Riemannian Stochastic Fixed Point Optimization Algorithm. (arXiv:2012.09346v1 [math.OC])
- On The Verification of Neural ODEs with Stochastic Guarantees. (arXiv:2012.08863v1 [cs.LG])
- Calibrated Adaptive Probabilistic ODE Solvers. (arXiv:2012.08202v2 [math.NA] UPDATED)
- Derivation of Ensemble Kalman-Bucy Filters with unbounded nonlinear coefficients. (arXiv:2012.07572v3 [math.PR] UPDATED)
- Bayesian Neural Ordinary Differential Equations. (arXiv:2012.07244v4 [cs.LG] UPDATED)
- Optimization and Learning With Nonlocal Calculus. (arXiv:2012.07013v2 [math.OC] UPDATED)
- Recent Theoretical Advances in Non-Convex Optimization. (arXiv:2012.06188v3 [math.OC] UPDATED)
- Convex Potential Flows: Universal Probability Distributions with Optimal Transport and Convex Optimization. (arXiv:2012.05942v2 [cs.LG] UPDATED)
- Stochastic optimization with momentum: convergence, fluctuations, and traps avoidance. (arXiv:2012.04002v3 [math.OC] UPDATED)
- Variational Autoencoders for Learning Nonlinear Dynamics of Physical Systems. (arXiv:2012.03448v2 [cs.LG] UPDATED)
- Penalised t-walk MCMC. (arXiv:2012.02293v1 [stat.CO])
- Integrable Nonparametric Flows. (arXiv:2012.02035v1 [stat.ML])
- Online learning with dynamics: A minimax perspective. (arXiv:2012.01705v1 [cs.LG])
- Refining Deep Generative Models via Discriminator Gradient Flow. (arXiv:2012.00780v4 [cs.LG] UPDATED)
- Deep learning based numerical approximation algorithms for stochastic partial differential equations and high-dimensional nonlinear filtering problems. (arXiv:2012.01194v1 [math.NA])
- Convergence of Gradient Algorithms for Nonconvex C^{1+alpha} Cost Functions. (arXiv:2012.00628v3 [math.OC] UPDATED)
- Sheaf-theoretic framework for optimal network control. (arXiv:2012.00120v1 [math.AT])
- Convergence and Sample Complexity of SGD in GANs. (arXiv:2012.00732v1 [cs.LG])
- Probabilistic Grammars for Equation Discovery. (arXiv:2012.00428v2 [cs.LG] UPDATED)
- Latent Programmer: Discrete Latent Codes for Program Synthesis. (arXiv:2012.00377v2 [cs.LG] UPDATED)
- On Generalization of Adaptive Methods for Over-parameterized Linear Regression. (arXiv:2011.14066v1 [stat.ML])
- A new approach to posterior contraction rates via Wasserstein dynamics. (arXiv:2011.14425v2 [math.ST] UPDATED)
- A Grassmann Manifold Handbook: Basic Geometry and Computational Aspects. (arXiv:2011.13699v3 [math.NA] UPDATED)
- Score-Based Generative Modeling through Stochastic Differential Equations. (arXiv:2011.13456v2 [cs.LG] UPDATED)
- A Unification of Weighted and Unweighted Particle Filters. (arXiv:2011.13804v3 [math.OC] UPDATED)
- Computation of Feedback Control Laws Based on Switched Tracking of Demonstrations. (arXiv:2011.12639v3 [eess.SY] UPDATED)
- Adam$^+$: A Stochastic Method with Adaptive Variance Reduction. (arXiv:2011.11985v1 [cs.LG])
- Linear Convergence of Distributed Mirror Descent with Integral Feedback for Strongly Convex Problems. (arXiv:2011.12233v1 [math.OC])
- Policy Optimization for Markovian Jump Linear Quadratic Control: Gradient-Based Methods and Global Convergence. (arXiv:2011.11852v1 [math.OC])
- Geometry-Aware Universal Mirror-Prox. (arXiv:2011.11203v1 [cs.LG])
- Automatic differentiation of Sylvester, Lyapunov, and algebraic Riccati equations. (arXiv:2011.11430v2 [math.OC] UPDATED)
- Continuous-Time Convergence Rates in Potential and Monotone Games. (arXiv:2011.10682v3 [math.OC] UPDATED)
- Anderson acceleration of coordinate descent. (arXiv:2011.10065v3 [stat.ML] UPDATED)
- Deep reinforcement learning for feedback control in a collective flashing ratchet. (arXiv:2011.10357v3 [cs.LG] UPDATED)
- Variational Laplace for Bayesian neural networks. (arXiv:2011.10443v2 [stat.ML] UPDATED)
- Entropic regularization of Wasserstein distance between infinite-dimensional Gaussian measures and Gaussian processes. (arXiv:2011.07489v3 [stat.ML] UPDATED)
- The back-and-forth method for Wasserstein gradient flows. (arXiv:2011.08151v1 [math.NA])
- Mixing ADAM and SGD: a Combined Optimization Method. (arXiv:2011.08042v1 [cs.LG])
- State-Dependent Temperature Control for Langevin Diffusions. (arXiv:2011.07456v3 [math.OC] UPDATED)
- $(f,\Gamma)$-Divergences: Interpolating between $f$-Divergences and Integral Probability Metrics. (arXiv:2011.05953v3 [stat.ML] UPDATED)
- Towards a Better Global Loss Landscape of GANs. (arXiv:2011.04926v1 [cs.LG])
- Double Descent Risk and Volume Saturation Effects: A Geometric Perspective. (arXiv:2006.04366v2 [stat.ML] UPDATED)
- Particles to Partial Differential Equations Parsimoniously. (arXiv:2011.04517v1 [stat.ML])
- Pathwise Conditioning of Gaussian Processes. (arXiv:2011.04026v3 [stat.ML] UPDATED)
- On the accept-reject mechanism for Metropolis-Hastings algorithms. (arXiv:2011.04493v2 [math.ST] UPDATED)
- Understanding Double Descent Requires a Fine-Grained Bias-Variance Decomposition. (arXiv:2011.03321v1 [stat.ML])
- Neural Controlled Differential Equations for Irregular Time Series. (arXiv:2005.08926v2 [cs.LG] UPDATED)
- Concentration Inequalities for Statistical Inference. (arXiv:2011.02258v3 [math.ST] UPDATED)
- Arbitrary Order Fixed-Time Differentiators. (arXiv:2011.02012v1 [math.OC])
- On the Convergence of Gradient Descent in GANs: MMD GAN As a Gradient Flow. (arXiv:2011.02402v1 [cs.LG])
- EAdam Optimizer: How $\epsilon$ Impact Adam. (arXiv:2011.02150v1 [cs.LG])
- Reverse engineering learned optimizers reveals known and novel mechanisms. (arXiv:2011.02159v2 [cs.LG] UPDATED)
- Exact Asymptotics for Linear Quadratic Adaptive Control. (arXiv:2011.01364v1 [cs.LG])
- ControlVAE: Tuning, Analytical Properties, and Performance Analysis. (arXiv:2011.01754v1 [cs.LG])
- Analytical aspects of non-differentiable neural networks. (arXiv:2011.01858v1 [cs.LG])
- Strengthened Splitting Methods for Computing Resolvents. (arXiv:2011.01796v3 [math.OC] UPDATED)
- Optimal 1-NN Prototypes for Pathological Geometries. (arXiv:2011.00228v1 [cs.LG])
- Distances between probability distributions of different dimensions. (arXiv:2011.00629v3 [math.ST] UPDATED)
- Data-Driven Approximation of the Perron-Frobenius Operator Using the Wasserstein Metric. (arXiv:2011.00759v1 [math.OC])
- Efficient constrained sampling via the mirror-Langevin algorithm. (arXiv:2010.16212v2 [math.ST] UPDATED)
- High-dimensional inference: a statistical mechanics perspective. (arXiv:2010.14863v1 [cond-mat.dis-nn])
- Faster Differentially Private Samplers via R\'enyi Divergence Analysis of Discretized Langevin MCMC. (arXiv:2010.14658v2 [cs.LG] UPDATED)
- Implicit Under-Parameterization Inhibits Data-Efficient Deep Reinforcement Learning. (arXiv:2010.14498v2 [cs.LG] UPDATED)
- Scientific intuition inspired by machine learning generated hypotheses. (arXiv:2010.14236v2 [cs.LG] UPDATED)
- Controlled Molecule Generator for Optimizing Multiple Chemical Properties. (arXiv:2010.13908v1 [cs.LG])
- LEAD: Least-Action Dynamics for Min-Max Optimization. (arXiv:2010.13846v1 [cs.LG])
- Data-Driven Stabilization of Periodic Orbits. (arXiv:2010.13896v2 [math.DS] UPDATED)
- General higher-order majorization-minimization algorithms for (non)convex optimization. (arXiv:2010.13893v3 [math.OC] UPDATED)
- Battery-assisted Electric Vehicle Charging: Data Driven Performance Analysis. (arXiv:2010.14455v1 [eess.SY])
- Convergence Acceleration via Chebyshev Step: Plausible Interpretation of Deep-Unfolded Gradient Descent. (arXiv:2010.13335v1 [cs.LG])
- Variational Bayesian Unlearning. (arXiv:2010.12883v1 [cs.LG])
- Geometric Exploration for Online Control. (arXiv:2010.13178v2 [cs.LG] UPDATED)
- A Dynamical View on Optimization Algorithms of Overparameterized Neural Networks. (arXiv:2010.13165v2 [cs.LG] UPDATED)
- State space models for building control: how deep should you go?. (arXiv:2010.12257v1 [eess.SY])
- Fast and Smooth Interpolation on Wasserstein Space. (arXiv:2010.12101v1 [math.ST])
- Train simultaneously, generalize better: Stability of gradient-based minimax learners. (arXiv:2010.12561v1 [cs.LG])
- Sub-linear convergence of a stochastic proximal iteration method in Hilbert space. (arXiv:2010.12348v3 [math.OC] UPDATED)
- Geometry-Aware Hamiltonian Variational Auto-Encoder. (arXiv:2010.11518v1 [stat.ML])
- Random Coordinate Underdamped Langevin Monte Carlo. (arXiv:2010.11366v1 [stat.ML])
- Riemannian Langevin Algorithm for Solving Semidefinite Programs. (arXiv:2010.11176v6 [stat.ML] UPDATED)
- Limiting Behaviors of Nonconvex-Nonconcave Minimax Optimization via Continuous-Time Systems. (arXiv:2010.10628v2 [math.OC] UPDATED)
- Regret-optimal control in dynamic environments. (arXiv:2010.10473v2 [cs.LG] UPDATED)
- Optimality vs Stability Trade-off in Ensemble Kalman Filters. (arXiv:2010.09920v2 [math.OC] UPDATED)
- Faster Convergence of Stochastic Gradient Langevin Dynamics for Non-Log-Concave Sampling. (arXiv:2010.09597v2 [cs.LG] UPDATED)
- AdaBelief Optimizer: Adapting Stepsizes by the Belief in Observed Gradients. (arXiv:2010.07468v5 [cs.LG] UPDATED)
- Differentiable Implicit Layers. (arXiv:2010.07078v2 [cs.LG] UPDATED)
- CAPD::DynSys: a flexible C++ toolbox for rigorous numerical analysis of dynamical systems. (arXiv:2010.07097v1 [math.NA])
- Probabilistic simulation of partial differential equations. (arXiv:2010.06583v1 [math.NA])
- On the cost of Bayesian posterior mean strategy for log-concave models. (arXiv:2010.06420v2 [math.PR] UPDATED)
- Gradient Descent Ascent for Minimax Problems on Riemannian Manifolds. (arXiv:2010.06097v5 [cs.LG] UPDATED)
- AdamP: Slowing Down the Slowdown for Momentum Optimizers on Scale-invariant Weights. (arXiv:2006.08217v3 [cs.LG] UPDATED)
- Towards Theoretically Understanding Why SGD Generalizes Better Than ADAM in Deep Learning. (arXiv:2010.05627v2 [cs.LG] UPDATED)
- Fast Convergence of Langevin Dynamics on Manifold: Geodesics meet Log-Sobolev. (arXiv:2010.05263v2 [cs.LG] UPDATED)
- Dissecting Hessian: Understanding Common Structure of Hessian in Neural Networks. (arXiv:2010.04261v6 [cs.LG] UPDATED)
- DiffTune: Optimizing CPU Simulator Parameters with Learned Differentiable Surrogates. (arXiv:2010.04017v2 [cs.LG] UPDATED)
- Learning Partially Observed Linear Dynamical Systems from Logarithmic Number of Samples. (arXiv:2010.04015v1 [cs.LG])
- Learning the Linear Quadratic Regulator from Nonlinear Observations. (arXiv:2010.03799v1 [cs.LG])
- Physics-constrained Bayesian inference of state functions in classical density-functional theory. (arXiv:2010.03374v4 [cond-mat.stat-mech] UPDATED)
- First-Order Optimization Inspired from Finite-Time Convergent Flows. (arXiv:2010.02990v4 [cs.LG] UPDATED)
- Denoising Diffusion Implicit Models. (arXiv:2010.02502v4 [cs.LG] UPDATED)
- Improved Analysis of Clipping Algorithms for Non-convex Optimization. (arXiv:2010.02519v2 [cs.LG] UPDATED)
- Machine-Learned Preconditioners for Linear Solvers in Geophysical Fluid Flows. (arXiv:2010.02866v1 [physics.ao-ph])
- Making Non-Stochastic Control (Almost) as Easy as Stochastic. (arXiv:2006.05910v2 [cs.LG] UPDATED)
- Understanding How Over-Parametrization Leads to Acceleration: A case of learning a single teacher neuron. (arXiv:2010.01637v3 [cs.LG] UPDATED)
- An adaptive Hessian approximated stochastic gradient MCMC method. (arXiv:2010.01384v1 [math.NA])
- $\xi$-torch: differentiable scientific computing library. (arXiv:2010.01921v1 [cs.LG])
- A Modular Analysis of Provable Acceleration via Polyak's Momentum: Training a Wide ReLU Network and a Deep Linear Network. (arXiv:2010.01618v6 [cs.LG] UPDATED)
- High-dimensional Gaussian sampling: a review and a unifying approach based on a stochastic proximal point algorithm. (arXiv:2010.01510v2 [stat.CO] UPDATED)
- Random Coordinate Langevin Monte Carlo. (arXiv:2010.01405v1 [stat.ML])
- Expectigrad: Fast Stochastic Optimization with Robust Convergence Properties. (arXiv:2010.01356v2 [cs.LG] UPDATED)
- Accelerating Convergence of Replica Exchange Stochastic Gradient MCMC via Variance Reduction. (arXiv:2010.01084v2 [stat.ML] UPDATED)
- Variance-Reduced Methods for Machine Learning. (arXiv:2010.00892v1 [cs.LG])
- Nonsmoothness in Machine Learning: specific structure, proximal identification, and applications. (arXiv:2010.00848v2 [math.OC] UPDATED)
- Tracking and regret bounds for online zeroth-order Euclidean and Riemannian optimisation. (arXiv:2010.00211v3 [math.OC] UPDATED)
- Momentum via Primal Averaging: Theoretical Insights and Learning Rate Schedules for Non-Convex Optimization. (arXiv:2010.00406v4 [cs.LG] UPDATED)
- Active Inference or Control as Inference? A Unifying View. (arXiv:2010.00262v1 [cs.LG])
- Unbalanced Sobolev Descent. (arXiv:2009.14148v1 [cs.LG])
- Hybrid Heavy-Ball Systems: Reset Methods for Optimization with Uncertainty. (arXiv:2009.13770v2 [eess.SY] UPDATED)
- Adaptive Non-reversible Stochastic Gradient Langevin Dynamics. (arXiv:2009.12690v1 [cs.LG])
- Stein Variational Gaussian Processes. (arXiv:2009.12141v3 [stat.ML] UPDATED)
- A Unified Analysis of First-Order Methods for Smooth Games via Integral Quadratic Constraints. (arXiv:2009.11359v4 [math.OC] UPDATED)
- A Personal Perspective on Numerical Analysis and Optimization. (arXiv:2009.11369v1 [math.NA])
- Neural Identification for Control. (arXiv:2009.11782v4 [eess.SY] UPDATED)
- Gradient Flows for Regularized Stochastic Control Problems. (arXiv:2006.05956v5 [math.OC] UPDATED)
- ContactNets: Learning Discontinuous Contact Dynamics with Smooth, Implicit Representations. (arXiv:2009.11193v2 [cs.RO] UPDATED)
- Implicit Gradient Regularization. (arXiv:2009.11162v3 [cs.LG] UPDATED)
- Bayesian Update with Importance Sampling: Required Sample Size. (arXiv:2009.10831v1 [stat.CO])
- Operator-valued formulas for Riemannian Gradient and Hessian and families of tractable metrics. (arXiv:2009.10159v2 [math.OC] UPDATED)
- Curved Schemes for SDEs on Manifolds. (arXiv:2009.10113v2 [math.NA] UPDATED)
- "Hey, that's not an ODE": Faster ODE Adjoints via Seminorms. (arXiv:2009.09457v2 [cs.LG] UPDATED)
- DiffWave: A Versatile Diffusion Model for Audio Synthesis. (arXiv:2009.09761v3 [eess.AS] UPDATED)
- Stochastic Gradient Langevin Dynamics Algorithms with Adaptive Drifts. (arXiv:2009.09535v1 [stat.ML])
- Lagrangian and Hamiltonian Mechanics for Probabilities on the Statistical Manifold. (arXiv:2009.09431v2 [math.ST] UPDATED)
- Automatic Differentiation to Simultaneously Identify Nonlinear Dynamics and Extract Noise Probability Distributions from Data. (arXiv:2009.08810v2 [eess.SP] UPDATED)
- Convergence of unadjusted Hamiltonian Monte Carlo for mean-field models. (arXiv:2009.08735v4 [math.PR] UPDATED)
- SISTA: learning optimal transport costs under sparsity constraints. (arXiv:2009.08564v2 [math.OC] UPDATED)
- DyNODE: Neural Ordinary Differential Equations for Dynamics Modeling in Continuous Control. (arXiv:2009.04278v1 [cs.LG])
- Probabilistic Gradients for Fast Calibration of Differential Equation Models. (arXiv:2009.04239v2 [math.OC] UPDATED)
- Optimal Control of Convection-Cooling and Numerical Implementation. (arXiv:2009.01936v2 [math.OC] UPDATED)
- WaveGrad: Estimating Gradients for Waveform Generation. (arXiv:2009.00713v2 [eess.AS] UPDATED)
- The connections between Lyapunov functions for some optimization algorithms and differential equations. (arXiv:2009.00673v2 [math.NA] UPDATED)
- Algorithms for Solving High Dimensional PDEs: From Nonlinear Monte Carlo to Machine Learning. (arXiv:2008.13333v2 [math.NA] UPDATED)
- Control on the Manifolds of Mappings with a View to the Deep Learning. (arXiv:2008.12702v2 [math.OC] UPDATED)
- Market-making with reinforcement-learning (SAC). (arXiv:2008.12275v1 [q-fin.PR])
- Bellman filtering and smoothing for state-space models. (arXiv:2008.11477v16 [stat.ME] UPDATED)
- Data-Driven Aerospace Engineering: Reframing the Industry with Machine Learning. (arXiv:2008.10740v1 [cs.LG])
- Introducing students to research codes: A short course on solving partial differential equations in Python. (arXiv:2008.10931v2 [physics.ed-ph] UPDATED)
- A Discrete-Time Matching Filtering Differentiator. (arXiv:2008.09863v1 [eess.SY])
- Non-convex Min-Max Optimization: Applications, Challenges, and Recent Theoretical Advances. (arXiv:2006.08141v2 [math.OC] CROSS LISTED)
- Augmenting Neural Differential Equations to Model Unknown Dynamical Systems with Incomplete State Information. (arXiv:2008.08226v3 [q-bio.NC] UPDATED)
- Non-Canonical Hamiltonian Monte Carlo. (arXiv:2008.08191v1 [stat.ML])
- Variance Contracts. (arXiv:2008.07103v1 [q-fin.RM])
- Whitening and second order optimization both make information in the dataset unusable during training, and can reduce or prevent generalization. (arXiv:2008.07545v4 [cs.LG] UPDATED)
- Complexity aspects of local minima and related notions. (arXiv:2008.06148v2 [math.OC] UPDATED)
- Multiple Descent: Design Your Own Generalization Curve. (arXiv:2008.01036v7 [cs.LG] UPDATED)
- Distributed Gradient Flow: Nonsmoothness, Nonconvexity, and Saddle Point Evasion. (arXiv:2008.05387v1 [math.OC])
- Darwinian evolution as Brownian motion on the simplex: A geometric perspective on stochastic replicator dynamics. (arXiv:2008.05410v1 [math.PR])
- Non-Stochastic Control with Bandit Feedback. (arXiv:2008.05523v1 [cs.LG])
- Riemannian stochastic recursive momentum method for non-convex optimization. (arXiv:2008.04555v1 [math.OC])
- Hierarchial Reinforcement Learning in StarCraft II with Human Expertise in Subgoals Selection. (arXiv:2008.03444v2 [cs.AI] UPDATED)
- Do ideas have shape? Idea registration as the continuous limit of artificial neural networks. (arXiv:2008.03920v3 [stat.ML] UPDATED)
- Convex Q-Learning, Part 1: Deterministic Optimal Control. (arXiv:2008.03559v1 [math.OC])
- MCMC Algorithms for Posteriors on Matrix Spaces. (arXiv:2008.02906v3 [stat.CO] UPDATED)
- A Gentle Lecture Note on Filtrations in Reinforcement Learning. (arXiv:2008.02622v1 [cs.LG])
- Curvature-Dependant Global Convergence Rates for Optimization on Manifolds of Bounded Geometry. (arXiv:2008.02517v1 [math.OC])
- Fast optimization via inertial dynamics with closed-loop damping. (arXiv:2008.02261v3 [math.OC] UPDATED)
- An accelerated first-order method for non-convex optimization on manifolds. (arXiv:2008.02252v2 [math.OC] UPDATED)
- Optimal Bounds between $f$-Divergences and Integral Probability Metrics. (arXiv:2006.05973v3 [math.ST] UPDATED)
- Proximal Deterministic Policy Gradient. (arXiv:2008.00759v1 [cs.LG])
- On the Convergence of SGD with Biased Gradients. (arXiv:2008.00051v2 [cs.LG] UPDATED)
- The proximal alternating direction method of multipliers in the nonconvex setting: convergence analysis and rates. (arXiv:1801.01994v2 [math.OC] UPDATED)
- ResNet After All? Neural ODEs and Their Numerical Solution. (arXiv:2007.15386v2 [cs.LG] UPDATED)
- A new framework for the computation of Hessians. (arXiv:2007.15040v1 [math.OC])
- Ergodicity of the underdamped mean-field Langevin dynamics. (arXiv:2007.14660v3 [math.PR] UPDATED)
- A High Probability Analysis of Adaptive SGD with Momentum. (arXiv:2007.14294v1 [stat.ML])
- Langevin Monte Carlo: random coordinate descent and variance reduction. (arXiv:2007.14209v8 [stat.ML] UPDATED)
- A Comparison of Optimization Algorithms for Deep Learning. (arXiv:2007.14166v1 [cs.LG])
- Optimal control of COVID-19 infection rate with social costs. (arXiv:2007.13811v1 [math.OC])
- Learning Compositional Neural Programs for Continuous Control. (arXiv:2007.13363v2 [cs.AI] UPDATED)
- A finite sample analysis of the benign overfitting phenomenon for ridge function estimation. (arXiv:2007.12882v5 [stat.ML] UPDATED)
- Alternating proximal-gradient steps for (stochastic) nonconvex-concave minimax problems. (arXiv:2007.13605v4 [math.OC] UPDATED)
- McKean-Vlasov SDEs in nonlinear filtering. (arXiv:2007.12658v3 [math.OC] UPDATED)
- An invitation to sequential Monte Carlo samplers. (arXiv:2007.11936v3 [stat.CO] UPDATED)
- Convergence of Langevin Monte Carlo in Chi-Squared and Renyi Divergence. (arXiv:2007.11612v4 [stat.ML] UPDATED)
- Wasserstein Statistics in One-dimensional Location-Scale Model. (arXiv:2007.11401v2 [math.ST] UPDATED)
- Optimal policies for mitigating pandemic costs. (arXiv:2007.11178v1 [physics.soc-ph])
- Weak error analysis for stochastic gradient descent optimization algorithms. (arXiv:2007.02723v2 [math.NA] UPDATED)
- Automating Involutive MCMC using Probabilistic and Differentiable Programming. (arXiv:2007.09871v2 [stat.CO] UPDATED)
- CoNES: Convex Natural Evolutionary Strategies. (arXiv:2007.08601v2 [cs.LG] UPDATED)
- Differential Inclusions in Wasserstein Spaces: The Cauchy-Lipschitz Framework. (arXiv:2007.08906v2 [math.OC] UPDATED)
- Adaptive Gradient Methods for Constrained Convex Optimization and Variational Inequalities. (arXiv:2007.08840v3 [cs.LG] UPDATED)
- A Fourier State Space Model for Bayesian ODE Filters. (arXiv:2007.09118v2 [stat.ML] UPDATED)
- On stochastic mirror descent with interacting particles: convergence properties and variance reduction. (arXiv:2007.07704v2 [math.OC] UPDATED)
- Global Convergence of Second-order Dynamics in Two-layer Neural Networks. (arXiv:2007.06852v1 [math.OC] CROSS LISTED)
- Black-Box Control for Linear Dynamical Systems. (arXiv:2007.06650v3 [cs.LG] UPDATED)
- Artificial Neural Networks Jamming on the Beat. (arXiv:2007.06284v3 [eess.AS] UPDATED)
- Identifying Latent Stochastic Differential Equations. (arXiv:2007.06075v5 [stat.ML] UPDATED)
- Control as Hybrid Inference. (arXiv:2007.05838v1 [cs.LG])
- High-dimensional MCMC with a standard splitting scheme for the underdamped Langevin diffusion. (arXiv:2007.05455v4 [math.PR] UPDATED)
- A Global Stochastic Optimization Particle Filter Algorithm. (arXiv:2007.04803v9 [stat.ML] UPDATED)
- On Entropy Regularized Path Integral Control for Trajectory Optimization. (arXiv:2007.03960v2 [math.OC] UPDATED)
- Variational Representations and Neural Network Estimation of R\'enyi Divergences. (arXiv:2007.03814v4 [stat.ML] UPDATED)
- Momentum Accelerates Evolutionary Dynamics. (arXiv:2007.02449v1 [cs.LG])
- Stochastic Stein Discrepancies. (arXiv:2007.02857v4 [stat.ML] UPDATED)
- Novel min-max reformulations of Linear Inverse Problems. (arXiv:2007.02448v1 [math.OC])
- Accelerating Nonconvex Learning via Replica Exchange Langevin Diffusion. (arXiv:2007.01990v1 [stat.ML])
- Weak error analysis for stochastic gradient descent optimization algorithms. (arXiv:2007.02723v2 [math.NA] UPDATED)
- Lecture Notes on Control System Theory and Design. (arXiv:2007.01367v1 [math.OC])
- Continuous-Time Multi-Armed Bandits with Controlled Restarts. (arXiv:2007.00081v1 [cs.LG])
- Double-Loop Unadjusted Langevin Algorithm. (arXiv:2007.01147v1 [math.ST])
- Molecular Latent Space Simulators. (arXiv:2007.00728v1 [physics.comp-ph])
- Efficient Proximal Mapping of the 1-path-norm of Shallow Networks. (arXiv:2007.01003v2 [cs.LG] UPDATED)
- On Linear Identifiability of Learned Representations. (arXiv:2007.00810v3 [stat.ML] UPDATED)
- Bandit Linear Control. (arXiv:2007.00759v1 [cs.LG])
- Continuous dynamics related to monotone inclusions and non-smooth optimization problems. (arXiv:2007.00460v1 [math.OC])
- On the strong concavity of the dual function of an optimization problem. (arXiv:2006.16781v6 [math.OC] UPDATED)
- AdaSGD: Bridging the gap between SGD and Adam. (arXiv:2006.16541v1 [cs.LG])
- Spectral Gap of Replica Exchange Langevin Diffusion on Mixture Distributions. (arXiv:2006.16193v2 [math.PR] UPDATED)
- Symplectic Euler scheme for Hamiltonian stochastic differential equations driven by Levy noise. (arXiv:2006.15500v1 [math.NA])
- A Stabilization of a Continuous Limit of the Ensemble Kalman Inversion. (arXiv:2006.15390v4 [math.NA] UPDATED)
- Understanding Gradient Clipping in Private SGD: A Geometric Perspective. (arXiv:2006.15429v2 [cs.LG] UPDATED)
- On the Generalization Benefit of Noise in Stochastic Gradient Descent. (arXiv:2006.15081v1 [cs.LG])
- Understanding Notions of Stationarity in Non-Smooth Optimization. (arXiv:2006.14901v1 [math.OC])
- Stochastic Online Optimization using Kalman Recursion. (arXiv:2002.03636v2 [cs.LG] CROSS LISTED)
- Prediction with Approximated Gaussian Process Dynamical Models. (arXiv:2006.14551v2 [eess.SY] UPDATED)
- Taming neural networks with TUSLA: Non-convex learning via adaptive stochastic gradient Langevin algorithms. (arXiv:2006.14514v4 [cs.LG] UPDATED)
- Penalized Langevin dynamics with vanishing penalty for smooth and log-concave targets. (arXiv:2006.13998v1 [math.ST])
- Hypocoercivity of Langevin-type dynamics on abstract smooth manifolds. (arXiv:2006.11567v1 [math.PR])
- Greedy Adversarial Equilibrium: An Efficient Alternative to Nonconvex-Nonconcave Min-Max Optimization. (arXiv:2006.12363v4 [cs.DS] UPDATED)
- On the Relationship Between Active Inference and Control as Inference. (arXiv:2006.12964v3 [cs.AI] UPDATED)
- A Proximal-Gradient Algorithm for Crystal Surface Evolution. (arXiv:2006.12528v1 [math.NA])
- An operator view of policy gradient methods. (arXiv:2006.11266v3 [cs.LG] UPDATED)
- On the Almost Sure Convergence of Stochastic Gradient Descent in Non-Convex Problems. (arXiv:2006.11144v1 [math.OC])
- Stochastic Bandits with Linear Constraints. (arXiv:2006.10185v1 [cs.LG])
- Competitive Policy Optimization. (arXiv:2006.10611v1 [cs.LG])
- Competitive Mirror Descent. (arXiv:2006.10179v1 [math.OC])
- Reinforcement Learning as Iterative and Amortised Inference. (arXiv:2006.10524v3 [cs.LG] UPDATED)
- Go with the Flow: Adaptive Control for Neural ODEs. (arXiv:2006.09545v3 [cs.LG] UPDATED)
- A Non-Asymptotic Analysis for Stein Variational Gradient Descent. (arXiv:2006.09797v4 [stat.ML] UPDATED)
- The limits of min-max optimization algorithms: convergence to spurious non-critical sets. (arXiv:2006.09065v2 [math.OC] UPDATED)
- Primal Dual Interpretation of the Proximal Stochastic Gradient Langevin Algorithm. (arXiv:2006.09270v2 [stat.ML] UPDATED)
- Learning Linear Programs from Optimal Decisions. (arXiv:2006.08923v1 [cs.LG])
- Analytic Manifold Learning: Unifying and Evaluating Representations for Continuous Control. (arXiv:2006.08718v2 [cs.LG] UPDATED)
- Time Discretizations of Wasserstein-Hamiltonian Flows. (arXiv:2006.09187v1 [math.NA])
- Entropic gradient descent algorithms and wide flat minima. (arXiv:2006.07897v4 [cs.LG] UPDATED)
- Proximal Mapping for Deep Regularization. (arXiv:2006.07822v1 [cs.LG])
- Learning Causal Models Online. (arXiv:2006.07461v1 [cs.LG])
- Time-Varying Convex Optimization: Time-Structured Algorithms and Applications. (arXiv:2006.08500v1 [math.OC])
- Non-convex Min-Max Optimization: Applications, Challenges, and Recent Theoretical Advances. (arXiv:2006.08141v2 [math.OC] UPDATED)
- Almost sure convergence rates for Stochastic Gradient Descent and Stochastic Heavy Ball. (arXiv:2006.07867v2 [cs.LG] UPDATED)
- Projection Robust Wasserstein Distance and Riemannian Optimization. (arXiv:2006.07458v10 [cs.LG] UPDATED)
- Preconditioned accelerated gradient descent methods for locally Lipschitz smooth objectives with applications to the solution of nonlinear PDEs. (arXiv:2006.06732v2 [math.NA] UPDATED)
- Multiplicative noise and heavy tails in stochastic optimization. (arXiv:2006.06293v1 [stat.ML])
- Gradient Flows for Regularized Stochastic Control Problems. (arXiv:2006.05956v5 [math.OC] UPDATED)
- Machine Learning and Control Theory. (arXiv:2006.05604v1 [cs.LG])
- Identifying Causal Structure in Dynamical Systems. (arXiv:2006.03906v2 [cs.LG] UPDATED)
- The Heavy-Tail Phenomenon in SGD. (arXiv:2006.04740v5 [math.OC] UPDATED)
- Triple descent and the two kinds of overfitting: Where & why do they appear?. (arXiv:2006.03509v2 [cs.LG] UPDATED)
- Least $k$th-Order and R\'{e}nyi Generative Adversarial Networks. (arXiv:2006.02479v3 [cs.LG] UPDATED)
- Asymptotic Analysis of Conditioned Stochastic Gradient Descent. (arXiv:2006.02745v5 [math.ST] UPDATED)
- Some Theoretical Insights into Wasserstein GANs. (arXiv:2006.02682v2 [cs.LG] UPDATED)
- SVGD as a kernelized Wasserstein gradient flow of the chi-squared divergence. (arXiv:2006.02509v1 [math.ST])
- SDE approximations of GANs training and its long-run behavior. (arXiv:2006.02047v5 [cs.LG] UPDATED)
- A mathematical model for automatic differentiation in machine learning. (arXiv:2006.02080v2 [cs.LG] UPDATED)
- The Power of Factorial Powers: New Parameter settings for (Stochastic) Optimization. (arXiv:2006.01244v3 [cs.LG] UPDATED)
- Carath\'eodory Sampling for Stochastic Gradient Descent. (arXiv:2006.01819v2 [cs.LG] UPDATED)
- The Power of Factorial Powers: New Parameter settings for (Stochastic) Optimization. (arXiv:2006.01244v3 [cs.LG] UPDATED)
- Interacting particle solutions of Fokker-Planck equations through gradient-log-density estimation. (arXiv:2006.00702v1 [cond-mat.stat-mech])
- CoolMomentum: A Method for Stochastic Optimization by Langevin Dynamics with Simulated Annealing. (arXiv:2005.14605v2 [stat.ML] UPDATED)
- Euclideanizing Flows: Diffeomorphic Reduction for Learning Stable Dynamical Systems. (arXiv:2005.13143v2 [cs.RO] UPDATED)
- On the Convergence of Langevin Monte Carlo: The Interplay between Tail Growth and Smoothness. (arXiv:2005.13097v1 [stat.ML])
- Convergence Analysis of Riemannian Stochastic Approximation Schemes. (arXiv:2005.13284v3 [stat.ML] UPDATED)
- Stochastic control liaisons: Richard Sinkhorn meets Gaspard Monge on a Schroedinger bridge. (arXiv:2005.10963v3 [math.OC] UPDATED)
- Stochastic Optimization with Heavy-Tailed Noise via Accelerated Gradient Clipping. (arXiv:2005.10785v2 [math.OC] UPDATED)
- Infinite-dimensional gradient-based descent for alpha-divergence minimisation. (arXiv:2005.10618v2 [math.ST] UPDATED)
- Exponential ergodicity of mirror-Langevin diffusions. (arXiv:2005.09669v2 [math.ST] UPDATED)
- Riemannian Proximal Policy Optimization. (arXiv:2005.09195v1 [cs.LG])
- Neural Controlled Differential Equations for Irregular Time Series. (arXiv:2005.08926v2 [cs.LG] UPDATED)
- Convergence of constant step stochastic gradient descent for non-smooth non-convex functions. (arXiv:2005.08513v3 [math.NA] UPDATED)
- Understanding Nesterov's Acceleration via Proximal Point Method. (arXiv:2005.08304v3 [math.OC] UPDATED)
- Convergence of Online Adaptive and Recurrent Optimization Algorithms. (arXiv:2005.05645v2 [math.DS] UPDATED)
- Solving high-dimensional Hamilton-Jacobi-Bellman PDEs using neural networks: perspectives from the theory of controlled diffusions and measures on path space. (arXiv:2005.05409v2 [math.OC] UPDATED)
- Revealing hidden dynamics from time-series data by ODENet. (arXiv:2005.04849v2 [math.DS] UPDATED)
- FedSplit: An algorithmic framework for fast federated optimization. (arXiv:2005.05238v1 [cs.LG])
- Accurate and efficient splitting methods for dissipative particle dynamics. (arXiv:2005.05260v2 [math.NA] UPDATED)
- Robust Filtering and Propagation of Uncertainty in Hidden Markov Models. (arXiv:2005.04982v2 [math.PR] UPDATED)
- Asymptotics of smoothed Wasserstein distances. (arXiv:2005.00738v2 [math.PR] UPDATED)
- Learning nonlinear dynamical systems from a single trajectory. (arXiv:2004.14681v1 [cs.LG])
- SLSpy: Python-Based System-Level Controller Synthesis Framework. (arXiv:2004.12565v3 [eess.SY] UPDATED)
- Differentiating through Log-Log Convex Programs. (arXiv:2004.12553v3 [math.OC] UPDATED)
- A noise-induced transition in the Lorenz system. (arXiv:2004.12815v3 [math.PR] UPDATED)
- A Bayesian perspective on classical control. (arXiv:2004.10288v2 [math.OC] UPDATED)
- Simulation of non-Lipschitz stochastic differential equations driven by $\alpha$-stable noise: a method based on deterministic homogenisation. (arXiv:2004.09914v1 [math.DS])
- Stochastic gradient algorithms from ODE splitting perspective. (arXiv:2004.08981v1 [stat.ML])
- On Linear Optimization over Wasserstein Balls. (arXiv:2004.07162v2 [math.OC] UPDATED)
- Analysis of Stochastic Gradient Descent in Continuous Time. (arXiv:2004.07177v3 [math.PR] UPDATED)
- On dissipative symplectic integration with applications to gradient-based optimization. (arXiv:2004.06840v4 [math.OC] UPDATED)
- On Learning Rates and Schr\"odinger Operators. (arXiv:2004.06977v1 [cs.LG])
- Relations among Open-loop Control Ability, Control Strategy Space and Closed-loop Performance for Linear Discrte-time Systems. (arXiv:2004.05619v3 [eess.SY] UPDATED)
- ControlVAE: Controllable Variational Autoencoder. (arXiv:2004.05988v5 [cs.LG] UPDATED)
- Explicit Estimation of Derivatives from Data and Differential Equations by Gaussian Process Regression. (arXiv:2004.05796v2 [stat.CO] UPDATED)
- Geomstats: A Python Package for Riemannian Geometry in Machine Learning. (arXiv:2004.04667v1 [cs.LG])
- Convergence rates and approximation results for SGD and its continuous-time counterpart. (arXiv:2004.04193v2 [math.OC] UPDATED)
- Mirror Descent Algorithms for Minimizing Interacting Free Energy. (arXiv:2004.04555v1 [math.OC])
- Densities of Almost Surely Terminating Probabilistic Programs are Differentiable Almost Everywhere. (arXiv:2004.03924v2 [cs.LO] UPDATED)
- First order convergence of Milstein schemes for McKean-Vlasov equations and interacting particle systems. (arXiv:2004.03325v2 [math.PR] UPDATED)
- $\beta$-High Resolution ODE and Phase Transition between NAG-SC and Heavy Ball Method. (arXiv:2004.03121v1 [math.OC])
- The equivalence between Stein variational gradient descent and black-box variational inference. (arXiv:2004.01822v1 [cs.LG])
- Controllability of a Linear System with Nonnegative Sparse Controls. (arXiv:2002.03978v5 [eess.SY] UPDATED)
- Mirrorless Mirror Descent: A Natural Derivation of Mirror Descent. (arXiv:2004.01025v3 [cs.LG] UPDATED)
- Bayesian ODE Solvers: The Maximum A Posteriori Estimate. (arXiv:2004.00623v2 [math.NA] UPDATED)
- Second-Order Guarantees in Centralized, Federated and Decentralized Nonconvex Optimization. (arXiv:2003.14366v1 [cs.MA])
- Deep State Space Models for Nonlinear System Identification. (arXiv:2003.14162v3 [eess.SY] UPDATED)
- Minimax control of ambiguous linear stochastic systems using the Wasserstein metric. (arXiv:2003.13258v1 [eess.SY])
- Runge-Kutta methods for rough differential equations. (arXiv:2003.12626v1 [math.NA])
- Planning as Inference in Epidemiological Models. (arXiv:2003.13221v1 [q-bio.PE])
- Online Smoothing for Diffusion Processes Observed with Noise. (arXiv:2003.12247v4 [stat.CO] UPDATED)
- Breaking the $O(1/\epsilon)$ Optimal Rate for a Class of Minimax Problems. (arXiv:2003.11758v1 [math.OC])
- Physics and Derivatives -- Interview Questions and Answers. (arXiv:2003.11471v1 [q-fin.GN])
- Stochastic Zeroth-order Riemannian Derivative Estimation and Optimization. (arXiv:2003.11238v3 [math.OC] UPDATED)
- A Poisson Kalman filter for disease surveillance. (arXiv:2003.11194v5 [stat.ME] UPDATED)
- Yet another introduction to linear dynamical systems control: From identification and approximation to digital control. (arXiv:2003.10481v1 [eess.SY])
- Distributed Gradient Methods for Nonconvex Optimization: Local and Global Convergence Guarantees. (arXiv:2003.10309v2 [math.OC] UPDATED)
- Complexity of randomized algorithms for underdamped Langevin dynamics. (arXiv:2003.09906v3 [math.NA] UPDATED)
- Non-asymptotic control of the cumulative distribution function of L\'evy processes. (arXiv:2003.09281v1 [math.PR])
- A simple SIR model with a large set of asymptomatic infectives. (arXiv:2003.08720v1 [q-bio.PE])
- Mixing Rates for Hamiltonian Monte Carlo Algorithms in Finite and Infinite Dimensions. (arXiv:2003.07980v1 [math.ST])
- Options on infectious diseases. (arXiv:2003.07992v3 [q-fin.CP] UPDATED)
- Acceleration with a Ball Optimization Oracle. (arXiv:2003.08078v1 [math.OC])
- Stable Neural Flows. (arXiv:2003.08063v1 [cs.LG])
- The Implicit Regularization of Stochastic Gradient Flow for Least Squares. (arXiv:2003.07802v2 [stat.ML] UPDATED)
- The Elliptical Processes: a Family of Fat-tailed Stochastic Processes. (arXiv:2003.07201v2 [stat.ME] UPDATED)
- On the Generalised Langevin Equation for Simulated Annealing. (arXiv:2003.06448v3 [math.PR] UPDATED)
- Interpolation Technique to Speed Up Gradients Propagation in Neural ODEs. (arXiv:2003.05271v2 [cs.NE] UPDATED)
- Unbiased Estimation of the Gradient of the Log-Likelihood in Inverse Problems. (arXiv:2003.04896v2 [stat.ME] UPDATED)
- Mean Field Games and Applications: Numerical Aspects. (arXiv:2003.04444v1 [math.OC])
- Geometry of First-Order Methods and Adaptive Acceleration. (arXiv:2003.03910v2 [math.OC] UPDATED)
- Lagrangian schemes for Wasserstein gradient flows. (arXiv:2003.03803v1 [math.NA])
- Stochastic Recursive Momentum for Policy Gradient Methods. (arXiv:2003.04302v1 [stat.ML])
- Stochastic Modified Equations for Continuous Limit of Stochastic ADMM. (arXiv:2003.03532v1 [math.OC])
- Ensemble Kalman Inversion for nonlinear problems: weights, consistency, and variance bounds. (arXiv:2003.02316v3 [math.NA] UPDATED)
- On the Differentiability of Projected Trajectories and the Robust Convergence of Non-convex Anti-Windup Gradient Flows. (arXiv:2003.02551v2 [math.OC] UPDATED)
- Bounds for the tracking error of first-order online optimization methods. (arXiv:2003.02400v2 [math.OC] UPDATED)
- A Simple Convergence Proof of Adam and Adagrad. (arXiv:2003.02395v3 [stat.ML] UPDATED)
- Gaussianization Flows. (arXiv:2003.01941v1 [cs.LG])
- Iterative Averaging in the Quest for Best Test Error. (arXiv:2003.01247v5 [stat.ML] UPDATED)
- Variational inference formulation for a model-free simulation of a dynamical system with unknown parameters by a recurrent neural network. (arXiv:2003.01184v2 [cs.LG] UPDATED)
- General convergence analysis of stochastic first order methods for composite optimization. (arXiv:2003.01666v2 [math.OC] UPDATED)
- Online Sinkhorn: Optimal Transport distances from sample streams. (arXiv:2003.01415v2 [math.OC] UPDATED)
- Batch Normalization Provably Avoids Rank Collapse for Randomly Initialised Deep Networks. (arXiv:2003.01652v3 [stat.ML] UPDATED)
- Differentiable Causal Backdoor Discovery. (arXiv:2003.01461v1 [cs.LG])
- Exactly Computing the Local Lipschitz Constant of ReLU Networks. (arXiv:2003.01219v2 [stat.ML] UPDATED)
- Stochastically Differentiable Probabilistic Programs. (arXiv:2003.00704v2 [cs.LG] UPDATED)
- Differentiating through the Fr\'echet Mean. (arXiv:2003.00335v4 [stat.ML] UPDATED)
- TAdam: A Robust Stochastic Gradient Optimizer. (arXiv:2003.00179v2 [cs.LG] UPDATED)
- Dimension-free convergence rates for gradient Langevin dynamics in RKHS. (arXiv:2003.00306v2 [math.PR] UPDATED)
- First Order Methods take Exponential Time to Converge to Global Minimizers of Non-Convex Functions. (arXiv:2002.12911v2 [cs.LG] UPDATED)
- On the Convergence of Nesterov's Accelerated Gradient Method in Stochastic Settings. (arXiv:2002.12414v2 [cs.LG] UPDATED)
- Optimization with Momentum: Dynamical, Control-Theoretic, and Symplectic Perspectives. (arXiv:2002.12493v2 [math.OC] UPDATED)
- Lipschitz and Comparator-Norm Adaptivity in Online Learning. (arXiv:2002.12242v2 [cs.LG] UPDATED)
- Disentangling Adaptive Gradient Methods from Learning Rates. (arXiv:2002.11803v1 [cs.LG])
- Optimality and Stability in Non-Convex Smooth Games. (arXiv:2002.11875v3 [cs.LG] UPDATED)
- Can We Find Near-Approximately-Stationary Points of Nonsmooth Nonconvex Functions?. (arXiv:2002.11962v3 [math.OC] UPDATED)
- Statistical Adaptive Stochastic Gradient Methods. (arXiv:2002.10597v1 [stat.ML])
- Non-asymptotic bounds for stochastic optimization with biased noisy gradient oracles. (arXiv:2002.11440v2 [cs.LG] UPDATED)
- Acceleration for Compressed Gradient Descent in Distributed and Federated Optimization. (arXiv:2002.11364v2 [math.OC] UPDATED)
- Neural Parametric Fokker-Planck Equations. (arXiv:2002.11309v4 [math.NA] UPDATED)
- Modeling Continuous Stochastic Processes with Dynamic Normalizing Flows. (arXiv:2002.10516v4 [cs.LG] UPDATED)
- Biased Stochastic Gradient Descent for Conditional Stochastic Optimization. (arXiv:2002.10790v1 [math.OC])
- Statistically Preconditioned Accelerated Gradient Method for Distributed Optimization. (arXiv:2002.10726v1 [math.OC])
- Stein variational reduced basis Bayesian inversion. (arXiv:2002.10924v1 [math.NA])
- Stochastic Normalizing Flows. (arXiv:2002.09547v2 [stat.ML] UPDATED)
- Finite-Time Last-Iterate Convergence for Multi-Agent Learning in Games. (arXiv:2002.09806v5 [math.OC] UPDATED)
- On Thompson Sampling with Langevin Algorithms. (arXiv:2002.10002v2 [cs.LG] UPDATED)
- Generalized Bayesian Filtering via Sequential Monte Carlo. (arXiv:2002.09998v2 [stat.ME] UPDATED)
- Global Convergence and Variance-Reduced Optimization for a Class of Nonconvex-Nonconcave Minimax Problems. (arXiv:2002.09621v1 [math.OC])
- Scalable Second Order Optimization for Deep Learning. (arXiv:2002.09018v2 [cs.LG] UPDATED)
- Stochastic Runge-Kutta methods and adaptive SGD-G2 stochastic gradient descent. (arXiv:2002.09304v1 [cs.LG])
- Differentiable Likelihoods for Fast Inversion of 'Likelihood-Free' Dynamical Systems. (arXiv:2002.09301v2 [stat.ML] UPDATED)
- Parallel and distributed asynchronous adaptive stochastic gradient methods. (arXiv:2002.09095v3 [math.OC] UPDATED)
- Dissipative SymODEN: Encoding Hamiltonian Dynamics with Dissipation and Control into Deep Learning. (arXiv:2002.08860v3 [cs.LG] UPDATED)
- DDPNOpt: Differential Dynamic Programming Neural Optimizer. (arXiv:2002.08809v3 [cs.LG] UPDATED)
- Stochastic Optimization for Regularized Wasserstein Estimators. (arXiv:2002.08695v1 [cs.LG])
- Learning with Differentiable Perturbed Optimizers. (arXiv:2002.08676v2 [cs.LG] UPDATED)
- Scalable Constrained Bayesian Optimization. (arXiv:2002.08526v3 [cs.LG] UPDATED)
- Non-asymptotic and Accurate Learning of Nonlinear Dynamical Systems. (arXiv:2002.08538v2 [cs.LG] UPDATED)
- Scalable Constrained Bayesian Optimization. (arXiv:2002.08526v3 [cs.LG] UPDATED)
- Stochastic Optimization for Regularized Wasserstein Estimators. (arXiv:2002.08695v1 [cs.LG])
- Efficient Search of First-Order Nash Equilibria in Nonconvex-Concave Smooth Min-Max Problems. (arXiv:2002.07919v7 [math.OC] UPDATED)
- A Unified Convergence Analysis for Shuffling-Type Gradient Methods. (arXiv:2002.08246v2 [math.OC] UPDATED)
- The Geometry of Sign Gradient Descent. (arXiv:2002.08056v1 [cs.LG])
- On the Trackability of Stochastic Processes. (arXiv:2002.08142v2 [cs.IT] UPDATED)
- Dissecting Neural ODEs. (arXiv:2002.08071v1 [cs.LG])
- Coarse graining of a Fokker-Planck equation with excluded volume effects preserving the gradient-flow structure. (arXiv:2002.07513v2 [math.AP] UPDATED)
- Estimating processes in adapted Wasserstein distance. (arXiv:2002.07261v2 [math.PR] UPDATED)
- Convex Optimization on Functionals of Probability Densities. (arXiv:2002.06488v2 [cs.IT] UPDATED)
- Distributed Averaging Methods for Randomized Second Order Optimization. (arXiv:2002.06540v1 [stat.ML])
- Non-asymptotic Convergence of Adam-type Reinforcement Learning Algorithms under Markovian Sampling. (arXiv:2002.06286v2 [cs.LG] UPDATED)
- Sampling and Update Frequencies in Proximal Variance-Reduced Stochastic Gradient Methods. (arXiv:2002.05545v3 [math.OC] UPDATED)
- Adaptivity of Stochastic Gradient Methods for Nonconvex Optimization. (arXiv:2002.05359v1 [cs.LG])
- Optimal Epoch Stochastic Gradient Descent Ascent Methods for Min-Max Optimization. (arXiv:2002.05309v2 [math.OC] UPDATED)
- A Second look at Exponential and Cosine Step Sizes: Simplicity, Adaptivity, and Performance. (arXiv:2002.05273v4 [stat.ML] UPDATED)
- Stochastic Approximate Gradient Descent via the Langevin Algorithm. (arXiv:2002.05519v1 [cs.LG])
- A Second look at Exponential and Cosine Step Sizes: Simplicity, Adaptivity, and Performance. (arXiv:2002.05273v4 [stat.ML] UPDATED)
- Adaptivity of Stochastic Gradient Methods for Nonconvex Optimization. (arXiv:2002.05359v1 [cs.LG])
- Convergence of a Stochastic Gradient Method with Momentum for Non-Smooth Non-Convex Optimization. (arXiv:2002.05466v2 [math.OC] UPDATED)
- Optimal Epoch Stochastic Gradient Descent Ascent Methods for Min-Max Optimization. (arXiv:2002.05309v2 [math.OC] UPDATED)
- Fast Convergence for Langevin Diffusion with Manifold Structure. (arXiv:2002.05576v2 [math.PR] UPDATED)
- Structure-preserving integrators for dissipative systems based on reversible-irreversible splitting. (arXiv:1804.05114v2 [math.NA] UPDATED)
- Variance Reduced Coordinate Descent with Acceleration: New Method With a Surprising Application to Finite-Sum Problems. (arXiv:2002.04670v1 [math.OC])
- Momentum Improves Optimization on Riemannian Manifolds. (arXiv:2002.04144v2 [math.OC] UPDATED)
- Logsmooth Gradient Concentration and Tighter Runtimes for Metropolized Hamiltonian Monte Carlo. (arXiv:2002.04121v3 [cs.LG] UPDATED)
- Wasserstein Control of Mirror Langevin Monte Carlo. (arXiv:2002.04363v1 [math.ST])
- Better Theory for SGD in the Nonconvex World. (arXiv:2002.03329v3 [math.OC] UPDATED)
- Pontryagin Differentiable Programming: An End-to-End Learning and Control Framework. (arXiv:1912.12970v5 [cs.LG] UPDATED)
- A gradient descent perspective on Sinkhorn. (arXiv:2002.03758v3 [math.OC] UPDATED)
- Momentum Improves Normalized SGD. (arXiv:2002.03305v2 [cs.LG] UPDATED)
- Compositional ADAM: An Adaptive Compositional Solver. (arXiv:2002.03755v2 [cs.LG] UPDATED)
- Stochastic Online Optimization using Kalman Recursion. (arXiv:2002.03636v2 [cs.LG] UPDATED)
- A Diffusion Theory For Deep Learning Dynamics: Stochastic Gradient Descent Exponentially Favors Flat Minima. (arXiv:2002.03495v14 [cs.LG] UPDATED)
- The Wasserstein Proximal Gradient Algorithm. (arXiv:2002.03035v3 [math.OC] UPDATED)
- Continuous-time Lower Bounds for Gradient-based Algorithms. (arXiv:2002.03546v2 [math.OC] UPDATED)
- Beyond $R_0$: the importance of contact tracing when predicting epidemics. (arXiv:2002.04004v1 [q-bio.PE])
- First attempts to model the dynamics of the Coronavirus outbreak 2020. (arXiv:2002.03821v1 [q-bio.PE])
- Quadratic growth during the 2019 novel coronavirus epidemic. (arXiv:2002.03638v1 [q-bio.PE])
- The Novel Coronavirus, 2019-nCoV, is Highly Contagious and More Infectious Than Initially Estimated. (arXiv:2002.03268v1 [q-bio.PE])
- Unbiased Filtering of a Class of Partially Observed Diffusions. (arXiv:2002.03747v2 [math.NA] UPDATED)
- Convergence Rates of Accelerated Markov Gradient Descent with Applications in Reinforcement Learning. (arXiv:2002.02873v3 [math.OC] UPDATED)
- The Power of Linear Controllers in LQR Control. (arXiv:2002.02574v1 [math.OC])
- A deep-learning view of chemical space designed to facilitate drug discovery. (arXiv:2002.02948v1 [cs.LG])
- How to train your neural ODE: the world of Jacobian and kinetic regularization. (arXiv:2002.02798v3 [stat.ML] UPDATED)
- How Good is the Bayes Posterior in Deep Neural Networks Really?. (arXiv:2002.02405v2 [stat.ML] UPDATED)
- Global Convergence of Frank Wolfe on One Hidden Layer Networks. (arXiv:2002.02208v1 [math.OC])
- Near-Optimal Algorithms for Minimax Optimization. (arXiv:2002.02417v6 [math.OC] UPDATED)
- Finite Time Analysis of Linear Two-timescale Stochastic Approximation with Markovian Noise. (arXiv:2002.01268v1 [stat.ML])
- Automatic structured variational inference. (arXiv:2002.00643v3 [stat.ML] UPDATED)
- An Equivalence between Bayesian Priors and Penalties in Variational Inference. (arXiv:2002.00178v2 [cs.LG] UPDATED)
- The Statistical Complexity of Early-Stopped Mirror Descent. (arXiv:2002.00189v2 [stat.ML] UPDATED)
- Last Iterate is Slower than Averaged Iterate in Smooth Convex-Concave Saddle Point Problems. (arXiv:2002.00057v2 [cs.LG] UPDATED)
- Analysis and optimal control of a malaria mathematical model under resistance and population movement. (arXiv:2002.00070v1 [q-bio.PE])
- Consensus-Based Optimization on Hypersurfaces: Well-Posedness and Mean-Field Limit. (arXiv:2001.11994v4 [math.AP] UPDATED)
- Non-reversibly updating a uniform [0,1] value for Metropolis accept/reject decisions. (arXiv:2001.11950v1 [stat.CO])
- Faster Projection-free Online Learning. (arXiv:2001.11568v2 [cs.LG] UPDATED)
- Non-explosion by Stratonovich noise for ODEs. (arXiv:2001.11598v2 [math.PR] UPDATED)
- Hamiltonian neural networks for solving equations of motion. (arXiv:2001.11107v5 [physics.comp-ph] UPDATED)
- Counterfactual Programming for Optimal Control. (arXiv:2001.11116v2 [math.OC] UPDATED)
- Variational Autoencoders for Opponent Modeling in Multi-Agent Systems. (arXiv:2001.10829v1 [cs.LG])
- Reproducibility Challenge NeurIPS 2019 Report on "Competitive Gradient Descent". (arXiv:2001.10820v1 [cs.LG])
- Explainable Machine Learning Control -- robust control and stability analysis. (arXiv:2001.10056v1 [cs.LG])
- Ensemble Rejection Sampling. (arXiv:2001.09188v1 [stat.CO])
- Variance Reduction with Sparse Gradients. (arXiv:2001.09623v1 [cs.LG])
- Exact rate of convergence of the mean Wasserstein distance between the empirical and true Gaussian distribution. (arXiv:2001.09817v1 [math.PR])
- A Sharp Convergence Rate for the Asynchronous Stochastic Gradient Descent. (arXiv:2001.09126v1 [math.NA])
- Discrete graphical models -- an optimization perspective. (arXiv:2001.09017v1 [math.OC])
- From Nesterov's Estimate Sequence to Riemannian Acceleration. (arXiv:2001.08876v1 [math.OC])
- An $O(s^r)$-Resolution ODE Framework for Understanding Discrete-Time Algorithms and Applications to the Linear Convergence of Minimax Problems. (arXiv:2001.08826v7 [math.OC] UPDATED)
- Learning the Non-Equilibrium Dynamics of Brownian Movies. (arXiv:2001.08642v1 [physics.bio-ph])
- A Unified Optimization Framework for Low-Rank Inducing Penalties. (arXiv:2001.08415v1 [math.OC])
- Gradient and Hessian approximations in Derivative Free Optimization. (arXiv:2001.08355v1 [math.OC])
- Replica Exchange for Non-Convex Optimization. (arXiv:2001.08356v4 [math.OC] UPDATED)
- TPFA Finite Volume Approximation of Wasserstein Gradient Flows. (arXiv:2001.07005v2 [math.NA] UPDATED)
- Fitting a Linear Control Policy to Demonstrations with a Kalman Constraint. (arXiv:2001.07572v1 [math.OC])
- Dual Stochastic Natural Gradient Descent and convergence of interior half-space gradient approximations. (arXiv:2001.06744v2 [math.OC] UPDATED)
- Adaptive Stochastic Optimization. (arXiv:2001.06699v1 [math.OC])
- Understanding the stochastic partial differential equation approach to smoothing. (arXiv:2001.07623v2 [stat.ME] UPDATED)
- Learning to Control PDEs with Differentiable Physics. (arXiv:2001.07457v1 [cs.LG])
- SGLB: Stochastic Gradient Langevin Boosting. (arXiv:2001.07248v5 [cs.LG] UPDATED)
- Counterexamples to "The Blessings of Multiple Causes" by Wang and Blei. (arXiv:2001.06555v3 [stat.ME] UPDATED)
- Markov Chain Monte Carlo Methods, a survey with some frequent misunderstandings. (arXiv:2001.06249v1 [stat.CO])
- Causal models for dynamical systems. (arXiv:2001.06208v1 [stat.ME])
- Exponential contraction in Wasserstein distance on static and evolving manifolds. (arXiv:2001.06187v1 [math.DG])
- On the Trend-corrected Variant of Adaptive Stochastic Optimization Methods. (arXiv:2001.06130v2 [cs.LG] UPDATED)
- Learning Stable Deep Dynamics Models. (arXiv:2001.06116v1 [cs.LG])
- Gradient descent with momentum --- to accelerate or to super-accelerate?. (arXiv:2001.06472v1 [cs.LG])
- i-flow: High-dimensional Integration and Sampling with Normalizing Flows. (arXiv:2001.05486v2 [physics.comp-ph] UPDATED)
- Smooth markets: A basic mechanism for organizing gradient-based learners. (arXiv:2001.04678v2 [cs.LG] UPDATED)
- DDSP: Differentiable Digital Signal Processing. (arXiv:2001.04643v1 [cs.LG])
- One Method to Rule Them All: Variance Reduction for Data, Parameters and Many New Methods. (arXiv:1905.11266v2 [math.OC] UPDATED)
- Smooth markets: A basic mechanism for organizing gradient-based learners. (arXiv:2001.04678v2 [cs.LG] UPDATED)
- DDSP: Differentiable Digital Signal Processing. (arXiv:2001.04643v1 [cs.LG])
- Asymptotic behavior of a nonautonomous evolution equation governed by a quasi-nonexpansive operator. (arXiv:2001.04628v3 [math.OC] UPDATED)
- Stochastic Recursive Gradient Descent Ascent for Stochastic Nonconvex-Strongly-Concave Minimax Problems. (arXiv:2001.03724v2 [cs.LG] UPDATED)
- Log-Linear Dynamical Systems. (arXiv:2001.03649v3 [math.OC] UPDATED)
- Manifold Learning for Accelerating Coarse-Grained Optimization. (arXiv:2001.03518v2 [math.OC] UPDATED)
- Accelerated and nonaccelerated stochastic gradient descent with model conception. (arXiv:2001.03443v5 [math.OC] UPDATED)
- Dynamic Gauss Newton Metropolis Algorithm. (arXiv:2001.03530v1 [stat.CO])
- The Bayesian Update: Variational Formulations and Gradient Flows
- Lifted Hybrid Variational Inference. (arXiv:2001.02773v2 [cs.LG] UPDATED)
- How to trap a gradient flow. (arXiv:2001.02968v3 [math.OC] UPDATED)
- First-Order Algorithms for Constrained Nonlinear Dynamic Games. (arXiv:2001.01826v1 [eess.SY])
- Evolution Strategies Converges to Finite Differences. (arXiv:2001.01684v1 [cs.NE])
- The troublesome kernel -- On hallucinations, no free lunches and the accuracy-stability trade-off in inverse problems. (arXiv:2001.01258v3 [cs.LG] UPDATED)
- Scalable Gradients for Stochastic Differential Equations. (arXiv:2001.01328v6 [cs.LG] UPDATED)
- Decentralized Langevin Dynamics. (arXiv:2001.00665v2 [math.OC] UPDATED)
- Optimization of Mean-field Spin Glasses. (arXiv:2001.00904v1 [math.PR])
- Introduction to Nonsmooth Analysis and Optimization. (arXiv:2001.00216v4 [math.OC] UPDATED)
- Learning from Learning Machines: Optimisation, Rules, and Social Norms. (arXiv:2001.00006v1 [cs.CY])
- The L\'evy State Space Model. (arXiv:1912.12524v2 [math.PR] UPDATED)
- Stochastic gradient-free descents. (arXiv:1912.13305v4 [math.OC] UPDATED)
Saved in 2019
- Finite-Time Analysis and Restarting Scheme for Linear Two-Time-Scale Stochastic Approximation. (arXiv:1912.10583v2 [cs.LG] UPDATED)
- Second-order Information in First-order Optimization Methods. (arXiv:1912.09926v1 [cs.LG])
- Learning Convex Optimization Control Policies. (arXiv:1912.09529v1 [math.OC])
- Numerical Optimal Control of HIV Transmission in Octave/MATLAB. (arXiv:1912.09510v1 [math.OC])
- Distributed Reinforcement Learning for Decentralized Linear Quadratic Control: A Derivative-Free Policy Optimization Approach. (arXiv:1912.09135v3 [eess.SY] UPDATED)
- On the Geometry of Bayesian Inference
- Finite-Time Convergence of Continuous-Time Optimization Algorithms via Differential Inclusions. (arXiv:1912.08342v1 [math.OC])
- Temporal Normalizing Flows. (arXiv:1912.09092v1 [physics.comp-ph])
- Online Gradient Descent for Linear Dynamical Systems. (arXiv:1912.09311v2 [math.OC] UPDATED)
- First order optimization methods based on Hessian-driven Nesterov accelerated gradient flow. (arXiv:1912.09276v2 [math.OC] UPDATED)
- On theoretical upper limits for valid timesteps of implicit ODE methods. (arXiv:1912.08900v1 [math.NA])
- Strong equivalence between metrics of Wasserstein type. (arXiv:1912.08247v3 [math.PR] UPDATED)
- Finite-Time Convergence of Continuous-Time Optimization Algorithms via Differential Inclusions. (arXiv:1912.08342v1 [math.OC])
- Thermodynamic interpretation of Wasserstein distance. (arXiv:1912.08405v1 [cond-mat.stat-mech])
- Continuous Limits for Constrained Ensemble Kalman Filter. (arXiv:1912.08406v2 [math.NA] UPDATED)
- A Control-Theoretic Perspective on Optimal High-Order Optimization. (arXiv:1912.07168v4 [math.OC] UPDATED)
- Optimal control of nonlinear stochastic differential equations on Hilbert spaces. (arXiv:1912.06541v1 [math.PR])
- Finite sample properties of parametric MMD estimation: robustness to misspecification and dependence. (arXiv:1912.05737v5 [math.ST] UPDATED)
- Mean-Field Neural ODEs via Relaxed Optimal Control. (arXiv:1912.05475v3 [math.PR] UPDATED)
- Neural Networks as Geometric Chaotic Maps. (arXiv:1912.05081v4 [cs.LG] UPDATED)
- Advances and Open Problems in Federated Learning. (arXiv:1912.04977v3 [cs.LG] UPDATED)
- differint: A Python Package for Numerical Fractional Calculus. (arXiv:1912.05303v1 [cs.MS])
- Deep Latent Factor Model for Collaborative Filtering. (arXiv:1912.04754v1 [cs.LG])
- From Reinforcement Learning to Optimal Control: A unified framework for sequential decisions. (arXiv:1912.03513v2 [cs.AI] UPDATED)
- Temporal Wasserstein non-negative matrix factorization for non-rigid motion segmentation and spatiotemporal deconvolution. (arXiv:1912.03463v1 [stat.AP])
- Non-asymptotic error bounds for scaled underdamped Langevin MCMC. (arXiv:1912.03154v1 [stat.ML])
- Manifold Markov chain Monte Carlo methods for Bayesian inference in diffusion models. (arXiv:1912.02982v3 [stat.CO] UPDATED)
- Bregman dynamics, contact transformations and convex optimization. (arXiv:1912.02928v4 [math.OC] UPDATED)
- Affine invariant interacting Langevin dynamics for Bayesian inference. (arXiv:1912.02859v2 [math.NA] UPDATED)
- McKean Feynman-Kac probabilistic representations of non-linear partial differential equations. (arXiv:1912.03146v1 [math.PR])
- Stochastic proximal splitting algorithm for composite minimization. (arXiv:1912.02039v3 [math.OC] UPDATED)
- Lower Bounds for Non-Convex Stochastic Optimization. (arXiv:1912.02365v2 [math.OC] UPDATED)
- Deep Double Descent: Where Bigger Models and More Data Hurt. (arXiv:1912.02292v1 [cs.LG])
- Inferring the Optimal Policy using Markov Chain Monte Carlo. (arXiv:1912.02714v1 [cs.LG])
- Analysis of the Optimization Landscapes for Overcomplete Representation Learning. (arXiv:1912.02427v1 [cs.LG])
- A Unified Switching System Perspective and O.D.E. Analysis of Q-Learning Algorithms. (arXiv:1912.02270v3 [math.OC] UPDATED)
- Variability as a better characterization of Shannon entropy. (arXiv:1912.02012v2 [cond-mat.stat-mech] UPDATED)
- Dream to Control: Learning Behaviors by Latent Imagination. (arXiv:1912.01603v3 [cs.LG] UPDATED)
- Numerical Gaussian process Kalman filtering. (arXiv:1912.01234v2 [stat.ML] UPDATED)
- Wasserstein Proximal Algorithms for the Schr\"{o}dinger Bridge Problem: Density Control with Nonlinear Drift. (arXiv:1912.01244v2 [math.OC] UPDATED)
- On the geometry of Stein variational gradient descent. (arXiv:1912.00894v2 [stat.ML] UPDATED)
- On the Heavy-Tailed Theory of Stochastic Gradient Descent for Deep Neural Networks. (arXiv:1912.00018v1 [stat.ML])
- Proximal Splitting Algorithms for Convex Optimization: A Tour of Recent Advances, with New Twists. (arXiv:1912.00137v8 [math.OC] UPDATED)
- Optimization for Reinforcement Learning: From Single Agent to Cooperative Agents. (arXiv:1912.00498v1 [cs.LG])
- The Nonstochastic Control Problem. (arXiv:1911.12178v2 [cs.LG] UPDATED)
- Functional Bayesian Filter. (arXiv:1911.10606v1 [eess.SP])
- JKO estimates in linear and non-linear Fokker-Planck equations, and Keller-Segel: L p and Sobolev bounds. (arXiv:1911.10999v1 [math.AP])
- Causality for Machine Learning. (arXiv:1911.10500v2 [cs.LG] UPDATED)
- Fokker-Planck particle systems for Bayesian inference: Computational approaches. (arXiv:1911.10832v3 [math.NA] UPDATED)
- Neural Integration of Continuous Dynamics. (arXiv:1911.10309v1 [cs.LG])
- Nonlinear Covariance Control via Differential Dynamic Programming. (arXiv:1911.09283v1 [eess.SY])
- On the Discretization of Robust Exact Filtering Differentiators. (arXiv:1911.09232v1 [eess.SY])
- vqSGD: Vector Quantized Stochastic Gradient Descent. (arXiv:1911.07971v4 [cs.LG] UPDATED)
- Bayesian interpretation of SGD as Ito process. (arXiv:1911.09011v1 [stat.ML])
- Adaptive Gradient Descent for Convex and Non-Convex Stochastic Optimization. (arXiv:1911.08380v5 [math.OC] UPDATED)
- Convergence Analysis of a Momentum Algorithm with Adaptive Step Size for Non Convex Optimization. (arXiv:1911.07596v2 [math.OC] UPDATED)
- Gradientless Descent: High-Dimensional Zeroth-Order Optimization. (arXiv:1911.06317v4 [cs.LG] UPDATED)
- Coarse-graining of non-reversible stochastic differential equations: quantitative results and connections to averaging. (arXiv:1911.06081v2 [math.AP] UPDATED)
- Shadowing Properties of Optimization Algorithms. (arXiv:1911.05206v1 [math.OC])
- Error bounds for some approximate posterior measures in Bayesian inference. (arXiv:1911.05669v2 [math.ST] UPDATED)
- Constructing Gradient Controllable Recurrent Neural Networks Using Hamiltonian Dynamics. (arXiv:1911.05035v2 [cs.LG] UPDATED)
- A Simple Differentiable Programming Language. (arXiv:1911.04523v4 [cs.PL] UPDATED)
- Fitness Optimization and Evolution of Permanent Replicator Systems. (arXiv:1911.02893v1 [q-bio.PE])
- Online learning-based Model Predictive Control with Gaussian Process Models and Stability Guarantees. (arXiv:1911.03315v4 [eess.SY] UPDATED)
- Estimating Normalizing Constants for Log-Concave Distributions: Algorithms and Lower Bounds. (arXiv:1911.03043v2 [cs.DS] UPDATED)
- Optimizing Millions of Hyperparameters by Implicit Differentiation. (arXiv:1911.02590v1 [cs.LG])
- Don't Blame the ELBO! A Linear VAE Perspective on Posterior Collapse. (arXiv:1911.02469v1 [cs.LG])
- Information-Theoretic Generalization Bounds for SGLD via Data-Dependent Estimates. (arXiv:1911.02151v3 [stat.ML] UPDATED)
- Convergence Acceleration of Ensemble Kalman Inversion in Nonlinear Settings. (arXiv:1911.02424v2 [math.NA] UPDATED)
- Cooperative transport with selective kinetic constraints. (arXiv:1911.02252v1 [cond-mat.stat-mech])
- Stein Variational Gradient Descent With Matrix-Valued Kernels. (arXiv:1910.12794v2 [stat.ML] UPDATED)
- A Rule for Gradient Estimator Selection, with an Application to Variational Inference. (arXiv:1911.01894v1 [cs.LG])
- Importance Sampling via Local Sensitivity. (arXiv:1911.01575v2 [math.OC] UPDATED)
- Proximal Langevin Algorithm: Rapid Convergence Under Isoperimetry. (arXiv:1911.01469v1 [stat.ML])
- Online matrix factorization for Markovian data and applications to Network Dictionary Learning. (arXiv:1911.01931v6 [cs.LG] UPDATED)
- Acceleration via Symplectic Discretization of High-Resolution Differential Equations. (arXiv:1902.03694v2 [math.OC] UPDATED)
- On the convergence of stochastic primal-dual hybrid gradient. (arXiv:1911.00799v3 [math.OC] UPDATED)
- Beta DVBF: Learning State-Space Models for Control from High Dimensional Observations. (arXiv:1911.00756v1 [cs.LG])
- Beta DVBF: Learning State-Space Models for Control from High Dimensional Observations. (arXiv:1911.00756v1 [cs.LG])
- Gradient-based Adaptive Markov Chain Monte Carlo. (arXiv:1911.01373v2 [stat.ML] UPDATED)
- Laplacian Smoothing Stochastic Gradient Markov Chain Monte Carlo. (arXiv:1911.00782v1 [cs.LG])
- Does Adam optimizer keep close to the optimal point?. (arXiv:1911.00289v1 [cs.LG])
- Stability of Non-linear Filter for Deterministic Dynamics. (arXiv:1910.14348v4 [math.OC] UPDATED)
- Statistical Estimation of the Poincar{\'e} constant and Application to Sampling Multimodal Distributions. (arXiv:1910.14564v2 [math.PR] UPDATED)
- Mixing of Stochastic Accelerated Gradient Descent. (arXiv:1910.14616v1 [math.OC])
- Backward Nonlinear Smoothing Diffusions. (arXiv:1910.14511v1 [math.PR])
- On the Convergence of Local Descent Methods in Federated Learning. (arXiv:1910.14425v2 [cs.LG] UPDATED)
- A Decentralized Proximal Point-type Method for Saddle Point Problems. (arXiv:1910.14380v1 [math.OC])
- Parameter elimination in particle Gibbs sampling. (arXiv:1910.14145v1 [stat.CO])
- Unifying mirror descent and dual averaging. (arXiv:1910.13742v4 [math.OC] UPDATED)
- Adaptive Sampling Quasi-Newton Methods for Derivative-Free Stochastic Optimization. (arXiv:1910.13516v1 [math.OC])
- Understanding the Role of Momentum in Stochastic Gradient Methods. (arXiv:1910.13962v1 [cs.LG])
- Continuous Control with Contexts, Provably. (arXiv:1910.13614v1 [cs.LG])
- Jump Markov Chains and Rejection-Free Metropolis Algorithms. (arXiv:1910.13316v3 [math.ST] UPDATED)
- Bridging the ELBO and MMD. (arXiv:1910.13181v1 [cs.LG])
- Efficiently avoiding saddle points with zero order methods: No gradients required. (arXiv:1910.13021v1 [math.OC])
- Ensemble Kalman Sampler: mean-field limit and convergence analysis. (arXiv:1910.12923v3 [math.NA] UPDATED)
- On the Global Convergence of (Fast) Incremental Expectation Maximization Methods. (arXiv:1910.12521v1 [stat.ML])
- Differentiable Convex Optimization Layers. (arXiv:1910.12430v1 [cs.LG])
- Asynchronous Decentralized SGD with Quantized and Local Updates. (arXiv:1910.12308v4 [cs.LG] UPDATED)
- Improved Zeroth-Order Variance Reduced Algorithms and Analysis for Nonconvex Optimization. (arXiv:1910.12166v1 [cs.LG])
- Mirror Natural Evolution Strategies. (arXiv:1910.11490v1 [math.OC])
- Optimizer Benchmarking Needs to Account for Hyperparameter Tuning. (arXiv:1910.11758v4 [cs.LG] UPDATED)
- Variational Predictive Information Bottleneck. (arXiv:1910.10831v1 [cs.LG])
- A Continuous-time Perspective for Modeling Acceleration in Riemannian Optimization. (arXiv:1910.10782v3 [math.OC] UPDATED)
- Wasserstein information matrix. (arXiv:1910.11248v5 [math.ST] UPDATED)
- Wasserstein total variation filtering. (arXiv:1910.10822v1 [eess.SP])
- Non-Gaussianity of Stochastic Gradient Noise. (arXiv:1910.09626v2 [cs.LG] UPDATED)
- Proximal Adam: Robust Adaptive Update Scheme for Constrained Optimization. (arXiv:1910.10094v2 [math.OC] UPDATED)
- On Distributed Stochastic Gradient Algorithms for Global Optimization. (arXiv:1910.09587v2 [math.OC] UPDATED)
- Image processing in DNA. (arXiv:1910.10095v2 [eess.IV] UPDATED)
- Causal bootstrapping. (arXiv:1910.09648v3 [cs.LG] UPDATED)
- Bridging the Gap Between $f$-GANs and Wasserstein GANs. (arXiv:1910.09779v2 [cs.LG] UPDATED)
- Kernelized Wasserstein Natural Gradient. (arXiv:1910.09652v4 [stat.ML] UPDATED)
- Collapsed Amortized Variational Inference for Switching Nonlinear Dynamical Systems. (arXiv:1910.09588v2 [cs.LG] UPDATED)
- Is There an Analog of Nesterov Acceleration for MCMC?. (arXiv:1902.00996v2 [stat.ML] UPDATED)
- Convergence Guarantees for a Class of Non-convex and Non-smooth Optimization Problems
- All-Action Policy Gradient Methods: A Numerical Integration Approach. (arXiv:1910.09093v1 [cs.LG])
- Variational Integrator Networks for Physically Structured Embeddings. (arXiv:1910.09349v2 [stat.ML] UPDATED)
- Integrals over Gaussians under Linear Domain Constraints. (arXiv:1910.09328v2 [cs.LG] UPDATED)
- Aggregated Gradient Langevin Dynamics. (arXiv:1910.09223v1 [cs.LG])
- From Importance Sampling to Doubly Robust Policy Gradient. (arXiv:1910.09066v3 [cs.LG] UPDATED)
- Fitting a Kalman Smoother to Data. (arXiv:1910.08615v1 [math.OC])
- Robust Distributed Accelerated Stochastic Gradient Methods for Multi-Agent Networks. (arXiv:1910.08701v4 [math.OC] UPDATED)
- Anderson Acceleration of Proximal Gradient Methods. (arXiv:1910.08590v2 [math.OC] UPDATED)
- Adaptive Gradient Descent without Descent. (arXiv:1910.09529v2 [math.OC] UPDATED)
- Probabilistic Deterministic Finite Automata and Recurrent Networks, Revisited. (arXiv:1910.07663v1 [cs.LG])
- Stochastic Resetting and Applications. (arXiv:1910.07993v2 [cond-mat.stat-mech] UPDATED)
- Sharper bounds for uniformly stable algorithms. (arXiv:1910.07833v2 [cs.LG] UPDATED)
- First-Order Preconditioning via Hypergradient Descent. (arXiv:1910.08461v2 [cs.LG] UPDATED)
- On Connections between Constrained Optimization and Reinforcement Learning. (arXiv:1910.08476v2 [cs.LG] UPDATED)
- MultiVerse: Causal Reasoning using Importance Sampling in Probabilistic Programming. (arXiv:1910.08091v2 [cs.AI] UPDATED)
- Sharper bounds for uniformly stable algorithms. (arXiv:1910.07833v2 [cs.LG] UPDATED)
- A Stochastic Variance Reduced Nesterov's Accelerated Quasi-Newton Method. (arXiv:1910.07939v1 [cs.LG])
- SCAFFOLD: Stochastic Controlled Averaging for Federated Learning. (arXiv:1910.06378v4 [cs.LG] UPDATED)
- On Higher-order Moments in Adam. (arXiv:1910.06878v1 [cs.LG])
- A unified view of likelihood ratio and reparameterization gradients and an optimal importance sampling scheme. (arXiv:1910.06419v1 [cs.LG])
- Bayesian Temporal Factorization for Multidimensional Time Series Prediction. (arXiv:1910.06366v2 [stat.ML] UPDATED)
- ZO-AdaMM: Zeroth-Order Adaptive Momentum Method for Black-Box Optimization. (arXiv:1910.06513v2 [cs.LG] UPDATED)
- A parameterized Douglas-Rachford Splitting algorithm for nonconvex optimization. (arXiv:1910.05544v2 [math.OC] UPDATED)
- TorchBeast: A PyTorch Platform for Distributed RL. (arXiv:1910.03552v1 [cs.LG])
- Stochastic Optimal Control as Approximate Input Inference. (arXiv:1910.03003v2 [cs.LG] UPDATED)
- Validated Variational Inference via Practical Posterior Error Bounds. (arXiv:1910.04102v4 [stat.ML] UPDATED)
- Projection-free nonconvex stochastic optimization on Riemannian manifolds. (arXiv:1910.04194v3 [math.OC] UPDATED)
- Linear-Quadratic Mean-Field Reinforcement Learning: Convergence of Policy Gradient Methods. (arXiv:1910.04295v1 [math.OC])
- One Sample Stochastic Frank-Wolfe. (arXiv:1910.04322v1 [math.OC])
- Robust Convergence Analysis of Three-Operator Splitting. (arXiv:1910.04229v3 [math.OC] UPDATED)
- Black-box Optimizer with Implicit Natural Gradient. (arXiv:1910.04301v3 [cs.LG] UPDATED)
- Asymmetric Multiresolution Matrix Factorization. (arXiv:1910.05132v1 [math.NA])
- Demon: Improved Neural Network Training with Momentum Decay. (arXiv:1910.04952v4 [cs.LG] UPDATED)
- Stochastic Optimal Control as Approximate Input Inference. (arXiv:1910.03003v2 [cs.LG] UPDATED)
- Policy Optimization Through Approximate Importance Sampling. (arXiv:1910.03857v2 [cs.LG] UPDATED)
- The fastest $\ell_{1,\infty}$ prox in the west. (arXiv:1910.03749v1 [cs.LG])
- Variance reduction for Markov chains with application to MCMC. (arXiv:1910.03643v2 [math.ST] UPDATED)
- Distilling Importance Sampling for Likelihood Free Inference. (arXiv:1910.03632v6 [stat.CO] UPDATED)
- Bregman Proximal Framework for Deep Linear Neural Networks. (arXiv:1910.03638v1 [math.OC])
- Frame Soft Shrinkage as Proximity Operator. (arXiv:1910.02843v2 [math.OC] UPDATED)
- An Optimal Transport Formulation of the Ensemble Kalman Filter. (arXiv:1910.02338v1 [eess.SY])
- Nonasymptotic estimates for Stochastic Gradient Langevin Dynamics under local conditions in nonconvex optimization. (arXiv:1910.02008v5 [math.ST] UPDATED)
- Scalable Global Optimization via Local Bayesian Optimization. (arXiv:1910.01739v4 [cs.LG] UPDATED)
- Escaping Saddle Points for Zeroth-order Nonconvex Optimization using Estimated Gradient Descent. (arXiv:1910.01277v1 [math.OC])
- Analyzing the Variance of Policy Gradient Estimators for the Linear-Quadratic Regulator. (arXiv:1910.01249v1 [cs.LG])
- The Neural Moving Average Model for Scalable Variational Inference of State Space Models. (arXiv:1910.00879v2 [stat.ML] UPDATED)
- Learning Neural Causal Models from Unknown Interventions. (arXiv:1910.01075v2 [stat.ML] UPDATED)
- Global exponential stability of primal-dual gradient flow dynamics based on the proximal augmented Lagrangian: A Lyapunov-based approach. (arXiv:1910.00783v1 [math.OC])
- First-order primal-dual methods for nonsmooth nonconvex optimisation. (arXiv:1910.00115v3 [math.OC] UPDATED)
- An Efficient Sampling Algorithm for Non-smooth Composite Potentials. (arXiv:1910.00551v1 [stat.ML])
- DualSMC: Tunneling Differentiable Filtering and Planning under Continuous POMDPs. (arXiv:1909.13003v4 [cs.LG] UPDATED)
- Linearly implicit structure-preserving schemes for Hamiltonian systems. (arXiv:1901.03573v3 [math.NA] UPDATED)
- Symplectic ODE-Net: Learning Hamiltonian Dynamics with Control. (arXiv:1909.12077v4 [cs.LG] UPDATED)
- Necessary and Sufficient Geometries for Gradient Methods. (arXiv:1909.10455v2 [math.OC] UPDATED)
- Finding the forward-Douglas-Rachford-forward method. (arXiv:1909.09747v2 [math.OC] UPDATED)
- Contractivity of Runge-Kutta methods for convex gradient systems. (arXiv:1909.09971v3 [math.NA] UPDATED)
- Particle Smoothing Variational Objectives. (arXiv:1909.09734v1 [stat.ML])
- On the Interplay between Acceleration and Identification for the Proximal Gradient algorithm. (arXiv:1909.08944v2 [math.OC] UPDATED)
- Unconditional convergence for discretizations of dynamical optimal transport. (arXiv:1909.08790v2 [math.NA] UPDATED)
- The Generalized Bregman Distance. (arXiv:1909.08206v2 [math.FA] UPDATED)
- Langevin Markov Chain Monte Carlo with stochastic gradients. (arXiv:1805.08863v2 [stat.ME] UPDATED)
- Riemannian Proximal Gradient Methods (extended version). (arXiv:1909.06065v4 [math.OC] UPDATED)
- Introduction to Online Convex Optimization. (arXiv:1909.05207v3 [cs.LG] UPDATED)
- Accelerated Information Gradient flow. (arXiv:1909.02102v3 [math.OC] UPDATED)
- A Diffusion Process Perspective on Posterior Contraction Rates for Parameters. (arXiv:1909.00966v2 [math.ST] UPDATED)
- Neural Policy Gradient Methods: Global Optimality and Rates of Convergence. (arXiv:1909.01150v3 [cs.LG] UPDATED)
- Accelerating ADMM for Efficient Simulation and Optimization. (arXiv:1909.00470v1 [cs.GR])
- Anderson Accelerated Douglas-Rachford Splitting. (arXiv:1908.11482v4 [math.OC] UPDATED)
- Inexact Proximal-Point Penalty Methods for Constrained Non-Convex Optimization. (arXiv:1908.11518v4 [math.OC] UPDATED)
- Note on Interacting Langevin Diffusions: Gradient Structure and Ensemble Kalman Sampler by Garbuno-Inigo, Hoffmann, Li and Stuart. (arXiv:1908.10890v1 [math.DS])
- High-Order Langevin Diffusion Yields an Accelerated MCMC Algorithm. (arXiv:1908.10859v2 [stat.ML] UPDATED)
- On the stability of optimization algorithms given by discretizations of the Euler-Lagrange ODE. (arXiv:1908.10426v1 [math.OC])
- Forward-Mode Differentiation of Maxwell's Equations. (arXiv:1908.10507v1 [physics.optics])
- Hypocoercivity properties of adaptive Langevin dynamics. (arXiv:1908.09363v3 [math.PR] UPDATED)
- Wasserstein Gradient Flow Formulation of the Time-Fractional Fokker-Planck Equation. (arXiv:1908.09055v2 [math.NA] UPDATED)
- Proximal gradient flow and Douglas-Rachford splitting dynamics: global exponential stability via integral quadratic constraints. (arXiv:1908.09043v2 [math.OC] UPDATED)
- Normalizing Flows: An Introduction and Review of Current Methods. (arXiv:1908.09257v4 [stat.ML] UPDATED)
- Variational Extrapolation of Implicit Schemes for General Gradient Flows. (arXiv:1908.10246v2 [math.NA] UPDATED)
- Accelerating proximal Markov chain Monte Carlo by using an explicit stabilised method. (arXiv:1908.08845v3 [stat.CO] UPDATED)
- Wasserstein Distributionally Robust Optimization: Theory and Applications in Machine Learning. (arXiv:1908.08729v1 [stat.ML])
- Automatic and Simultaneous Adjustment of Learning Rate and Momentum for Stochastic Gradient Descent. (arXiv:1908.07607v1 [stat.ML])
- Nesterov's method with decreasing learning rate leads to accelerated stochastic gradient descent. (arXiv:1908.07861v5 [math.OC] UPDATED)
- Causality from the Point of View of Statistics. (arXiv:1908.07301v8 [math.ST] UPDATED)
- Warped Proximal Iterations for Monotone Inclusions. (arXiv:1908.07077v5 [math.OC] UPDATED)
- Nonlinear Forward-Backward Splitting with Projection Correction. (arXiv:1908.07449v3 [math.OC] UPDATED)
- A Framework for Population-Based Stochastic Optimization on Abstract Riemannian Manifolds. (arXiv:1908.06783v3 [math.OC] UPDATED)
- A reflected forward-backward splitting method for monotone inclusions involving Lipschitzian operators. (arXiv:1908.05912v1 [math.OC])
- A discretization of Caputo derivatives with application to time fractional SDEs and gradient flows. (arXiv:1901.03159v2 [math.NA] UPDATED)
- Pearson Distance is not a Distance. (arXiv:1908.06029v1 [stat.ME])
- Ensemble Kalman Inversion: mean-field limit and convergence analysis. (arXiv:1908.05575v5 [math.NA] UPDATED)
- Convergence Rates of Variational Inference in Sparse Deep Learning. (arXiv:1908.04847v2 [math.ST] UPDATED)
- On explicit $L^2$-convergence rate estimate for underdamped Langevin dynamics. (arXiv:1908.04746v7 [math.AP] UPDATED)
- A multi-level ADMM algorithm for elliptic PDE-constrained optimization problems. (arXiv:1908.04652v1 [math.OC])
- Bregman Forward-Backward Operator Splitting. (arXiv:1908.03878v3 [math.OC] UPDATED)
- Gradient flows and proximal splitting methods: A unified view on accelerated and stochastic optimization. (arXiv:1908.00865v5 [math.OC] UPDATED)
- Approximation Capabilities of Neural ODEs and Invertible Residual Networks. (arXiv:1907.12998v2 [cs.LG] UPDATED)
- Bayesian Robustness: A Nonasymptotic Viewpoint. (arXiv:1907.11826v1 [stat.ML])
- The Wang-Landau Algorithm as Stochastic Optimization and Its Acceleration. (arXiv:1907.11985v2 [stat.CO] UPDATED)
- Variational f-divergence Minimization. (arXiv:1907.11891v1 [stat.ML])
- Improved Bounds for Discretization of Langevin Diffusions: Near-Optimal Rates without Convexity. (arXiv:1907.11331v2 [math.PR] UPDATED)
- Convergence rates for the stochastic gradient descent method for non-convex objective functions. (arXiv:1904.01517v2 [math.NA] UPDATED)
- Variable Metric Forward-Backward Algorithm for Composite Minimization Problems. (arXiv:1907.11486v3 [math.OC] UPDATED)
- Transport Monte Carlo: High-Accuracy Posterior Approximation via Random Transport. (arXiv:1907.10448v6 [stat.CO] UPDATED)
- First-order optimization algorithms via inertial systems with Hessian driven damping. (arXiv:1907.10536v2 [math.OC] UPDATED)
- On importance-weighted autoencoders. (arXiv:1907.10477v2 [stat.ML] UPDATED)
- On URANS Congruity with Time Averaging: Analytical laws suggest improved models. (arXiv:1907.10092v1 [math.NA])
- LQR through the Lens of First Order Methods: Discrete-time Case. (arXiv:1907.08921v2 [eess.SY] UPDATED)
- A New computation reduction based nonlinear Kalman filter. (arXiv:1907.09450v1 [eess.SY])
- Lookahead Optimizer: k steps forward, 1 step back. (arXiv:1907.08610v2 [cs.LG] UPDATED)
- Stochastic gradient Markov chain Monte Carlo. (arXiv:1907.06986v1 [stat.CO])
- Dynamical Systems as Temporal Feature Spaces. (arXiv:1907.06382v3 [cs.LG] UPDATED)
- Variational Autoencoders and Nonlinear ICA: A Unifying Framework. (arXiv:1907.04809v4 [stat.ML] UPDATED)
- Which Algorithmic Choices Matter at Which Batch Sizes? Insights From a Noisy Quadratic Model. (arXiv:1907.04164v2 [cs.LG] UPDATED)
- Entropic Regularization of Markov Decision Processes. (arXiv:1907.04214v2 [cs.LG] UPDATED)
- GP-VAE: Deep Probabilistic Time Series Imputation. (arXiv:1907.04155v5 [stat.ML] UPDATED)
- Latent ODEs for Irregularly-Sampled Time Series. (arXiv:1907.03907v1 [cs.LG])
- Unified Optimal Analysis of the (Stochastic) Gradient Method. (arXiv:1907.04232v2 [cs.LG] UPDATED)
- Policy-Gradient Algorithms Have No Guarantees of Convergence in Linear Quadratic Games. (arXiv:1907.03712v2 [cs.LG] UPDATED)
- Stochastic Gradient and Langevin Processes. (arXiv:1907.03215v7 [cs.LG] UPDATED)
- Nesterov's acceleration and Polyak's heavy ball method in continuous time: convergence rate analysis under geometric conditions and perturbations. (arXiv:1907.02710v1 [math.OC])
- Algorithms of Robust Stochastic Optimization Based on Mirror Descent Method. (arXiv:1907.02707v1 [math.ST])
- Learning Latent Dynamics for Partially-Observed Chaotic Systems. (arXiv:1907.02452v1 [stat.ML])
- Fisher information regularization schemes for Wasserstein gradient flows. (arXiv:1907.02152v2 [math.NA] UPDATED)
- Distributed Learning in Non-Convex Environments -- Part I: Agreement at a Linear Rate. (arXiv:1907.01848v1 [math.OC])
- Distributed Learning in Non-Convex Environments -- Part II: Polynomial Escape from Saddle-Points. (arXiv:1907.01849v1 [cs.MA])
- Causal models on probability spaces. (arXiv:1907.01672v1 [math.ST])
- The Role of Memory in Stochastic Optimization. (arXiv:1907.01678v2 [cs.LG] UPDATED)
- Gradient flow formulations of discrete and continuous evolutionary models: a unifying perspective. (arXiv:1907.01681v2 [q-bio.PE] UPDATED)
- Learning the Arrow of Time. (arXiv:1907.01285v1 [cs.LG])
- Lecture Notes on Stochastic Processes. (arXiv:1907.01060v6 [math.PR] UPDATED)
- An Introduction to Mean Field Games using probabilistic methods. (arXiv:1907.01411v1 [math.OC])
- Conjugate Gradients and Accelerated Methods Unified: The Approximate Duality Gap View. (arXiv:1907.00289v3 [math.OC] UPDATED)
- Branching Particle Pricers with Heston Examples. (arXiv:1907.00219v2 [q-fin.CP] UPDATED)
- The Thermodynamic Variational Objective. (arXiv:1907.00031v5 [cs.LG] UPDATED)
- Convergence Rates of Gaussian ODE Filters. (arXiv:1807.09737v3 [math.NA] UPDATED)
- Causal Regularization. (arXiv:1906.12179v1 [stat.ML])
- Neural ODEs as the Deep Limit of ResNets with constant weights. (arXiv:1906.12183v2 [stat.ML] UPDATED)
- Accelerated Symmetric ADMM and Its Applications in Signal Processing. (arXiv:1906.12015v2 [math.NA] UPDATED)
- Combining Stochastic Adaptive Cubic Regularization with Negative Curvature for Nonconvex Optimization. (arXiv:1906.11417v1 [math.OC])
- Complexity of Highly Parallel Non-Smooth Convex Optimization. (arXiv:1906.10655v2 [math.OC] UPDATED)
- A Theoretical Connection Between Statistical Physics and Reinforcement Learning. (arXiv:1906.10228v2 [cs.LG] UPDATED)
- A story of balls, randomness and PDEs. (arXiv:1906.09830v1 [math.PR] CROSS LISTED)
- A Unifying Framework for Variance Reduction Algorithms for Finding Zeroes of Monotone Operators. (arXiv:1906.09437v2 [stat.ML] UPDATED)
- Discrete gradients for computational Bayesian inference. (arXiv:1903.00186v4 [math.NA] UPDATED)
- Logarithmic divergences: geometry and interpretation of curvature. (arXiv:1906.09103v1 [math.DG])
- Bounding the error of discretized Langevin algorithms for non-strongly log-concave targets. (arXiv:1906.08530v3 [math.ST] UPDATED)
- Minimum Stein Discrepancy Estimators. (arXiv:1906.08283v3 [math.ST] UPDATED)
- Global Convergence of Policy Gradient Methods to (Almost) Locally Optimal Policies. (arXiv:1906.08383v3 [math.OC] UPDATED)
- Stochastic Runge-Kutta Accelerates Langevin Monte Carlo and Beyond. (arXiv:1906.07868v3 [stat.ML] UPDATED)
- Stochastic Runge-Kutta Accelerates Langevin Monte Carlo and Beyond. (arXiv:1906.07868v3 [stat.ML] UPDATED)
- Differentiable probabilistic models of scientific imaging with the Fourier slice theorem. (arXiv:1906.07582v2 [cs.LG] UPDATED)
- Escaping from saddle points on Riemannian manifolds. (arXiv:1906.07355v1 [math.OC])
- SNODE: Spectral Discretization of Neural ODEs for System Identification. (arXiv:1906.07038v2 [cs.NE] UPDATED)
- Is the Policy Gradient a Gradient?. (arXiv:1906.07073v2 [cs.LG] UPDATED)
- Direct Policy Gradients: Direct Optimization of Policies in Discrete Action Spaces. (arXiv:1906.06062v2 [cs.LG] UPDATED)
- Non-convex optimization via strongly convex majoirziation-minimization. (arXiv:1906.05608v1 [math.OC])
- Is Deep Learning a Renormalization Group Flow?. (arXiv:1906.05212v2 [cs.LG] UPDATED)
- Maximum Mean Discrepancy Gradient Flow. (arXiv:1906.04370v2 [stat.ML] UPDATED)
- ANODEV2: A Coupled Neural ODE Evolution Framework. (arXiv:1906.04596v1 [cs.LG])
- Learning to Score Behaviors for Guided Policy Optimization. (arXiv:1906.04349v4 [cs.LG] UPDATED)
- Adaptively Preconditioned Stochastic Gradient Langevin Dynamics. (arXiv:1906.04324v2 [cs.LG] UPDATED)
- Continuous Time Analysis of Momentum Methods. (arXiv:1906.04285v2 [cs.LG] UPDATED)
- Analysis of Optimization Algorithms via Sum-of-Squares. (arXiv:1906.04648v4 [math.OC] UPDATED)
- Efficiently escaping saddle points on manifolds. (arXiv:1906.04321v3 [math.OC] UPDATED)
- On a Combination of Alternating Minimization and Nesterov's Momentum. (arXiv:1906.03622v5 [math.OC] UPDATED)
- Solving a Class of Non-Convex Min-Max Games Using Iterative First Order Methods. (arXiv:1902.08297v3 [math.OC] UPDATED)
- Stochastic Mirror Descent on Overparameterized Nonlinear Models: Convergence, Implicit Regularization, and Generalization. (arXiv:1906.03830v1 [cs.LG])
- Disentangled State Space Representations. (arXiv:1906.03255v1 [stat.ML])
- Hamiltonian descent for composite objectives. (arXiv:1906.02608v2 [math.OC] UPDATED)
- An Introduction to Variational Autoencoders. (arXiv:1906.02691v3 [cs.LG] UPDATED)
- Neural SDE: Stabilizing Neural ODE Networks with Stochastic Noise. (arXiv:1906.02355v1 [cs.LG])
- Last-iterate convergence rates for min-max optimization. (arXiv:1906.02027v3 [math.OC] UPDATED)
- Streaming Variational Monte Carlo. (arXiv:1906.01549v4 [stat.ML] UPDATED)
- On the Efficiency of Entropic Regularized Algorithms for Optimal Transport. (arXiv:1906.01437v9 [cs.DS] UPDATED)
- A Generic Acceleration Framework for Stochastic Composite Optimization. (arXiv:1906.01164v3 [math.OC] UPDATED)
- Generalized Momentum-Based Methods: A Hamiltonian Perspective. (arXiv:1906.00436v3 [math.OC] UPDATED)
- On Gradient Descent Ascent for Nonconvex-Concave Minimax Problems. (arXiv:1906.00331v9 [cs.LG] UPDATED)
- An implicit gradient-descent procedure for minimax problems. (arXiv:1906.00233v1 [math.OC])
- Pseudo-Riemannian geometry embeds information geometry in optimal transport. (arXiv:1906.00030v4 [math.DG] UPDATED)
- A Stochastic Derivative Free Optimization Method with Momentum. (arXiv:1905.13278v2 [math.OC] UPDATED)
- Implicit Regularization in Deep Matrix Factorization. (arXiv:1905.13655v3 [cs.LG] UPDATED)
- Langevin Monte Carlo without smoothness. (arXiv:1905.13285v3 [stat.ML] UPDATED)
- Stochastic Sign Descent Methods: New Algorithms and Better Theory. (arXiv:1905.12938v5 [math.OC] UPDATED)
- On stochastic gradient Langevin dynamics with dependent data streams: the fully non-convex case. (arXiv:1905.13142v4 [math.ST] UPDATED)
- Zeroth-Order Stochastic Alternating Direction Method of Multipliers for Nonconvex Nonsmooth Optimization. (arXiv:1905.12729v2 [math.OC] UPDATED)
- Acceleration in First Order Quasi-strongly Convex Optimization by ODE Discretization. (arXiv:1905.12436v1 [math.OC])
- Switching Linear Dynamics for Variational Bayes Filtering. (arXiv:1905.12434v1 [stat.ML])
- Fast mixing of Metropolized Hamiltonian Monte Carlo: Benefits of multi-step gradients. (arXiv:1905.12247v3 [stat.ML] UPDATED)
- Approximate Guarantees for Dictionary Learning. (arXiv:1905.12091v1 [cs.LG])
- Efficient MCMC Sampling with Dimension-Free Convergence Rate using ADMM-type Splitting. (arXiv:1905.11937v6 [stat.CO] UPDATED)
- Stochastic Proximal Langevin Algorithm: Potential Splitting and Nonasymptotic Rates. (arXiv:1905.11768v2 [stat.ML] UPDATED)
- An Accelerated Decentralized Stochastic Proximal Algorithm for Finite Sums. (arXiv:1905.11394v2 [math.OC] UPDATED)
- Direct Nonlinear Acceleration. (arXiv:1905.11692v1 [math.OC])
- Revisiting Stochastic Extragradient. (arXiv:1905.11373v2 [math.OC] UPDATED)
- ODE Analysis of Stochastic Gradient Methods with Optimism and Anchoring for Minimax Problems. (arXiv:1905.10899v3 [cs.LG] UPDATED)
- Causal Discovery and Forecasting in Nonstationary Environments with State-Space Models. (arXiv:1905.10857v2 [cs.LG] UPDATED)
- Infinitely deep neural networks as diffusion processes. (arXiv:1905.11065v3 [stat.ML] UPDATED)
- ODE$^2$VAE: Deep generative second order ODEs with Bayesian neural networks. (arXiv:1905.10994v2 [stat.ML] UPDATED)
- Physics-informed Autoencoders for Lyapunov-stable Fluid Flow Prediction. (arXiv:1905.10866v1 [physics.comp-ph])
- Regularity as Regularization: Smooth and Strongly Convex Brenier Potentials in Optimal Transport. (arXiv:1905.10812v5 [stat.ML] UPDATED)
- Neural Jump Stochastic Differential Equations. (arXiv:1905.10403v3 [cs.LG] UPDATED)
- Robustness of accelerated first-order algorithms for strongly convex optimization problems. (arXiv:1905.11011v2 [math.OC] UPDATED)
- Neural ODEs with stochastic vector field mixtures. (arXiv:1905.09905v1 [cs.LG])
- Neural Stochastic Differential Equations: Deep Latent Gaussian Models in the Diffusion Limit. (arXiv:1905.09883v2 [cs.LG] UPDATED)
- Accelerating Langevin Sampling with Birth-death. (arXiv:1905.09863v1 [stat.ML])
- Stochastic Proximal Methods for Non-Smooth Non-Convex Constrained Sparse Optimization. (arXiv:1905.10188v1 [math.OC])
- A First-Order Approach To Accelerated Value Iteration. (arXiv:1905.09963v7 [math.OC] UPDATED)
- Painless Stochastic Gradient: Interpolation, Line-Search, and Convergence Rates. (arXiv:1905.09997v5 [cs.LG] UPDATED)
- Relaxation Runge-Kutta Methods: Conservation and stability for Inner-Product Norms. (arXiv:1905.09847v1 [math.NA])
- A Condition Number for Hamiltonian Monte Carlo. (arXiv:1905.09813v3 [stat.CO] UPDATED)
- Kernel Wasserstein Distance. (arXiv:1905.09314v1 [cs.LG])
- A Linearly Convergent Proximal Gradient Algorithm for Decentralized Optimization. (arXiv:1905.07996v2 [math.OC] UPDATED)
- Mean-Field Langevin Dynamics and Energy Landscape of Neural Networks. (arXiv:1905.07769v3 [math.PR] UPDATED)
- A Dynamical Systems Perspective on Nesterov Acceleration. (arXiv:1905.07436v1 [math.OC])
- Recurrent Kalman Networks: Factorized Inference in High-Dimensional Deep Feature Spaces. (arXiv:1905.07357v1 [cs.LG])
- Stability of Linear Structural Equation Models of Causal Inference. (arXiv:1905.06836v3 [cs.LG] UPDATED)
- Hybrid Stochastic Gradient Descent Algorithms for Stochastic Nonconvex Optimization. (arXiv:1905.05920v1 [math.OC])
- Minimax estimation of smooth optimal transport maps. (arXiv:1905.05828v3 [math.ST] UPDATED)
- Differentiable Linearized ADMM. (arXiv:1905.06179v1 [cs.LG])
- Variational approximations using Fisher divergence. (arXiv:1905.05284v1 [stat.ML])
- The sharp, the flat and the shallow: Can weakly interacting agents learn to escape bad minima?. (arXiv:1905.04121v1 [stat.ML])
- A Contrastive Divergence for Combining Variational Inference and MCMC. (arXiv:1905.04062v2 [stat.ML] UPDATED)
- Inexact Block Coordinate Descent Algorithms for Nonsmooth Nonconvex Optimization. (arXiv:1905.04211v5 [math.OC] UPDATED)
- The sharp, the flat and the shallow: Can weakly interacting agents learn to escape bad minima?. (arXiv:1905.04121v1 [stat.ML])
- A Contrastive Divergence for Combining Variational Inference and MCMC. (arXiv:1905.04062v2 [stat.ML] UPDATED)
- Inverse optimal transport. (arXiv:1905.03950v1 [math.OC])
- Think Globally, Act Locally: A Deep Neural Network Approach to High-Dimensional Time Series Forecasting. (arXiv:1905.03806v2 [stat.ML] UPDATED)
- Proximal Distance Algorithms: Theory and Practice
- Importance Weighted Hierarchical Variational Inference. (arXiv:1905.03290v1 [stat.ML])
- Approximate Bayesian computation with the Wasserstein distance. (arXiv:1905.03747v1 [stat.ME])
- Generative Model with Dynamic Linear Flow. (arXiv:1905.03239v1 [cs.LG])
- Optimal Convergence Rate of Hamiltonian Monte Carlo for Strongly Logconcave Distributions. (arXiv:1905.02313v1 [cs.DS])
- Learning Causality: Synthesis of Large-Scale Causal Networks from High-Dimensional Time Series Data. (arXiv:1905.02291v1 [cs.LG])
- Learning to Control in Metric Space with Optimal Regret. (arXiv:1905.01576v1 [cs.LG])
- A robust Kalman-Bucy filtering problem. (arXiv:1905.01791v3 [math.OC] UPDATED)
- A Latent Variational Framework for Stochastic Optimization. (arXiv:1905.01707v5 [cs.LG] UPDATED)
- TensorNetwork: A Library for Physics and Machine Learning. (arXiv:1905.01330v1 [physics.comp-ph])
- Deep Learning for Audio Signal Processing. (arXiv:1905.00078v2 [cs.SD] UPDATED)
- Inertial Three-Operator Splitting Method and Applications. (arXiv:1904.12980v1 [math.OC])
- Implicit Regularization of Discrete Gradient Dynamics in Linear Neural Networks. (arXiv:1904.13262v2 [cs.LG] UPDATED)
- On Stationary-Point Hitting Time and Ergodicity of Stochastic Gradient Langevin Dynamics. (arXiv:1904.13016v4 [stat.ML] UPDATED)
- New optimization algorithms for neural network training using operator splitting techniques. (arXiv:1904.12952v5 [cs.LG] UPDATED)
- Recurrent Neural Networks in the Eye of Differential Equations. (arXiv:1904.12933v1 [cs.LG])
- The Step Decay Schedule: A Near Optimal, Geometrically Decaying Learning Rate Procedure For Least Squares. (arXiv:1904.12838v2 [cs.LG] UPDATED)
- Making the Last Iterate of SGD Information Theoretically Optimal. (arXiv:1904.12443v2 [math.OC] UPDATED)
- Linearized two-layers neural networks in high dimension. (arXiv:1904.12191v3 [math.ST] UPDATED)
- Wave Physics as an Analog Recurrent Neural Network. (arXiv:1904.12831v2 [physics.comp-ph] UPDATED)
- On Exact Computation with an Infinitely Wide Neural Net. (arXiv:1904.11955v2 [cs.LG] UPDATED)
- Stability Optimization of Positive Semi-Markov Jump Linear Systems via Convex Optimization. (arXiv:1904.11690v2 [cs.SY] UPDATED)
- Derivative-free optimization methods. (arXiv:1904.11585v2 [math.OC] UPDATED)
- Layer Dynamics of Linearised Neural Nets. (arXiv:1904.10689v1 [cs.LG])
- On the Kullback-Leibler divergence between location-scale densities. (arXiv:1904.10428v3 [math.ST] UPDATED)
- Convergence of diffusions and their discretizations: from continuous to discrete processes and back. (arXiv:1904.09808v4 [math.PR] UPDATED)
- Implicit regularization for deep neural networks driven by an Ornstein-Uhlenbeck like process. (arXiv:1904.09080v2 [cs.LG] UPDATED)
- On the Convergence of Adam and Beyond. (arXiv:1904.09237v1 [cs.LG])
- Linear convergence of accelerated conditional gradient algorithms in spaces of measures. (arXiv:1904.09218v2 [math.OC] UPDATED)
- Stochastic nonlinear Fokker-Planck equations. (arXiv:1904.07894v1 [math.PR])
- On Structured Filtering-Clustering: Global Error Bound and Optimal First-Order Algorithms. (arXiv:1904.07462v3 [stat.ML] UPDATED)
- Iterated Extended Kalman Smoother-based Variable Splitting for $L_1$-Regularized State Estimation. (arXiv:1903.08605v3 [cs.IT] UPDATED)
- Copula-like Variational Inference. (arXiv:1904.07153v2 [stat.ML] UPDATED)
- Markov chain Monte Carlo importance samplers for Bayesian models with intractable likelihoods. (arXiv:1904.05886v1 [stat.CO])
- Connections Between Adaptive Control and Optimization in Machine Learning. (arXiv:1904.05856v1 [math.OC])
- Deep learning as optimal control problems: models and numerical methods. (arXiv:1904.05657v3 [math.OC] UPDATED)
- Coconuts and Islanders: A Statistics-First Guide to the Boltzmann Distribution. (arXiv:1904.04669v1 [cond-mat.stat-mech])
- On the Adaptivity of Stochastic Gradient-Based Optimization. (arXiv:1904.04480v3 [math.OC] UPDATED)
- Integration-by-Parts Characterizations of Gaussian Processes. (arXiv:1904.02890v1 [math.PR])
- Stochastic Modified Equations and Dynamics of Stochastic Gradient Algorithms I: Mathematical Foundations
- A Stochastic Interpretation of Stochastic Mirror Descent: Risk-Sensitive Optimality. (arXiv:1904.01855v1 [math.OC])
- Augmented Neural ODEs. (arXiv:1904.01681v3 [stat.ML] UPDATED)
- Generalized Variational Inference: Three arguments for deriving new Posteriors. (arXiv:1904.02063v4 [stat.ML] UPDATED)
- Convergence rates for the stochastic gradient descent method for non-convex objective functions. (arXiv:1904.01517v2 [math.NA] UPDATED)
- Perturbative estimation of stochastic gradients. (arXiv:1904.00469v4 [stat.ML] UPDATED)
- Implicit Langevin Algorithms for Sampling From Log-concave Densities. (arXiv:1903.12322v2 [stat.ML] UPDATED)
- Convergence rates for optimised adaptive importance samplers. (arXiv:1903.12044v4 [stat.CO] UPDATED)
- What is the Lagrangian for Nonlinear Filtering?. (arXiv:1903.11195v3 [math.OC] UPDATED)
- Filtering of Gaussian processes in Hilbert spaces. (arXiv:1903.11464v2 [math.PR] UPDATED)
- State and Parameter Estimation from Observed Signal Increments. (arXiv:1903.10717v2 [math.NA] UPDATED)
- Inequalities between $L^p$-norms for log-concave distributions. (arXiv:1903.10101v1 [math.ST])
- Parametric Fokker-Planck equation. (arXiv:1903.10076v2 [math.OC] UPDATED)
- Stochastic Gradient Hamiltonian Monte Carlo for Non-Convex Learning. (arXiv:1903.10328v3 [stat.ML] UPDATED)
- The Hitchhiker's Guide to Nonlinear Filtering. (arXiv:1903.09247v2 [stat.ME] UPDATED)
- LMI Properties and Applications in Systems, Stability, and Control Theory. (arXiv:1903.08599v3 [cs.SY] UPDATED)
- Safe and adaptive importance sampling: a mixture approach. (arXiv:1903.08507v4 [math.ST] UPDATED)
- Rapid Convergence of the Unadjusted Langevin Algorithm: Isoperimetry Suffices. (arXiv:1903.08568v4 [cs.DS] UPDATED)
- Potential-based analyses of first-order methods for constrained and composite optimization. (arXiv:1903.08497v1 [math.OC])
- The importance of better models in stochastic optimization. (arXiv:1903.08619v1 [math.OC])
- Distributed Kalman-filtering: Distributed optimization viewpoint. (arXiv:1903.07807v2 [math.OC] UPDATED)
- Online Non-Convex Learning: Following the Perturbed Leader is Optimal. (arXiv:1903.08110v2 [cs.LG] UPDATED)
- Truly Proximal Policy Optimization. (arXiv:1903.07940v2 [cs.LG] UPDATED)
- Convergence Analysis of Inexact Randomized Iterative Methods. (arXiv:1903.07971v1 [math.OC])
- Signal recovery by Stochastic Optimization. (arXiv:1903.07349v1 [math.ST])
- Annealing for Distributed Global Optimization. (arXiv:1903.07258v1 [math.OC])
- Inefficiency of K-FAC for Large Batch Size Training. (arXiv:1903.06237v3 [cs.LG] UPDATED)
- A nonasymptotic law of iterated logarithm for general M-estimators. (arXiv:1903.06576v2 [math.ST] UPDATED)
- Understanding Straight-Through Estimator in Training Activation Quantized Neural Nets. (arXiv:1903.05662v4 [cs.LG] UPDATED)
- Accelerated First-Order Methods: Differential Equations and Lyapunov Functions. (arXiv:1903.05671v6 [math.OC] UPDATED)
- Elements of Sequential Monte Carlo. (arXiv:1903.04797v2 [stat.ML] UPDATED)
- Stochastic Gradient Descent for Nonconvex Learning without Bounded Gradient Assumptions. (arXiv:1902.00908v3 [cs.LG] UPDATED)
- Neural Empirical Bayes. (arXiv:1903.02334v2 [stat.ML] UPDATED)
- A machine learning framework for data driven acceleration of computations of differential equations. (arXiv:1807.09519v1 [math.NA] CROSS LISTED)
- The Variational Predictive Natural Gradient. (arXiv:1903.02984v3 [cs.LG] UPDATED)
- Mean-field Analysis of Batch Normalization. (arXiv:1903.02606v1 [cs.LG])
- Accelerated convergence to equilibrium and reduced asymptotic variance for Langevin dynamics using Stratonovich perturbations. (arXiv:1903.03024v2 [math.NA] UPDATED)
- A new regularization approach for numerical differentiation. (arXiv:1903.02762v5 [math.NA] UPDATED)
- Image Dependent Conditional McKean-Vlasov SDEs for Measure-Valued Diffusion Processes. (arXiv:1903.02148v4 [math.PR] UPDATED)
- Convergence of gradient descent-ascent analyzed as a Newtonian dynamical system with dissipation. (arXiv:1903.02536v1 [math.OC])
- Accelerated Stochastic Algorithms for Convex-Concave Saddle-Point Problems. (arXiv:1903.01687v4 [math.OC] UPDATED)
- SGD without Replacement: Sharper Rates for General Smooth Convex Functions. (arXiv:1903.01463v2 [math.OC] UPDATED)
- Theoretical guarantees for sampling and inference in generative models with latent diffusions. (arXiv:1903.01608v2 [math.PR] UPDATED)
- Kinetic walks for sampling. (arXiv:1903.00550v4 [math.PR] UPDATED)
- An Optimistic Acceleration of AMSGrad for Nonconvex Optimization. (arXiv:1903.01435v3 [stat.ML] UPDATED)
- Bernoulli Race Particle Filters. (arXiv:1903.00939v1 [stat.CO])
- Scalable optimization-based sampling on function space. (arXiv:1903.00870v2 [stat.CO] UPDATED)
- On the power of random information. (arXiv:1903.00681v1 [math.NA])
- Introduction to geometric control. (arXiv:1903.00211v2 [math.OC] UPDATED)
- Proximal algorithms for constrained composite optimization, with applications to solving low-rank SDPs. (arXiv:1903.00184v1 [math.OC])
- Discrete gradients for computational Bayesian inference. (arXiv:1903.00186v4 [math.NA] UPDATED)
- Distributed Linear Quadratic Optimal Control: Compute Locally and Act Globally. (arXiv:1902.11244v2 [math.OC] UPDATED)
- A survey of hidden convex optimization. (arXiv:1902.10921v1 [math.OC])
- Data-driven approximations of dynamical systems operators for control. (arXiv:1902.10239v1 [math.DS])
- Adaptive Gradient Methods with Dynamic Bound of Learning Rate. (arXiv:1902.09843v1 [cs.LG])
- Gradient Methods for Problems with Inexact Model of the Objective. (arXiv:1902.09001v2 [math.OC] UPDATED)
- Accelerating Rescaled Gradient Descent: Fast Optimization of Smooth Functions. (arXiv:1902.08825v3 [math.OC] UPDATED)
- Single-Forward-Step Projective Splitting: Exploiting Cocoercivity. (arXiv:1902.09025v3 [math.OC] UPDATED)
- A Formalization of The Natural Gradient Method for General Similarity Measures. (arXiv:1902.08959v1 [stat.ML])
- Beating SGD Saturation with Tail-Averaging and Minibatching. (arXiv:1902.08668v2 [stat.ML] UPDATED)
- Nonconvex sampling with the Metropolis-adjusted Langevin algorithm. (arXiv:1902.08452v2 [cs.DS] UPDATED)
- Analysis of the alternating direction method of multipliers for nonconvex problems. (arXiv:1902.07815v2 [math.OC] UPDATED)
- Non-asymptotic Analysis of Stochastic Methods for Non-Smooth Non-Convex Regularized Problems. (arXiv:1902.07672v4 [math.OC] UPDATED)
- A Mean Field Theory of Batch Normalization. (arXiv:1902.08129v2 [cs.NE] UPDATED)
- Online Sampling from Log-Concave Distributions. (arXiv:1902.08179v4 [cs.LG] UPDATED)
- On the anticipative nonlinear filtering problem and its stability. (arXiv:1902.08168v1 [math.PR])
- Active Probabilistic Inference on Matrices for Pre-Conditioning in Stochastic Optimization. (arXiv:1902.07557v1 [cs.LG])
- Diffusion map-based algorithm for Gain function approximation in the Feedback Particle Filter. (arXiv:1902.07263v2 [math.OC] UPDATED)
- Faster Gradient-Free Proximal Stochastic Methods for Nonconvex Nonsmooth Optimization. (arXiv:1902.06158v1 [math.OC])
- Convergence Rate of a Simulated Annealing Algorithm with Noisy Observations
- Robust Accelerated Gradient Methods for Smooth Strongly Convex Functions. (arXiv:1805.10579v4 [math.OC] UPDATED)
- The Optimal Approximation Factor in Density Estimation. (arXiv:1902.05876v3 [cs.LG] UPDATED)
- Mean-field optimal control and optimality conditions in the space of probability measures. (arXiv:1902.05339v3 [math.OC] UPDATED)
- Pathwise Stochastic Control with Applications to Robust Filtering. (arXiv:1902.05434v2 [math.PR] UPDATED)
- On Nonconvex Optimization for Machine Learning: Gradients, Stochasticity, and Saddle Points. (arXiv:1902.04811v2 [cs.LG] UPDATED)
- The Complexity of Making the Gradient Small in Stochastic Convex Optimization. (arXiv:1902.04686v2 [cs.LG] UPDATED)
- Acceleration via Symplectic Discretization of High-Resolution Differential Equations. (arXiv:1902.03694v2 [math.OC] UPDATED)
- Acceleration via Symplectic Discretization of High-Resolution Differential Equations. (arXiv:1902.03694v2 [math.OC] UPDATED)
- Unnormalized Optimal Transport. (arXiv:1902.03367v1 [math.OC])
- The Riemannian barycentre as a proxy for global optimisation. (arXiv:1902.03885v1 [math.ST])
- Mode Collapse and Regularity of Optimal Transportation Maps. (arXiv:1902.02934v1 [cs.LG])
- The Actor-Advisor: Policy Gradient With Off-Policy Advice. (arXiv:1902.02556v1 [cs.AI])
- Total stochastic gradient algorithms and applications in reinforcement learning. (arXiv:1902.01722v1 [cs.LG])
- Dual Space Preconditioning for Gradient Descent. (arXiv:1902.02257v4 [math.OC] UPDATED)
- Exponentiated Gradient Meets Gradient Descent. (arXiv:1902.01903v1 [cs.LG])
- Is There an Analog of Nesterov Acceleration for MCMC?. (arXiv:1902.00996v2 [stat.ML] UPDATED)
- Non-asymptotic Analysis of Biased Stochastic Approximation Scheme. (arXiv:1902.00629v4 [stat.ML] UPDATED)
- A Stochastic Derivative-Free Optimization Method with Importance Sampling: Theory and Learning to Control. (arXiv:1902.01272v3 [math.OC] UPDATED)
- Stochastic Gradient Descent for Nonconvex Learning without Bounded Gradient Assumptions. (arXiv:1902.00908v3 [cs.LG] UPDATED)
- Non-asymptotic Analysis of Biased Stochastic Approximation Scheme. (arXiv:1902.00629v4 [stat.ML] UPDATED)
- A dual Newton based preconditioned proximal point algorithm for exclusive lasso models. (arXiv:1902.00151v2 [math.OC] UPDATED)
- Understanding MCMC Dynamics as Flows on the Wasserstein Space. (arXiv:1902.00282v3 [stat.ML] UPDATED)
- Optimal mini-batch and step sizes for SAGA. (arXiv:1902.00071v3 [math.OC] UPDATED)
- A Theory of Regularized Markov Decision Processes. (arXiv:1901.11275v2 [cs.LG] UPDATED)
- Transport map accelerated adaptive importance sampling, and application to inverse problems arising from multiscale stochastic reaction networks. (arXiv:1901.11269v2 [stat.CO] UPDATED)
- Metric Gaussian Variational Inference. (arXiv:1901.11033v3 [stat.ML] UPDATED)
- New Tricks for Estimating Gradients of Expectations. (arXiv:1901.11311v4 [cs.LG] UPDATED)
- Lower Bounds for Smooth Nonconvex Finite-Sum Optimization. (arXiv:1901.11224v1 [math.OC])
- Memory-Efficient Adaptive Optimization. (arXiv:1901.11150v2 [cs.LG] UPDATED)
- An optimal transport approach for solving dynamic inverse problems in spaces of measures. (arXiv:1901.10162v2 [math.FA] UPDATED)
- Scalable Metropolis-Hastings for Exact Bayesian Inference with Large Datasets. (arXiv:1901.09881v3 [stat.ML] UPDATED)
- 99% of Distributed Optimization is a Waste of Time: The Issue and How to Fix it. (arXiv:1901.09437v2 [cs.LG] UPDATED)
- Surrogate Losses for Online Learning of Stepsizes in Stochastic Non-Convex Optimization. (arXiv:1901.09068v2 [cs.LG] UPDATED)
- Asynchronous Accelerated Proximal Stochastic Gradient for Strongly Convex Distributed Finite Sums. (arXiv:1901.09865v3 [math.OC] UPDATED)
- SGD: General Analysis and Improved Rates. (arXiv:1901.09401v4 [cs.LG] UPDATED)
- Escaping Saddle Points with Adaptive Gradient Methods. (arXiv:1901.09149v2 [cs.LG] UPDATED)
- Projected Stein Variational Newton: A Fast and Scalable Bayesian Inference Method in High Dimensions. (arXiv:1901.08659v2 [math.OC] UPDATED)
- Estimate Sequences for Stochastic Composite Optimization: Variance Reduction, Acceleration, and Robustness to Noise. (arXiv:1901.08788v4 [stat.ML] UPDATED)
- New nonasymptotic convergence rates of stochastic proximal pointalgorithm for convex optimization problems. (arXiv:1901.08663v4 [math.OC] UPDATED)
- Primal dual methods for Wasserstein gradient flows. (arXiv:1901.08081v2 [math.NA] UPDATED)
- A Universally Optimal Multistage Accelerated Stochastic Gradient Method. (arXiv:1901.08022v3 [math.OC] UPDATED)
- Accelerated Linear Convergence of Stochastic Momentum Methods in Wasserstein Distances. (arXiv:1901.07445v2 [stat.ML] UPDATED)
- Unreasonable effectiveness of Monte Carlo. (arXiv:1901.06428v1 [stat.CO])
- Ensemble transform algorithms for nonlinear smoothing problems. (arXiv:1901.06300v3 [math.NA] UPDATED)
- A Tail-Index Analysis of Stochastic Gradient Noise in Deep Neural Networks. (arXiv:1901.06053v1 [cs.LG])
- Zeroth-order Nonconvex Stochastic Optimization: Handling Constraints, High-Dimensionality and Saddle-Points. (arXiv:1809.06474v2 [math.OC] CROSS LISTED)
- A Modern Retrospective on Probabilistic Numerics. (arXiv:1901.04457v3 [math.NA] UPDATED)
- Optimality Criteria for Probabilistic Numerical Methods. (arXiv:1901.04326v2 [stat.ME] UPDATED)
- On the Global Convergence of Imitation Learning: A Case for Linear Quadratic Regulator. (arXiv:1901.03674v1 [cs.LG])
- A discretization of Caputo derivatives with application to time fractional SDEs and gradient flows. (arXiv:1901.03159v2 [math.NA] UPDATED)
- Accelerated Flow for Probability Distributions. (arXiv:1901.03317v2 [cs.LG] UPDATED)
- The square root rule for adaptive importance sampling. (arXiv:1901.02976v2 [math.ST] UPDATED)
- Graphical model inference: Sequential Monte Carlo meets deterministic approximations. (arXiv:1901.02374v1 [stat.ML])
- Primal-dual proximal splitting and generalized conjugation in non-smooth non-convex optimization. (arXiv:1901.02746v4 [math.OC] UPDATED)
- The Extended Kalman Filter is a Natural Gradient Descent in Trajectory Space. (arXiv:1901.00696v1 [math.OC])
- A Simple Algorithm for Scalable Monte Carlo Inference. (arXiv:1901.00533v4 [stat.CO] UPDATED)
- Kernel Density Estimation Bias under Minimal Assumptions. (arXiv:1901.00331v1 [math.ST])
- A Geometric Theory of Higher-Order Automatic Differentiation. (arXiv:1812.11592v1 [stat.CO])
Saved in 2018
- A continuous-time analysis of distributed stochastic gradient. (arXiv:1812.10995v5 [math.OC] UPDATED)
- Random batch methods (RBM) for interacting particle systems. (arXiv:1812.10575v2 [math.NA] UPDATED)
- Sampling on the sphere from $f(x) \propto x^TAx$. (arXiv:1812.10612v1 [stat.CO])
- Perturbed Fenchel duality and first-order methods. (arXiv:1812.10198v7 [math.OC] UPDATED)
- Asymptotic distribution and convergence rates of stochastic algorithms for entropic optimal transportation between probability measures. (arXiv:1812.09150v5 [math.ST] UPDATED)
- A Universal Sampling Method for Reconstructing Signals with Simple Fourier Transforms. (arXiv:1812.08723v1 [cs.DS] CROSS LISTED)
- Marvels and Pitfalls of the Langevin Algorithm in Noisy High-dimensional Inference. (arXiv:1812.09066v4 [cs.LG] UPDATED)
- Stochastic Doubly Robust Gradient. (arXiv:1812.08997v1 [cs.LG])
- First-order algorithms converge faster than $O(1/k)$ on convex problems. (arXiv:1812.08485v4 [math.OC] UPDATED)
- Derivative-Free Methods for Policy Optimization: Guarantees for Linear Quadratic Systems. (arXiv:1812.08305v3 [cs.LG] UPDATED)
- Invariance, Causality and Robustness. (arXiv:1812.08233v1 [stat.ME])
- Breaking Reversibility Accelerates Langevin Dynamics for Global Non-Convex Optimization. (arXiv:1812.07725v4 [math.OC] UPDATED)
- Near-optimal method for highly smooth convex optimization. (arXiv:1812.08026v2 [math.OC] UPDATED)
- Wasserstein Covariance for Multiple Random Densities. (arXiv:1812.07694v1 [stat.ME])
- On The Chain Rule Optimal Transport Distance. (arXiv:1812.08113v3 [cs.LG] UPDATED)
- Inference with Hamiltonian Sequential Monte Carlo Simulators. (arXiv:1812.07978v1 [stat.CO])
- Semi-Riemannian Manifold Optimization. (arXiv:1812.07643v1 [math.OC])
- Geometric Scattering on Manifolds. (arXiv:1812.06968v4 [stat.ML] UPDATED)
- Limit theorems for filtered long-range dependent random fields. (arXiv:1812.07290v1 [math.PR])
- The one-dimensional log-gas free energy has a unique minimiser. (arXiv:1812.06929v1 [math.PR])
- An efficient adaptive accelerated inexact proximal point method for solving linearly constrained nonconvex composite problems. (arXiv:1812.06352v3 [math.OC] UPDATED)
- Algorithmic Theory of ODEs and Sampling from Well-conditioned Logconcave Densities. (arXiv:1812.06243v1 [cs.DS])
- Non-Factorised Variational Inference in Dynamical Systems. (arXiv:1812.06067v1 [stat.ML])
- Numerical Methods for Fast Nonlinear Fourier Transformation, Part I: Exponential Runge-Kutta and Linear Multistep Methods. (arXiv:1812.04701v1 [math.NA])
- Gradient Descent Happens in a Tiny Subspace. (arXiv:1812.04754v1 [cs.LG])
- On the Curved Geometry of Accelerated Optimization. (arXiv:1812.04634v2 [cs.LG] UPDATED)
- A variational approach to nonlinear and interacting diffusions. (arXiv:1812.04269v3 [math.PR] UPDATED)
- Convex integral functionals of cadlag processes. (arXiv:1812.04086v1 [math.OC])
- Sampling-based Bayesian Inference with gradient uncertainty. (arXiv:1812.03285v2 [cs.LG] UPDATED)
- Signal Recovery From 1-Bit Quantized Noisy Samples via Adaptive Thresholding. (arXiv:1812.03977v1 [cs.IT])
- A gentle introduction to SPDEs: the random field approach. (arXiv:1812.02812v1 [math.PR])
- Uncertainty Sampling is Preconditioned Stochastic Gradient Descent on Zero-One Loss. (arXiv:1812.01815v1 [cs.LG])
- Coarse-Graining Auto-Encoders for Molecular Dynamics. (arXiv:1812.02706v2 [physics.chem-ph] UPDATED)
- On stochastic gradient Langevin dynamics with dependent data streams in the logconcave case. (arXiv:1812.02709v3 [math.ST] UPDATED)
- A Framework for Adaptive MCMC Targeting Multimodal Distributions. (arXiv:1812.02609v2 [stat.CO] UPDATED)
- Accelerated finite elements schemes for parabolic stochastic partial differential equations. (arXiv:1812.02225v2 [math.PR] UPDATED)
- Information geometry for approximate Bayesian computation. (arXiv:1812.02127v2 [stat.ME] UPDATED)
- Statistics with improper posteriors. (arXiv:1812.01314v1 [math.ST])
- Simple Confidence Intervals for MCMC Without CLTs. (arXiv:1812.00126v1 [math.PR])
- Stochastic Gradient MCMC with Repulsive Forces. (arXiv:1812.00071v2 [stat.ML] UPDATED)
- Beyond Log-concavity: Provable Guarantees for Sampling Multi-modal Distributions using Simulated Tempering Langevin Monte Carlo. (arXiv:1710.02736v2 [cs.LG] CROSS LISTED)
- Eigenvalue Corrected Noisy Natural Gradient. (arXiv:1811.12565v1 [cs.LG])
- Bayesian fractional posteriors
- Improved Calibration of Numerical Integration Error in Sigma-Point Filters. (arXiv:1811.11474v2 [stat.ML] UPDATED)
- New Convergence Aspects of Stochastic Gradient Algorithms. (arXiv:1811.12403v2 [math.OC] UPDATED)
- Adaptive Stochastic Variance Reduction for Subsampled Newton Method with Cubic Regularization. (arXiv:1811.11637v1 [math.OC])
- On circumcenter mappings induced by nonexpansive operators. (arXiv:1811.11420v1 [math.OC])
- Hessian Riemannian gradient flows in convex programming. (arXiv:1811.10331v1 [math.OC])
- Inexact SARAH Algorithm for Stochastic Optimization. (arXiv:1811.10105v2 [math.OC] UPDATED)
- The promises and pitfalls of Stochastic Gradient Langevin Dynamics. (arXiv:1811.10072v1 [stat.ML])
- Non-deterministic inference using random set models: theory, approximation, and sampling method. (arXiv:1811.10446v1 [math.NA])
- Rejoinder for "Probabilistic Integration: A Role in Statistical Computation?". (arXiv:1811.10275v1 [stat.CO])
- Spread Divergence. (arXiv:1811.08968v5 [stat.ML] UPDATED)
- Trajectorial Otto calculus. (arXiv:1811.08686v4 [math.PR] UPDATED)
- An ODE Method to Prove the Geometric Convergence of Adaptive Stochastic Algorithms. (arXiv:1811.06703v3 [math.OC] UPDATED)
- Minimum weight norm models do not always generalize well for over-parameterized problems. (arXiv:1811.07055v3 [stat.ML] UPDATED)
- Sampling Can Be Faster Than Optimization. (arXiv:1811.08413v2 [stat.ML] UPDATED)
- Variance Reduction in Stochastic Particle-Optimization Sampling. (arXiv:1811.08052v1 [stat.ML])
- Economics of disagreement -- financial intuition for the R\'enyi divergence. (arXiv:1811.08308v7 [q-fin.GN] UPDATED)
- Stability of Gaussian Process State Space Models. (arXiv:1811.06646v1 [cs.SY])
- The Theory and Algorithm of Ergodic Inference. (arXiv:1811.07192v1 [cs.LG])
- Semi-dual Regularized Optimal Transport. (arXiv:1811.05527v1 [cs.LG])
- A General Method for Amortizing Variational Filtering. (arXiv:1811.05090v1 [stat.ML])
- Subsampled Inexact Newton methods for minimizing large sums of convex functions. (arXiv:1811.05730v1 [math.NA])
- Deep Nonlinear Non-Gaussian Filtering for Dynamical Systems. (arXiv:1811.05933v1 [cs.LG])
- Stochastic Algorithmic Differentiation of (Expectations of) Discontinuous Functions (Indicator Functions). (arXiv:1811.05741v5 [q-fin.CP] UPDATED)
- Gaussian AutoEncoder. (arXiv:1811.04751v4 [cs.LG] UPDATED)
- Plug-In Stochastic Gradient Method. (arXiv:1811.03659v1 [eess.SP])
- Practical Bayesian Learning of Neural Networks via Adaptive Optimisation Methods. (arXiv:1811.03679v3 [stat.ML] UPDATED)
- On exponential convergence of SGD in non-convex over-parametrized learning. (arXiv:1811.02564v1 [math.OC])
- On Convex Envelopes and Regularization of Non-Convex Functionals without moving Global Minima. (arXiv:1811.03439v1 [math.OC])
- An Optimal Transport View on Generalization. (arXiv:1811.03270v1 [stat.ML])
- Wasserstein variational gradient descent: From semi-discrete optimal transport to ensemble variational inference. (arXiv:1811.02827v2 [stat.ML] UPDATED)
- On exponential convergence of SGD in non-convex over-parametrized learning. (arXiv:1811.02564v1 [math.OC])
- Achieving Acceleration in Distributed Optimization via Direct Discretization of the Heavy-Ball ODE. (arXiv:1811.02521v1 [math.OC])
- Quantifying Uncertainty in High Dimensional Inverse Problems by Convex Optimisation. (arXiv:1811.02514v2 [eess.SP] UPDATED)
- Double Adaptive Stochastic Gradient Optimization. (arXiv:1811.02525v1 [stat.ML])
- Stochastic Modified Equations and Dynamics of Stochastic Gradient Algorithms I: Mathematical Foundations. (arXiv:1811.01558v1 [cs.LG])
- Non-Asymptotic Guarantees For Sampling by Stochastic Gradient Descent. (arXiv:1811.00781v1 [math.ST])
- Policy Optimization via Importance Sampling. (arXiv:1809.06098v2 [cs.LG] UPDATED)
- On Exploration, Exploitation and Learning in Adaptive Importance Sampling. (arXiv:1810.13296v1 [stat.ML])
- Fast Convergence Rates of Distributed Subgradient Methods with Adaptive Quantization. (arXiv:1810.13245v3 [math.OC] UPDATED)
- A general system of differential equations to model first order adaptive algorithms. (arXiv:1810.13108v2 [cs.LG] UPDATED)
- Diagnosing Forward Operator Error Using Optimal Transport. (arXiv:1810.12993v1 [math.NA])
- Divergence Network: Graphical calculation method of divergence functions. (arXiv:1810.12794v2 [cs.LG] UPDATED)
- Global Non-convex Optimization with Discretized Diffusions. (arXiv:1810.12361v2 [stat.ML] UPDATED)
- Systemic Greeks: Measuring risk in financial networks. (arXiv:1810.11849v1 [q-fin.RM])
- Variational Inference with Tail-adaptive f-Divergence. (arXiv:1810.11943v3 [cs.LG] UPDATED)
- Stein Variational Gradient Descent as Moment Matching. (arXiv:1810.11693v1 [stat.ML])
- Kalman Gradient Descent: Adaptive Variance Reduction in Stochastic Optimization. (arXiv:1810.12273v1 [stat.ML])
- Automatic differentiation in ML: Where we are and where we should be going. (arXiv:1810.11530v2 [cs.LG] UPDATED)
- Linear Convergence of Cyclic SAGA. (arXiv:1810.11167v2 [math.OC] UPDATED)
- Robust Importance Sampling with Adaptive Winsorization. (arXiv:1810.11130v2 [stat.CO] UPDATED)
- Uniform Convergence of Gradients for Non-Convex Learning and Optimization. (arXiv:1810.11059v2 [cs.LG] UPDATED)
- signProx: One-Bit Proximal Algorithm for Nonconvex Stochastic Optimization. (arXiv:1807.08023v2 [math.OC] UPDATED)
- Posterior Convergence of Gaussian and General Stochastic Process Regression Under Possible Misspecifications. (arXiv:1810.10495v2 [math.ST] UPDATED)
- Understanding and correcting pathologies in the training of learned optimizers. (arXiv:1810.10180v5 [cs.NE] UPDATED)
- A Continuous-Time View of Early Stopping for Least Squares. (arXiv:1810.10082v4 [stat.ML] UPDATED)
- A Proximal Zeroth-Order Algorithm for Nonconvex Nonsmooth Problems. (arXiv:1810.10085v1 [math.OC])
- Negative results for approximation using single layer and multilayer feedforward neural networks. (arXiv:1810.10032v4 [cs.LG] UPDATED)
- A jamming transition from under- to over-parametrization affects loss landscape and generalization. (arXiv:1810.09665v5 [cs.LG] UPDATED)
- Understanding the Acceleration Phenomenon via High-Resolution Differential Equations. (arXiv:1810.08907v3 [math.OC] UPDATED)
- The Price equation program: simple invariances unify population dynamics, thermodynamics, probability, information and inference. (arXiv:1810.09262v1 [q-bio.PE])
- Optimality of the final model found via Stochastic Gradient Descent. (arXiv:1810.09418v1 [cs.LG])
- The Bregman chord divergence. (arXiv:1810.09113v1 [cs.LG])
- Stochastic Gradient MCMC for State Space Models. (arXiv:1810.09098v2 [stat.ML] UPDATED)
- The total variation distance between high-dimensional Gaussians with the same mean. (arXiv:1810.08693v7 [math.ST] UPDATED)
- On the Poincar{\'e} constant of log-concave measures. (arXiv:1810.08369v1 [math.PR])
- Generalized Lyapunov criteria on finite-time stability of stochastic nonlinear systems. (arXiv:1810.07927v1 [math.PR])
- The Wasserstein transform. (arXiv:1810.07793v1 [cs.LG])
- Efficient Proximal Mapping Computation for Unitarily Invariant Low-Rank Inducing Norms. (arXiv:1810.07570v1 [math.OC])
- Fast and Faster Convergence of SGD for Over-Parameterized Models and an Accelerated Perceptron. (arXiv:1810.07288v3 [cs.LG] UPDATED)
- Quasi-hyperbolic momentum and Adam for deep learning. (arXiv:1810.06801v4 [cs.LG] UPDATED)
- Inverse Problems and Data Assimilation. (arXiv:1810.06191v5 [stat.ME] UPDATED)
- Uniform Convergence Rate of the Kernel Density Estimator Adaptive to Intrinsic Volume Dimension. (arXiv:1810.05935v3 [math.ST] UPDATED)
- Stochastic (Approximate) Proximal Point Methods: Convergence, Optimality, and Adaptivity. (arXiv:1810.05633v2 [math.OC] UPDATED)
- Variational Bayesian Monte Carlo. (arXiv:1810.05558v1 [stat.ML])
- Contracts as specifications for dynamical systems in driving variable form. (arXiv:1810.05542v1 [cs.SY])
- Characterization of Convex Objective Functions and Optimal Expected Convergence Rates for SGD. (arXiv:1810.04100v2 [math.OC] UPDATED)
- Frank-Wolfe Method is Automatically Adaptive to Error Bound Condition. (arXiv:1810.04765v1 [math.OC])
- Nonlinear Acceleration of Momentum and Primal-Dual Algorithms. (arXiv:1810.04539v2 [math.OC] UPDATED)
- Taming a non-convex landscape with dynamical long-range order: memcomputing Ising benchmarks. (arXiv:1810.03712v3 [cond-mat.dis-nn] UPDATED)
- Gradient flows and Evolution Variational Inequalities in metric spaces. I: structural properties. (arXiv:1810.03939v1 [math.FA])
- Doubly Reparameterized Gradient Estimators for Monte Carlo Objectives. (arXiv:1810.04152v2 [cs.LG] UPDATED)
- Characterization of Convex Objective Functions and Optimal Expected Convergence Rates for SGD. (arXiv:1810.04100v2 [math.OC] UPDATED)
- Deep learning with differential Gaussian process flows. (arXiv:1810.04066v2 [cs.LG] UPDATED)
- Information Geometry of Orthogonal Initializations and Training. (arXiv:1810.03785v2 [stat.ML] UPDATED)
- Practical Bayesian Optimization for Transportation Simulators. (arXiv:1810.03688v2 [stat.CO] UPDATED)
- Cubic Regularization with Momentum for Nonconvex Optimization. (arXiv:1810.03763v2 [math.OC] UPDATED)
- Towards Gradient Free and Projection Free Stochastic Optimization. (arXiv:1810.03233v3 [math.OC] UPDATED)
- Proximal Online Gradient is Optimum for Dynamic Regret. (arXiv:1810.03594v6 [cs.LG] UPDATED)
- Probabilistic Solutions To Ordinary Differential Equations As Non-Linear Bayesian Filtering: A New Perspective. (arXiv:1810.03440v4 [stat.ME] UPDATED)
- Accelerating Stochastic Gradient Descent Using Antithetic Sampling. (arXiv:1810.03124v1 [cs.LG])
- ASVRG: Accelerated Proximal SVRG. (arXiv:1810.03105v2 [cs.LG] UPDATED)
- Anytime Stochastic Gradient Descent: A Time to Hear from all the Workers. (arXiv:1810.02976v1 [cs.LG])
- Probabilistic Linear Solvers: A Unifying View. (arXiv:1810.03398v2 [stat.CO] UPDATED)
- Hybrid Active Inference. (arXiv:1810.02647v1 [cs.AI])
- Where Did My Optimum Go?: An Empirical Analysis of Gradient Descent Optimization in Policy Gradient Methods. (arXiv:1810.02525v1 [cs.LG])
- Continuous-time Models for Stochastic Optimization Algorithms. (arXiv:1810.02565v3 [math.OC] UPDATED)
- Accelerated Decentralized Optimization with Local Updates for Smooth and Strongly Convex Objectives. (arXiv:1810.02660v3 [math.OC] UPDATED)
- Restarting Frank-Wolfe: Faster Rates Under H\"olderian Error Bounds. (arXiv:1810.02429v4 [math.OC] UPDATED)
- A Convergence Analysis of Gradient Descent for Deep Linear Neural Networks. (arXiv:1810.02281v3 [cs.LG] UPDATED)
- Convergence and Dynamical Behavior of the ADAM Algorithm for Non-Convex Stochastic Optimization. (arXiv:1810.02263v4 [stat.ML] UPDATED)
- Gradient Descent Provably Optimizes Over-parameterized Neural Networks. (arXiv:1810.02054v2 [cs.LG] UPDATED)
- Gradient descent aligns the layers of deep linear networks. (arXiv:1810.02032v2 [cs.LG] UPDATED)
- Convergence of the Expectation-Maximization Algorithm Through Discrete-Time Lyapunov Stability Theory. (arXiv:1810.02022v1 [math.OC])
- Sum decomposition of divergence into three divergences. (arXiv:1810.01720v2 [stat.ME] UPDATED)
- Learning with Random Learning Rates. (arXiv:1810.01322v3 [cs.LG] UPDATED)
- A simple parameter-free and adaptive approach to optimization under a minimal local smoothness assumption. (arXiv:1810.00997v2 [cs.LG] UPDATED)
- Inference Over Programs That Make Predictions. (arXiv:1810.01190v1 [cs.AI])
- Hypocoercivity in Wasserstein-1 for the kinetic Fokker-Planck equation via Malliavin Calculus. (arXiv:1810.01324v1 [math.PR])
- Geometry of quadratic maps via convex relaxation. (arXiv:1810.00896v1 [math.OC])
- ProxQuant: Quantized Neural Networks via Proximal Operators. (arXiv:1810.00861v3 [cs.LG] UPDATED)
- Optimal Adaptive and Accelerated Stochastic Gradient Descent. (arXiv:1810.00553v1 [stat.ML])
- Riemannian Adaptive Optimization Methods. (arXiv:1810.00760v2 [cs.LG] UPDATED)
- Directional Analysis of Stochastic Gradient Descent via von Mises-Fisher Distributions in Deep learning. (arXiv:1810.00150v2 [cs.LG] UPDATED)
- AdaShift: Decorrelation and Convergence of Adaptive Learning Rate Methods. (arXiv:1810.00143v4 [cs.LG] UPDATED)
- Fluctuation-dissipation relations for stochastic gradient descent. (arXiv:1810.00004v2 [stat.ML] UPDATED)
- Parallels and promising directions in the study of genetic, cultural, and moral evolution. (arXiv:1810.00501v1 [q-bio.PE])
- Accelerated PDE's for efficient solution of regularized inversion problems. (arXiv:1810.00410v1 [cs.NA])
- Computational Convergence Analysis of Distributed Gradient Tracking for Smooth Convex Optimization Using Dissipativity Theory. (arXiv:1810.00257v2 [math.OC] UPDATED)
- An Approach to Duality in Nonlinear Filtering. (arXiv:1809.10762v2 [math.PR] UPDATED)
- Proximal Recursion for Solving the Fokker-Planck Equation. (arXiv:1809.10844v2 [math.OC] UPDATED)
- An Introduction to Probabilistic Programming. (arXiv:1809.10756v2 [stat.ML] UPDATED)
- A Fast Splitting Method for efficient Split Bregman Iterations. (arXiv:1809.11135v1 [math.NA])
- An exploration-exploitation tradeoff dictates the optimal distribution of phenotypes for populations in presence of fitness fluctuations. (arXiv:1809.11030v1 [q-bio.PE])
- Monge-Amp\`ere Flow for Generative Modeling. (arXiv:1809.10188v1 [cs.LG])
- The jamming transition as a paradigm to understand the loss landscape of deep neural networks. (arXiv:1809.09349v4 [cond-mat.dis-nn] UPDATED)
- Asynchronous decentralized accelerated stochastic gradient descent. (arXiv:1809.09258v1 [math.OC])
- Practical bounds on the error of Bayesian posterior approximations: A nonasymptotic approach. (arXiv:1809.09505v2 [math.ST] UPDATED)
- Exact Solutions for a GBM-type Stochastic Volatility Model having a Stationary Distribution. (arXiv:1809.08635v2 [q-fin.CP] UPDATED)
- Gaussian fluctuations for linear eigenvalue statistics of products of independent iid random matrices. (arXiv:1809.08367v2 [math.PR] UPDATED)
- Implicit Maximum Likelihood Estimation. (arXiv:1809.09087v2 [cs.LG] UPDATED)
- Exponential Convergence Time of Gradient Descent for One-Dimensional Deep Linear Neural Networks. (arXiv:1809.08587v4 [cs.LG] UPDATED)
- A Canonical Form for First-Order Distributed Optimization Algorithms. (arXiv:1809.08709v2 [math.OC] UPDATED)
- On the performance of the Euler-Maruyama scheme for SDEs with discontinuous drift coefficient. (arXiv:1809.08423v1 [math.NA])
- Wasserstein Distributionally Robust Kalman Filtering. (arXiv:1809.08830v3 [math.OC] UPDATED)
- Provably Correct Automatic Subdifferentiation for Qualified Programs. (arXiv:1809.08530v2 [math.OC] UPDATED)
- Mirror Descent and Constrained Online Optimization Problems. (arXiv:1809.08329v2 [math.OC] UPDATED)
- Twist-bend coupling and the statistical mechanics of the twistable worm-like chain model of DNA: perturbation theory and beyond. (arXiv:1809.07050v2 [cond-mat.stat-mech] UPDATED)
- Projective Splitting with Forward Steps only Requires Continuity. (arXiv:1809.07180v1 [math.OC])
- Survey: Sixty Years of Douglas--Rachford. (arXiv:1809.07181v3 [math.OC] UPDATED)
- Competing evolutionary paths in growing populations with applications to multidrug resistance. (arXiv:1809.06806v2 [q-bio.PE] UPDATED)
- Zeroth-order Nonconvex Stochastic Optimization: Handling Constraints, High-Dimensionality and Saddle-Points. (arXiv:1809.06474v2 [math.OC] CROSS LISTED)
- Primal-dual accelerated gradient methods with small-dimensional relaxation oracle. (arXiv:1809.05895v3 [math.OC] UPDATED)
- On stability of a class of filters for non-linear stochastic systems. (arXiv:1809.05667v2 [stat.ME] UPDATED)
- On-Line Learning of Linear Dynamical Systems: Exponential Forgetting in Kalman Filters. (arXiv:1809.05870v1 [math.ST])
- Zap Meets Momentum: Stochastic Approximation Algorithms with Optimal Convergence Rate. (arXiv:1809.06277v1 [math.OC])
- Stochastic Variational Optimization. (arXiv:1809.04855v1 [stat.ML])
- Hamiltonian Descent Methods. (arXiv:1809.05042v1 [math.OC])
- Global Convergence of Stochastic Gradient Hamiltonian Monte Carlo for Non-Convex Stochastic Optimization: Non-Asymptotic Performance Bounds and Momentum-Based Acceleration. (arXiv:1809.04618v4 [math.OC] UPDATED)
- On Markov Chain Gradient Descent. (arXiv:1809.04216v1 [math.OC])
- Wasserstein Gradients for the Temporal Evolution of Probability Distributions. (arXiv:1809.03498v7 [stat.ME] UPDATED)
- Analysis of the Generalization Error: Empirical Risk Minimization over Deep Artificial Neural Networks Overcomes the Curse of Dimensionality in the Numerical Approximation of Black-Scholes Partial Differential Equations. (arXiv:1809.03062v3 [cs.LG] UPDATED)
- Decentralized Differentially Private Without-Replacement Stochastic Gradient Descent. (arXiv:1809.02727v4 [cs.LG] UPDATED)
- Inexact Proximal Gradient Methods for Non-convex and Non-smooth Optimization. (arXiv:1612.06003v2 [cs.LG] UPDATED)
- Communication-Efficient Distributed Strongly Convex Stochastic Optimization: Non-Asymptotic Rates. (arXiv:1809.02920v1 [math.OC])
- Online Adaptive Methods, Universality and Acceleration. (arXiv:1809.02864v1 [cs.LG])
- Scalable Monte Carlo inference for state-space models. (arXiv:1809.02527v1 [stat.ME])
- Gaussian statistics as an emergent symmetry of the stochastic Burgers equation. (arXiv:1809.02158v2 [cond-mat.stat-mech] UPDATED)
- Stochastic Particle-Optimization Sampling and the Non-Asymptotic Convergence Theory. (arXiv:1809.01293v5 [stat.ML] UPDATED)
- $hp$-Multilevel Monte Carlo Methods for Uncertainty Quantification of Compressible Flows. (arXiv:1808.10626v3 [math.NA] UPDATED)
- Random Bit Multilevel Algorithms for Stochastic Differential Equations. (arXiv:1808.10623v2 [math.NA] UPDATED)
- Fixed-Time Stable Gradient Flows: Applications to Continuous-Time Optimization. (arXiv:1808.10474v5 [math.OC] UPDATED)
- Proximal boosting: aggregating weak learners to minimize non-differentiable losses. (arXiv:1808.09670v4 [cs.LG] UPDATED)
- Online ICA: Understanding Global Dynamics of Nonconvex Optimization via Diffusion Processes. (arXiv:1808.09642v1 [stat.ML])
- An elementary introduction to information geometry. (arXiv:1808.08271v2 [cs.LG] UPDATED)
- TAP free energy, spin glasses, and variational inference. (arXiv:1808.07890v2 [math.PR] UPDATED)
- On the Normality of Negative Interest Rates. (arXiv:1808.07909v1 [econ.GN])
- Continuous time Gaussian process dynamical models in gene regulatory network inference. (arXiv:1808.08161v3 [math.OC] UPDATED)
- Adaptive Tuning Of Hamiltonian Monte Carlo Within Sequential Monte Carlo. (arXiv:1808.07730v2 [stat.CO] UPDATED)
- Non-asymptotic bounds for sampling algorithms without log-concavity. (arXiv:1808.07105v3 [math.PR] UPDATED)
- Limit order books, diffusion approximations and reflected SPDEs: from microscopic to macroscopic models. (arXiv:1808.07107v2 [q-fin.MF] UPDATED)
- Convergence of Cubic Regularization for Nonconvex Optimization under KL Property. (arXiv:1808.07382v1 [math.OC])
- A Note on Inexact Condition for Cubic Regularized Newton's Method. (arXiv:1808.07384v1 [math.OC])
- Quantitative contraction rates for Markov chains on general state spaces. (arXiv:1808.07033v1 [math.PR])
- A note on the approximate symmetry of Bregman distances. (arXiv:1808.06790v1 [math.OC])
- Universal Stagewise Learning for Non-Convex Problems with Convergence on Averaged Solutions. (arXiv:1808.06296v3 [math.OC] UPDATED)
- Non-equilibrium time dynamics of genetic evolution. (arXiv:1808.06083v1 [cond-mat.stat-mech])
- Generalized Bregman and Jensen divergences which include some f-divergences. (arXiv:1808.06148v5 [math.ST] UPDATED)
- Optimal proposals for Approximate Bayesian Computation. (arXiv:1808.06040v1 [math.ST])
- Adaptive Cubic Regularization Methods with Dynamic Inexact Hessian Information and Applications to Finite-Sum Minimization. (arXiv:1808.06239v3 [math.OC] UPDATED)
- On the Convergence of Adaptive Gradient Methods for Nonconvex Optimization. (arXiv:1808.05671v3 [cs.LG] UPDATED)
- On the time-dependent Fisher information of a density function
- Unifying Markov properties for graphical models
- Frank-Wolfe Style Algorithms for Large Scale Optimization. (arXiv:1808.05274v1 [math.OC])
- A Proximal Operator for Multispectral Phase Retrieval Problems. (arXiv:1808.05194v1 [math.OC])
- An Analysis of Asynchronous Stochastic Accelerated Coordinate Descent. (arXiv:1808.05156v1 [math.OC])
- Gradient descent in some simple settings. (arXiv:1808.04839v2 [math.OC] UPDATED)
- Kernel Flows: from learning kernels from data into the abyss. (arXiv:1808.04475v2 [stat.ML] UPDATED)
- Self-avoiding walk, spin systems, and renormalization. (arXiv:1808.04476v2 [math-ph] UPDATED)
- Weight-Preserving Simulated Tempering. (arXiv:1808.04782v3 [stat.CO] UPDATED)
- Adaptive Sampling for Convex Regression. (arXiv:1808.04523v3 [cs.LG] UPDATED)
- An Adaptive Primal-Dual Framework for Nonsmooth Convex Minimization. (arXiv:1808.04648v1 [math.OC])
- Neural Importance Sampling. (arXiv:1808.03856v5 [cs.LG] UPDATED)
- Randomized Hamiltonian Monte Carlo as Scaling Limit of the Bouncy Particle Sampler and Dimension-Free Convergence Rates. (arXiv:1808.04299v5 [stat.CO] UPDATED)
- Parallelization does not Accelerate Convex Optimization: Adaptivity Lower Bounds for Non-smooth Convex Minimization. (arXiv:1808.03880v2 [cs.LG] UPDATED)
- A Nonsmooth Dynamical Systems Perspective on Accelerated Extensions of ADMM. (arXiv:1808.04048v7 [math.OC] UPDATED)
- Globally Convergent Type-I Anderson Acceleration for Non-Smooth Fixed-Point Iterations. (arXiv:1808.03971v1 [math.OC])
- The Stochastic Fej\'er-Monotone Hybrid Steepest Descent Method and the Hierarchical RLS. (arXiv:1808.03895v4 [math.OC] UPDATED)
- The contractivity of cone-preserving multilinear mappings. (arXiv:1808.04180v2 [math.SP] UPDATED)
- Ensemble Kalman Inversion: A Derivative-Free Technique For Machine Learning Tasks. (arXiv:1808.03620v1 [cs.LG])
- A Unified Analysis of AdaGrad with Weighted Aggregation and Momentum Acceleration. (arXiv:1808.03408v4 [cs.LG] UPDATED)
- Policy Optimization as Wasserstein Gradient Flows. (arXiv:1808.03030v1 [cs.LG])
- Accelerated Bregman Proximal Gradient Methods for Relatively Smooth Convex Optimization. (arXiv:1808.03045v3 [math.OC] UPDATED)
- Distributed heavy-ball: A generalization and acceleration of first-order methods with gradient tracking. (arXiv:1808.02942v2 [math.OC] UPDATED)
- On the Convergence of A Class of Adam-Type Algorithms for Non-Convex Optimization. (arXiv:1808.02941v2 [cs.LG] UPDATED)
- Asynchronous Variance-reduced Block Schemes for Composite Nonconvex Stochastic Optimization: Block-specific Steplengths and Adapted Batch-sizes. (arXiv:1808.02543v4 [math.OC] UPDATED)
- signSGD: Compressed Optimisation for Non-Convex Problems. (arXiv:1802.04434v3 [cs.LG] CROSS LISTED)
- Random directions stochastic approximation with deterministic perturbations. (arXiv:1808.02871v2 [math.OC] UPDATED)
- Composite Convex Optimization with Global and Local Inexact Oracles. (arXiv:1808.02121v2 [math.OC] UPDATED)
- Unbiased Implicit Variational Inference. (arXiv:1808.02078v3 [stat.ML] UPDATED)
- Diffusion approximations and control variates for MCMC. (arXiv:1808.01665v2 [stat.ME] UPDATED)
- Higher Order Langevin Monte Carlo Algorithm. (arXiv:1808.00728v3 [math.ST] UPDATED)
- Upper and lower bounds for the Bregman divergence. (arXiv:1808.00772v1 [math.NA])
- Data-driven nonsmooth optimization. (arXiv:1808.00946v1 [math.OC])
- Geometry of energy landscapes and the optimizability of deep neural networks. (arXiv:1808.00408v1 [cond-mat.dis-nn])
- On the stability of matrix-valued Riccati diffusions. (arXiv:1808.00235v4 [math.PR] UPDATED)
- Multiscale analysis of accelerated gradient methods. (arXiv:1807.11354v3 [math.OC] UPDATED)
- Stochastic Gradient Descent with Biased but Consistent Gradient Estimators. (arXiv:1807.11880v4 [cs.LG] UPDATED)
- On the use of bootstrap with variational inference: Theory, interpretation, and a two-sample test example
- Unbiased inference for discretely observed hidden Markov model diffusions. (arXiv:1807.10259v8 [stat.ME] UPDATED)
- Stability of the Bakry-Emery theorem on $\mathbb{R}^n$. (arXiv:1807.09845v2 [math.FA] UPDATED)
- Convergence Rates of Gaussian ODE Filters. (arXiv:1807.09737v3 [math.NA] UPDATED)
- On sampling from a log-concave density using kinetic Langevin diffusions. (arXiv:1807.09382v6 [math.PR] UPDATED)
- Global consensus Monte Carlo. (arXiv:1807.09288v3 [stat.CO] UPDATED)
- Inexact Variable Metric Stochastic Block-Coordinate Descent for Regularized Optimization. (arXiv:1807.09146v3 [math.OC] UPDATED)
- Proximal Averages for Minimization of Entropy Functionals. (arXiv:1807.08878v1 [math.OC])
- Atomic Swaptions: Cryptocurrency Derivatives. (arXiv:1807.08644v2 [cs.CR] UPDATED)
- Particle Filtering Methods for Stochastic Optimization with Application to Large-Scale Empirical Risk Minimization. (arXiv:1807.08534v11 [cs.LG] UPDATED)
- Subsampling MCMC - An introduction for the survey statistician. (arXiv:1807.08409v4 [stat.ME] UPDATED)
- On the Analysis of Trajectories of Gradient Descent in the Optimization of Deep Neural Networks. (arXiv:1807.08140v1 [cs.LG])
- On the rate of convergence of empirical measure in $\infty-$Wasserstein distance for unbounded density function. (arXiv:1807.08365v2 [math.PR] UPDATED)
- Towards a general mathematical theory of experimental science. (arXiv:1807.07896v2 [cs.AI] UPDATED)
- Adaptive Variational Particle Filtering in Non-stationary Environments. (arXiv:1807.07612v1 [cs.LG])
- Generalized Stochastic Frank-Wolfe Algorithm with Stochastic "Substitute" Gradient for Structured Convex Optimization. (arXiv:1807.07680v5 [math.OC] UPDATED)
- A geometric integration approach to nonsmooth, nonconvex optimisation. (arXiv:1807.07554v1 [math.OC])
- Bayesian filtering unifies adaptive and non-adaptive neural network optimization methods. (arXiv:1807.07540v5 [stat.ML] UPDATED)
- Convergence guarantees for RMSProp and ADAM in non-convex optimization and an empirical comparison to Nesterov acceleration. (arXiv:1807.06766v3 [cs.LG] UPDATED)
- Time-Varying Optimization: Algorithms and Engineering Applications. (arXiv:1807.07032v2 [math.OC] UPDATED)
- Parallel Restarted SGD with Faster Convergence and Less Communication: Demystifying Why Model Averaging Works for Deep Learning. (arXiv:1807.06629v3 [math.OC] UPDATED)
- Bregman Monotone Operator Splitting. (arXiv:1807.04871v2 [math.OC] UPDATED)
- Fast yet Simple Natural-Gradient Descent for Variational Inference in Complex Models. (arXiv:1807.04489v2 [stat.ML] UPDATED)
- On bayesian estimation and proximity operators. (arXiv:1807.04021v2 [math.ST] UPDATED)
- Variable metric algorithms driven by averaged operators. (arXiv:1807.04027v1 [math.OC])
- NAPS: Natural Program Synthesis Dataset. (arXiv:1807.03168v1 [cs.LG])
- On the convergence time of some non-reversible Markov chain Monte Carlo methods. (arXiv:1807.02614v3 [stat.CO] UPDATED)
- A Tutorial on Bayesian Optimization. (arXiv:1807.02811v1 [stat.ML])
- Optimistic mirror descent in saddle-point problems: Going the extra (gradient) mile. (arXiv:1807.02629v2 [cs.LG] UPDATED)
- Gaussian Processes and Kernel Methods: A Review on Connections and Equivalences. (arXiv:1807.02582v1 [stat.ML])
- Understanding and Accelerating Particle-Based Variational Inference. (arXiv:1807.01750v4 [stat.ML] UPDATED)
- A State-Space Modeling Framework for Engineering Blockchain-Enabled Economic Systems. (arXiv:1807.00955v1 [cs.SY])
- Limit theorems for sequential MCMC methods. (arXiv:1807.01057v2 [stat.CO] UPDATED)
- Direct Acceleration of SAGA using Sampled Negative Momentum. (arXiv:1806.11048v4 [cs.LG] UPDATED)
- Constructing sampling schemes via coupling: Markov semigroups and optimal transport. (arXiv:1806.11026v1 [math.PR])
- Guided evolutionary strategies: Augmenting random search with surrogate gradients. (arXiv:1806.10230v4 [cs.NE] UPDATED)
- The decoupled extended Kalman filter for dynamic exponential-family factorization models. (arXiv:1806.09976v2 [stat.ML] UPDATED)
- Efficient Projection onto the $\ell_{\infty,1}$ Mixed-Norm Ball using a Newton root search method. (arXiv:1806.10041v2 [math.OC] UPDATED)
- Random Shuffling Beats SGD after Finite Epochs. (arXiv:1806.10077v2 [math.OC] UPDATED)
- Stochastic natural gradient descent draws posterior samples in function space. (arXiv:1806.09597v4 [cs.LG] UPDATED)
- Bias of Particle Approximations to Optimal Filter Derivative. (arXiv:1806.09590v5 [math.ST] UPDATED)
- A Tour of Reinforcement Learning: The View from Continuous Control. (arXiv:1806.09460v2 [math.OC] UPDATED)
- Asymptotic Properties of Recursive Maximum Likelihood Estimation in Non-Linear State-Space Models. (arXiv:1806.09571v3 [math.ST] UPDATED)
- Stability of Optimal Filter Higher-Order Derivatives. (arXiv:1806.09595v3 [math.PR] UPDATED)
- Tensor Monte Carlo: particle methods for the GPU era. (arXiv:1806.08593v3 [stat.ML] UPDATED)
- Mutation rate variability as a driving force in adaptive evolution. (arXiv:1806.08454v1 [q-bio.PE])
- Sliced-Wasserstein Flows: Nonparametric Generative Modeling via Optimal Transport and Diffusions. (arXiv:1806.08141v2 [stat.ML] UPDATED)
- Variational Formulations for Explicit Runge-Kutta Methods. (arXiv:1806.07803v1 [math.NA])
- Neural Ordinary Differential Equations. (arXiv:1806.07366v5 [cs.LG] UPDATED)
- Laplacian Smoothing Gradient Descent. (arXiv:1806.06317v5 [cs.LG] UPDATED)
- Distributed learning with compressed gradients. (arXiv:1806.06573v2 [math.OC] UPDATED)
- Geodesic Convex Optimization: Differentiation on Manifolds, Geodesics, and Convexity. (arXiv:1806.06373v1 [math.OC])
- Catalyst Acceleration for First-order Convex Optimization: from Theory to Practice; Hongzhou Lin, Julien Mairal, Zaid Harchaoui
- Only Bayes should learn a manifold (on the estimation of differential geometric structure from data). (arXiv:1806.04994v3 [stat.ML] UPDATED)
- Generalized Mirror Prox for Monotone Variational Inequalities: Universality and Inexact Oracle. (arXiv:1806.05140v3 [math.OC] UPDATED)
- A survey on fractional variational calculus. (arXiv:1806.05092v1 [math.OC])
- Approximate inference with Wasserstein gradient flows. (arXiv:1806.04542v1 [stat.ML])
- Meta-Learning for Stochastic Gradient MCMC. (arXiv:1806.04522v1 [stat.ML])
- Stein operators, kernels and discrepancies for multivariate continuous distributions. (arXiv:1806.03478v2 [math.PR] UPDATED)
- Convergence Rates for Projective Splitting. (arXiv:1806.03920v3 [math.OC] UPDATED)
- A note on the equivalence of operator splitting methods. (arXiv:1806.03353v1 [math.OC])
- Adaptive MCMC via Combining Local Samplers. (arXiv:1806.03816v6 [cs.LG] UPDATED)
- Decentralize and Randomize: Faster Algorithm for Wasserstein Barycenters. (arXiv:1806.03915v3 [math.OC] UPDATED)
- Noise-based control of opinion dynamics. (arXiv:1806.03781v3 [physics.soc-ph] UPDATED)
- Exponential convergence of adaptive importance sampling estimators for Markov chain expectations. (arXiv:1806.03029v2 [math.PR] UPDATED)
- Lightweight Stochastic Optimization for Minimizing Finite Sums with Infinite Data. (arXiv:1806.02927v1 [cs.LG])
- A Stein variational Newton method. (arXiv:1806.03085v2 [stat.ML] UPDATED)
- Scalable Natural Gradient Langevin Dynamics in Practice. (arXiv:1806.02855v1 [cs.LG])
- Variational Implicit Processes. (arXiv:1806.02390v2 [stat.ML] UPDATED)
- Simulating the stochastic dynamics and cascade failure of power networks. (arXiv:1806.02420v1 [physics.soc-ph])
- Pinned, locked, pushed, and pulled traveling waves in structured environments. (arXiv:1806.02480v2 [cond-mat.stat-mech] UPDATED)
- Stein Variational Gradient Descent Without Gradient. (arXiv:1806.02775v1 [stat.ML])
- Variational Implicit Processes. (arXiv:1806.02390v2 [stat.ML] UPDATED)
- Towards Riemannian Accelerated Gradient Methods. (arXiv:1806.02812v1 [math.OC])
- Pathwise Derivatives for Multivariate Distributions. (arXiv:1806.01856v2 [stat.ML] UPDATED)
- Pathwise Derivatives Beyond the Reparameterization Trick. (arXiv:1806.01851v2 [stat.ML] UPDATED)
- AdaGrad stepsizes: Sharp convergence over nonconvex landscapes. (arXiv:1806.01811v8 [stat.ML] UPDATED)
- Mode-Coupling Theory of the Glass Transition: A Primer. (arXiv:1806.01369v1 [cond-mat.stat-mech])
- Stochastic Gradient/Mirror Descent: Minimax Optimality and Implicit Regularization. (arXiv:1806.00952v4 [cs.LG] UPDATED)
- An Alternative to EM for Gaussian Mixture Models: Batch and Stochastic Riemannian Optimization. (arXiv:1706.03267v1 [stat.ML] CROSS LISTED)
- Mining gold from implicit models to improve likelihood-free inference. (arXiv:1805.12244v4 [stat.ML] UPDATED)
- A Mean Field View of the Landscape of Two-Layers Neural Networks. (arXiv:1804.06561v2 [stat.ML] UPDATED)
- On Acceleration with Noise-Corrupted Gradients. (arXiv:1805.12591v3 [math.OC] UPDATED)
- Differential Properties of Sinkhorn Approximation for Learning with Wasserstein Distance. (arXiv:1805.11897v1 [stat.ML])
- A Unified Particle-Optimization Framework for Scalable Bayesian Sampling. (arXiv:1805.11659v2 [stat.ML] UPDATED)
- Inexact Stochastic Mirror Descent for two-stage nonlinear stochastic programs. (arXiv:1805.11732v3 [math.OC] UPDATED)
- Stochastic Zeroth-order Optimization via Variance Reduction method. (arXiv:1805.11811v3 [stat.ML] UPDATED)
- Hamiltonian Variational Auto-Encoder. (arXiv:1805.11328v2 [cs.LG] UPDATED)
- Wasserstein Variational Inference. (arXiv:1805.11284v2 [stat.ML] UPDATED)
- Differentiable Particle Filters: End-to-End Learning with Algorithmic Priors. (arXiv:1805.11122v2 [cs.LG] UPDATED)
- A parallel implementation of the covariance matrix adaptation evolution strategy. (arXiv:1805.11201v1 [cs.NE])
- Kernel embedding of maps for sequential Bayesian inference: The variational mapping particle filter. (arXiv:1805.11380v1 [stat.ML])
- Optimal transportation between unequal dimensions. (arXiv:1805.11187v2 [math.AP] UPDATED)
- Zeroth-Order Stochastic Variance Reduction for Nonconvex Optimization. (arXiv:1805.10367v2 [cs.LG] UPDATED)
- Ergodic Inference: Accelerate Convergence by Optimisation. (arXiv:1805.10377v4 [cs.LG] UPDATED)
- A proximal minimization algorithm for structured nonconvex and nonsmooth problems. (arXiv:1805.11056v2 [math.OC] UPDATED)
- Robust Accelerated Gradient Methods for Smooth Strongly Convex Functions. (arXiv:1805.10579v4 [math.OC] UPDATED)
- Double Quantization for Communication-Efficient Distributed Optimization. (arXiv:1805.10111v4 [math.OC] UPDATED)
- Inverse Rational Control: Inferring What You Think from How You Forage. (arXiv:1805.09864v4 [cs.LG] UPDATED)
- Maximizing acquisition functions for Bayesian optimization. (arXiv:1805.10196v2 [stat.ML] UPDATED)
- Inexact proximal $\epsilon$-subgradient methods for composite convex optimization problems. (arXiv:1805.10120v2 [math.OC] UPDATED)
- Implicit Autoencoders. (arXiv:1805.09804v2 [cs.LG] UPDATED)
- Primal-Dual Wasserstein GAN. (arXiv:1805.09575v1 [stat.ML])
- Adaptive Stochastic Gradient Langevin Dynamics: Taming Convergence and Saddle Point Escape Time. (arXiv:1805.09416v1 [cs.LG])
- Deep Learning the Ising Model Near Criticality; Alan Morningstar, Roger G. Melko
- Approximate Newton-based statistical inference using only stochastic gradients. (arXiv:1805.08920v2 [cs.LG] UPDATED)
- Likelihood-free inference with emulator networks. (arXiv:1805.09294v2 [stat.ML] UPDATED)
- Dictionary Learning by Dynamical Neural Networks. (arXiv:1805.08952v1 [cs.LG])
- Step Size Matters in Deep Learning. (arXiv:1805.08890v2 [cs.LG] UPDATED)
- Langevin Markov Chain Monte Carlo with stochastic gradients. (arXiv:1805.08863v2 [stat.ME] UPDATED)
- Optimization, fast and slow: optimally switching between local and Bayesian optimization. (arXiv:1805.08610v1 [stat.ML])
- geomstats: a Python Package for Riemannian Geometry in Machine Learning. (arXiv:1805.08308v2 [cs.LG] UPDATED)
- Optimal transport natural gradient for statistical manifolds with continuous sample space. (arXiv:1805.08380v4 [math.OC] UPDATED)
- Gradient descent in hyperbolic space. (arXiv:1805.08207v2 [math.OC] UPDATED)
- Implicit Reparameterization Gradients. (arXiv:1805.08498v4 [cs.LG] UPDATED)
- Automatic Differentiation in Machine Learning: a Survey; Atilim Gunes Baydin, Barak A. Pearlmutter, Alexey Andreyevich Radul, Jeffrey Mark Siskind
- Online Structured Laplace Approximations For Overcoming Catastrophic Forgetting. (arXiv:1805.07810v1 [stat.ML])
- Bayesian posterior approximation via greedy particle optimization. (arXiv:1805.07912v3 [stat.ML] UPDATED)
- Nostalgic Adam: Weighting more of the past gradients when designing the adaptive learning rate. (arXiv:1805.07557v2 [cs.LG] UPDATED)
- The global optimum of shallow neural network is attained by ridgelet transform. (arXiv:1805.07517v3 [stat.ML] UPDATED)
- A theory on the absence of spurious solutions for nonconvex and nonsmooth optimization. (arXiv:1805.08204v4 [math.OC] UPDATED)
- On the Convergence of Stochastic Gradient Descent with Adaptive Stepsizes. (arXiv:1805.08114v3 [stat.ML] UPDATED)
- Never look back - A modified EnKF method and its application to the training of neural networks without back propagation. (arXiv:1805.08034v2 [math.NA] UPDATED)
- Implicit Probabilistic Integrators for ODEs. (arXiv:1805.07970v3 [stat.ME] UPDATED)
- Construction of quasi-potentials for stochastic dynamical systems: an optimization approach. (arXiv:1805.07273v1 [math.OC])
- A geometric integration approach to smooth optimisation: Foundations of the discrete gradient method. (arXiv:1805.06444v4 [math.OC] UPDATED)
- On the Application of Danskin's Theorem to Derivative-Free Minimax Optimization. (arXiv:1805.06322v1 [math.OC])
- Perspective Maximum Likelihood-Type Estimation via Proximal Decomposition. (arXiv:1805.06098v2 [math.ST] UPDATED)
- Glassy nature of the hard phase in inference problems. (arXiv:1805.05857v4 [cond-mat.dis-nn] UPDATED)
- ADMM and Accelerated ADMM as Continuous Dynamical Systems. (arXiv:1805.06579v3 [math.OC] UPDATED)
- Mirror Descent Search and its Acceleration. (arXiv:1709.02535v2 [cs.LG] UPDATED)
- Marginal likelihoods in phylogenetics: a review of methods and applications. (arXiv:1805.04072v1 [q-bio.PE])
- Scaling limit of the Stein variational gradient descent: the mean field regime. (arXiv:1805.04035v3 [math.AP] UPDATED)
- Unbiased and Consistent Nested Sampling via Sequential Monte Carlo. (arXiv:1805.03924v5 [stat.CO] UPDATED)
- Subsampling Sequential Monte Carlo for Static Bayesian Models. (arXiv:1805.03317v3 [stat.CO] UPDATED)
- Differential Equations for Modeling Asynchronous Algorithms. (arXiv:1805.02991v1 [stat.ML])
- Statistical Inference and Exact Saddle Point Approximations. (arXiv:1805.02234v1 [math.ST])
- Stochastic Quasi-Gradient Methods: Variance Reduction via Jacobian Sketching. (arXiv:1805.02632v1 [math.OC])
- Implementation of Stochastic Quasi-Newton's Method in PyTorch. (arXiv:1805.02338v1 [cs.LG])
- Analysis of nonsmooth stochastic approximation: the differential inclusion approach. (arXiv:1805.01916v1 [math.OC])
- Sharp convergence rates for Langevin dynamics in the nonconvex setting. (arXiv:1805.01648v4 [stat.ML] UPDATED)
- Noise as a resource. (arXiv:1805.01800v1 [quant-ph])
- Alpha-Beta Divergence For Variational Inference. (arXiv:1805.01045v2 [stat.ML] UPDATED)
- Scalable Importance Tempering and Bayesian Variable Selection. (arXiv:1805.00541v2 [stat.CO] UPDATED)
- Direct Runge-Kutta Discretization Achieves Acceleration. (arXiv:1805.00521v5 [math.OC] UPDATED)
- Distributed Big-Data Optimization via Block-Iterative Convexification and Averaging. (arXiv:1805.00658v1 [cs.DC])
- Trainability and Accuracy of Neural Networks: An Interacting Particle System Approach. (arXiv:1805.00915v3 [stat.ML] UPDATED)
- Coupling and Convergence for Hamiltonian Monte Carlo. (arXiv:1805.00452v2 [math.PR] UPDATED)
- optimParallel: an R Package Providing Parallel Versions of the Gradient-Based Optimization Methods of optim(). (arXiv:1804.11058v1 [stat.CO])
- Gradient Sampling Methods for Nonsmooth Optimization. (arXiv:1804.11003v1 [math.OC])
- A Matrix Gaussian Distribution. (arXiv:1804.11010v3 [math.PR] UPDATED)
- Convergence and Concentration of Empirical Measures under Wasserstein Distance in Unbounded Functional Spaces. (arXiv:1804.10556v2 [math.ST] UPDATED)
- Sparse Inverse Problems Over Measures: Equivalence of the Conditional Gradient and Exchange Methods. (arXiv:1804.10243v4 [math.OC] UPDATED)
- A telescoping Bregmanian proximal gradient method without the global Lipschitz continuity assumption. (arXiv:1804.10273v4 [math.OC] UPDATED)
- The loss landscape of overparameterized neural networks. (arXiv:1804.10200v1 [cs.LG])
- On stochastic optimization methods for Monte Carlo least-squares problems. (arXiv:1804.10079v1 [math.OC])
- Convergence guarantees for a class of non-convex and non-smooth optimization problems. (arXiv:1804.09629v1 [stat.ML])
- Stochastic Conditional Gradient Methods: From Convex Minimization to Submodular Maximization. (arXiv:1804.09554v2 [math.OC] UPDATED)
- Stability Properties of Systems of Linear Stochastic Differential Equations with Random Coefficients. (arXiv:1804.09349v2 [math.PR] UPDATED)
- An Introduction to Quantum Filtering. (arXiv:1804.09086v1 [quant-ph])
- Inertial, corrected, primal-dual proximal splitting. (arXiv:1804.08736v5 [math.OC] UPDATED)
- Dissipative numerical schemes on Riemannian manifolds with applications to gradient flows. (arXiv:1804.08104v3 [math.NA] UPDATED)
- Progress and open problems in evolutionary dynamics. (arXiv:1804.07720v1 [q-bio.PE])
- On the Location of the Minimizer of the Sum of Strongly Convex Functions. (arXiv:1804.07699v1 [math.OC])
- Operator limits of random matrices. (arXiv:1804.06953v1 [math.PR])
- On Large Lag Smoothing for Hidden Markov Models. (arXiv:1804.07117v1 [stat.ME])
- Distributed Simulation and Distributed Inference. (arXiv:1804.06952v3 [cs.DS] UPDATED)
- On the networked architecture of genotype spaces and its critical effects on molecular evolution. (arXiv:1804.06835v1 [q-bio.PE])
- Death and resurrection of a current by disorder, interaction or activity. (arXiv:1804.06780v1 [cond-mat.stat-mech])
- Monte Carlo sampling in diffusive dynamical systems. (arXiv:1804.06698v1 [cond-mat.stat-mech])
- Validating Bayesian Inference Algorithms with Simulation-Based Calibration. (arXiv:1804.06788v2 [stat.ME] UPDATED)
- The emergent integrated network structure of scientific research. (arXiv:1804.06434v1 [cs.SI])
- Walkman: A Communication-Efficient Random-Walk Algorithm for Decentralized Optimization. (arXiv:1804.06568v4 [math.OC] UPDATED)
- A Mean Field View of the Landscape of Two-Layers Neural Networks. (arXiv:1804.06561v2 [stat.ML] UPDATED)
- Robust Kalman Filtering: Asymptotic Analysis of the Least Favorable Model. (arXiv:1804.06321v1 [math.OC])
- Model-Free Linear Quadratic Control via Reduction to Expert Prediction. (arXiv:1804.06021v3 [cs.LG] UPDATED)
- Model-Free Information Extraction in Enriched Nonlinear Phase-Space. (arXiv:1804.05170v2 [cs.LG] UPDATED)
- Constant Step Size Stochastic Gradient Descent for Probabilistic Modeling. (arXiv:1804.05567v2 [stat.ML] UPDATED)
- A Variable Sample-size Stochastic Quasi-Newton Method for Smooth and Nonsmooth Stochastic Convex Optimization. (arXiv:1804.05368v5 [math.OC] UPDATED)
- On the Differentiability of the Solution to Convex Optimization Problems. (arXiv:1804.05098v3 [math.OC] UPDATED)
- Causal Inference via Kernel Deviance Measures. (arXiv:1804.04622v1 [cs.LG])
- Adafactor: Adaptive Learning Rates with Sublinear Memory Cost. (arXiv:1804.04235v1 [cs.LG])
- Multilevel Particle Filters for L\'evy-driven stochastic differential equations. (arXiv:1804.04444v2 [stat.CO] UPDATED)
- The Correlated Particle Hybrid Sampler for State Space Models. (arXiv:1804.04359v4 [stat.ME] UPDATED)
- Online convex optimization and no-regret learning: Algorithms, guarantees and applications. (arXiv:1804.04529v1 [cs.LG] CROSS LISTED)
- Merging joint distributions via causal model classes with low VC dimension. (arXiv:1804.03206v2 [math.ST] UPDATED)
- Derivative free optimization via repeated classification. (arXiv:1804.03761v1 [stat.ML])
- Cascade of transitions in molecular information theory. (arXiv:1804.03827v1 [cond-mat.stat-mech])
- Frank-Wolfe Splitting via Augmented Lagrangian Method. (arXiv:1804.03176v1 [math.OC])
- Thermodynamics and evolutionary biology through optimal control. (arXiv:1804.03309v1 [q-bio.PE])
- On the stability of Approximate Taylor methods for ODE and their relationship with Runge-Kutta schemes. (arXiv:1804.03627v1 [math.NA])
- Frank-Wolfe Splitting via Augmented Lagrangian Method. (arXiv:1804.03176v1 [math.OC])
- Distributed Non-Convex First-Order Optimization and Information Processing: Lower Complexity Bounds and Rate Optimal Algorithms. (arXiv:1804.02729v4 [math.OC] UPDATED)
- Complex energy landscapes in spiked-tensor and simple glassy models: ruggedness, arrangements of local minima and phase transitions. (arXiv:1804.02686v2 [cond-mat.dis-nn] UPDATED)
- Accelerating MCMC Algorithms. (arXiv:1804.02719v2 [stat.CO] UPDATED)
- Nonconvex Proximal Incremental Aggregated Gradient Method with Linear Convergence. (arXiv:1804.02571v1 [math.OC])
- An Accelerated Directional Derivative Method for Smooth Stochastic Convex Optimization. (arXiv:1804.02394v2 [math.OC] UPDATED)
- Adaptive Three Operator Splitting. (arXiv:1804.02339v3 [math.OC] UPDATED)
- Accelerated Optimization in the PDE Framework: Formulations for the Manifold of Diffeomorphisms. (arXiv:1804.02307v1 [math.OC])
- Asymptotic genealogies of interacting particle systems with an application to sequential Monte Carlo. (arXiv:1804.01811v1 [math.ST])
- Sliced-Wasserstein Autoencoder: An Embarrassingly Simple Generative Model. (arXiv:1804.01947v3 [cs.LG] UPDATED)
- Variational Rejection Sampling. (arXiv:1804.01712v1 [stat.ML])
- Probabilistic Contraction Analysis of Iterated Random Operators. (arXiv:1804.01195v6 [math.PR] UPDATED)
- Self-Organization and Artificial Life: A Review. (arXiv:1804.01144v1 [nlin.AO])
- A Constant Step Stochastic Douglas-Rachford Algorithm with Application to Non Separable Regularizations. (arXiv:1804.00934v1 [math.OC])
- Aggregated Momentum: Stability Through Passive Damping. (arXiv:1804.00325v3 [cs.LG] UPDATED)
- Understanding Autoencoders with Information Theoretic Concepts. (arXiv:1804.00057v3 [cs.LG] UPDATED)
- Learning to generate classifiers. (arXiv:1803.11373v1 [cs.LG])
- Notes on computational-to-statistical gaps: predictions using statistical physics. (arXiv:1803.11132v2 [stat.ML] UPDATED)
- Stochastic Gradient Hamiltonian Monte Carlo with Variance Reduction for Bayesian Inference. (arXiv:1803.11159v3 [cs.LG] UPDATED)
- On the Equivalence of Inexact Proximal ALM and ADMM for a Class of Convex Composite Programming. (arXiv:1803.10803v2 [math.OC] UPDATED)
- Inexact First-Order Primal-Dual Algorithms. (arXiv:1803.10576v3 [math.OC] UPDATED)
- On the Local Minima of the Empirical Risk. (arXiv:1803.09357v2 [cs.LG] UPDATED)
- A theory of the phenomenology of multipopulation genetic algorithm with an application to the Ising model. (arXiv:1803.09254v3 [cs.NE] UPDATED)
- Derivative-Free Optimization of Noisy Functions via Quasi-Newton Methods. (arXiv:1803.10173v2 [math.OC] UPDATED)
- Asynchronous Gradient-Push. (arXiv:1803.08950v3 [cs.MA] UPDATED)
- On Matching Pursuit and Coordinate Descent. (arXiv:1803.09539v7 [stat.ML] UPDATED)
- Stein Points. (arXiv:1803.10161v4 [stat.CO] UPDATED)
- A Proximal Block Coordinate Descent Algorithm for Deep Neural Network Training. (arXiv:1803.09082v1 [stat.ML])
- Finite Sample Complexity of Sequential Monte Carlo Estimators. (arXiv:1803.09365v3 [stat.CO] UPDATED)
- Wide consensus aggregation in the Wasserstein space. Application to location-scatter families
- Evolution of the Wasserstein distance between the marginals of two Markov processes
- Perturbation theory for Markov chains via Wasserstein distance
- Optimization of Smooth Functions with Noisy Observations: Local Minimax Rates. (arXiv:1803.08586v1 [stat.ML])
- Byzantine Stochastic Gradient Descent. (arXiv:1803.08917v1 [cs.LG])
- Lower error bounds for the stochastic gradient descent optimization algorithm: Sharp convergence rates for slowly and fast decaying learning rates. (arXiv:1803.08600v1 [math.NA])
- Information Geometry of the Gaussian Space. (arXiv:1803.08135v1 [math.PR])
- Monte Carlo Information Geometry: The dually flat case. (arXiv:1803.07225v1 [cs.LG])
- Distributed Zeroth Order Optimization Over Random Networks: A Kiefer-Wolfowitz Stochastic Approximation Approach. (arXiv:1803.07844v1 [math.OC])
- A Push-Pull Gradient Method for Distributed Optimization in Networks. (arXiv:1803.07588v3 [math.OC] UPDATED)
- Frank-Wolfe with Subsampling Oracle. (arXiv:1803.07348v1 [math.OC])
- Communication reduction in distributed optimization via estimation of the proximal operator. (arXiv:1803.07143v2 [math.OC] UPDATED)
- Monte Carlo Information Geometry: The dually flat case. (arXiv:1803.07225v1 [cs.LG])
- Fastest Rates for Stochastic Mirror Descent Methods. (arXiv:1803.07374v1 [math.OC])
- D$^2$: Decentralized Training over Decentralized Data. (arXiv:1803.07068v2 [cs.DC] UPDATED)
- Parameterizations for Ensemble Kalman Inversion. (arXiv:1709.01781v2 [math.NA] UPDATED)
- Sparse Regularization via Convex Analysis. (arXiv:1803.06765v1 [math.OC])
- (Parametrized) First Order Transport Equations: Realization of Optimally Stable Petrov-Galerkin Methods. (arXiv:1803.06925v3 [math.NA] UPDATED)
- Numerical Integration on Graphs: where to sample and how to weigh. (arXiv:1803.06989v1 [math.ST])
- Projective Splitting with Forward Steps: Asynchronous and Block-Iterative Operator Splitting. (arXiv:1803.07043v7 [math.OC] UPDATED)
- Communication Compression for Decentralized Training. (arXiv:1803.06443v5 [cs.LG] UPDATED)
- Escaping Saddles with Stochastic Gradients. (arXiv:1803.05999v2 [cs.LG] UPDATED)
- Natural gradient via optimal transport. (arXiv:1803.07033v5 [math.OC] UPDATED)
- Gradients on Sets. (arXiv:1803.06243v1 [math.OC])
- On the insufficiency of existing momentum schemes for Stochastic Optimization. (arXiv:1803.05591v2 [cs.LG] UPDATED)
- Escaping Saddles with Stochastic Gradients. (arXiv:1803.05999v1 [cs.LG])
- Nesting Probabilistic Programs. (arXiv:1803.06328v2 [stat.ML] UPDATED)
- Information Thermodynamics of Turing Patterns. (arXiv:1803.05378v1 [cond-mat.stat-mech])
- High Throughput Synchronous Distributed Stochastic Gradient Descent. (arXiv:1803.04209v1 [cs.DC] CROSS LISTED)
- Irreproducibility; Nothing is More Predictable. (arXiv:1803.04481v1 [stat.AP])
- Tutorial on dynamic average consensus: the problem, its applications, and the algorithms. (arXiv:1803.04628v1 [cs.SY])
- Bayesian Optimization for Dynamic Problems. (arXiv:1803.03432v1 [stat.ML])
- Exponentially concave functions and a new information geometry
- WNGrad: Learn the Learning Rate in Gradient Descent. (arXiv:1803.02865v2 [stat.ML] UPDATED)
- Proximal Activation of Smooth Functions in Splitting Algorithms for Convex Image Recovery. (arXiv:1803.02919v5 [math.OC] UPDATED)
- Energy-entropy competition and the effectiveness of stochastic gradient descent in machine learning. (arXiv:1803.01927v1 [cs.LG])
- A gradient method in a Hilbert space with an optimized inner product: achieving a Newton-like convergence. (arXiv:1803.02414v1 [math.NA])
- Thermodynamics of Restricted Boltzmann Machines and related learning dynamics. (arXiv:1803.01960v2 [cond-mat.dis-nn] UPDATED)
- Almost Sure Uniqueness of a Global Minimum Without Convexity. (arXiv:1803.02415v3 [econ.EM] UPDATED)
- Proximal Gradient Algorithms: Applications in Signal Processing. (arXiv:1803.01621v4 [eess.SP] UPDATED)
- A Primal-Dual Algorithm with Line Search for General Convex-Concave Saddle Point Problems. (arXiv:1803.01401v5 [math.OC] UPDATED)
- Inexact Successive Quadratic Approximation for Regularized Optimization. (arXiv:1803.01298v4 [math.OC] UPDATED)
- Re-examination of Bregman functions and new properties of their divergences. (arXiv:1803.00641v4 [math.OC] UPDATED)
- Scalable Bayesian uncertainty quantification in imaging inverse problems via convex optimization. (arXiv:1803.00889v2 [stat.ME] UPDATED)
- Computational Optimal Transport. (arXiv:1803.00567v4 [stat.ML] UPDATED)
- Volatility and arbitrage
- Distance Measure Machines. (arXiv:1803.00250v3 [cs.LG] UPDATED)
- The Anisotropic Noise in Stochastic Gradient Descent: Its Behavior of Escaping from Sharp Minima and Regularization Effects. (arXiv:1803.00195v5 [stat.ML] UPDATED)
- A Simple Nearly-Optimal Restart Scheme For Speeding-Up First Order Methods. (arXiv:1803.00151v2 [math.OC] UPDATED)
- The Difficulty of Monte Carlo Approximation of Multivariate Monotone Functions. (arXiv:1803.00099v1 [math.NA])
- Limits on Inferring the Past. (arXiv:1802.10420v2 [stat.OT] UPDATED)
- Parametrized Accelerated Methods Free of Condition Number. (arXiv:1802.10235v1 [math.OC])
- Mirrored Langevin Dynamics. (arXiv:1802.10174v5 [cs.LG] UPDATED)
- Accelerating Asynchronous Algorithms for Convex Optimization by Momentum Compensation. (arXiv:1802.09747v1 [math.OC])
- VR-SGD: A Simple Stochastic Variance Reduction Method for Machine Learning. (arXiv:1802.09932v2 [cs.LG] UPDATED)
- Generalizing Parallel Replica Dynamics: Trajectory Fragments, Asynchronous Computing, and PDMPs. (arXiv:1802.09444v3 [math.NA] UPDATED)
- Steady and Stable: Numerical Investigations of Nonlinear Partial Differential Equations. (arXiv:1802.08785v1 [math.NA])
- Dimension-free Information Concentration via Exp-Concavity. (arXiv:1802.09301v1 [cs.LG])
- An Accelerated Method for Derivative-Free Smooth Stochastic Convex Optimization. (arXiv:1802.09022v3 [math.OC] UPDATED)
- Analysis of Langevin Monte Carlo via convex optimization. (arXiv:1802.09188v2 [stat.CO] UPDATED)
- Averaging Stochastic Gradient Descent on Riemannian Manifolds. (arXiv:1802.09128v2 [cs.LG] UPDATED)
- A Walk with SGD. (arXiv:1802.08770v4 [stat.ML] UPDATED)
- Langevin Monte Carlo and JKO splitting. (arXiv:1802.08671v2 [stat.CO] UPDATED)
- Accelerate iterated filtering. (arXiv:1802.08613v1 [stat.ME])
- Characterizing Implicit Bias in Terms of Optimization Geometry. (arXiv:1802.08246v3 [stat.ML] UPDATED)
- Projection-Free Online Optimization with Stochastic Gradient: From Convexity to Submodularity. (arXiv:1802.08183v4 [stat.ML] UPDATED)
- Iterate averaging as regularization for stochastic gradient descent. (arXiv:1802.08009v1 [cs.LG])
- Sampling as optimization in the space of measures: The Langevin dynamics as a composite optimization problem. (arXiv:1802.08089v2 [math.OC] UPDATED)
- Exact formulas for the normalizing constants of Wishart distributions for graphical models
- Generalization in Machine Learning via Analytical Learning Theory. (arXiv:1802.07426v3 [stat.ML] UPDATED)
- Bregman Parallel Direction Method of Multipliers for Distributed Optimization via Mirror Averaging. (arXiv:1802.06835v3 [math.OC] UPDATED)
- Generalization Error Bounds with Probabilistic Guarantee for SGD in Nonconvex Optimization. (arXiv:1802.06903v3 [stat.ML] UPDATED)
- Zeroth-Order Online Alternating Direction Method of Multipliers: Convergence Analysis and Applications. (arXiv:1710.07804v2 [stat.ML] UPDATED)
- An Alternative View: When Does SGD Escape Local Minima?. (arXiv:1802.06175v2 [cs.LG] UPDATED)
- Spurious Valleys in Two-layer Neural Network Optimization Landscapes. (arXiv:1802.06384v4 [math.OC] UPDATED)
- One-dimensional System Arising in Stochastic Gradient Descent. (arXiv:1802.06760v1 [math.PR])
- Convergence of Online Mirror Descent. (arXiv:1802.06357v2 [cs.LG] UPDATED)
- Distributed Stochastic Optimization via Adaptive SGD. (arXiv:1802.05811v3 [stat.ML] UPDATED)
- Information Theory: A Tutorial Introduction. (arXiv:1802.05968v3 [cs.IT] UPDATED)
- Revisiting Normalized Gradient Descent: Fast Evasion of Saddle Points. (arXiv:1711.05224v3 [math.OC] UPDATED)
- Optimal Transport: Fast Probabilistic Approximation with Exact Solvers. (arXiv:1802.05570v4 [stat.CO] UPDATED)
- On the Theory of Variance Reduction for Stochastic Gradient Monte Carlo. (arXiv:1802.05431v1 [stat.ML])
- How Much Data Do You Need? An Operational, Pre-Asymptotic Metric for Fat-tailedness. (arXiv:1802.05495v3 [stat.ME] UPDATED)
- A Diffusion Approximation Theory of Momentum SGD in Nonconvex Optimization. (arXiv:1802.05155v5 [cs.LG] UPDATED)
- DiCE: The Infinitely Differentiable Monte-Carlo Estimator. (arXiv:1802.05098v3 [cs.LG] CROSS LISTED)
- Geometry-Based Data Generation. (arXiv:1802.04927v4 [cs.LG] UPDATED)
- State Space Gaussian Processes with Non-Gaussian Likelihood. (arXiv:1802.04846v5 [stat.ML] UPDATED)
- Uncertainty Quantification for Online Learning and Stochastic Approximation via Hierarchical Incremental Gradient Descent. (arXiv:1802.04876v2 [stat.ML] UPDATED)
- Logarithmic Regret for Online Gradient Descent Beyond Strong Convexity. (arXiv:1802.04623v2 [cs.LG] UPDATED)
- signSGD: Compressed Optimisation for Non-Convex Problems. (arXiv:1802.04434v3 [cs.LG] UPDATED)
- Stochastic Variance-Reduced Cubic Regularized Newton Method. (arXiv:1802.04796v1 [cs.LG])
- Computational Optimal Transport: Complexity by Accelerated Gradient Descent Is Better Than by Sinkhorn's Algorithm. (arXiv:1802.04367v2 [cs.DS] UPDATED)
- Online Variance Reduction for Stochastic Optimization. (arXiv:1802.04715v3 [stat.ML] UPDATED)
- Fast Global Convergence via Landscape of Empirical Loss. (arXiv:1802.04617v1 [stat.ML])
- A Simple Proximal Stochastic Gradient Method for Nonsmooth Nonconvex Optimization. (arXiv:1802.04477v4 [math.OC] UPDATED)
- Bouncy Hybrid Sampler as a Unifying Device. (arXiv:1802.04366v2 [stat.CO] UPDATED)
- A Fast Proximal Point Method for Computing Exact Wasserstein Distance. (arXiv:1802.04307v3 [stat.ML] UPDATED)
- Stochastic quasi-Newton with adaptive step lengths for large-scale problems. (arXiv:1802.04310v1 [stat.ML])
- Convergence Analysis of Alternating Projection Method for Nonconvex Sets. (arXiv:1802.03889v2 [math.OC] UPDATED)
- Sparse Random Matrices have Simple Spectrum. (arXiv:1802.03662v2 [math.PR] UPDATED)
- Differentiable Dynamic Programming for Structured Prediction and Attention. (arXiv:1802.03676v2 [stat.ML] UPDATED)
- Reweighted Autoencoded Variational Bayes for Enhanced Sampling (RAVE). (arXiv:1802.03420v1 [physics.chem-ph])
- State Representation Learning for Control: An Overview. (arXiv:1802.04181v2 [cs.AI] UPDATED)
- Taking gradients through experiments: LSTMs and memory proximal policy optimization for black-box quantum control. (arXiv:1802.04063v2 [cs.LG] UPDATED)
- Katyusha X: Practical Momentum Method for Stochastic Sum-of-Nonconvex Optimization. (arXiv:1802.03866v1 [cs.LG])
- Communication-Computation Efficient Gradient Coding. (arXiv:1802.03475v1 [stat.ML])
- Randomized Block Cubic Newton Method. (arXiv:1802.04084v2 [math.OC] UPDATED)
- Martingale Characterizations of Risk-Averse Stochastic Optimization Problems. (arXiv:1802.03639v2 [math.OC] UPDATED)
- Stochastic Spectral and Conjugate Descent Methods. (arXiv:1802.03703v1 [math.OC])
- SGD and Hogwild! Convergence Without the Bounded Gradients Assumption. (arXiv:1802.03801v2 [math.OC] UPDATED)
- On the Latent Space of Wasserstein Auto-Encoders. (arXiv:1802.03761v1 [stat.ML])
- Spectral Filtering for General Linear Dynamical Systems. (arXiv:1802.03981v1 [cs.LG])
- On Symplectic Optimization. (arXiv:1802.03653v2 [stat.CO] UPDATED)
- Black-box Variational Inference for Stochastic Differential Equations. (arXiv:1802.03335v3 [stat.CO] UPDATED)
- Batch Kalman Normalization: Towards Training Deep Neural Networks with Micro-Batches. (arXiv:1802.03133v2 [cs.CV] UPDATED)
- Mini-Batch Stochastic ADMMs for Nonconvex Nonsmooth Optimization. (arXiv:1802.03284v3 [math.OC] UPDATED)
- Online Learning: A Comprehensive Survey. (arXiv:1802.02871v2 [cs.LG] UPDATED)
- Learning Sparse Wavelet Representations. (arXiv:1802.02961v1 [cs.LG])
- Gradient conjugate priors and multi-layer neural networks. (arXiv:1802.02643v3 [math.ST] UPDATED)
- Improved Oracle Complexity of Variance Reduced Methods for Nonsmooth Convex Stochastic Composition Optimization. (arXiv:1802.02339v7 [math.OC] UPDATED)
- The sum of log-normal variates in geometric Brownian motion. (arXiv:1802.02939v1 [cond-mat.stat-mech])
- Monotone Operator Theory in Convex Optimization. (arXiv:1802.02694v3 [math.OC] UPDATED)
- Stochastic subgradient method converges at the rate $O(k^{-1/4})$ on weakly convex functions. (arXiv:1802.02988v3 [math.OC] UPDATED)
- Relax-and-split method for nonsmooth nonconvex problems. (arXiv:1802.02654v2 [math.OC] UPDATED)
- Recent Advances in Neural Program Synthesis. (arXiv:1802.02353v1 [cs.AI])
- Algorithm implementation and numerical analysis for the two-dimensional tempered fractional Laplacian. (arXiv:1802.02349v1 [math.NA])
- Improved Oracle Complexity of Variance Reduced Methods for Nonsmooth Convex Stochastic Composition Optimization. (arXiv:1802.02339v7 [math.OC] UPDATED)
- Training Generative Adversarial Networks via Primal-Dual Subgradient Methods: A Lagrangian Perspective on GAN. (arXiv:1802.01765v1 [cs.LG])
- Dynamics of Wealth Inequality. (arXiv:1802.01991v2 [physics.soc-ph] UPDATED)
- Volatility options in rough volatility models. (arXiv:1802.01641v2 [q-fin.PR] UPDATED)
- Lossless Brownian information engine. (arXiv:1802.01868v1 [cond-mat.stat-mech])
- To understand deep learning we need to understand kernel learning. (arXiv:1802.01396v3 [stat.ML] UPDATED)
- Parameter and Uncertainty Estimation for Dynamical Systems Using Surrogate Stochastic Processes. (arXiv:1802.00852v1 [stat.ME])
- Memory Fusion Network for Multi-view Sequential Learning. (arXiv:1802.00927v1 [cs.LG])
- Linear Convergence of the Primal-Dual Gradient Method for Convex-Concave Saddle Point Problems without Strong Convexity. (arXiv:1802.01504v2 [math.OC] UPDATED)
- Stochastic control and non-equilibrium thermodynamics: fundamental limits. (arXiv:1802.01271v2 [cond-mat.stat-mech] UPDATED)
- Regional Complexity Analysis of Algorithms for Nonconvex Smooth Optimization. (arXiv:1802.01062v2 [math.OC] UPDATED)
- Stochastic control and non-equilibrium thermodynamics: fundamental limits. (arXiv:1802.01271v1 [cond-mat.stat-mech])
- The Matrix Calculus You Need For Deep Learning. (arXiv:1802.01528v3 [cs.LG] UPDATED)
- Analysis of Fast Alternating Minimization for Structured Dictionary Learning. (arXiv:1802.00518v1 [cs.LG])
- Parametrized measure models
- A Simple Adaptive Step-size Choice for Iterative Optimization Methods. (arXiv:1802.00339v2 [math.OC] UPDATED)
- Probabilistic Recurrent State-Space Models. (arXiv:1801.10395v2 [stat.ML] UPDATED)
- What Is the Fractional Laplacian?. (arXiv:1801.09767v3 [math.NA] UPDATED)
- Strong error analysis for stochastic gradient descent optimization algorithms. (arXiv:1801.09324v1 [math.NA])
- Wasserstein Riemannian Geometry of Positive Definite Matrices. (arXiv:1801.09269v4 [math.ST] UPDATED)
- Algorithmic Linearly Constrained Gaussian Processes. (arXiv:1801.09197v3 [stat.ML] UPDATED)
- Adaptive Scan Gibbs Sampler for Large Scale Inference Problems. (arXiv:1801.09144v1 [stat.ML])
- Gradient descent revisited via an adaptive online learning rate. (arXiv:1801.09136v2 [stat.ML] UPDATED)
- A Review of Multiple Try MCMC algorithms for Signal Processing. (arXiv:1801.09065v1 [stat.CO])
- On Quasi-Newton Forward--Backward Splitting: Proximal Calculus and Convergence. (arXiv:1801.08691v2 [math.OC] UPDATED)
- Smoothing Algorithms for Computing the Projection onto a Minkowski Sum of Convex Sets. (arXiv:1801.08285v1 [math.OC])
- Optimal Monte Carlo integration on closed manifolds. (arXiv:1707.04723v5 [math.NA] CROSS LISTED)
- Importance sampling for partially observed temporal epidemic models. (arXiv:1801.08244v1 [q-bio.PE])
- Global Identifiability of Differential Models. (arXiv:1801.08112v5 [math.CA] UPDATED)
- Gaussian variational approximation for high-dimensional state space models. (arXiv:1801.07873v3 [stat.ME] UPDATED)
- A Proximal Approach for a Class of Matrix Optimization Problems. (arXiv:1801.07452v1 [math.OC])
- Non-ergodic Complexity of Convex Proximal Inertial Gradient Descents. (arXiv:1801.07389v3 [math.OC] UPDATED)
- Yet Another Convex Sets Subtraction with Application in Nondifferentiable Optimization. (arXiv:1801.06946v2 [math.OC] UPDATED)
- Douglas--Rachford Splitting and ADMM for Pathological Convex Optimization. (arXiv:1801.06618v3 [math.OC] UPDATED)
- Probabilistic Tools for the Analysis of Randomized Optimization Heuristics. (arXiv:1801.06733v6 [cs.DS] UPDATED)
- Information is not a thermodynamic resource. (arXiv:1801.05237v2 [quant-ph] UPDATED)
- On the Iteration Complexity Analysis of Stochastic Primal-Dual Hybrid Gradient Approach with High Probability. (arXiv:1801.06934v2 [cs.LG] UPDATED)
- Improving the particle filter in high dimensions using conjugate artificial process noise. (arXiv:1801.07000v2 [stat.CO] UPDATED)
- Upgrading from Gaussian Processes to Student's-T Processes. (arXiv:1801.06147v1 [stat.ML])
- Deep Learning: An Introduction for Applied Mathematicians. (arXiv:1801.05894v1 [math.HO])
- On the Limited Communication Analysis and Design for Decentralized Estimation. (arXiv:1801.05849v1 [cs.SY])
- When Does Stochastic Gradient Algorithm Work Well?. (arXiv:1801.06159v2 [stat.ML] UPDATED)
- On the Proximal Gradient Algorithm with Alternated Inertia. (arXiv:1801.05589v1 [math.OC])
- A Bayesian Conjugate Gradient Method. (arXiv:1801.05242v3 [stat.ME] UPDATED)
- Robust and Scalable Bayes via a Median of Subset Posterior Measures; Stanislav Minsker, Sanvesh Srivastava, Lizhen Lin, David B. Dunson
- Asynchronous Stochastic Variational Inference. (arXiv:1801.04289v1 [stat.ML])
- A dimensional acceleration of gradient descent-like methods, using persistent random walkers. (arXiv:1801.04532v2 [physics.comp-ph] UPDATED)
- Convexification of a 3-D coefficient inverse scattering problem. (arXiv:1801.04404v1 [math.NA])
- Generalized Conditional Gradient for Sparse Estimation; Yaoliang Yu, Xinhua Zhang, Dale Schuurmans
- Following the Leader and Fast Rates in Online Linear Prediction: Curved Constraint Sets and Other Regularities; Ruitong Huang, Tor Lattimore, András György, Csaba Szepesvári
- Communication Optimality Trade-offs For Distributed Estimation. (arXiv:1801.04050v1 [math.OC])
- Bayesian Quadrature for Multiple Related Integrals. (arXiv:1801.04153v7 [stat.CO] UPDATED)
- Theoretical Impediments to Machine Learning With Seven Sparks from the Causal Revolution. (arXiv:1801.04016v1 [cs.LG])
- Model-Based Action Exploration for Learning Dynamic Motion Skills. (arXiv:1801.03954v2 [cs.AI] UPDATED)
- Towards Arbitrary Noise Augmentation - Deep Learning for Sampling from Arbitrary Probability Distributions. (arXiv:1801.04211v2 [cs.LG] UPDATED)
- Noisy Expectation-Maximization: Applications and Generalizations. (arXiv:1801.04053v1 [stat.ML])
- Fast iterative solvers for an optimal transport problem. (arXiv:1801.04172v1 [math.NA])
- Using probabilistic programs as proposals. (arXiv:1801.03612v2 [cs.AI] UPDATED)
- Stochastic Gradient Monomial Gamma Sampler. (arXiv:1706.01498v2 [stat.ML] CROSS LISTED)
- Improved asynchronous parallel optimization analysis for stochastic incremental methods. (arXiv:1801.03749v3 [math.OC] UPDATED)
- Using probabilistic programs as proposals. (arXiv:1801.03612v2 [cs.AI] UPDATED)
- Non-stationary Douglas-Rachford and alternating direction method of multipliers: adaptive stepsizes and convergence. (arXiv:1801.03765v2 [math.OC] UPDATED)
- Improved asynchronous parallel optimization analysis for stochastic incremental methods. (arXiv:1801.03749v3 [math.OC] UPDATED)
- A Formalization of Kant's Second Formulation of the Categorical Imperative. (arXiv:1801.03160v3 [cs.AI] UPDATED)
- Convergence Analysis of Gradient Descent Algorithms with Proportional Updates. (arXiv:1801.03137v1 [cs.LG])
- Measure-valued spline curves: an optimal transport viewpoint. (arXiv:1801.03186v1 [math.OC])
- Nonconvex Lagrangian-Based Optimization: Monitoring Schemes and Global Convergence. (arXiv:1801.03013v1 [math.OC])
- How To Make the Gradients Small Stochastically: Even Faster Convex and Nonconvex SGD. (arXiv:1801.02982v3 [cs.LG] UPDATED)
- Log-concave sampling: Metropolis-Hastings algorithms are fast. (arXiv:1801.02309v4 [stat.ML] UPDATED)
- Convergence rates of proximal gradient methods via the convex conjugate. (arXiv:1801.02509v2 [math.OC] UPDATED)
- Effective strong convergence of the proximal point algorithm in CAT(0) spaces. (arXiv:1801.02179v2 [math.OC] UPDATED)
- The proximal alternating direction method of multipliers in the nonconvex setting: convergence analysis and rates. (arXiv:1801.01994v2 [math.OC] UPDATED)
- Constructing Metropolis-Hastings proposals using damped BFGS updates. (arXiv:1801.01243v2 [stat.CO] UPDATED)
- Deep Neuroevolution: Genetic Algorithms Are a Competitive Alternative for Training Deep Neural Networks for Reinforcement Learning. (arXiv:1712.06567v3 [cs.NE] UPDATED)
- Monte Carlo integration with a growing number of control variates. (arXiv:1801.01797v4 [math.ST] UPDATED)
- Probabilistic max-plus schemes for solving Hamilton-Jacobi-Bellman equations. (arXiv:1801.01780v1 [math.OC])
- Convergence rates of Forward--Douglas--Rachford splitting method. (arXiv:1801.01088v1 [math.OC])
- Probabilistic supervised learning. (arXiv:1801.00753v3 [stat.ML] UPDATED)
- Parameter estimation with a class of outer probability measures. (arXiv:1801.00569v4 [stat.ME] UPDATED)
- ZOOpt: Toolbox for Derivative-Free Optimization. (arXiv:1801.00329v3 [cs.LG] UPDATED)
- Molecular enhanced sampling with autoencoders: On-the-fly collective variable discovery and accelerated free energy landscape exploration. (arXiv:1801.00203v2 [physics.bio-ph] UPDATED)
- Thermodynamics of non-Markovian reservoirs and heat engines. (arXiv:1801.00744v2 [quant-ph] UPDATED)
- Vector and Matrix Optimal Mass Transport: Theory, Algorithm, and Applications. (arXiv:1712.10279v2 [math.OC] UPDATED)
Saved in 2017
- Visualizing the Loss Landscape of Neural Nets. (arXiv:1712.09913v3 [cs.LG] UPDATED)
- Deep learning for universal linear embeddings of nonlinear dynamics. (arXiv:1712.09707v2 [math.DS] UPDATED)
- Entropy-SGD optimizes the prior of a PAC-Bayes bound: Generalization properties of Entropy-SGD and data-dependent priors. (arXiv:1712.09376v3 [stat.ML] UPDATED)
- A single potential governing convergence of conjugate gradient, accelerated gradient and geometric descent. (arXiv:1712.09498v2 [math.OC] UPDATED)
- Entropy balance and Information processing in bipartite and non-bipartite composite systems. (arXiv:1712.09715v2 [cond-mat.stat-mech] UPDATED)
- Momentum and Stochastic Momentum for Stochastic Gradient, Newton, Proximal Point and Subspace Descent Methods. (arXiv:1712.09677v2 [math.OC] UPDATED)
- On Statistical Optimality of Variational Bayes. (arXiv:1712.08983v1 [math.ST])
- Lectures on Randomized Numerical Linear Algebra. (arXiv:1712.08880v1 [cs.DS])
- On the nonequilibrium entropy of large and small systems. (arXiv:1712.08961v3 [cond-mat.stat-mech] UPDATED)
- A Random Block-Coordinate Douglas-Rachford Splitting Method with Low Computational Complexity for Binary Logistic Regression. (arXiv:1712.09131v1 [math.OC])
- Distributed Coupled Multi-Agent Stochastic Optimization. (arXiv:1712.08817v3 [math.OC] UPDATED)
- True Asymptotic Natural Gradient Optimization. (arXiv:1712.08449v1 [stat.ML])
- Differential geometry and stochastic dynamics with deep learning numerics. (arXiv:1712.08364v1 [cs.CG])
- Geometrical Insights for Implicit Generative Modeling. (arXiv:1712.07822v3 [stat.ML] UPDATED)
- A graph theoretic interpretation of the mean first passage times. (arXiv:math/0701359v4 [math.PR] UPDATED)
- Introduction to Random Matrices - Theory and Practice. (arXiv:1712.07903v1 [math-ph])
- Non-convex Optimization for Machine Learning. (arXiv:1712.07897v1 [stat.ML])
- On Wasserstein Reinforcement Learning and the Fokker-Planck equation. (arXiv:1712.07185v1 [cs.LG])
- Snake: a Stochastic Proximal Gradient Algorithm for Regularized Problems over Large Graphs. (arXiv:1712.07027v1 [math.OC])
- Statistical Inference for the Population Landscape via Moment Adjusted Stochastic Gradients. (arXiv:1712.07519v2 [stat.ML] UPDATED)
- ADINE: An Adaptive Momentum Method for Stochastic Gradient Descent. (arXiv:1712.07424v1 [stat.ML])
- The Power of Interpolation: Understanding the Effectiveness of SGD in Modern Over-parametrized Learning. (arXiv:1712.06559v3 [cs.LG] UPDATED)
- Snake: a Stochastic Proximal Gradient Algorithm for Regularized Problems over Large Graphs. (arXiv:1712.07027v1 [math.OC])
- Improving Exploration in Evolution Strategies for Deep Reinforcement Learning via a Population of Novelty-Seeking Agents. (arXiv:1712.06560v3 [cs.AI] UPDATED)
- Safe Mutations for Deep and Recurrent Neural Networks through Output Gradients. (arXiv:1712.06563v3 [cs.NE] UPDATED)
- ES Is More Than Just a Traditional Finite-Difference Approximator. (arXiv:1712.06568v3 [cs.NE] UPDATED)
- The Power of Interpolation: Understanding the Effectiveness of SGD in Modern Over-parametrized Learning. (arXiv:1712.06559v3 [cs.LG] UPDATED)
- Misspecified Nonconvex Statistical Optimization for Phase Retrieval. (arXiv:1712.06245v1 [stat.ML])
- Structured Optimal Transport. (arXiv:1712.06199v1 [stat.ML])
- How well does your sampler really work?. (arXiv:1712.06006v1 [stat.ML])
- Third-order Smoothness Helps: Even Faster Stochastic Optimization Algorithms for Finding Local Minima. (arXiv:1712.06585v1 [math.OC])
- Distributed SMC-PHD Fusion for Partial, Arithmetic Average Consensus. (arXiv:1712.06128v1 [cs.SY])
- Universal Intermediate Gradient Method for Convex Problems with Inexact Oracle. (arXiv:1712.06036v2 [math.OC] UPDATED)
- The proximal point method revisited. (arXiv:1712.06038v1 [math.OC])
- Continious-time Importance Sampling: Monte Carlo Methods which Avoid Time-discretisation Error. (arXiv:1712.06201v1 [stat.ME])
- Ergodicity of some classes of cellular automata subject to noise. (arXiv:1712.05500v3 [math.PR] UPDATED)
- Stein's Method for Stationary Distributions of Markov Chains and Application to Ising Models. (arXiv:1712.05743v3 [math.PR] UPDATED)
- Catalyst Acceleration for First-order Convex Optimization: from Theory to Practice. (arXiv:1712.05654v2 [stat.ML] UPDATED)
- Stochastic Particle Gradient Descent for Infinite Ensembles. (arXiv:1712.05438v1 [stat.ML])
- Statistical Inference for SPDEs: an overview. (arXiv:1712.05445v1 [math.PR])
- Random forward models and log-likelihoods in Bayesian inverse problems. (arXiv:1712.05717v5 [math.ST] UPDATED)
- Comparing consensus Monte Carlo strategies for distributed Bayesian computation
- Asymptotic bias of stochastic gradient search
- Language: The missing selection pressure. (arXiv:1712.05005v1 [q-bio.PE])
- Equations of Evolutionary Dynamics in High Dimensions. (arXiv:1712.04774v1 [q-bio.PE])
- Sixty years of percolation. (arXiv:1712.04651v1 [math.PR])
- "Active-set complexity" of proximal gradient: How long does it take to find the sparsity pattern?. (arXiv:1712.03577v2 [math.OC] UPDATED)
- Stochastic thermodynamic interpretation of information geometry. (arXiv:1712.04311v6 [cond-mat.stat-mech] UPDATED)
- Convergence Rates for Deterministic and Stochastic Subgradient Methods Without Lipschitz Continuity. (arXiv:1712.04104v3 [math.OC] UPDATED)
- Shape optimization in laminar flow with a label-guided variational autoencoder. (arXiv:1712.03599v1 [cs.CE])
- Linear regression over the max-plus semiring: algorithms and applications. (arXiv:1712.03499v1 [math.NA])
- On the numerical solution of non-linear first order ordinary differential equation systems. (arXiv:1712.03552v1 [math.NA])
- Variational auto-encoding of protein sequences. (arXiv:1712.03346v1 [q-bio.QM])
- Saving Gradient and Negative Curvature Computations: Finding Local Minima More Efficiently. (arXiv:1712.03950v1 [cs.LG])
- Continuous-discrete smoothing of diffusions. (arXiv:1712.03807v4 [stat.CO] UPDATED)
- Saving Gradient and Negative Curvature Computations: Finding Local Minima More Efficiently. (arXiv:1712.03950v1 [cs.LG])
- Assumed Density Filtering Q-learning. (arXiv:1712.03333v4 [cs.LG] UPDATED)
- Neumann Optimizer: A Practical Optimization Algorithm for Deep Neural Networks. (arXiv:1712.03298v1 [cs.LG])
- A primer on noise-induced transitions in applied dynamical systems. (arXiv:1712.03785v2 [math.AP] UPDATED)
- Logarithmic divergences from optimal transport and R\'enyi geometry. (arXiv:1712.03610v3 [math.PR] UPDATED)
- AdaComp : Adaptive Residual Gradient Compression for Data-Parallel Distributed Training. (arXiv:1712.02679v1 [cs.LG])
- Noisy Natural Gradient as Variational Inference. (arXiv:1712.02390v2 [cs.LG] UPDATED)
- Structure-Adaptive, Variance-Reduced, and Accelerated Stochastic Optimization. (arXiv:1712.03156v2 [math.OC] UPDATED)
- Coordinate Descent with Bandit Sampling. (arXiv:1712.03010v2 [cs.LG] UPDATED)
- On Adaptive Estimation for Dynamic Bernoulli Bandits. (arXiv:1712.03134v2 [stat.ML] UPDATED)
- Iterated filtering methods for Markov process epidemic models. (arXiv:1712.03058v3 [stat.ME] UPDATED)
- Deep Gradient Compression: Reducing the Communication Bandwidth for Distributed Training. (arXiv:1712.01887v3 [cs.CV] UPDATED)
- Adaptive Submodularity: Theory and Applications in Active Learning and Stochastic Optimization. (arXiv:1003.3967v5 [cs.LG] UPDATED)
- Differentially Private Variational Dropout. (arXiv:1712.02629v3 [stat.ML] UPDATED)
- The Approximate Duality Gap Technique: A Unified Theory of First-Order Methods. (arXiv:1712.02485v3 [math.OC] UPDATED)
- Exact Renormalization Groups as a form of Entropic Dynamics. (arXiv:1712.02267v1 [cond-mat.stat-mech])
- Optimizing Human Learning. (arXiv:1712.01856v2 [stat.ML] UPDATED)
- Deep linear neural networks with arbitrary loss: All local minima are global. (arXiv:1712.01473v2 [cs.LG] UPDATED)
- Integrative biological simulation praxis: Considerations from physics, philosophy, and data/model curation practices. (arXiv:1712.01417v1 [q-bio.QM])
- Differentially Private Dropout. (arXiv:1712.01665v1 [stat.ML])
- Hessian eigenvalue distribution in a random Gaussian landscape. (arXiv:1712.01282v1 [hep-th])
- Connection between subdifferentials and codifferentials. Constructing the continuous codifferentials. I. (arXiv:1712.01346v3 [math.CA] UPDATED)
- A variational derivation of a class of BFGS-like methods. (arXiv:1712.00680v3 [math.NA] UPDATED)
- Inferring agent objectives at different scales of a complex adaptive system. (arXiv:1712.01137v1 [q-fin.TR])
- Natural Langevin Dynamics for Neural Networks. (arXiv:1712.01076v1 [stat.ML])
- Vprop: Variational Inference using RMSprop. (arXiv:1712.01038v1 [stat.ML])
- Comment: A brief survey of the current state of play for Bayesian computation in data science at Big-Data scale. (arXiv:1712.00849v2 [stat.CO] UPDATED)
- A Pliable Lasso. (arXiv:1712.00484v4 [stat.ME] UPDATED)
- Drift Analysis. (arXiv:1712.00964v2 [cs.NE] UPDATED)
- NEON+: Accelerated Gradient Methods for Extracting Negative Curvature for Non-Convex Optimization. (arXiv:1712.01033v2 [math.OC] UPDATED)
- Inertial Proximal Incremental Aggregated Gradient Method. (arXiv:1712.00984v2 [math.OC] UPDATED)
- Gradient Descent Learns One-hidden-layer CNN: Don't be Afraid of Spurious Local Minima. (arXiv:1712.00779v2 [cs.LG] UPDATED)
- Rapid Bayesian Inference of Global Network Statistics Using Random Walks. (arXiv:1712.00804v2 [physics.soc-ph] UPDATED)
- Convolutional Phase Retrieval via Gradient Descent. (arXiv:1712.00716v3 [stat.CO] UPDATED)
- The reparameterization trick for acquisition functions. (arXiv:1712.00424v1 [stat.ML])
- Optimization Methods for Inverse Problems. (arXiv:1712.00154v1 [math.OC])
- Adaptive fast gradient method in stochastic optimization tasks. (arXiv:1712.00062v1 [math.OC])
- The Wright--Fisher model for class--dependent fitness landscapes. (arXiv:1712.00279v1 [math.PR])
- Optimal Algorithms for Distributed Optimization. (arXiv:1712.00232v1 [math.OC])
- Sample-based Population Observers. (arXiv:1711.11095v1 [math.OC])
- On reducing the communication cost of the diffusion LMS algorithm. (arXiv:1711.11423v2 [stat.ML] UPDATED)
- State Space LSTM Models with Particle MCMC Inference. (arXiv:1711.11179v1 [cs.LG])
- TensorFlow Distributions. (arXiv:1711.10604v1 [cs.PL])
- Stochastic Approximation on Riemannian manifolds. (arXiv:1711.10754v1 [math.OC])
- Particle Optimization in Stochastic Gradient MCMC. (arXiv:1711.10927v1 [stat.ML])
- Learning nonlinear state-space models using smooth particle-filter-based likelihood approximations. (arXiv:1711.10765v1 [stat.CO])
- Julian Ernst Besag, 26 March 1945 -- 6 August 2010, a biographical memoir. (arXiv:1711.10262v2 [stat.OT] UPDATED)
- Backprop as Functor: A compositional perspective on supervised learning. (arXiv:1711.10455v3 [math.CT] UPDATED)
- Accelerated Gradient Descent Escapes Saddle Points Faster than Gradient Descent. (arXiv:1711.10456v1 [cs.LG])
- On the convergence rate of the scaled proximal decomposition on the graph of a maximal monotone operator (SPDG) algorithm. (arXiv:1711.09959v2 [math.OC] UPDATED)
- On One-Dimensional Riccati Diffusions. (arXiv:1711.10065v3 [math.PR] UPDATED)
- The collapse of ecosystem engineer populations. (arXiv:1711.09424v1 [q-bio.PE])
- Generalizing Hamiltonian Monte Carlo with Neural Networks. (arXiv:1711.09268v3 [stat.ML] UPDATED)
- An abstract proximal point algorithm. (arXiv:1711.09455v2 [math.OC] UPDATED)
- Analysis of the Gradient Method with an Armijo-Wolfe Line Search on a Class of Nonsmooth Convex Functions. (arXiv:1711.08517v2 [math.OC] UPDATED)
- Central limit theorems for entropy-regularized optimal transport on finite spaces and statistical applications. (arXiv:1711.08947v3 [math.ST] UPDATED)
- Variational Encoding of Complex Dynamics. (arXiv:1711.08576v2 [stat.ML] UPDATED)
- Contracting Nonlinear Observers: Convex Optimization and Learning from Data. (arXiv:1711.08135v1 [cs.SY])
- Distributed Kalman Filter in a Network of Linear Dynamical Systems. (arXiv:1711.07625v1 [cs.SY])
- Large deviations. (arXiv:1711.07571v1 [cond-mat.stat-mech])
- The Pontryagin Maximum Principle in the Wasserstein Space. (arXiv:1711.07667v5 [math.OC] UPDATED)
- Unbiased Simulation for Optimizing Stochastic Function Compositions. (arXiv:1711.07564v1 [math.OC])
- Decentralized High-Dimensional Bayesian Optimization with Factor Graphs. (arXiv:1711.07033v3 [stat.ML] UPDATED)
- Techniques for proving Asynchronous Convergence results for Markov Chain Monte Carlo methods. (arXiv:1711.06719v5 [stat.ML] UPDATED)
- Strict Local Martingales and Optimal Investment in a Black-Scholes Model with a Bubble. (arXiv:1711.06679v1 [q-fin.MF])
- Proximal Gradient Method with Extrapolation and Line Search for a Class of Nonconvex and Nonsmooth Problems. (arXiv:1711.06831v4 [math.OC] UPDATED)
- New convergence analysis of a primal-dual algorithm with large stepsizes. (arXiv:1711.06785v2 [math.OC] UPDATED)
- A note on Hadamard fractional differential equations with varying coefficients and their applications in probability. (arXiv:1711.07016v1 [math.PR])
- Informed proposals for local MCMC in discrete spaces. (arXiv:1711.07424v1 [stat.CO])
- Approaching nonsmooth nonconvex minimization through second order proximal-gradient dynamical systems. (arXiv:1711.06570v1 [math.OC])
- Fast Simulation of Hyperplane-Truncated Multivariate Normal Distributions
- Bootstrapped synthetic likelihood. (arXiv:1711.05825v2 [stat.CO] UPDATED)
- Hindsight policy gradients. (arXiv:1711.06006v3 [cs.LG] UPDATED)
- Random gradient extrapolation for distributed and stochastic optimization. (arXiv:1711.05762v1 [math.OC])
- Distributed Stochastic Variance Reduced Gradient Methods by Sampling Extra Data with Replacement; Jason D. Lee, Qihang Lin, Tengyu Ma, Tianbao Yang
- Second-Order Stochastic Optimization for Machine Learning in Linear Time; Naman Agarwal, Brian Bullins, Elad Hazan
- A General Distributed Dual Coordinate Optimization Framework for Regularized Loss Minimization; Shun Zheng, Jialei Wang, Fen Xia, Wei Xu, Tong Zhang
- Efficient Sampling from Time-Varying Log-Concave Distributions; Hariharan Narayanan, Alexander Rakhlin
- An Accelerated Communication-Efficient Primal-Dual Optimization Framework for Structured Machine Learning. (arXiv:1711.05305v1 [math.OC])
- Uncertainty quantification for radio interferometric imaging: I. proximal MCMC methods. (arXiv:1711.04818v2 [astro-ph.IM] UPDATED)
- Preconditioned proximal point methods and notions of partial subregularity. (arXiv:1711.05123v5 [math.OC] UPDATED)
- Sharp non-asymptotic Concentration Inequalities for the Approximation of the Invariant Measure of a Diffusion. (arXiv:1711.05620v2 [math.PR] UPDATED)
- Geometric integrators and the Hamiltonian Monte Carlo method. (arXiv:1711.05337v1 [math.PR])
- Relations between Heat Exchange and R\'{e}nyi Divergences. (arXiv:1711.05383v1 [quant-ph])
- Sobolev GAN. (arXiv:1711.04894v1 [cs.LG])
- Parameter Estimation in Finite Mixture Models by Regularized Optimal Transport: A Unified Framework for Hard and Soft Clustering. (arXiv:1711.04366v1 [cs.LG])
- Extremely Large Minibatch SGD: Training ResNet-50 on ImageNet in 15 Minutes. (arXiv:1711.04325v1 [cs.DC])
- Experimental realization of Feynman's ratchet. (arXiv:1711.04968v1 [cond-mat.stat-mech])
- Scalable Peaceman-Rachford Splitting Method with Proximal Terms. (arXiv:1711.04955v2 [stat.ML] UPDATED)
- Message Passing Stein Variational Gradient Descent. (arXiv:1711.04425v3 [stat.ML] UPDATED)
- Asynchronous Schemes for Stochastic and Misspecified Potential Games and Nonconvex Optimization. (arXiv:1711.03963v3 [math.OC] UPDATED)
- Black was right: Price is within a factor 2 of Value. (arXiv:1711.04717v2 [q-fin.PM] UPDATED)
- Selection strategies for randomly distributed replicators. (arXiv:1711.04350v1 [q-bio.PE])
- Circularly-Coupled Markov Chain Sampling. (arXiv:1711.04399v1 [stat.CO])
- How fragile are information cascades?. (arXiv:1711.04024v2 [math.PR] UPDATED)
- Variance Reduced methods for Non-convex Composition Optimization. (arXiv:1711.04416v1 [stat.ML])
- Adaptive FISTA for Non-convex Optimization. (arXiv:1711.04343v4 [math.OC] UPDATED)
- Interpolation and Extrapolation of Toeplitz Matrices via Optimal Mass Transport. (arXiv:1711.03890v2 [eess.SP] UPDATED)
- Alternating minimization for dictionary learning: Local Convergence Guarantees. (arXiv:1711.03634v4 [stat.ML] UPDATED)
- Temperature in and out of equilibrium: a review of concepts, tools and attempts. (arXiv:1711.03770v1 [cond-mat.stat-mech])
- Accelerated Method for Stochastic Composition Optimization with Nonsmooth Regularization. (arXiv:1711.03937v2 [cs.LG] UPDATED)
- Selecting Representative Examples for Program Synthesis. (arXiv:1711.03243v3 [cs.AI] UPDATED)
- Smooth Primal-Dual Coordinate Descent Algorithms for Nonsmooth Convex Optimization. (arXiv:1711.03439v1 [math.OC])
- Stochastic Cubic Regularization for Fast Nonconvex Optimization. (arXiv:1711.02838v2 [cs.LG] UPDATED)
- Fast gradient descent method for convex optimization problems with an oracle that generates a $(\delta,L)$-model of a function in a requested point. (arXiv:1711.02747v5 [math.OC] UPDATED)
- Multilevel Monte Carlo for Smoothing via Transport Methods. (arXiv:1711.02836v2 [stat.ME] UPDATED)
- Tangent: Automatic Differentiation Using Source Code Transformation in Python. (arXiv:1711.02712v1 [cs.MS])
- Large-Scale Optimal Transport and Mapping Estimation. (arXiv:1711.02283v2 [stat.ML] UPDATED)
- Adaptive Bayesian Sampling with Monte Carlo EM. (arXiv:1711.02159v1 [cs.LG])
- Analysis and Optimization of Population Annealing. (arXiv:1711.02146v2 [cond-mat.stat-mech] UPDATED)
- Exponential Discrete Gradient Schemes for Stochastic Differential Equations. (arXiv:1711.02522v1 [math.NA])
- Convex Optimization with Unbounded Nonconvex Oracles using Simulated Annealing. (arXiv:1711.02621v2 [cs.DS] UPDATED)
- Safe Adaptive Importance Sampling. (arXiv:1711.02637v1 [cs.LG])
- Optimal transport maps for distribution preserving operations on latent spaces of Generative Models. (arXiv:1711.01970v2 [cs.LG] UPDATED)
- Approximating Partition Functions in Constant Time. (arXiv:1711.01655v2 [cs.LG] UPDATED)
- Wasserstein Auto-Encoders. (arXiv:1711.01558v4 [stat.ML] UPDATED)
- AdaBatch: Efficient Gradient Aggregation Rules for Sequential and Parallel Stochastic Gradient Methods. (arXiv:1711.01761v1 [cs.LG])
- Overrelaxed Sinkhorn-Knopp Algorithm for Regularized Optimal Transport. (arXiv:1711.01851v2 [math.NA] UPDATED)
- Fisher-Rao Metric, Geometry, and Complexity of Neural Networks. (arXiv:1711.01530v2 [cs.LG] UPDATED)
- First-order Stochastic Algorithms for Escaping From Saddle Points in Almost Linear Time. (arXiv:1711.01944v3 [math.OC] UPDATED)
- Proximal Alternating Penalty Algorithms for Constrained Convex Optimization. (arXiv:1711.01367v3 [math.OC] UPDATED)
- Proximal-Like Incremental Aggregated Gradient Method with Linear Convergence under Bregman Distance Growth Conditions. (arXiv:1711.01136v3 [math.OC] UPDATED)
- Learning Linear Dynamical Systems via Spectral Filtering. (arXiv:1711.00946v1 [cs.LG])
- Approximation of Functions over Manifolds: A Moving Least-Squares Approach. (arXiv:1711.00765v4 [stat.ML] UPDATED)
- Learning to Represent Programs with Graphs. (arXiv:1711.00740v3 [cs.LG] UPDATED)
- Fast Information-theoretic Bayesian Optimisation. (arXiv:1711.00673v5 [stat.ML] UPDATED)
- Meta-Learning and Universality: Deep Representations and Gradient Descent can Approximate any Learning Algorithm. (arXiv:1710.11622v3 [cs.LG] UPDATED)
- Understanding GANs: the LQG Setting. (arXiv:1710.10793v2 [stat.ML] UPDATED)
- DGM: A deep learning algorithm for solving partial differential equations. (arXiv:1708.07469v5 [q-fin.MF] UPDATED)
- Adaptive Sampling Strategies for Stochastic Optimization. (arXiv:1710.11258v1 [math.OC])
- Coarse-Graining Open Markov Processes. (arXiv:1710.11343v4 [math-ph] UPDATED)
- Universal gradient descent. (arXiv:1711.00394v29 [math.OC] UPDATED)
- Backpropagation through the Void: Optimizing control variates for black-box gradient estimation. (arXiv:1711.00123v3 [cs.LG] UPDATED)
- Fixing a Broken ELBO. (arXiv:1711.00464v3 [cs.LG] UPDATED)
- Deep Neural Networks as Gaussian Processes. (arXiv:1711.00165v3 [stat.ML] UPDATED)
- SGDLibrary: A MATLAB library for stochastic gradient descent algorithms. (arXiv:1710.10951v2 [cs.MS] UPDATED)
- Stochastic Zeroth-order Optimization in High Dimensions. (arXiv:1710.10551v2 [stat.ML] UPDATED)
- The Implicit Bias of Gradient Descent on Separable Data. (arXiv:1710.10345v5 [stat.ML] UPDATED)
- Convex duality in nonlinear optimal transport. (arXiv:1710.10981v1 [math.PR])
- An introduction to random matrix theory. (arXiv:1710.10792v1 [math.PR])
- A Derivative-Free Gauss-Newton Method. (arXiv:1710.11005v1 [math.OC])
- Lower Bounds for Higher-Order Convex Optimization. (arXiv:1710.10329v1 [math.OC])
- Stochastic variance reduced multiplicative update for nonnegative matrix factorization. (arXiv:1710.10781v1 [cs.NA])
- Stochastic gradient descent performs variational inference, converges to limit cycles for deep networks. (arXiv:1710.11029v2 [cs.LG] UPDATED)
- Hot new directions for quasi-Monte Carlo research in step with applications. (arXiv:1710.09905v1 [math.NA])
- Gradient Sparsification for Communication-Efficient Distributed Optimization. (arXiv:1710.09854v1 [cs.LG])
- A Central Limit Theorem for Wasserstein type distances between two different laws. (arXiv:1710.09763v2 [math.ST] UPDATED)
- Directional Metropolis-Hastings. (arXiv:1710.09759v1 [stat.CO])
- Rethinking generalization requires revisiting old ideas: statistical mechanics approaches and complex learning behavior. (arXiv:1710.09553v2 [cs.LG] UPDATED)
- Duality-free Methods for Stochastic Composition Optimization. (arXiv:1710.09554v1 [stat.ML])
- PDE-Net: Learning PDEs from Data. (arXiv:1710.09668v2 [math.NA] UPDATED)
- Stochastic Non-convex Optimization with Strong High Probability Second-order Convergence. (arXiv:1710.09447v2 [math.OC] UPDATED)
- Kernel-based collocation methods for Zakai equations. (arXiv:1710.09090v8 [math.NA] UPDATED)
- Block Coordinate Descent Only Converge to Minimizers. (arXiv:1710.09047v1 [math.OC])
- Curvature-aided Incremental Aggregated Gradient Method. (arXiv:1710.08936v1 [stat.ML])
- Fixation and absorption in a fluctuating environment. (arXiv:1710.08807v1 [q-bio.PE])
- Nesterov's Acceleration For Approximate Newton. (arXiv:1710.08496v1 [cs.LG])
- Auto-Differentiating Linear Algebra. (arXiv:1710.08717v5 [cs.MS] UPDATED)
- Many Paths to Equilibrium: GANs Do Not Need to Decrease a Divergence At Every Step. (arXiv:1710.08446v3 [stat.ML] UPDATED)
- Avoiding Communication in Proximal Methods for Convex Optimization Problems. (arXiv:1710.08883v1 [cs.DC])
- Gradient flows, second order gradient systems and convexity. (arXiv:1710.07858v2 [math.OC] UPDATED)
- Stochastic Backward Euler: An Implicit Gradient Descent Algorithm for $k$-means Clustering. (arXiv:1710.07746v3 [math.OC] UPDATED)
- bridgesampling: An R Package for Estimating Normalizing Constants. (arXiv:1710.08162v3 [stat.CO] UPDATED)
- Optimal Rates for Learning with Nystr\"om Stochastic Gradient Methods. (arXiv:1710.07797v1 [stat.ML])
- Zeroth-Order Online Alternating Direction Method of Multipliers: Convergence Analysis and Applications. (arXiv:1710.07804v2 [stat.ML] UPDATED)
- Localization for MCMC: sampling high-dimensional posterior distributions with local structure. (arXiv:1710.07747v7 [stat.ME] UPDATED)
- Nonlinear Filtering for Periodic, Time-Varying Parameter Estimation. (arXiv:1710.07978v1 [q-bio.QM])
- Accelerating Stochastic Composition Optimization; Mengdi Wang, Ji Liu, Ethan X. Fang
- Harder, Better, Faster, Stronger Convergence Rates for Least-Squares Regression; Aymeric Dieuleveut, Nicolas Flammarion, Francis Bach
- Tracking the gradients using the Hessian: A new look at variance reducing stochastic methods. (arXiv:1710.07462v3 [math.OC] UPDATED)
- First-order Methods Almost Always Avoid Saddle Points. (arXiv:1710.07406v1 [stat.ML])
- Convergence Analysis of the Frank-Wolfe Algorithm and Its Generalization in Banach Spaces. (arXiv:1710.07367v1 [math.OC])
- A regularity structure for rough volatility. (arXiv:1710.07481v1 [q-fin.PR])
- Time Averages of Markov Processes and Applications to Two-Timescale Problems. (arXiv:1710.07447v2 [math.PR] UPDATED)
- Asynchronous Decentralized Parallel Stochastic Gradient Descent. (arXiv:1710.06952v3 [math.OC] UPDATED)
- Variational Inference based on Robust Divergences. (arXiv:1710.06595v2 [stat.ML] UPDATED)
- A Bayesian Perspective on Generalization and Stochastic Gradient Descent. (arXiv:1710.06451v3 [cs.LG] UPDATED)
- Convergence diagnostics for stochastic gradient descent with constant step size. (arXiv:1710.06382v2 [stat.ML] UPDATED)
- Incremental Subgradient Methods for Minimizing The Sum of Quasi-convex Functions. (arXiv:1710.06073v1 [math.OC])
- A successive difference-of-convex approximation method for a class of nonconvex nonsmooth optimization problems. (arXiv:1710.05778v2 [math.OC] UPDATED)
- The Tamed Unadjusted Langevin Algorithm. (arXiv:1710.05559v3 [stat.ME] UPDATED)
- Semi-independent resampling for particle filtering. (arXiv:1710.05407v1 [stat.CO])
- Robust Decentralized Learning Using ADMM with Unreliable Agents. (arXiv:1710.05241v3 [cs.LG] UPDATED)
- A Geometric View of Optimal Transportation and Generative Model. (arXiv:1710.05488v2 [cs.LG] UPDATED)
- Generalization in Deep Learning. (arXiv:1710.05468v9 [stat.ML] UPDATED)
- Dropout as a Low-Rank Regularizer for Matrix Factorization. (arXiv:1710.05092v1 [cs.LG])
- Information loss under coarse-graining: a geometric approach. (arXiv:1710.05787v2 [cond-mat.stat-mech] UPDATED)
- Entropy Production and Information Flow for Markov Diffusions with Filtering. (arXiv:1710.05553v1 [math-ph])
- Non-Euclidean Conditional Expectation and Filtering. (arXiv:1710.05829v3 [q-fin.MF] UPDATED)
- On stochastic and deterministic quasi-Newton methods for non-Strongly convex optimization: Asymptotic convergence and rate analysis. (arXiv:1710.05509v3 [math.OC] UPDATED)
- Second-Order Methods with Cubic Regularization Under Inexact Information. (arXiv:1710.05782v1 [math.OC])
- Accelerated Block Coordinate Proximal Gradients with Applications in High Dimensional Statistics. (arXiv:1710.05338v7 [math.OC] UPDATED)
- Well-posedness of Bayesian inverse problems in quasi-Banach spaces with stable priors. (arXiv:1710.05610v1 [math.PR])
- Potential Conditional Mutual Information: Estimators, Properties and Applications. (arXiv:1710.05012v1 [cs.IT])
- Parsimonious Adaptive Rejection Sampling. (arXiv:1710.04948v1 [stat.CO])
- A Robust Accelerated Optimization Algorithm for Strongly Convex Functions. (arXiv:1710.04753v1 [math.OC])
- Particle Filtering for Stochastic Navier-Stokes Signal Observed with Linear Additive Noise. (arXiv:1710.04586v2 [stat.CO] UPDATED)
- Marginal sequential Monte Carlo for doubly intractable models. (arXiv:1710.04382v1 [stat.CO])
- Stochastic Gradient Descent in Continuous Time: A Central Limit Theorem. (arXiv:1710.04273v4 [math.PR] UPDATED)
- A probabilistic view on the deterministic mutation-selection equation: dynamics, equilibria, and ancestry via individual lines of descent. (arXiv:1710.04573v1 [math.PR])
- Dynamics, numerical analysis, and some geometry. (arXiv:1710.03946v1 [math.NA])
- Beyond Log-concavity: Provable Guarantees for Sampling Multi-modal Distributions using Simulated Tempering Langevin Monte Carlo. (arXiv:1710.02736v2 [cs.LG] UPDATED)
- Notions of optimal transport theory and how to implement them on a computer. (arXiv:1710.02634v1 [math.AP])
- SGD for robot motion? The effectiveness of stochastic optimization on a new benchmark for biped locomotion tasks. (arXiv:1710.03029v1 [cs.RO])
- Wasserstein and total variation distance between marginals of L\'evy processes. (arXiv:1710.02715v2 [math.PR] UPDATED)
- Heat and work in Markovian quantum master equations: concepts, fluctuation theorems, and computations. (arXiv:1710.02311v2 [cond-mat.stat-mech] UPDATED)
- A survey of Algorithms and Analysis for Adaptive Online Learning; H. Brendan McMahan
- Optimal Rates for Multi-pass Stochastic Gradient Methods; Junhong Lin, Lorenzo Rosasco
- Scientific progress despite irreproducibility: A seeming paradox. (arXiv:1710.01946v1 [stat.OT])
- Parameter Uncertainty in the Kalman-Bucy Filter. (arXiv:1710.02046v2 [math.PR] UPDATED)
- On Kelly Betting: Some Limitations. (arXiv:1710.01787v1 [math.OC])
- Kelly Betting Can Be Too Conservative. (arXiv:1710.01786v1 [q-fin.PM])
- User-friendly guarantees for the Langevin Monte Carlo with inaccurate gradient. (arXiv:1710.00095v3 [math.ST] UPDATED)
- sgmcmc: An R Package for Stochastic Gradient Markov Chain Monte Carlo. (arXiv:1710.00578v3 [stat.CO] UPDATED)
- Accelerated Directional Search with non-Euclidean prox-structure. (arXiv:1710.00162v4 [math.OC] UPDATED)
- Gradient Flows in Filtering and Fisher-Rao Geometry. (arXiv:1710.00064v1 [math.OC])
- Efficient Preconditioning for Noisy Separable NMFs by Successive Projection Based Low-Rank Approximations. (arXiv:1710.00387v1 [cs.NA])
- Discriminating between two models based on Bregman divergence in small samples. (arXiv:1709.10505v1 [stat.ME])
- A generalization of the Jensen divergence: The chord gap divergence. (arXiv:1709.10498v2 [cs.LG] UPDATED)
- Information Geometry Connecting Wasserstein Distance and Kullback-Leibler Divergence via the Entropy-Relaxed Transportation Problem. (arXiv:1709.10219v1 [math.OC])
- Emergent failures and cascades in power grids: a statistical physics perspective. (arXiv:1709.10166v1 [physics.soc-ph])
- Variational Particle Approximations; Ardavan Saeedi, Tejas D. Kulkarni, Vikash K. Mansinghka, Samuel J. Gershman
- Stochastic Primal-Dual Coordinate Method for Regularized Empirical Risk Minimization; Yuchen Zhang, Lin Xiao
- Multilevel Sequential${}^2$ Monte Carlo for Bayesian Inverse Problems. (arXiv:1709.09763v2 [stat.CO] UPDATED)
- Particle rolling MCMC with double-block sampling. (arXiv:1709.09280v5 [stat.CO] UPDATED)
- Interacting particle filters for simultaneous state and parameter estimation. (arXiv:1709.09199v1 [math.NA])
- On the regularization of Wasserstein GANs. (arXiv:1709.08894v2 [stat.ML] UPDATED)
- Stochastic Nonconvex Optimization with Large Minibatches. (arXiv:1709.08728v4 [cs.LG] UPDATED)
- Glass-Box Program Synthesis: A Machine Learning Approach. (arXiv:1709.08669v1 [cs.LG])
- Bayesian Filtering for ODEs with Bounded Derivatives. (arXiv:1709.08471v1 [cs.NA])
- The Consciousness Prior. (arXiv:1709.08568v2 [cs.LG] UPDATED)
- GP-SUM. Gaussian Processes Filtering of non-Gaussian Beliefs. (arXiv:1709.08120v3 [cs.RO] UPDATED)
- On Noisy Negative Curvature Descent: Competing with Gradient Descent for Faster Non-convex Optimization. (arXiv:1709.08571v2 [math.OC] UPDATED)
- Self-tuned mirror descent schemes for smooth and nonsmooth high-dimensional stochastic optimization. (arXiv:1709.08308v2 [math.OC] UPDATED)
- General Bayesian Updating and the Loss-Likelihood Bootstrap. (arXiv:1709.07616v2 [stat.ME] UPDATED)
- Asymptotic analysis of covariance parameter estimation for Gaussian processes in the misspecified case
- Uniform ergodicity of the iterated conditional SMC and geometric ergodicity of particle Gibbs samplers
- Neural Optimizer Search with Reinforcement Learning. (arXiv:1709.07417v2 [cs.AI] UPDATED)
- On Nesting Monte Carlo Estimators. (arXiv:1709.06181v4 [stat.CO] UPDATED)
- AI Programmer: Autonomously Creating Software Programs Using Genetic Algorithms. (arXiv:1709.05703v1 [cs.AI])
- Efficient Statistically Accurate Algorithms for the Fokker-Planck Equation in Large Dimensions. (arXiv:1709.05562v1 [stat.ME])
- A convergent relaxation of the Douglas-Rachford algorithm. (arXiv:1709.05984v1 [math.OC])
- Douglas-Rachford splitting and ADMM for nonconvex optimization: tight convergence results. (arXiv:1709.05747v4 [math.OC] UPDATED)
- Machine learning approximation algorithms for high-dimensional fully nonlinear partial differential equations and second-order backward stochastic differential equations. (arXiv:1709.05963v1 [math.NA])
- Accelerating SGD for Distributed Deep-Learning Using Approximated Hessian Matrix. (arXiv:1709.05069v1 [cs.LG])
- Normalized Direction-preserving Adam. (arXiv:1709.04546v2 [cs.LG] UPDATED)
- A statistical mechanics perspective for protein folding from $q$-state Potts model. (arXiv:1709.04813v3 [cond-mat.stat-mech] UPDATED)
- Trait evolution with jumps: illusionary normality. (arXiv:1709.04702v1 [q-bio.PE])
- Monte-Carlo Algorithms for Forward Feynman-Kac type representation for semilinear nonconservative Partial Differential Equations. (arXiv:1709.04777v1 [math.PR])
- The Impact of Local Geometry and Batch Size on Stochastic Gradient Descent for Nonconvex Problems. (arXiv:1709.04718v2 [math.OC] UPDATED)
- On proximal mappings with Young functions in uniformly convex Banach spaces. (arXiv:1709.04700v3 [math.FA] UPDATED)
- A Rewriting System for Convex Optimization Problems. (arXiv:1709.04494v2 [math.OC] UPDATED)
- A convergence framework for inexact nonconvex and nonsmooth algorithms and its applications to several iterations. (arXiv:1709.04072v6 [math.OC] UPDATED)
- Particle Filters and Data Assimilation. (arXiv:1709.04196v1 [stat.CO])
- Recursive Exponential Weighting for Online Non-convex Optimization. (arXiv:1709.04136v1 [cs.LG])
- Alternating minimization and alternating descent over nonconvex sets. (arXiv:1709.04451v3 [math.OC] UPDATED)
- Linear Stochastic Approximation: Constant Step-Size and Iterate Averaging. (arXiv:1709.04073v1 [cs.LG])
- A first-order splitting method for solving a large-scale composite convex optimization problem. (arXiv:1709.03962v5 [math.OC] UPDATED)
- Lower Bound for Randomized First Order Convex Optimization. (arXiv:1709.03594v2 [math.OC] UPDATED)
- A Simple Analysis for Exp-concave Empirical Minimization with Arbitrary Convex Regularizer. (arXiv:1709.02909v1 [stat.ML])
- Towards information optimal simulation of partial differential equations. (arXiv:1709.02859v2 [stat.ME] UPDATED)
- A Modular Analysis of Adaptive (Non-)Convex Optimization: Optimism, Composite Objectives, and Variational Bounds. (arXiv:1709.02726v1 [cs.LG] CROSS LISTED)
- Global Convergence of Arbitrary-Block Gradient Methods for Generalized Polyak-{\L}ojasiewicz Functions. (arXiv:1709.03014v1 [math.OC])
- Data Assimilation in the Geosciences - An overview on methods, issues and perspectives. (arXiv:1709.02798v3 [physics.ao-ph] UPDATED)
- Entropic Determinants. (arXiv:1709.02702v1 [stat.ML])
- Convex Hulls of Random Walks in Higher Dimensions: A Large Deviation Study. (arXiv:1709.02638v1 [cond-mat.stat-mech])
- Covariances, Robustness, and Variational Bayes. (arXiv:1709.02536v3 [stat.ME] UPDATED)
- Information Theory and the Length Distribution of all Discrete Systems. (arXiv:1709.01712v1 [q-bio.OT])
- Neither pulled nor pushed: Genetic drift and front wandering uncover a new class of reaction-diffusion waves. (arXiv:1709.01601v1 [q-bio.PE])
- Adaptive restart of accelerated gradient methods under local quadratic growth condition. (arXiv:1709.02300v1 [math.OC])
- Hamiltonian Flow Simulation of Rare Events. (arXiv:1709.01303v1 [stat.CO])
- A Convergence Analysis for A Class of Practical Variance-Reduction Stochastic Gradient MCMC. (arXiv:1709.01180v1 [stat.ML])
- Stochastic Gradient Descent: Going As Fast As Possible But Not Faster. (arXiv:1709.01427v1 [stat.ML])
- A Generic Approach for Escaping Saddle points. (arXiv:1709.01434v1 [cs.LG])
- A Mathematical Framework for Resilience: Dynamics, Strategies, Shocks and Acceptable Paths. (arXiv:1709.01389v2 [math.OC] UPDATED)
- A Unification and Generalization of Exact Distributed First Order Methods. (arXiv:1709.01317v2 [cs.IT] UPDATED)
- On the Suboptimality of Proximal Gradient Descent for $\ell^{0}$ Sparse Approximation. (arXiv:1709.01230v1 [math.OC])
- Unbiased approximations of products of expectations. (arXiv:1709.01002v1 [stat.CO])
- Number of hidden states needed to physically implement a given conditional distribution. (arXiv:1709.00765v1 [cond-mat.stat-mech])
- Iteratively Linearized Reweighted Alternating Direction Method of Multipliers for a Class of Nonconvex Problems. (arXiv:1709.00483v5 [cs.NA] UPDATED)
- The Averaged Kaczmarz Iteration for Solving Inverse Problems. (arXiv:1709.00742v2 [math.NA] UPDATED)
- A State-Space Approach to Dynamic Nonnegative Matrix Factorization. (arXiv:1709.00025v1 [cs.LG])
- Asymptotic Bias of Stochastic Gradient Search. (arXiv:1709.00291v1 [math.ST])
- Towards physical principles of biological evolution. (arXiv:1709.00284v1 [q-bio.OT])
- Measure differential equations. (arXiv:1708.09738v1 [math.OC])
- Ergodic behaviour of a Douglas-Rachford operator away from the origin. (arXiv:1708.09068v1 [math.OC])
- Block-Simultaneous Direction Method of Multipliers: A proximal primal-dual splitting algorithm for nonconvex problems with multiple constraints. (arXiv:1708.09066v1 [math.OC])
- Controlled Sequential Monte Carlo. (arXiv:1708.08396v3 [stat.CO] UPDATED)
- An equilibrium-conserving taxation scheme for income from capital. (arXiv:1708.08275v1 [q-fin.EC])
- An inexact subsampled proximal Newton-type method for large-scale machine learning. (arXiv:1708.08552v1 [cs.LG])
- Inference on high-dimensional implicit dynamic models using a guided intermediate resampling filter. (arXiv:1708.08543v4 [stat.ME] UPDATED)
- The stabilizing effect of volatility in financial markets. (arXiv:1708.08695v1 [q-fin.ST])
- Natasha 2: Faster Non-Convex Optimization Than SGD. (arXiv:1708.08694v4 [math.OC] UPDATED)
- Evolution of a Fluctuating Population in a Randomly Switching Environment. (arXiv:1708.08841v4 [q-bio.PE] UPDATED)
- The statistical physics of active matter: from self-catalytic colloids to living cells. (arXiv:1708.08652v3 [cond-mat.soft] UPDATED)
- An inexact subsampled proximal Newton-type method for large-scale machine learning. (arXiv:1708.08552v1 [cs.LG])
- A Conservation Law Method in Optimization. (arXiv:1708.08035v3 [math.OC] UPDATED)
- Second-Order Optimization for Non-Convex Machine Learning: An Empirical Study. (arXiv:1708.07827v2 [math.OC] UPDATED)
- A New Use of Douglas-Rachford Splitting and ADMM for Identifying Infeasible, Unbounded, and Pathological Conic Programs. (arXiv:1706.02374v2 [math.OC] CROSS LISTED)
- Nudging the particle filter. (arXiv:1708.07801v4 [stat.CO] UPDATED)
- Delayed Sampling and Automatic Rao-Blackwellization of Probabilistic Programs. (arXiv:1708.07787v2 [stat.ML] UPDATED)
- DGM: A deep learning algorithm for solving partial differential equations. (arXiv:1708.07469v5 [q-fin.MF] UPDATED)
- Mixing time estimation in reversible Markov chains from a single sample path. (arXiv:1708.07367v1 [math.ST])
- The prior can generally only be understood in the context of the likelihood. (arXiv:1708.07487v2 [stat.ME] UPDATED)
- Divergence, Entropy, Information: An Opinionated Introduction to Information Theory. (arXiv:1708.07459v2 [cs.IT] UPDATED)
- Decentralized Computation of Effective Resistances and Acceleration of Consensus Algorithms. (arXiv:1708.07190v1 [math.OC])
- Newton-Type Methods for Non-Convex Optimization Under Inexact Hessian Information. (arXiv:1708.07164v4 [math.OC] UPDATED)
- Linear convergence of inexact descent method and inexact proximal gradient algorithms for lower-order regularization problems. (arXiv:1708.07010v1 [math.OC])
- On Relationship between Primal-Dual Method of Multipliers and Kalman Filter. (arXiv:1708.06881v1 [math.OC])
- Proximal-Proximal-Gradient Method. (arXiv:1708.06908v2 [math.OC] UPDATED)
- Learning Combinations of Sigmoids Through Gradient Estimation. (arXiv:1708.06678v2 [stat.ML] UPDATED)
- Sequential Monte Carlo algorithms for a class of outer measures. (arXiv:1708.06489v2 [stat.ME] UPDATED)
- A Deterministic Nonsmooth Frank Wolfe Algorithm with Coreset Guarantees. (arXiv:1708.06714v1 [math.OC])
- A brief tutorial on transformation based Markov Chain Monte Carlo and optimal scaling of the additive transformation
- A general framework for Vecchia approximations of Gaussian processes. (arXiv:1708.06302v5 [stat.ME] UPDATED)
- Meta-Learning MCMC Proposals. (arXiv:1708.06040v5 [cs.AI] UPDATED)
- The Stochastic Replica Approach to Machine Learning: Stability and Parameter Optimization. (arXiv:1708.05715v3 [stat.ML] UPDATED)
- Sparsity Within and Across Overlapping Groups. (arXiv:1708.06166v1 [math.OC])
- Least Sparsity of $p$-norm based Optimization Problems with $p > 1$. (arXiv:1708.06055v1 [math.OC])
- Stochastic Primal-Dual Proximal ExtraGradient Descent for Compositely Regularized Optimization. (arXiv:1708.05978v4 [cs.LG] UPDATED)
- A Brief Survey of Deep Reinforcement Learning. (arXiv:1708.05866v2 [cs.LG] CROSS LISTED)
- Renormalization group theory for percolation in time-varying networks. (arXiv:1708.05704v3 [cond-mat.stat-mech] UPDATED)
- Innovation rather than improvement: a solvable high-dimensional model highlights the limitations of scalar fitness. (arXiv:1708.05453v3 [q-bio.PE] UPDATED)
- More Iterations per Second, Same Quality -- Why Asynchronous Algorithms may Drastically Outperform Traditional Ones. (arXiv:1708.05136v1 [math.OC])
- Pseudo-extended Markov chain Monte Carlo. (arXiv:1708.05239v3 [stat.ME] UPDATED)
- Quantifying hidden order out of equilibrium. (arXiv:1708.04993v3 [cond-mat.soft] UPDATED)
- Non-convex Conditional Gradient Sliding. (arXiv:1708.04783v1 [math.OC])
- Linear Hypothesis Testing in Dense High-Dimensional Linear Models. (arXiv:1610.02987v2 [stat.ME] CROSS LISTED)
- The Trimmed Lasso: Sparsity and Robustness. (arXiv:1708.04527v1 [stat.ME])
- The auxiliary region method: A hybrid method for coupling a PDE to Brownian-based dynamics for reaction-diffusion systems. (arXiv:1708.04457v1 [q-bio.QM])
- Efficient sequential Monte Carlo algorithms for integrated population models. (arXiv:1708.04221v1 [stat.ME])
- A probabilistic scheme for joint parameter estimation and state prediction in complex dynamical systems. (arXiv:1708.03730v2 [stat.CO] UPDATED)
- Bayesian inference with information content model check for Langevin equations. (arXiv:1708.03664v2 [cond-mat.stat-mech] UPDATED)
- Nonsmooth Analysis and Optimization. (arXiv:1708.04180v3 [math.OC] UPDATED)
- Online Convex Optimization with Stochastic Constraints. (arXiv:1708.03741v1 [math.OC])
- Gradient Methods for Submodular Maximization. (arXiv:1708.03949v2 [cs.LG] UPDATED)
- Unbiased Markov chain Monte Carlo with couplings. (arXiv:1708.03625v5 [stat.ME] UPDATED)
- Properties of the Promotion Markov Chain on Linear Extensions. (arXiv:1708.03633v1 [math.CO])
- Central limit theorems and bootstrap in high dimensions
- On the existence of sure profits via flash strategies. (arXiv:1708.03099v4 [q-fin.TR] UPDATED)
- Non-stationary Stochastic Optimization under $L_{p,q}$-Variation Measures. (arXiv:1708.03020v3 [stat.ML] UPDATED)
- Communication-Free Parallel Supervised Topic Models. (arXiv:1708.03052v1 [cs.LG])
- Convergence of Unregularized Online Learning Algorithms. (arXiv:1708.02939v1 [cs.LG])
- A Self-Correcting Variable-Metric Algorithm Framework for Nonsmooth Optimization. (arXiv:1708.02552v5 [math.OC] UPDATED)
- Turbocharging Monte Carlo pricing for the rough Bergomi model. (arXiv:1708.02563v3 [q-fin.CP] UPDATED)
- Stein's method for multivariate Brownian approximations of sums under dependence. (arXiv:1708.02521v4 [math.PR] UPDATED)
- Stochastic Optimization with Bandit Sampling. (arXiv:1708.02544v2 [cs.LG] UPDATED)
- Convergence analysis of Riemannian Gauss-Newton methods and its connection with the geometric condition number. (arXiv:1708.02488v1 [math.NA])
- Delayed acceptance ABC-SMC. (arXiv:1708.02230v2 [stat.CO] UPDATED)
- Boosting Variational Inference: an Optimization Perspective. (arXiv:1708.01733v2 [cs.LG] UPDATED)
- Model Misspecification in ABC: Consequences and Diagnostics. (arXiv:1708.01974v4 [math.ST] UPDATED)
- Adaptive Regularized Newton Method for Riemannian Optimization. (arXiv:1708.02016v1 [math.OC])
- Wasserstein Dictionary Learning: Optimal Transport-based unsupervised non-linear dictionary learning. (arXiv:1708.01955v3 [stat.ML] UPDATED)
- STARDATA: A StarCraft AI Research Dataset. (arXiv:1708.02139v1 [cs.AI])
- Dimension-free Wasserstein contraction of nonlinear filters. (arXiv:1708.01582v5 [math.ST] UPDATED)
- Mean Estimation from Adaptive One-bit Measurements. (arXiv:1708.00952v2 [math.ST] UPDATED)
- The Vlasov-Fokker-Planck equation in non-convex landscapes: convergence to equilibrium. (arXiv:1708.00840v1 [math.AP])
- Importance sampling large deviations in nonequilibrium steady states. I. (arXiv:1708.00459v2 [cond-mat.stat-mech] UPDATED)
- Complexity Results for MCMC derived from Quantitative Bounds. (arXiv:1708.00829v6 [stat.CO] UPDATED)
- Hidden Physics Models: Machine Learning of Nonlinear Partial Differential Equations. (arXiv:1708.00588v2 [cs.AI] UPDATED)
- Estimation of the covariance structure of heavy-tailed distributions. (arXiv:1708.00502v3 [math.ST] UPDATED)
- Mini-batch stochastic gradient descent with dynamic sample sizes. (arXiv:1708.00555v1 [math.OC])
- The duality structure gradient descent algorithm: analysis and applications to neural networks. (arXiv:1708.00523v7 [cs.LG] UPDATED)
- An Inexact Regularized Newton Framework with a Worst-Case Iteration Complexity of $\mathcal{O}(\epsilon^{-3/2})$ for Nonconvex Optimization. (arXiv:1708.00475v4 [math.OC] UPDATED)
- Using Program Induction to Interpret Transition System Dynamics. (arXiv:1708.00376v1 [cs.AI])
- Discrete probabilistic and algebraic dynamics: a stochastic commutative Gelfand-Naimark Theorem. (arXiv:1708.00091v2 [math.FA] UPDATED)
- The derivation of Markov processes that violate detailed balance. (arXiv:1708.00184v3 [cond-mat.stat-mech] UPDATED)
- A Geometric Variational Approach to Bayesian Inference. (arXiv:1707.09714v2 [stat.ME] UPDATED)
- Mini-batch Tempered MCMC. (arXiv:1707.09705v8 [stat.CO] UPDATED)
- Simultaneous active parameter estimation and control using sampling-based Bayesian reinforcement learning. (arXiv:1707.09055v1 [cs.SY])
- Data-Driven Stochastic Robust Optimization: A General Computational Framework and Algorithm for Optimization under Uncertainty in the Big Data Era. (arXiv:1707.09198v4 [cs.LG] UPDATED)
- An application of proof mining to the proximal point algorithm in CAT(0) spaces. (arXiv:1707.09169v1 [math.OC])
- Convergence of first-order methods via the convex conjugate. (arXiv:1707.09084v1 [math.OC])
- Curvature and transport inequalities for Markov chains in discrete spaces
- Importance sampling for metastable and multiscale dynamical systems. (arXiv:1707.08868v1 [math.PR])
- Ergodic Theorems for discrete Markov chains. (arXiv:1707.08827v2 [math.PR] UPDATED)
- Testing for instability in covariance structures
- Inference in Ising models
- Proper scoring rules and Bregman divergence
- Unifying Framework for Accelerated Randomized Methods in Convex Optimization. (arXiv:1707.08486v2 [math.OC] UPDATED)
- Stochastic Subsampling for Factorizing Huge Matrices. (arXiv:1701.05363v3 [stat.ML] UPDATED)
- Notes on optimal approximations for importance sampling. (arXiv:1707.08358v1 [cs.GR])
- On a decomposition formula for the proximal operator of the sum of two convex functions. (arXiv:1707.08509v2 [math.OC] UPDATED)
- Using deterministic approximations to accelerate SMC for posterior sampling. (arXiv:1707.07971v1 [stat.ME])
- A Newton-Based Method for Nonconvex Optimization with Fast Evasion of Saddle Points. (arXiv:1707.08028v3 [math.OC] UPDATED)
- Statistical properties and multifractality of Bitcoin. (arXiv:1707.07618v3 [q-fin.ST] UPDATED)
- Stability and instability in saddle point dynamics Part II: The subgradient method. (arXiv:1707.07351v2 [math.OC] UPDATED)
- Stability and instability in saddle point dynamics -- Part I. (arXiv:1707.07349v2 [math.OC] CROSS LISTED)
- A Statistical Perspective on Inverse and Inverse Regression Problems. (arXiv:1707.06852v1 [stat.ME])
- Limits of Predictions in Thermodynamic Systems: A Review. (arXiv:1707.06680v3 [cond-mat.stat-mech] UPDATED)
- Towards a scientific blockchain framework for reproducible data analysis. (arXiv:1707.06552v1 [cs.CY] CROSS LISTED)
- Cyclic Stochastic Optimization: Generalizations, Convergence, and Applications in Multi-Agent Systems. (arXiv:1707.06700v1 [math.OC])
- Proximal Policy Optimization Algorithms. (arXiv:1707.06347v2 [cs.LG] UPDATED)
- Global Convergence of Langevin Dynamics Based Algorithms for Nonconvex Optimization. (arXiv:1707.06618v3 [stat.ML] UPDATED)
- Breaking the Nonsmooth Barrier: A Scalable Parallel Method for Composite Optimization. (arXiv:1707.06468v3 [math.OC] UPDATED)
- Bridging the Gap between Constant Step Size Stochastic Gradient Descent and Markov Chains. (arXiv:1707.06386v2 [stat.ML] UPDATED)
- Codes on graphs: Models for elementary algebraic topology and statistical physics. (arXiv:1707.06621v1 [cs.IT])
- On Unlimited Sampling. (arXiv:1707.06340v2 [cs.IT] UPDATED)
- Density Estimation in Infinite Dimensional Exponential Families; Bharath Sriperumbudur, Kenji Fukumizu, Arthur Gretton, Aapo Hyvärinen, Revant Kumar
- Probably approximate Bayesian computation: nonasymptotic convergence of ABC under misspecification. (arXiv:1707.05987v2 [math.ST] UPDATED)
- Generalization Bounds of SGLD for Non-convex Learning: Two Theoretical Viewpoints. (arXiv:1707.05947v1 [cs.LG])
- Acceleration and Averaging in Stochastic Mirror Descent Dynamics. (arXiv:1707.06219v1 [math.OC])
- Natural selection as coarsening. (arXiv:1707.05317v1 [q-bio.PE])
- Cayley Splitting for Second-Order Langevin Stochastic Partial Differential Equations. (arXiv:1707.05603v1 [math.PR])
- Second order stochastic differential models for financial markets. (arXiv:1707.05419v1 [q-fin.MF])
- Don't relax: early stopping for convex regularization. (arXiv:1707.05422v1 [math.OC])
- Nonlinear Programming Methods for Distributed Optimization. (arXiv:1707.04598v1 [math.OC])
- Optimal Monte Carlo integration on closed manifolds. (arXiv:1707.04723v5 [math.NA] UPDATED)
- Automatic Backward Differentiation for American Monte-Carlo Algorithms (Conditional Expectation). (arXiv:1707.04942v1 [q-fin.CP])
- A new look at the inverse Gaussian distribution. (arXiv:1707.04400v1 [stat.ME])
- Bayesian Optimization for Probabilistic Programs. (arXiv:1707.04314v1 [stat.ML])
- ADMM Based Privacy-preserving Decentralized Optimization. (arXiv:1707.04338v2 [math.OC] UPDATED)
- A short introduction to quasi-Monte Carlo option pricing. (arXiv:1707.04293v1 [q-fin.CP])
- Teaching renormalization, scaling, and universality with an example from quantum mechanics. (arXiv:1707.04388v3 [quant-ph] UPDATED)
- Particle Simulation of Fractional Diffusion Equations. (arXiv:1707.03871v1 [math.NA])
- Iterative Updating of Model Error for Bayesian Inversion. (arXiv:1707.04246v1 [stat.ME])
- Lyapunov Conditions for Differentiability of Markov Chain Expectations: the Absolutely Continuous Case. (arXiv:1707.03870v1 [math.PR])
- Underdamped Langevin MCMC: A non-asymptotic analysis. (arXiv:1707.03663v7 [stat.ML] UPDATED)
- Initialising Kernel Adaptive Filters via Probabilistic Inference. (arXiv:1707.03450v1 [stat.ML])
- Stochastic thermodynamics: From principles to the cost of precision. (arXiv:1707.03759v1 [cond-mat.stat-mech])
- Proximally Guided Stochastic Subgradient Method for Nonsmooth, Nonconvex Problems. (arXiv:1707.03505v5 [math.OC] UPDATED)
- Low-rank updates of matrix functions. (arXiv:1707.03045v1 [math.NA])
- Symmetrized importance samplers for stochastic differential equations. (arXiv:1707.02695v2 [math.NA] UPDATED)
- Solving high-dimensional partial differential equations using deep learning. (arXiv:1707.02568v3 [math.NA] UPDATED)
- Non-Gaussian Limit Theorem for Non-Linear Langevin Equations Driven by L\'evy Noise. (arXiv:1707.01958v2 [math.PR] UPDATED)
- Non-smooth Non-convex Bregman Minimization: Unification and new Algorithms. (arXiv:1707.02278v4 [math.OC] UPDATED)
- Convergence Analysis of Optimization Algorithms. (arXiv:1707.01647v1 [stat.ML])
- Particle MCMC with Poisson Resampling: Parallelization and Continuous Time Models. (arXiv:1707.01660v2 [stat.CO] UPDATED)
- Stochastic, Distributed and Federated Optimization for Machine Learning. (arXiv:1707.01155v1 [cs.LG])
- Particle rejuvenation of Rao-Blackwellized Sequential Monte Carlo smoothers for Conditionally Linear and Gaussian models. (arXiv:1707.01311v1 [stat.ME])
- Model compression as constrained optimization, with application to neural nets. Part I: general framework. (arXiv:1707.01209v1 [cs.LG])
- Gini estimation under infinite variance. (arXiv:1707.01370v4 [stat.ME] UPDATED)
- Robust Optimization for Non-Convex Objectives. (arXiv:1707.01047v1 [cs.LG])
- Normalizing constants of log-concave densities. (arXiv:1707.00460v2 [stat.ME] UPDATED)
- On Scalable Inference with Stochastic Gradient Descent. (arXiv:1707.00192v1 [stat.ML])
- Parle: parallelizing stochastic gradient descent. (arXiv:1707.00424v2 [cs.LG] UPDATED)
- Clock Monte Carlo methods. (arXiv:1706.10261v4 [cond-mat.stat-mech] UPDATED)
- Elementary epistemological features of machine intelligence. (arXiv:0812.0885v4 [cs.AI] UPDATED)
- Interpretability via Model Extraction. (arXiv:1706.09773v4 [cs.LG] UPDATED)
- Towards Understanding Generalization of Deep Learning: Perspective of Loss Landscapes. (arXiv:1706.10239v2 [cs.LG] UPDATED)
- Online Convolutional Dictionary Learning. (arXiv:1706.09563v2 [cs.LG] UPDATED)
- A Fixed-Point of View on Gradient Methods for Big Data. (arXiv:1706.09880v4 [stat.ML] UPDATED)
- On the tightness of Gaussian concentration for convex functions. (arXiv:1706.09446v1 [math.PR])
- Asymptotic and finite-sample properties of estimators based on stochastic gradients
- Concentration of tempered posteriors and of their variational approximations. (arXiv:1706.09293v3 [math.ST] UPDATED)
- Informed Sub-Sampling MCMC: Approximate Bayesian Inference for Large Datasets. (arXiv:1706.08327v3 [stat.ME] UPDATED)
- Complexity of the Regularized Newton Method. (arXiv:1706.08483v1 [math.OC])
- Behavior of Accelerated Gradient Methods Near Critical Points of Nonconvex Functions. (arXiv:1706.07993v3 [math.OC] UPDATED)
- A Unified Analysis of Stochastic Optimization Methods Using Jump System Theory and Quadratic Constraints. (arXiv:1706.08141v1 [stat.ML])
- Asymptotics of ABC. (arXiv:1706.07712v1 [stat.ME])
- Shannon Entropy Reinterpreted. (arXiv:1706.07735v2 [cond-mat.stat-mech] UPDATED)
- The energy landscape of a simple neural network. (arXiv:1706.07101v1 [stat.ML])
- Nonlinear Acceleration of Stochastic Algorithms. (arXiv:1706.07270v2 [math.OC] UPDATED)
- Interior-proximal primal-dual methods. (arXiv:1706.07067v2 [math.OC] UPDATED)
- Nonlinear probability. A theory with incompatible stochastic variables. (arXiv:1706.06770v1 [math.ST])
- Control Variates for Stochastic Gradient MCMC. (arXiv:1706.05439v2 [stat.CO] UPDATED)
- On Quadratic Convergence of DC Proximal Newton Algorithm for Nonconvex Sparse Learning in High Dimensions. (arXiv:1706.06066v3 [stat.ML] UPDATED)
- On the Optimization Landscape of Tensor Decompositions. (arXiv:1706.05598v1 [cs.LG])
- Variants of RMSProp and Adagrad with Logarithmic Regret Bounds. (arXiv:1706.05507v2 [cs.LG] UPDATED)
- A Unified Approach to Adaptive Regularization in Online and Stochastic Optimization. (arXiv:1706.06569v1 [cs.LG])
- First Order Methods beyond Convexity and Lipschitz Gradient Continuity with Applications to Quadratic Inverse Problems. (arXiv:1706.06461v1 [math.OC])
- Nonasymptotic convergence of stochastic proximal point algorithms for constrained convex optimization. (arXiv:1706.06297v1 [math.OC])
- Evaluating Noisy Optimisation Algorithms: First Hitting Time is Problematic. (arXiv:1706.05086v2 [cs.NE] UPDATED)
- Gradient Descent for Spiking Neural Networks. (arXiv:1706.04698v2 [q-bio.NC] UPDATED)
- Sequential quasi-Monte Carlo: Introduction for Non-Experts, Dimension Reduction, Application to Partly Observed Diffusion Processes. (arXiv:1706.05305v1 [stat.CO])
- Exact Simulation for Multivariate It\^o Diffusions. (arXiv:1706.05124v2 [math.PR] UPDATED)
- Proximal Backpropagation. (arXiv:1706.04638v3 [cs.LG] UPDATED)
- Reinforcement Learning under Model Mismatch. (arXiv:1706.04711v2 [cs.LG] UPDATED)
- Average of Recentered Parallel MCMC for Big Data. (arXiv:1706.04780v2 [stat.CO] UPDATED)
- Stochastic Gradient MCMC Methods for Hidden Markov Models. (arXiv:1706.04632v1 [stat.ML])
- Deep learning-based numerical methods for high-dimensional parabolic partial differential equations and backward stochastic differential equations. (arXiv:1706.04702v1 [math.NA])
- Accelerated Extra-Gradient Descent: A Novel Accelerated First-Order Method. (arXiv:1706.04680v2 [math.OC] UPDATED)
- The loss surface of deep and wide neural networks. (arXiv:1704.08045v2 [cs.LG] CROSS LISTED)
- Online Learning for Structured Loss Spaces. (arXiv:1706.04125v2 [cs.LG] UPDATED)
- Iterated random functions and regularly varying tails. (arXiv:1706.03876v1 [math.PR])
- An Alternative to EM for Gaussian Mixture Models: Batch and Stochastic Riemannian Optimization. (arXiv:1706.03267v1 [stat.ML])
- Practical Gauss-Newton Optimisation for Deep Learning. (arXiv:1706.03662v2 [stat.ML] UPDATED)
- Fractional Langevin Monte Carlo: Exploring L\'{e}vy Driven Stochastic Differential Equations for Markov Chain Monte Carlo. (arXiv:1706.03649v1 [stat.CO] CROSS LISTED)
- On the Sampling Problem for Kernel Quadrature. (arXiv:1706.03369v1 [stat.ML])
- Climbing a shaky ladder: Better adaptive risk estimation. (arXiv:1706.02733v1 [cs.LG])
- Thermodynamics of Evolutionary Games. (arXiv:1706.03058v2 [q-bio.PE] UPDATED)
- The True Cost of Stochastic Gradient Langevin Dynamics. (arXiv:1706.02692v1 [stat.ME])
- Berry-Ess\'een bounds for parameter estimation of general Gaussian processes. (arXiv:1706.02420v1 [math.PR])
- The Generalized Cross Validation Filter. (arXiv:1706.02495v1 [stat.ML])
- Stochastic Global Optimization Algorithms: A Systematic Formal Approach. (arXiv:1706.02246v1 [cs.AI])
- Coupling and Decoupling to bound an approximating Markov Chain. (arXiv:1706.02040v2 [math.PR] UPDATED)
- The Convergence of Markov chain Monte Carlo Methods: From the Metropolis method to Hamiltonian Monte Carlo. (arXiv:1706.01520v2 [stat.ME] UPDATED)
- Stochastic Gradient Monomial Gamma Sampler. (arXiv:1706.01498v2 [stat.ML] CROSS LISTED)
- Limitations on Variance-Reduction and Acceleration Schemes for Finite Sum Optimization. (arXiv:1706.01686v2 [math.OC] UPDATED)
- Multivariate initial sequence estimators in Markov chain Monte Carlo. (arXiv:1706.00853v1 [stat.ME])
- The Emergence of Consensus: A Primer. (arXiv:1704.07767v3 [physics.soc-ph] UPDATED)
- Learning Whenever Learning is Possible: Universal Learning under General Stochastic Processes. (arXiv:1706.01418v2 [stat.ML] UPDATED)
- Fast approximate Bayesian inference for stable differential equation models. (arXiv:1706.00689v2 [stat.CO] UPDATED)
- Deep Learning: A Bayesian Perspective. (arXiv:1706.00473v4 [stat.ML] UPDATED)
- Learning to Compute Word Embeddings On the Fly. (arXiv:1706.00286v3 [cs.LG] UPDATED)
- Learning Generative Models with Sinkhorn Divergences. (arXiv:1706.00292v3 [stat.ML] UPDATED)
- Krylov Subspace Recycling for Fast Iterative Least-Squares in Machine Learning. (arXiv:1706.00241v1 [cs.LG])
- Variational Sequential Monte Carlo. (arXiv:1705.11140v2 [stat.ML] UPDATED)
- Distributed SAGA: Maintaining linear convergence rate with limited communication. (arXiv:1705.10405v1 [math.OC])
- The Numerics of GANs. (arXiv:1705.10461v3 [cs.LG] UPDATED)
- The Cramer Distance as a Solution to Biased Wasserstein Gradients. (arXiv:1705.10743v1 [cs.AI])
- Gradient Descent Can Take Exponential Time to Escape Saddle Points. (arXiv:1705.10412v2 [math.OC] UPDATED)
- Diagonal Rescaling For Neural Networks. (arXiv:1705.09319v1 [cs.LG])
- Approximate and Stochastic Greedy Optimization. (arXiv:1705.09396v1 [math.OC])
- Convergence of the Population Dynamics algorithm in the Wasserstein metric. (arXiv:1705.09747v2 [math.PR] UPDATED)
- Numerical low-rank approximation of matrix differential equations. (arXiv:1705.10175v2 [math.NA] UPDATED)
- A Generalized Accelerated Composite Gradient Method: Uniting Nesterov's Fast Gradient Method and FISTA. (arXiv:1705.10266v2 [math.OC] UPDATED)
- Market Crashes as Critical Phenomena? Explanation, Idealization, and Universality in Econophysics. (arXiv:1704.02392v1 [physics.hist-ph] CROSS LISTED)
- Can Decentralized Algorithms Outperform Centralized Algorithms? A Case Study for Decentralized Parallel Stochastic Gradient Descent. (arXiv:1705.09056v5 [math.OC] UPDATED)
- Filtering Variational Objectives. (arXiv:1705.09279v3 [cs.LG] UPDATED)
- Convergence of Langevin MCMC in KL-divergence. (arXiv:1705.09048v2 [stat.ML] UPDATED)
- Sampling from a log-concave distribution with compact support with proximal Langevin Monte Carlo. (arXiv:1705.08964v1 [stat.ME])
- Discovery of statistical equivalence classes using computer algebra. (arXiv:1705.09457v1 [math.ST])
- Latent Geometry and Memorization in Generative Models. (arXiv:1705.09303v1 [cs.LG])
- Finite-length analysis on tail probability for Markov chain and application to simple hypothesis testing
- Interpreting Blackbox Models via Model Extraction. (arXiv:1705.08504v6 [cs.LG] UPDATED)
- Stochastic Sequential Neural Networks with Structured Inference. (arXiv:1705.08695v1 [cs.LG])
- Causal Effect Inference with Deep Latent-Variable Models. (arXiv:1705.08821v2 [stat.ML] UPDATED)
- Train longer, generalize better: closing the generalization gap in large batch training of neural networks. (arXiv:1705.08741v2 [stat.ML] UPDATED)
- Bayesian Compression for Deep Learning. (arXiv:1705.08665v4 [stat.ML] UPDATED)
- A.Ya. Khintchine's Work in Probability Theory. (arXiv:1705.08744v3 [math.HO] UPDATED)
- Poincar\'e Embeddings for Learning Hierarchical Representations. (arXiv:1705.08039v2 [cs.AI] UPDATED)
- Parallel Stochastic Gradient Descent with Sound Combiners. (arXiv:1705.08030v1 [cs.LG])
- The Bayesian update: variational formulations and gradient flows. (arXiv:1705.07382v2 [math.ST] UPDATED)
- From optimal transport to generative modeling: the VEGAN cookbook. (arXiv:1705.07642v1 [stat.ML])
- Reducing Reparameterization Gradient Variance. (arXiv:1705.07880v1 [stat.ML])
- Nestrov's Acceleration For Second Order Method. (arXiv:1705.07171v2 [cs.LG] UPDATED)
- Two-temperature logistic regression based on the Tsallis divergence. (arXiv:1705.07210v2 [cs.LG] UPDATED)
- On the diffusion approximation of nonconvex stochastic gradient descent. (arXiv:1705.07562v2 [stat.ML] UPDATED)
- Information-theoretic analysis of generalization capability of learning algorithms. (arXiv:1705.07809v2 [cs.LG] UPDATED)
- Nonparametric Online Regression while Learning the Metric. (arXiv:1705.07853v2 [cs.LG] UPDATED)
- Statistical inference using SGD. (arXiv:1705.07477v2 [cs.LG] UPDATED)
- Stochastic Recursive Gradient Algorithm for Nonconvex Optimization. (arXiv:1705.07261v1 [stat.ML])
- Spatial Variational Auto-Encoding via Matrix-Variate Normal Distributions. (arXiv:1705.06821v2 [cs.LG] UPDATED)
- Disordered statistical physics in low dimensions: extremes, glass transition, and localization. (arXiv:1705.06896v1 [cond-mat.dis-nn])
- Gradient Estimators for Implicit Models. (arXiv:1705.07107v5 [stat.ML] UPDATED)
- Scalable Variational Inference for Dynamical Systems. (arXiv:1705.07079v2 [stat.ML] UPDATED)
- The Landscape of Deep Learning Algorithms. (arXiv:1705.07038v2 [stat.ML] UPDATED)
- Lecture Notes on the Statistical Mechanics of Disordered Systems. (arXiv:1705.07072v1 [cond-mat.stat-mech])
- Information Geometry Approach to Parameter Estimation in Hidden Markov Models. (arXiv:1705.06040v3 [math.ST] UPDATED)
- Online estimation of the geometric median in Hilbert spaces: Nonasymptotic confidence balls
- A Bayesian Filtering Algorithm for Gaussian Mixture Models. (arXiv:1705.05495v2 [stat.ML] UPDATED)
- Learning Probabilistic Programs Using Backpropagation. (arXiv:1705.05396v1 [cs.LG])
- Distributed Statistical Machine Learning in Adversarial Settings: Byzantine Gradient Descent. (arXiv:1705.05491v2 [cs.DC] UPDATED)
- The Incremental Multiresolution Matrix Factorization Algorithm. (arXiv:1705.05804v1 [cs.CV])
- High-dimensional Lipschitz functions are typically flat
- Total variation distance between stochastic polynomials and invariance principles. (arXiv:1705.05194v1 [math.PR])
- A rational analysis of curiosity. (arXiv:1705.04351v2 [cs.AI] UPDATED)
- Molecular Generation with Recurrent Neural Networks (RNNs). (arXiv:1705.04612v2 [cs.LG] UPDATED)
- Learning ReLUs via Gradient Descent. (arXiv:1705.04591v2 [cs.LG] UPDATED)
- Exponential Ergodicity of the Bouncy Particle Sampler. (arXiv:1705.04579v2 [stat.CO] UPDATED)
- Convergence Analysis of Proximal Gradient with Momentum for Nonconvex Optimization. (arXiv:1705.04925v1 [cs.LG])
- Inference for Differential Equation Models using Relaxation via Dynamical Systems. (arXiv:1705.04436v1 [stat.ME])
- Learning ReLUs via Gradient Descent. (arXiv:1705.04591v2 [cs.LG] UPDATED)
- Dynamic Models of Wasserstein-1-Type Unbalanced Transport. (arXiv:1705.04535v2 [math.OC] UPDATED)
- The Euler scheme for stochastic differential equations with discontinuous drift coefficient: A numerical study of the convergence rate. (arXiv:1705.04562v2 [math.NA] UPDATED)
- Optimal Monte Carlo Methods for $L^2$-Approximation. (arXiv:1705.04567v3 [math.NA] UPDATED)
- Nonnegative Matrix Factorization with Transform Learning. (arXiv:1705.04193v2 [cs.LG] UPDATED)
- Program Induction by Rationale Generation : Learning to Solve and Explain Algebraic Word Problems. (arXiv:1705.04146v3 [cs.AI] UPDATED)
- A critical analysis of resampling strategies for the regularized particle filter. (arXiv:1705.04219v1 [stat.CO])
- What Can We Learn from Noise? -- Mesoscopic Nonequilibrium Statistical Physics --. (arXiv:1705.04201v1 [cond-mat.mes-hall])
- Spectral gap estimates in mean field spin glasses. (arXiv:1705.04243v3 [math.PR] UPDATED)
- From Least Squares to Signal Processing and Particle Filtering. (arXiv:1705.04141v1 [stat.ME])
- Introductory Lectures on Stochastic Population Systems. (arXiv:1705.03781v1 [math.PR])
- On Stein operators for discrete approximations
- A generalized divergence for statistical inference
- Multilevel Richardson–Romberg extrapolation
- The geometric foundations of Hamiltonian Monte Carlo
- Large-sample approximations for variance-covariance matrices of high-dimensional time series
- Non-negative Matrix Factorization via Archetypal Analysis. (arXiv:1705.02994v1 [stat.ML])
- Geometric GAN. (arXiv:1705.02894v2 [stat.ML] UPDATED)
- Nonasymptotic estimation and support recovery for high dimensional sparse covariance matrices. (arXiv:1705.02679v3 [stat.ME] UPDATED)
- Geometry and Dynamics for Markov Chain Monte Carlo. (arXiv:1705.02891v1 [stat.CO])
- "Convex Until Proven Guilty": Dimension-Free Acceleration of Gradient Descent on Non-Convex Functions. (arXiv:1705.02766v1 [math.OC])
- Stable Architectures for Deep Neural Networks. (arXiv:1705.03341v3 [cs.LG] UPDATED)
- Geometry of Optimization and Implicit Regularization in Deep Learning. (arXiv:1705.03071v1 [cs.LG])
- Fluctuations of the Empirical Measure of Freezing Markov Chains. (arXiv:1705.02121v1 [math.PR])
- Group invariance principles for causal generative models. (arXiv:1705.02212v1 [stat.ML])
- A Bayesian Stochastic Approximation Method. (arXiv:1705.02069v1 [stat.ME])
- Matrix Completion via Factorizing Polynomials. (arXiv:1705.02047v3 [stat.ML] UPDATED)
- Parallel Stochastic Newton Method. (arXiv:1705.02005v1 [math.NA])
- Solving Spin Glasses with Optimized Trees of Clustered Spins. (arXiv:1705.02075v2 [cond-mat.stat-mech] UPDATED)
- Characterization of nonequilibrium states of trapped Bose-Einstein condensates. (arXiv:1705.01768v2 [cond-mat.quant-gas] UPDATED)
- MapReduce Particle Filtering with Exact Resampling and Deterministic Runtime. (arXiv:1705.01660v1 [stat.CO])
- Stochastic gene expression conditioned on large deviations. (arXiv:1704.03863v1 [cond-mat.stat-mech] CROSS LISTED)
- Nonlinear Kalman Filtering with Divergence Minimization. (arXiv:1705.00722v1 [math.OC])
- Importance-sampling computation of statistical properties of coupled oscillators. (arXiv:1705.01068v2 [nlin.CD] UPDATED)
- Thermodynamic cost and benefit of memory. (arXiv:1705.00612v3 [cond-mat.stat-mech] UPDATED)
- Langevin diffusions on the torus: estimation and applications. (arXiv:1705.00296v4 [stat.ME] UPDATED)
- Using Perturbed Underdamped Langevin Dynamics to Efficiently Sample from Probability Distributions. (arXiv:1705.00170v1 [math.PR])
- On the convergence of Hamiltonian Monte Carlo. (arXiv:1705.00166v2 [stat.CO] UPDATED)
- Nonequilibrium Thermodynamics of Restricted Boltzmann Machines. (arXiv:1704.08724v1 [cond-mat.stat-mech])
- Failures of Gradient-Based Deep Learning. (arXiv:1703.07950v2 [cs.LG] CROSS LISTED)
- The loss surface of deep and wide neural networks. (arXiv:1704.08045v2 [cs.LG] UPDATED)
- Accelerating Stochastic Gradient Descent For Least Squares Regression. (arXiv:1704.08227v2 [stat.ML] UPDATED)
- High-Dimensional Function Approximation: Breaking the Curse with Monte Carlo Methods. (arXiv:1704.08213v1 [math.NA])
- Misspecified Linear Bandits. (arXiv:1704.06880v1 [cs.LG])
- Ensemble Kalman methods for high-dimensional hierarchical dynamic space-time models. (arXiv:1704.06988v2 [stat.ME] UPDATED)
- Advanced Multilevel Monte Carlo Methods. (arXiv:1704.07272v1 [stat.CO])
- Time-Varying Convex Optimization via Time-Varying Averaged Operators. (arXiv:1704.07338v3 [math.OC] UPDATED)
- A decentralized proximal-gradient method with network independent step-sizes and separated convergence rates. (arXiv:1704.07807v2 [math.OC] UPDATED)
- Stein Variational Gradient Descent as Gradient Flow. (arXiv:1704.07520v2 [stat.ML] UPDATED)
- Optimal Transport Filtering with Particle Reweighing in Finance. (arXiv:1704.07698v1 [math.NA])
- Mutual Information, Neural Networks and the Renormalization Group. (arXiv:1704.06279v2 [cond-mat.dis-nn] UPDATED)
- Entropy-SGD: Biasing Gradient Descent Into Wide Valleys. (arXiv:1611.01838v5 [cs.LG] CROSS LISTED)
- How to measure heat in stochastic systems. (arXiv:1704.06566v2 [cond-mat.stat-mech] UPDATED)
- Importance Sampled Stochastic Optimization for Variational Inference. (arXiv:1704.05786v2 [stat.ML] UPDATED)
- The proximal point algorithm in geodesic spaces with curvature bounded above. (arXiv:1704.05721v1 [math.FA])
- Probabilistic programs for inferring the goals of autonomous agents. (arXiv:1704.04977v2 [cs.AI] UPDATED)
- Further and stronger analogy between sampling and optimization: Langevin Monte Carlo and gradient descent. (arXiv:1704.04752v2 [math.ST] UPDATED)
- Deep Relaxation: partial differential equations for optimizing deep neural networks. (arXiv:1704.04932v2 [cs.LG] UPDATED)
- Metropolis Sampling. (arXiv:1704.04629v1 [stat.ME])
- Computations of optimal transport distance with Fisher information regularization. (arXiv:1704.04605v2 [math.NA] UPDATED)
- Model Uncertainty, Recalibration, and the Emergence of Delta-Vega Hedging. (arXiv:1704.04524v1 [q-fin.MF])
- Structure and Randomness of Continuous-Time Discrete-Event Processes. (arXiv:1704.04707v1 [cond-mat.stat-mech])
- Stochastic Gradient Descent as Approximate Bayesian Inference. (arXiv:1704.04289v2 [stat.ML] UPDATED)
- On Generalized Bellman Equations and Temporal-Difference Learning. (arXiv:1704.04463v2 [cs.LG] UPDATED)
- ZigZag: A new approach to adaptive online learning. (arXiv:1704.04010v1 [cs.LG])
- Solving ill-posed inverse problems using iterative deep neural networks. (arXiv:1704.04058v1 [math.OC])
- A Note on the Birkhoff Ergodic Theorem. (arXiv:1704.03681v1 [math.PR])
- The Fundamental Theorem of Perfect Simulation. (arXiv:1704.03561v1 [math.PR])
- Information-Geometric Optimization Algorithms: A Unifying Picture via Invariance Principles; Yann Ollivier, Ludovic Arnold, Anne Auger, Nikolaus Hansen
- Parametric Gaussian Process Regression for Big Data. (arXiv:1704.03144v2 [stat.ML] UPDATED)
- Exponential stability of modified truncated EM method for stochastic differential equations. (arXiv:1704.03158v1 [math.PR])
- Reinterpreting Importance-Weighted Autoencoders. (arXiv:1704.02916v2 [stat.ML] UPDATED)
- Posterior Asymptotic Normality for an Individual Coordinate in High-dimensional Linear Regression. (arXiv:1704.02646v1 [math.ST])
- Evolving a Vector Space with any Generating Set. (arXiv:1704.02708v2 [cs.LG] UPDATED)
- Stein Variational Policy Gradient. (arXiv:1704.02399v1 [cs.LG])
- Dynamic Safe Interruptibility for Decentralized Multi-Agent Reinforcement Learning. (arXiv:1704.02882v2 [cs.AI] UPDATED)
- Efficient SMC$^2$ schemes for stochastic kinetic models. (arXiv:1704.02791v2 [stat.CO] UPDATED)
- Bayesian Recurrent Neural Networks. (arXiv:1704.02798v4 [cs.LG] UPDATED)
- Group Importance Sampling for Particle Filtering and MCMC. (arXiv:1704.02771v4 [stat.CO] UPDATED)
- Three Skewed Matrix Variate Distributions. (arXiv:1704.02531v5 [stat.ME] UPDATED)
- Entropic Dynamics: Mechanics without Mechanism. (arXiv:1704.02663v2 [quant-ph] UPDATED)
- Evolution in Groups: A deeper look at synaptic cluster driven evolution of deep neural networks. (arXiv:1704.02081v1 [cs.NE])
- Recurrent Environment Simulators. (arXiv:1704.02254v2 [cs.AI] UPDATED)
- Generalized fractional Brownian motion. (arXiv:1704.02103v1 [math.PR])
- Nonnegative/binary matrix factorization with a D-Wave quantum annealer. (arXiv:1704.01605v1 [cs.LG])
- Maximum a Posteriori Joint State Path and Parameter Estimation in Stochastic Differential Equations. (arXiv:1704.01847v1 [math.ST])
- Exploring first-order phase transitions with population annealing. (arXiv:1704.01888v1 [physics.comp-ph])
- A stochastic molecular scheme for an artificial cell to infer its environment from partial observations. (arXiv:1704.01733v1 [q-bio.MN])
- Optimal transport and integer partitions. (arXiv:1704.01666v1 [math.NT])
- Accelerated Stochastic Quasi-Newton Optimization on Riemann Manifolds. (arXiv:1704.01700v3 [math.OC] UPDATED)
- Heat fluctuations of Brownian oscillators in nonstationary processes: fluctuation theorem and condensation transition. (arXiv:1704.01739v1 [cond-mat.stat-mech])
- Two modified proximal point algorithms in geodesic spaces with curvature bounded above. (arXiv:1704.01360v2 [math.FA] UPDATED)
- On the construction of probabilistic Newton-type algorithms. (arXiv:1704.01382v1 [stat.ML])
- Smoothing and filtering with a class of outer measures. (arXiv:1704.01233v2 [stat.ME] UPDATED)
- Entropy production for complex Langevin equations. (arXiv:1704.01566v2 [cond-mat.stat-mech] UPDATED)
- Stochastic L-BFGS: Improved Convergence Rates and Practical Acceleration Strategies. (arXiv:1704.00116v3 [math.OC] UPDATED)
- Iterated stochastic processes : simulation and relationship with high order partial differential equations. (arXiv:1704.00173v2 [math.PR] UPDATED)
- Robust Student's t based Stochastic Cubature Filter for Nonlinear Systems with Heavy-tailed Process and Measurement Noises. (arXiv:1704.00040v1 [stat.AP])
- Exploiting gradients and Hessians in Bayesian optimization and Bayesian quadrature. (arXiv:1704.00060v2 [stat.ML] UPDATED)
- Gradient Flows in Uncertainty Propagation and Filtering of Linear Gaussian Systems. (arXiv:1704.00102v1 [math.OC])
- How Wave - Wavelet Trading Wins and "Beats" the Market. (arXiv:1704.00383v1 [q-fin.TR])
- Computing Nonvacuous Generalization Bounds for Deep (Stochastic) Neural Networks with Many More Parameters than Training Data. (arXiv:1703.11008v2 [cs.LG] UPDATED)
- A note on the generalized heat content for L\'evy processes. (arXiv:1703.10790v2 [math.PR] UPDATED)
- Catalyst Acceleration for Gradient-Based Non-Convex Optimization. (arXiv:1703.10993v3 [stat.ML] UPDATED)
- Reliable Decision Support using Counterfactual Models. (arXiv:1703.10651v4 [stat.ML] UPDATED)
- Factorization tricks for LSTM networks. (arXiv:1703.10722v3 [cs.CL] UPDATED)
- Proper Bayes and Minimax Predictive Densities for a Matrix-variate Normal Distribution. (arXiv:1703.10393v2 [math.ST] UPDATED)
- Investigating inequality: a Langevin approach. (arXiv:1703.10360v1 [cond-mat.stat-mech])
- Probabilistic Line Searches for Stochastic Optimization. (arXiv:1703.10034v2 [cs.LG] UPDATED)
- A Course in Interacting Particle Systems. (arXiv:1703.10007v4 [math.PR] UPDATED)
- Early Stopping without a Validation Set. (arXiv:1703.09580v3 [cs.LG] UPDATED)
- Radial Subgradient Method. (arXiv:1703.09280v2 [math.OC] UPDATED)
- Parameter estimation for fractional Ornstein-Uhlenbeck processes of general Hurst parameter. (arXiv:1703.09372v1 [math.PR])
- A Unified Formulation and Fast Accelerated Proximal Gradient Method for Classification; Naoki Ito, Akiko Takeda, Kim-Chuan Toh
- On Perturbed Proximal Gradient Algorithms; Yves F. Atchadé, Gersende Fort, Eric Moulines
- Automatic Differentiation Variational Inference; Alp Kucukelbir, Dustin Tran, Rajesh Ranganath, Andrew Gelman, David M. Blei
- Smolyak's algorithm: A powerful black box for the acceleration of scientific computations. (arXiv:1703.08872v3 [math.NA] UPDATED)
- Finite Mixtures of Skewed Matrix Variate Distributions. (arXiv:1703.08882v3 [stat.ME] UPDATED)
- Linear Thompson Sampling Revisited. (arXiv:1611.06534v3 [stat.ML] UPDATED)
- Uncertainty quantification in graph-based classification of high dimensional data. (arXiv:1703.08816v2 [cs.LG] UPDATED)
- Gradient Method With Inexact Oracle for Composite Non-Convex Optimization. (arXiv:1703.09180v1 [math.OC])
- Stochastic Methods for Composite and Weakly Convex Optimization Problems. (arXiv:1703.08570v3 [math.OC] UPDATED)
- Thompson Sampling for Linear-Quadratic Control Problems. (arXiv:1703.08972v1 [stat.ML])
- Full likelihood inference for max-stable data. (arXiv:1703.08665v2 [stat.ME] UPDATED)
- How AD Can Help Solve Differential-Algebraic Equations. (arXiv:1703.08914v1 [math.NA])
- Structure Preserving Model Reduction of Parametric Hamiltonian Systems. (arXiv:1703.08345v1 [math.NA])
- A note on symmetries in the path integral formulation of the Langevin dynamics. (arXiv:1703.08192v4 [hep-th] UPDATED)
- A Nonconvex Splitting Method for Symmetric Nonnegative Matrix Factorization: Convergence Analysis and Optimality. (arXiv:1703.08267v1 [math.OC])
- Stochastic Calculus with respect to Gaussian Processes: Part I. (arXiv:1703.08393v2 [math.PR] UPDATED)
- Perspective: Energy Landscapes for Machine Learning. (arXiv:1703.07915v1 [stat.ML])
- A probability inequality for sums of independent Banach space valued random variables. (arXiv:1703.07868v1 [math.PR])
- How to avoid the curse of dimensionality: scalability of particle filters with and without importance weights. (arXiv:1703.07879v2 [math.OC] UPDATED)
- Learning to Generate Samples from Noise through Infusion Training. (arXiv:1703.06975v1 [stat.ML])
- REBAR: Low-variance, unbiased gradient estimates for discrete latent variable models. (arXiv:1703.07370v4 [cs.LG] UPDATED)
- Deep Learning for Explicitly Modeling Optimization Landscapes. (arXiv:1703.07394v1 [cs.NE])
- Overcoming model simplifications when quantifying predictive uncertainty. (arXiv:1703.07198v1 [stat.ML])
- Targeting Bayes factors with direct-path non-equilibrium thermodynamic integration. (arXiv:1703.07305v1 [stat.ME])
- QMDP-Net: Deep Learning for Planning under Partial Observability. (arXiv:1703.06692v3 [cs.AI] UPDATED)
- Sequential Monte Carlo Methods in the nimble R Package. (arXiv:1703.06206v3 [stat.CO] UPDATED)
- Derivation of the Boltzmann Equation for Financial Brownian Motion: Direct Observation of the Collective Motion of High-Frequency Traders. (arXiv:1703.06739v3 [q-fin.TR] UPDATED)
- An Introduction to Large Deviations and Equilibrium Statistical Mechanics for Turbulent Flows. (arXiv:1703.06779v1 [cond-mat.stat-mech])
- Inference via low-dimensional couplings. (arXiv:1703.06131v4 [stat.ME] UPDATED)
- Particle Value Functions. (arXiv:1703.05820v1 [cs.LG])
- Behavior of the Wasserstein distance between the empirical and the marginal distributions of stationary $\alpha$-dependent sequences
- Girsanov reweighting for path ensembles and Markov state models. (arXiv:1703.05498v1 [cond-mat.stat-mech])
- Testing and non-linear preconditioning of the proximal point method. (arXiv:1703.05705v3 [math.OC] UPDATED)
- Finite Sample Analysis of Two-Timescale Stochastic Approximation with Applications to Reinforcement Learning. (arXiv:1703.05376v5 [cs.AI] UPDATED)
- Generalized Self-Concordant Functions: A Recipe for Newton-Type Methods. (arXiv:1703.04599v3 [math.OC] UPDATED)
- Adaptive Restart of the Optimized Gradient Method for Convex Optimization. (arXiv:1703.04641v2 [math.OC] UPDATED)
- Sharp Minima Can Generalize For Deep Nets. (arXiv:1703.04933v2 [cs.LG] UPDATED)
- Understanding Black-box Predictions via Influence Functions. (arXiv:1703.04730v3 [stat.ML] UPDATED)
- Student-t Process Quadratures for Filtering of Non-Linear Systems with Heavy-Tailed Noise. (arXiv:1703.05189v2 [stat.ME] UPDATED)
- Online Learning for Distribution-Free Prediction. (arXiv:1703.05060v1 [cs.LG])
- A Short Note on Almost Sure Convergence of Bayes Factors in the General Set-Up. (arXiv:1703.04956v5 [math.ST] UPDATED)
- Learned Optimizers that Scale and Generalize. (arXiv:1703.04813v4 [cs.LG] UPDATED)
- Multilevel Sequential Monte Carlo with Dimension-Independent Likelihood-Informed Proposals. (arXiv:1703.04866v1 [stat.CO])
- Online Learning Rate Adaptation with Hypergradient Descent. (arXiv:1703.04782v3 [cs.LG] UPDATED)
- A Noninformative Prior on a Space of Distribution Functions. (arXiv:1703.04661v3 [math.ST] UPDATED)
- APP-Hom Method for Box Constrained Quadratic Programming. (arXiv:1703.05001v2 [math.OC] UPDATED)
- Riemannian stochastic quasi-Newton algorithm with variance reduction and its convergence analysis. (arXiv:1703.04890v1 [cs.LG])
- Stochastic Lotka-Volterra food chains. (arXiv:1703.04809v1 [math.PR])
- A proximal point algorithm revisited and extended. (arXiv:1703.04051v1 [math.OC])
- Evolution Strategies as a Scalable Alternative to Reinforcement Learning. (arXiv:1703.03864v2 [stat.ML] UPDATED)
- Langevin Dynamics with Continuous Tempering for Training Deep Neural Networks. (arXiv:1703.04379v4 [cs.LG] UPDATED)
- Strong convergence rates of probabilistic integrators for ordinary differential equations. (arXiv:1703.03680v6 [math.NA] UPDATED)
- Learning Gradient Descent: Better Generalization and Longer Horizons. (arXiv:1703.03633v3 [cs.LG] UPDATED)
- Online Learning with Abstention. (arXiv:1703.03478v3 [cs.LG] UPDATED)
- Right for the Right Reasons: Training Differentiable Models by Constraining their Explanations. (arXiv:1703.03717v2 [cs.LG] UPDATED)
- Couplings and quantitative contraction rates for Langevin dynamics. (arXiv:1703.01617v2 [math.PR] UPDATED)
- Characterizing the impact of model error in hydrogeologic time series recovery inverse problems. (arXiv:1703.03090v1 [math.SP])
- Interpretable Structure-Evolving LSTM. (arXiv:1703.03055v1 [cs.CV])
- Random matrices and the New York City subway system. (arXiv:1703.02537v1 [physics.soc-ph])
- Byzantine-Tolerant Machine Learning. (arXiv:1703.02757v1 [cs.DC])
- Global optimization of Lipschitz functions. (arXiv:1703.02628v3 [stat.ML] UPDATED)
- Deep Robust Kalman Filter. (arXiv:1703.02310v1 [cs.AI])
- Revisiting stochastic off-policy action-value gradients. (arXiv:1703.02102v2 [stat.ML] UPDATED)
- Unsupervised learning of phase transitions: from principal component analysis to variational autoencoders. (arXiv:1703.02435v2 [cond-mat.stat-mech] UPDATED)
- Online Multilinear Dictionary Learning. (arXiv:1703.02492v5 [cs.LG] UPDATED)
- Faster Coordinate Descent via Adaptive Importance Sampling. (arXiv:1703.02518v1 [cs.LG])
- Robust Bayesian Filtering and Smoothing Using Student's t Distribution. (arXiv:1703.02428v1 [stat.ME])
- Probabilistic learning of nonlinear dynamical systems using sequential Monte Carlo. (arXiv:1703.02419v1 [stat.CO])
- Applications of neural networks to the studies of phase transitions of two-dimensional Potts models. (arXiv:1703.02369v2 [cond-mat.dis-nn] UPDATED)
- Estimating the Marginal Likelihood Using the Arithmetic Mean Identity
- Large-Scale Evolution of Image Classifiers. (arXiv:1703.01041v2 [cs.NE] UPDATED)
- When is selfish routing bad? The price of anarchy in light and heavy traffic. (arXiv:1703.00927v2 [cs.GT] UPDATED)
- Forward and Reverse Gradient-Based Hyperparameter Optimization. (arXiv:1703.01785v3 [stat.ML] UPDATED)
- Online Sequential Monte Carlo smoother for partially observed stochastic differential equations. (arXiv:1703.01776v1 [stat.ME])
- A Matrix Variate Skew-t Distribution. (arXiv:1703.01364v3 [stat.ME] UPDATED)
- On the convex Poincar\'e inequality and weak transportation inequalities. (arXiv:1703.01765v2 [math.PR] UPDATED)
- Implied Filtering Densities on Volatility's Hidden State. (arXiv:1203.6631v5 [q-fin.PR] UPDATED)
- Swarm behavior of traders with different subjective predictions in the Market. (arXiv:1703.01291v1 [q-fin.TR])
- Data-Dependent Stability of Stochastic Gradient Descent. (arXiv:1703.01678v4 [cs.LG] UPDATED)
- Sharp bounds for population recovery. (arXiv:1703.01474v1 [cs.DS])
- Newton-like dynamics associated to nonconvex optimization problems. (arXiv:1703.01339v1 [math.OC])
- Control Interpretations for First-Order Optimization Methods. (arXiv:1703.01670v1 [cs.SY])
- Online EM Algorithm for Latent Data Models. (arXiv:0712.4273v4 [stat.CO] UPDATED)
- SARAH: A Novel Method for Machine Learning Problems Using Stochastic Recursive Gradient. (arXiv:1703.00102v2 [stat.ML] UPDATED)
- Linearly constrained Gaussian processes. (arXiv:1703.00787v2 [stat.ML] UPDATED)
- Introduction to Nonnegative Matrix Factorization. (arXiv:1703.00663v1 [math.OC])
- Algorithmic stability and hypothesis complexity. (arXiv:1702.08712v2 [stat.ML] UPDATED)
- Convergence rate of a simulated annealing algorithm with noisy observations. (arXiv:1703.00329v1 [stat.ML])
- Online EM Algorithm for Latent Data Models. (arXiv:0712.4273v4 [stat.CO] UPDATED)
- Online Natural Gradient as a Kalman Filter. (arXiv:1703.00209v3 [stat.ML] UPDATED)
- SARAH: A Novel Method for Machine Learning Problems Using Stochastic Recursive Gradient. (arXiv:1703.00102v2 [stat.ML] UPDATED)
- Wright-Fisher diffusion bridges. (arXiv:1703.00208v3 [math.PR] UPDATED)
- Hierarchical Implicit Models and Likelihood-Free Variational Inference. (arXiv:1702.08896v3 [stat.ML] UPDATED)
- Algorithmic stability and hypothesis complexity. (arXiv:1702.08712v2 [stat.ML] UPDATED)
- Monte Carlo on manifolds: sampling densities and integrating functions. (arXiv:1702.08446v1 [math.NA])
- Convergence Analysis of the Ensemble Kalman Filter for Inverse Problems: the Noisy Case. (arXiv:1702.07894v1 [math.NA])
- Linear Convergence of the Proximal Incremental Aggregated Gradient Method under Quadratic Growth Condition. (arXiv:1702.08166v1 [math.OC])
- A Universal Ordinary Differential Equation. (arXiv:1702.08328v6 [math.CA] UPDATED)
- The Ensemble Kalman Filter: A Signal Processing Perspective. (arXiv:1702.08061v1 [stat.ME])
- Kalman Filter and its Modern Extensions for the Continuous-time Nonlinear Filtering Problem. (arXiv:1702.07241v3 [math.OC] UPDATED)
- The Stochastic complexity of spin models: Are pairwise models really simple?. (arXiv:1702.07549v3 [cond-mat.dis-nn] UPDATED)
- Deep Models Under the GAN: Information Leakage from Collaborative Deep Learning. (arXiv:1702.07464v3 [cs.CR] UPDATED)
- Stochastic Newton and Quasi-Newton Methods for Large Linear Least-squares Problems. (arXiv:1702.07367v1 [math.NA])
- Particle Filters for Partially-Observed Boolean Dynamical Systems. (arXiv:1702.07269v1 [stat.ME])
- Kalman Filter and its Modern Extensions for the Continuous-time Nonlinear Filtering Problem. (arXiv:1702.07241v1 [math.OC])
- Fast rates for online learning in Linearly Solvable Markov Decision Processes. (arXiv:1702.06341v2 [cs.LG] UPDATED)
- Stochastic Composite Least-Squares Regression with convergence rate O(1/n). (arXiv:1702.06429v1 [math.OC])
- Stochastic Composite Least-Squares Regression with convergence rate O(1/n). (arXiv:1702.06429v1 [math.OC])
- What we learn from the learning rate. (arXiv:1702.06041v2 [cond-mat.stat-mech] UPDATED)
- A Hitting Time Analysis of Stochastic Gradient Langevin Dynamics. (arXiv:1702.05575v3 [cs.LG] UPDATED)
- Riemannian stochastic variance reduced gradient algorithm with retraction and vector transport. (arXiv:1702.05594v3 [cs.LG] UPDATED)
- Learning to Use Learners' Advice. (arXiv:1702.04825v2 [cs.LG] UPDATED)
- Unbiased Online Recurrent Optimization. (arXiv:1702.05043v3 [cs.NE] UPDATED)
- Generative Temporal Models with Memory. (arXiv:1702.04649v2 [cs.LG] UPDATED)
- Convergence rates for nonequilibrium Langevin dynamics. (arXiv:1702.03685v2 [math-ph] UPDATED)
- An inexact iterative Bregman method for optimal control problems. (arXiv:1702.04547v2 [math.OC] UPDATED)
- Non-convex learning via Stochastic Gradient Langevin Dynamics: a nonasymptotic analysis. (arXiv:1702.03849v3 [cs.LG] UPDATED)
- Convergence rates for nonequilibrium Langevin dynamics. (arXiv:1702.03685v2 [math-ph] UPDATED)
- Multilevel Monte Carlo in Approximate Bayesian Computation. (arXiv:1702.03628v1 [stat.ME])
- Information and estimation in Fokker-Planck channels. (arXiv:1702.03656v1 [cs.IT])
- Modern Monte Carlo Variants for Uncertainty Quantification in Neutron Transport. (arXiv:1702.03561v1 [math.NA])
- Bayesian Probabilistic Numerical Methods. (arXiv:1702.03673v1 [stat.ME])
- Non-convex learning via Stochastic Gradient Langevin Dynamics: a nonasymptotic analysis. (arXiv:1702.03849v3 [cs.LG] UPDATED)
- Analysis of a nonlinear importance sampling scheme for Bayesian parameter estimation in state-space models. (arXiv:1702.03146v1 [stat.CO])
- Deep Learning with Dynamic Computation Graphs. (arXiv:1702.02181v2 [cs.NE] UPDATED)
- Deep Generalized Canonical Correlation Analysis. (arXiv:1702.02519v2 [cs.LG] UPDATED)
- Deep Kernelized Autoencoders. (arXiv:1702.02526v1 [stat.ML])
- Rough volatility: evidence from option prices. (arXiv:1702.02777v1 [q-fin.ST])
- Predicting Pairwise Relations with Neural Similarity Encoders. (arXiv:1702.01824v2 [stat.ML] UPDATED)
- Deep learning and the Schr\"odinger equation. (arXiv:1702.01361v3 [cond-mat.mtrl-sci] UPDATED)
- Making matrices better: Geometry and topology of polar and singular value decomposition. (arXiv:1702.02131v1 [math.RA])
- Optimal Scaling of the MALA algorithm with Irreversible Proposals for Gaussian targets. (arXiv:1702.01777v3 [stat.ME] UPDATED)
- Learning of state-space models with highly informative observations: a tempered Sequential Monte Carlo solution. (arXiv:1702.01618v2 [stat.CO] UPDATED)
- Statistical inference for misspecified ergodic L\'evy driven stochastic differential equation models. (arXiv:1702.00908v3 [math.ST] UPDATED)
- Expansion of the Kullback-Leibler Divergence, and a new class of information metrics. (arXiv:1702.00033v1 [cs.IT] CROSS LISTED)
- The Computer Science and Physics of Community Detection: Landscapes, Phase Transitions, and Hardness. (arXiv:1702.00467v3 [cs.CC] UPDATED)
- Empirical entropy, minimax regret and minimax risk
- IQN: An Incremental Quasi-Newton Method with Local Superlinear Convergence Rate. (arXiv:1702.00709v2 [math.OC] UPDATED)
- Statistics with Set-Valued Functions: Applications to Inverse Approximate Optimization. (arXiv:1702.00708v2 [math.OC] UPDATED)
- Convergence Results for Neural Networks via Electrodynamics. (arXiv:1702.00458v5 [cs.DS] UPDATED)
- On SGD's Failure in Practice: Characterizing and Overcoming Stalling. (arXiv:1702.00317v2 [stat.ML] UPDATED)
- Chentsov's theorem for exponential families. (arXiv:1701.08895v2 [math.ST] UPDATED)
- On the geometry of Bayesian inference. (arXiv:1701.08994v2 [stat.ME] UPDATED)
- CommAI: Evaluating the first steps towards a useful general AI. (arXiv:1701.08954v2 [cs.LG] UPDATED)
- Reinforcement Learning Algorithm Selection. (arXiv:1701.08810v3 [stat.ML] UPDATED)
- Linear Convergence of SVRG in Statistical Estimation. (arXiv:1611.01957v3 [stat.ML] UPDATED)
- The origin of life seen from the point of view of non-equilibrium statistical mechanics. (arXiv:1701.08388v1 [nlin.AO])
- Deep Recurrent Neural Network for Protein Function Prediction from Sequence. (arXiv:1701.08318v1 [q-bio.QM])
- The Causal Frame Problem: An Algorithmic Perspective. (arXiv:1701.08100v1 [cs.AI])
- Wasserstein GAN. (arXiv:1701.07875v3 [stat.ML] UPDATED)
- The Price of Differential Privacy For Online Learning. (arXiv:1701.07953v2 [cs.LG] UPDATED)
- Laplace's method in Bayesian inverse problems. (arXiv:1701.07989v2 [math.PR] UPDATED)
- Geometric Ergodicity of the multivariate COGARCH(1,1) Process. (arXiv:1701.07859v4 [math.PR] UPDATED)
- Deep Reinforcement Learning: An Overview. (arXiv:1701.07274v6 [cs.LG] UPDATED)
- Stratified Splitting for Efficient Monte Carlo Integration. (arXiv:1701.07535v2 [math.ST] UPDATED)
- Fast initial conditions for Glauber dynamics. (arXiv:1701.06042v1 [math.PR])
- Geometric Optimal Control and Applications to Aerospace. (arXiv:1701.06203v1 [math.OC])
- Learning Policies for Markov Decision Processes from Data. (arXiv:1701.05954v1 [math.OC])
- dna2vec: Consistent vector representations of variable-length k-mers. (arXiv:1701.06279v1 [q-bio.QM])
- Importance Sampling of Rare Events in Chaotic Systems. (arXiv:1701.06265v1 [nlin.CD])
- Generalized and hybrid Metropolis-Hastings overdamped Langevin algorithms. (arXiv:1701.05833v1 [math.PR])
- Bayesian Static Parameter Estimation for Partially Observed Diffusions via Multilevel Monte Carlo. (arXiv:1701.05892v1 [stat.CO])
- Git Blame Who?: Stylistic Authorship Attribution of Small, Incomplete Source Code Fragments. (arXiv:1701.05681v3 [cs.LG] UPDATED)
- Highly Efficient Hierarchical Online Nonlinear Regression Using Second Order Methods. (arXiv:1701.05053v1 [cs.LG])
- Converting Cascade-Correlation Neural Nets into Probabilistic Generative Models. (arXiv:1701.05004v1 [q-bio.NC])
- A Large Deviation Inequality for $\beta$-mixing Time Series and its Applications to the Functional Kernel Regression Model. (arXiv:1701.05380v3 [math.ST] UPDATED)
- Efficient Implementation Of Newton-Raphson Methods For Sequential Data Prediction. (arXiv:1701.05378v1 [cs.DS])
- Reasoning in Non-Probabilistic Uncertainty: Logic Programming and Neural-Symbolic Computing as Examples. (arXiv:1701.05226v2 [cs.AI] UPDATED)
- Lipschitz Properties for Deep Convolutional Networks. (arXiv:1701.05217v1 [cs.LG])
- Stochastic Subsampling for Factorizing Huge Matrices. (arXiv:1701.05363v3 [stat.ML] UPDATED)
- On parameter estimation with the Wasserstein distance. (arXiv:1701.05146v3 [stat.ME] UPDATED)
- Multilayer Perceptron Algebra. (arXiv:1701.04968v1 [stat.ML])
- Deep Learning for Computational Chemistry. (arXiv:1701.04503v1 [stat.ML])
- Bayesian Inference and Model Assessment for Spatial Point Patterns Using Posterior Predictive Samples
- Identifying polymer states by machine learning. (arXiv:1701.04390v1 [cond-mat.soft])
- Piecewise Deterministic Markov Processes for Scalable Monte Carlo on Restricted Domains. (arXiv:1701.04244v3 [stat.ME] UPDATED)
- Consistency and Asymptotic Normality of Stochastic Euler Schemes for Ordinary Differential Equations. (arXiv:1609.06880v1 [math.PR] CROSS LISTED)
- Fundamental Properties of Process Distances. (arXiv:1701.03955v3 [math.OC] UPDATED)
- Probabilistic Numerical Methods for PDE-constrained Bayesian Inverse Problems. (arXiv:1701.04006v1 [stat.ME])
- Fast Bayesian Intensity Estimation for the Permanental Process. (arXiv:1701.03535v3 [stat.ME] UPDATED)
- Deep Probabilistic Programming. (arXiv:1701.03757v2 [stat.ML] UPDATED)
- Perishability of Data: Dynamic Pricing under Varying-Coefficient Models. (arXiv:1701.03537v2 [cs.GT] UPDATED)
- An Ergodic Theorem for Fleming-Viot Models in Random Environments. (arXiv:1701.03224v1 [math.PR])
- Price dynamics on a risk-averse market with asymmetric information. (arXiv:1701.03341v1 [math.OC])
- Stein's method for dynamical systems. (arXiv:1701.02966v1 [math.PR])
- Identifying Best Interventions through Online Importance Sampling. (arXiv:1701.02789v3 [stat.ML] UPDATED)
- Towards Smart Proof Search for Isabelle. (arXiv:1701.03037v1 [cs.AI])
- A Convenient Category for Higher-Order Probability Theory. (arXiv:1701.02547v4 [cs.PL] UPDATED)
- The Curie-Weiss model with complex temperature: phase transitions. (arXiv:1701.02375v2 [math.PR] UPDATED)
- A Conceptual Introduction to Hamiltonian Monte Carlo. (arXiv:1701.02434v2 [stat.ME] UPDATED)
- Reinforcement Learning via Recurrent Convolutional Neural Networks. (arXiv:1701.02392v1 [cs.LG])
- Machine Learning of Linear Differential Equations using Gaussian Processes. (arXiv:1701.02440v1 [cs.LG])
- Stoic Ethics for Artificial Agents. (arXiv:1701.02388v2 [cs.AI] UPDATED)
- A Convex Optimization Approach to Discrete Optimal Control. (arXiv:1701.02414v2 [math.OC] UPDATED)
- Feedback Particle Filter on Matrix Lie Groups. (arXiv:1701.02416v1 [math.OC])
- A Controlled Particle Filter for Global Optimization. (arXiv:1701.02413v1 [math.OC])
- Universality for eigenvalue algorithms on sample covariance matrices. (arXiv:1701.01896v1 [math.NA])
- An Inexact Inverse Power Method for Numerical Analysis of Stochastic Dynamic Systems. (arXiv:1701.02830v1 [math.NA])
- Information theory, predictability, and the emergence of complex life. (arXiv:1701.02389v1 [q-bio.PE])
- Towards parallelizable sampling-based Nonlinear Model Predictive Control. (arXiv:1701.02660v1 [cs.SY])
- Noise Stability is computable and low dimensional. (arXiv:1701.01483v2 [math.PR] UPDATED)
- Smoothing with Couplings of Conditional Particle Filters. (arXiv:1701.02002v3 [stat.ME] UPDATED)
- Coupled Compound Poisson Factorization. (arXiv:1701.02058v1 [cs.LG])
- Does universal controllability of physical systems prohibit thermodynamic cycles?. (arXiv:1701.01591v2 [cond-mat.stat-mech] UPDATED)
- Methods to locate Saddle Points in Complex Landscapes. (arXiv:1701.01241v1 [cond-mat.dis-nn])
- A Rough Path Perspective on Renormalization. (arXiv:1701.01152v2 [math.PR] UPDATED)
- Summary statistics from training images as prior information in probabilistic inversion. (arXiv:1701.01376v1 [physics.geo-ph])
- Generating Focussed Molecule Libraries for Drug Discovery with Recurrent Neural Networks. (arXiv:1701.01329v1 [cs.NE])
- Rational Decision-Making Under Uncertainty: Observed Betting Patterns on a Biased Coin. (arXiv:1701.01427v1 [q-fin.GN])
- Partially Recursive Acceptance Rejection. (arXiv:1701.00821v1 [cs.DS])
- Bayesian Computation for Log-Gaussian Cox Processes--A Comparative Analysis of Methods. (arXiv:1701.00857v1 [stat.CO])
- Collapsing of dimensionality. (arXiv:1701.00831v1 [cs.LG])
- Private Incremental Regression. (arXiv:1701.01093v1 [cs.DS])
- Numerically stable online estimation of variance in particle filters. (arXiv:1701.01001v1 [stat.ME])
- Is microcanonical ensemble stable?. (arXiv:1701.00720v1 [cond-mat.stat-mech])
- Bet-hedging against demographic fluctuations. (arXiv:1701.00523v3 [q-bio.PE] UPDATED)
- When the map is better than the territory. (arXiv:1612.09592v1 [cs.IT])
- Stochastic Variance-reduced Gradient Descent for Low-rank Matrix Recovery from Linear Measurements. (arXiv:1701.00481v2 [stat.ML] UPDATED)
- Total Variation Denoising via the Moreau Envelope. (arXiv:1701.00439v1 [math.OC])
- NIPS 2016 Tutorial: Generative Adversarial Networks. (arXiv:1701.00160v4 [cs.LG] UPDATED)
Saved in 2016
- Symmetry, Saddle Points, and Global Optimization Landscape of Nonconvex Matrix Factorization. (arXiv:1612.09296v3 [cs.LG] UPDATED)
- Efficient Inexact Proximal Gradient Algorithm for Nonconvex Problems. (arXiv:1612.09069v3 [math.OC] UPDATED)
- A Basic Recurrent Neural Network Model. (arXiv:1612.09022v1 [cs.NE])
- Conditional Central Limit Theorems for Gaussian Projections. (arXiv:1612.09252v2 [cs.IT] UPDATED)
- The interplay between system identification and machine learning. (arXiv:1612.09158v1 [cs.SY])
- Efficient iterative policy optimization. (arXiv:1612.08967v1 [cs.AI])
- The Predictron: End-To-End Learning and Planning. (arXiv:1612.08810v3 [cs.LG] UPDATED)
- Is Lipschitz Continuity Preserved under Sampled-Data Discretization?. (arXiv:1612.08469v2 [cs.SY] UPDATED)
- Couplings, gradient estimates and logarithmic Sobolev inequality for Langevin bridges. (arXiv:1612.08546v2 [math.PR] UPDATED)
- A Generalized Population Dynamics Model of a City and an Algorithm for Engineering Regime Shifts. (arXiv:1612.08338v1 [physics.soc-ph])
- Speculation and Power Law. (arXiv:1612.08705v1 [q-fin.ST])
- SampleRNN: An Unconditional End-to-End Neural Audio Generation Model. (arXiv:1612.07837v2 [cs.SD] UPDATED)
- Generalized Curie-Weiss Model and Quadratic Pressure in Ergodic Theory. (arXiv:1612.07909v2 [math.DS] UPDATED)
- A State Space Approach for Piecewise-Linear Recurrent Neural Networks for Reconstructing Nonlinear Dynamics from Neural Measurements. (arXiv:1612.07846v1 [q-bio.NC])
- RLScore: Regularized Least-Squares Learners; Tapio Pahikkala, Antti Airola
- Newton-Stein Method: An Optimization Method for GLMs via Stein's Lemma; Murat A. Erdogdu
- Four lectures on probabilistic methods for data science. (arXiv:1612.06661v2 [math.PR] UPDATED)
- Box constrained $\ell_1$ optimization in random linear systems -- asymptotics. (arXiv:1612.06835v1 [math.PR])
- Box constrained $\ell_1$ optimization in random linear systems -- finite dimensions. (arXiv:1612.06839v1 [math.OC])
- Loss is its own Reward: Self-Supervision for Reinforcement Learning. (arXiv:1612.07307v2 [cs.LG] UPDATED)
- Quasi-Newton Methods: Superlinear Convergence Without Line Searches for Self-Concordant Functions. (arXiv:1612.06965v3 [math.OC] UPDATED)
- Multilevel Monte Carlo and Improved Timestepping Methods in Atmospheric Dispersion Modelling. (arXiv:1612.07717v1 [math.NA])
- Sampling normalizing constants in high dimensions using inhomogeneous diffusions. (arXiv:1612.07583v2 [math.ST] UPDATED)
- Efficient Bayesian computation by proximal Markov chain Monte Carlo: when Langevin meets Moreau. (arXiv:1612.07471v1 [stat.CO])
- Sparse estimation in Ising Model via penalized Monte Carlo methods. (arXiv:1612.07497v2 [stat.ME] UPDATED)
- How to Train Your Deep Neural Network with Dictionary Learning. (arXiv:1612.07454v1 [cs.LG])
- Stacking machine learning classifiers to identify Higgs bosons at the LHC. (arXiv:1612.07725v3 [hep-ph] UPDATED)
- Multivariate approximation in total variation, II: discrete normal approximation. (arXiv:1612.07519v1 [math.PR])
- Non-Deterministic Policy Improvement Stabilizes Approximated Reinforcement Learning. (arXiv:1612.07548v1 [cs.AI])
- Partial $\ell_1$ optimization in random linear systems -- phase transitions and large deviations. (arXiv:1612.07435v1 [math.OC])
- Partial $\ell_1$ optimization in random linear systems -- finite dimensions. (arXiv:1612.07436v1 [math.OC])
- Shot-Noise Processes in Finance. (arXiv:1612.06616v1 [math.PR])
- Sixty Years of Moments for Random Matrices. (arXiv:1612.06725v1 [math-ph])
- Leverage and Uncertainty. (arXiv:1612.07194v1 [q-fin.RM])
- Correlations and forecast of death tolls in the Syrian conflict. (arXiv:1612.06746v1 [physics.soc-ph])
- Sequential Monte Carlo with transformations. (arXiv:1612.06468v3 [stat.CO] UPDATED)
- Four lectures on probabilistic methods for data science. (arXiv:1612.06661v2 [math.PR] UPDATED)
- On stochastic calculus with respect to q-Brownian motion. (arXiv:1612.05757v4 [math.PR] UPDATED)
- Combinatorial Levy processes. (arXiv:1612.05746v1 [math.PR])
- A representation-theoretic approach to the calculation of evolutionary distance in bacteria. (arXiv:1612.06035v1 [q-bio.PE])
- Optimal Investment under Information Driven Contagious Distress. (arXiv:1612.06133v1 [q-fin.PM])
- Inexact Proximal Gradient Methods for Non-convex and Non-smooth Optimization. (arXiv:1612.06003v2 [cs.LG] UPDATED)
- Reinforcement Learning Using Quantum Boltzmann Machines. (arXiv:1612.05695v3 [quant-ph] UPDATED)
- The Blockchain: A Gentle Four Page Introduction. (arXiv:1612.06244v1 [q-fin.GN])
- Causal Learning via Manifold Regularization. (arXiv:1612.05678v4 [stat.ML] UPDATED)
- An extended Perona-Malik model based on probabilistic models. (arXiv:1612.06176v1 [cs.CV])
- Mutual information for fitting deep nonlinear models. (arXiv:1612.05708v1 [math.OC])
- Coupling Adaptive Batch Sizes with Learning Rates. (arXiv:1612.05086v2 [cs.LG] UPDATED)
- Graphical RNN Models. (arXiv:1612.05054v1 [cs.NE])
- Bayesian Optimization for Machine Learning : A Practical Guidebook. (arXiv:1612.04858v1 [cs.LG])
- Coupling Adaptive Batch Sizes with Learning Rates. (arXiv:1612.05086v2 [cs.LG] UPDATED)
- Maximum Likelihood Estimation in Markov Regime-Switching Models with Covariate-Dependent Transition Probabilities. (arXiv:1612.04932v3 [math.ST] UPDATED)
- Projected Regression Methods for Inverting Fredholm Integrals: Formalism and Application to Analytical Continuation. (arXiv:1612.04895v1 [cond-mat.str-el])
- Matrix Dirichlet processes. (arXiv:1612.04472v2 [math.PR] UPDATED)
- SVD-based Kalman Filter Derivative Computation. (arXiv:1612.04777v1 [cs.SY])
- Central limit theorems of a recursive stochastic algorithm with applications to adaptive designs
- Lotka–Volterra with randomly fluctuating environments or “how switching between beneficial environments can make survival harder”
- Parameter Estimation Under Model Uncertainties by Iterative Covariance Approximation. (arXiv:1612.04059v2 [math.ST] UPDATED)
- Monte Carlo Structured SVI for Two-Level Non-Conjugate Models. (arXiv:1612.03957v3 [stat.ML] UPDATED)
- Probabilistic Bisection Converges Almost as Quickly as Stochastic Approximation. (arXiv:1612.03964v1 [math.PR])
- A numerical scheme for the compressible low-Mach number regime of ideal fluid dynamics. (arXiv:1612.03910v1 [math.NA])
- When multiplicative noise stymies control. (arXiv:1612.03239v1 [cs.SY])
- DeepMind Lab. (arXiv:1612.03801v2 [cs.AI] UPDATED)
- Anytime Monte Carlo. (arXiv:1612.03319v3 [stat.CO] UPDATED)
- Models as Approximations II: A Model-Free Theory of Parametric Regression. (arXiv:1612.03257v2 [math.ST] UPDATED)
- Testing Ising Models. (arXiv:1612.03147v6 [cs.DS] UPDATED)
- Phase transitions in Restricted Boltzmann Machines with generic priors. (arXiv:1612.03132v2 [cond-mat.dis-nn] UPDATED)
- Learning Representations by Stochastic Meta-Gradient Descent in Neural Networks. (arXiv:1612.02879v2 [cs.LG] UPDATED)
- The Why of the Applicability of Statistical Physics to Economics. (arXiv:physics/0609088v3 [physics.soc-ph] UPDATED)
- Measuring the non-asymptotic convergence of sequential Monte Carlo samplers using probabilistic programming. (arXiv:1612.02161v2 [cs.AI] UPDATED)
- Random dynamical systems, rough paths and rough flows. (arXiv:1612.01955v1 [math.PR])
- Critical scaling in hidden state inference for linear Langevin dynamics. (arXiv:1612.01976v2 [cond-mat.dis-nn] UPDATED)
- Structured Filtering. (arXiv:1612.00762v1 [quant-ph])
- Learning with Hierarchical Gaussian Kernels. (arXiv:1612.00824v1 [stat.ML])
- Low rank approximate solutions to large-scale differential matrix Riccati equations. (arXiv:1612.00499v1 [math.NA])
- Summary - TerpreT: A Probabilistic Programming Language for Program Induction. (arXiv:1612.00817v1 [cs.LG])
- Probabilistic Neural Programs. (arXiv:1612.00712v1 [cs.NE])
- Asynchronous Stochastic Gradient MCMC with Elastic Coupling. (arXiv:1612.00767v2 [stat.ML] UPDATED)
- On the instability and degeneracy of deep learning models. (arXiv:1612.01159v3 [math.ST] UPDATED)
- On the Pitfalls of Nested Monte Carlo. (arXiv:1612.00951v1 [stat.CO])
- Universality of the SIS prevalence in networks. (arXiv:1612.01386v1 [physics.soc-ph])
- Three lectures on statistical mechanics. (arXiv:1612.00863v1 [cond-mat.stat-mech])
- A Matrix Splitting Perspective on Planning with Options. (arXiv:1612.00916v2 [cs.AI] UPDATED)
- Two Methods For Wild Variational Inference. (arXiv:1612.00081v2 [stat.ML] UPDATED)
- Tuning the Scheduling of Distributed Stochastic Gradient Descent with Bayesian Optimization. (arXiv:1612.00383v1 [stat.ML])
- Quantum Machine Learning. (arXiv:1611.09347v2 [quant-ph] UPDATED)
- Accelerated Gradient Temporal Difference Learning. (arXiv:1611.09328v2 [cs.AI] UPDATED)
- Robust Variational Inference. (arXiv:1611.09226v1 [cs.LG])
- Should I use TensorFlow. (arXiv:1611.08903v1 [cs.LG])
- Rough paths in idealized financial markets. (arXiv:1005.0279v3 [q-fin.GN] UPDATED)
- Embedded Bandits for Large-Scale Black-Box Optimization. (arXiv:1611.08773v1 [cs.AI])
- A new primal-dual algorithm for minimizing the sum of three functions with a linear operator. (arXiv:1611.09805v4 [math.OC] UPDATED)
- Probabilistic map-matching using particle filters. (arXiv:1611.09706v1 [stat.ML])
- Subsampled online matrix factorization with convergence guarantees. (arXiv:1611.10041v1 [math.OC])
- Complex-valued Gaussian Process Regression for Time Series Analysis. (arXiv:1611.10073v2 [stat.ML] UPDATED)
- Stochastic Thermodynamics of Learning. (arXiv:1611.09428v1 [cond-mat.stat-mech])
- Probabilizing Parking Functions. (arXiv:1611.09821v1 [math.PR])
- Ergodicity and Accuracy of Optimal Particle Filters for Bayesian Data Assimilation. (arXiv:1611.08761v2 [math.PR] UPDATED)
- Scalable Bayesian Learning of Recurrent Neural Networks for Language Modeling. (arXiv:1611.08034v2 [cs.CL] UPDATED)
- On the linear quadratic problem for systems with time reversed Markov jump parameters and the duality with filtering of Markov jump linear systems. (arXiv:1611.07558v1 [cs.SY])
- Principled Option Learning in Markov Decision Processes. (arXiv:1609.05524v3 [cs.LG] UPDATED)
- Programs as Black-Box Explanations. (arXiv:1611.07579v1 [stat.ML])
- Variational Intrinsic Control. (arXiv:1611.07507v1 [cs.LG])
- Interpretable Recurrent Neural Networks Using Sequential Sparse Recovery. (arXiv:1611.07252v1 [stat.ML])
- Eigenvalues of the Hessian in Deep Learning: Singularity and Beyond. (arXiv:1611.07476v2 [cs.LG] UPDATED)
- The Recycling Gibbs Sampler for Efficient Learning. (arXiv:1611.07056v2 [stat.CO] UPDATED)
- Interpreting Finite Automata for Sequential Data. (arXiv:1611.07100v2 [stat.ML] UPDATED)
- Scalable Approximations for Generalized Linear Problems. (arXiv:1611.06686v1 [stat.ML])
- Variational Boosting: Iteratively Refining Posterior Approximations. (arXiv:1611.06585v2 [stat.ML] UPDATED)
- Learning to reinforcement learn. (arXiv:1611.05763v3 [cs.LG] UPDATED)
- Empirical risk minimization and complexity of dynamical models. (arXiv:1611.06173v2 [math.ST] UPDATED)
- On a goodness of fit test for the Cauchy distribution. (arXiv:1611.06129v1 [math.ST])
- Controlling energy landscapes with correlations between minima. (arXiv:1611.06127v1 [cond-mat.dis-nn])
- Brownian yet non-Gaussian diffusion: from superstatistics to subordination of diffusing diffusivities. (arXiv:1611.06202v2 [cond-mat.stat-mech] UPDATED)
- Increasing the Interpretability of Recurrent Neural Networks Using Hidden Markov Models. (arXiv:1611.05934v1 [stat.ML])
- Contributed Discussion to Bayesian Solution Uncertainty Quantification for Differential Equations. (arXiv:1611.05843v1 [math.ST])
- A smooth transition from Wishart to GOE. (arXiv:1611.05838v1 [math.ST])
- Learning to reinforcement learn. (arXiv:1611.05763v3 [cs.LG] UPDATED)
- Bayesian inference for multivariate extreme value distributions. (arXiv:1611.05602v2 [stat.ME] UPDATED)
- $e$PCA: High Dimensional Exponential Family PCA. (arXiv:1611.05550v2 [stat.ME] UPDATED)
- Boosting Variational Inference. (arXiv:1611.05559v2 [stat.ML] UPDATED)
- Statistical mechanics of the inverse Ising problem and the optimal objective function. (arXiv:1611.04281v1 [cond-mat.dis-nn] CROSS LISTED)
- Stochastic Gradient Descent in Continuous Time. (arXiv:1611.05545v4 [math.PR] UPDATED)
- Kalman-Takens filtering in the presence of dynamical noise. (arXiv:1611.05414v1 [physics.data-an])
- A traffic model with an absorbing-state phase transition. (arXiv:1611.05307v1 [cond-mat.stat-mech])
- Ergodicity of one-dimensional systems coupled to the logistic thermostat. (arXiv:1611.05090v1 [cond-mat.stat-mech])
- Information transport in classical statistical systems. (arXiv:1611.04820v4 [cond-mat.stat-mech] UPDATED)
- Generative Models and Model Criticism via Optimized Maximum Mean Discrepancy. (arXiv:1611.04488v6 [stat.ML] UPDATED)
- Statistical mechanics of the inverse Ising problem and the optimal objective function. (arXiv:1611.04281v4 [cond-mat.dis-nn] UPDATED)
- Exactly solvable model for a velocity jump observed in crack propagation in viscoelastic solids. (arXiv:1611.04269v2 [cond-mat.soft] UPDATED)
- Turbulence as a problem in non-equilibrium statistical mechanics. (arXiv:1611.02778v1 [physics.flu-dyn] CROSS LISTED)
- Gray Box Identification of State-Space Models Using Difference of Convex Programming. (arXiv:1611.04359v1 [cs.SY])
- Learning to Learn without Gradient Descent by Gradient Descent. (arXiv:1611.03824v6 [stat.ML] UPDATED)
- Linear predictors for nonlinear dynamical systems: Koopman operator meets model predictive control. (arXiv:1611.03537v3 [math.OC] UPDATED)
- Socratic Learning: Augmenting Generative Models to Incorporate Latent Subsets in Training Data. (arXiv:1610.08123v4 [cs.LG] UPDATED)
- Low Data Drug Discovery with One-shot Learning. (arXiv:1611.03199v1 [cs.LG])
- Statistical mechanics of stochastic growth phenomena. (arXiv:1611.03247v2 [cond-mat.stat-mech] UPDATED)
- A Note on Random Walks with Absorbing barriers and Sequential Monte Carlo Methods. (arXiv:1611.03177v1 [stat.CO])
- What is dimension?. (arXiv:1611.03048v1 [cond-mat.stat-mech])
- RL$^2$: Fast Reinforcement Learning via Slow Reinforcement Learning. (arXiv:1611.02779v2 [cs.AI] UPDATED)
- Sequence Tutor: Conservative Fine-Tuning of Sequence Generation Models with KL-control. (arXiv:1611.02796v9 [cs.LG] UPDATED)
- Recursive Regression with Neural Networks: Approximating the HJI PDE Solution. (arXiv:1611.02739v4 [cs.LG] UPDATED)
- Causal optimal transport and its links to enlargement of filtrations and continuous-time stochastic optimization. (arXiv:1611.02610v2 [math.PR] UPDATED)
- Structure of the optimal path to a fluctuation. (arXiv:1611.02500v2 [cond-mat.stat-mech] UPDATED)
- Application of SIR epidemiological model: new trends. (arXiv:1611.02565v1 [physics.soc-ph])
- Gradients of Counterfactuals. (arXiv:1611.02639v2 [cs.LG] UPDATED)
- Learning a Static Analyzer from Data. (arXiv:1611.01752v2 [cs.PL] UPDATED)
- Using Social Dynamics to Make Individual Predictions: Variational Inference with a Stochastic Kinetic Model. (arXiv:1611.02181v1 [stat.ML])
- Robustly representing uncertainty in deep neural networks through sampling. (arXiv:1611.01639v7 [cs.LG] CROSS LISTED)
- DeepCoder: Learning to Write Programs. (arXiv:1611.01989v2 [cs.LG] UPDATED)
- Quasi-Recurrent Neural Networks. (arXiv:1611.01576v2 [cs.NE] UPDATED)
- A Gaussian small deviation inequality for convex functions. (arXiv:1611.01723v2 [math.PR] UPDATED)
- PGQ: Combining policy gradient and Q-learning. (arXiv:1611.01626v2 [cs.LG] UPDATED)
- Learning to superoptimize programs. (arXiv:1611.01787v3 [cs.LG] UPDATED)
- Entropy-SGD: Biasing Gradient Descent Into Wide Valleys. (arXiv:1611.01838v5 [cs.LG] UPDATED)
- Q-Prop: Sample-Efficient Policy Gradient with An Off-Policy Critic. (arXiv:1611.02247v3 [cs.LG] UPDATED)
- Learning to Draw Samples: With Application to Amortized MLE for Generative Adversarial Learning. (arXiv:1611.01722v2 [stat.ML] UPDATED)
- Neural Architecture Search with Reinforcement Learning. (arXiv:1611.01578v2 [cs.LG] UPDATED)
- Dependence and Relevance: A probabilistic view. (arXiv:1611.02126v1 [cs.AI])
- Necessary and sufficient conditions for $\mathbb{Z}_2$-symmetry-breaking phase transitions. (arXiv:1106.3870v8 [cond-mat.stat-mech] UPDATED)
- Model Uncertainty Stochastic Mean-Field Control. (arXiv:1611.01385v9 [math.OC] UPDATED)
- Reparameterization trick for discrete variables. (arXiv:1611.01239v1 [stat.ML])
- Deep Information Propagation. (arXiv:1611.01232v2 [stat.ML] UPDATED)
- Information Dropout: Learning Optimal Representations Through Noisy Computation. (arXiv:1611.01353v3 [stat.ML] UPDATED)
- A Central Limit Theorem for Fleming-Viot Particle Systems with Soft Killing. (arXiv:1611.00515v2 [math.PR] UPDATED)
- Computing proximal points of convex functions with inexact subgradients. (arXiv:1611.00724v1 [math.OC])
- Why and When Can Deep -- but Not Shallow -- Networks Avoid the Curse of Dimensionality: a Review. (arXiv:1611.00740v5 [cs.LG] UPDATED)
- Collaborative Recurrent Autoencoder: Recommend while Learning to Fill in the Blanks. (arXiv:1611.00454v1 [cs.LG])
- Pricing Bounds for VIX Derivatives via Least Squares Monte Carlo. (arXiv:1611.00464v1 [q-fin.CP])
- Turbulent compressible fluid: renormalization group analysis, scaling regimes, and anomalous scaling of advected scalar fields. (arXiv:1611.00327v2 [cond-mat.stat-mech] UPDATED)
- Variational Inference via $\chi$-Upper Bound Minimization. (arXiv:1611.00328v4 [stat.ML] UPDATED)
- Critical Amount of Resource in Containing Catastrophic Epidemics. (arXiv:1611.00212v1 [physics.soc-ph])
- Stochastic Variational Deep Kernel Learning. (arXiv:1611.00336v2 [stat.ML] UPDATED)
- Surpassing Gradient Descent Provably: A Cyclic Incremental Method with Linear Convergence Rate. (arXiv:1611.00347v2 [math.OC] UPDATED)
- Online Maximum Likelihood Estimation of the Parameters of Partially Observed Diffusion Processes. (arXiv:1611.00170v4 [math.OC] UPDATED)
- PCA meets RG. (arXiv:1610.09733v1 [physics.bio-ph])
- Detecting heteroskedasticity in nonparametric regression using weighted empirical processes. (arXiv:1610.09139v3 [stat.ME] UPDATED)
- Edward: A library for probabilistic modeling, inference, and criticism. (arXiv:1610.09787v3 [stat.CO] UPDATED)
- Trajectory stratification of stochastic dynamics. (arXiv:1610.09426v2 [cond-mat.stat-mech] UPDATED)
- Model Criticism for Bayesian Causal Inference. (arXiv:1610.09037v1 [stat.ME])
- Toward Implicit Sample Noise Modeling: Deviation-driven Matrix Factorization. (arXiv:1610.09274v1 [cs.LG])
- Operator Variational Inference. (arXiv:1610.09033v3 [stat.ML] UPDATED)
- Professor Forcing: A New Algorithm for Training Recurrent Networks. (arXiv:1610.09038v1 [stat.ML])
- Learning Scalable Deep Kernels with Recurrent Structure. (arXiv:1610.08936v3 [cs.LG] UPDATED)
- On embedded hidden Markov models and particle Markov chain Monte Carlo methods. (arXiv:1610.08962v1 [stat.CO])
- Statistical Inference for Model Parameters in Stochastic Gradient Descent. (arXiv:1610.08637v4 [stat.ML] UPDATED)
- Fast and Reliable Parameter Estimation from Nonlinear Observations. (arXiv:1610.07108v1 [stat.ML])
- Probabilistic Linear Multistep Methods. (arXiv:1610.08417v1 [math.NA])
- Universality of Bayesian mixture predictors. (arXiv:1610.08249v2 [math.ST] UPDATED)
- Universality of Bayesian mixture predictors. (arXiv:1610.08249v2 [math.ST] UPDATED)
- Recurrent switching linear dynamical systems. (arXiv:1610.08466v1 [stat.ML])
- Things Bayes can't do. (arXiv:1610.08239v2 [cs.LG] UPDATED)
- Online Bayesian phylogenetic inference: theoretical foundations via Sequential Monte Carlo. (arXiv:1610.08148v1 [q-bio.PE])
- Cleaning large correlation matrices: tools from random matrix theory. (arXiv:1610.08104v1 [cond-mat.stat-mech])
- On the Convergence of Stochastic Gradient MCMC Algorithms with High-Order Integrators. (arXiv:1610.06665v1 [stat.ML])
- Robust Markowitz mean-variance portfolio selection under ambiguous covariance matrix *. (arXiv:1610.06805v2 [q-fin.PM] UPDATED)
- A Projected Gradient and Constraint Linearization Method for Nonlinear Model Predictive Control. (arXiv:1610.06834v1 [math.OC])
- Stochastic Gradient MCMC with Stale Gradients. (arXiv:1610.06664v1 [stat.ML])
- H-theorem and Thermodynamics for generalized entropies that depend only on the probability. (arXiv:1610.06596v1 [cond-mat.stat-mech])
- A probabilistic model for the numerical solution of initial value problems. (arXiv:1610.05261v1 [math.NA])
- A Variational Approach to Path Estimation and Parameter Inference of Hidden Diffusion Processes; Tobias Sutter, Arnab Ganguly, Heinz Koeppl
- Interacting Brownian dynamics in a nonequilibrium particle bath. (arXiv:1610.06477v2 [cond-mat.stat-mech] UPDATED)
- The argmin process of random walks and L\'evy processes. (arXiv:1610.05869v2 [math.PR] UPDATED)
- Big Batch SGD: Automated Inference using Adaptive Batch Sizes. (arXiv:1610.05792v4 [cs.LG] UPDATED)
- Learning Quadrotor Dynamics Using Neural Network for Flight Control. (arXiv:1610.05863v1 [cs.SY])
- Learning to Learn Neural Networks. (arXiv:1610.06072v1 [cs.LG])
- Perfect sampling for nonhomogeneous Markov chains and hidden Markov models
- Establishing some order amongst exact approximations of MCMCs
- Reparameterization Gradients through Acceptance-Rejection Sampling Algorithms. (arXiv:1610.05683v3 [stat.ML] UPDATED)
- Langevin Diffusion for Population Based Sampling with an Application in Bayesian Inference for Pharmacodynamics. (arXiv:1610.05660v2 [stat.CO] UPDATED)
- Proximal Algorithms and Temporal Differences for Large Linear Systems: Extrapolation, Approximation, and Simulation. (arXiv:1610.05427v4 [cs.NA] UPDATED)
- Markov Chain Truncation for Doubly-Intractable Inference. (arXiv:1610.05672v2 [stat.ML] UPDATED)
- Black-box Importance Sampling. (arXiv:1610.05247v1 [stat.ML])
- On the Stability of Kalman-Bucy Diffusion Processes. (arXiv:1610.04686v7 [math.OC] UPDATED)
- Automatic numerical differentiation by maximum likelihood estimation of state-space model. (arXiv:1610.04397v1 [stat.ME])
- Optimal approximation of SDEs on submanifolds: the Ito-vector and Ito-jet projections. (arXiv:1610.03887v2 [math.PR] UPDATED)
- MI-Sim: A MATLAB Package for the Numerical Analysis of Microbial Ecological Interactions. (arXiv:1610.03786v1 [q-bio.QM])
- Parallelizing Stochastic Gradient Descent for Least Squares Regression: mini-batching, averaging, and model misspecification. (arXiv:1610.03774v4 [stat.ML] UPDATED)
- Transfer from Simulation to Real World through Learning Deep Inverse Dynamics Model. (arXiv:1610.03518v1 [cs.RO])
- Quantifying the accuracy of approximate diffusions and Markov chains. (arXiv:1605.06420v4 [math.ST] UPDATED)
- Dynamic patterns of overexploitation in fisheries. (arXiv:1610.03653v1 [q-bio.PE])
- Localization in High-Dimensional Monte Carlo Filtering. (arXiv:1610.03701v2 [stat.CO] UPDATED)
- On Origins of Bubbles. (arXiv:1610.03769v2 [q-fin.RM] UPDATED)
- Learning in Implicit Generative Models. (arXiv:1610.03483v4 [stat.ML] UPDATED)
- A note on the multiplicative gamma process. (arXiv:1610.03408v2 [stat.ME] UPDATED)
- Volatility Smile as Relativistic Effect. (arXiv:1610.02456v4 [q-fin.MF] UPDATED)
- The Generalized Reparameterization Gradient. (arXiv:1610.02287v3 [stat.ML] UPDATED)
- A Differential Equation for Modeling Nesterov's Accelerated Gradient Method: Theory and Insights; Weijie Su, Stephen Boyd, Emmanuel J. Candès
- Exploration of the (Non-)Asymptotic Bias and Variance of Stochastic Gradient Langevin Dynamics; Sebastian J. Vollmer, Konstantinos C. Zygalakis, Yee Whye Teh
- A law of the iterated logarithm for directed last passage percolation. (arXiv:1610.01828v1 [math.PR])
- Factor Models for Matrix-Valued High-Dimensional Time Series. (arXiv:1610.01889v2 [stat.ME] UPDATED)
- The argmin process of random walks, Brownian motion and L\'evy processes. (arXiv:1610.01524v2 [math.PR] UPDATED)
- $\ell_1$ Regularized Gradient Temporal-Difference Learning. (arXiv:1610.01476v1 [cs.AI])
- Perspective Functions: Proximal Calculus and Applications in High-Dimensional Statistics. (arXiv:1610.01478v3 [math.OC] UPDATED)
- A Non-generative Framework and Convex Relaxations for Unsupervised Learning. (arXiv:1610.01132v3 [cs.LG] UPDATED)
- Stochastic Optimization with Variance Reduction for Infinite Datasets with Finite-Sum Structure. (arXiv:1610.00970v6 [stat.ML] UPDATED)
- A SMART Stochastic Algorithm for Nonconvex Optimization with Applications to Robust Machine Learning. (arXiv:1610.01101v2 [stat.ML] UPDATED)
- Large deviation principle in one-dimensional dynamics. (arXiv:1610.00822v2 [math.DS] UPDATED)
- Tuning of MCMC with Langevin, Hamiltonian, and other stochastic autoregressive proposals. (arXiv:1610.00781v1 [stat.CO])
- An Inexact Variable Metric Proximal Point Algorithm for Generic Quasi-Newton Acceleration. (arXiv:1610.00960v4 [stat.ML] UPDATED)
- Approaching nonsmooth nonconvex optimization problems through first order dynamical systems with hidden acceleration and Hessian driven damping terms. (arXiv:1610.00911v1 [math.OC])
- Square-root algorithms for maximum correntropy estimation of linear discrete-time systems in presence of non-Gaussian noise. (arXiv:1610.00257v1 [cs.SY])
- Quantifying Urban Traffic Anomalies. (arXiv:1610.00579v1 [cs.LG])
- Flint Water Crisis: Data-Driven Risk Assessment Via Residential Water Testing. (arXiv:1610.00580v1 [cs.LG])
- QInfer: Statistical inference software for quantum applications. (arXiv:1610.00336v2 [quant-ph] UPDATED)
- Penalized Ensemble Kalman Filters for High Dimensional Non-linear Systems. (arXiv:1610.00195v3 [stat.AP] UPDATED)
- Path Integral Guided Policy Search. (arXiv:1610.00529v2 [cs.RO] UPDATED)
- Volatility Inference and Return Dependencies in Stochastic Volatility Models. (arXiv:1610.00312v1 [q-fin.MF])
- Random walks in cooling random environments. (arXiv:1610.00641v2 [math.PR] UPDATED)
- Superprocesses on ultradistributions. (arXiv:1610.00481v1 [math.PR])
- Phase transitions in distributed control systems with multiplicative noise. (arXiv:1610.00653v2 [cond-mat.stat-mech] UPDATED)
- One-dimensional long-range percolation: a numerical study. (arXiv:1610.00200v1 [cond-mat.stat-mech])
- The Statistical Mechanics of Human Weight Change. (arXiv:1610.00656v1 [physics.soc-ph])
- A Primer on Coordinate Descent Algorithms. (arXiv:1610.00040v2 [math.OC] UPDATED)
- Structured Inference Networks for Nonlinear State Space Models. (arXiv:1609.09869v2 [stat.ML] UPDATED)
- Nonasymptotic analysis of adaptive and annealed Feynman-Kac particle models. (arXiv:1209.5654v2 [math.PR] UPDATED)
- A Cross Entropy based Stochastic Approximation Algorithm for Reinforcement Learning with Linear Function Approximation. (arXiv:1609.09449v1 [cs.SY])
- Nonnegative autoencoder with simplified random neural network. (arXiv:1609.08151v2 [cs.LG] UPDATED)
- Exact and Inexact Subsampled Newton Methods for Optimization. (arXiv:1609.08502v1 [math.OC])
- Fully Bayesian Estimation and Variable Selection in Partially Linear Wavelet Models. (arXiv:1609.07233v1 [stat.ME])
- An unbiased Monte Carlo estimator for derivatives. Application to CIR. (arXiv:1609.07431v3 [math.PR] UPDATED)
- Decoupled Asynchronous Proximal Stochastic Gradient Descent with Variance Reduction. (arXiv:1609.06804v2 [cs.LG] UPDATED)
- Hawkes Processes with Stochastic Excitations. (arXiv:1609.06831v1 [cs.LG])
- Exact Sampling from Determinantal Point Processes. (arXiv:1609.06840v2 [cs.LG] UPDATED)
- (Bandit) Convex Optimization with Biased Noisy Gradient Oracles. (arXiv:1609.07087v2 [cs.LG] UPDATED)
- Generalized Kalman Smoothing: Modeling and Algorithms. (arXiv:1609.06369v2 [math.OC] UPDATED)
- Optimal and scalable methods to approximate the solutions of large-scale Bayesian problems: Theory and application to atmospheric inversions and data assimilation. (arXiv:1609.06431v1 [physics.data-an])
- Multilevel Monte Carlo for Scalable Bayesian Computations. (arXiv:1609.06144v1 [stat.ML])
- Alternating Optimisation and Quadrature for Robust Control. (arXiv:1605.07496v3 [cs.LG] UPDATED)
- Fast nonnegative least squares through flexible Krylov subspaces. (arXiv:1511.06269v1 [math.NA])
- Lp and almost sure rates of convergence of averaged stochastic gradient algorithms: locally strongly convex objective. (arXiv:1609.05479v3 [math.ST] UPDATED)
- Stochastic Matrix Factorization. (arXiv:1609.05772v1 [stat.ML])
- Principled Option Learning in Markov Decision Processes. (arXiv:1609.05524v3 [cs.LG] UPDATED)
- SeqGAN: Sequence Generative Adversarial Nets with Policy Gradient. (arXiv:1609.05473v6 [cs.LG] UPDATED)
- Asymptotic properties of maximum likelihood estimator for the growth rate for a jump-type CIR process based on continuous time observations. (arXiv:1609.05865v4 [math.ST] UPDATED)
- From quantum mechanics to finance: Microfoundations for jumps, spikes and high volatility phases in diffusion price processes. (arXiv:1609.05286v2 [q-fin.TR] UPDATED)
- Improving landscape inference by integrating heterogeneous data in the inverse Ising problem. (arXiv:1609.05692v2 [q-bio.QM] UPDATED)
- Analysis and optimization of weighted ensemble sampling. (arXiv:1609.05887v1 [math.NA])
- Randomized dual proximal gradient for large-scale distributed optimization. (arXiv:1609.05713v1 [math.OC])
- Learning Low-Complexity Autoregressive Models via Proximal Alternating Minimization. (arXiv:1609.05341v2 [math.OC] UPDATED)
- Iteration-complexity of gradient, subgradient and proximal point methods on Riemannian manifolds. (arXiv:1609.04869v1 [math.NA])
- A Differentiable Alternative to the Lasso Penalty. (arXiv:1609.04985v1 [stat.ME])
- Gradient Descent Learns Linear Dynamical Systems. (arXiv:1609.05191v2 [cs.LG] UPDATED)
- Linear-quadratic optimal control under non-Markovian switching. (arXiv:1609.04977v1 [math.OC])
- On Large-Batch Training for Deep Learning: Generalization Gap and Sharp Minima. (arXiv:1609.04836v2 [cs.LG] UPDATED)
- An overview of gradient descent optimization algorithms. (arXiv:1609.04747v2 [cs.LG] UPDATED)
- Formalizing Neurath's Ship: Approximate Algorithms for Online Causal Learning. (arXiv:1609.04212v3 [cs.LG] UPDATED)
- Relativistic Monte Carlo. (arXiv:1609.04388v1 [stat.ML])
- Gray-box inference for structured Gaussian process models. (arXiv:1609.04289v1 [stat.ML])
- Private Topic Modeling. (arXiv:1609.04120v3 [stat.ML] UPDATED)
- Tsallis Regularized Optimal Transport and Ecological Inference. (arXiv:1609.04495v1 [cs.LG])
- Bayesian Reinforcement Learning: A Survey. (arXiv:1609.04436v1 [cs.AI])
- Entropy and efficiency of the ETF market. (arXiv:1609.04199v1 [q-fin.ST])
- Geometric Ergodicity of Gibbs Samplers in Bayesian Penalized Regression Models. (arXiv:1609.04057v2 [math.ST] UPDATED)
- Stochastic Heavy Ball. (arXiv:1609.04228v2 [math.ST] UPDATED)
- A New Architecture for Optimization Modeling Frameworks. (arXiv:1609.03488v2 [math.OC] UPDATED)
- CompAdaGrad: A Compressed, Complementary, Computationally-Efficient Adaptive Gradient Method. (arXiv:1609.03319v2 [cs.LG] UPDATED)
- High-Dimension, Low Sample Size Asymptotics of Canonical Correlation Analysis. (arXiv:1609.02992v2 [math.ST] UPDATED)
- Puzzles in modern biology. I. Male sterility, failure reveals design. (arXiv:1609.02956v1 [q-bio.PE])
- Puzzles in modern biology. II. Language, cancer and the recursive processes of evolutionary innovation. (arXiv:1609.02959v1 [q-bio.PE])
- Comment on "Why does deep and cheap learning work so well?" [arXiv:1608.08225]. (arXiv:1609.03541v1 [cond-mat.dis-nn])
- Nonparametric risk bounds for time-series forecasting. (arXiv:1212.0463v2 [math.ST] UPDATED)
- The Solution to Science's Replication Crisis. (arXiv:1609.03223v1 [q-fin.EC])
- Energy-based Generative Adversarial Network. (arXiv:1609.03126v4 [cs.LG] UPDATED)
- An unexpected encounter with Cauchy and Lévy
- Estimation in nonlinear regression with Harris recurrent Markov chains
- On the Relationship between Online Gaussian Process Regression and Kernel Least Mean Squares Algorithms. (arXiv:1609.03164v1 [stat.ML])
- Quasi-stationary Monte Carlo and the ScaLE Algorithm. (arXiv:1609.03436v3 [stat.ME] UPDATED)
- Less than a Single Pass: Stochastically Controlled Stochastic Gradient Method. (arXiv:1609.03261v3 [math.OC] UPDATED)
- Importance sampling of heavy-tailed iterated random functions. (arXiv:1609.03182v1 [math.PR])
- WaveNet: A Generative Model for Raw Audio. (arXiv:1609.03499v2 [cs.SD] UPDATED)
- Predicting the future relevance of research institutions - The winning solution of the KDD Cup 2016. (arXiv:1609.02728v1 [cs.LG])
- Importance sampling type estimators based on approximate marginal MCMC. (arXiv:1609.02541v6 [stat.CO] UPDATED)
- Discrete Variational Autoencoders. (arXiv:1609.02200v2 [stat.ML] UPDATED)
- An improved uncertainty decoding scheme with weighted samples for DNN-HMM hybrid systems. (arXiv:1609.02082v1 [cs.LG])
- Random matrices meet machine learning: a large dimensional analysis of LS-SVM. (arXiv:1609.02020v2 [stat.ML] UPDATED)
- The more you test, the more you find: Smallest P-values become increasingly enriched with real findings as more tests are conducted. (arXiv:1609.01788v1 [q-bio.GN])
- Tractable Bayesian variable selection: beyond normality. (arXiv:1609.01708v3 [stat.ME] UPDATED)
- Hierarchical Multiscale Recurrent Neural Networks. (arXiv:1609.01704v7 [cs.LG] UPDATED)
- Robust Particle Filter by Dynamic Averaging of Multiple Noise Models. (arXiv:1609.01336v2 [stat.ME] UPDATED)
- Multivariate Dependence Beyond Shannon Information. (arXiv:1609.01233v2 [cs.IT] UPDATED)
- Multivariate Mixed Tempered Stable Distribution. (arXiv:1609.00926v3 [q-fin.ST] UPDATED)
- Stochastic Bouncy Particle Sampler. (arXiv:1609.00770v3 [stat.CO] UPDATED)
- A Unified Convergence Analysis of the Multiplicative Update Algorithm for Regularized Nonnegative Matrix Factorization. (arXiv:1609.00951v3 [math.OC] UPDATED)
- The maximizing set of the asymptotic normalized log-likelihood for partially observed Markov chains
- State Estimation for Piecewise Affine State-Space Models. (arXiv:1609.00365v1 [cs.SY])
- Optimal State Estimation with Measurements Corrupted by Laplace Noise. (arXiv:1609.00115v1 [math.OC])
- Minimizing Quadratic Functions in Constant Time. (arXiv:1608.07179v1 [cs.LG] CROSS LISTED)
- Ten Steps of EM Suffice for Mixtures of Two Gaussians. (arXiv:1609.00368v5 [stat.ML] UPDATED)
- One-Minute Derivation of The Conjugate Gradient Algorithm. (arXiv:1608.08691v1 [cs.DS])
- Incremental Nonlinear System Identification and Adaptive Particle Filtering Using Gaussian Process. (arXiv:1608.08362v1 [stat.ML])
- On Concentration Properties of Partially Observed Chaotic Systems. (arXiv:1608.08348v2 [math.ST] UPDATED)
- Non-stationary phase of the MALA algorithm. (arXiv:1608.08379v2 [math.PR] UPDATED)
- Multilevel ensemble Kalman filtering for spatially extended models. (arXiv:1608.08558v1 [math.NA])
- Better stability with measurement errors. (arXiv:1608.08461v1 [cond-mat.soft])
- Iterative Mechanisms for Electricity Markets. (arXiv:1608.08987v3 [math.OC] UPDATED)
- Why does deep and cheap learning work so well?. (arXiv:1608.08225v4 [cond-mat.dis-nn] UPDATED)
- Importance Sampling and Necessary Sample Size: an Information Theory Approach. (arXiv:1608.08814v1 [stat.CO])
- Online state and parameter estimation in Dynamic Generalised Linear Models. (arXiv:1608.08666v1 [stat.CO])
- Approximation of Continuous-Time Infinite-Horizon Optimal Control Problems Arising in Model Predictive Control - Supplementary Notes. (arXiv:1608.08823v1 [cs.SY])
- Reconstructing parameters of spreading models from partial observations. (arXiv:1608.08698v1 [cs.SI])
- Statistical physics of vaccination. (arXiv:1608.09010v3 [physics.soc-ph] UPDATED)
- Why does deep and cheap learning work so well?. (arXiv:1608.08225v4 [cond-mat.dis-nn] UPDATED)
- On the fastest finite Markov processes. (arXiv:1608.07958v1 [math.PR])
- Collaborative Filtering with Recurrent Neural Networks. (arXiv:1608.07400v2 [cs.IR] UPDATED)
- Skew-t Filter and Smoother with Improved Covariance Matrix Approximation. (arXiv:1608.07435v1 [cs.SY])
- Posterior consistency for partially observed Markov models. (arXiv:1608.06851v2 [math.ST] UPDATED)
- Towards Bayesian Deep Learning: A Framework and Some Existing Methods. (arXiv:1608.06884v2 [stat.ML] UPDATED)
- Identifying stochastic oscillations in single cell live imaging time series using Gaussian processes. (arXiv:1608.06476v1 [q-bio.QM])
- The discriminative Kalman filter for nonlinear and non-Gaussian sequential Bayesian filtering. (arXiv:1608.06622v2 [stat.ML] UPDATED)
- LFADS - Latent Factor Analysis via Dynamical Systems. (arXiv:1608.06315v1 [cs.LG])
- Approximation and inference methods for stochastic biochemical kinetics - a tutorial review. (arXiv:1608.06582v2 [q-bio.QM] UPDATED)
- Decoupled Neural Interfaces using Synthetic Gradients. (arXiv:1608.05343v2 [cs.LG] UPDATED)
- Time scales and species coexistence in chaotic flows. (arXiv:1608.05166v1 [q-bio.PE])
- Parameter Learning for Log-supermodular Distributions. (arXiv:1608.05258v1 [stat.ML])
- Nonuniform Berry-Esseen bounds for martingales with applications to statistical estimation. (arXiv:1608.05217v1 [math.PR])
- The multi-level Monte Carlo method for simulations of turbulent flows. (arXiv:1608.05338v1 [stat.CO])
- Filling the gaps smoothly. (arXiv:1608.05145v1 [q-fin.CP])
- Developing a statistically powerful measure for phylogenetic tree inference using phylogenetic identities and Markov invariants. (arXiv:1608.04761v1 [q-bio.QM])
- Sustainable theory of a logistic model - Fisher Information approach. (arXiv:1608.04987v1 [q-bio.PE])
- Lecture Notes on Spectral Graph Methods. (arXiv:1608.04845v1 [cs.DS])
- Linear inverse problems for Markov processes and their regularisation. (arXiv:1608.04918v2 [math.PR] UPDATED)
- Noise and Function. (arXiv:1608.04824v1 [physics.bio-ph])
- Reinforcement Learning algorithms for regret minimization in structured Markov Decision Processes. (arXiv:1608.04929v1 [cs.LG])
- Variational Gaussian Process Auto-Encoder for Ordinal Prediction of Facial Action Units. (arXiv:1608.04664v2 [stat.ML] UPDATED)
- Estimation of the parameters of the Ornstein-Uhlenbeck's stochastic process. (arXiv:1608.04507v3 [math.ST] UPDATED)
- Optimal importance sampling for L\'evy Processes. (arXiv:1608.04621v1 [q-fin.RM])
- Online Nonnegative Matrix Factorization with General Divergences. (arXiv:1608.00075v2 [stat.ML] CROSS LISTED)
- Fully Parallel Particle Learning for GPGPUs and Other Parallel Devices. (arXiv:1212.1639v2 [stat.CO] UPDATED)
- Linear Convergence of Gradient and Proximal-Gradient Methods Under the Polyak-\L{}ojasiewicz Condition. (arXiv:1608.04636v4 [cs.LG] UPDATED)
- Lecture Notes on Randomized Linear Algebra. (arXiv:1608.04481v1 [cs.DS])
- SGDR: Stochastic Gradient Descent with Warm Restarts. (arXiv:1608.03983v5 [cs.LG] UPDATED)
- Stacked Approximated Regression Machine: A Simple Deep Learning Approach. (arXiv:1608.04062v2 [cs.LG] UPDATED)
- Approach of complexity in nature: Entropic nonuniqueness. (arXiv:1608.03599v1 [cond-mat.stat-mech])
- Demographic noise can reverse the direction of deterministic selection. (arXiv:1608.03471v1 [q-bio.PE])
- Geometric-Algebra Adaptive Filters. (arXiv:1608.03450v2 [math.OC] UPDATED)
- Towards Representation Learning with Tractable Probabilistic Models. (arXiv:1608.02341v1 [cs.LG] CROSS LISTED)
- Some Contributions to Sequential Monte Carlo Methods for Option Pricing. (arXiv:1608.03352v1 [stat.CO])
- Revisiting Causality Inference in Memory-less Transition Networks. (arXiv:1608.02658v3 [stat.ML] UPDATED)
- Revisiting Sub-sampled Newton Methods. (arXiv:1608.02875v1 [math.OC])
- Another example of duality between game-theoretic and measure-theoretic probability. (arXiv:1608.02706v1 [q-fin.MF])
- Joint distributions of partial and global maxima of a Brownian Bridge. (arXiv:1608.02161v1 [cond-mat.stat-mech])
- Rare events in stochastic populations under bursty reproduction. (arXiv:1608.02083v1 [cond-mat.stat-mech])
- Interdependency of Transmission and Distribution Pricing. (arXiv:1608.02316v1 [cs.SY])
- A branching random walk among disasters. (arXiv:1608.02440v4 [math.PR] UPDATED)
- Transition probabilities for degenerate diffusions arising in population genetics. (arXiv:1608.02119v2 [math.AP] UPDATED)
- Towards Representation Learning with Tractable Probabilistic Models. (arXiv:1608.02341v1 [cs.LG])
- Robust High-Dimensional Linear Regression. (arXiv:1608.02257v2 [cs.LG] UPDATED)
- Online Adaptation of Deep Architectures with Reinforcement Learning. (arXiv:1608.02292v1 [cs.LG])
- Serendipity and strategy in rapid innovation. (arXiv:1608.01900v4 [physics.soc-ph] UPDATED)
- R\'enyi divergence and the central limit theorem. (arXiv:1608.01805v1 [math.PR])
- Iterative importance sampling algorithms for parameter estimation. (arXiv:1608.01958v2 [math.NA] UPDATED)
- A central limit theorem for a new statistic on permutations. (arXiv:1608.01666v4 [math.PR] UPDATED)
- Recurrence and transience of contractive autoregressive processes and related Markov chains. (arXiv:1608.01394v2 [math.PR] UPDATED)
- Stochastic thermodynamics based on incomplete information: Generalized Jarzynski equality with measurement errors with or without feedback. (arXiv:1608.01574v3 [cond-mat.stat-mech] UPDATED)
- Learning Online Alignments with Continuous Rewards Policy Gradient. (arXiv:1608.01281v1 [cs.LG])
- Hard Threshold Least Mean Squares Algorithm. (arXiv:1608.01128v1 [cs.SY])
- Exponential Family Embeddings. (arXiv:1608.00778v2 [stat.ML] UPDATED)
- Can we trust the bootstrap in high-dimension?. (arXiv:1608.00696v1 [stat.ME])
- Oracle Inequalities for High-dimensional Prediction. (arXiv:1608.00624v2 [math.ST] UPDATED)
- Online Nonnegative Matrix Factorization with General Divergences. (arXiv:1608.00075v2 [stat.ML] UPDATED)
- Particle representations for stochastic partial differential equations with boundary conditions. (arXiv:1607.08909v2 [math.PR] UPDATED)
- Particle Filtering with Invertible Particle Flow. (arXiv:1607.08799v5 [stat.ME] UPDATED)
- Inverse problem for multi-body interaction of nonlinear waves. (arXiv:1607.08549v2 [cond-mat.dis-nn] UPDATED)
- Free boundary problems in PDEs and particle systems. (arXiv:1607.08124v1 [math.PR])
- Algorithmic statistics: forty years later. (arXiv:1607.08077v3 [cs.CC] UPDATED)
- The Ornstein-Uhlenbeck process with migration: evolution with interactions. (arXiv:1607.07970v1 [q-bio.PE])
- Lecture notes on stochastic models in systems biology. (arXiv:1607.07806v1 [q-bio.QM])
- Disease Mapping with Generative Models. (arXiv:1607.07002v1 [stat.ME])
- Statistical mechanics for complex systems: On the structure of $q$-triplets. (arXiv:1607.07097v1 [cond-mat.stat-mech])
- Averaged vs. quenched large deviations and entropy for random walk in a dynamic random environment. (arXiv:1607.07000v1 [math.PR])
- Asymptotic Properties of Approximate Bayesian Computation. (arXiv:1607.06903v4 [stat.ME] UPDATED)
- Introduction to Stochastic Differential Equations (SDEs) for Finance. (arXiv:1504.05309v14 [q-fin.MF] UPDATED)
- Metastability in an open quantum Ising model. (arXiv:1607.06780v1 [cond-mat.stat-mech])
- An approximation method for the optimization of $p$-th moment of $\mathbb{R}^n$-valued random variable. (arXiv:1607.06557v1 [math.OC])
- Joining and splitting models with Markov melding. (arXiv:1607.06779v3 [stat.ME] UPDATED)
- Sample Variance in Free Probability. (arXiv:1607.06586v4 [math.OA] UPDATED)
- Statistical Entropy of Open Quantum Systems. (arXiv:1607.05800v3 [cond-mat.stat-mech] UPDATED)
- Multidimensional Dynamic Pricing for Welfare Maximization. (arXiv:1607.05397v3 [cs.DS] UPDATED)
- The Kalman Filter: a didactical overview. (arXiv:1607.05590v1 [cs.SY])
- Uncertainty Quantification for PDEs with Anisotropic Random Diffusion. (arXiv:1607.05584v1 [math.NA])
- On the estimation of the mean of a random vector. (arXiv:1607.05421v1 [math.ST])
- Variational calculus for diffusions. (arXiv:1607.05488v3 [math.PR] UPDATED)
- Invariants of Fokker-Planck equations. (arXiv:1607.05531v2 [cond-mat.stat-mech] UPDATED)
- Scaling of Information in Turbulence. (arXiv:1607.05511v2 [cond-mat.stat-mech] UPDATED)
- Inferring solutions of differential equations using noisy multi-fidelity data. (arXiv:1607.04805v1 [cs.LG])
- Stochastic Recursive Inclusions with Non-Additive Iterate-Dependent Markov Noise. (arXiv:1607.04735v1 [cs.SY])
- False confidence, non-additive beliefs, and valid statistical inference. (arXiv:1607.05051v3 [math.ST] UPDATED)
- Piecewise convexity of artificial neural networks. (arXiv:1607.04917v2 [cs.LG] UPDATED)
- A form of multivariate Pareto distribution with applications to financial risk measurement. (arXiv:1607.04737v1 [q-fin.RM])
- A note on dynamical models on random graphs and Fokker-Planck equations. (arXiv:1607.05224v2 [math.PR] UPDATED)
- Asynchronous Parallel Algorithms for Nonconvex Optimization. (arXiv:1607.04818v3 [math.OC] UPDATED)
- A note on the asymptotic normality of sums of extreme values. (arXiv:1607.04848v1 [stat.ME])
- Imitation Learning with Recurrent Neural Networks. (arXiv:1607.05241v1 [cs.CL])
- Guided Policy Search as Approximate Mirror Descent. (arXiv:1607.04614v1 [cs.LG])
- Optimally-Weighted Herding is Bayesian Quadrature. (arXiv:1204.1664v3 [stat.ML] UPDATED)
- Parametric PDEs: Sparse or Low-Rank Approximations?. (arXiv:1607.04444v1 [math.NA])
- Lotka-Volterra with randomly fluctuating environments: a full description. (arXiv:1607.04395v1 [math.PR])
- Generalized hybrid iterative methods for large-scale Bayesian inverse problems. (arXiv:1607.03943v2 [math.NA] UPDATED)
- Cluster Sampling Filters for Non-Gaussian Data Assimilation. (arXiv:1607.03592v1 [stat.CO])
- San Francisco Crime Classification. (arXiv:1607.03626v1 [cs.LG])
- Kernel Density Estimation for Dynamical Systems. (arXiv:1607.03792v1 [stat.ML])
- Analysis and Probability on Infinite-Dimensional Spaces. (arXiv:1607.03591v2 [math.PR] UPDATED)
- Probabilistic solvers for partial differential equations. (arXiv:1607.03526v1 [math.PR])
- From Dependence to Causation. (arXiv:1607.03300v1 [stat.ML])
- Populations can be essential in tracking dynamic optima. (arXiv:1607.03317v1 [cs.NE])
- Recurrent Highway Networks. (arXiv:1607.03474v5 [cs.LG] UPDATED)
- Multilevel Picard iterations for solving smooth semilinear parabolic heat equations. (arXiv:1607.03295v4 [math.NA] UPDATED)
- A calculus proof of the Cram\'er-Wold theorem. (arXiv:1607.03206v2 [math.PR] UPDATED)
- Learning in Quantum Control: High-Dimensional Global Optimization for Noisy Quantum Dynamics. (arXiv:1607.03428v3 [cs.LG] UPDATED)
- Geometric inference for general high-dimensional linear inverse problems
- Information geometry approach to parameter estimation in Markov chains
- Nonparametric stochastic approximation with large step-sizes
- How to use the functional empirical process for deriving asymptotic laws for functions of the sample. (arXiv:1607.02745v3 [stat.ME] UPDATED)
- Parallel local approximation MCMC for expensive models. (arXiv:1607.02788v2 [stat.CO] UPDATED)
- Improving Population Monte Carlo: Alternative Weighting and Resampling Schemes. (arXiv:1607.02758v1 [stat.CO])
- Magnetic Hamiltonian Monte Carlo. (arXiv:1607.02738v2 [stat.ML] UPDATED)
- Pseudo-Marginal Hamiltonian Monte Carlo. (arXiv:1607.02516v2 [stat.ME] UPDATED)
- Kernel-based methods for bandit convex optimization. (arXiv:1607.03084v1 [cs.LG] CROSS LISTED)
- Proximal Quasi-Newton Methods for Convex Optimization. (arXiv:1607.03081v1 [cs.NA])
- Exponential ergodicity for a class of non-Markovian stochastic processes. (arXiv:1607.02252v1 [math.PR])
- Log-Linear RNNs: Towards Recurrent Neural Networks with Flexible Prior Knowledge. (arXiv:1607.02467v2 [cs.AI] UPDATED)
- Bayesian inverse problems with $l_1$ priors: a Randomize-then-Optimize approach. (arXiv:1607.01904v2 [stat.CO] UPDATED)
- Nesterov's Accelerated Gradient and Momentum as approximations to Regularised Update Descent. (arXiv:1607.01981v2 [stat.ML] UPDATED)
- Generalizations of the Sherman-Morrison-Woodbury formula involving the Schur complement. (arXiv:1607.01579v1 [math.NA])
- Derivative pricing as a transport problem: MPDATA solutions to Black-Scholes-type equations. (arXiv:1607.01751v5 [q-fin.CP] UPDATED)
- Machine Learning for Antimicrobial Resistance. (arXiv:1607.01224v1 [stat.ML])
- Stochastic Quasi-Newton Methods for Nonconvex Stochastic Optimization. (arXiv:1607.01231v4 [math.OC] UPDATED)
- Estimation of anthracnose dynamics by nonlinear filtering. (arXiv:1607.00674v1 [stat.AP])
- Why is Posterior Sampling Better than Optimism for Reinforcement Learning?. (arXiv:1607.00215v3 [stat.ML] UPDATED)
- Parameter Estimation via Conditional Expectation --- A Bayesian Inversion. (arXiv:1606.09440v1 [math.PR])
- Proximity Operators of Discrete Information Divergences. (arXiv:1606.09552v2 [cs.IT] UPDATED)
- Dimension-Free Iteration Complexity of Finite Sum Optimization Problems. (arXiv:1606.09333v1 [math.OC])
- A Model Explanation System: Latest Updates and Extensions. (arXiv:1606.09517v1 [stat.ML])
- Performance of Ensemble Kalman filters in large dimensions. (arXiv:1606.09321v2 [math.PR] UPDATED)
- Decision making via semi-supervised machine learning techniques. (arXiv:1606.09022v1 [cs.LG])
- Variational Geometric Approach to Generalized Differential and Fenchel Conjugate Calculi in Convex Analysis. (arXiv:1606.08749v3 [math.OC] UPDATED)
- Approximate Smoothing and Parameter Estimation in High-Dimensional State-Space Models. (arXiv:1606.08650v4 [stat.CO] UPDATED)
- Dynamic dependence networks: Financial time series forecasting and portfolio decisions (with discussion). (arXiv:1606.08339v1 [stat.ME])
- Probabilistic Forecasting and Simulation of Electricity Markets via Online Dictionary Learning. (arXiv:1606.07855v1 [stat.AP])
- An agent behavior based model for diffusion price processes with application to phase transition and oscillations. (arXiv:1606.08269v1 [math.PR])
- On the Stability and the Exponential Concentration of Extended Kalman-Bucy filters. (arXiv:1606.08251v2 [math.PR] UPDATED)
- Framework for state and unknown input estimation of linear time-varying systems. (arXiv:1606.08090v1 [cs.SY])
- Simultaneous Mode, Input and State Estimation for Switched Linear Stochastic Systems. (arXiv:1606.08323v1 [math.OC])
- Prior knowledge and Markov parameters of linear time-invariant models. (arXiv:1606.08422v1 [cs.SY])
- Optimal adaptation for early stopping in statistical inverse problems. (arXiv:1606.07702v2 [math.ST] UPDATED)
- Probabilistic Models for Integration Error in the Assessment of Functional Cardiac Models. (arXiv:1606.06841v5 [stat.ME] UPDATED)
- Ancestral Causal Inference. (arXiv:1606.07035v3 [cs.LG] UPDATED)
- Stochastic Runge-Kutta Software Package for Stochastic Differential Equations. (arXiv:1606.06604v1 [physics.comp-ph])
- Geometric MCMC for Infinite-Dimensional Inverse Problems. (arXiv:1606.06351v2 [stat.CO] UPDATED)
- Distribution-Dependent SDEs for Landau Type Equations. (arXiv:1606.05843v3 [math.PR] UPDATED)
- On the prediction loss of the lasso in the partially labeled setting. (arXiv:1606.06179v2 [math.ST] UPDATED)
- Primal-dual extragradient methods for nonlinear nonsmooth PDE-constrained optimization. (arXiv:1606.06219v3 [math.OC] UPDATED)
- Stochastic MPC with Offline Uncertainty Sampling. (arXiv:1606.06056v1 [cs.SY])
- Deviations from universality in the fluctuation behavior of a heterogeneous complex system reveal intrinsic properties of components: The case of the international currency market. (arXiv:1606.06111v2 [q-fin.ST] UPDATED)
- Vibrato and automatic differentiation for high order derivatives and sensitivities of financial options. (arXiv:1606.06143v1 [q-fin.CP])
- Scalable Information Inequalities for Uncertainty Quantification. (arXiv:1605.04184v1 [cs.IT] CROSS LISTED)
- Rare event computation in deterministic chaotic systems using genealogical particle analysis. (arXiv:1511.02703v2 [cond-mat.stat-mech] UPDATED)
- Tutorial on Variational Autoencoders. (arXiv:1606.05908v3 [stat.ML] UPDATED)
- An Empirical Comparison of Sampling Quality Metrics: A Case Study for Bayesian Nonnegative Matrix Factorization. (arXiv:1606.06250v1 [cs.LG])
- Structured Stochastic Linear Bandits. (arXiv:1606.05693v1 [stat.ML])
- On spurious regressions with trending variables. (arXiv:1606.05049v1 [stat.ME])
- Unsupervised Risk Estimation Using Only Conditional Independence Structure. (arXiv:1606.05313v1 [cs.LG])
- Inversion Copulas from Nonlinear State Space Models. (arXiv:1606.05022v2 [stat.ME] UPDATED)
- Statistical Mechanics of Avalanches. (arXiv:1606.05066v2 [cond-mat.stat-mech] UPDATED)
- Learning Optimal Interventions. (arXiv:1606.05027v2 [stat.ML] UPDATED)
- Optimization Methods for Large-Scale Machine Learning. (arXiv:1606.04838v3 [stat.ML] UPDATED)
- Safe Exploration in Finite Markov Decision Processes with Gaussian Processes. (arXiv:1606.04753v2 [cs.LG] UPDATED)
- Understanding Probabilistic Sparse Gaussian Process Approximations. (arXiv:1606.04820v2 [stat.ML] UPDATED)
- Optimization Methods for Large-Scale Machine Learning. (arXiv:1606.04838v3 [stat.ML] UPDATED)
- Beyond universality in random matrix theory
- Approximations of stochastic partial differential equations
- Recursive nonlinear-system identification using latent variables. (arXiv:1606.04366v3 [stat.ML] UPDATED)
- Learning to learn by gradient descent by gradient descent. (arXiv:1606.04474v2 [cs.NE] UPDATED)
- Self-similar aftershock rates. (arXiv:1606.03958v1 [cond-mat.stat-mech])
- The Sound of Silence: equilibrium filtering and optimal censoring in financial markets. (arXiv:1606.04039v1 [q-fin.MF])
- Non parametric estimation for random walks in random environment. (arXiv:1606.03848v1 [math.ST])
- Robust Probabilistic Modeling with Bayesian Data Reweighting. (arXiv:1606.03860v3 [stat.ML] UPDATED)
- Deep Directed Generative Models with Energy-Based Probability Estimation. (arXiv:1606.03439v1 [cs.LG])
- Extended Gauss-Newton and ADMM-Gauss-Newton Algorithms for Low-Rank Matrix Optimization. (arXiv:1606.03358v3 [math.OC] UPDATED)
- Causal Bandits: Learning Good Interventions via Causal Inference. (arXiv:1606.03203v1 [stat.ML])
- Conditional Extreme Value Models: Fallacies and Pitfalls. (arXiv:1606.02927v2 [math.PR] UPDATED)
- On Projected Stochastic Gradient Descent Algorithm with Weighted Averaging for Least Squares Regression. (arXiv:1606.03000v1 [cs.IT])
- Learning Thermodynamics with Boltzmann Machines. (arXiv:1606.02718v1 [cond-mat.stat-mech])
- Expectile Matrix Factorization for Skewed Data Analysis. (arXiv:1606.01984v3 [stat.ML] UPDATED)
- The Generalized Langevin Equation and the Parameterization from Data. (arXiv:1606.02596v1 [physics.comp-ph])
- A Locally Adaptive Normal Distribution. (arXiv:1606.02518v3 [stat.ML] UPDATED)
- Classifying Options for Deep Reinforcement Learning. (arXiv:1604.08153v3 [cs.LG] UPDATED)
- Specific Differential Entropy Rate Estimation for Continuous-Valued Time Series. (arXiv:1606.02615v1 [cs.LG])
- A statistical inference course based on p-values. (arXiv:1606.02352v1 [stat.OT])
- Probabilistic counterparts of nonlinear parabolic PDE systems. (arXiv:1606.02525v1 [math.PR])
- Deep Successor Reinforcement Learning. (arXiv:1606.02396v1 [stat.ML])
- Reducing the error of Monte Carlo Algorithms by Learning Control Variates. (arXiv:1606.02261v1 [stat.ML])
- Minimum-Information LQG Control - Part II: Retentive Controllers. (arXiv:1606.01947v1 [cs.SY])
- Minimum-Information LQG Control - Part I: Memoryless Controllers. (arXiv:1606.01946v1 [cs.SY])
- Towards a Neural Statistician. (arXiv:1606.02185v2 [stat.ML] UPDATED)
- Bayesian Policy Gradient and Actor-Critic Algorithms; Mohammad Ghavamzadeh, Yaakov Engel, Michal Valko
- Regularizing Bayesian Predictive Regressions. (arXiv:1606.01701v4 [stat.ME] UPDATED)
- Density estimation using Real NVP. (arXiv:1605.08803v3 [cs.LG] UPDATED)
- Envelope Functions: Unifications and Further Properties. (arXiv:1606.01327v3 [math.OC] UPDATED)
- The exact information-based complexity of smooth convex minimization. (arXiv:1606.01424v1 [math.OC])
- Relaxation of the EM Algorithm via Quantum Annealing. (arXiv:1606.01484v1 [stat.ML])
- Learning to Optimize. (arXiv:1606.01885v1 [cs.LG])
- End-to-end LSTM-based dialog control optimized with supervised and reinforcement learning. (arXiv:1606.01269v1 [cs.CL])
- Provable non-convex projected gradient descent for a class of constrained matrix optimization problems. (arXiv:1606.01316v1 [stat.ML])
- Penalized Barycenters in the Wasserstein Space. (arXiv:1606.01025v5 [math.ST] UPDATED)
- Gaussian Processes for Music Audio Modelling and Content Analysis. (arXiv:1606.01039v2 [stat.ML] UPDATED)
- Smooth Imitation Learning for Online Sequence Prediction. (arXiv:1606.00968v1 [cs.LG])
- Distributed stochastic optimization via matrix exponential learning. (arXiv:1606.01190v1 [cs.IT])
- On Coupling Particle Filter Trajectories. (arXiv:1606.01016v2 [stat.CO] UPDATED)
- Coupling of Particle Filters. (arXiv:1606.01156v2 [stat.ME] UPDATED)
- Global warming: Temperature estimation in annealers. (arXiv:1606.00919v4 [quant-ph] UPDATED)
- Detecting Serial Dependence in Binomial Time Series II: Observation Driven Models. (arXiv:1606.00984v1 [math.ST])
- Testing for Serial Dependence in Binomial Time Series I: Parameter Driven Models. (arXiv:1606.00983v1 [math.ST])
- Optimality Conditions for Inventory Control. (arXiv:1606.00957v1 [math.OC])
- Property-driven State-Space Coarsening for Continuous Time Markov Chains. (arXiv:1606.01111v1 [cs.SY])
- When Does a Boltzmannian Equilibrium Exist?. (arXiv:1606.01202v1 [cond-mat.stat-mech])
- Distributed Cooperative Decision-Making in Multiarmed Bandits: Frequentist and Bayesian Algorithms. (arXiv:1606.00911v3 [cs.SY] UPDATED)
- Preconditioned Iterative Solves in Model Reduction of Second Order Linear Dynamical Systems. (arXiv:1606.01216v1 [math.NA])
- Quantifying the probable approximation error of probabilistic inference programs. (arXiv:1606.00068v1 [cs.AI])
- ctsmr - Continuous Time Stochastic Modeling in R. (arXiv:1606.00242v1 [stat.CO])
- Convergence of the gradient method for ill-posed problems. (arXiv:1606.00274v1 [math.NA])
- The regularized tau estimator: A robust and efficient solution to ill-posed linear inverse problems. (arXiv:1606.00812v1 [stat.ME])
- High Dimensional Multivariate Regression and Precision Matrix Estimation via Nonconvex Optimization. (arXiv:1606.00832v1 [stat.ML])
- Variance-Reduced Proximal Stochastic Gradient Descent for Non-convex Composite optimization. (arXiv:1606.00602v2 [stat.ML] UPDATED)
- Uncertainty and filtering of hidden Markov models in discrete time. (arXiv:1606.00229v4 [stat.ME] UPDATED)
- Discovering Phase Transitions with Unsupervised Learning. (arXiv:1606.00318v2 [cond-mat.stat-mech] UPDATED)
- Mutations as Levy flights. (arXiv:1605.09697v1 [q-bio.PE])
- Extreme Stochastic Variational Inference: Distributed and Asynchronous. (arXiv:1605.09499v9 [stat.ML] UPDATED)
- Parallel Markov Chain Monte Carlo via Spectral Clustering. (arXiv:1605.09454v1 [stat.ME])
- Minding the Gaps for Block Frank-Wolfe Optimization of Structured SVMs. (arXiv:1605.09346v1 [cs.LG] CROSS LISTED)
- Kernel Mean Embedding of Distributions: A Review and Beyond. (arXiv:1605.09522v4 [stat.ML] UPDATED)
- Practical Kernel-Based Reinforcement Learning; André M.S. Barreto, Doina Precup, Joelle Pineau
- Optimal Rates for Multi-pass Stochastic Gradient Methods. (arXiv:1605.08882v3 [cs.LG] UPDATED)
- Density estimation using Real NVP. (arXiv:1605.08803v3 [cs.LG] UPDATED)
- Model-Free Imitation Learning with Policy Optimization. (arXiv:1605.08478v1 [cs.LG])
- Stochastic Optimization for Large-scale Optimal Transport. (arXiv:1605.08527v1 [math.OC])
- Linear dynamical neural population models through nonlinear embeddings. (arXiv:1605.08454v2 [q-bio.NC] UPDATED)
- Maximum information entropy principle and the interpretation of probabilities in statistical mechanics - a short review. (arXiv:1605.08703v1 [cond-mat.stat-mech])
- Merging MCMC Subposteriors through Gaussian-Process Approximations. (arXiv:1605.08576v2 [stat.CO] UPDATED)
- On the two-filter approximations of marginal smoothing distributions in general state space models. (arXiv:1605.08534v1 [math.ST])
- Nonlinear Stochastic Dynamics of Complex Systems, II: Potential of Entropic Force in Markov Systems with Nonequilibrium Steady State, Generalized Gibbs Function and Criticality. (arXiv:1605.08071v2 [cond-mat.stat-mech] UPDATED)
- Nonlinear Stochastic Dynamics of Complex Systems, I: A Chemical Reaction Kinetic Perspective with Mesoscopic Nonequilibrium Thermodynamics. (arXiv:1605.08070v1 [cond-mat.stat-mech])
- Provable Efficient Online Matrix Completion via Non-convex Stochastic Gradient Descent. (arXiv:1605.08370v1 [cs.LG])
- Foreign exchange risk premia: from traditional to state-space analyses. (arXiv:1605.08025v1 [q-fin.EC])
- Fast Algorithms for Robust PCA via Gradient Descent. (arXiv:1605.07784v2 [cs.IT] UPDATED)
- How priors of initial hyperparameters affect Gaussian process regression models. (arXiv:1605.07906v2 [stat.ML] UPDATED)
- Asymptotically exact inference in differentiable generative models. (arXiv:1605.07826v4 [stat.CO] UPDATED)
- Many physical laws are ridge functions. (arXiv:1605.07974v1 [math.NA])
- AIMS:Average Information Matrix Splitting. (arXiv:1605.07646v4 [stat.ME] UPDATED)
- Probabilistic Meshless Methods for Partial Differential Equations and Bayesian Inverse Problems. (arXiv:1605.07811v1 [stat.ME])
- Recursive Sampling for the Nystr\"om Method. (arXiv:1605.07583v5 [cs.LG] UPDATED)
- Convergence guarantees for kernel-based quadrature rules in misspecified settings. (arXiv:1605.07254v2 [stat.ML] UPDATED)
- Alternating Optimisation and Quadrature for Robust Control. (arXiv:1605.07496v3 [cs.LG] UPDATED)
- Sequential Neural Models with Stochastic Layers. (arXiv:1605.07571v2 [stat.ML] UPDATED)
- Learning and Policy Search in Stochastic Dynamical Systems with Bayesian Neural Networks. (arXiv:1605.07127v3 [stat.ML] UPDATED)
- Collaborative Filtering with Side Information: a Gaussian Process Perspective. (arXiv:1605.07025v3 [stat.ML] UPDATED)
- Learning to Communicate with Deep Multi-Agent Reinforcement Learning. (arXiv:1605.06676v2 [cs.AI] UPDATED)
- Fast Stochastic Methods for Nonsmooth Nonconvex Optimization. (arXiv:1605.06900v1 [math.OC])
- DynaNewton - Accelerating Newton's Method for Machine Learning. (arXiv:1605.06561v1 [cs.LG])
- An Information Criterion for Inferring Coupling in Distributed Dynamical Systems. (arXiv:1605.06931v3 [cs.LG] UPDATED)
- Harnessing Smoothness to Accelerate Distributed Optimization. (arXiv:1605.07112v2 [math.OC] UPDATED)
- Likelihood Gradient Evaluation Using Square-Root Covariance Filters. (arXiv:1605.06654v1 [cs.SY])
- An Information Criterion for Inferring Coupling in Distributed Dynamical Systems. (arXiv:1605.06931v3 [cs.LG] UPDATED)
- Nonnegative Matrix Factorization Requires Irrationality. (arXiv:1605.06848v1 [cs.CC])
- On Restricted Nonnegative Matrix Factorization. (arXiv:1605.07061v1 [cs.FL])
- Fast Stochastic Methods for Nonsmooth Nonconvex Optimization. (arXiv:1605.06900v1 [math.OC])
- Extremes and Recurrence in Dynamical Systems. (arXiv:1605.07006v1 [math.DS])
- Query-Efficient Imitation Learning for End-to-End Autonomous Driving. (arXiv:1605.06450v1 [cs.LG])
- Deep Variational Bayes Filters: Unsupervised Learning of State Space Models from Raw Data. (arXiv:1605.06432v1 [stat.ML])
- Constrained LQ problem with a random jump and application to portfolio selection. (arXiv:1605.05825v1 [math.OC])
- Gaussian Approximations of Small Noise Diffusions in Kullback-Leibler Divergence. (arXiv:1605.05878v1 [math.PR])
- Jump Diffusion and {\alpha}-Stable Techniques for the Markov Switching Approach to Financial Time Series. (arXiv:1605.05893v1 [stat.AP])
- Consensus+Innovations Distributed Kalman Filter with Optimized Gains. (arXiv:1605.06096v2 [cs.IT] UPDATED)
- Well-posed Bayesian inverse problems and heavy-tailed stable Banach space priors. (arXiv:1605.05898v1 [math.PR])
- On the Optimal Linear Convergence Rate of a Generalized Proximal Point Algorithm. (arXiv:1605.05474v1 [math.OC])
- Computing the variance of a conditional expectation via non-nested Monte Carlo. (arXiv:1605.05454v2 [stat.CO] UPDATED)
- Metropolis-Hastings algorithms with autoregressive proposals, and a few examples. (arXiv:1605.05441v2 [stat.CO] UPDATED)
- Online Algorithms For Parameter Mean And Variance Estimation In Dynamic Regression Models. (arXiv:1605.05697v1 [stat.ML])
- Sequential design of experiments for estimating percentiles of black-box functions. (arXiv:1605.05524v2 [math.ST] UPDATED)
- A Distributed Quaternion Kalman Filter With Applications to Fly-by-Wire Systems. (arXiv:1605.05588v2 [cs.SY] UPDATED)
- Localizing the Ensemble Kalman Particle Filter. (arXiv:1605.05476v2 [stat.AP] UPDATED)
- Orthogonal symmetric non-negative matrix factorization under the stochastic block model. (arXiv:1605.05349v1 [stat.ML])
- A Harris process to model stochastic volatility. (arXiv:1605.05382v1 [stat.AP])
- Large complex correlated Wishart matrices: Fluctuations and asymptotic independence at the edges
- A Preconditioned Low-Rank Projection Method with a Rank-Reduction Scheme for Stochastic Partial Differential Equations. (arXiv:1605.05297v1 [math.NA])
- Exact Simulation of Noncircular or Improper Complex-Valued Stationary Gaussian Processes using Circulant Embedding. (arXiv:1605.05278v2 [stat.ME] UPDATED)
- Computational issues and numerical experiments for Linear Multistep Method Particle Filtering. (arXiv:1605.05042v1 [cs.NA])
- Moderate deviation principles for stochastic differential equations with jumps
- Minimax Lower Bounds for Kronecker-Structured Dictionary Learning. (arXiv:1605.05284v1 [cs.IT])
- Multilevel Particle Filters: Normalizing Constant Estimation. (arXiv:1605.04963v1 [stat.CO])
- Simple, Scalable and Accurate Posterior Interval Estimation. (arXiv:1605.04029v2 [stat.CO] UPDATED)
- Alternating optimization method based on nonnegative matrix factorizations for deep neural networks. (arXiv:1605.04639v1 [cs.LG])
- Intermittency for branching random walk in Pareto environment
- Barzilai-Borwein Step Size for Stochastic Gradient Descent. (arXiv:1605.04131v2 [math.OC] UPDATED)
- Extreme-Value Statistics of Fractional Brownian Motion Bridges. (arXiv:1605.04132v1 [cond-mat.stat-mech])
- Stochastic differential equations related to random matrix theory. (arXiv:1605.04417v1 [math.PR])
- Tracking Slowly Moving Clairvoyant: Optimal Dynamic Regret of Online Learning with True and Noisy Gradient. (arXiv:1605.04638v1 [cs.LG])
- Generalized Linear Models for Aggregated Data. (arXiv:1605.04466v1 [stat.ML])
- On stepwise regression. (arXiv:1605.04542v1 [math.ST])
- Ordinary Differential Equation Methods For Markov Decision Processes and Application to Kullback-Leibler Control Cost. (arXiv:1605.04591v1 [math.OC])
- Reducing the Model Order of Deep Neural Networks Using Information Theory. (arXiv:1605.04859v1 [cs.LG])
- On the Iteration Complexity of Oblivious First-Order Optimization Algorithms. (arXiv:1605.03529v1 [math.OC])
- A variational approach to stochastic minimization of convex functionals. (arXiv:1605.03289v1 [math.OC])
- Model Selection Principles in Misspecified Models. (arXiv:1005.5483v2 [math.ST] UPDATED)
- Active Uncertainty Calibration in Bayesian ODE Solvers. (arXiv:1605.03364v3 [cs.NA] UPDATED)
- Theano: A Python framework for fast computation of mathematical expressions. (arXiv:1605.02688v1 [cs.SC])
- Inference of High-dimensional Autoregressive Generalized Linear Models. (arXiv:1605.02693v2 [stat.ML] UPDATED)
- Clustering Time Series and the Surprising Robustness of HMMs. (arXiv:1605.02531v2 [cs.IT] UPDATED)
- Stochastic Portfolio Theory: A Machine Learning Perspective. (arXiv:1605.02654v1 [q-fin.PM])
- On the Convergence of A Family of Robust Losses for Stochastic Gradient Descent. (arXiv:1605.01623v1 [cs.LG])
- High-dimensional Bayesian inference via the Unadjusted Langevin Algorithm. (arXiv:1605.01559v4 [math.ST] UPDATED)
- Fractional Brownian motion, the Matern process, and stochastic modeling of turbulent dispersion. (arXiv:1605.01684v3 [stat.ME] UPDATED)
- Matrix-Variate Regressions and Envelope Models. (arXiv:1605.01485v2 [stat.ME] UPDATED)
- On the evaluation of uncertainties for state estimation with the Kalman filter. (arXiv:1605.01235v1 [cs.SY])
- A Bayesian Approach to Policy Recognition and State Representation Learning. (arXiv:1605.01278v4 [stat.ML] UPDATED)
- Multilevel Monte Carlo methods for the approximation of invariant measures of stochastic differential equations. (arXiv:1605.01384v4 [math.NA] UPDATED)
- Decentralized Dynamic Discriminative Dictionary Learning. (arXiv:1605.01107v1 [stat.ML])
- Estimating an Inverse Gamma distribution. (arXiv:1605.01019v2 [stat.ME] UPDATED)
- Factor Models for Cancer Signatures. (arXiv:1604.08743v4 [q-bio.GN] UPDATED)
- Dictionary Learning for Massive Matrix Factorization. (arXiv:1605.00937v2 [stat.ML] UPDATED)
- Blackbox: A procedure for parallel optimization of expensive black-box functions. (arXiv:1605.00998v1 [cs.MS])
- Decentralized Quasi-Newton Methods. (arXiv:1605.00933v1 [math.OC])
- Convergence in H\"older norms with applications to Monte Carlo methods in infinite dimensions. (arXiv:1605.00856v3 [math.NA] UPDATED)
- A unified convergence bound for conjugate gradient and accelerated gradient. (arXiv:1605.00320v1 [math.OC])
- Gradient Descent Only Converges to Minimizers: Non-Isolated Critical Points and Invariant Regions. (arXiv:1605.00405v2 [math.DS] UPDATED)
- Why have asset price properties changed so little in 200 years. (arXiv:1605.00634v1 [q-fin.GN])
- Particle Smoothing for Hidden Diffusion Processes: Adaptive Path Integral Smoother. (arXiv:1605.00278v2 [cs.LG] UPDATED)
- A new structural stochastic volatility model of asset pricing and its stylized facts. (arXiv:1604.08824v1 [q-fin.EC])
- Optimal Transport vs. Fisher-Rao distance between Copulas for Clustering Multivariate Time Series. (arXiv:1604.08634v2 [stat.ML] UPDATED)
- Parameter estimation in a subcritical percolation model with colouring. (arXiv:1604.08908v1 [math.ST])
- Factor Models for Cancer Signatures. (arXiv:1604.08743v1 [q-bio.GN])
- Ergodicity in randomly perturbed quantum systems. (arXiv:1604.08518v1 [quant-ph])
- Explaining the Prevalence, Scaling and Variance of Urban Phenomena. (arXiv:1604.07876v2 [physics.soc-ph] UPDATED)
- Noisy Optimization: Fast Convergence Rates with Comparison-Based Algorithms. (arXiv:1604.08459v1 [math.OC])
- Adaptive Incremental Mixture Markov chain Monte Carlo. (arXiv:1604.08016v2 [stat.ME] UPDATED)
- On the Surprising Explanatory Power of Higher Realized Moments in Practice. (arXiv:1604.07969v1 [stat.AP])
- Brownian Motion on Spaces with Varying Dimension. (arXiv:1604.07870v1 [math.PR])
- Approximating a Diffusion by a Hidden Markov Model. (arXiv:0906.0259v2 [math.PR] UPDATED)
- Hybrid Monte Carlo with Chaotic Mixing. (arXiv:1604.07343v1 [physics.data-an])
- Universality for the Toda algorithm to compute the largest eigenvalue of a random matrix. (arXiv:1604.07384v2 [math.PR] UPDATED)
- A direct approach to the stable distributions. (arXiv:1604.07350v1 [math.PR])
- Entropy and credit risk in highly correlated markets. (arXiv:1604.07042v1 [q-fin.RM])
- Benchmarking Deep Reinforcement Learning for Continuous Control. (arXiv:1604.06778v3 [cs.LG] UPDATED)
- A Class of Nonconvex Penalties Preserving Overall Convexity in Optimization-Based Mean Filtering. (arXiv:1604.06589v1 [cs.IT])
- Gaussian approximations for transition paths in Brownian dynamics. (arXiv:1604.06594v3 [math.PR] UPDATED)
- Robust and Sparse Regression via $\gamma$-divergence. (arXiv:1604.06637v3 [stat.ME] UPDATED)
- Parameter Estimation of Gaussian Stationary Processes using the Generalized Method of Moments. (arXiv:1604.06511v2 [math.ST] UPDATED)
- Thompson Sampling is Asymptotically Optimal in General Environments. (arXiv:1602.07905v2 [cs.LG] UPDATED)
- A Closer Look at Adaptive Regret; Dmitry Adamskiy, Wouter M. Koolen, Alexey Chernov, Vladimir Vovk
- Gradients Weights improve Regression and Classification; Samory Kpotufe, Abdeslam Boularias, Thomas Schultz, Kyoungok Kim
- Variational Inference for Latent Variables and Uncertain Inputs in Gaussian Processes; Andreas C. Damianou, Michalis K. Titsias, Neil D. Lawrence
- BayesPy: Variational Bayesian Inference in Python; Jaakko Luttinen
- Optimal control under uncertainty and Bayesian parameters adjustments. (arXiv:1604.06340v2 [math.PR] UPDATED)
- Dynamic matrix factorization with social influence. (arXiv:1604.06194v1 [stat.ML])
- Sequential Monte Carlo Smoothing with Parameter Estimation. (arXiv:1604.05658v1 [stat.CO])
- Proximal Distance Algorithms: Theory and Examples. (arXiv:1604.05694v3 [math.OC] UPDATED)
- Multi-level methods and approximating distribution functions. (arXiv:1604.05102v1 [q-bio.QM])
- Potentially Predictive Variance Reducing Subsample Locations in Local Gaussian Process Regression. (arXiv:1604.04980v2 [stat.ME] UPDATED)
- Examples of computational approaches to accommodate randomness in elliptic PDEs. (arXiv:1604.05061v1 [math.NA])
- Examples of computational approaches to accommodate randomness in elliptic PDEs. (arXiv:1604.05061v1 [math.NA])
- The Numerical Approximation of Nonlinear Functionals and Functional Differential Equations. (arXiv:1604.05250v3 [math.NA] UPDATED)
- Chained Gaussian Processes. (arXiv:1604.05263v1 [stat.ML])
- Identifying global optimality for dictionary learning. (arXiv:1604.04942v4 [stat.ML] UPDATED)
- Characteristic Function of Time-Inhomogeneous L\'evy-Driven Ornstein-Uhlenbeck Processes. (arXiv:1604.05117v2 [math.PR] UPDATED)
- A stochastic coordinate descent inertial primal-dual algorithm for large-scale composite optimization. (arXiv:1604.04845v1 [math.OC])
- Efficient primal-dual fixed point algorithm with dynamic stepsize for convex problems with applications to imaging restoration. (arXiv:1604.04852v1 [math.OC])
- A stochastic coordinate descent splitting primal-dual fixed point algorithm and applications to large-scale composite optimization. (arXiv:1604.04282v1 [math.OC])
- Bayesian linear regression with Student-t assumptions. (arXiv:1604.04434v1 [cs.LG])
- Evidence of Self-Organization in Time Series of Capital Markets. (arXiv:1604.03996v2 [q-fin.ST] UPDATED)
- Ergodicity: a historical perspective. Equilibrium and Nonequilibrium. (arXiv:1604.04239v2 [cond-mat.stat-mech] UPDATED)
- Optimal Rates For Regularization Of Statistical Inverse Learning Problems. (arXiv:1604.04054v1 [stat.ML])
- Variational Bayesian Inference of Line Spectra. (arXiv:1604.03744v2 [cs.IT] UPDATED)
- Inverse Reinforcement Learning with Simultaneous Estimation of Rewards and Dynamics. (arXiv:1604.03912v1 [cs.AI])
- Algorithms for stochastic optimization with functional or expectation constraints. (arXiv:1604.03887v8 [math.OC] UPDATED)
- Asynchronous Stochastic Gradient Descent with Variance Reduction for Non-Convex Optimization. (arXiv:1604.03584v4 [cs.LG] UPDATED)
- Distributed Gradient Descent in Bacterial Food Search. (arXiv:1604.03052v1 [q-bio.QM])
- The Matrix Generalized Inverse Gaussian Distribution: Properties and Applications. (arXiv:1604.03463v2 [stat.ML] UPDATED)
- Equivariant adjusted least squares estimator in two-line fitting model. (arXiv:1604.02928v1 [stat.ME])
- Explicit computations for some Markov modulated counting processes. (arXiv:1604.02617v1 [math.PR])
- Grid Based Nonlinear Filtering Revisited: Recursive Estimation & Asymptotic Optimality. (arXiv:1604.02631v1 [math.ST])
- Well-posed Bayesian Inverse Problems: beyond Gaussian priors. (arXiv:1604.02575v1 [math.PR])
- Online Nonnegative Matrix Factorization with Outliers. (arXiv:1604.02634v2 [stat.ML] UPDATED)
- Online Nonnegative Matrix Factorization with Outliers. (arXiv:1604.02634v2 [stat.ML] UPDATED)
- Liu-type Negative Binomial Regression: A Comparison of Recent Estimators and Applications. (arXiv:1604.02335v1 [stat.ME])
- Inference in partially identified models with many moment inequalities using Lasso. (arXiv:1604.02309v4 [math.ST] UPDATED)
- A Unified Framework for Sparse Non-Negative Least Squares using Multiplicative Updates and the Non-Negative Matrix Factorization Problem. (arXiv:1604.02181v6 [stat.ML] UPDATED)
- An Adaptive Resample-Move Algorithm for Estimating Normalizing Constants. (arXiv:1604.01972v2 [stat.ML] UPDATED)
- Relativistic Quantum Finance. (arXiv:1604.01447v1 [q-fin.MF])
- Online EM for Functional Data. (arXiv:1604.00570v1 [stat.ME])
- Uniform convergence over time of a nested particle filtering scheme for recursive parameter estimation in state--space Markov models. (arXiv:1603.09005v1 [stat.CO])
- Towards Practical Bayesian Parameter and State Estimation. (arXiv:1603.08988v1 [cs.AI])
- Barrier Functionals for the Analysis of Complex Systems: An Optimization-Based Approach. (arXiv:1603.08716v1 [math.OC])
- Exact statistics of record increments of random walks and L\'evy flights. (arXiv:1603.08368v2 [cond-mat.stat-mech] UPDATED)
- The block-Poisson estimator for optimally tuned exact subsampling MCMC. (arXiv:1603.08232v6 [stat.CO] UPDATED)
- A Short Note on P-Value Hacking. (arXiv:1603.07532v4 [stat.AP] UPDATED)
- Optimal transport via a Monge-Amp\`ere optimization problem. (arXiv:1603.07435v1 [math.NA])
- Central limit theorem for a class of globally correlated random variables. (arXiv:1603.07314v1 [cond-mat.stat-mech])
- Meaningful timescales from Monte Carlo simulations of molecular systems with hard-core interactions. (arXiv:1603.07015v2 [physics.comp-ph] UPDATED)
- Stochastic thermodynamics in the quantum regime: From quantum measurement to quantum trajectories. (arXiv:1603.07266v2 [quant-ph] UPDATED)
- Resampling: an improvement of Importance Sampling in varying population size models. (arXiv:1603.07237v1 [math.ST])
- Optimal stopping under model uncertainty: Randomized stopping times approach
- On the convergence of adaptive sequential Monte Carlo methods
- Trading-off variance and complexity in stochastic gradient descent. (arXiv:1603.06861v1 [stat.ML])
- Risk-Constrained Kelly Gambling. (arXiv:1603.06183v1 [q-fin.PM])
- Weighted sampling without replacement. (arXiv:1603.06556v1 [math.PR])
- Linear Dimensionality Reduction: Survey, Insights, and Generalizations. (arXiv:1406.0873v2 [stat.ML] UPDATED)
- Statistically validated network of portfolio overlaps and systemic risk. (arXiv:1603.05914v2 [q-fin.RM] UPDATED)
- Stratified Monte Carlo simulation of Markov chains. (arXiv:1603.06386v1 [math.ST])
- Forward and Inverse Uncertainty Quantification using Multilevel Monte Carlo Algorithms for an Elliptic Nonlocal Equation. (arXiv:1603.06381v1 [stat.CO])
- Skew-t inference with improved covariance matrix approximation. (arXiv:1603.06216v1 [cs.SY])
- Non-standard conditionally specified models for non-ignorable missing data. (arXiv:1603.06045v1 [stat.ME])
- Do Deep Convolutional Nets Really Need to be Deep and Convolutional?. (arXiv:1603.05691v4 [stat.ML] UPDATED)
- Convexity of a stochastic control functional related to importance sampling of It\^o diffusions. (arXiv:1603.05900v1 [math.OC])
- Estimating multivariate latent-structure models
- Optimal Black-Box Reductions Between Optimization Objectives. (arXiv:1603.05642v3 [math.OC] UPDATED)
- Particle-based Gaussian process optimization for input design in nonlinear dynamical models. (arXiv:1603.05445v1 [math.OC])
- A flexible state space model for learning nonlinear dynamical systems. (arXiv:1603.05486v1 [stat.CO])
- Scaled stochastic gradient descent for low-rank matrix completion. (arXiv:1603.04989v2 [cs.LG] UPDATED)
- Markov Chain Monte Carlo confidence intervals
- Structured and Efficient Variational Deep Learning with Matrix Gaussian Posteriors. (arXiv:1603.04733v5 [stat.ML] UPDATED)
- Particle Gaussian Mixture (PGM) Filters. (arXiv:1603.04510v1 [cs.SY])
- Asymptotic Optimal Strategy for Portfolio Optimization in a Slowly Varying Stochastic Environment. (arXiv:1603.03538v2 [q-fin.MF] UPDATED)
- Convergence Rates for a Class of Estimators Based on Stein's Method. (arXiv:1603.03220v5 [math.ST] UPDATED)
- Functional Autoregression for Sparsely Sampled Data. (arXiv:1603.02982v3 [stat.ME] UPDATED)
- Improved adaptive Multilevel Monte Carlo and applications to finance. (arXiv:1603.02959v2 [math.PR] UPDATED)
- Inference and rare event simulation for stopped Markov processes via reverse-time sequential Monte Carlo. (arXiv:1603.02834v1 [stat.CO])
- Interacting Default Intensity with Hidden Markov Process. (arXiv:1603.02902v1 [q-fin.CP])
- Concentration inequalities with exchangeable pairs (Ph.D. thesis). (arXiv:math/0507526v2 [math.PR] UPDATED)
- Unbiased estimation of risk. (arXiv:1603.02615v4 [q-fin.RM] UPDATED)
- Posterior Consistency for Gaussian Process Approximations of Bayesian Posterior Distributions. (arXiv:1603.02004v1 [math.NA])
- A Quantum Extended Kalman Filter. (arXiv:1603.01890v1 [quant-ph])
- Optimal dictionary for least squares representation. (arXiv:1603.02074v3 [cs.LG] UPDATED)
- Big is Fragile: An Attempt at Theorizing Scale. (arXiv:1603.01416v2 [q-fin.EC] UPDATED)
- Exact and Approximate Bayesian Inference for Low Integer-Valued Time Series Models with Intractable Likelihoods
- PLATO: Policy Learning using Adaptive Trajectory Optimization. (arXiv:1603.00622v4 [cs.LG] UPDATED)
- Multilevel Sequential Monte Carlo Samplers for Normalizing Constants. (arXiv:1603.01136v1 [stat.CO])
- Stochastic thermodynamics of resetting. (arXiv:1603.01141v1 [cond-mat.stat-mech])
- Overdispersed Black-Box Variational Inference. (arXiv:1603.01140v1 [stat.ML])
- A multipurpose information engine that can go beyond the Carnot limit. (arXiv:1603.01129v2 [cond-mat.stat-mech] UPDATED)
- Automatic Differentiation Variational Inference. (arXiv:1603.00788v1 [stat.ML])
- The Arrow of Time in Multivariate Time Series. (arXiv:1603.00784v1 [math.ST])
- Without-Replacement Sampling for Stochastic Gradient Methods: Convergence Results and Application to Distributed Optimization. (arXiv:1603.00570v3 [cs.LG] UPDATED)
- An Introduction to L\'evy and Feller Processes. Advanced Courses in Mathematics - CRM Barcelona 2014. (arXiv:1603.00251v2 [math.PR] UPDATED)
- Parameter Estimation for the Langevin Equation with Stationary-Increment Gaussian Noise. (arXiv:1603.00390v1 [math.PR])
- Guided Cost Learning: Deep Inverse Optimal Control via Policy Optimization. (arXiv:1603.00448v3 [cs.LG] UPDATED)
- Why does Monte Carlo fail to work properly in high-dimensional optimization problems?. (arXiv:1603.00311v2 [math.OC] UPDATED)
- Flexible online multivariate regression with variational Bayes and the matrix-variate Dirichlet process. (arXiv:1602.08849v1 [stat.CO])
- Easy Monotonic Policy Iteration. (arXiv:1602.09118v1 [cs.LG])
- Adaptive Horizon Model Predictive Control. (arXiv:1602.08619v1 [math.OC])
- Statistical models for dynamics in extreme value processes. (arXiv:1602.08974v1 [stat.AP])
- Interference-Normalised Least Mean Square Algorithm. (arXiv:1602.08116v1 [cs.SY])
- Bias-variance trade-off in portfolio optimization under Expected Shortfall with $\ell_2$ regularization. (arXiv:1602.08297v2 [q-fin.PM] UPDATED)
- On short-term traffic flow forecasting and its reliability. (arXiv:1602.08355v1 [stat.AP])
- Efficient Bayesian Inference for Multivariate Factor Stochastic Volatility Models. (arXiv:1602.08154v3 [stat.CO] UPDATED)
- Reinforcement Learning of POMDPs using Spectral Methods. (arXiv:1602.07764v2 [cs.AI] UPDATED)
- Necessity is the mother of invention. The role of collective sensing in group formation. (arXiv:1602.06737v1 [q-bio.PE])
- Variational inference for Monte Carlo objectives. (arXiv:1602.06725v2 [cs.LG] UPDATED)
- Inference Networks for Sequential Monte Carlo in Graphical Models. (arXiv:1602.06701v2 [stat.ML] UPDATED)
- Error bounds, quadratic growth, and linear convergence of proximal methods. (arXiv:1602.06661v2 [math.OC] UPDATED)
- Conservation laws and symmetries in stochastic thermodynamics. (arXiv:1602.06555v2 [cond-mat.stat-mech] UPDATED)
- Scaling up Dynamic Topic Models. (arXiv:1602.06049v1 [stat.ML])
- Noise Fit, Estimation Error and a Sharpe Information Criterion. (arXiv:1602.06186v5 [q-fin.ST] UPDATED)
- Sampling latent states for high-dimensional non-linear state space models with the embedded HMM method. (arXiv:1602.06030v2 [stat.CO] UPDATED)
- A Poisson process model for Monte Carlo. (arXiv:1602.05986v2 [stat.CO] UPDATED)
- Pathways towards instability in financial networks. (arXiv:1602.05883v2 [q-fin.RM] UPDATED)
- Continuity equation for probability as a requirement of inference over paths. (arXiv:1602.05447v1 [cond-mat.stat-mech])
- Auxiliary Deep Generative Models. (arXiv:1602.05473v4 [stat.ML] UPDATED)
- Patterns of Scalable Bayesian Inference. (arXiv:1602.05221v2 [stat.ML] UPDATED)
- Harder, Better, Faster, Stronger Convergence Rates for Least-Squares Regression. (arXiv:1602.05419v2 [math.OC] UPDATED)
- Parallel Bayesian Global Optimization of Expensive Functions. (arXiv:1602.05149v4 [stat.ML] UPDATED)
- Gradient Descent Converges to Minimizers. (arXiv:1602.04915v2 [stat.ML] UPDATED)
- Interacting Particle Markov Chain Monte Carlo. (arXiv:1602.05128v3 [stat.CO] UPDATED)
- An introduction to sampling via measure transport. (arXiv:1602.05023v1 [stat.CO])
- Option Pricing in Markets with Unknown Stochastic Dynamics. (arXiv:1602.04848v2 [q-fin.MF] UPDATED)
- Black-box optimization with a politician. (arXiv:1602.04847v1 [math.OC])
- Integral Inequalities in Thermodynamics. (arXiv:1602.04495v3 [math.CA] UPDATED)
- Quasi-Monte Carlo and Multilevel Monte Carlo Methods for Computing Posterior Expectations in Elliptic Inverse Problems. (arXiv:1602.04704v1 [math.NA])
- Non-Boltzmann Ensembles and Monte Carlo Simulation. (arXiv:1602.04631v3 [cond-mat.stat-mech] UPDATED)
- Method for generating two coupled Gaussian stochastic processes. (arXiv:1602.04697v1 [math.PR])
- Online Low-Rank Subspace Learning from Incomplete Data: A Bayesian View. (arXiv:1602.03670v2 [stat.ML] UPDATED)
- Entropy solutions for a traffic model with phase transitions. (arXiv:1602.03454v1 [math.NA])
- Stochastic Quasi-Newton Langevin Monte Carlo. (arXiv:1602.03442v2 [stat.ML] UPDATED)
- Graphical models for studying museum networks: the Abbonamento Musei Torino Piemonte. (arXiv:1602.03429v1 [stat.AP])
- Stratified Bayesian Optimization. (arXiv:1602.02338v2 [cs.LG] UPDATED)
- A Variational Analysis of Stochastic Gradient Algorithms. (arXiv:1602.02666v1 [stat.ML])
- Data-Efficient Reinforcement Learning in Continuous-State POMDPs. (arXiv:1602.02523v1 [stat.ML])
- Importance Sampling for Minibatches. (arXiv:1602.02283v1 [cs.LG])
- A Tractable Fully Bayesian Method for the Stochastic Block Model. (arXiv:1602.02256v1 [cs.LG])
- Applications of the divergence theorem in Bayesian inference and MaxEnt. (arXiv:1602.02544v1 [cond-mat.stat-mech])
- Reducing Runtime by Recycling Samples. (arXiv:1602.02136v1 [cs.LG])
- Large deviations and concentration inequalities for the Ornstein-Uhlenbeck process without tears. (arXiv:1602.02092v2 [math.ST] UPDATED)
- The Spacey Random Walk: a Stochastic Process for Higher-order Data. (arXiv:1602.02102v2 [cs.NA] UPDATED)
- Asynchronous Methods for Deep Reinforcement Learning. (arXiv:1602.01783v2 [cs.LG] UPDATED)
- Modeling the relation between income and commuting distance. (arXiv:1602.01578v2 [physics.soc-ph] UPDATED)
- Thermodynamic aspects of information transfer in complex dynamical systems. (arXiv:1602.01693v2 [cond-mat.stat-mech] UPDATED)
- A Proximal Stochastic Quasi-Newton Algorithm. (arXiv:1602.00223v2 [cs.LG] UPDATED)
- DeepCare: A Deep Dynamic Memory Model for Predictive Medicine. (arXiv:1602.00357v2 [stat.ML] UPDATED)
- Greedy Deep Dictionary Learning. (arXiv:1602.00203v1 [cs.LG])
- System Identification through Online Sparse Gaussian Process Regression with Input Noise. (arXiv:1601.08068v3 [stat.ML] UPDATED)
- On asymptotic validity of naive inference with an approximate likelihood. (arXiv:1601.07911v2 [math.ST] UPDATED)
- Bayesian analysis of traffic flow on interstate I-55: The LWR model
- Modeling competition between two pharmaceutical drugs using innovation diffusion models
- Sequential Monte Carlo Filtering Estimation of Ebola Progression in West Africa. (arXiv:1601.07606v1 [stat.AP])
- Evolutionary stability implies asymptotic stability under multiplicative weights. (arXiv:1601.07267v2 [cs.GT] UPDATED)
- What is Information?. (arXiv:1601.06176v1 [nlin.AO])
- Efficient parameter inference in general hidden Markov models using the filter derivatives. (arXiv:1601.05568v2 [stat.CO] UPDATED)
- The Randomized Causation Coefficient; David Lopez-Paz, Krikamol Muandet, Benjamin Recht
- Linear Dimensionality Reduction: Survey, Insights, and Generalizations; John P. Cunningham, Zoubin Ghahramani
- Bounds on Tail Probabilities in Exponential families. (arXiv:1601.05179v2 [math.PR] UPDATED)
- Shadow price of information in discrete time stochastic optimization. (arXiv:1601.05202v1 [math.OC])
- Understanding Deep Convolutional Networks. (arXiv:1601.04920v1 [stat.ML])
- Stochastic control, entropic interpolation and gradient flows on Wasserstein product spaces. (arXiv:1601.04891v1 [math.PR])
- A continuous updating weighted least squares estimator of tail dependence in high dimensions. (arXiv:1601.04826v1 [stat.ME])
- Sub-Sampled Newton Methods II: Local Convergence Rates. (arXiv:1601.04738v3 [math.OC] UPDATED)
- Sub-Sampled Newton Methods I: Globally Convergent Algorithms. (arXiv:1601.04737v3 [math.OC] UPDATED)
- Variational analysis of inference from dynamical systems. (arXiv:1601.05033v4 [math.DS] UPDATED)
- Critical fragmentation properties of random drilling: How many random holes need to be drilled to collapse a wooden cube?. (arXiv:1601.03534v1 [cond-mat.stat-mech] CROSS LISTED)
- Irreversible simulated tempering. (arXiv:1601.04286v2 [cond-mat.stat-mech] UPDATED)
- An Analysis of Primal-Dual Algorithms for Discounted Markov Decision Processes. (arXiv:1601.04175v1 [math.OC])
- Statistical Mechanics of High-Dimensional Inference. (arXiv:1601.04650v2 [stat.ML] UPDATED)
- On-line Bayesian System Identification. (arXiv:1601.04251v1 [cs.SY])
- Fighting Uncertainty with Uncertainty: A Baby Step. (arXiv:1601.04043v8 [q-fin.GN] UPDATED)
- Faster Asynchronous SGD. (arXiv:1601.04033v1 [stat.ML])
- Proximal extrapolated gradient methods for variational inequalities. (arXiv:1601.04001v1 [math.OC])
- Inter-occurrence times and universal laws in finance, earthquakes and genomes. (arXiv:1601.03688v1 [cond-mat.stat-mech])
- Non-Parametric Causality Detection: An Application to Social Media and Financial Data. (arXiv:1601.03610v3 [stat.ME] UPDATED)
- Eigenvectors of random matrices: A survey. (arXiv:1601.03678v3 [math.PR] UPDATED)
- Generalization of Doob decomposition Theorem. (arXiv:1601.03574v1 [math.PR])
- Infomax strategies for an optimal balance between exploration and exploitation. (arXiv:1601.03073v1 [cs.LG])
- Unbiased Monte Carlo estimate of stochastic differential equations expectations. (arXiv:1601.03139v2 [math.PR] UPDATED)
- Stochastic Gradient Made Stable: A Manifold Propagation Approach for Large-Scale Optimization. (arXiv:1506.08350v2 [cs.LG] UPDATED)
- Simulated Quantum Annealing Can Be Exponentially Faster than Classical Simulated Annealing. (arXiv:1601.03030v2 [quant-ph] UPDATED)
- On the entropy minimization problem in Statistical Mechanics. (arXiv:1601.02527v1 [math-ph])
- Irreversibility of financial time series: a graph-theoretical approach. (arXiv:1601.01980v1 [q-fin.ST])
- Geometry of Matrix Decompositions Seen Through Optimal Transport and Information Geometry. (arXiv:1601.01875v5 [math.DG] UPDATED)
- Fast Kronecker product kernel methods via generalized vec trick. (arXiv:1601.01507v3 [stat.ML] UPDATED)
- State Space representation of non-stationary Gaussian Processes. (arXiv:1601.01544v1 [cs.LG])
- Bayesian Inference for the Extremal Dependence. (arXiv:1601.01462v3 [stat.ME] UPDATED)
- An Oracle Inequality for Quasi-Bayesian Non-Negative Matrix Factorization. (arXiv:1601.01345v4 [stat.ML] UPDATED)
- Limits to causal inference with state-space reconstruction for infectious disease. (arXiv:1601.00716v1 [q-bio.QM])
- Variational Inference: A Review for Statisticians. (arXiv:1601.00670v9 [stat.CO] UPDATED)
- No Stable Distributions in Finance, please!. (arXiv:1601.00566v2 [q-fin.ST] UPDATED)
Saved in 2015
- Model-based testing for space-time interaction using point processes: An application to psychiatric hospital admissions in an urban area. (arXiv:1512.09052v2 [stat.ME] UPDATED)
- Even Faster Accelerated Coordinate Descent Using Non-Uniform Sampling. (arXiv:1512.09103v3 [math.OC] UPDATED)
- A Stochastic Majorize-Minimize Subspace Algorithm for Online Penalized Least Squares Estimation. (arXiv:1512.08722v2 [math.OC] UPDATED)
- Recovery of periodicities hidden in heavy-tailed noise. (arXiv:1512.08732v3 [math.CA] UPDATED)
- A scalable quasi-Bayesian framework for Gaussian graphical models. (arXiv:1512.07934v1 [math.ST])
- Estimation of Kullback-Leibler losses for noisy recovery problems within the exponential family. (arXiv:1512.08191v3 [stat.AP] UPDATED)
- Inverse Reinforcement Learning via Deep Gaussian Process. (arXiv:1512.08065v4 [cs.LG] UPDATED)
- Bridging the Gap between Stochastic Gradient MCMC and Stochastic Optimization. (arXiv:1512.07962v3 [stat.ML] UPDATED)
- Implementing a Bayes Filter in a Neural Circuit: The Case of Unknown Stimulus Dynamics. (arXiv:1512.07839v4 [cs.LG] UPDATED)
- High-Order Stochastic Gradient Thermostats for Bayesian Learning of Deep Models. (arXiv:1512.07662v1 [stat.ML])
- Preconditioned Stochastic Gradient Langevin Dynamics for Deep Neural Networks. (arXiv:1512.07666v1 [stat.ML])
- Reinforcement Learning: Stochastic Approximation Algorithms for Markov Decision Processes. (arXiv:1512.07669v1 [math.OC])
- Desensitized Cubature Kalman Filter with Uncertain Parameter. (arXiv:1512.07675v1 [cs.SY])
- k-Means Clustering Is Matrix Factorization. (arXiv:1512.07548v1 [stat.ML])
- Risk Sensitive, Nonlinear Optimal Control: Iterative Linear Exponential-Quadratic Optimal Control with Gaussian Noise. (arXiv:1512.07173v1 [cs.SY])
- Stochastic Dual Ascent for Solving Linear Systems. (arXiv:1512.06890v2 [math.NA] UPDATED)
- Uniform bounds for Black--Scholes implied volatility. (arXiv:1512.06812v2 [q-fin.MF] UPDATED)
- Ornstein-Uhlenbeck diffusion of hermitian and non-hermitian matrices - unexpected links. (arXiv:1512.06599v2 [math-ph] UPDATED)
- Two Sample Covariances from a Trivariate Normal Distribution. (arXiv:1005.1183v2 [math.ST] UPDATED)
- Optimal Control of Conditional Value-at-Risk in Continuous Time. (arXiv:1512.05015v3 [math.OC] UPDATED)
- Bayesian model selection for linear regression. (arXiv:1512.04823v1 [math.ST])
- Stability with respect to initial conditions in V-norm for nonlinear filters with ergodic observations. (arXiv:1512.04834v1 [stat.CO])
- The LASSO risk for gaussian matrices. (arXiv:1008.2581v3 [math.ST] UPDATED)
- Model comparison with missing data using MCMC and importance sampling. (arXiv:1512.04743v1 [stat.CO])
- Second order dynamical systems associated to variational inequalities. (arXiv:1512.04702v3 [math.OC] UPDATED)
- Coupling stochastic EM and Approximate Bayesian Computation for parameter inference in state-space models. (arXiv:1512.04831v6 [stat.CO] UPDATED)
- Quantum assisted Gaussian process regression. (arXiv:1512.03929v1 [quant-ph])
- Preconditioned Stochastic Gradient Descent. (arXiv:1512.04202v3 [stat.ML] UPDATED)
- Coexistence in the face of uncertainty. (arXiv:1512.03970v1 [q-bio.PE])
- Derivation and Analysis of Simplified Filters for Complex Dynamical Systems. (arXiv:1512.03647v1 [math.PR])
- What the collapse of the ensemble Kalman filter tells us about particle filters. (arXiv:1512.03720v2 [math.NA] UPDATED)
- RSG: Beating Subgradient Method without Smoothness and Strong Convexity. (arXiv:1512.03107v14 [math.OC] UPDATED)
- Construction of ODE systems from time series data by a highly flexible modelling approach. (arXiv:1512.03357v1 [math.NA])
- On Computational Complexity Reduction Methods for Kalman Filter Extensions. (arXiv:1512.03077v5 [eess.SY] CROSS LISTED)
- Alternating Minimization, Proximal Minimization and Optimization Transfer Are Equivalent. (arXiv:1512.03034v1 [math.NA])
- Numerical Study of a Particle Method for Gradient Flows. (arXiv:1512.03029v1 [math.AP])
- Convergence of Entropic Schemes for Optimal Transport and Gradient Flows. (arXiv:1512.02783v1 [math.AP])
- Efficient Distributed SGD with Variance Reduction. (arXiv:1512.02970v3 [cs.LG] UPDATED)
- High-order ADI scheme for option pricing in stochastic volatility models. (arXiv:1512.02529v1 [q-fin.CP])
- Convergence of discrete-time Kalman filter estimate to continuous-time estimate for systems with unbounded observation. (arXiv:1512.02473v1 [math.OC])
- Sequential Markov Chain Monte Carlo for Bayesian Filtering with Massive Data. (arXiv:1512.02452v1 [stat.CO])
- Robust Inference with Variational Bayes. (arXiv:1512.02578v1 [stat.ME])
- Gibbs-type Indian buffet processes. (arXiv:1512.02543v2 [stat.ML] UPDATED)
- Filter Based Methods For Statistical Linear Inverse Problems. (arXiv:1512.01955v1 [math.ST])
- Stochastic Collapsed Variational Inference for Sequential Data. (arXiv:1512.01666v1 [stat.ML])
- Stochastic Collapsed Variational Inference for Hidden Markov Models. (arXiv:1512.01665v1 [stat.ML])
- Variance Reduction for Distributed Stochastic Gradient Descent. (arXiv:1512.01708v2 [cs.LG] UPDATED)
- Purely pathwise probability-free Ito integral. (arXiv:1512.01698v5 [q-fin.MF] UPDATED)
- Algorithmic independence of initial condition and dynamical law in thermodynamics and causal inference. (arXiv:1512.02057v1 [cond-mat.stat-mech])
- Quantifying knowledge with a new calculus for belief functions - a generalization of probability theory. (arXiv:1512.01249v1 [math.PR])
- A translation of "The characteristic function of a random phenomenon" by Bruno de Finetti. (arXiv:1512.01229v1 [math.ST])
- Computation of generalized matrix functions. (arXiv:1512.01446v1 [math.NA])
- Probabilistic Foundations of Statistical Mechanics: A Bayesian Approach. (arXiv:1512.01368v1 [cond-mat.stat-mech])
- Unbiased estimators and multilevel Monte Carlo. (arXiv:1512.01022v4 [stat.CO] UPDATED)
- Probabilistic Integration: A Role in Statistical Computation?. (arXiv:1512.00933v6 [stat.ML] UPDATED)
- Non-Markovian quantum processes: complete framework and efficient characterisation. (arXiv:1512.00589v3 [quant-ph] UPDATED)
- Kalman-based Stochastic Gradient Method with Stop Condition and Insensitivity to Conditioning. (arXiv:1512.01139v2 [math.OC] UPDATED)
- Bridges of Markov counting processes: quantitative estimates. (arXiv:1512.01180v1 [math.PR])
- Unbiased estimators and multilevel Monte Carlo. (arXiv:1512.01022v4 [stat.CO] UPDATED)
- Central-limit approach to risk-aware Markov decision processes. (arXiv:1512.00583v1 [math.OC])
- Evolution arrests invasions of cooperative populations. (arXiv:1512.00034v1 [q-bio.PE])
- Accelerated non-linear denoising filters. (arXiv:1512.00389v1 [cs.CV])
- Groups, Information Theory and Einstein's Likelihood Principle. (arXiv:1512.00089v2 [cond-mat.stat-mech] UPDATED)
- Gaussian and Robust Kronecker Product Covariance Estimation: Existence and Uniqueness. (arXiv:1512.00336v2 [stat.AP] UPDATED)
- Sparse preconditioning for model predictive control. (arXiv:1512.00375v1 [math.OC])
- Newton-Stein Method: An optimization method for GLMs via Stein's Lemma. (arXiv:1511.08895v1 [stat.ML])
- An Iteratively Reweighted Least Squares Algorithm for Sparse Regularization. (arXiv:1511.08970v1 [math.NA])
- Regularized EM Algorithms: A Unified Framework and Statistical Guarantees. (arXiv:1511.08551v2 [cs.LG] UPDATED)
- J. B. S. Haldane's Contribution to the Bayes Factor Hypothesis Test. (arXiv:1511.08180v4 [stat.OT] UPDATED)
- On the Poisson equation for Metropolis-Hastings chains. (arXiv:1511.07464v2 [math.PR] UPDATED)
- Discretisations of rough stochastic PDEs. (arXiv:1511.06937v2 [math.PR] UPDATED)
- Convergence Results for a Class of Time-Varying Simulated Annealing Algorithms. (arXiv:1511.07304v3 [math.PR] UPDATED)
- Black box variational inference for state space models. (arXiv:1511.07367v1 [stat.ML])
- Pipelined, Flexible Krylov Subspace Methods. (arXiv:1511.07226v1 [math.NA])
- Block stochastic gradient iteration for convex and nonconvex optimization. (arXiv:1408.2597v3 [math.OC] CROSS LISTED)
- Recurrent Gaussian Processes. (arXiv:1511.06644v6 [cs.LG] UPDATED)
- The Variational Gaussian Process. (arXiv:1511.06499v4 [stat.ML] UPDATED)
- Variance Reduction in SGD by Distributed Importance Sampling. (arXiv:1511.06481v7 [stat.ML] UPDATED)
- Variational Auto-encoded Deep Gaussian Processes. (arXiv:1511.06455v2 [cs.LG] UPDATED)
- Bayesian inference via rejection filtering. (arXiv:1511.06458v2 [cs.LG] UPDATED)
- Neural Network Matrix Factorization. (arXiv:1511.06443v2 [cs.LG] UPDATED)
- Stochastic modified equations and adaptive stochastic gradient algorithms. (arXiv:1511.06251v3 [cs.LG] UPDATED)
- The iterated auxiliary particle filter. (arXiv:1511.06286v2 [stat.CO] UPDATED)
- Importance Sampling: Intrinsic Dimension and Computational Cost. (arXiv:1511.06196v3 [stat.CO] UPDATED)
- Parameter inference with estimated covariance matrices. (arXiv:1511.05969v2 [astro-ph.CO] UPDATED)
- Deep factorisation of the stable process II; potentials and applications. (arXiv:1511.06356v4 [math.PR] UPDATED)
- Stochastic gradient method with accelerated stochastic dynamics. (arXiv:1511.06036v1 [stat.ML])
- Variance Reduced Stochastic Gradient Descent with Neighbors. (arXiv:1506.03662v4 [cs.LG] UPDATED)
- On the Global Linear Convergence of Frank-Wolfe Optimization Variants. (arXiv:1511.05932v1 [math.OC])
- Constructive stability and stabilizability of positive linear discrete-time switching systems. (arXiv:1511.05665v4 [math.OC] UPDATED)
- On the minimum of a conditioned Brownian bridge. (arXiv:1511.05735v2 [math.OC] UPDATED)
- Least squares estimation for the subcritical Heston model based on continuous time observations. (arXiv:1511.05948v3 [math.ST] UPDATED)
- Waves in a Spatial Queue: Stop-and-Go at Airport Security. (arXiv:1511.05140v1 [math.PR])
- Predictive Entropy Search for Multi-objective Bayesian Optimization. (arXiv:1511.05467v3 [stat.ML] UPDATED)
- Dynamical systems with multiplicative noise: Time-scale competition, delayed feedback and effective drifts. (arXiv:1511.05340v1 [cond-mat.stat-mech])
- On parameter identification in stochastic differential equations by penalized maximum likelihood. (arXiv:1404.0651v1 [stat.CO] CROSS LISTED)
- Applications of Realizations (aka Linearizations) to Free Probability. (arXiv:1511.05330v2 [math.OA] UPDATED)
- Bayesian Optimization with Dimension Scheduling: Application to Biological Systems. (arXiv:1511.05385v1 [stat.ML])
- Stochastic selection processes. (arXiv:1511.05390v1 [q-bio.PE])
- Dynamical models of task organization in social insect colonies. (arXiv:1511.04769v1 [q-bio.PE])
- The Correlated Pseudo-Marginal Method. (arXiv:1511.04992v4 [stat.CO] UPDATED)
- Deep Kalman Filters. (arXiv:1511.05121v2 [stat.ML] UPDATED)
- Model Space Priors for Objective Sparse Bayesian Regression. (arXiv:1511.04745v1 [stat.ME])
- Exact sampling of diffusions with a discontinuity in the drift. (arXiv:1511.04112v1 [stat.ME])
- Likelihood inference for incompletely observed stochastic processes: ignorability conditions. (arXiv:math/0507151v2 [math.ST] UPDATED)
- $k$-means: Fighting against Degeneracy in Sequential Monte Carlo with an Application to Tracking. (arXiv:1511.04157v1 [stat.ML])
- Mean Square Error bounds for parameter estimation under model misspecification. (arXiv:1511.03982v2 [math.ST] UPDATED)
- The Douglas-Rachford Algorithm for Weakly Convex Penalties. (arXiv:1511.03920v1 [math.OC])
- Instability and Information. (arXiv:1511.03732v1 [physics.soc-ph])
- Prediction uncertainty and optimal experimental design for learning dynamical systems. (arXiv:1511.03395v5 [stat.AP] UPDATED)
- Dimension of Marginals of Kronecker Product Models. (arXiv:1511.03570v1 [stat.ML])
- Sliced Wasserstein Kernels for Probability Distributions. (arXiv:1511.03198v1 [cs.LG])
- Black-box $\alpha$-divergence Minimization. (arXiv:1511.03243v3 [stat.ML] UPDATED)
- A brief account of the Ising and Ising-like models: Mean-field, effective-field and exact results. (arXiv:1511.03031v2 [cond-mat.stat-mech] UPDATED)
- Simulation of volatility modulated Volterra processes using hyperbolic stochastic partial differential equations
- On the exact and $\varepsilon$-strong simulation of (jump) diffusions
- Variance inequalities for quadratic forms with applications. (arXiv:1511.02723v1 [math.PR])
- Sandwiching the marginal likelihood using bidirectional Monte Carlo. (arXiv:1511.02543v1 [stat.ML])
- Statistical physics of inference: Thresholds and algorithms. (arXiv:1511.02476v5 [cond-mat.stat-mech] UPDATED)
- Speed learning on the fly. (arXiv:1511.02540v1 [math.OC])
- Deep Kernel Learning. (arXiv:1511.02222v1 [cs.LG])
- Streaming regularization parameter selection via stochastic gradient descent. (arXiv:1511.02187v3 [stat.ML] UPDATED)
- Barrier Frank-Wolfe for Marginal Inference. (arXiv:1511.02124v2 [stat.ML] UPDATED)
- Stop Wasting My Gradients: Practical SVRG. (arXiv:1511.01942v1 [cs.LG])
- Getting Started with Particle Metropolis-Hastings for Inference in Nonlinear Dynamical Models. (arXiv:1511.01707v8 [stat.CO] UPDATED)
- Regularization and Bayesian Learning in Dynamical Systems: Past, Present and Future. (arXiv:1511.01543v1 [cs.SY])
- Algorithm Portfolios for Noisy Optimization. (arXiv:1511.01277v1 [math.OC])
- The sample size required in importance sampling. (arXiv:1511.01437v3 [math.PR] UPDATED)
- Stochastic Particle Flow for Nonlinear High-Dimensional Filtering Problems. (arXiv:1511.01448v3 [stat.ME] UPDATED)
- Dictionary descent in optimization. (arXiv:1511.01304v1 [stat.ML])
- The elapsed time between two transient state observations for an absorbing Markov chain. (arXiv:1511.01067v1 [math.PR])
- Consistent Parameter Estimation for LASSO and Approximate Message Passing. (arXiv:1511.01017v2 [math.ST] UPDATED)
- Information Theory and Statistics: an overview. (arXiv:1511.00860v1 [math.ST])
- Optimal Gaussian approximations to the posterior for log-linear models with Diaconis-Ylvisaker priors. (arXiv:1511.00764v1 [stat.ME])
- Discrete time approximation of a COGARCH(p,q) model and its estimation. (arXiv:1511.00253v2 [math.ST] UPDATED)
- Gaussian Process Random Fields. (arXiv:1511.00054v1 [cs.LG])
- Inverse Problems in a Bayesian Setting. (arXiv:1511.00524v1 [math.PR])
- Limiting fitness distributions in evolutionary dynamics. (arXiv:1511.00296v2 [q-bio.PE] UPDATED)
- Conditional Value-at-Risk: Theory and Applications. (arXiv:1511.00140v1 [q-fin.RM])
- Martingale central-limit theorems for pivotal sampling. (arXiv:1510.08895v1 [math.ST])
- Mean Field Games with Ergodic cost for Discrete Time Markov Processes. (arXiv:1510.08968v1 [math.PR])
- Convergence Rate of Incremental Gradient and Newton Methods. (arXiv:1510.08562v3 [math.OC] UPDATED)
- Why Random Reshuffling Beats Stochastic Gradient Descent. (arXiv:1510.08560v4 [math.OC] UPDATED)
- Blitzkriging: Kronecker-structured Stochastic Gaussian Processes. (arXiv:1510.07965v2 [stat.ML] UPDATED)
- Online Learning with Gaussian Payoffs and Side Observations. (arXiv:1510.08108v1 [stat.ML])
- Beyond prediction: A framework for inference with variational approximations in mixture models. (arXiv:1510.08151v6 [stat.ME] UPDATED)
- Stochastic control for a class of nonlinear kernels and applications. (arXiv:1510.08439v2 [math.PR] UPDATED)
- On the Convergence of the Iterative Shrinkage/Thresholding Algorithm With a Weakly Convex Penalty. (arXiv:1510.07821v2 [math.OC] UPDATED)
- Pricing of high-dimensional options. (arXiv:1510.07221v1 [q-fin.MF])
- Law invariant risk measures and information divergences. (arXiv:1510.07030v2 [q-fin.RM] UPDATED)
- Quantum Techniques for Stochastic Mechanics. (arXiv:1209.3632v5 [quant-ph] UPDATED)
- On the shadow moments of apparently infinite-mean phenomena. (arXiv:1510.06731v2 [stat.AP] UPDATED)
- Optimization as Estimation with Gaussian Processes in Bandit Settings. (arXiv:1510.06423v4 [stat.ML] UPDATED)
- Scalable posterior approximations for large-scale Bayesian inverse problems via likelihood-informed parameter and state reduction. (arXiv:1510.06053v2 [stat.CO] UPDATED)
- Markov Processes linking Thermodynamics and Turbulence. (arXiv:1510.06281v1 [cond-mat.stat-mech])
- NYTRO: When Subsampling Meets Early Stopping. (arXiv:1510.05684v2 [stat.ML] UPDATED)
- Entropy and thinning of discrete random variables. (arXiv:1510.05390v3 [math.PR] UPDATED)
- Optimization for Gaussian Processes via Chaining. (arXiv:1510.05576v1 [stat.ML])
- A TV-Gaussian prior for infinite-dimensional Bayesian inverse problems and its numerical implementations. (arXiv:1510.05239v2 [stat.ME] UPDATED)
- Random matrices. (arXiv:1510.04430v2 [math-ph] UPDATED)
- Dual Control for Approximate Bayesian Reinforcement Learning. (arXiv:1510.03591v2 [stat.ML] UPDATED)
- Kernel Sequential Monte Carlo. (arXiv:1510.03105v4 [stat.CO] UPDATED)
- Pseudo-Marginal Slice Sampling. (arXiv:1510.02958v2 [stat.CO] UPDATED)
- Conditional Risk Minimization for Stochastic Processes. (arXiv:1510.02706v2 [stat.ML] UPDATED)
- Sequential Monte Carlo Methods for State and Parameter Estimation in Abruptly Changing Environments. (arXiv:1510.02604v1 [stat.CO])
- A nonlinear population Monte Carlo scheme for the Bayesian estimation of parameters of $\alpha$-stable distributions. (arXiv:1510.02702v1 [stat.ME])
- Empirical risk minimization for heavy-tailed losses
- An Optimal Transport Formulation of the Linear Feedback Particle Filter. (arXiv:1510.01948v1 [math.PR])
- What's in a ball? Constructing and characterizing uncertainty sets. (arXiv:1510.01675v1 [q-fin.RM])
- The Proximal Robbins-Monro Method. (arXiv:1510.00967v4 [math.ST] UPDATED)
- Nonlinear State Space Model Identification Using a Regularized Basis Function Expansion. (arXiv:1510.00563v1 [stat.CO])
- Critical Behavior and Universality Classes for an Algorithmic Phase Transition in Sparse Reconstruction. (arXiv:1509.08995v3 [cs.IT] UPDATED)
- An Introduction to Twisted Particle Filters and Parameter Estimation in Non-linear State-space Models. (arXiv:1509.09175v2 [stat.CO] UPDATED)
- Convergence of Stochastic Gradient Descent for PCA. (arXiv:1509.09002v2 [cs.LG] UPDATED)
- Asynchronous Gibbs Sampling. (arXiv:1509.08999v7 [stat.CO] UPDATED)
- Posterior Exploration based Sequential Monte Carlo for Global Optimization. (arXiv:1509.08870v3 [stat.CO] UPDATED)
- Tractable Fully Bayesian Inference via Convex Optimization and Optimal Transport Theory. (arXiv:1509.08582v1 [stat.ML])
- Asymptotic behavior of maximum likelihood estimators for a jump-type Heston model. (arXiv:1509.08869v4 [math.ST] UPDATED)
- Variational Information Maximisation for Intrinsically Motivated Reinforcement Learning. (arXiv:1509.08731v1 [stat.ML])
- Blocking Strategies and Stability of Particle Gibbs Samplers. (arXiv:1509.08362v1 [math.ST])
- Adaptive sequential Monte Carlo for multiple changepoint analysis. (arXiv:1509.08442v1 [stat.AP])
- Unbiased Bayesian Inference for Population Markov Jump Processes via Random Truncations. (arXiv:1509.08327v2 [stat.ML] UPDATED)
- High-dimensional Time Series Prediction with Missing Values. (arXiv:1509.08333v3 [cs.LG] UPDATED)
- Bayesian sequential parameter estimation with a Laplace type approximation. (arXiv:1509.07900v1 [stat.ME])
- Optimal trading strategies - a time series approach. (arXiv:1509.07953v3 [q-fin.PM] UPDATED)
- Spectral analysis of the Moore-Penrose inverse of a large dimensional sample covariance matrix. (arXiv:1509.06121v1 [math.ST])
- Forward Backward Doubly Stochastic Differential Equations and the Optimal Filtering of Diffusion Processes. (arXiv:1509.06352v3 [math.PR] UPDATED)
- A sequential approach to calibrate ecosystem models with multiple time series data. (arXiv:1509.06123v1 [q-bio.QM])
- Steady state thermodynamics in population dynamics. (arXiv:1509.06448v1 [cond-mat.stat-mech])
- Expectile Asymptotics. (arXiv:1509.06866v2 [stat.ME] UPDATED)
- Market Making with Model Uncertainty. (arXiv:1509.07155v4 [q-fin.TR] UPDATED)
- Role of dimensionality in complex networks: Connection with nonextensive statistics. (arXiv:1509.07141v1 [cond-mat.stat-mech])
- Estimating standard errors for importance sampling estimators with multiple Markov chains. (arXiv:1509.06310v2 [math.ST] UPDATED)
- A Note on Parameter Estimation for Misspecified Regression Models with Heteroskedastic Errors. (arXiv:1509.05810v3 [stat.ME] UPDATED)
- Quasi-MLE for quadratic ARCH model with long memory. (arXiv:1509.06422v1 [math.ST])
- A Simulated Annealing Approach to Bayesian Inference. (arXiv:1509.05315v1 [stat.CO])
- Boosting Bayesian Parameter Inference of Nonlinear Stochastic Differential Equation Models by Hamiltonian Scale Separation. (arXiv:1509.05305v2 [cs.DS] UPDATED)
- Bayesian inference for spatio-temporal spike-and-slab priors. (arXiv:1509.04752v3 [stat.ML] UPDATED)
- Adapting the Number of Particles in Sequential Monte Carlo Methods through an Online Scheme for Convergence Assessment. (arXiv:1509.04879v2 [stat.CO] UPDATED)
- Correlations of correlations: Secondary autocorrelations in finite harmonic systems. (arXiv:1509.04359v3 [cond-mat.stat-mech] UPDATED)
- Dynamic Poisson Factorization. (arXiv:1509.04640v1 [cs.LG])
- Gaussian process surrogates for failure detection: a Bayesian experimental design approach. (arXiv:1509.04613v1 [stat.CO])
- Maximum Correntropy Kalman Filter. (arXiv:1509.04580v1 [stat.ML])
- Precise Phase Transition of Total Variation Minimization. (arXiv:1509.04376v1 [cs.IT])
- Scalable Bayesian shrinkage and uncertainty quantification for high-dimensional regression. (arXiv:1509.03697v2 [stat.ME] UPDATED)
- Compatible Value Gradients for Reinforcement Learning of Continuous Deep Policies. (arXiv:1509.03005v1 [cs.LG])
- Entropic fluctuations in Gaussian dynamical systems. (arXiv:1509.03244v1 [math-ph])
- Entropic CLT and phase transition in high-dimensional Wishart matrices. (arXiv:1509.03258v3 [math.PR] UPDATED)
- Coarse-to-Fine Sequential Monte Carlo for Probabilistic Programs. (arXiv:1509.02962v1 [cs.AI])
- Continuous control with deep reinforcement learning. (arXiv:1509.02971v6 [cs.LG] UPDATED)
- Fast low-rank estimation by projected gradient descent: General statistical and algorithmic guarantees. (arXiv:1509.03025v1 [math.ST])
- On Parameter Estimation of Hidden Telegraph Process. (arXiv:1509.02704v1 [math.ST])
- A Variational Bayesian State-Space Approach to Online Passive-Aggressive Regression. (arXiv:1509.02438v1 [stat.ML])
- Matrix Factorisation with Linear Filters. (arXiv:1509.02088v1 [stat.ML])
- HAMSI: A Parallel Incremental Optimization Algorithm Using Quadratic Approximations for Solving Partially Separable Problems. (arXiv:1509.01698v4 [stat.ML] UPDATED)
- Coordinate Descent Methods for Symmetric Nonnegative Matrix Factorization. (arXiv:1509.01404v2 [cs.NA] UPDATED)
- Statistical Inference for Partially Observed Markov Processes via the R Package pomp. (arXiv:1509.00503v2 [stat.ME] UPDATED)
- Heavy-tailed Independent Component Analysis. (arXiv:1509.00727v1 [cs.LG])
- Variance estimation in the particle filter. (arXiv:1509.00394v2 [stat.CO] UPDATED)
- A large deviations approach to limit theory for heavy-tailed time series. (arXiv:1509.00253v1 [math.ST])
- On recursive Bayesian predictive distributions. (arXiv:1508.07448v5 [stat.ME] UPDATED)
- Regularized Kernel Recursive Least Square Algoirthm. (arXiv:1508.07103v1 [cs.LG])
- On dimension reduction in Gaussian filters. (arXiv:1508.06452v3 [stat.CO] UPDATED)
- OCReP: An Optimally Conditioned Regularization for Pseudoinversion Based Neural Training. (arXiv:1508.06095v1 [cs.NE])
- Statistical look at reasons of involvement in wars. (arXiv:1508.06228v3 [stat.AP] UPDATED)
- Fast Asynchronous Parallel Stochastic Gradient Decent. (arXiv:1508.05711v1 [stat.ML])
- Population annealing: Theory and application in spin glasses. (arXiv:1508.05647v2 [cond-mat.dis-nn] UPDATED)
- Models of Markov processes with a random transition mechanism. (arXiv:1508.05598v2 [math.PR] UPDATED)
- Concentration of Measure Techniques and Applications. (arXiv:1508.05448v1 [math.PR])
- Adaptive Online Learning. (arXiv:1508.05170v2 [cs.LG] UPDATED)
- Parallel and Interacting Stochastic Approximation Annealing algorithms for global optimisation. (arXiv:1508.04876v1 [stat.CO])
- Calculating principal eigen-functions of non-negative integral kernels: particle approximations and applications. (arXiv:1202.6678v3 [stat.CO] UPDATED)
- State-space models' dirty little secrets: even simple linear Gaussian models can have estimation problems. (arXiv:1508.04325v3 [q-bio.QM] UPDATED)
- Non-Stationary Gaussian Process Regression with Hamiltonian Monte Carlo. (arXiv:1508.04319v1 [stat.ML])
- Optimal approximating Markov chains for Bayesian inference. (arXiv:1508.03387v3 [stat.CO] UPDATED)
- An adaptive independence sampler MCMC algorithm for infinite dimensional Bayesian inferences. (arXiv:1508.03283v2 [math.NA] UPDATED)
- Maximum Likelihood Estimation for Wishart processes. (arXiv:1508.03323v2 [math.ST] UPDATED)
- Bayesian Dropout. (arXiv:1508.02905v2 [stat.ML] UPDATED)
- De-biasing the Lasso: Optimal Sample Size for Gaussian Designs. (arXiv:1508.02757v3 [math.ST] UPDATED)
- Sequential Monte Carlo for fractional Stochastic Volatility Models. (arXiv:1508.02651v2 [stat.ME] UPDATED)
- The Third Way Of Probability & Statistics: Beyond Testing and Estimation To Importance, Relevance, and Skill. (arXiv:1508.02384v1 [stat.OT])
- The Bayesian Second Law of Thermodynamics. (arXiv:1508.02421v3 [cond-mat.stat-mech] UPDATED)
- Decision Making in the Arrow of Time. (arXiv:1508.02018v4 [cond-mat.stat-mech] UPDATED)
- A variational approach to path estimation and parameter inference of hidden diffusion processes. (arXiv:1508.00506v4 [math.OC] UPDATED)
- Adaptive Multiple Importance Sampling for Gaussian Processes. (arXiv:1508.01050v2 [stat.CO] UPDATED)
- Likelihood-free inference in high-dimensional models. (arXiv:1507.08612v2 [stat.ME] UPDATED)
- Nonlinear stability and ergodicity of ensemble based Kalman filters. (arXiv:1507.08307v1 [math.PR])
- A Gauss-Newton Method for Markov Decision Processes. (arXiv:1507.08271v4 [cs.AI] UPDATED)
- On particle Gibbs sampling. (arXiv:1304.1887v2 [stat.CO] UPDATED)
- An Analytically Tractable Bayesian Approximation to Optimal Point Process Filtering. (arXiv:1507.07813v1 [stat.ML])
- Inference in Ising Models. (arXiv:1507.07055v3 [math.ST] UPDATED)
- Consistent estimation of the filtering and marginal smoothing distributions in nonparametric hidden Markov models. (arXiv:1507.06510v1 [math.ST])
- Dynamic Matrix Factorization with Priors on Unknown Values. (arXiv:1507.06452v1 [stat.ML])
- Hessian corrections to the Metropolis Adjusted Langevin Algorithm. (arXiv:1507.06336v1 [stat.CO])
- Cox's Theorem and the Jaynesian Interpretation of Probability. (arXiv:1507.06597v3 [math.ST] UPDATED)
- Risk Quantification in Stochastic Simulation under Input Uncertainty. (arXiv:1507.06015v3 [q-fin.RM] UPDATED)
- Re-Weighted l_1 Dynamic Filtering for Time-Varying Sparse Signal Estimation. (arXiv:1208.0325v3 [math.ST] UPDATED)
- On Classical and Bayesian Asymptotics in State Space Stochastic Differential Equations. (arXiv:1507.06128v4 [math.ST] UPDATED)
- Dynamic Filtering of Time-Varying Sparse Signals via l1 Minimization. (arXiv:1507.06145v2 [math.ST] UPDATED)
- FastGP: An R Package for Gaussian Processes. (arXiv:1507.06055v1 [stat.CO])
- Long time behavior of Markov processes and beyond. (arXiv:1507.05801v1 [math.PR])
- Gradient Importance Sampling. (arXiv:1507.05781v1 [stat.ML])
- Fast Approximate Bayesian Computation for Estimating Parameters in Differential Equations. (arXiv:1507.05117v1 [stat.ML])
- Non-asymptotic convergence analysis for the Unadjusted Langevin Algorithm. (arXiv:1507.05021v3 [math.ST] UPDATED)
- Incremental Variational Inference for Latent Dirichlet Allocation. (arXiv:1507.05016v2 [stat.ML] UPDATED)
- Black-Box Policy Search with Probabilistic Programs. (arXiv:1507.04635v4 [stat.ML] UPDATED)
- On the Convergence of Stochastic Variational Inference in Bayesian Networks. (arXiv:1507.04505v1 [stat.ML])
- Approximate Maximum Likelihood Estimation. (arXiv:1507.04553v1 [stat.CO])
- Parallel MMF: a Multiresolution Approach to Matrix Computation. (arXiv:1507.04396v1 [cs.NA])
- Kernel Methods for Linear Discrete-Time Equations. (arXiv:1507.03111v2 [math.DS] UPDATED)
- A Review of Nonnegative Matrix Factorization Methods for Clustering. (arXiv:1507.03194v2 [stat.ML] UPDATED)
- Derivative-Free Estimation of the Score Vector and Observed Information Matrix with Application to State-Space Models. (arXiv:1304.5768v3 [stat.ME] UPDATED)
- Hawkes Processes. (arXiv:1507.02822v1 [math.PR])
- Sampling from a log-concave distribution with Projected Langevin Monte Carlo. (arXiv:1507.02564v1 [math.PR])
- Wasserstein Training of Boltzmann Machines. (arXiv:1507.01972v1 [stat.ML])
- Intersecting Faces: Non-negative Matrix Factorization With New Guarantees. (arXiv:1507.02189v1 [cs.LG])
- Rethinking LDA: moment matching for discrete ICA. (arXiv:1507.01784v2 [stat.ML] UPDATED)
- Fast sampling in a linear-Gaussian inverse problem. (arXiv:1507.01614v2 [stat.CO] UPDATED)
- Subspace-Sparse Representation. (arXiv:1507.01307v1 [stat.ML])
- Uncertainty Quantification Under Group Sparsity. (arXiv:1507.01296v4 [math.ST] UPDATED)
- Convex Factorization Machine for Regression. (arXiv:1507.01073v5 [stat.ML] UPDATED)
- A New Approach to Probabilistic Programming Inference. (arXiv:1507.00996v2 [stat.ML] UPDATED)
- Scalable Discrete Sampling as a Multi-Armed Bandit Problem. (arXiv:1506.09039v3 [stat.ML] UPDATED)
- Locally weighted Markov chain Monte Carlo. (arXiv:1506.08852v1 [stat.CO])
- Online Learning to Sample. (arXiv:1506.09016v2 [cs.LG] UPDATED)
- Update estimation of diffusion parameter observed at high frequency. (arXiv:1506.08521v1 [math.ST])
- Asynchronous Parallel Stochastic Gradient for Nonconvex Optimization. (arXiv:1506.08272v5 [math.OC] UPDATED)
- Gaussian process hyper-parameter estimation using parallel asymptotically independent Markov sampling. (arXiv:1506.08010v4 [stat.CO] UPDATED)
- A review of some recent advances in causal inference. (arXiv:1506.07669v1 [stat.ME])
- Generalized Majorization-Minimization. (arXiv:1506.07613v3 [cs.CV] UPDATED)
- On Variance Reduction in Stochastic Gradient Descent and its Asynchronous Variants. (arXiv:1506.06840v2 [cs.LG] UPDATED)
- Bayesian optimisation for fast approximate inference in state-space models with intractable likelihoods. (arXiv:1506.06975v3 [stat.CO] UPDATED)
- Approximation method for discrete Markov decision models with a large state space. (arXiv:1506.06722v2 [stat.CO] UPDATED)
- The MCMC split sampler: A block Gibbs sampling scheme for latent Gaussian models. (arXiv:1506.06285v1 [stat.CO])
- Convergence of Sequential Quasi-Monte Carlo Smoothing Algorithms. (arXiv:1506.06117v1 [stat.CO])
- Expectation Particle Belief Propagation. (arXiv:1506.05934v1 [stat.CO])
- Topics in Markov chains: mixing and escape rate. (arXiv:1506.04850v1 [math.PR])
- Some stochastic models for structured populations : scaling limits and long time behavior. (arXiv:1506.04165v2 [math.PR] UPDATED)
- Exact simulation of max-stable processes. (arXiv:1506.04430v1 [stat.ME])
- Online Matrix Factorization via Broyden Updates. (arXiv:1506.04389v2 [stat.ML] UPDATED)
- MCMC for Variationally Sparse Gaussian Processes. (arXiv:1506.04000v1 [stat.ML])
- Sparse Partially Collapsed MCMC for Parallel Inference in Topic Models. (arXiv:1506.03784v3 [stat.ML] UPDATED)
- Optimization Monte Carlo: Efficient and Embarrassingly Parallel Likelihood-Free Inference. (arXiv:1506.03693v2 [cs.LG] UPDATED)
- Parallelizing MCMC with Random Partition Trees. (arXiv:1506.03164v2 [stat.ML] UPDATED)
- Automatic Variational Inference in Stan. (arXiv:1506.03431v2 [stat.ML] UPDATED)
- Neural Adaptive Sequential Monte Carlo. (arXiv:1506.03338v3 [cs.LG] UPDATED)
- Copula variational inference. (arXiv:1506.03159v2 [stat.ML] UPDATED)
- Parallel Markov Chain Monte Carlo for Non-Gaussian Posterior Distributions. (arXiv:1506.03162v1 [stat.ME])
- Provable Bayesian Inference via Particle Mirror Descent. (arXiv:1506.03101v3 [cs.LG] UPDATED)
- Variational consensus Monte Carlo. (arXiv:1506.03074v1 [stat.ML])
- Accelerated Stochastic Gradient Descent for Minimizing Finite Sums. (arXiv:1506.03016v2 [stat.ML] UPDATED)
- Frank-Wolfe Bayesian Quadrature: Probabilistic Integration with Theoretical Guarantees. (arXiv:1506.02681v3 [stat.ML] UPDATED)
- Predictive statistical mechanics and macroscopic time evolution: hydrodynamics and entropy production. (arXiv:1506.02625v4 [cond-mat.stat-mech] UPDATED)
- Predictive statistical mechanics and macroscopic time evolution. A model for closed Hamiltonian systems. (arXiv:1506.02622v3 [cond-mat.stat-mech] UPDATED)
- Computationally Efficient Bayesian Learning of Gaussian Process State Space Models. (arXiv:1506.02267v2 [stat.CO] UPDATED)
- No penalty no tears: Least squares in high-dimensional linear models. (arXiv:1506.02222v5 [stat.ME] UPDATED)
- Towards Automatic Model Comparison: An Adaptive Sequential Monte Carlo Approach. (arXiv:1303.3123v2 [stat.ME] UPDATED)
- Handy sufficient conditions for the convergence of the maximum likelihood estimator in observation-driven models. (arXiv:1506.01831v1 [math.ST])
- Parallel Stochastic Gradient Markov Chain Monte Carlo for Matrix Factorisation Models. (arXiv:1506.01418v2 [stat.ML] UPDATED)
- Transition from lognormal to chi-square superstatistics for financial time series. (arXiv:1506.01660v2 [q-fin.ST] UPDATED)
- Probabilistic Numerics and Uncertainty in Computations. (arXiv:1506.01326v1 [math.NA])
- Toward a generic representation of random variables for machine learning. (arXiv:1506.00976v2 [cs.LG] UPDATED)
- Inferring causal impact using Bayesian structural time-series models. (arXiv:1506.00356v1 [stat.AP])
- Towards automatic calibration of the number of state particles within the SMC$^2$ algorithm. (arXiv:1506.00570v1 [stat.CO])
- On particle Gibbs sampling
- Concentration inequalities for sampling without replacement
- Adaptive MCMC with online relabeling
- An autoregressive model leading to stable distributions. (arXiv:1505.06873v2 [math.PR] UPDATED)
- Adaptive Thermostats for Noisy Gradient Systems. (arXiv:1505.06889v2 [math.NA] UPDATED)
- A Moreau-Yosida approximation scheme for a class of high-dimensional posterior distributions. (arXiv:1505.07072v2 [math.ST] UPDATED)
- Markov Chain Monte Carlo confidence intervals. (arXiv:1209.0703v3 [math.ST] UPDATED)
- Particle ancestor sampling for near-degenerate or intractable state transition models. (arXiv:1505.06356v1 [stat.CO])
- Rao-Blackwellized particle smoothers for conditionally linear Gaussian models. (arXiv:1505.06357v1 [stat.CO])
- Three discussions of the paper "sequential quasi-Monte Carlo sampling", by M. Gerber and N. Chopin. (arXiv:1505.06473v2 [stat.CO] UPDATED)
- Learning the dependence structure of rare events: a non-asymptotic study. (arXiv:1505.06298v1 [math.ST])
- Record statistics for random walk bridges. (arXiv:1505.06053v2 [cond-mat.stat-mech] UPDATED)
- Extremes of multidimensional Gaussian processes. (arXiv:1006.0029v4 [math.PR] UPDATED)
- Weight Uncertainty in Neural Networks. (arXiv:1505.05424v2 [stat.ML] UPDATED)
- Accelerated Gibbs sampling of normal distributions using matrix splittings and polynomials. (arXiv:1505.03512v1 [stat.CO])
- An Asynchronous Mini-Batch Algorithm for Regularized Stochastic Optimization. (arXiv:1505.04824v1 [math.OC])
- Learning Exponential Families in High-Dimensions: Strong Convexity and Sparsity. (arXiv:0911.0054v2 [cs.LG] UPDATED)
- On the statistical properties and tail risk of violent conflicts. (arXiv:1505.04722v2 [stat.AP] UPDATED)
- Sequential Bayesian inference for implicit hidden Markov models and current limitations. (arXiv:1505.04321v1 [stat.ME])
- Efficient adaptive MCMC through precision estimation. (arXiv:1505.03908v2 [stat.CO] UPDATED)
- Portfolio optimization for heavy-tailed assets: Extreme Risk Index vs. Markowitz. (arXiv:1505.04045v1 [q-fin.PM])
- Towards stability and optimality in stochastic gradient descent. (arXiv:1505.02417v4 [stat.ME] UPDATED)
- Multiple Change Point Estimation in Stationary Ergodic Time Series. (arXiv:1203.1515v10 [stat.ML] UPDATED)
- Bayesian Structure Learning for Stationary Time Series. (arXiv:1505.03131v2 [stat.ME] UPDATED)
- On the Exact Simulation of (Jump) Diffusion Bridges. (arXiv:1505.03030v1 [stat.ME])
- On Markov chain Monte Carlo methods for tall data. (arXiv:1505.02827v1 [stat.ME])
- Fluctuations, stability and instability of a distributed particle filter with local exchange. (arXiv:1505.02390v2 [stat.ME] UPDATED)
- Multifractal to monofractal evolution of the London's street network. (arXiv:1505.02760v1 [physics.soc-ph])
- An unexpected encounter with Cauchy and L\'evy. (arXiv:1505.01957v2 [math.ST] UPDATED)
- Clustering of extreme events created by multiple correlated maxima. (arXiv:1505.01553v1 [math.DS])
- Scaling It Up: Stochastic Search Structure Learning in Graphical Models. (arXiv:1505.01687v1 [math.ST])
- Dirichlet Process Hidden Markov Multiple Change-point Model. (arXiv:1505.01665v1 [math.ST])
- Traffic Dynamic Instability. (arXiv:1505.01219v1 [cond-mat.stat-mech])
- Particle Gibbs algorithms for Markov jump processes. (arXiv:1505.01434v1 [stat.CO])
- An Introduction to Multilevel Monte Carlo for Option Valuation. (arXiv:1505.00965v1 [math.NA])
- Inference on the Sharpe ratio via the upsilon distribution. (arXiv:1505.00829v3 [q-fin.ST] UPDATED)
- Stick-Breaking Policy Learning in Dec-POMDPs. (arXiv:1505.00274v2 [cs.AI] UPDATED)
- A central limit theorem for temporally non-homogenous Markov chains with applications to dynamic programming. (arXiv:1505.00749v2 [math.PR] UPDATED)
- Generalized Mass Action Law and Thermodynamics of Nonlinear Markov Processes. (arXiv:1504.08317v1 [physics.chem-ph])
- Statistical Inference for Perturbed Multiscale Dynamical Systems. (arXiv:1504.07645v3 [math.PR] UPDATED)
- Market forecasting using Hidden Markov Models. (arXiv:1504.07829v2 [stat.ML] UPDATED)
- Fast Sampling for Bayesian Max-Margin Models. (arXiv:1504.07107v5 [stat.ML] UPDATED)
- Detecting Markov Random Fields Hidden in White Noise. (arXiv:1504.06984v2 [math.ST] UPDATED)
- Maximum a Posteriori Estimation by Search in Probabilistic Programs. (arXiv:1504.06848v1 [cs.AI])
- Thermodynamics of Error Correction. (arXiv:1504.06407v2 [q-bio.SC] UPDATED)
- Stability of Stochastic Approximations with `Controlled Markov' Noise and Temporal Difference Learning. (arXiv:1504.06043v2 [cs.SY] UPDATED)
- Why the nature needs 1/f-noise. (arXiv:1504.05859v2 [cond-mat.stat-mech] UPDATED)
- SMC-ABC methods for the estimation of stochastic simulation models of the limit order book. (arXiv:1504.05806v1 [q-fin.CP])
- Efficient Sequential Monte-Carlo Samplers for Bayesian Inference. (arXiv:1504.05753v1 [stat.CO])
- Noise Robust Online Inference for Linear Dynamic Systems. (arXiv:1504.05723v1 [stat.CO])
- Introduction to Stochastic Differential Equations (SDEs) for Finance. (arXiv:1504.05309v14 [q-fin.MF] UPDATED)
- 25 Years of Self-Organized Criticality: Concepts and Controversies. (arXiv:1504.04991v1 [cond-mat.stat-mech])
- Time-consistency of risk measures with GARCH volatilities and their estimation. (arXiv:1504.04774v2 [q-fin.RM] UPDATED)
- Non-Uniform Stochastic Average Gradient Method for Training Conditional Random Fields. (arXiv:1504.04406v1 [stat.ML])
- Mini-Batch Semi-Stochastic Gradient Descent in the Proximal Setting. (arXiv:1504.04407v2 [cs.LG] UPDATED)
- Introduction to regularity structures
- Forecasting trends with asset prices. (arXiv:1504.03934v2 [q-fin.ST] UPDATED)
- A note on one of the Markov chain Monte Carlo novice's questions. (arXiv:1504.03467v1 [stat.CO])
- Partially Observable Risk-Sensitive Markov Decision Processes. (arXiv:1504.03530v3 [math.PR] UPDATED)
- Exponential ergodicity for Markov processes with random switching. (arXiv:1303.6999v2 [math.PR] UPDATED)
- Pathwise versions of the Burkholder-Davis-Gundy inequality. (arXiv:1305.6188v2 [math.PR] UPDATED)
- Topics in Stochastic Portfolio Theory. (arXiv:1504.02988v1 [q-fin.MF])
- Streaming, Memory Limited Matrix Completion with Noise. (arXiv:1504.03156v1 [math.SP])
- An Overview on the Estimation of Large Covariance and Precision Matrices. (arXiv:1504.02995v2 [stat.ME] UPDATED)
- A closed-form approach to Bayesian inference in tree-structured graphical models. (arXiv:1504.02723v4 [stat.ML] UPDATED)
- Stochastic determination of matrix determinants. (arXiv:1504.02661v2 [physics.data-an] UPDATED)
- A proof of uniform convergence over time for a distributed particle filter. (arXiv:1504.01079v2 [stat.CO] UPDATED)
- Estimating the geometric median in Hilbert spaces with stochastic gradient algorithms: $L^{p}$ and almost sure rates of convergence. (arXiv:1504.02267v2 [math.ST] UPDATED)
- The Metropolis-Hastings algorithm. (arXiv:1504.01896v3 [stat.CO] UPDATED)
- Lasso and probabilistic inequalities for multivariate point processes. (arXiv:1208.0570v2 [math.ST] UPDATED)
- Practical Statistics for Particle Physicists. (arXiv:1504.00945v1 [stat.ME])
- Rare Events, Extremely Rare Events and Fluctuations in a Thermodynamic System. (arXiv:1504.00543v1 [cond-mat.stat-mech])
- Bayesian model comparison with un-normalised likelihoods. (arXiv:1504.00298v3 [stat.CO] UPDATED)
- Hidden Markov models for stochastic thermodynamics. (arXiv:1504.00293v2 [cond-mat.stat-mech] UPDATED)
- Two Timescale Stochastic Approximation with Controlled Markov noise and Off-policy temporal difference learning. (arXiv:1503.09105v14 [math.DS] UPDATED)
- Variational Bayes with Intractable Likelihood. (arXiv:1503.08621v2 [stat.ME] UPDATED)
- Sequential Monte Carlo with Adaptive Weights for Approximate Bayesian Computation. (arXiv:1503.07791v1 [stat.CO])
- Likelihood-free Model Choice. (arXiv:1503.07689v3 [stat.ME] UPDATED)
- Introduction to labeled island particle models and their asymptotic properties. (arXiv:1503.07316v1 [math.PR])
- Multilevel Sequential Monte Carlo Samplers. (arXiv:1503.07259v1 [stat.CO])
- Stability of Noisy Metropolis-Hastings. (arXiv:1503.07066v1 [stat.CO])
- High-dimensional inference in misspecified linear models. (arXiv:1503.06426v1 [stat.ME])
- Sequential Monte Carlo Methods for System Identification. (arXiv:1503.06058v3 [stat.CO] UPDATED)
- Probabilities of concurrent extremes. (arXiv:1503.05748v1 [math.ST])
- Tail index estimation, concentration and adaptivity. (arXiv:1503.05077v3 [math.ST] UPDATED)
- Interactive Restless Multi-armed Bandit Game and Swarm Intelligence Effect. (arXiv:1503.03964v1 [cs.AI])
- Random-walk in Beta-distributed random environment. (arXiv:1503.04117v5 [math.PR] UPDATED)
- Perturbation theory for Markov chains via Wasserstein distance. (arXiv:1503.04123v3 [stat.CO] UPDATED)
- Light and Widely Applicable MCMC: Approximate Bayesian Inference for Large Datasets. (arXiv:1503.04178v2 [stat.ME] UPDATED)
- L\'evy Processes For Finance: An Introduction In R. (arXiv:1503.03902v1 [stat.AP])
- Deep Unsupervised Learning using Nonequilibrium Thermodynamics. (arXiv:1503.03585v8 [cs.LG] UPDATED)
- Statistical inference for generalized Ornstein-Uhlenbeck processes. (arXiv:1503.03381v1 [stat.ME])
- Some people have all the luck. (arXiv:1503.02902v2 [math.PR] UPDATED)
- A piecewise deterministic model for a prey-predator community. (arXiv:1503.02492v4 [math.PR] UPDATED)
- Escaping From Saddle Points --- Online Stochastic Gradient for Tensor Decomposition. (arXiv:1503.02101v1 [cs.LG])
- Effective Langevin equations for constrained stochastic processes. (arXiv:1503.02639v2 [cond-mat.stat-mech] UPDATED)
- Large-Scale Distributed Bayesian Matrix Factorization using Stochastic Gradient MCMC. (arXiv:1503.01596v2 [cs.LG] UPDATED)
- Local Expectation Gradients for Doubly Stochastic Variational Inference. (arXiv:1503.01494v1 [stat.ML])
- Constructing Analytically Tractable Ensembles of Non-Stationary Covariances with an Application to Financial Data. (arXiv:1503.01584v2 [q-fin.ST] UPDATED)
- Sensitivity Analysis for Bayesian Hierarchical Models
- Dirichlet Process Hidden Markov Multiple Change-point Model
- A Differential Equation for Modeling Nesterov's Accelerated Gradient Method: Theory and Insights. (arXiv:1503.01243v2 [stat.ML] UPDATED)
- Quantifying Uncertainty in Stochastic Models with Parametric Variability. (arXiv:1503.01401v1 [stat.ML])
- Time-varying nonlinear regression models: Nonparametric estimation and model selection
- Detecting gradual changes in locally stationary processes
- How to speed up R code: an introduction. (arXiv:1503.00855v1 [stat.CO])
- Sequential Monte Carlo as Approximate Sampling: bounds, adaptive resampling via $\infty$-ESS, and an application to Particle Gibbs. (arXiv:1503.00966v2 [math.ST] UPDATED)
- Biased Online Parameter Inference for State-Space Models. (arXiv:1503.00266v1 [stat.CO])
- On estimation states of hidden markov models in condition of unknown transition matrix. (arXiv:1503.00167v2 [math.PR] UPDATED)
- Randomized Urn Models revisited using Stochastic Approximation. (arXiv:1101.2786v6 [math.PR] UPDATED)
- Correlation formulas for Markovian network processes in a random environment. (arXiv:1503.00153v1 [math.PR])
- Stochastic Dual Coordinate Ascent with Adaptive Probabilities. (arXiv:1502.08053v1 [math.OC])
- Measures of Systemic Risk. (arXiv:1502.07961v5 [q-fin.RM] UPDATED)
- Importance sampling in path space for diffusion processes with slow-fast variables. (arXiv:1502.07899v2 [math.PR] UPDATED)
- Dynamics of quasi-stationary systems: Finance as an example. (arXiv:1502.07522v1 [q-fin.ST])
- Privacy for Free: Posterior Sampling and Stochastic Gradient Monte Carlo. (arXiv:1502.07645v2 [stat.ML] UPDATED)
- Equilibrium in Misspecified Markov Decision Processes. (arXiv:1502.06901v2 [q-fin.EC] UPDATED)
- Markov Interacting Importance Samplers. (arXiv:1502.07039v2 [stat.CO] UPDATED)
- Iteratively reweighted adaptive lasso for conditional heteroscedastic time series with applications to AR-ARCH type processes. (arXiv:1502.06557v2 [stat.ME] UPDATED)
- MILJS : Brand New JavaScript Libraries for Matrix Calculation and Machine Learning. (arXiv:1502.06064v1 [stat.ML])
- Robust Utility Maximization with L\'evy Processes. (arXiv:1502.05920v2 [q-fin.MF] UPDATED)
- Extremes Control of Complex Systems With Applications to Social Network. (arXiv:1502.04985v1 [math.ST])
- An efficient particle-based online EM algorithm for general state-space models. (arXiv:1502.04822v2 [stat.CO] UPDATED)
- Are Discoveries Spurious? Distributions of Maximum Spurious Correlations and Their Applications. (arXiv:1502.04237v4 [math.ST] UPDATED)
- On the probability that all eigenvalues of Gaussian, Wishart, and double Wishart random matrices lie within an interval. (arXiv:1502.04189v2 [math.ST] UPDATED)
- Hawkes processes in finance. (arXiv:1502.04592v2 [q-fin.TR] UPDATED)
- Advanced Mean Field Theory of Restricted Boltzmann Machine. (arXiv:1502.00186v3 [cond-mat.stat-mech] UPDATED)
- Multifractality of jump diffusion processes. (arXiv:1502.03938v2 [math.PR] UPDATED)
- Policy Gradient for Coherent Risk Measures. (arXiv:1502.03919v2 [cs.AI] UPDATED)
- Solvable non-Markovian dynamic network. (arXiv:1502.04072v3 [math.PR] UPDATED)
- Nonlinear state space smoothing using the conditional particle filter. (arXiv:1502.03697v3 [stat.CO] UPDATED)
- Monte Carlo Planning method estimates planning horizons during interactive social exchange. (arXiv:1502.03696v7 [stat.ML] UPDATED)
- Quasi-Newton particle Metropolis-Hastings. (arXiv:1502.03656v2 [stat.CO] UPDATED)
- Newton-based maximum likelihood estimation in nonlinear state space models. (arXiv:1502.03655v2 [stat.CO] UPDATED)
- Quasi-Newton particle Metropolis-Hastings. (arXiv:1502.03656v2 [stat.CO] UPDATED)
- Transition densities of one-dimensional Levy processes. (arXiv:1502.02750v1 [math.PR])
- Nested Sequential Monte Carlo Methods. (arXiv:1502.02536v3 [stat.CO] UPDATED)
- Contextual Markov Decision Processes. (arXiv:1502.02259v1 [stat.ML])
- Algorithms for Finding Copulas Minimizing Convex Functions of Sums. (arXiv:1502.02130v3 [stat.CO] UPDATED)
- Entropy of convex functions on $R^d$. (arXiv:1502.01752v3 [math.ST] UPDATED)
- Collective periodicity in mean-field models of cooperative behavior. (arXiv:1502.01960v1 [math.PR])
- Glass Transition, Cooperativity and Interfaces. (arXiv:1502.01900v1 [cond-mat.dis-nn])
- Fixed points EM algorithm and nonnegative rank boundaries
- Bayesian computation: a perspective on the current state, and sampling backwards and forwards. (arXiv:1502.01148v3 [stat.CO] UPDATED)
- On certain functionals of the maximum of Brownian motion and their applications. (arXiv:1502.01218v1 [cond-mat.stat-mech])
- A large deviations principle for infinite-server queues in a random environment. (arXiv:1502.00885v1 [math.PR])
- Multiple Tipping Points and Optimal Repairing in Interacting Networks. (arXiv:1502.00244v2 [physics.soc-ph] UPDATED)
- Lectures on singular stochastic PDEs. (arXiv:1502.00157v2 [math.PR] UPDATED)
- Loop measures without transition probabilities. (arXiv:1502.00148v1 [math.PR])
- High dimensional matrix estimation with unknown variance of the noise. (arXiv:1112.3055v3 [math.ST] UPDATED)
- A nonlinear model for long memory conditional heteroscedasticity. (arXiv:1502.00095v2 [math.ST] UPDATED)
- Power-law correlations in finance-related Google searches, and their cross-correlations with volatility and traded volume: Evidence from the Dow Jones Industrial components. (arXiv:1502.00225v1 [q-fin.ST])
- Nonparametric change-point analysis of volatility. (arXiv:1502.00043v2 [math.ST] UPDATED)
- Convolution and convolution-root properties of long-tailed distributions. (arXiv:1501.07458v5 [math.PR] UPDATED)
- Human diffusion and city influence. (arXiv:1501.07788v2 [physics.soc-ph] UPDATED)
- Graphical Markov models for infinitely many variables. (arXiv:1501.07878v1 [math.PR])
- Adaptive step size selection for Hessian-based manifold Langevin samplers. (arXiv:1501.07454v3 [stat.CO] UPDATED)
- Portfolio Optimization under Shortfall Risk Constraint. (arXiv:1501.07480v4 [q-fin.MF] UPDATED)
- Forward-reverse EM algorithm for Markov chains: convergence and numerical analysis. (arXiv:1501.07091v1 [math.ST])
- A Probabilistic Least-Mean-Squares Filter. (arXiv:1501.06929v1 [stat.ML])
- Optimal strategies of investment in a linear stochastic model of market. (arXiv:1501.07124v1 [q-fin.PM])
- Transportation cost-information and concentration inequalities for bifurcating Markov chains. (arXiv:1501.06693v1 [math.PR])
- Estimation of parameters of SDE driven by fractional Brownian motion with polynomial drift. (arXiv:1501.06850v2 [math.PR] UPDATED)
- Particle Gibbs with Ancestor Sampling for Probabilistic Programs. (arXiv:1501.06769v5 [stat.ML] UPDATED)
- Spatio-temporal modelling of extreme storms. (arXiv:1501.06377v1 [stat.AP])
- Granger causality for state space models. (arXiv:1501.06502v2 [math.ST] UPDATED)
- AR(1) Latent Class Models for Longitudinal Count Data. (arXiv:1501.05961v1 [stat.ME])
- Output-Sensitive Adaptive Metropolis-Hastings for Probabilistic Programs. (arXiv:1501.05677v2 [cs.AI] UPDATED)
- Moment based estimation of supOU processes and a related stochastic volatility model. (arXiv:1305.1470v2 [math.PR] UPDATED)
- Lazier ABC. (arXiv:1501.05144v1 [stat.CO])
- Parameter Estimation for a partially observed Ornstein-Uhlenbeck process with long-memory noise. (arXiv:1501.04972v2 [math.PR] UPDATED)
- Parameter estimation for SDEs related to stationary Gaussian processes. (arXiv:1501.04970v1 [math.PR])
- Toward robust early-warning models: A horse race, ensembles and model uncertainty. (arXiv:1501.04682v3 [q-fin.ST] UPDATED)
- State Space Methods for Granger-Geweke Causality Measures. (arXiv:1501.04663v1 [math.ST])
- Lectures on integrable probability. (arXiv:1212.3351v2 [math.PR] UPDATED)
- An exact mapping between the Variational Renormalization Group and Deep Learning. (arXiv:1410.3831v1 [stat.ML])
- Fractionally integrated COGARCH processes. (arXiv:1501.03694v3 [math.ST] UPDATED)
- The asymptotic smile of a multiscaling stochastic volatility model. (arXiv:1501.03387v4 [math.PR] UPDATED)
- Extremes on river networks. (arXiv:1501.02663v2 [stat.ME] UPDATED)
- Controlled Markov Chains with AVaR Criteria for Unbounded Costs. (arXiv:1501.02518v4 [math.PR] UPDATED)
- Self-Financing Trading and the Ito-Doeblin Lemma. (arXiv:1501.02750v1 [q-fin.PR])
- Sequential Kernel Herding: Frank-Wolfe Optimization for Particle Filtering. (arXiv:1501.02056v2 [stat.ML] UPDATED)
- Shortfall Deviation Risk: An alternative to risk measurement. (arXiv:1501.02007v4 [q-fin.RM] UPDATED)
- Simulation of stochastic Volterra equations driven by space--time L\'evy noise. (arXiv:1501.01645v1 [math.PR])
- R Markdown. (arXiv:1501.01613v1 [stat.CO])
- An Introduction to Matrix Concentration Inequalities. (arXiv:1501.01571v1 [math.PR])
- On the survival probability of a random walk in random environment with killing. (arXiv:1501.01521v1 [math.PR])
- A Composite Risk Measure Framework for Decision Making under Uncertainty. (arXiv:1501.01126v1 [math.OC])
- Entropy-Based Financial Asset Pricing. (arXiv:1501.01155v1 [q-fin.PR])
- Effective interactions and large deviations in stochastic processes. (arXiv:1501.01154v1 [cond-mat.stat-mech])
- Detecting tail behavior: mean excess plots with confidence bounds. (arXiv:1501.00518v1 [math.ST])
- On large deviations for small noise It\^o processes. (arXiv:1212.3223v3 [math.PR] UPDATED)
- A law of large numbers for limit order books. (arXiv:1501.00843v1 [q-fin.MF])
- Detailed Derivations of Small-Variance Asymptotics for some Hierarchical Bayesian Nonparametric Models. (arXiv:1501.00052v1 [stat.ML])
- (Non-) asymptotic properties of Stochastic Gradient Langevin Dynamics. (arXiv:1501.00438v2 [stat.ME] UPDATED)
- Lifting -- A nonreversible Markov chain Monte Carlo Algorithm. (arXiv:1412.8762v4 [cond-mat.stat-mech] UPDATED)
- Collective Dynamics from Stochastic Thermodynamics. (arXiv:1501.00055v1 [cond-mat.stat-mech])
- Accurate and Conservative Estimates of MRF Log-likelihood using Reverse Annealing. (arXiv:1412.8566v1 [cs.LG])
Saved in 2014
- Information processing in living systems. (arXiv:1412.8752v1 [q-bio.QM])
- On Particle Methods for Parameter Estimation in State-Space Models. (arXiv:1412.8695v2 [stat.CO] UPDATED)
- High Dimensional Expectation-Maximization Algorithm: Statistical Optimization and Asymptotic Normality. (arXiv:1412.8729v2 [stat.ML] UPDATED)
- Parametrix construction of the transition probability density of the solution to an SDE driven by $\alpha$-stable noise. (arXiv:1412.8732v3 [math.PR] UPDATED)
- Why does Deep Learning work? - A perspective from Group Theory. (arXiv:1412.6621v3 [cs.LG] UPDATED)
- Markov-modulated Ornstein-Uhlenbeck processes. (arXiv:1412.7952v1 [math.PR])
- Imitation Dynamics with Payoff Shocks. (arXiv:1412.7842v1 [math.PR])
- Parametric Inference for Nonsynchronously Observed Diffusion Processes in the Presence of Market Microstructure Noise. (arXiv:1412.8173v3 [math.ST] UPDATED)
- Efficient particle-based online smoothing in general hidden Markov models: the PaRIS algorithm. (arXiv:1412.7550v1 [stat.CO])
- Concentration for matrix martingales in continuous time and microscopic activity of social networks. (arXiv:1412.7705v2 [math.PR] UPDATED)
- Information spreading in a large population of active transmitters and passive receivers. (arXiv:1412.7563v2 [math.PR] UPDATED)
- Optimal switching for pairs trading rule: a viscosity solutions approach. (arXiv:1412.7649v1 [q-fin.MF])
- Tail Risk Constraints and Maximum Entropy. (arXiv:1412.7647v1 [q-fin.RM])
- A new perspective on the fundamental theorem of asset pricing for large financial markets. (arXiv:1412.7562v3 [q-fin.MF] UPDATED)
- Some simple but challenging Markov processes. (arXiv:1412.7516v1 [math.PR])
- Max-stable processes and stationary systems of L\'evy particles. (arXiv:1412.7444v2 [math.PR] UPDATED)
- Large-scale empirical study on pairs trading for all possible pairs of stocks listed on the first section of the Tokyo Stock Exchange. (arXiv:1412.7269v2 [q-fin.TR] UPDATED)
- Particle Metropolis-adjusted Langevin algorithms. (arXiv:1412.7299v3 [stat.ME] UPDATED)
- Efficient strategy for the Markov chain Monte Carlo in high-dimension with heavy-tailed target probability distribution. (arXiv:1412.6231v1 [stat.ME])
- Adaptive Monte Carlo Maximum Likelihood. (arXiv:1412.6370v1 [stat.ME])
- From dependency to causality: a machine learning approach. (arXiv:1412.6285v1 [cs.LG])
- Nonlinear GARCH model and 1/f noise. (arXiv:1412.6244v2 [q-fin.ST] UPDATED)
- Large deviations for Generalized Polya Urns with arbitrary urn function. (arXiv:1412.5762v6 [math.PR] UPDATED)
- Topological properties of hierarchical networks. (arXiv:1412.5918v2 [cond-mat.dis-nn] UPDATED)
- A Comment on the Book "Continuous-Time Markov Chains" by W.J. Anderson. (arXiv:1412.5856v3 [math.PR] UPDATED)
- Poisson's equation in nonlinear filtering. (arXiv:1412.5845v1 [math.OC])
- Interacting growth processes and invariant percolation
- On Pareto theory of circulation of elites. (arXiv:1412.4695v1 [q-fin.GN])
- Analysis and control of pre-extinction dynamics in stochastic populations. (arXiv:1412.3857v1 [q-bio.PE] CROSS LISTED)
- Quenched central limit theorems for the Ising model on random graphs. (arXiv:1412.5081v1 [math.PR])
- Power Weighted Densities for Time Series Data. (arXiv:1412.4059v3 [stat.AP] UPDATED)
- Biips: Software for Bayesian Inference with Interacting Particle Systems. (arXiv:1412.3779v1 [stat.CO])
- Efficient penalty search for multiple changepoint problems. (arXiv:1412.3617v1 [stat.CO])
- A Stable Particle Filter in High-Dimensions. (arXiv:1412.3501v1 [stat.CO])
- Social contact processes and the partner model. (arXiv:1412.3349v2 [math.PR] UPDATED)
- Financial Time Series: Stylized Facts for the Mexican Stock Exchange Index Compared to Developed Markets. (arXiv:1412.3126v1 [q-fin.ST])
- Generalised Entropy MDPs and Minimax Regret. (arXiv:1412.3276v1 [cs.LG])
- Distributed Stochastic Approximation: Weak Convergence and Network Design. (arXiv:1412.3158v2 [stat.AP] UPDATED)
- Multilevel Monte Carlo for stochastic differential equations with small noise. (arXiv:1412.3039v2 [math.NA] UPDATED)
- On Distribution of Product of Stable Laws. (arXiv:1412.2809v1 [math.PR])
- Asymptotic analysis of covariance parameter estimation for Gaussian processes in the misspecified case. (arXiv:1412.1926v2 [math.ST] UPDATED)
- Nonstationary ETAS models for nonstandard earthquakes. (arXiv:1412.1922v1 [stat.AP])
- Statistical Physics of Adaptation. (arXiv:1412.1875v1 [physics.bio-ph])
- Skewness and kurtosis analysis for non-Gaussian distributions. (arXiv:1412.1293v1 [cond-mat.stat-mech])
- Lotka Volterra in fluctuating environment or "how switching between beneficial environments can make survival harder". (arXiv:1412.1107v3 [math.PR] UPDATED)
- Bernstein-von Mises Theorems for Functionals of Covariance Matrix. (arXiv:1412.0313v1 [math.ST])
- Interacting two-state Markov chains on undirected networks. (arXiv:1412.0700v2 [math.DS] UPDATED)
- Misspecified Recovery. (arXiv:1412.0042v3 [q-fin.MF] UPDATED)
- Improving predictability of time series using maximum entropy methods. (arXiv:1411.7805v1 [q-fin.RM])
- Unbiased Monte Carlo: posterior estimation for intractable/infinite-dimensional models. (arXiv:1411.7713v1 [stat.ME])
- Probability Theory without Bayes' Rule. (arXiv:1411.7920v2 [math.PR] UPDATED)
- Two examples of non strictly convex large deviations. (arXiv:1411.7256v3 [math.PR] UPDATED)
- How to Gamble If You're In a Hurry. (arXiv:1112.1645v2 [math.PR] UPDATED)
- Efficiently learning Ising models on arbitrary graphs. (arXiv:1411.6156v2 [cs.LG] UPDATED)
- Distributed Coordinate Descent for L1-regularized Logistic Regression. (arXiv:1411.6520v1 [stat.ML])
- Risk minimization and portfolio diversification. (arXiv:1411.6657v2 [q-fin.PM] UPDATED)
- Asymptotically Optimal Discrete Time Nonlinear Filters From Stochastically Convergent State Process Approximations. (arXiv:1411.6719v3 [math.ST] UPDATED)
- On percolation in Poisson graphs. (arXiv:1411.6688v1 [math.PR])
- Diversification versus specialization -- lessons from a noise driven linear dynamical system. (arXiv:1411.4756v1 [physics.soc-ph])
- Modelling of dependence in high-dimensional financial time series by cluster-derived canonical vines. (arXiv:1411.4970v1 [q-fin.ST])
- You Can Beat the Market: Estimating the Return on Investment for National Hockey League (NHL) Team Scouting using a Draft Value Pick Chart for the NHL. (arXiv:1411.5754v1 [stat.AP])
- A conditional strong large deviation result and a functional central limit theorem for the rate function. (arXiv:1411.5803v2 [math.PR] UPDATED)
- Butterfly resampling: asymptotics for particle filters with constrained interactions. (arXiv:1411.5876v1 [stat.ME])
- On a Nonparametric Change Point Detection Model in Markovian Regimes
- Large deviations of the realized (co-)volatility vector. (arXiv:1411.5159v1 [math.PR])
- A unifying framework for relaxations of the causal assumptions in Bell's theorem. (arXiv:1411.4648v1 [quant-ph])
- Filtering hidden Markov measures. (arXiv:1411.4944v1 [math.ST])
- Consistency of maximum likelihood estimation for some dynamical systems
- Stochastic Compositional Gradient Descent: Algorithms for Minimizing Compositions of Expected-Value Functions. (arXiv:1411.3803v1 [stat.ML])
- Autocorrelation type functions for big and dirty data series. (arXiv:1411.3904v2 [stat.ME] UPDATED)
- A Sharp First Order Analysis of Feynman-Kac Particle Models. (arXiv:1411.3800v1 [math.ST])
- Likelihood estimators for multivariate extremes. (arXiv:1411.3448v2 [stat.ME] UPDATED)
- Estimating causal structure using conditional DAG models. (arXiv:1411.2755v1 [stat.ME])
- Statistical inference for critical continuous state and continuous time branching processes with immigration. (arXiv:1411.2232v2 [math.ST] UPDATED)
- Modelling extremes using approximate Bayesian Computation. (arXiv:1411.1451v1 [stat.ME])
- Stochastic Variational Inference for Hidden Markov Models. (arXiv:1411.1670v1 [stat.ML])
- Parametric Sequential Causal Inference in Point Parametrization. (arXiv:1411.1194v2 [stat.ME] UPDATED)
- Stochastic Modelling with Randomised Markov Bridges. (arXiv:1411.1214v4 [math.PR] UPDATED)
- Random walks in a one-dimensional L\'evy random environment. (arXiv:1411.0586v2 [math.PR] UPDATED)
- Sequentially Constrained Monte Carlo. (arXiv:1410.8209v2 [stat.ME] UPDATED)
- Mean and variance estimation in high-dimensional heteroscedastic models with non-convex penalties. (arXiv:1410.7874v2 [math.ST] UPDATED)
- Almost Sure Asymptotic Stability for Regime-Switching Diffusions. (arXiv:1410.7643v1 [math.PR])
- State and Parameter Estimation of Partially Observed Linear Ordinary Differential Equations with Deterministic Optimal Control. (arXiv:1410.7558v1 [stat.ME])
- Parametric Estimation of Ordinary Differential Equations with Orthogonality Conditions. (arXiv:1410.7566v1 [stat.ME])
- A Tracking Approach to Parameter Estimation in Linear Ordinary Differential Equations. (arXiv:1410.7554v1 [stat.ME])
- Breakdown of statistical inference from some random experiments. (arXiv:1410.7424v4 [physics.data-an] UPDATED)
- Everything you wanted to know about Data Analysis and Fitting but were afraid to ask. (arXiv:1210.3781v3 [physics.data-an] UPDATED)
- On spectral distribution of high dimensional covariation matrices. (arXiv:1410.6764v1 [math.PR])
- About the posterior distribution in hidden Markov models with unknown number of states. (arXiv:1207.2064v2 [math.ST] UPDATED)
- Online and Stochastic Gradient Methods for Non-decomposable Loss Functions. (arXiv:1410.6776v1 [cs.LG])
- A Second Law for Open Markov Processes. (arXiv:1410.6531v3 [cond-mat.stat-mech] UPDATED)
- Markov Chain Monte Carlo and Variational Inference: Bridging the Gap. (arXiv:1410.6460v4 [stat.CO] UPDATED)
- Power-law models for infectious disease spread
- L\'evy walks. (arXiv:1410.5100v2 [cond-mat.stat-mech] UPDATED)
- Multicanonical MCMC for Sampling Rare Events. (arXiv:1305.3039v2 [cond-mat.stat-mech] UPDATED)
- Inference and Mixture Modeling with the Elliptical Gamma Distribution. (arXiv:1410.4812v2 [stat.CO] UPDATED)
- Limiting Statistics of the Largest and Smallest Eigenvalues in the Correlated Wishart Model. (arXiv:1410.4719v2 [math-ph] UPDATED)
- mS2GD: Mini-Batch Semi-Stochastic Gradient Descent in the Proximal Setting. (arXiv:1410.4744v1 [cs.LG])
- Perfect sampling for nonhomogeneous Markov chains and hidden Markov models. (arXiv:1410.4462v2 [math.ST] UPDATED)
- Convergence properties of weighted particle islands with application to the double bootstrap algorithm. (arXiv:1410.4231v2 [stat.CO] UPDATED)
- Central Limit Theorem for Nonlinear Hawkes Processes. (arXiv:1204.1067v3 [math.PR] UPDATED)
- On Random Operator-Valued Matrices: Operator-Valued Semicircular Mixtures and Central Limit Theorem. (arXiv:1410.3500v1 [math.PR])
- A stochastic behavior analysis of stochastic restricted-gradient descent algorithm in reproducing kernel Hilbert spaces. (arXiv:1410.3595v1 [cs.LG])
- On a toy model of interacting neurons. (arXiv:1410.3263v2 [math.PR] UPDATED)
- Second derivative of the log-likelihood in the model given by a Levy driven stochastic differential equations. (arXiv:1410.2880v1 [math.PR])
- Coauthorship and Citation Networks for Statisticians. (arXiv:1410.2840v2 [stat.AP] UPDATED)
- Central Limit for the Product of Free Random Variables. (arXiv:1101.5220v3 [math.OA] UPDATED)
- Control functionals for Monte Carlo integration. (arXiv:1410.2392v5 [stat.ME] UPDATED)
- Bayesian tracking and parameter learning for non-linear multiple target tracking models. (arXiv:1410.2046v1 [stat.AP])
- Existence and uniqueness of the maximum likelihood estimator for models with a Kronecker product covariance structure. (arXiv:1410.2118v1 [math.ST])
- Sequential Monte Carlo Samplers for capital allocation under copula-dependent risk models. (arXiv:1410.1101v2 [stat.CO] UPDATED)
- BayesPy: Variational Bayesian Inference in Python. (arXiv:1410.0870v3 [stat.ML] UPDATED)
- Generalized Friendship Paradox: An Analytical Approach. (arXiv:1410.0586v1 [physics.soc-ph])
- Brownian motion and gambling: from ratchets to paradoxical games. (arXiv:1410.0485v1 [physics.soc-ph])
- Deep Directed Generative Autoencoders. (arXiv:1410.0630v1 [stat.ML])
- Linear State-Space Model with Time-Varying Dynamics. (arXiv:1410.0555v2 [stat.ML] UPDATED)
- Rejoinder of "Instrumental Variables: An Econometrician's Perspective". (arXiv:1410.0482v1 [stat.ME])
- Likelihood free inference for Markov processes: a comparison. (arXiv:1410.0524v1 [stat.CO])
- Think Globally, Act Globally: An Epidemiologist's Perspective on Instrumental Variable Estimation. (arXiv:1410.0477v1 [stat.ME])
- Rare event simulation for multiscale diffusions in random environments. (arXiv:1410.0386v3 [math.PR] UPDATED)
- About the posterior distribution in hidden Markov models with unknown number of states
- Convergence rate and concentration inequalities for Gibbs sampling in high dimension
- Conditional ergodicity in infinite dimension
- Combining Particle MCMC with Rao-Blackwellized Monte Carlo Data Association for Parameter Estimation in Multiple Target Tracking. (arXiv:1409.8502v2 [stat.ME] UPDATED)
- Approximate Bayesian Computation in State Space Models. (arXiv:1409.8363v1 [math.ST])
- Socio-economic inequalities: a statistical physics perspective. (arXiv:1409.8030v2 [physics.soc-ph] UPDATED)
- Beyond Maximum Likelihood: from Theory to Practice. (arXiv:1409.7458v1 [stat.ME])
- Identification of jump Markov linear models using particle filters. (arXiv:1409.7287v1 [stat.CO])
- Flexible modelling in statistics: past, present and future. (arXiv:1409.6219v1 [stat.ME])
- Optimal filtering and the dual process
- Particle-kernel estimation of the filter density in state-space models
- Large deviations for bootstrapped empirical measures
- A Comprehensive Method for Solving Finite-State Semi-Markov Processes. (arXiv:1212.1440v3 [stat.AP] UPDATED)
- How to read probability distributions as statements about process. (arXiv:1409.5196v3 [stat.OT] UPDATED)
- Generalised Fisher Matrices. (arXiv:1404.2854v2 [astro-ph.CO] CROSS LISTED)
- Statistical inference with probabilistic graphical models. (arXiv:1409.4928v1 [cs.LG])
- L\'evy flights with power-law absorption. (arXiv:1409.4453v2 [cond-mat.stat-mech] UPDATED)
- The Randomized Causation Coefficient. (arXiv:1409.4366v1 [stat.ML])
- Bayesian inference for Markov jump processes with informative observations. (arXiv:1409.4362v1 [stat.CO])
- Theory of Parallel Particle Filters for Hidden Markov Models. (arXiv:1409.4160v1 [math.ST])
- Estimating time-changes in noisy Lévy models
- Deterministic Mean-field Ensemble Kalman Filtering. (arXiv:1409.0628v5 [math.PR] UPDATED)
- Efficient Gaussian Sampling for Solving Large-Scale Inverse Problems using MCMC Methods. (arXiv:1409.0606v1 [stat.ME])
- Consistency and fluctuations for stochastic gradient Langevin dynamics. (arXiv:1409.0578v2 [stat.ML] UPDATED)
- How often should you clean your room?. (arXiv:1305.1984v2 [math.CO] UPDATED)
- Large deviations for weighted empirical measures arising in importance sampling. (arXiv:1210.2251v2 [math.PR] UPDATED)
- Parametric estimation of L\'evy processes. (arXiv:1409.0292v1 [math.ST])
- Spectral gaps for a Metropolis–Hastings algorithm in infinite dimensions
- Asymptotic and finite-sample properties of estimators based on stochastic gradients. (arXiv:1408.2923v6 [stat.ME] UPDATED)
- Maximum Likelihood Estimation for Stochastic Differential Equations Using Sequential Kriging-Based Optimization. (arXiv:1408.2441v1 [stat.ME])
- Probabilistic inverse reinforcement learning in unknown environments. (arXiv:1408.2067v1 [cs.LG])
- The Bayesian Approach To Inverse Problems. (arXiv:1302.6989v4 [math.PR] UPDATED)
- Non-parametric Stochastic Approximation with Large Step sizes. (arXiv:1408.0361v3 [math.ST] UPDATED)
- Randomization for Markov chains with applications to networks in a random environment. (arXiv:1407.8378v1 [math.PR])
- A simple scheme for the parallelization of particle filters and its application to the tracking of complex stochastic systems. (arXiv:1407.8071v2 [stat.CO] UPDATED)
- Branching random walk with a random environment in time. (arXiv:1407.7623v1 [math.PR])
- On the Convergence Rates of Some Adaptive Markov Chain Monte Carlo Algorithms. (arXiv:1207.6779v2 [math.PR] UPDATED)
- The filtering equations revisited. (arXiv:1407.6043v2 [math.PR] UPDATED)
- Perfect simulation using atomic regeneration with application to Sequential Monte Carlo. (arXiv:1407.5770v1 [stat.CO])
- Long-term stability of sequential Monte Carlo methods under verifiable conditions. (arXiv:1203.6898v2 [math.ST] UPDATED)
- Quantitative model-checking of controlled discrete-time Markov processes. (arXiv:1407.5449v1 [math.PR])
- Maximum likelihood estimation and Expectation-Maximization algorithm for controlled branching processes. (arXiv:1407.5341v3 [math.ST] UPDATED)
- Likelihood-free inference via classification. (arXiv:1407.4981v3 [stat.CO] UPDATED)
- Statistical Inference with Different Missing-data Mechanisms. (arXiv:1407.4971v1 [stat.ME])
- Parametric estimation of a one-dimensional ballistic random walk in a Markov environment. (arXiv:1407.4905v2 [math.ST] UPDATED)
- Parallel MCMC with Generalized Elliptical Slice Sampling; Robert Nishihara, Iain Murray, Ryan P. Adams
- Particle Gibbs with Ancestor Sampling; Fredrik Lindsten, Michael I. Jordan, Thomas B. Schön
- The inverse problem for rough controlled differential equations. (arXiv:1407.2768v3 [math.PR] UPDATED)
- From Boltzmann to random matrices and beyond. (arXiv:1405.1003v4 [math.HO] UPDATED)
- Mean-field stochastic differential equations and associated PDEs. (arXiv:1407.1215v1 [math.PR])
- Discrete-time probabilistic approximation of path-dependent stochastic control problems. (arXiv:1407.0499v1 [math.PR])
- Detailed balance and entanglement. (arXiv:1407.0520v2 [quant-ph] UPDATED)
- Estimation of nonlinear differential equation model for glucose–insulin dynamics in type I diabetic patients using generalized smoothing
- Estimation in the partially observed stochastic Morris–Lecar neuronal model with particle filter and stochastic approximation methods
- Maximum likelihood and pseudo score approaches for parametric time-to-event analysis with informative entry times
- Infinite Structured Hidden Semi-Markov Models. (arXiv:1407.0044v1 [stat.ME])
- SAGA: A Fast Incremental Gradient Method With Support for Non-Strongly Convex Composite Objectives. (arXiv:1407.0202v3 [cs.LG] UPDATED)
- Asymptotic lower bounds in estimating jumps. (arXiv:1407.0241v1 [math.ST])
- Variational approach for spatial point process intensity estimation. (arXiv:1407.0249v1 [math.ST])
- Learning Laplacian Matrix in Smooth Graph Signal Representations. (arXiv:1406.7842v3 [cs.LG] UPDATED)
- Maximum Likelihood Estimation of Functionals of Discrete Distributions. (arXiv:1406.6959v7 [cs.IT] UPDATED)
- Extreme value statistics of correlated random variables. (arXiv:1406.6768v3 [cond-mat.stat-mech] UPDATED)
- Online learning in MDPs with side information. (arXiv:1406.6812v1 [cs.LG])
- Discrete-time probabilistic approximation of path-dependent stochastic control problems
- Long-term stability of sequential Monte Carlo methods under verifiable conditions
- Mixing time of the card-cyclic-to-random shuffle
- Limit theorems for the empirical distribution function of scaled increments of Itô semimartingales at high frequencies
- Data augmentation for models based on rejection sampling. (arXiv:1406.6652v2 [stat.CO] UPDATED)
- Causality Networks. (arXiv:1406.6651v1 [cs.LG])
- A scaled gradient projection method for Bayesian learning in dynamical systems. (arXiv:1406.6603v3 [math.NA] UPDATED)
- What we talk about when we talk about fields. (arXiv:1406.6371v1 [astro-ph.IM])
- Learning the ergodic decomposition. (arXiv:1406.6670v1 [math.ST])
- Forest resampling for distributed sequential Monte Carlo. (arXiv:1406.6010v1 [stat.CO])
- Asymptotic theory for density ridges. (arXiv:1406.5663v3 [stat.ME] UPDATED)
- Rows vs Columns for Linear Systems of Equations - Randomized Kaczmarz or Coordinate Descent?. (arXiv:1406.5295v1 [math.OC])
- Rate optimality of Random walk Metropolis algorithm in high-dimension with heavy-tailed target distribution. (arXiv:1406.5392v2 [stat.ME] UPDATED)
- Predictive Characterization of Mixtures of Markov Chains. (arXiv:1406.5421v3 [stat.ME] UPDATED)
- A remark on the rates of convergence for integrated volatility estimation in the presence of jumps
- Variational Gaussian Process State-Space Models. (arXiv:1406.4905v2 [cs.LG] UPDATED)
- Divide-and-Conquer with Sequential Monte Carlo. (arXiv:1406.4993v2 [stat.CO] UPDATED)
- Inferring causal structure: a quantum advantage. (arXiv:1406.5036v1 [quant-ph])
- Bayesian estimation of discretely observed multi-dimensional diffusion processes using guided proposals. (arXiv:1406.4704v3 [stat.CO] UPDATED)
- Estimation of Causal Invertible VARMA Models. (arXiv:1406.4584v1 [math.ST])
- On the properties of Laplace transform originating from one-sided L\'evy stable laws. (arXiv:1406.3802v2 [math-ph] UPDATED)
- Smoothed Gradients for Stochastic Variational Inference. (arXiv:1406.3650v2 [stat.ML] UPDATED)
- Fisher information and convergence to stable laws
- A robust, adaptive M-estimator for pointwise estimation in heteroscedastic regression
- High-dimensional covariance matrix estimation with missing observations
- Asymptotic lower bounds in estimating jumps
- Asymptotic properties of adaptive maximum likelihood estimators in latent variable models
- On the computation of the marginal likelihood. (arXiv:1306.1170v2 [math.ST] UPDATED)
- Stochastic Analysis Seminar on Filtering Theory. (arXiv:1406.1936v2 [q-fin.MF] UPDATED)
- Variational inference of latent state sequences using Recurrent Networks. (arXiv:1406.1655v2 [stat.ML] UPDATED)
- Statistics for Tail Processes of Markov Chains. (arXiv:1405.7721v2 [stat.ME] UPDATED)
- Lazy ABC. (arXiv:1405.7867v3 [stat.CO] UPDATED)
- Bayesian Analysis of the Functional-Coefficient Autoregressive Heteroscedastic Model
- Estimation in high dimensions: a geometric perspective. (arXiv:1405.5103v2 [math.ST] UPDATED)
- Maximum Likelihood for Dual Varieties. (arXiv:1405.5143v1 [math.ST])
- Convex Optimization: Algorithms and Complexity. (arXiv:1405.4980v2 [math.OC] UPDATED)
- Efficient estimation of integrated volatility in presence of infinite variation jumps
- Stochastic analysis for Poisson processes. (arXiv:1405.4416v1 [math.PR])
- Stochastic Volatility Filtering with Intractable Likelihoods. (arXiv:1405.4323v1 [stat.CO])
- A Review on asymptotic normality of sums of associated random variables. (arXiv:1405.4316v3 [stat.ME] UPDATED)
- Antithetic multilevel Monte Carlo estimation for multi-dimensional SDEs without L\'{e}vy area simulation. (arXiv:1202.6283v4 [q-fin.CP] CROSS LISTED)
- A geometric approach to archetypal analysis and non-negative matrix factorization. (arXiv:1405.4275v2 [stat.ME] UPDATED)
- Sequential Monte Carlo with Highly Informative Observations. (arXiv:1405.4081v2 [stat.CO] UPDATED)
- On the stability of sequential Monte Carlo methods in high dimensions
- Maximum likelihood estimation in the context of a sub-ballistic random walk in a parametric random environment. (arXiv:1405.2880v1 [math.PR])
- Statistical Causality from a Decision-Theoretic Perspective. (arXiv:1405.2292v1 [math.ST])
- Parameter estimation of a two-colored urn model class. (arXiv:1405.2322v2 [math.ST] UPDATED)
- Marginal integration for nonparametric causal inference. (arXiv:1405.1868v3 [stat.ME] UPDATED)
- Parallel resampling in the particle filter. (arXiv:1301.4019v3 [stat.CO] UPDATED)
- Estimating the transition matrix of a Markov chain observed at random times. (arXiv:1405.0384v1 [math.ST])
- Estimation of stable distribution parameters from a dependent sample. (arXiv:1405.0374v1 [math.ST])
- Ergodicity of Approximate MCMC Chains with Applications to Large Data Sets. (arXiv:1405.0182v2 [math.ST] UPDATED)
- Fast MLE Computation for the Dirichlet Multinomial. (arXiv:1405.0099v2 [stat.ML] UPDATED)
- Establishing some order amongst exact approximations of MCMCs. (arXiv:1404.6909v2 [stat.CO] UPDATED)
- The Use of a Single Pseudo-Sample in Approximate Bayesian Computation. (arXiv:1404.6298v5 [stat.CO] UPDATED)
- An Equivalence between the Lasso and Support Vector Machines. (arXiv:1303.1152v2 [cs.LG] UPDATED)
- On particle Gibbs Markov chain Monte Carlo models. (arXiv:1404.5733v5 [math.PR] UPDATED)
- Discrete Restricted Boltzmann Machines. (arXiv:1301.3529v4 [stat.ML] UPDATED)
- Recent advances and open challenges in percolation. (arXiv:1404.5325v1 [cond-mat.stat-mech])
- A comparison of nonlinear population Monte Carlo and particle Markov chain Monte Carlo algorithms for Bayesian inference in stochastic kinetic models. (arXiv:1404.5218v1 [stat.ME])
- Bias-correction of the maximum likelihood estimator for the $\alpha$-Brownian bridge. (arXiv:1404.4452v1 [math.ST])
- Simulation based sequential Monte Carlo methods for discretely observed Markov processes. (arXiv:1404.4185v1 [stat.CO])
- Nonparametric identification and maximum likelihood estimation for hidden Markov model. (arXiv:1404.4210v2 [math.ST] UPDATED)
- Speeding Up MCMC by Efficient Data Subsampling. (arXiv:1404.4178v6 [stat.ME] UPDATED)
- Defining a Trend for a Time Series Which Makes Use of the Intrinsic Time-Scale Decomposition. (arXiv:1404.3827v1 [physics.data-an])
- Limits of Sequences of Markov Chains. (arXiv:1404.3815v3 [math.LO] UPDATED)
- Quasi-stationary distributions for randomly perturbed dynamical systems. (arXiv:1101.3420v4 [math.PR] UPDATED)
- Noisy Optimization: Convergence with a Fixed Number of Resamplings. (arXiv:1404.2553v1 [math.OC])
- ECF identification of GARCH systems driven by L\'evy processes. (arXiv:1404.3046v1 [math.ST])
- Recursive ECF identification of linear systems driven by L\'evy processes. (arXiv:1404.3051v1 [math.ST])
- Kalman filter in quantum language. (arXiv:1404.2664v1 [math.ST])
- Non-asymptotic confidence intervals for MCMC in practice. (arXiv:1212.2016v6 [math.PR] UPDATED)
- The Brownian fan. (arXiv:1404.2928v1 [math.PR])
- Improved diffusion Monte Carlo. (arXiv:1207.2866v2 [math.PR] UPDATED)
- On the use of Markov chain Monte Carlo methods for the sampling of mixture models. (arXiv:1404.0880v1 [stat.CO])
- On parameter identification in stochastic differential equations by penalized maximum likelihood. (arXiv:1404.0651v1 [stat.CO])
- Irreversible Langevin samplers and variance reduction: a large deviation approach. (arXiv:1404.0105v4 [math.PR] UPDATED)
- An introduction to Monte Carlo methods. (arXiv:1404.0209v1 [cond-mat.stat-mech])
- A Stable Manifold MCMC Method for High Dimensions. (arXiv:1403.7711v1 [stat.ME])
- Prior-free probabilistic prediction of future observations. (arXiv:1403.7589v3 [stat.ME] UPDATED)
- Information-geometric Markov Chain Monte Carlo methods using Diffusions. (arXiv:1403.7957v3 [stat.CO] UPDATED)
- On a Poissonian Change-Point Model with Variable Jump Size. (arXiv:1403.7866v2 [math.ST] UPDATED)
- Efficient implementation of Markov chain Monte Carlo when using an unbiased likelihood estimator. (arXiv:1210.1871v4 [stat.ME] UPDATED)
- Multi-scaling of moments in stochastic volatility models. (arXiv:1403.7387v1 [math.PR])
- A L\'evy-driven rainfall model with applications to futures pricing. (arXiv:1403.7406v2 [stat.ME] UPDATED)
- Asymmetric COGARCH processes. (arXiv:1403.7068v1 [math.ST])
- Moment Conditions for Convergence of Particle Filters with Unbounded Importance Weights. (arXiv:1403.6585v3 [math.DS] CROSS LISTED)
- Scalable Inference for Markov Processes with Intractable Likelihoods. (arXiv:1403.6886v2 [stat.CO] UPDATED)
- Convergence of Markovian Stochastic Approximation with discontinuous dynamics. (arXiv:1403.6803v2 [math.ST] UPDATED)
- Causality between time series. (arXiv:1403.6496v1 [stat.ME])
- Firefly Monte Carlo: Exact MCMC with Subsets of Data. (arXiv:1403.5693v1 [stat.ML])
- Noisy Monte Carlo: Convergence of Markov chains with approximate transition kernels. (arXiv:1403.5496v3 [stat.ME] UPDATED)
- Smoothing Dynamic Systems with State-Dependent Covariance Matrices. (arXiv:1211.4601v2 [math.OC] UPDATED)
- Transdimensional Transformation based Markov Chain Monte Carlo. (arXiv:1403.5207v5 [stat.CO] UPDATED)
- Accuracy of Maximum Likelihood Parameter Estimators for Heston volatility SDE. (arXiv:1403.4893v1 [math.PR])
- Criteria for transience and recurrence of regime-switching diffusion processes. (arXiv:1403.3135v2 [math.PR] UPDATED)
- Efficient maximum likelihood estimation for L\'{e}vy-driven Ornstein-Uhlenbeck processes. (arXiv:1403.2954v1 [math.ST])
- Bayesian dynamic financial networks with time-varying predictors. (arXiv:1403.2272v1 [stat.ME])
- Constraint-based Causal Discovery from Multiple Interventions over Overlapping Variable Sets. (arXiv:1403.2150v1 [stat.ML])
- Second order discretization of backward SDEs and simulation with the cubature method
- Markovian stochastic approximation with expanding projections. (arXiv:1111.5421v2 [math.PR] UPDATED)
- Estimating complex causal effects from incomplete observational data. (arXiv:1403.1124v2 [stat.ME] UPDATED)
- Convergence of switching diffusions. (arXiv:1403.0705v2 [math.PR] UPDATED)
- Convergence properties of pseudo-marginal Markov chain Monte Carlo algorithms. (arXiv:1210.1484v3 [math.PR] UPDATED)
- Approximate Integrated Likelihood via ABC methods. (arXiv:1403.0387v1 [stat.CO])
- Parameter estimation for the subcritical Heston model based on discrete time observations. (arXiv:1403.0527v2 [math.ST] UPDATED)
- A convolution method for numerical solution of backward stochastic differential equations. (arXiv:1304.1783v3 [math.PR] UPDATED)
- Compressible Generalized Hybrid Monte Carlo. (arXiv:1402.7107v1 [physics.comp-ph])
- Efficient maximum likelihood estimation for Lévy-driven Ornstein–Uhlenbeck processes
- Markov properties for mixed graphs
- Statistical convergence of Markov experiments to diffusion limits
- Markovian stochastic approximation with expanding projections
- Simple simulation of diffusion bridges with application to likelihood inference for diffusions
- Maximum likelihood characterization of distributions
- A central limit theorem for adaptive and interacting Markov chains
- Annealed Important Sampling for Models with Latent Variables. (arXiv:1402.6035v1 [stat.ME])
- Zero Variance Differential Geometric Markov Chain Monte Carlo Algorithms
- Variational Particle Approximations. (arXiv:1402.5715v3 [stat.ML] UPDATED)
- Markov Switching Component ARCH Model: Stability and Forecasting. (arXiv:1303.5525v2 [stat.ME] UPDATED)
- Classification with Sparse Overlapping Groups. (arXiv:1402.4512v2 [cs.LG] UPDATED)
- Twisted particle filters
- Incremental Majorization-Minimization Optimization with Application to Large-Scale Machine Learning. (arXiv:1402.4419v3 [math.OC] UPDATED)
- A convergence proof of the split Bregman method for regularized least-squares problems. (arXiv:1402.4371v1 [math.OC])
- Stochastic Gradient Hamiltonian Monte Carlo. (arXiv:1402.4102v2 [stat.ME] UPDATED)
- Fast Hamiltonian Monte Carlo Using GPU Computing. (arXiv:1402.4089v1 [stat.CO])
- Sequential Quasi-Monte Carlo. (arXiv:1402.4039v5 [stat.CO] UPDATED)
- Ergodicity and scaling limit of a constrained multivariate Hawkes process. (arXiv:1301.5007v2 [stat.AP] UPDATED)
- On Zeroth-Order Stochastic Convex Optimization via Random Walks. (arXiv:1402.2667v1 [cs.LG])
- On the long-time integration of stochastic gradient systems. (arXiv:1402.2797v1 [math.NA])
- On perturbed proximal gradient algorithms. (arXiv:1402.2365v4 [math.ST] UPDATED)
- Accelerating Asymptotically Exact MCMC for Computationally Intensive Models via Local Approximations. (arXiv:1402.1694v4 [stat.ME] UPDATED)
- Consistent inference of a general model using the pseudo-likelihood method. (arXiv:1402.1578v2 [cond-mat.dis-nn] UPDATED)
- Adaptive ABC model choice and geometric summary statistics for hidden Gibbs random fields. (arXiv:1402.1380v2 [math.ST] UPDATED)
- An Ensemble Kushner-Stratonovich (EnKS) Nonlinear Filter: Additive Particle Updates in Non-Iterative and Iterative Forms. (arXiv:1402.1253v1 [stat.ME])
- Markov bridges: SDE representation. (arXiv:1402.0822v4 [math.PR] UPDATED)
- Particle Metropolis adjusted Langevin algorithms for state space models. (arXiv:1402.0694v2 [stat.CO] UPDATED)
- Sequential Monte Carlo for Graphical Models. (arXiv:1402.0330v4 [stat.ME] UPDATED)
- Quantifying causal influences
- Approximations of Stochastic Partial Differential Equations. (arXiv:1401.7794v1 [math.PR])
- Quantifying causal influences. (arXiv:1203.6502v2 [math.ST] UPDATED)
- Parallel Optimisation of Bootstrapping in R. (arXiv:1401.6389v1 [stat.CO])
- The EM algorithm and the Laplace Approximation. (arXiv:1401.6276v1 [stat.ML])
- Phase Transitions in Nonlinear Filtering. (arXiv:1401.6450v2 [math.PR] UPDATED)
- Bayesian modeling and forecasting of 24-hour high-frequency volatility: A case study of the financial crisis. (arXiv:1211.2961v2 [stat.AP] UPDATED)
- A Spatio-Temporal Point Process Model for Ambulance Demand. (arXiv:1401.5547v2 [stat.AP] UPDATED)
- An attraction-repulsion point process model for respiratory syncytial virus infections. (arXiv:1401.5506v2 [stat.ME] UPDATED)
- The Why and How of Nonnegative Matrix Factorization. (arXiv:1401.5226v2 [stat.ML] UPDATED)
- Hydrodynamic limit for interacting neurons. (arXiv:1401.4264v2 [math.PR] UPDATED)
- Monte Carlo Simulation for Lasso-Type Problems by Estimator Augmentation. (arXiv:1401.4425v2 [stat.ME] UPDATED)
- Model selection of stochastic simulation algorithm based on generalized divergence measures. (arXiv:1401.5015v1 [stat.ME])
- Opinion Exchange Dynamics. (arXiv:1401.4770v2 [math.PR] UPDATED)
- Random Walk on Random Walks. (arXiv:1401.4498v2 [math.PR] UPDATED)
- Inference on Self-Exciting Jumps in Prices and Volatility using High Frequency Measures. (arXiv:1401.3911v3 [stat.AP] UPDATED)
- Information Geometry Approach to Parameter Estimation in Markov Chains. (arXiv:1401.3814v4 [math.ST] UPDATED)
- Multilevel Monte Carlo for the Feynman-Kac Formula for the Laplace Equation. (arXiv:1401.3891v1 [math.PR])
- Minimising MCMC variance via diffusion limits, with an application to simulated tempering. (arXiv:1401.3559v1 [math.PR])
- Long Time Results for a Weakly Interacting Particle System in Discrete Time. (arXiv:1401.3423v1 [math.PR])
- Stochastic Optimization with Importance Sampling. (arXiv:1401.2753v2 [stat.ML] UPDATED)
- Filtering the Maximum Likelihood for Multiscale Problems. (arXiv:1305.1918v4 [math.PR] UPDATED)
- Robust Large Scale Non-negative Matrix Factorization using Proximal Point Algorithm. (arXiv:1401.1842v1 [stat.ML])
- Penalized estimation in high-dimensional hidden Markov models with state-specific graphical models. (arXiv:1208.4989v2 [stat.ME] CROSS LISTED)
- A flexible Particle Markov chain Monte Carlo method. (arXiv:1401.1667v6 [stat.CO] UPDATED)
- Fast nonparametric clustering of structured time-series. (arXiv:1401.1605v2 [cs.LG] UPDATED)
- Accelerating ABC methods using Gaussian processes. (arXiv:1401.1436v2 [stat.CO] UPDATED)
- Measures of Causality in Complex Datasets with application to financial data. (arXiv:1401.1457v2 [q-fin.CP] UPDATED)
- Data Smashing. (arXiv:1401.0742v1 [cs.LG])
- Uniform ergodicity of the Particle Gibbs sampler. (arXiv:1401.0683v2 [math.ST] UPDATED)
- Particle Gibbs with Ancestor Sampling. (arXiv:1401.0604v1 [stat.CO])
- Study design in causal models. (arXiv:1211.2958v4 [stat.ME] UPDATED)
- Approximation of epidemic models by diffusion processes and their statistical inference. (arXiv:1305.3492v2 [stat.ME] UPDATED)
- Approximate Bayesian Computation for a Class of Time Series Models. (arXiv:1401.0265v1 [stat.CO])
- Efficient inference and simulation for elliptical Pareto processes. (arXiv:1401.0168v2 [stat.ME] UPDATED)
- Spatial patterns of competing random walkers. (arXiv:1401.0413v1 [q-bio.PE])
- Covariance and precision matrix estimation for high-dimensional time series
- Structure estimation for discrete graphical models: Generalized covariance matrices and their inverses
Saved in 2013
- Processus al\'eatoires et applications. (arXiv:1312.7796v1 [math.HO])
- Martingales et calcul stochastique. (arXiv:1312.7799v1 [math.HO])
- Parallel Markov Chain Monte Carlo. (arXiv:1312.7479v1 [stat.CO])
- MLE's bias pathology motivates MCMLE. (arXiv:1312.7709v1 [math.ST])
- Bayesian prediction for stochastic processes. Theory and applications. (arXiv:1211.2300v2 [math.ST] UPDATED)
- Supervised learning of a regression model based on latent process. Application to the estimation of fuel cell life time. (arXiv:1312.7003v1 [stat.ML])
- Time series modeling by a regression approach based on a latent process. (arXiv:1312.6969v1 [stat.ME])
- Model-based clustering and segmentation of time series with changes in regime. (arXiv:1312.6967v1 [stat.ME])
- A regression model with a hidden logistic process for feature extraction from time series. (arXiv:1312.7001v1 [stat.ME])
- Model-based clustering with Hidden Markov Model regression for time series with regime changes. (arXiv:1312.7024v1 [stat.ML])
- Near-separable Non-negative Matrix Factorization with $\ell_1$- and Bregman Loss Functions. (arXiv:1312.7167v1 [stat.ML])
- Robust EM algorithm for model-based curve clustering. (arXiv:1312.7022v1 [stat.ME])
- System of interacting particles with Markovian switching. (arXiv:1312.6897v1 [math.PR])
- Meteor process on ${\mathbb Z}^d$. (arXiv:1312.6865v2 [math.PR] UPDATED)
- Penalized estimation in high-dimensional hidden Markov models with state-specific graphical models
- Exact Simulation of Non-stationary Reflected Brownian Motion. (arXiv:1312.6456v1 [math.PR])
- Uniform Ergodicity of the Iterated Conditional SMC and Geometric Ergodicity of Particle Gibbs samplers. (arXiv:1312.6432v2 [math.PR] UPDATED)
- Estimating time-changes in noisy L\'evy models. (arXiv:1312.5911v5 [math.ST] UPDATED)
- Accelerated, Parallel and Proximal Coordinate Descent. (arXiv:1312.5799v2 [math.OC] UPDATED)
- A shrinkage-thresholding Metropolis adjusted Langevin algorithm for Bayesian variable selection. (arXiv:1312.5658v3 [math.ST] UPDATED)
- Approximate continuous-discrete filters for the estimation of diffusion processes from partial and noisy observations. (arXiv:1212.3721v2 [math.OC] UPDATED)
- Approximate discrete-time schemes for the estimation of diffusion processes from complete observations. (arXiv:1212.1788v2 [math.OC] UPDATED)
- Parameter inference from hitting times for perturbed Brownian motion. (arXiv:1312.5207v1 [stat.ME])
- A general theory of particle filters in hidden Markov models and some applications. (arXiv:1312.5114v1 [math.ST])
- A general theory of particle filters in hidden Markov models and some applications
- Error bounds of MCMC for functions with unbounded stationary variance. (arXiv:1312.4344v2 [math.ST] UPDATED)
- Parallel MCMC via Weierstrass Sampler. (arXiv:1312.4605v1 [stat.CO])
- Statistical inference for exponential functionals of L\'evy processe. (arXiv:1312.4731v1 [stat.OT])
- Marked empirical processes for non-stationary time series. (arXiv:1312.3120v1 [math.ST])
- Rare-event Probability Estimation via Empirical Likelihood Maximization. (arXiv:1312.3027v1 [stat.CO])
- A path-integral approach to Bayesian inference for inverse problems using the semiclassical approximation. (arXiv:1312.2974v4 [physics.data-an] UPDATED)
- Stochastic volatility models with possible extremal clustering. (arXiv:1312.2780v1 [math.ST])
- Sequential Monte Carlo Inference of Mixed Membership Stochastic Blockmodels for Dynamic Social Networks. (arXiv:1312.2154v1 [cs.SI])
- Quenched Large Deviations for Multiscale Diffusion Processes in Random Environments. (arXiv:1312.1731v3 [math.PR] UPDATED)
- Semi-Stochastic Gradient Descent Methods. (arXiv:1312.1666v2 [stat.ML] UPDATED)
- Max-Min Distance Nonnegative Matrix Factorization. (arXiv:1312.1613v1 [stat.ML])
- A categorical foundation for Bayesian probability. (arXiv:1205.1488v3 [math.CT] UPDATED)
- Recursive maximum likelihood identification of jump Markov nonlinear systems. (arXiv:1312.0781v1 [stat.CO])
- Hamiltonian Monte Carlo for Hierarchical Models. (arXiv:1312.0906v1 [stat.ME])
- A penalized simulated maximum likelihood approach in parameter estimation for stochastic differential equations. (arXiv:1305.4390v4 [stat.ME] UPDATED)
- Stochastic Mechanistic Interaction. (arXiv:1311.6756v3 [stat.ME] UPDATED)
- Online inference in Markov modulated nonlinear dynamic systems: a Rao-Blackwellized particle filtering approach. (arXiv:1311.6486v1 [stat.CO])
- Estimation and approximation in multidimensional dynamics. (arXiv:1311.5727v1 [stat.ME])
- Stochastic neural field equations: A rigorous footing. (arXiv:1311.5446v1 [math.PR])
- Parameter Estimation in Hidden Markov Models with Intractable Likelihoods Using Sequential Monte Carlo. (arXiv:1311.4117v1 [stat.CO])
- Nonparametric Multi-group Membership Model for Dynamic Networks. (arXiv:1311.2079v1 [cs.SI])
- The Rate of Convergence for Approximate Bayesian Computation. (arXiv:1311.2038v3 [math.ST] UPDATED)
- Computation of expectations by Markov chain Monte Carlo methods. (arXiv:1311.1899v2 [math.ST] UPDATED)
- Scalable Recommendation with Poisson Factorization. (arXiv:1311.1704v3 [cs.IR] UPDATED)
- Multivariate stochastic volatility modelling using Wishart autoregressive processes. (arXiv:1311.0530v1 [q-fin.CP])
- Statistical Inference in Hidden Markov Models using $k$-segment Constraints. (arXiv:1311.1189v1 [stat.ME])
- Statistical pairwise interaction model of stock market. (arXiv:1206.4420v6 [q-fin.ST] UPDATED)
- Predicting trend reversals using market instantaneous state. (arXiv:1310.8169v5 [q-fin.ST] UPDATED)
- Integrable probability: From representation theory to Macdonald processes. (arXiv:1310.8007v4 [math.PR] UPDATED)
- Statistical inference on errorfully observed graphs. (arXiv:1211.3601v4 [stat.ML] UPDATED)
- Maximum likelihood estimation for small noise multiscale diffusions. (arXiv:1301.6413v4 [math.ST] UPDATED)
- Inference in nonstationary asymmetric GARCH models
- PPF - A Parallel Particle Filtering Library. (arXiv:1310.5045v2 [cs.DC] UPDATED)
- Piecewise Constant Sequential Importance Sampling for Fast Particle Filtering. (arXiv:1310.5541v3 [stat.CO] UPDATED)
- Slowed exclusion process: hydrodynamics, fluctuations and phase transitions. (arXiv:1310.5161v1 [math.PR])
- Fast MCMC sampling for Markov jump processes and extensions. (arXiv:1208.4818v3 [stat.CO] UPDATED)
- A Theoretical and Experimental Comparison of the EM and SEM Algorithm. (arXiv:1310.5034v2 [cs.LG] UPDATED)
- Series Expansion Approximations of Brownian Motion for Non-Linear Kalman Filtering of Diffusion Processes. (arXiv:1302.5324v3 [stat.CO] UPDATED)
- Inference, Sampling, and Learning in Copula Cumulative Distribution Networks. (arXiv:1310.4456v1 [stat.ML])
- Superposition of COGARCH processes. (arXiv:1305.2296v3 [math.PR] UPDATED)
- Statistical likelihood methods in finance. (arXiv:1310.4400v2 [math.PR] UPDATED)
- Finite Difference Schemes for Linear Stochastic Integro-Differential Equations. (arXiv:1310.4117v5 [math.PR] UPDATED)
- A Simulated Annealing Approach to Approximate Bayes Computations. (arXiv:1208.2157v2 [stat.CO] UPDATED)
- Homotopy Probability Theory II. (arXiv:1302.5325v4 [math.PR] UPDATED)
- Homotopy Probability Theory I. (arXiv:1302.3684v3 [math.PR] UPDATED)
- A primal-dual algorithm for BSDEs. (arXiv:1310.3694v2 [q-fin.CP] UPDATED)
- Stochastic analysis of biochemical reaction networks with absolute concentration robustness. (arXiv:1310.3761v2 [math.PR] UPDATED)
- Stochastic flows and an interface SDE on metric graphs. (arXiv:1310.3576v5 [math.PR] UPDATED)
- Factorial moments of point processes. (arXiv:1310.3531v1 [math.PR])
- Intrinsic noise and discrete-time processes. (arXiv:1306.0837v2 [cond-mat.stat-mech] UPDATED)
- Negative Binomial Process Count and Mixture Modeling. (arXiv:1209.3442v3 [stat.ME] UPDATED)
- Identifying Influential Entries in a Matrix. (arXiv:1310.3556v2 [cs.NA] UPDATED)
- A generalized Multiple-try Metropolis version of the Reversible Jump algorithm. (arXiv:1006.0621v2 [stat.ME] UPDATED)
- Fractional Poisson processes and their representation by infinite systems of ordinary differential equations. (arXiv:1310.3161v1 [math.CA])
- Optimal filtering and the dual process. (arXiv:1305.4571v4 [math.ST] UPDATED)
- Multi-level stochastic approximation algorithms. (arXiv:1310.2052v2 [math.PR] UPDATED)
- Sequential Monte Carlo Bandits. (arXiv:1310.1404v1 [stat.ML])
- Particle filters. (arXiv:1309.7807v1 [math.ST])
- Some things we've learned (about Markov chain Monte Carlo). (arXiv:1309.7754v1 [math.ST])
- On stochastic finite difference schemes. (arXiv:1309.7610v1 [math.PR])
- Gaussian Processes for Big Data. (arXiv:1309.6835v1 [cs.LG])
- Particle Efficient Importance Sampling. (arXiv:1309.6745v1 [stat.CO])
- Interacting particle systems as stochastic social dynamics. (arXiv:1309.6766v1 [math.ST])
- A Tricentenary history of the Law of Large Numbers. (arXiv:1309.6488v1 [math.ST])
- H\"ormander's theorem for stochastic partial differential equations. (arXiv:1309.5543v2 [math.PR] UPDATED)
- Hypoellipticity for filtering problems of partially observable diffusion processes. (arXiv:1309.5545v1 [math.PR])
- Asymptotic equivalence of jumps L\'evy processes and their discrete counterpart. (arXiv:1305.6725v2 [math.PR] UPDATED)
- Extremes and first passage times of correlated fBm's. (arXiv:1309.4981v2 [math.PR] UPDATED)
- Concentration of the Stationary Distribution on General Random Directed Graphs. (arXiv:1309.4811v1 [math.PR])
- Non-linear dependences in finance. (arXiv:1309.5073v1 [q-fin.ST])
- Twisted particle filters. (arXiv:1210.0220v4 [stat.CO] UPDATED)
- Diffusion of interacting particles in discrete geometries. (arXiv:1305.2095v2 [cond-mat.stat-mech] UPDATED)
- Sparsity Based Poisson Denoising with Dictionary Learning. (arXiv:1309.4306v3 [cs.CV] UPDATED)
- ecp: An R Package for Nonparametric Multiple Change Point Analysis of Multivariate Data. (arXiv:1309.3295v2 [stat.CO] UPDATED)
- Forward-Backward SDEs driven by L\'evy Processes and Application to Option Pricing. (arXiv:1203.5546v3 [math.PR] UPDATED)
- Metropolis-Hastings within Partially Collapsed Gibbs Samplers. (arXiv:1309.3217v2 [stat.CO] UPDATED)
- On the role of interaction in sequential Monte Carlo algorithms. (arXiv:1309.2918v3 [stat.CO] UPDATED)
- Stochastic processes with random contexts: a characterization, and adaptive estimators for the transition probabilities. (arXiv:1309.2819v2 [math.PR] UPDATED)
- The Kac Model Coupled to a Thermostat. (arXiv:1309.2715v2 [math-ph] UPDATED)
- Two limiting regimes of interacting Bessel processes. (arXiv:1309.2733v3 [math-ph] UPDATED)
- On the Prior and Posterior Distributions Used in Graphical Modelling
- $\varphi$-strong solutions and uniqueness of 1-dimensional stochastic differential equations. (arXiv:1309.1551v1 [math.PR])
- Convergent Stochastic Expectation Maximization algorithm with efficient sampling in high dimension. Application to deformable template model estimation. (arXiv:1207.5938v4 [math.ST] UPDATED)
- Statistics of weighted Poisson events and its applications. (arXiv:1309.1287v1 [physics.data-an])
- On the regularity of American options with regime-switching uncertainty. (arXiv:1309.1404v4 [math.PR] UPDATED)
- Exact Simulation of Wishart Multidimensional Stochastic Volatility Model. (arXiv:1309.0557v1 [q-fin.PR])
- Robust filtering: Correlated noise and multidimensional observation
- A computational framework for infinite-dimensional Bayesian inverse problems: Part II. Stochastic Newton MCMC with application to ice sheet flow inverse problems. (arXiv:1308.6221v2 [stat.ME] UPDATED)
- Semiparametric stochastic volatility modelling using penalized splines. (arXiv:1308.5836v3 [stat.ME] UPDATED)
- Adaptive Metropolis Algorithm Using Variational Bayesian Adaptive Kalman Filter. (arXiv:1308.5875v3 [math.ST] UPDATED)
- Modelling group dynamic animal movement. (arXiv:1308.5850v1 [q-bio.QM])
- Predicting the time at which a L\'evy process attains its ultimate supremum. (arXiv:1207.4736v3 [math.PR] UPDATED)
- Monte Carlo approximations of the Neumann problem. (arXiv:1203.4910v2 [math.PR] UPDATED)
- Approximations of a complex Brownian motion with processes constructed from a process with independent increments. (arXiv:1308.5854v1 [math.PR])
- Integration by Parts Formula and Applications for SDEs with L\'evy Noise. (arXiv:1308.5799v1 [math.PR])
- Large deviations of empirical neighborhood distribution in sparse random graphs. (arXiv:1308.5725v2 [math.PR] UPDATED)
- Top-down particle filtering for Bayesian decision trees. (arXiv:1303.0561v2 [stat.ML] UPDATED)
- Online and stochastic Douglas-Rachford splitting method for large scale machine learning. (arXiv:1308.4757v9 [cs.NA] UPDATED)
- Twisting the Alive Particle Filter. (arXiv:1308.4462v1 [stat.ME])
- Statistics, Causality and Bell's Theorem. (arXiv:1207.5103v6 [stat.AP] CROSS LISTED)
- Sequential Markov Chain Monte Carlo. (arXiv:1308.3861v1 [math.ST])
- Improved likelihood inference in generalized linear models. (arXiv:1308.3467v1 [stat.ME])
- Convergence of Gaussian quasi-likelihood random fields for ergodic L\'{e}vy driven SDE observed at high frequency. (arXiv:1308.2830v1 [math.ST])
- Maximum-likelihood estimation for diffusion processes via closed-form density expansions. (arXiv:1308.2764v1 [math.ST])
- A limit theorem for the sum of squared differences of an integrated Ito process with application to inverse scattering. (arXiv:1211.6413v3 [math.PR] UPDATED)
- Optimal transport from Lebesgue to Poisson. (arXiv:1012.3845v2 [math.PR] UPDATED)
- Malliavin calculus approach to statistical inference for Levy driven SDE's. (arXiv:1301.5141v2 [math.PR] UPDATED)
- Intervention in Ornstein-Uhlenbeck SDEs. (arXiv:1308.2152v1 [math.PR])
- Nested particle filters for online parameter estimation in discrete-time state-space Markov models. (arXiv:1308.1883v5 [stat.CO] UPDATED)
- Smooth densities of stochastic differential equations forced by degenerate stable type noises. (arXiv:1308.1124v5 [math.PR] UPDATED)
- Identification of Finite Dimensional Linear Systems Driven by Levy processes. (arXiv:1308.1211v2 [math.ST] UPDATED)
- EM algorithms for estimating the Bernstein copula. (arXiv:1301.2677v4 [stat.CO] UPDATED)
- Spatial Process Generation. (arXiv:1308.0399v1 [stat.CO])
- Nearly Gaussian random variables and application to meteorology. (arXiv:1308.0248v1 [math.PR])
- Nonparametric inference on Lévy measures and copulas
- Backward SPDEs with non-local in time and space boundary conditions. (arXiv:1211.1460v3 [math.PR] UPDATED)
- Integration with respect to L\'evy colored noise, with applications to SPDEs. (arXiv:1307.8426v1 [math.PR])
- A Fractional Generalization of the Poisson Processes and Some of its Properties. (arXiv:1307.8271v1 [math.ST])
- Poisson stochastic integration in Banach spaces. (arXiv:1307.7901v1 [math.PR])
- Counterfactual Reasoning and Learning Systems. (arXiv:1209.2355v5 [cs.LG] UPDATED)
- [1307.6127] Sequential Monte Carlo Methods for High-Dimensional Inverse Problems: A case study for the Navier-Stokes equations
- Integral representation of random variables with respect to Gaussian processes. (arXiv:1307.7559v7 [math.PR] UPDATED)
- Effect of sampling on the estimation of drift parameter of continuous time AR(1) processes. (arXiv:1307.6865v1 [math.ST])
- Streaming Variational Bayes. (arXiv:1307.6769v2 [stat.ML] UPDATED)
- GARCH-extended models: theoretical properties and applications. (arXiv:1307.6685v1 [math.ST])
- Properties of solutions of stochastic differential equations driven by the G-Brownian motion. (arXiv:1010.3158v2 [math.PR] UPDATED)
- Central limit theorem for a Stratonovich integral with Malliavin calculus. (arXiv:1105.4841v3 [math.PR] UPDATED)
- Ranking and mapping of universities and research-focused institutions worldwide based on highly-cited papers: A visualization of results from multi-level models. (arXiv:1212.0304v2 [cs.DL] UPDATED)
- A cubature based algorithm to solve decoupled McKean-Vlasov Forward Backward Stochastic Differential Equations. (arXiv:1307.6427v3 [math.PR] UPDATED)
- Non-uniform approximations for sums of discrete m-dependent random variables. (arXiv:1307.6296v1 [math.PR])
- Extreme gaps between eigenvalues of random matrices. (arXiv:1010.1294v3 [math.PR] UPDATED)
- Sequential Monte Carlo Methods for High-Dimensional Inverse Problems: A case study for the Navier-Stokes equations. (arXiv:1307.6127v1 [stat.CO])
- Variational estimators for the parameters of Gibbs point process models. (arXiv:1307.5971v1 [math.ST])
- Parameter estimation based on discrete observations of fractional Ornstein-Uhlenbeck process of the second kind. (arXiv:1304.2466v5 [math.PR] UPDATED)
- Contractive Markov systems II. (arXiv:math/0503633v15 [math.PR] UPDATED)
- Large deviations for stable like random walks on $\mathbb Z^d$ with applications to random walks on wreath products. (arXiv:1211.3013v2 [math.PR] UPDATED)
- Mimicking an It\^{o} process by a solution of a stochastic differential equation. (arXiv:1011.0111v3 [math.PR] UPDATED)
- Non-stationary Stochastic Optimization. (arXiv:1307.5449v2 [math.PR] UPDATED)
- Austerity in MCMC Land: Cutting the Metropolis-Hastings Budget. (arXiv:1304.5299v4 [cs.LG] UPDATED)
- Dimension-Independent MCMC Sampling for Inverse Problems with Non-Gaussian Priors. (arXiv:1302.2213v3 [math.ST] UPDATED)
- Jump-diffusion processes in random environments. (arXiv:1305.4129v2 [math.PR] UPDATED)
- Hitting time theorems for random matrices. (arXiv:1304.1779v2 [math.PR] UPDATED)
- A New Convex Relaxation for Tensor Completion. (arXiv:1307.4653v1 [cs.LG])
- Sparse Signal Recovery under Poisson Statistics. (arXiv:1307.4666v2 [math.ST] UPDATED)
- Stochastic integration for fractional Levy process and stochastic differential equation driven by fractional Levy noise. (arXiv:1307.4173v1 [math.PR])
- On signed measure valued solutions of stochastic evolution equations. (arXiv:1307.4024v1 [math.PR])
- MCMC Learning. (arXiv:1307.3617v2 [cs.LG] UPDATED)
- Diffusion maps for changing data. (arXiv:1209.0245v3 [math.CA] UPDATED)
- L\'evy processes, martingales, reversed martingales and orthogonal polynomials. (arXiv:1212.3121v5 [math.PR] UPDATED)
- On-line Bayesian parameter estimation in general non-linear state-space models: A tutorial and new results. (arXiv:1307.3490v1 [stat.CO])
- Iterative Scaling in Curved Exponential Families. (arXiv:1307.3282v2 [stat.CO] UPDATED)
- MAP Estimators and Their Consistency in Bayesian Nonparametric Inverse Problems. (arXiv:1303.4795v3 [math.PR] UPDATED)
- Importance sampling for jump processes and applications to finance. (arXiv:1307.2218v1 [math.PR])
- Mixed Gaussian processes: A filtering approach. (arXiv:1208.6253v7 [math.PR] UPDATED)
- Statistical Inference for Stochastic Differential Equations with Memory. (arXiv:1307.1164v1 [stat.ME])
- On the positive eigenvalues and eigenvectors of a non-negative matrix. (arXiv:1306.5116v2 [math.FA] UPDATED)
- The Kalman-Bucy Filter for Integrable L\'{e}vy Processes With Infinite Second Moment. (arXiv:1306.5103v5 [math.PR] UPDATED)
- Making SGD Efficient by Reducing Projections: Guaranteed Optimal Rate for Strongly Convex Optimization. (arXiv:1304.5504v1 [cs.LG])
- Parameter estimation for fractional birth and fractional death processes. (arXiv:1303.6690v1 [math.ST])
- Approximate Inference for Observation Driven Time Series Models with Intractable Likelihoods. (arXiv:1303.7318v1 [stat.CO])
- The Alive Particle Filter. (arXiv:1304.0151v1 [stat.CO])
- A primer on information theory, with applications to neuroscience. (arXiv:1304.2333v1 [cs.IT])
- Stochastic Recovery Of Sparse Signals From Random Measurements. (arXiv:1304.2058v1 [physics.data-an])
- On the particle Gibbs sampler. (arXiv:1304.1887v1 [stat.CO])
- A hierarchical time-splitting approach for solving finite-time optimal control problems. (arXiv:1304.2152v1 [math.OC])
- Sequential Randomized Algorithms for Convex Optimization in the Presence of Uncertainty. (arXiv:1304.2222v1 [cs.SY])
- Mean field spin glasses treated with PDE techniques. (arXiv:1304.2623v1 [cond-mat.dis-nn])
- Central limit theorems in linear dynamics. (arXiv:1304.2621v1 [math.FA])
- On a fractional binomial process. (arXiv:1303.6663v1 [math.PR])
- Dirichlet Heat Kernel Estimates for Subordinate Brownian Motions with Gaussian Components. (arXiv:1303.6626v1 [math.PR])
- Exponential ergodicity for Markov processes with random switching. (arXiv:1303.6999v1 [math.PR])
- Harmonic functions of general graph Laplacians. (arXiv:1303.7198v1 [math.MG])
- Stochastic control with rough paths. (arXiv:1303.7160v1 [math.PR])
- Coherence of countably many bets. (arXiv:1303.6981v1 [math.PR])
- Exponential ergodicity for Markov processes with random switching. (arXiv:1303.6999v1 [math.PR])
- Quasi-Stationary Distributions for Stochastic Approximation Algorithms with constat step size. (arXiv:1303.7081v1 [math.PR])
- Integration Theory for infinite dimensional volatility modulated Volterra processes. (arXiv:1303.7143v1 [math.PR])
- A direct method for solving optimal stopping problems for L\&#39;evy processes. (arXiv:1303.3465v1 [math.PR])
- Large deviations for interacting Bessel-like processes and applications to systemic risk. (arXiv:1303.3061v1 [math.PR])
- Online Learning in Markov Decision Processes with Adversarially Chosen Transition Probability Distributions. (arXiv:1303.3055v1 [cs.LG])
- Lyapunov spectrum of a relativistic stochastic flow in the Poincar\&#39;e group. (arXiv:1303.2028v1 [math.PR])
- Extended Fourier analysis of signals. (arXiv:1303.2033v2 [cs.DS] UPDATED)
- Growing Ising-like chain as a model of online emotional interactions. (arXiv:1303.2079v1 [physics.soc-ph])
- The linear stochastic order and directed inference for multivariate ordered distributions. (arXiv:1303.1927v1 [math.ST])
- A Cyclic Douglas-Rachford Iteration Scheme. (arXiv:1303.1859v4 [math.OC] UPDATED)
- Optimal switching control design for polynomial systems: an LMI approach. (arXiv:1303.1988v1 [math.OC])
- Optimization viewpoint on Kalman smoothing, with applications to robust and sparse estimation. (arXiv:1303.1993v2 [math.OC] UPDATED)
- Inference with Possibilistic Evidence. (arXiv:1303.1515v1 [cs.AI])
- Intercausal Reasoning with Uninstantiated Ancestor Nodes. (arXiv:1303.1492v1 [cs.AI])
- Using Causal Information and Local Measures to Learn Bayesian Networks. (arXiv:1303.1483v1 [cs.AI])
- Causality in Bayesian Belief Networks. (arXiv:1303.1454v1 [cs.AI])
- Rumor Spreading in Random Evolving Graphs. (arXiv:1302.3828v1 [cs.DM] CROSS LISTED)
- Percolation of finite clusters and infinite surfaces. (arXiv:1303.1657v1 [math.PR])
- Numerical Approximation of Stationary Distribution for SPDEs. (arXiv:1303.1600v1 [math.PR])
- Two cases of squares evolving by anisotropic diffusion. (arXiv:1303.1655v1 [math.AP])
- Statistical estimation of quadratic R\&#39;enyi entropy for a stationary m-dependent sequence. (arXiv:1303.1743v1 [math.ST])
- Sampling fractional Brownian motion in presence of absorption: a Markov Chain method. (arXiv:1303.1648v2 [cond-mat.stat-mech] UPDATED)
- Causal Modeling. (arXiv:1303.1471v1 [cs.AI])
- Importance sampling for weighted binary random matrices with specified margins. (arXiv:1301.3928v1 [stat.CO])
- Investment and Consumption with Regime-Switching Discount Rates. (arXiv:1303.1248v1 [q-fin.PM])
- Concepts and a case study for a flexible class of graphical Markov models. (arXiv:1303.1436v1 [stat.ME])
- Nonparametric functionals as generalized functions. (arXiv:1303.1435v1 [math.ST])
- A Novel Exact Representation of Stationary Colored Gaussian Processes (Fractional Differential Approach). (arXiv:1303.1327v1 [cond-mat.stat-mech])
- Analysis of Partially Observed Networks via Exponential-family Random Network Models. (arXiv:1303.1219v1 [stat.ME])
- A &quot;Gaussian&quot; for diffusion on the sphere. (arXiv:1303.1278v1 [cond-mat.stat-mech])
- Multivariable Feedback Particle Filter. (arXiv:1303.1205v1 [math.NA])
- Joint Probabilistic Data Association-Feedback Particle Filter for Multiple Target Tracking Applications. (arXiv:1303.1214v1 [math.NA])
- Causality in concurrent systems. (arXiv:1303.1384v1 [cs.DC])
- The Pricing of A Moving Barrier Option. (arXiv:1303.1296v1 [q-fin.PR])
- Stabilizing switching signals for switched linear systems. (arXiv:1303.1292v1 [cs.SY])
- Dynamics of influence processes on networks: Complete mean-field theory; the roles of response functions, connectivity, and synchrony; and applications to social contagion. (arXiv:1303.1414v1 [physics.soc-ph])
- Existence, Uniqueness and Removable Singularities for Nonlinear Partial Differential Equations in Geometry. (arXiv:1303.1117v1 [math.AP])
- Recursive Sparse Recovery in Large but Structured Noise - Part 2. (arXiv:1303.1144v1 [cs.IT])
- On the slowdown of random walk in random environment with bounded jumps. (arXiv:1303.1097v1 [math.PR])
- Some apriori estimates of G-BSDEs and the G-martingale representation for a special case. (arXiv:1303.0937v1 [math.PR])
- On Galerkin Approximations for the Zakai Equation with Diffusive and Point Process Observations. (arXiv:1303.0975v1 [math.NA])
- Random walk attachment graphs. (arXiv:1303.1052v1 [math.PR])
- Joint convergence along different subsequences of the signed cubic variation of fractional Brownian motion II. (arXiv:1303.0892v2 [math.PR] UPDATED)
- An Equivalence between the Lasso and Support Vector Machines. (arXiv:1303.1152v1 [cs.LG])
- Carnot process with a single particle. (arXiv:1303.0145v1 [cond-mat.stat-mech])
- Randomized Low-Memory Singular Value Projection. (arXiv:1303.0167v2 [math.OC] UPDATED)
- A stochastic diffusion process for the Dirichlet distribution. (arXiv:1303.0217v2 [math-ph] UPDATED)
- Linear PDEs and eigenvalue problems corresponding to ergodic stochastic optimization problems on compact manifolds. (arXiv:1303.0126v1 [math.OC])
- Strong Convergence of Euler Approximations of Stochastic Differential Equations with Delay under Local Lipschitz Condition. (arXiv:1303.0017v2 [math.PR] UPDATED)
- Relative fixed-width stopping rules for Markov chain Monte Carlo simulations. (arXiv:1303.0238v1 [math.ST])
- Nonlinear PDEs with modulated dispersion. (arXiv:1303.0822v1 [math.AP])
- On rough asymptotic behaviour of ruin probabilities in a general discrete risk model. (arXiv:1303.0522v1 [math.PR])
- Multiple points of the Brownian sheet in critical dimensions. (arXiv:1303.0403v2 [math.PR] UPDATED)
- Escaping from an Attractor: Importance Sampling and Rest Points I. (arXiv:1303.0450v1 [math.PR])
- A Sufficient Condition for Partial Ensemble Controllability of Bilinear Schr\&quot;odinger Equations with Bounded Coupling Terms. (arXiv:1303.0298v2 [math.OC] UPDATED)
- Switched linear differential systems and their stability. (arXiv:1303.0308v1 [math.OC])
- Environmental Time Series Interpolation Based on Spartan Random Processes. (arXiv:1303.0654v1 [stat.AP])
- Bayesian Compressed Regression. (arXiv:1303.0642v2 [stat.ML] UPDATED)
- Top-down particle filtering for Bayesian decision trees. (arXiv:1303.0561v1 [stat.ML])
- Bayesian learning of joint distributions of objects. (arXiv:1303.0449v1 [stat.ME])
- Learning Stable Multilevel Dictionaries for Sparse Representation of Images. (arXiv:1303.0448v1 [cs.CV])
- On Bayesian Nonparametric Continuous Time Series Models. (arXiv:1303.0439v1 [stat.ME])
- Matrix completion via max-norm constrained optimization. (arXiv:1303.0341v1 [cs.LG])
- Blowup as a driving mechanism of developed hydrodynamic turbulence. (arXiv:1303.0386v1 [physics.flu-dyn])
- Realizing stock market crashes: stochastic cusp catastrophe model of returns under the time-varying volatility. (arXiv:1302.7036v1 [q-fin.ST])
- Community Detection in Random Networks. (arXiv:1302.7099v1 [math.ST])
- Linear-noise approximations for stochastic processes with distributed delays. (arXiv:1302.7166v1 [cond-mat.stat-mech])
- An analytic multi-currency model with stochastic volatility and stochastic interest rates. (arXiv:1302.7246v2 [q-fin.PR] UPDATED)
- Randomly Trapped Random Walks. (arXiv:1302.7227v1 [math.PR])
- A strong law of large numbers for branching processes: almost sure spine events. (arXiv:1302.7199v1 [math.PR])
- Weak and strong no-arbitrage conditions for continuous financial markets. (arXiv:1302.7192v1 [q-fin.PR])
- Feedback Particle Filter. (arXiv:1302.6563v1 [math.NA])
- A Treatise on Stability of Autonomous and Non-autonomous Systems: Theory and Illustrative Practical Applications. (arXiv:1302.6259v1 [cs.IT])
- The Role of Information Diffusion in the Evolution of Social Networks. (arXiv:1302.6276v1 [cs.SI])
- Discrete Time Mean-Field Stochastic Linear-Quadratic Optimal Control Problems. (arXiv:1302.6416v1 [math.OC])
- The importance sampling technique for understanding rare events in Erd\H{o}s-R\&#39;enyi random graphs. (arXiv:1302.6551v1 [math.PR])
- Swing options in commodity markets: A multidimensional L\&#39;evy diffusion model. (arXiv:1302.6399v1 [q-fin.PR])
- Ensemble Sparse Models for Image Analysis. (arXiv:1302.6957v1 [cs.CV])
- Three Approaches to Probability Model Selection. (arXiv:1302.6838v1 [stat.ME])
- General Belief Measures. (arXiv:1302.6851v1 [cs.AI])
- A Probabilistic Calculus of Actions. (arXiv:1302.6835v1 [cs.AI])
- A Decision-Based View of Causality. (arXiv:1302.6816v1 [cs.AI])
- Efficient Estimation of the Value of Information in Monte Carlo Models. (arXiv:1302.6794v1 [cs.AI])
- Counterfactual Probabilities: Computational Methods, Bounds and Applications. (arXiv:1302.6784v1 [cs.AI])
- Laplace&#39;s Method Approximations for Probabilistic Inference in Belief Networks with Continuous Variables. (arXiv:1302.6782v1 [cs.AI])
- Arriving on time: estimating travel time distributions on large-scale road networks. (arXiv:1302.6617v1 [cs.LG])
- Combining Multiple Time Series Models Through A Robust Weighted Mechanism. (arXiv:1302.6595v1 [cs.AI])
- Online Learning for Time Series Prediction. (arXiv:1302.6927v1 [cs.LG])
- An Introductory Study on Time Series Modeling and Forecasting. (arXiv:1302.6613v1 [cs.LG])
- Adaptive Gibbs samplers and related MCMC methods. (arXiv:1101.5838v2 [stat.CO] CROSS LISTED)
- The Bayesian Approach To Inverse Problems. (arXiv:1302.6989v1 [math.PR])
- On the Exact and \epsilon-Strong Simulation of (Jump) Diffusions. (arXiv:1302.6964v1 [stat.ME])
- Forward Brownian Motion. (arXiv:1302.6958v4 [math.PR] UPDATED)
- Frozen percolation in two dimensions. (arXiv:1302.6727v1 [math.PR])
- Bernoulli and self-destructive percolation on non-amenable graphs. (arXiv:1302.6870v1 [math.PR])
- On the large deviation rate function for the empirical measures of reversible jump Markov processes. (arXiv:1302.6647v1 [math.PR])
- A functional central limit theorem for the partial sums of sorted i.i.d. random variables. (arXiv:1302.6926v1 [math.ST])
- Variable transformation to obtain geometric ergodicity in the random-walk Metropolis algorithm. (arXiv:1302.6741v1 [math.ST])
- Variational Algorithms for Marginal MAP. (arXiv:1302.6584v1 [stat.ML] CROSS LISTED)
- Studying complex tourism systems: a novel approach based on networks derived from a time series. (arXiv:1302.5909v1 [physics.soc-ph])
- Large cycles and a functional central limit theorem for generalized weighted random permutations. (arXiv:1302.5938v1 [math.PR])
- Renormalized powers of Ornstein-Uhlenbeck processes and well-posedness of stochastic Ginzburg-Landau equations. (arXiv:1302.5930v2 [math-ph] UPDATED)
- Integration by Parts Formula, Derivative Formula, and Transportation Inequalities for SDEs Driven by Fractional Brownian Motion. (arXiv:1302.5868v1 [math.PR])
- Non Monotone Stochastic Evolution Equations. (arXiv:1302.5969v2 [math.PR] UPDATED)
- Hitting Time Distribution for finite states Markov Chains. (arXiv:1302.5987v2 [math.PR] UPDATED)
- Conditional G-expectation in $\mathbb{L}^{p}$ and related It\^o&#39;s calculus. (arXiv:1302.6001v1 [math.PR])
- Pareto genealogies arising from a Poisson branching evolution model with selection. (arXiv:1302.6029v1 [math.PR])
- On the heat kernel and the Dirichlet form of Liouville Brownian Motion. (arXiv:1302.6050v1 [math.PR])
- Characterizing Branching Processes from Sampled Data. (arXiv:1302.5847v1 [stat.AP])
- Prediction by Random-Walk Perturbation. (arXiv:1302.5797v1 [cs.LG])
- Sparse Signal Estimation by Maximally Sparse Convex Optimization. (arXiv:1302.5729v1 [cs.LG])
- Image restoration using sparse approximations of spatially varying blur operators in the wavelet domain. (arXiv:1302.6105v1 [math.OC])
- Sequential Joint Detection and Estimation. (arXiv:1302.6058v2 [math.ST] UPDATED)
- On learning parametric-output HMMs. (arXiv:1302.6009v1 [cs.LG])
- From dynamical systems to renormalization. (arXiv:1302.6037v1 [math.DS])
- Design of Nonlinear State Observers for One-Sided Lipschitz Systems. (arXiv:1302.5867v1 [cs.SY])
- Time scales and structures of wave interaction. (arXiv:1302.5961v1 [physics.flu-dyn])
- The nonlinear heat equation on dense graphs and graph limits. (arXiv:1302.5804v1 [nlin.AO])
- Integration of PDEs by differential geometric means. (arXiv:1302.5562v1 [math.DG])
- Numerical Methods and Causality in Physics. (arXiv:1302.5601v1 [physics.comp-ph])
- Asymptotically liberating sequences of random unitary matrices. (arXiv:1302.5688v2 [math.PR] UPDATED)
- Large Deviations for Nonlocal Stochastic Neural Fields. (arXiv:1302.5616v1 [math.PR] CROSS LISTED)
- Convergence analysis of some multivariate Markov chains using stochastic monotonicity. (arXiv:1302.5606v1 [math.PR])
- Distributed Community Detection in Dynamic Graphs. (arXiv:1302.5607v1 [cs.SI])
- DCT and Eigenvectors of Covariance of 1st and 2nd order Discrete fractional Brownian motion. (arXiv:1302.5556v1 [stat.AP])
- Nonparametric Basis Pursuit via Sparse Kernel-based Learning. (arXiv:1302.5449v1 [cs.LG])
- Self-similar prior and wavelet bases for hidden incompressible turbulent motion. (arXiv:1302.5554v1 [stat.AP])
- Random Projections for Support Vector Machines. (arXiv:1211.6085v3 [cs.LG] UPDATED)
- Series Expansion Approximations of Brownian Motion for Non-Linear Kalman Filtering of Diffusion Processes. (arXiv:1302.5324v1 [stat.CO])
- Beno\^{i}t Mandelbrot and Fractional Brownian Motion. (arXiv:1302.5237v1 [stat.ME])
- Lookahead Strategies for Sequential Monte Carlo. (arXiv:1302.5206v1 [stat.ME])
- The exponential family in abstract information theory. (arXiv:1302.5205v1 [cs.IT])
- Heat Equation on the Cone and the Spectrum of the Spherical Laplacian. (arXiv:1301.6202v3 [math.SP] UPDATED)
- Identification of Finite Dimensional L\&#39;evy Systems in Financial Mathematics. (arXiv:1302.5221v1 [math.ST])
- The geometry of Lie algebroids and its applications to optimal control. (arXiv:1302.5212v2 [math.DG] UPDATED)
- Rational approximations of spectral densities based on the Alpha divergence. (arXiv:1302.5131v1 [math.OC])
- On the existence of solutions to nonlinear systems of higher order Poisson type. (arXiv:1302.5073v2 [math.AP] UPDATED)
- On a singular heat equation with dynamic boundary conditions. (arXiv:1302.5026v1 [math.AP])
- A theoretical framework for conducting multi-level studies of complex social systems with agent-based models and empirical data. (arXiv:1302.4774v1 [cs.MA])
- Dynamical transitions in Markovian exciton transport. (arXiv:1302.4909v1 [quant-ph])
- General position of a projection and its image under a free unitary Brownian motion. (arXiv:1302.4844v2 [math.PR] UPDATED)
- An Explicit Martingale Version of Brenier&#39;s Theorem. (arXiv:1302.4854v3 [q-fin.CP] UPDATED)
- Finite Time Ruin Probabilities for Tempered Stable Insurance Risk Processes. (arXiv:1302.4795v2 [math.PR] UPDATED)
- Path Planning under Time-Dependent Uncertainty. (arXiv:1302.4987v1 [cs.AI])
- Causal Inference in the Presence of Latent Variables and Selection Bias. (arXiv:1302.4983v1 [cs.AI])
- On the Complexity of Solving Markov Decision Problems. (arXiv:1302.4971v1 [cs.AI])
- Stochastic Simulation Algorithms for Dynamic Probabilistic Networks. (arXiv:1302.4965v1 [cs.AI])
- Learning Bayesian Networks: A Unification for Discrete and Gaussian Domains. (arXiv:1302.4957v1 [cs.AI])
- A Definition and Graphical Representation for Causality. (arXiv:1302.4956v1 [cs.AI])
- Independence Concepts for Convex Sets of Probabilities. (arXiv:1302.4940v1 [cs.AI])
- Chain Graphs for Learning. (arXiv:1302.4933v1 [cs.AI])
- Non-stationary extremal eigenvalue approximations in iterative solutions of linear systems and estimators for relative error. (arXiv:1302.4824v1 [math.NA])
- Is Matching Pursuit Solving Convex Problems?. (arXiv:1302.5010v1 [cs.CV])
- Equations in simple matrix groups: algebra, geometry, arithmetic, dynamics. (arXiv:1302.4667v1 [math.AG])
- In Love With a Robot: the Dawn of Machine-To-Machine Marketing. (arXiv:1302.4475v2 [cs.AI] UPDATED)
- Random-walk domination in large graphs: problem definitions and fast solutions. (arXiv:1302.4546v1 [cs.SI])
- Large cliques in sparse random intersection graphs. (arXiv:1302.4627v1 [math.CO])
- A numerical algorithm for a class of BSDEs via branching process. (arXiv:1302.4624v2 [math.NA] UPDATED)
- Convergent sequences of sparse graphs: A large deviations approach. (arXiv:1302.4615v1 [math.PR])
- Hitting times of Bessel processes, volume of Wiener sausages and zeros of Macdonald functions. (arXiv:1302.4526v1 [math.PR])
- Undiscounted Markov chain BSDEs to stopping times. (arXiv:1302.4637v1 [math.PR])
- Metrics for Multivariate Dictionaries. (arXiv:1302.4242v2 [cs.LG] UPDATED)
- Reasoning about Independence in Probabilistic Models of Relational Data. (arXiv:1302.4381v1 [cs.AI])
- A Class of Solvable Optimal Stopping Problems of Spectrally Negative Jump Diffusions. (arXiv:1302.4181v1 [q-fin.PR])
- Multi-input Schr\&quot;odinger equation: controllability, tracking, and application to the quantum angular momentum. (arXiv:1302.4173v1 [math.OC])
- Decentralized Event-Triggering for Control of Nonlinear Systems. (arXiv:1302.4019v2 [cs.SY] UPDATED)
- An Algebraic Approach for Identification of Linear Systems with Fractional Derivatives. (arXiv:1302.4071v1 [math.OC])
- Understanding Deep Learning by Revisiting Boltzmann Machines: An Information Geometry Approach. (arXiv:1302.3931v5 [cs.NE] UPDATED)
- Diffusivity in multiple scattering systems. (arXiv:1302.4339v1 [math.PR])
- Weak Convergence Methods for Approximation of Path-dependent Functionals. (arXiv:1302.4278v1 [math.PR])
- Probabilistic existence of regular combinatorial structures. (arXiv:1302.4295v1 [math.CO])
- Conditioning of Gaussian processes and a zero area Brownian bridge. (arXiv:1302.4186v1 [math.PR])
- Branching Brownian Motion with catalytic branching at the origin. (arXiv:1302.4087v1 [math.PR])
- Branching Random Walk in an inhomogeneous breeding potential. (arXiv:1302.4084v1 [math.PR])
- Approximation of stable random measures and applications to linear fractional stable integrals. (arXiv:1302.4011v1 [math.PR])
- Directed Information on Abstract Spaces: Properties and Variational Equalities. (arXiv:1302.3971v1 [cs.IT])
- Traveling Wave Solutions in a Reaction-Diffusion Model for Criminal Activity. (arXiv:1302.4333v1 [math.AP])
- Nonparametric regression for locally stationary time series. (arXiv:1302.4198v1 [math.ST])
- Posterior Consistency for Bayesian (elliptic) Inverse Problems through Stability and Regression Results. (arXiv:1302.4101v2 [math.ST] UPDATED)
- Derivation of an EM algorithm for constrained and unconstrained multivariate autoregressive state-space (MARSS) models. (arXiv:1302.3919v1 [stat.ME])
- A comprehensive characterization of recurrences in time series. (arXiv:1302.3704v1 [physics.data-an])
- Toward a Market Model for Bayesian Inference. (arXiv:1302.3593v1 [cs.GT])
- An Efficient Implementation of the Ensemble Kalman Filter Based on an Iterative Sherman-Morrison Formula. (arXiv:1302.3876v1 [cs.NA])
- A second-order stock market model. (arXiv:1302.3870v1 [q-fin.ST])
- Homotopy Probability Theory I. (arXiv:1302.3684v2 [math.PR] UPDATED)
- Thompson Sampling in Switching Environments with Bayesian Online Change Point Detection. (arXiv:1302.3721v1 [cs.LG])
- A Latent Source Model for Online Time Series Classification. (arXiv:1302.3639v3 [stat.ML] UPDATED)
- Renyi entropies as a measure of the complexity of counting problems. (arXiv:1302.2826v2 [cond-mat.stat-mech] UPDATED)
- Endless self-avoiding walks. (arXiv:1302.2796v2 [cond-mat.stat-mech] UPDATED)
- Extrinsic noise driven phenotype switching in a self-regulating gene. (arXiv:1302.2724v1 [q-bio.MN])
- Acquaintance Time of a Graph. (arXiv:1302.2787v2 [cs.CC] UPDATED)
- Competing With Strategies. (arXiv:1302.2672v1 [stat.ML])
- Latent Point Process Models for Spatial-Temporal Networks. (arXiv:1302.2671v2 [cs.SI] UPDATED)
- Matrix Completion and Tensor Rank. (arXiv:1302.2639v1 [math.OC])
- Markov chain Monte Carlo methods for the regular two-level fractional factorial designs and cut ideals. (arXiv:1302.2882v1 [math.ST])
- Posterior convergence rates for estimating large precision matrices using Graphical models. (arXiv:1302.2677v1 [math.ST])
- Percolation on uniform infinite planar maps. (arXiv:1302.2851v1 [math.PR])
- Parameter dependent optimal thresholds, indifference levels and inverse optimal stopping problems. (arXiv:1302.2769v1 [math.PR])
- Multidimensional sticky Brownian motions as limits of exclusion processes. (arXiv:1302.2678v1 [math.PR])
- Exploring network dynamics with a mathematical triple jump. (arXiv:1302.2743v2 [math.DS] UPDATED)
- A computational tool for comparing all linear PDE solvers -- Optimal methods are meshless. (arXiv:1302.2784v1 [math.NA])
- A Tensor Spectral Approach to Learning Mixed Membership Community Models. (arXiv:1302.2684v1 [cs.LG] CROSS LISTED)
- Queues with random back-offs. (arXiv:1302.3144v1 [math.PR])
- On a stochastic Ricker competition model. (arXiv:1302.3147v1 [math.PR])
- Fine regularity of L\&#39;evy processes and linear (multi)fractional stable motion. (arXiv:1302.3140v1 [math.PR])
- Tail asymptotic of the stationary distribution for the state dependent (1,R)-reflecting random walk: near critical. (arXiv:1302.3069v3 [math.PR] UPDATED)
- Stationary max-stable processes with the Markov property. (arXiv:1302.3041v1 [math.PR])
- A Local Central Limit Theorem and Loss of Rotational Symmetry of Planar Simple Random Walk. (arXiv:1302.2971v1 [math.PR])
- A spectral result for Hardy inequalities. (arXiv:1302.3039v1 [math.SP])
- Measurement error in GLMMs with INLA. (arXiv:1302.3065v1 [stat.ME])
- Eigenfunctions of the Edge-Based Laplacian on a Graph. (arXiv:1302.3433v1 [cs.DM])
- Pontryagin Maximum Principle for finite dimensional nonlinear optimal control problems on time scales. (arXiv:1302.3513v1 [math.OC])
- Stochastic Minimum Principle for Partially Observed Systems Subject to Continuous and Jump Diffusion Processes and Driven by Relaxed Controls. (arXiv:1302.3455v1 [math.OC])
- Stochastic dynamical model of a growing network based on self-exciting point process. (arXiv:1210.0756v1 [physics.soc-ph] CROSS LISTED)
- Alpha-diversity processes and normalized inverse-Gaussian diffusions. (arXiv:1302.3000v1 [math.PR] CROSS LISTED)
- Centralized Versus Decentralized Team Games of Distributed Stochastic Differential Decision Systems with Noiseless Information Structures-Part I: General Theory. (arXiv:1302.3452v1 [math.OC])
- A consistent clustering-based approach to estimating the number of change-points in highly dependent time-series. (arXiv:1302.3407v1 [stat.ML])
- A Kushner-Stratonovich Monte Carlo Filter for Nonlinear Dynamical System Identification. (arXiv:1302.3330v1 [stat.ME])
- Non-linear noise excitation of intermittent stochastic PDEs and the topology of LCA groups. (arXiv:1302.3266v1 [math.PR])
- Optimal rates of convergence for sparse covariance matrix estimation. (arXiv:1302.3030v1 [math.ST])
- Dimension-Independent MCMC Sampling for Elliptic Inverse Problems with Non-Gaussian Priors. (arXiv:1302.2213v1 [math.ST])
- Time-Symmetry Breaking in Hamiltonian Mechanics. (arXiv:1302.2533v2 [cond-mat.stat-mech] UPDATED)
- Pathwise solutions and attractors for retarded SPDEs with time smooth diffusion coefficients. (arXiv:1302.2400v1 [math.DS])
- Collective dynamics in systems of active Brownian particles with dissipative interactions. (arXiv:1302.2280v2 [cond-mat.stat-mech] UPDATED)
- Dual subgradient algorithms for large-scale nonsmooth learning problems. (arXiv:1302.2349v1 [math.OC])
- Early-warning signals of topological collapse in interbank networks. (arXiv:1302.2063v1 [physics.data-an])
- Exchangeable random measures. (arXiv:1302.2116v4 [math.PR] UPDATED)
- Variance optimal hedging for continuous time additive processes and applications. (arXiv:1302.1965v1 [q-fin.PR])
- Hypergraph limits via martingales. (arXiv:1302.1634v1 [math.CO])
- Fast Value Iteration for Goal-Directed Markov Decision Processes. (arXiv:1302.1575v1 [cs.AI])
- Probabilistic Acceptance. (arXiv:1302.1556v1 [cs.AI])
- Composition of Probability Measures on Finite Spaces. (arXiv:1302.1551v1 [cs.AI])
- Time-Critical Reasoning: Representations and Application. (arXiv:1302.1548v1 [cs.AI])
- A Scheme for Approximating Probabilistic Inference. (arXiv:1302.1534v1 [cs.AI])
- Model Reduction Techniques for Computing Approximately Optimal Solutions for Markov Decision Processes. (arXiv:1302.1533v1 [cs.AI])
- Incremental Pruning: A Simple, Fast, Exact Method for Partially Observable Markov Decision Processes. (arXiv:1302.1525v1 [cs.AI])
- Algorithms for Learning Decomposable Models and Chordal Graphs. (arXiv:1302.1524v1 [cs.AI])
- Learning Bayesian Networks from Incomplete Databases. (arXiv:1302.1565v1 [cs.AI])
- Structure and Parameter Learning for Causal Independence and Causal Interaction Models. (arXiv:1302.1561v1 [cs.AI])
- Levy flights and nonlocal quantum dynamics. (arXiv:1302.1478v1 [quant-ph])
- Smoothing equations for large P\&#39;olya urns. (arXiv:1302.1412v1 [math.PR])
- A large deviation principle for networks of rate neurons with correlated synaptic weights. (arXiv:1302.1029v3 [math.PR] UPDATED)
- GENERIC formalism of a Vlasov-Fokker-Planck equation and connection to large-deviation principles. (arXiv:1302.1024v1 [math.AP])
- Local conditioning in Dawson-Watanabe superprocesses. (arXiv:1302.0968v1 [math.PR])
- Large deviations for random walks in a random environment on a strip. (arXiv:1302.0888v2 [math.PR] UPDATED)
- Analytic reconstruction of some dynamical systems. (arXiv:1302.1164v2 [nlin.CD] UPDATED)
- Stochastic differential games for fully coupled FBSDEs with jumps. (arXiv:1302.0938v1 [math.OC])
- Optimal control problems of fully coupled FBSDEs and viscosity solutions of Hamilton-Jacobi-Bellman equations. (arXiv:1302.0935v1 [math.OC])
- A Generalized Ito Formula. (arXiv:1302.1142v1 [math.AP])
- The magneto-geostrophic equations: a survey. (arXiv:1302.0925v1 [math.AP])
- Total variation distance between two double Wiener-It\^o integrals. (arXiv:1302.1171v1 [math.ST])
- Analysis Based Blind Compressive Sensing. (arXiv:1302.1094v2 [cs.IT] UPDATED)
- Exact Sparse Recovery with L0 Projections. (arXiv:1302.0895v1 [stat.ML])
- A PDE approach to fractional diffusion in general domains: a priori error analysis. (arXiv:1302.0698v1 [math.NA])
- On the convergence of the Metropolis-Hastings Markov chains. (arXiv:1302.0654v3 [math.ST] UPDATED)
- Efficient Importance Sampling for Rare Event Simulation with Applications. (arXiv:1302.0583v1 [stat.ME])
- Slow dynamics of the contact process on complex networks. (arXiv:1302.0808v1 [cond-mat.stat-mech])
- Area law for random graph states. (arXiv:1302.0709v1 [math-ph])
- Backward stochastic differential equations associated to jump Markov processes and applications. (arXiv:1302.0679v1 [math.PR])
- A Class of infinite dimensional stochastic Processes with unbounded Diffusion. (arXiv:1302.0673v1 [math.PR])
- Stochastic Control Representations for Penalized Backward Stochastic Differential Equations. (arXiv:1302.0480v2 [math.PR] UPDATED)
- Numerical scheme for a semilinear Stochastic PDEs via Backward Doubly Stochastic Differential Equations. (arXiv:1302.0440v3 [math.PR] UPDATED)
- Design of optimal sparse interconnection graphs for synchronization of oscillator networks. (arXiv:1302.0449v1 [math.OC])
- Stochastic maximum principle for optimal control of SPDEs. (arXiv:1302.0286v1 [math.OC])
- Projection Design For Statistical Compressive Sensing: A Tight Frame Based Approach. (arXiv:1302.0635v1 [cs.IT])
- Beyond Markov Chains, Towards Adaptive Memristor Network-based Music Generation. (arXiv:1302.0785v1 [cs.ET])
- An Exact Relationship Between Invasion Probability and Endemic Prevalence for Markovian SIS Dynamics on Networks. (arXiv:1302.0255v1 [q-bio.PE])
- Parameter estimation and model testing for Markov processes via conditional characteristic functions. (arXiv:1302.0122v1 [math.ST])
- Empirical likelihood-based tests for stochastic ordering. (arXiv:1302.0163v1 [math.ST])
- Inference for modulated stationary processes. (arXiv:1302.0114v1 [math.ST])
- Learning From What You Don&#39;t Observe. (arXiv:1301.7407v1 [cs.AI])
- Measure Selection: Notions of Rationality and Representation Independence. (arXiv:1301.7387v1 [cs.AI])
- On the Semi-Markov Equivalence of Causal Models. (arXiv:1301.7370v1 [cs.AI])
- Tractable Inference for Complex Stochastic Processes. (arXiv:1301.7362v1 [cs.AI])
- On ergodic least-squares estimators of the generalized diffusion coefficient for fractional Brownian motion. (arXiv:1301.7638v1 [cond-mat.stat-mech])
- Stochastic dynamics on slow manifolds. (arXiv:1301.7697v1 [cond-mat.stat-mech])
- Waiting times for particles in a branching Brownian motion to reach the rightmost position. (arXiv:1301.7606v1 [math.PR])
- Approximately Optimal Trajectory Tracking for Continuous Time Nonlinear Systems. (arXiv:1301.7664v1 [cs.SY])
- Large Deviation Methods for Approximate Probabilistic Inference. (arXiv:1301.7392v1 [cs.LG])
- Learning Mixtures of DAG Models. (arXiv:1301.7415v1 [cs.LG])
- Strange uniform random variables. (arXiv:1301.7148v1 [math.PR])
- On Multi-Particle Brownian Survivals and the Spherical Laplacian. (arXiv:1301.6202v2 [math.SP] CROSS LISTED)
- Markovian acyclic directed mixed graphs for discrete data. (arXiv:1301.6624v1 [math.ST])
- Tight is better: Performance Improvement of the Compressive Classifier Using Equi-Norm Tight Frames. (arXiv:1301.6256v1 [cs.IT])
- Option pricing with market impact and non-linear Black and Scholes pde&#39;s. (arXiv:1301.6252v1 [q-fin.PR])
- On the relation between forecast precision and trading profitability of financial analysts. (arXiv:1301.6638v1 [q-fin.TR])
- On solutions of Kolmogorov&#39;s equations for jump Markov processes. (arXiv:1301.6998v3 [math.PR] UPDATED)
- Link prediction for partially observed networks. (arXiv:1301.7047v1 [stat.ML])
- Guarantees of Total Variation Minimization for Signal Recovery. (arXiv:1301.6791v2 [cs.IT] UPDATED)
- Accelerating EM: An Empirical Study. (arXiv:1301.6730v1 [cs.LG])
- Learning Bayesian Network Structure from Massive Datasets: The &quot;Sparse Candidate&quot; Algorithm. (arXiv:1301.6696v1 [cs.LG])
- Discovering the Hidden Structure of Complex Dynamic Systems. (arXiv:1301.6683v1 [cs.AI])
- Model-Based Bayesian Exploration. (arXiv:1301.6690v1 [cs.AI])
- Fast Learning from Sparse Data. (arXiv:1301.6685v1 [cs.LG])
- Time-Critical Dynamic Decision Making. (arXiv:1301.6750v1 [cs.AI])
- Loopy Belief Propagation for Approximate Inference: An Empirical Study. (arXiv:1301.6725v1 [cs.AI])
- A Variational Approximation for Bayesian Networks with Discrete and Continuous Latent Variables. (arXiv:1301.6724v1 [cs.AI])
- Solving POMDPs by Searching the Space of Finite Policies. (arXiv:1301.6720v1 [cs.AI])
- Causal Discovery from a Mixture of Experimental and Observational Data. (arXiv:1301.6686v1 [cs.AI])
- Possibilistic logic bases and possibilistic graphs. (arXiv:1301.6679v1 [cs.AI])
- Quadratic Basis Pursuit. (arXiv:1301.7002v2 [cs.IT] UPDATED)
- Central limit theorem related to MDR-method. (arXiv:1301.6609v1 [math.PR])
- Can local particle filters beat the curse of dimensionality?. (arXiv:1301.6585v1 [math.ST])
- From pseudo-random walk to pseudo-Brownian motion: first exit time from a one-sided or a two-sided interval. (arXiv:1301.6579v1 [math.PR])
- Maximum likelihood estimation for small noise multiscale diffusions. (arXiv:1301.6413v1 [math.ST])
- Simple random walk on distance-regular graphs. (arXiv:1301.6394v1 [math.PR])
- Iteration of the lent particle method for existence of smooth densities of Poisson functionals. (arXiv:1301.6389v1 [math.PR])
- Drichlet forms for Poisson measures and L\&#39;evy processes : the lent particle method. (arXiv:1301.6390v1 [math.PR])
- The lent particle method for marked point processes. (arXiv:1301.6387v1 [math.PR])
- Optimal Sequential Joint Detection and Estimation. (arXiv:1301.6206v3 [stat.ME] UPDATED)
- Causal Theories: A Categorical Perspective on Bayesian Networks. (arXiv:1301.6201v1 [math.PR])
- Mean field limit for disordered diffusions with singular interactions. (arXiv:1301.6521v1 [math.PR])
- Analytical approximations for spiral waves. (arXiv:1301.6271v2 [nlin.PS] UPDATED)
- A local CLT for convolution equations with an application to weakly self-avoiding random walks. (arXiv:1301.6071v1 [math.PR])
- Exponential Ergodicity for Nonlinear SPDEs Driven by Jump Processes. (arXiv:1301.6024v1 [math.PR])
- Singular probability distribution of shot-noise driven systems. (arXiv:1301.5966v1 [cond-mat.mes-hall])
- Reaction-diffusion model Monte Carlo simulations on the GPU. (arXiv:1301.6082v1 [physics.comp-ph])
- A stochastic algorithm for computing global minimizer of generalized conic functions. (arXiv:1301.6112v1 [math.OC])
- On the Dynamics of Large Particle Systems in the Mean Field Limit. (arXiv:1301.5494v1 [math.AP])
- Recurrence and transience of critical branching processes in random environment with immigration and an application to excited random walks. (arXiv:1301.5450v1 [math.PR])
- Random intersection graph process. (arXiv:1301.5579v1 [math.PR])
- An Essay on the Double Nature of the Probability. (arXiv:1301.5443v1 [math.PR])
- Weingarten calculus for matrix ensembles associated with compact symmetric spaces. (arXiv:1301.5401v2 [math.PR] UPDATED)
- Optimal Sequential Vector Estimation. (arXiv:1301.5701v1 [stat.AP] CROSS LISTED)
- Counting Triangles in Massive Graphs with MapReduce. (arXiv:1301.5887v1 [cs.SI])
- Spike detection from inaccurate samplings. (arXiv:1301.5873v1 [math.ST])
- Asymptotically Optimal Detection of Changes in Stochastic Models with Switching Regimes. (arXiv:1301.5722v1 [math.ST])
- The covariation for Banach space valued processes and applications. (arXiv:1301.5715v1 [math.PR])
- Random walks in the quarter plane, harmonic functions and conformal mappings. (arXiv:1301.5716v1 [math.PR])
- Malliavin calculus approach to statistical inference for Levy driven SDE&#39;s. (arXiv:1301.5141v1 [math.PR])
- Coarse-graining complex dynamics: Continuous Time Random Walks vs. Record Dynamics. (arXiv:1301.5199v1 [cond-mat.stat-mech])
- Bayesian Non-Parametric Portfolio Decisions with Financial Time Series. (arXiv:1301.5129v1 [q-fin.PM])
- A Note on Probabilistic Models over Strings: Inference and Representation with Indexed Matrices. (arXiv:1301.5054v1 [q-bio.PE])
- The Regularity problem for second order elliptic operators with complex-valued bounded measurable coefficients. (arXiv:1301.5209v1 [math.AP])
- Random matrix ensembles: Wang-Landau algorithm for spectral densities. (arXiv:1301.5179v1 [cond-mat.stat-mech])
- Stochastic Averaging Principle for Dynamical Systems with Fractional Brownian Motion. (arXiv:1301.4788v1 [math.DS])
- On the spectrum and eigenfunctions of the operator $(Vf)(x) = \int_0^{x^\alpha} f(t) dt$. (arXiv:1301.4886v1 [math.SP])
- Sampling from a polytope and hard-disk Monte Carlo. (arXiv:1301.4901v1 [cond-mat.stat-mech])
- Gradient estimates for SDEs Driven by Multiplicative L\&#39;evy Noise. (arXiv:1301.4528v1 [math.PR])
- Gaussian estimates for Schroedinger perturbations. (arXiv:1301.4627v1 [math.AP])
- On the use of fractional calculus for the probabilistic characterization of random variables. (arXiv:1301.4867v1 [math-ph])
- Fractional absolute moments of heavy tailed distributions. (arXiv:1301.4804v1 [math.ST])
- Sparse/Robust Estimation and Kalman Smoothing with Nonsmooth Log-Concave Densities: Modeling, Computation, and Theory. (arXiv:1301.4566v1 [stat.ML])
- A Linearly Convergent Conditional Gradient Algorithm with Applications to Online and Stochastic Optimization. (arXiv:1301.4666v4 [cs.LG] UPDATED)
- Non-random overshoots of L\&#39;evy processes. (arXiv:1301.4463v1 [math.PR])
- Optimal bilinear control of nonlinear Schr\&quot;{o}dinger equations with singular potentials. (arXiv:1301.4335v1 [math.AP])
- Nonlinear dynamical systems and linearly forced isotropic turbulence. (arXiv:1301.4383v2 [physics.flu-dyn] UPDATED)
- Probabilities of Causation: Bounds and Identification. (arXiv:1301.3898v1 [cs.AI])
- Building a Stochastic Dynamic Model of Application Use. (arXiv:1301.3859v1 [cs.AI])
- Stochastic Logic Programs: Sampling, Inference and Applications. (arXiv:1301.3846v1 [cs.AI])
- A Differential Approach to Inference in Bayesian Networks. (arXiv:1301.3847v1 [cs.AI])
- PEGASUS: A Policy Search Method for Large MDPs and POMDPs. (arXiv:1301.3878v1 [cs.AI])
- Polya&#39;s random walk theorem. (arXiv:1301.3916v2 [math.PR] UPDATED)
- Diffusive Limits for Adaptive MCMC for Normal Target densities. (arXiv:1301.4030v1 [math.ST])
- De-noising procedures for frame operators. (arXiv:1301.3949v1 [stat.ME])
- Variational Approximations between Mean Field Theory and the Junction Tree Algorithm. (arXiv:1301.3901v1 [cs.LG])
- Gaussian Process Networks. (arXiv:1301.3857v1 [cs.AI])
- Rao-Blackwellised Particle Filtering for Dynamic Bayesian Networks. (arXiv:1301.3853v1 [cs.LG])
- Experiments with Random Projection. (arXiv:1301.3849v1 [cs.LG])
- Reversible Jump MCMC Simulated Annealing for Neural Networks. (arXiv:1301.3833v1 [cs.LG])
- A Levy-area between Brownian motion and rough paths with applications to robust non-linear filtering and RPDEs. (arXiv:1301.3799v1 [math.PR])
- Sequential Bayesian Inference in Hidden Markov Stochastic Kinetic Models with Application to Detection and Response to Seasonal Epidemics. (arXiv:1301.3617v1 [stat.CO])
- Random directed forest and the Brownian web. (arXiv:1301.3766v1 [math.PR])
- Cramer-Rao Lower Bound and Information Geometry. (arXiv:1301.3578v2 [cs.IT] UPDATED)
- Causal Discovery from Changes. (arXiv:1301.2312v1 [cs.AI])
- Value-Directed Sampling Methods for POMDPs. (arXiv:1301.2305v1 [cs.AI])
- A Tractable POMDP for a Class of Sequencing Problems. (arXiv:1301.2308v1 [cs.AI])
- Aggregating Learned Probabilistic Beliefs. (arXiv:1301.2293v1 [cs.AI])
- Probabilistic Logic Programming under Inheritance with Overriding. (arXiv:1301.2290v1 [cs.AI])
- A Bayesian Approach to Tackling Hard Computational Problems. (arXiv:1301.2279v1 [cs.AI])
- Plausible reasoning from spatial observations. (arXiv:1301.2285v1 [cs.AI])
- Graphical Models for Game Theory. (arXiv:1301.2281v1 [cs.GT])
- A Calculus for Causal Relevance. (arXiv:1301.2257v1 [cs.AI])
- Lattice Particle Filters. (arXiv:1301.2298v1 [cs.AI])
- Incorporating Expressive Graphical Models in Variational Approximations: Chain-Graphs and Hidden Variables. (arXiv:1301.2268v1 [cs.AI])
- Variational MCMC. (arXiv:1301.2266v1 [cs.LG])
- Lecture notes on non-asymptotic theory of random matrices. (arXiv:1301.2382v1 [math.PR])
- A sequential algorithm for fast fitting of Dirichlet process mixture models. (arXiv:1301.2897v1 [stat.CO])
- Blind source separation methods for deconvolution of complex signals in cancer biology. (arXiv:1301.2634v1 [q-bio.QM])
- Neutral Backward Stochastic Functional Differential Equations and Their Application. (arXiv:1301.3081v1 [math.OC])
- Positivity and boundedness preserving schemes for the fractional reaction-diffusion equation. (arXiv:1301.2861v1 [math.NA])
- On an Optimal Stopping Problem of an Insider. (arXiv:1301.3100v2 [math.PR] UPDATED)
- Exact simulation for solutions of one-dimensional Stochastic Differential Equations with discontinuous drift. (arXiv:1301.3019v2 [math.PR] UPDATED)
- Liouville Brownian motion. (arXiv:1301.2876v2 [math.PR] UPDATED)
- Audio Classical Composer Identification by Deep Neural Network. (arXiv:1301.3195v6 [cs.NE] UPDATED)
- Deterministically driven random walks on a finite state space. (arXiv:1301.3179v1 [math.DS])
- Deterministically driven random walks in a random environment on Z. (arXiv:1301.3176v1 [math.DS])
- Learning Graphical Model Parameters with Approximate Marginal Inference. (arXiv:1301.3193v1 [cs.LG])
- Multi-agent learning using Fictitious Play and Extended Kalman Filter. (arXiv:1301.3347v1 [cs.MA])
- A remarkably simple and accurate method for computing the Bayes Factor from a Markov chain Monte Carlo Simulation of the Posterior Distribution in high dimension. (arXiv:1301.3156v1 [astro-ph.IM])
- Inverse problem of group analysis for autonomous differential systems. (arXiv:1301.3425v1 [math.DS])
- The detection of signals buried in noise. (arXiv:1301.1528v1 [physics.data-an])
- Bayesian Optimization in a Billion Dimensions via Random Embeddings. (arXiv:1301.1942v1 [stat.ML])
- A toolbox for fitting complex spatial point process models using integrated nested Laplace approximation (INLA). (arXiv:1301.1817v1 [stat.AP])
- The Coagulation - Fragmentation Equation and its Stochastic Counterpart. (arXiv:1301.1934v1 [math.PR])
- Fiducial theory and optimal inference. (arXiv:1301.1717v2 [math.ST] UPDATED)
- A graph discretization of the Laplace-Beltrami operator. (arXiv:1301.2222v2 [math.AP] UPDATED)
- Evolutionary dynamics of group interactions on structured populations: A review. (arXiv:1301.2247v1 [physics.soc-ph])
- q-Fourier Transform and Nonextensive Statistical Mechanics. (arXiv:1301.2155v1 [math-ph])
- Inference for Multi-Dimensional High-Frequency Data: Equivalence of Methods, Central Limit Theorems, and an Application to Conditional Independence Testing. (arXiv:1301.2074v1 [math.ST])
- Distributed soft thresholding for sparse signal recovery. (arXiv:1301.2130v1 [cs.IT])
- A Feynman-Kac-It\^o Formula for magnetic Schr\&quot;odinger operators on graphs. (arXiv:1301.1304v1 [math-ph])
- BV-regularity for the Malliavin Derivative of the Maximum of the Wiener Process. (arXiv:1301.1199v1 [math.PR])
- Zebra-percolation on Cayley trees. (arXiv:1301.1165v1 [math.PR])
- The martingale representation in a progressive enlargement of a filtration with jumps. (arXiv:1301.1119v1 [math.PR])
- Spectral norm of random Toeplitz matrices. (arXiv:1301.0938v2 [math.PR] UPDATED)
- Hawkes model for price and trades high-frequency dynamics. (arXiv:1301.1135v1 [q-fin.TR])
- Coupling between time series: a network view. (arXiv:1301.1010v1 [physics.data-an])
- A mean field analysis of the solid/fluid phase transition. (arXiv:1301.1256v3 [math-ph] UPDATED)
- Integrals of general birth-death processes. (arXiv:1301.1305v1 [stat.ME])
- Statistical estimation by power variations in mixed models. (arXiv:1301.0993v1 [math.PR])
- Automated Variational Inference in Probabilistic Programming. (arXiv:1301.1299v1 [stat.ML])
- Compressed Sensing under Matrix Uncertainty: Optimum Thresholds and Robust Approximate Message Passing. (arXiv:1301.0901v1 [cs.IT])
- Entropy and the Shannon-McMillan-Breiman theorem for beta random matrix ensembles. (arXiv:1301.0342v3 [math.PR] UPDATED)
- Partial Linear Eigenvalue Statistics for Wigner and Sample Covariance Random Matrices. (arXiv:1301.0368v2 [math.PR] UPDATED)
- Invariant measure of the stochastic Allen-Cahn equation: the regime of small noise and large system size. (arXiv:1301.0408v1 [math.PR])
- On a ternary coalescent process. (arXiv:1301.0409v1 [math.PR])
- Invariant measures of reflected stochastic delay differential equations with jumps. (arXiv:1301.0442v1 [math.PR])
- Compressed Sensing Matrices from Fourier Matrices. (arXiv:1301.0373v1 [cs.IT])
- Eventual linear convergence of the Douglas Rachford iteration for basis pursuit. (arXiv:1301.0542v1 [math.NA])
- Real-Time Inference with Large-Scale Temporal Bayes Nets. (arXiv:1301.0603v1 [cs.AI])
- Factorization of Discrete Probability Distributions. (arXiv:1301.0568v1 [cs.AI])
- Airy processes and variational problems. (arXiv:1301.0750v1 [math.PR])
- Mixing of Poisson random measures under interacting transformations. (arXiv:1301.0672v1 [math.PR])
- Convergence of a variational Lagrangian scheme for a nonlinear drift diffusion equation. (arXiv:1301.0747v1 [math.NA])
- Reinforcement Learning with Partially Known World Dynamics. (arXiv:1301.0601v1 [cs.LG])
- Decayed MCMC Filtering. (arXiv:1301.0584v1 [cs.AI])
- Predictability and control of extreme events in complex systems. (arXiv:1301.0244v1 [nlin.CD])
- Gaussian approximation of suprema of empirical processes. (arXiv:1212.6885v2 [math.PR] UPDATED)
- Small ball probability, Inverse theorems, and applications. (arXiv:1301.0019v1 [math.CO])
- Robust Optimal Stopping under Volatility Uncertainty. (arXiv:1301.0091v3 [math.PR] UPDATED)
- Noise-Induced Spatial Pattern Formation in Stochastic Reaction-Diffusion Systems. (arXiv:1301.0170v1 [q-bio.QM])
- Central limit theorems for linear statistics of heavy tailed random matrices. (arXiv:1301.0448v2 [math.PR] UPDATED)
- Persistence of fractional Brownian motion with moving boundaries and applications. (arXiv:1301.0424v1 [math.PR])
Saved in 2012
- A control strategy algorithm for finite alternating transition systems. (arXiv:1212.6607v1 [math.OC])
- On necessary boundary conditions for strictly optimal control in infinite horizon control problems. (arXiv:1212.6309v1 [math.OC])
- Algebraically integrable quadratic dynamical systems. (arXiv:1212.6675v2 [math.DS] UPDATED)
- On algebraic Riccati equations associated with M-Matrices. (arXiv:1212.6461v1 [math.NA])
- Perturbed Linear-Quadratic Control Problems and Their Probabilistic Representations. (arXiv:1212.6694v1 [math.OC])
- A paradox on the spectral representation of stationary random processes. (arXiv:1212.6339v1 [stat.OT])
- Pathwise uniqueness for stochastic reaction-diffusion equations in Banach spaces with an H\&quot;{o}lder drift component. (arXiv:1212.5377v1 [math.AP])
- Efficient Gibbs Sampling for Markov Switching GARCH Models. (arXiv:1212.5397v1 [math.ST])
- Interactions and dynamical systems of type (n,m) - A case study. (arXiv:1212.5963v2 [math.OA] UPDATED)
- Forward-Douglas-Rachford splitting and forward-partial inverse method for solving monotone inclusions. (arXiv:1212.5942v1 [math.OC])
- Bayesian shrinkage. (arXiv:1212.6088v1 [math.ST])
- Influence Analysis in the Blogosphere. (arXiv:1212.5863v1 [cs.SI])
- Modeling Financial Volatility in the Presence of Abrupt Changes. (arXiv:1212.6016v1 [q-fin.ST])
- The Kernel-SME Filter for Multiple Target Tracking. (arXiv:1212.5882v1 [cs.SY])
- An introduction to $BV$ functions in Wiener spaces. (arXiv:1212.5926v1 [math.AP])
- Small noise asymptotic expansions for stochastic PDE&#39;s driven by dissipative nonlinearity and L\&#39;evy noise. (arXiv:1212.5804v1 [math.PR])
- Embedding measure spaces. (arXiv:1212.5666v1 [math.GN])
- Space-time discretization of the heat equation. A concise Matlab implementation. (arXiv:1212.6037v1 [math.NA])
- Generating Motion Patterns Using Evolutionary Computation in Digital Soccer. (arXiv:1212.6216v2 [cs.AI] UPDATED)
- Comparison Theorem, Feynman-Kac Formula and Girsanov Transformation for BSDEs Driven by G-Brownian Motion. (arXiv:1212.5403v1 [math.PR])
- Lie geometry of 2x2 Markov matrices. (arXiv:1212.5311v1 [math.ST])
- On controller-stopper problems with jumps and their applications to indifference pricing of American options. (arXiv:1212.4894v1 [math.PR])
- Variational Optimization. (arXiv:1212.4507v2 [stat.ML] UPDATED)
- Concentration of Measure Inequalities in Information Theory, Communications and Coding. (arXiv:1212.4663v3 [cs.IT] UPDATED)
- L\&#39;{e}vy Laplacian for Square Roots of Measures. (arXiv:1212.4205v2 [math.FA] UPDATED)
- Learning Markov Decision Processes for Model Checking. (arXiv:1212.3873v1 [cs.LG])
- Ornstein-Uhlenbeck processes driven by cylindrical L\&#39;evy processes. (arXiv:1212.3832v2 [math.PR] UPDATED)
- Critical branching Brownian motion with absorption: survival probability. (arXiv:1212.3821v2 [math.PR] UPDATED)
- Online Learning for Ground Trajectory Prediction. (arXiv:1212.3998v1 [cs.AI])
- Increasing Air Traffic: What is the Problem?. (arXiv:1212.3996v1 [cs.AI])
- Probability Bracket Notation: Markov State Chain Projector, Hidden Markov Models and Dynamic Bayesian Networks. (arXiv:1212.3817v1 [cs.AI])
- Sparse Dynamics for Partial Differential Equations. (arXiv:1212.4132v1 [math.NA])
- A Latent-Variable Bayesian Nonparametric Regression Model. (arXiv:1212.3712v2 [stat.ME] UPDATED)
- Approximate continuous-discrete filters for the estimation of diffusion processes from partial and noisy observations. (arXiv:1212.3721v1 [math.OC])
- Optimal stopping problems for a Brownian motion with a disorder on a finite interval. (arXiv:1212.3709v1 [math.ST])
- Belief Propagation for Continuous State Spaces: Stochastic Message-Passing with Quantitative Guarantees. (arXiv:1212.3850v1 [cs.IT])
- An inventory model for group-buying auction. (arXiv:1212.3541v1 [math.OC])
- Forward and inverse problems in fundamental and applied magnetohydrodynamics. (arXiv:1212.3447v1 [physics.flu-dyn])
- Linearly Reconfigurable Kalman Filtering for a Vector Process. (arXiv:1212.3376v2 [cs.IT] UPDATED)
- Entropy of Conditional Markov Trajectories. (arXiv:1212.2831v1 [cs.IT])
- Non stationary multifractality in stock returns. (arXiv:1212.3195v1 [q-fin.ST])
- Multi-target tracking algorithms in 3D. (arXiv:1212.3034v1 [cs.CV])
- On large deviations for small noise It\^o processes. (arXiv:1212.3223v2 [math.PR] UPDATED)
- L\&#39;evy processes, martingales, reversed martingales and orthogonal polynomials. (arXiv:1212.3121v2 [math.PR] UPDATED)
- Riemannian Calculus of Variations using Strongly Typed Tensor Calculus. (arXiv:1212.2376v1 [math.DG])
- A Generalized Mean Field Algorithm for Variational Inference in Exponential Families. (arXiv:1212.2512v1 [cs.LG])
- Learning Continuous Time Bayesian Networks. (arXiv:1212.2498v1 [cs.LG])
- Learning Generative Models of Similarity Matrices. (arXiv:1212.2494v1 [cs.LG])
- Approximate Inference and Constrained Optimization. (arXiv:1212.2480v1 [cs.LG])
- Stochastic PDEs and Quantitative Finance: The Black-Scholes-Merton Model of Options Pricing and Riskless Trading. (arXiv:1212.1919v1 [q-fin.PR])
- Feynman-Kac representation for Hamilton-Jacobi-Bellman IPDE. (arXiv:1212.2000v1 [math.PR])
- Approximation to multifractional Riemann-Liouville Brownian sheet. (arXiv:1212.1818v2 [math.PR] UPDATED)
- On diffusion approximation of a slow component for solution of stochastic differential equation of Ito. (arXiv:1212.1872v1 [math.PR])
- A class of random fields on complete graphs with tractable partition function. (arXiv:1212.2136v1 [cs.CV])
- A Scale-Space Theory for Text. (arXiv:1212.2145v1 [cs.IR])
- Approximate discrete-time schemes for the estimation of diffusion processes from complete observations. (arXiv:1212.1788v1 [math.OC])
- Stochastic Perron&#39;s method for Hamilton-Jacobi-Bellman equations. (arXiv:1212.2170v3 [math.PR] UPDATED)
- Layer-wise learning of deep generative models. (arXiv:1212.1524v2 [cs.NE] UPDATED)
- Modeling for Control of Symmetric Aerial Vehicles Subjected to Aerodynamic Forces. (arXiv:1212.1629v1 [cs.SY])
- Projections and dimension conservation for random self-similar measures and sets. (arXiv:1212.1345v2 [math.DS] UPDATED)
- A series approach to stochastic Volterra equations of convolution time. (arXiv:1212.1254v1 [math.PR])
- Large deviation principle for certain spatially lifted Gaussian rough path. (arXiv:1212.1249v1 [math.PR])
- A fundamental mean-square convergence theorem for SDEs with locally Lipschitz coefficients and its applications. (arXiv:1212.1352v1 [math.NA])
- Discrete Total Variation Flows Without Regularization. (arXiv:1212.1137v1 [math.NA])
- Stationary two-dimensional turbulence statistics using a Markovian forcing scheme. (arXiv:1212.0916v1 [physics.flu-dyn])
- Some remarks on integral parameters of Wiener process. (arXiv:1212.1152v1 [math.SP])
- Spectral properties of Google matrix of Wikipedia and other networks. (arXiv:1212.1068v1 [cs.IR])
- Stochastic Models of Misinformation Distribution in Online Social Networks. (arXiv:1212.1002v1 [cs.SI])
- Self-Organizing Flows in Social Networks. (arXiv:1212.0952v1 [cs.SI])
- Multiscale Markov Decision Problems: Compression, Solution, and Transfer Learning. (arXiv:1212.1143v1 [cs.AI])
- Estimating the Static Parameters in Linear Gaussian Multiple Target Tracking Models. (arXiv:1212.0849v1 [stat.AP])
- Limit theorems for random walks on a strip in subdiffusive regime. (arXiv:1212.0599v1 [math.PR])
- Nonlinear elliptic Partial Differential Equations and p-harmonic functions on graphs. (arXiv:1212.0834v2 [math.AP] UPDATED)
- Directed random walks on hierarchic trees with continuous branching: a renormalization group approach. (arXiv:1212.0688v1 [cond-mat.stat-mech])
- Compositional Stochastic Modeling and Probabilistic Programming. (arXiv:1212.0582v1 [cs.AI])
- A Mixed Linear Quadratic Optimal Control Problem with a Controlled Time Horizon. (arXiv:1212.0594v1 [math.OC])
- Information Geometry and Sequential Monte Carlo. (arXiv:1212.0764v1 [stat.ME])
- Building blocks of turbulence. (arXiv:1212.0230v3 [physics.flu-dyn] UPDATED)
- Theory of simple glasses. (arXiv:1212.0390v1 [cond-mat.stat-mech])
- Reversibility in Queueing Models. (arXiv:1212.0398v2 [math.PR] UPDATED)
- Monte Carlo simulation with fixed steplength for diffusion processes in nonhomogeneous media. (arXiv:1212.0362v1 [physics.comp-ph])
- A causal analysis of mother&#39;s education on birth inequalities. (arXiv:1212.0372v1 [stat.ME])
- Large deviations from a stationary measure for a class of dissipative PDE&#39;s with random kicks. (arXiv:1212.0527v1 [math.AP] CROSS LISTED)
- Structure estimation for discrete graphical models: Generalized covariance matrices and their inverses. (arXiv:1212.0478v1 [stat.ML])
- Time series forecasting: model evaluation and selection using nonparametric risk bounds. (arXiv:1212.0463v1 [math.ST])
- A variational approach to modeling slow processes in stochastic dynamical systems. (arXiv:1211.7103v1 [math-ph])
- Feynman-Kac particle integration with geometric interacting jumps. (arXiv:1211.7191v1 [math.PR])
- Statistical Microeconomics. (arXiv:1211.7172v1 [q-fin.GN])
- Mean-variance hedging via stochastic control and BSDEs for general semimartingales. (arXiv:1211.6820v1 [math.PR])
- Nonparametric Bayesian Mixed-effect Model: a Sparse Gaussian Process Approach. (arXiv:1211.6653v1 [cs.LG])
- On the $l^p$ spectrum of Laplacians on graphs. (arXiv:1211.6536v1 [math.SP])
- Strong solution of the stochastic Burgers equation. (arXiv:1211.6622v2 [math.FA] UPDATED)
- A class of multifractal processes constructed using an embedded branching process. (arXiv:1211.6599v1 [math.PR])
- Statistical mechanics of reputation systems in autonomous networks. (arXiv:1211.6462v3 [cond-mat.dis-nn] UPDATED)
- A contraction theory-based analysis of the stability of the Extended Kalman Filter. (arXiv:1211.6624v2 [cs.SY] UPDATED)
- Diffusion in nonuniform temperature and its geometric analog. (arXiv:1211.6580v3 [cond-mat.stat-mech] UPDATED)
- A limit theorem for the sum of squared differences of an integrated Ito process with application to inverse scattering. (arXiv:1211.6413v1 [math.PR])
- A Liouville theorem for solutions of degenerate Monge-Amp\`ere equations. (arXiv:1211.6183v1 [math.AP])
- Hamiltonian dynamics of a particle interacting with a wave field. (arXiv:1211.6154v1 [math-ph])
- Limiting distributions of continuous-time random walks with superheavy-tailed waiting times. (arXiv:1211.6389v2 [cond-mat.stat-mech] UPDATED)
- A decomposition approach for the discrete-time approximation of BSDEs with a jump II: the quadratic case. (arXiv:1211.6231v1 [math.OC])
- Self-organizing traffic lights: A realistic simulation. (arXiv:nlin/0610040v2 [nlin.AO] UPDATED)
- Random Projections for Support Vector Machines. (arXiv:1211.6085v3 [cs.LG] UPDATED)
- Non-spectral relaxation in one dimensional Ornstein-Uhlenbeck processes. (arXiv:1211.5945v2 [cond-mat.soft] UPDATED)
- Online Stochastic Optimization with Multiple Objectives. (arXiv:1211.6013v1 [cs.LG])
- Application of simplest random walk algorithms for pricing barrier options. (arXiv:1211.5726v1 [q-fin.CP])
- Random Stopping Times in Stopping Problems and Stopping Games. (arXiv:1211.5802v1 [math.PR])
- Degenerate backward SPDEs in domains: non-local boundary conditions and applications to finance. (arXiv:1211.5858v2 [math.PR] UPDATED)
- A simple strong solution to non-linear stochastic HJB PDEs: an application to the portfolio model. (arXiv:1211.5816v1 [q-fin.PM])
- Bayesian learning of noisy Markov decision processes. (arXiv:1211.5901v1 [stat.ML])
- \epsilon-Strong simulation of the Brownian path. (arXiv:1110.0110v2 [stat.CO] CROSS LISTED)
- Convexity of reachable sets of nonlinear ordinary differential equations. (arXiv:1211.6080v1 [math.OC])
- Degenerate backward SPDEs in domains: non-local boundary conditions and applications to finance. (arXiv:1211.5858v2 [math.PR] UPDATED)
- A semiuniform ergodic theorem for random dynamical systems. (arXiv:1211.5885v1 [math.DS])
- A survey of uncertainty principles and some signal processing applications. (arXiv:1211.5914v1 [cs.IT])
- Records in stochastic processes -- Theory and applications. (arXiv:1211.6005v1 [cond-mat.stat-mech])
- Brownian motion of free particles on curved surfaces. (arXiv:1211.5799v1 [cond-mat.stat-mech])
- Large deviations for diffusions interacting through their ranks. (arXiv:1211.5223v1 [math.PR])
- Sparsity-Aware Learning and Compressed Sensing: An Overview. (arXiv:1211.5231v1 [cs.IT])
- Extended It\^{o} calculus for symmetric Markov processes. (arXiv:1211.5272v1 [math.ST])
- On differentiability with respect to the initial data of a solution of an SDE with L\&#39;evy noise and discontinuous coefficients. (arXiv:1211.4975v2 [math.PR] UPDATED)
- Convergence to SPDE of the Schrodinger equation with large, random potential. (arXiv:1211.4894v1 [math.AP])
- Computational aspects of Bayesian spectral density estimation. (arXiv:1211.4483v1 [stat.CO])
- A Dataset for StarCraft AI \&amp; an Example of Armies Clustering. (arXiv:1211.4552v1 [cs.AI])
- Compact Support Biorthogonal Wavelet Filterbanks for Arbitrary Undirected Graphs. (arXiv:1210.8129v2 [cs.IT] CROSS LISTED)
- Inference on Sets in Finance. (arXiv:1211.4282v1 [stat.AP])
- Lagrangian Dynamical Monte Carlo. (arXiv:1211.3759v2 [stat.CO] UPDATED)
- Path Integral Formulation for L\&#39;{e}vy Flights - Evaluation of the Propagator for Free, Linear and Harmonic Potentials in the Over- and Underdamped Limits. (arXiv:1211.4083v2 [cond-mat.stat-mech] UPDATED)
- Asymptotic theory for Brownian semi-stationary processes with application to turbulence. (arXiv:1211.4221v1 [math.PR])
- Limit Theorems For Marked Hawkes Processes With Application to a Risk Model. (arXiv:1211.4039v1 [math.PR])
- Applying Dynamic Model for Multiple Manoeuvring Target Tracking Using Particle Filtering. (arXiv:1211.4524v1 [cs.CV])
- Dynamics and Control of a Chain Pendulum on a Cart. (arXiv:1211.4604v1 [math.OC])
- Approximation of stationary solutions to SDEs driven by multiplicative fractional noise. (arXiv:1211.4813v1 [math.PR])
- On Fourier analytic properties of graphs. (arXiv:1211.4803v2 [math.CA] UPDATED)
- On the Strong Recurrence of Recurrent RWRE. (arXiv:1211.4770v1 [math.PR])
- MCMC inference for Markov Jump Processes via the Linear Noise Approximation. (arXiv:1211.4801v1 [stat.CO])
- Small World MCMC with Tempering: Ergodicity and Spectral Gap. (arXiv:1211.4675v1 [stat.ME])
- Smoothing Dynamic Systems with State-Dependent Covariance Matrices. (arXiv:1211.4601v1 [math.OC])
- A Unifying Variational Perspective on Some Fundamental Information Theoretic Inequalities. (arXiv:1211.4795v1 [cs.IT])
- A mathematical framework for inverse wave problems in heterogeneous media. (arXiv:1211.4656v1 [math-ph])
- On the martingale problem for degenerate-parabolic partial differential operators with unbounded coefficients and a mimicking theorem for Ito processes. (arXiv:1211.4636v1 [math.PR])
- Conditional hitting time estimation in a nonlinear filtering model by the Brownian bridge method. (arXiv:1211.4553v1 [math.PR])
- On a class of self-similar processes with stationary increments in higher order Wiener chaoses. (arXiv:1211.4343v1 [math.PR])
- Asymptotic theory for Brownian semi-stationary processes with application to turbulence. (arXiv:1211.4221v1 [math.PR])
- Mean Field Forward-Backward Stochastic Differential Equations. (arXiv:1211.4186v1 [math.PR])
- Observability with Random Observations. (arXiv:1211.4077v1 [cs.SY])
- A probabilistic approach to Dirichlet problems of semilinear elliptic PDEs with singular coefficients. (arXiv:1211.3820v1 [math.PR])
- On a Class of Boundary Control Problems. (arXiv:1211.3634v2 [math.OC] UPDATED)
- Statistical inference on errorfully observed graphs. (arXiv:1211.3601v1 [stat.ML])
- The Bernstein-von Mises theorem for non-regular generalised linear inverse problems. (arXiv:1211.3434v2 [math.ST] UPDATED)
- Adaptive Estimation of Convex Sets and Convex Polytopes from Noisy Data. (arXiv:1211.3224v1 [math.ST])
- The relation between frequentist confidence intervals and Bayesian credible intervals. (arXiv:1211.3343v1 [physics.data-an] CROSS LISTED)
- The relation between Granger causality and directed information theory: a review. (arXiv:1211.3169v1 [cs.IT])
- Time-series Scenario Forecasting. (arXiv:1211.3010v1 [stat.ML])
- Volatility around the clock: Bayesian modeling and forecasting of intraday volatility in the financial crisis. (arXiv:1211.2961v1 [stat.AP])
- Study design in causal models. (arXiv:1211.2958v2 [stat.ME] UPDATED)
- Analysis of short term price trends in daily stock-market index data. (arXiv:1211.3060v1 [q-fin.ST])
- Recovering Optimal Solution by Dual Random Projection. (arXiv:1211.3046v3 [cs.LG] UPDATED)
- Unbounded Probability Theory and Its Applications. (arXiv:1211.3037v2 [math-ph] UPDATED)
- It\^o calculus and jump diffusions for $G$-L\&#39;evy processes. (arXiv:1211.2973v1 [math.PR])
- Martingale Problems under Nonlinear Expectation. (arXiv:1211.2869v1 [math.PR])
- Expectation Propagation in Gaussian Process Dynamical Systems: Extended Version. (arXiv:1207.2940v3 [stat.ML] CROSS LISTED)
- Sequentially interacting Markov chain Monte Carlo methods. (arXiv:1211.2582v1 [math.ST])
- Bayesian prediction for stochastic processes. (arXiv:1211.2300v1 [math.ST])
- Optimal Detection For Sparse Mixtures. (arXiv:1211.2265v1 [cs.IT])
- An Inverse Problem for Sturm-Liouville Operators on the Half-line Having Bessel-type Singularity in an Interior Point. (arXiv:1211.2395v1 [math.SP])
- Markov chain Monte Carlo for computing rare-event probabilities for a heavy-tailed random walk. (arXiv:1211.2207v1 [math.PR])
- Fluctuations of Martingales and Winning Probabilities of Game Contestants. (arXiv:1211.2045v1 [math.PR])
- Exact minimax estimation of the predictive density in sparse Gaussian models. (arXiv:1211.2071v2 [math.ST] UPDATED)
- Computational and Statistical Tradeoffs via Convex Relaxation. (arXiv:1211.1073v2 [math.ST] UPDATED)
- Inverse problems in approximate uniform generation. (arXiv:1211.1722v1 [cs.CC])
- Continuous random walks and fractional powers of operators. (arXiv:1211.1846v1 [math.PR])
- Concentration inequalities for mean field particle models. (arXiv:1211.1837v1 [math.PR])
- Stochastic viability and comparison theorems for mixed stochastic differential equations. (arXiv:1211.1814v1 [math.PR])
- High-Frequency Trading Synchronizes Prices in Financial Markets. (arXiv:1211.1919v1 [q-fin.TR])
- Central limit theorem for functionals of two independent fractional Brownian motions. (arXiv:1211.1967v1 [math.PR])
- An optimal control problem of forward-backward stochastic Volterra integral equations with state constraints. (arXiv:1211.1740v1 [math-ph])
- Area coverage of radial Levy flights with periodic boundary conditions. (arXiv:1211.1849v1 [cond-mat.stat-mech])
- Random walk in random environment, corrector equation, and homogenized coefficients: from theory to numerics, back and forth. (arXiv:1211.1834v1 [math.NA])
- Conditional inferential models: combining information for prior-free probabilistic inference. (arXiv:1211.1530v1 [math.ST])
- Time averaged Einstein relation and fluctuating diffusivities for the L\&#39;evy walk. (arXiv:1211.1539v2 [cond-mat.stat-mech] UPDATED)
- Brownian motion at short time scales. (arXiv:1211.1458v1 [cond-mat.stat-mech])
- Optimal expulsion and optimal confinement of a Brownian particle with a switching cost. (arXiv:1211.1595v1 [math.PR])
- Testing time series irreversibility using complex network methods. (arXiv:1211.1162v2 [physics.data-an] UPDATED)
- BSDEs with terminal conditions that have bounded Malliavin derivative. (arXiv:1211.1089v1 [math.PR])
- Brownian dynamics simulations with hard-body interactions: Spherical particles. (arXiv:1211.1308v1 [physics.comp-ph])
- Motion Planning via Optimal Control for Stochastic Processes. (arXiv:1211.1138v1 [math.OC])
- A generalized Polya&#39;s urn with graph based interactions. (arXiv:1211.1247v2 [math.PR] UPDATED)
- Random walk kernels and learning curves for Gaussian process regression on random graphs. (arXiv:1211.1328v1 [stat.ML])
- Stochastic perturbations in open chaotic systems: random versus noisy maps. (arXiv:1211.0698v1 [nlin.CD])
- Coupling and tracking of regime-switching martingales. (arXiv:1209.0180v1 [math.PR] CROSS LISTED)
- Rejoinder: Latent variable graphical model selection via convex optimization. (arXiv:1211.0835v1 [math.ST])
- Discussion: Latent variable graphical model selection via convex optimization. (arXiv:1211.0817v1 [math.ST])
- Discussion: Latent variable graphical model selection via convex optimization. (arXiv:1211.0813v1 [math.ST])
- Discussion: Latent variable graphical model selection via convex optimization. (arXiv:1211.0811v1 [math.ST])
- Discussion: Latent variable graphical model selection via convex optimization. (arXiv:1211.0808v1 [math.ST])
- Discussion: Latent variable graphical model selection via convex optimization. (arXiv:1211.0806v1 [math.ST])
- Discussion: Latent variable graphical model selection via convex optimization. (arXiv:1211.0801v1 [math.ST])
- Sharp Bounds on Random Walk Eigenvalues via Spectral Embedding. (arXiv:1211.0589v1 [math.PR])
- Complete Algebraic Reconstruction of Piecewise-Smooth Functions from Fourier Data. (arXiv:1211.0680v1 [math.NA])
- Markov chain approximations for transition densities of L\&#39;evy processes. (arXiv:1211.0476v1 [math.PR])
- Large Deviations for SPDEs of Jump Type. (arXiv:1211.0466v1 [math.PR])
- On an Integral Equation for the Free Boundary of Stochastic, Irreversible Investment Problems. (arXiv:1211.0412v1 [q-fin.PM])
- The Emerging Field of Signal Processing on Graphs: Extending High-Dimensional Data Analysis to Networks and Other Irregular Domains. (arXiv:1211.0053v2 [cs.DM] UPDATED)
- Mean Field Theory of Dynamical Systems Driven by External Signals. (arXiv:1210.8260v2 [nlin.CD] UPDATED)
- Decision dynamics in complex networks subject to mass media and social contact transmission mechanisms. (arXiv:1210.8193v1 [physics.soc-ph])
- A stopping criterion for Markov chains when generating independent random graphs. (arXiv:1210.8184v1 [cs.SI])
- Linear-Nonlinear-Poisson Neuron Networks Perform Bayesian Inference On Boltzmann Machines. (arXiv:1210.8442v3 [cs.AI] UPDATED)
- Efficient simulation of nonlinear parabolic SPDEs with additive noise. (arXiv:1210.8320v1 [math.PR])
- A probabilistic numerical method for optimal multiple switching problem and application to investments in electricity generation. (arXiv:1210.8175v1 [math.NA])
- Partially Gaussian Stationary Stochastic Processes in Discrete Time. (arXiv:1210.7773v1 [math.PR])
- Derivative-variable correlation reveals the structure of dynamical networks. (arXiv:1210.7446v2 [physics.data-an] UPDATED)
- Randomized Matrix Computations. (arXiv:1210.7476v1 [math.NA])
- Inverse Spectral Theory for Sturm-Liouville Operators with Distributional Coefficients. (arXiv:1210.7628v1 [math.SP])
- Convolutional Compressed Sensing Using Deterministic Sequences. (arXiv:1210.7506v1 [cs.IT])
- A statistical mechanics approach to the sample deconvolution problem. (arXiv:1210.7508v1 [q-bio.QM])
- Tensor decompositions for learning latent variable models. (arXiv:1210.7559v2 [cs.LG] UPDATED)
- A central limit theorem for projections of the cube. (arXiv:1210.7012v2 [math.PR] UPDATED)
- An Exponential Lower Bound on the Complexity of Regularization Paths. (arXiv:0903.4817v3 [cs.LG] CROSS LISTED)
- The anti-Bayesian moment and its passing. (arXiv:1210.7225v1 [stat.OT])
- Managing sparsity, time, and quality of inference in topic models. (arXiv:1210.7053v2 [stat.ML] UPDATED)
- Structured Sparsity Models for Multiparty Speech Recovery from Reverberant Recordings. (arXiv:1210.6766v1 [cs.LG])
- Nested Hierarchical Dirichlet Processes. (arXiv:1210.6738v2 [stat.ML] UPDATED)
- Sparse Stochastic Processes and Discretization of Linear Inverse Problems. (arXiv:1210.5839v2 [cs.IT] UPDATED)
- [1210.4752] Discrete Signal Processing on Graphs
- D.: [1210.4752] Discrete Signal Processing on Graphs
- A Bayesian Approach to Constraint Based Causal Inference. (arXiv:1210.4866v1 [cs.AI])
- Stochastic Thermodynamics, Reversible Dynamical Systems and Information Theory. (arXiv:1210.5071v1 [cond-mat.stat-mech])
- Graph-Coupled HMMs for Modeling the Spread of Infection. (arXiv:1210.4864v1 [cs.SI])
- Discrete Signal Processing on Graphs. (arXiv:1210.4752v2 [cs.SI] UPDATED)
- Large Deviations for the solution of a Kac-type kinetic equation. (arXiv:1210.4468v1 [math.PR])
- The Zakai equation of nonlinear filtering for jump-diffusion observation: existence and uniqueness. (arXiv:1210.4279v2 [math.PR] UPDATED)
- Bayesian Estimation of Inverse Gaussian Distribution. (arXiv:1210.4524v1 [stat.ME])
- Poisson intensity parameter estimation for stationary Gibbs point processes of finite interaction range. (arXiv:1210.4402v2 [math.ST] UPDATED)
- Hilbert Space Embedding for Dirichlet Process Mixtures. (arXiv:1210.4347v1 [stat.ML])
- The Kernel Pitman-Yor Process. (arXiv:1210.4184v1 [cs.LG])
- An introduction to particle integration methods: with applications to risk and insurance. (arXiv:1210.3851v2 [q-fin.CP] UPDATED)
- Functional Methods in Stochastic Systems. (arXiv:1210.3934v1 [math-ph])