Apache Spark - MaRDI portal

Cited in

(only showing first 100 items - show all)

A Bayesian perspective of statistical machine learning for big data
GSA for machine learning problems: a comprehensive overview
Parametric Gaussian process regression for big data
User-defined tensor data analysis
Genetic programming + proof search = automatic improvement
Traditional and context-specific spam detection in low resource settings
A new accelerated proximal boosting machine with convergence rate \(O(1/t^2)\)
A semi-parallel framework for greedy information-theoretic feature selection
Distributed cooperative learning over time-varying random networks using a gossip-based communication protocol
Elephant against Goliath: performance of big data versus high-performance computing DBSCAN clustering implementations
An intuitive fuzzy approach for evaluating financial resiliency of supply chain
GPS
GoFFish
Modern Datalog Engines
An effective and efficient MapReduce algorithm for computing BFS-based traversals of large-scale RDF graphs
MLlib: machine learning in Apache Spark
scientific article; zbMATH DE number 7306895 (Why is no real title available?)
From distributed coordination to field calculus and aggregate computing
Equivalence classes and conditional hardness in massively parallel computations
\(k\)-means, Ward and probabilistic distance-based clustering methods with contiguity constraint
Spark solutions for discovering fuzzy association rules in big data
Regression Neural Networks with a Highly Robust Loss Function
Full likelihood inference from the site frequency spectrum based on the optimal tree resolution
MLP-ANN-based execution time prediction model and assessment of input parameters through structural modeling
KATZ centrality with biogeography-based optimization for influence maximization problem
Translating Scala programs to Isabelle/HOL. System description
Novel data-driven method for non-probabilistic uncertainty analysis of engineering structures based on ellipsoid model
A three-way cluster ensemble approach for large-scale data
Scaling up Bayesian variational inference using distributed computing clusters
Temporal concatenation for Markov decision processes
Computation Against a Neighbour: Addressing Large-Scale Distribution and Adaptivity with Functional Programming and Scala
GEODIS: towards the optimization of data locality-aware job scheduling in geo-distributed data centers
Statistical challenges of big brain network data
Computational fluid dynamics simulation based on hadoop ecosystem and heterogeneous computing
MuLOT: multi-level optimization of the canonical polyadic tensor decomposition at large-scale
A cloud computing-based intelligent forecasting method for cross-border e-commerce logistics costs
Performance Comparison of Machine Learning Platforms
Iterative selection of categorical variables for log data anomaly detection
Widening: using parallel resources to improve model quality
Mermaid
MapReduce
StreamIt
MOEA
ViennaCL
OPT4J
CADO-NFS
Scala
Mahout
G-Hadoop
Hadoop
Dryad
rapidminer
CELF++
SDEF
GraphLab
Pregel
pmml
Breeze
GraphX
MLlib
MLbase
PLANET
PSF
RAST
Vispark
RhpcBLASctl
GRADIENT
GGSA
Giraph
SAPPER
GraphLog
AMIDST
Flapjax
Helena
Djinn
Gen-O-Fix
GenProg
SparkSW
DryadLINQ
MetaSpark
VC3
ROSEFW-RF
stream
Spark
Azure
Elixir
Hive
Apache Pig
kdANN+
5tbl
MLaut
BigDatalog
AIDE
DSCOVR
DiSCO
eofs
Swift
rslurm
HPDBSCAN
Nak

This page was built for software: Apache Spark