A | B | C | D | E | F 
 G | H | I | J | K | L | M 
 N | O | P | Q | R | S | T 
 U | V | W | X | Y | Z 
max planck institut
informatik
mpii logo Minerva of the Max Planck Society

Homepage

Rainer Gemulla

Rainer Gemulla

Max-Planck-Institut für Informatik
Department 5: Databases and Information Systems
Campus E1 4, Room 404
66123 Saarbrücken
Germany

Email: Get my email address via email
Phone: +49 681 9325 5004
Fax: +49 681 9325 599

I am heading the research group on Scalable Management of Uncertain Data at Department 5 of the Max-Planck-Institut für Informatik.


Research Interests




Teaching


     Current semester:
     Past semesters:

Publications


2011    R. Gemulla, P. J. Haas, Y. Sismanis, C. Teflioudi, F. Makari
Large-Scale Matrix Factorization with Distributed Stochastic Gradient Descent. [pdf, slides]
In NIPS 2011 Biglearn workshop, 2011. (best paper award)

R. Gemulla, E. Nijkamp, P. J. Haas, Y. Sismanis
Large-Scale Matrix Factorization with Distributed Stochastic Gradient Descent. [pdf, slides]
In KDD, pp. 69-77, 2011.

K. Beyer, V. Ercegovac, R. Gemulla, A. Balmin, M. Eltabakh, C.C. Kanne, F. Ozcan, E. Shekita
Jaql: A Scripting Language for Large Scale Semistructured Data Analysis [pdf]
In PVLDB (industrial track), 4(11), pp. 1272-1283, 2011.

M. Y. Eltabakh, Y. Tian, F. Özcan, R. Gemulla, A. Krettek, J. McPherson
CoHadoop: Flexible Data Placement and Its Exploitation in Hadoop. [pdf]
In PVLDB, 4(9), pp. 575-585, 2011.

R. Gemulla, P. J. Haas, E. Nijkamp, Y. Sismanis
Large-Scale Matrix Factorization with Distributed Stochastic Gradient Descent. [pdf]
IBM Research Report RJ10481, March 2011.

B. Schlegel, R. Gemulla, W. Lehner
Memory-Efficient Frequent-Itemset Mining. [pdf]
In EDBT, pp. 461-472, 2011.
2010    S. Das, Y. Sismanis, K. S. Beyer, R. Gemulla, P. J. Haas, J. McPherson.
Ricardo: Integrating R and Hadoop. [pdf]
In SIGMOD (industrial track), pp. 987-998, 2010.

B. Schlegel, R. Gemulla, W. Lehner.
Fast Integer Compression using SIMD Instructions. [pdf]
In DAMON, pp. 34-40, 2010.
2009    K. Beyer, R. Gemulla. P. J. Haas, B. Reinwald, Y. Sismanis.
Distinct-Value Synopses for Multiset Operations. [pdf]
In Commun. ACM, 52(10), pp. 87-95, 2009.
Technical perspective by Surajit Chaudhuri.

B. Schlegel, R. Gemulla, W. Lehner.
k-Ary Search on Modern Processors. [pdf, slides]
In DAMON, pp. 52-60, 2009.
2008    R. Gemulla.
Sampling Algorithms for Evolving Datasets. [pdf, summary, slides]
Ph.D. thesis, Technische Universität Dresden, 2009.
URL for citations: http://nbn-resolving.de/urn:nbn:de:bsz:14-ds-1224861856184-11644

R. Gemulla, P. Rösch and W. Lehner.
Linked Bernoulli Synopses: Sampling Along Foreign Keys. [pdf, slides]
In SSDBM, pp. 6-23, 2008.

R. Gemulla and W. Lehner.
Sampling Time-Based Sliding Windows in Bounded Space. [pdf, slides]
In SIGMOD, pp. 379-392, 2008.

P. Rösch, R. Gemulla and W. Lehner.
Designing Random Sample Synopses with Outliers. (Poster) [pdf, poster]
In ICDE, pp. 1400-1402, 2008.
2007    R. Gemulla, W. Lehner and P.J. Haas.
Maintaining Bounded-Size Sample Synopses of Evolving Datasets. [pdf]
In The VLDB Journal, Special Issue: Best Papers of VLDB 2006, pp. 173-201, 2007.

K. Beyer, P. J. Haas, B. Reinwald, Y. Sismanis and R. Gemulla.
On Synopses for Distinct-Value Estimation Under Multiset Operations. [pdf, slides]
In SIGMOD, pp. 199-210, 2007

R. Gemulla, W. Lehner and P. J. Haas.
Maintaining Bernoulli Samples over Evolving Multisets. [pdf, slides]
In PODS, pp. 93-102, 2007.
2006    R. Gemulla, W. Lehner and P. J. Haas.
A Dip in the Reservoir: Maintaining Sample Synopses of Evolving Datasets. [pdf, slides]
In VLDB, pp. 595-606, 2006.

Klein, R. Gemulla, P. Rösch and W. Lehner.
Derby/S: A DBMS for Sample-Based Query Answering. [pdf, poster1, poster2]
In SIGMOD (demo), pp. 757-759, 2006.

R. Gemulla and W. Lehner.
Deferred Maintenance of Disk-Based Random Samples. [pdf, slides]
In EDBT, pp. 423-441, 2006.

Search MPII (type ? for help)