CHAITANYA K. BARU
Distinguished Scientist
San Diego
Supercomputer Center
Director,
Center for Large-scale Data Systems research (CLDS), clds.sdsc.edu
Director,
Advanced Cyberinfrastructure Development Group (ACID), acid.sdsc.edu
Address: University
of California, San Diego, 9500 Gilman Drive, La Jolla,
CA 92093-0505
Phone:
858-534-5035
email:
baru@sdsc.edu
RESEARCH INTERESTS
Scientific data management. Large-scale data systems. Data integration. Data analytics.
Parallel database systems.
Employment
San Diego Supercomputer Center, UC
San Diego
2008 – Distinguished Scientist, SDSC. Also, Chair, SDSC Project Advisory Council and Director of my
R&D group, Advanced
Cyberinfrastructure Development (ACID).
2004
– 2008 Division Director, Science R&D Division.
Member of the SDSC Senior Management Team. Provided
leadership and advice to researchers in this Division, while also serving as a
PI in my own right. Also, Director of the SDSC/Calit2
Synthesis Center (www.syncenter.org)
and co-Director, National Laboratory for Advanced Data Research (NLADR, www.nladr.org).
2004 – Member, California Institute for
Information Technology and Telecommunication
Principal Investigator at Calit2. Also, promoted
collaborations between Calit2 and SDSC, including creation of the SDSC/Calit2 Synthesis Center.
2001 – 04 Co-Director, Data and Knowledge Systems Program.
Provided
leadership for all data-oriented R&D activities, including establishing
ÒLabsÓ at SDSC. Also, served as a PI in my own right.
2000 – 01 Assistant Director, Data Intensive
Computing Environments (DICE) group.
Duties
were the same as my previous job at SDSC (see below for details), which was
technically with a different employer.
San Diego Supercomputer Center, General Atomics, La Jolla,
CA
1996–2000 Senior Principal Scientist, Data
Intensive Computing Environments (DICE) group.
Technical
Project Manager for the Distributed
Object Computation Testbed (DOCT) project led by Reagan Moore. Provided
leadership in data-related R&D activities for the group.
IBM Corporation
1992
– 95 Advisory Programmer, Database Technology Institute, IBM
Almaden Research Labs, San Jose, CA (1995).
Advisory Development Analyst and Group Lead, Database Technology Group, IBM Toronto Labs
(1992-95).
One
of the three team leads responsible for the design and
development of DB2 Parallel Edition V1, released December 1995.
University of Michigan, Ann Arbor, MI
1985 – 92 Assistant Professor, Department of
Electrical Engineering and Computer Science, University of Michigan, Ann Arbor,
MI 48109-2122.
á
Member
of Advanced Computer Architecture Lab (ACAL)
á
Member
of Software Systems Research Lab (SSRL)
á
Member,
Executive Committee, Univ. of Michigan Human Genome Center,
led by Dr. Francis Collins.
Performed research and taught
undergraduate and graduate courses in computer science and engineering.
Received
NSF and other research awards; graduated 3 Ph.D. students; received Outstanding
Teacher award from Dean of Engineering.
Cyberinfrastructure Leadership Activities
1.
Project Director, The Geosciences
Network (GEON, www.geongrid.org), since
2002.
PI of the original large NSF ITR grant ($11.5M total funding) involving 12 PI institutions.
The project was renewed as GEON 2.0 and is currently active. It also spun off
another independently funded project called, OpenTopography.org.
2.
Director of Cyberinfrastructure for
NSF National Earthquake Engineering Simulations (NEESit), 2007-present.
Joined NEESit as an
Cyberinfrastructure Advisor in 2006. In 2007, helped retain funding for NEESit
and continue operations at UCSD. Served as Director for NEESIt
since Oct 2007.
3.
Cyberinfrastructure Lead/PI for the
Tropical Ecology, Assessment and Monitoring Network (TEAM), 2007-present.
TEAM is a project of Conservation
International, funded by the Moore Foundation. The grant is through Calit2.
4.
Member of Senior Management Team,
NSF National Ecological Observatory Network (NEON, www.neoninc.org), 2005-2007.
Cyberinfrastructure Lead for the
NEON Testbed, and co-PI of the NEON Cyberinfrastructure Diagnostic Testbed.
5.
Lead, KatrinaSafe
Database Project.
Worked in close collaboration with American Red Cross to develop KatrinaSafe, a ÒliveÓ database of victims of Hurricane
Katrina. This led to the development of DisasterSafe,
hosted at SDSC—a standard service offered by Red Cross for victims of any
disaster.
6.
Member, IRIS Data Management System
Standing Committee, 2007-2009.
IRIS is the NSF data archives for seismological data.
7.
Co-Director, National Laboratory for
Advanced Data Research (NLADR, www.nladr.org)
Joint activity
with the National Computational Science Alliance (NCSA) with Dr. Michael Welge as the other co-Director.
8.
Executive Director, SDSC/Calit2
Synthesis Center (www.syncenter.org),
2005-2008.
Joint facility consisting of SDSC staff
and equipment located at Calit2.
9.
Member of Cyberinfrastructure
Advisory Committee, Long-Term Ecological Research Network (LTER, www.lternet.org), 2006.
10.
Member of Advisory Board, CLEANER
Project Office, 2005-2006.
11.
Co-Convener, NSF Earth Science CyberInfrastructure (ES-CI) Task Force (with Lee Allison
and Tom Jordan), 2004.
12.
SDSC PI for CUAHSI Hydrologic
Information System (http://his.cuahsi.org), 2004-2008.
13.
Member of Leadership Team,
Biomedical Informatics Research Network (BIRN, www.nbirn.net),
2001-2004. One of
the co-Investigators of the original BIRN Coordinating Center (BIRN-CC).
UCSD and other UC
Collaborations
1. CARTA: Cyberinfrastructure and
bioinformatics lead for the UCSD / Salk Institute-led ORU Center for Advanced
Research and Training in Anthropogeny led by Profs. Ajit Varki, Margaret Schoeninger, and Rusty Gage (Salk). Funded by the Mathers Foundation. Duration: 2007—ongoing.
2. CYCORE: Co-PI of Cyberinfrastructure for
Comparative Effectiveness Research project funded by NIH. Project
is led by Dr. Kevin Patrick (SOM & Calit2) in collaboration with
M.D. Anderson Cancer Center, Houston. Duration: October 2009—September
2010.
3. CISA3: Co-PI with Profs. Tom Levy and Falko Kuester of the Mediterranean Archaeology Network (MedArchNet). Funded by the UCSD ChancellorÕs Collaboratory initiative, for 2009-2010 academic year.
4. 911: PI of NSF-funded project on
Spatiotemporal Analysis of 911 Call Stream Data, with Prof. William Hodgkiss, SIO, as co-PI. Duration: 2004-2008.
5. Hazards: Coordinated a hazards initiative on
campus with funding from OVCR, JSOE, SIO, and SDSC. Pre-proposal on
Cyberinfrastructure Center for Urgent Response to Emergencies (CICURE) submitted
to the NSF STC program (not selected). Other proposal planning activities are
under way.
6. WIISARD: Co-investigator (with Prof. Leslie
Lenert as PI) on the original Wireless Internet
Information System for Medical Response in Disasters project. Funded by NIH,
2005-2007. Was responsible for the data management component.
7. BIRN: Collaborated with Prof. Mark Elisman as co-Investigator on the original BIRN-CC project,
funded by NIH, 2001. Was responsible for the data integration component.
8. I2T: PI of the NSF-funded Information
Integration Testbed (I2T) project with Prof. Yannis Papakonstantinou (CSE) as co-PI. Duration: 2002-2004.
9. UC-SGH: Co-PI on a proposal for a Center
of Excellence on Disasters to the UC School of Global Health with Prof. Craig
Van Dyke, UCSF (PI), and Profs. Gretchen Kalonji
(UCOP) and Nicholas Sitar (UCB). (not selected).
10. RISC MRU: Co-PI on a multi-campus research
unit (MRU) proposal on Rapid Information for Science during Catastrophes
(RISC), led by Prof. Emily Brodsky, UCSC. (not selected).
Software Development
1.
One
of the group leaders and developers of IBMÕs DB2 Parallel Edition Version 1.0,
released commercially in December 1995.
2.
One
of the designers of the SDSC Storage Resource Broker (SRB). Version 1 was
released in September 1997.
3.
One
of the designers of the Data Integration Cartª technology for ontology-based
data integration (invention disclosure filed: 2007).
U.S. Patents
1.
Persistent Archives, R. Moore, A. Rajasekar, C. Baru, B. Ludaescher, A. Gupta, R. Marciano, US Patent 7,349,915,
March 25, 2008.
2.
Persistent Archives, R. Moore, A. Rajasekar, C. Baru, B. Ludaescher, A. Gupta, R. Marciano, US Patent 6,963,875,
November 8, 2005.
3.
System and method for construction,
storage, and transport of presentation-independent multimedia content, C. Baru, J. Chase, T. Elvins, R. Fassett, E. Nebel, Patent No. 7,028,252, March 22, 2001.
4.
Method and apparatus for achieving
uniform data distribution in a parallel database system, C. Baru and F. Koo, Patent No.US5970495, IBM, Oct.19, 1999.
5.
Method and apparatus for
implementing partial declustering in a parallel
database system, C. Baru, G. Fecteau,
J. Kirton, L. Kollar, F.
Koo, Patent No. US5878409, IBM, March 2, 1999.
Ph.D. Committees Chaired
á Ophir Frieder. Dissertation title: "Database processing on a cube-connected multicomputer
system," EECS Dept., University of Michigan, Dec. 1987. Recipient of IBM Graduate Fellowship
Award.
Currently, Chaired Professor,
Illinois Institute of Technology, Chicago, IL.
á Piyush Goel. Dissertation title: "Dataflow query processing and optimization," EECS Dept.,
University of Michigan, May 1992.
Was co-Founder of Everypath.com, San
Jose, CA.
á Sriram Padmanabhan. Dissertation title: "Data placement in shared-nothing parallel database
systems," EECS Dept., University of Michigan, July 1992. Recipient of
IBM Graduate Fellowship Award.
Currently, Distinguished Engineer,
IBM Silicon Valley Labs.
Key Committee Memberships
á Lead, TeraGrid Data Working Group,
2001–2002.
á Member of Review Committee, Canada
Research Chairs program, Natural Sciences and Engineering Research Council
(NSERC) of Canada, 2000–2002.
á Member of the Architecture Working
Group, California Digital Library, University of California, Office of the
President, Oakland, CA, 1998–2000.
á Member of the Grants Selection Committee (GSC) for Computer and Information
Sciences, Natural Sciences and Engineering Research Council (NSERC) of Canada,
1994-97. (The GSC is responsible
for annually reviewing grant proposals from computer science faculty in Canada
and making funding decisions).
á IBM representative on the
Transaction Processing Council's TPC-D Benchmark Standard Subcommittee,
1993-95. Participated in drafting the original TPC-D specification.
Funded Research: Principal
Investigator (23 grants, $15.3M funding)
1. Geoinformatics: GEON 2.0: A Data Integration
Facility for the Earth Sciences (Co-PIs: Randy Keller, U.
Oklahoma; Ramon Arrowsmith, Arizona State; Charles Meertens, UNAVCO; Ann Gates,
UT El Paso)
Funding: $1,158,150, Aug 2008-July 2010
Sponsor: NSF GEO/Earth Sciences: Geoinformatics
2. Performance Evaluation of On-Demand
Provisioning of Data Intensive Applications (Co-PI: Sriram Krishnan, SDSC)
Funding: $450,000, Jan 2009-Dec 2010
Sponsor: NSF CISE/Cluster Exploratory program
3. OpenTopography - A National Hub for High Resolution
Topographic Data, Tools, and Knowledge (Co-PI: Ramon Arrowsmith, Arizona
State)
Funding: $1,400,000, Sept 2009-Aug 2012
Sponsor: NSF GEO/Earth Science Facilities
4. Cyberinfrastructure for the Tropical Ecology
Assessment and Monitoring Network
Funding: $916,417, Aug 2006 – current
Sponsor: Moore Foundation (via Conservation
International)
5. Cyberinfrastructure Preparedness for Emergency Response and
Relief: Learning the lessons from Hurricane
Katrina
Funding: $200,000, 2007
Sponsor: NSF CISE/Integrated
Information Systems
6.
Cyberinfrastructure
Design for NEON
Funding: $156,861, 2006-2007
Sponsor: NSF BIO
7. Geoinformatics 2006
Funding: $38,986, 2006
Sponsor: NSF
EAR/Geoinformatics
8.
Cyberinfrastructure
Summer Institute for Geoscientists
Funding $70,000, 2004
Sponsor: NSF EAR/Geoinformatics
9. GEON: The Geosciences Network—A Research
Project to Create Cyberinfrastructure for the Geosciences
Collaborative grant with UCSD as
lead institution and 10 other PI institutions and 2 subcontracts.
Funding: Total: $11,6000,000 (UCSD: $5,987,927), October 2002—September 2007
Sponsor: NSF ITR
Program
10. Development of Informatics Infrastructure for Hydrologic Sciences (Co-PI: Ilya
Zaslavsky.
Collaborative project with UT Austin, UIUC, Drexel)
Funding: $1,271,221, April 2004-March 2006
Sponsor: NSF GEO/EAR
11. Spatiotemporal Analysis of 9-1-1 Call Stream
Data
Funding: $918,970,
August 2004—September 2007
Sponsor: NSF CISE/Digital Government
12. Data Intensive Grid Benchmarks (Co-PIs:
Allan Snavely, Henri Casanova, SDSC)
Funding: $749,999,
October 2002—September 2006
Sponsor: National Middleware Initiative (NMI), CISE, National Science Foundation
13. INGREIN Phase II: Integrated Georeferenced
Environmental Information Network
Funding: $250,000, September 2002-September
2003
Sponsor: University of California Life
Science Informatics Program (LSIT) and Environmental Systems Research Institute
Inc. (ESRI), Redlands
14. Building a Scalable Biological Information Network - A
Workshop Proposal (Co-PI: Peter Arzberger)
Funding: $93,106, 2006
Sponsor: NSF BIO
15. I2T: An Information Integration Testbed for Digital Government (Co-PIs: Yannis
Papakonstantinou, Amarnath
Gupta) (Collaboration with U.Penn and U.Michigan)
Funding: $720,000, July 2000-June 2003
Sponsor: National Science Foundation
16. INGREIN: Integrated Georeferenced
Environmental Information Network
Funding: $250,000, May 2001-April 2002
Sponsor: University of California Life Science
Informatics Program (LSIT) and Environmental Systems Research Institute Inc.
(ESRI), Redlands
17. Current Research Information Systems
Funding: $200,000, January 2000 to December
2001
Sponsor: Community of Science, Inc.,
Baltimore, MD
18. Evaluation of the ArcXML Specification
Funding: $124,000, 1.5 years. June 2000 to
December 2001
Sponsor: ESRI Inc., Redlands, CA
19. Information Integration Architecture for a National Statistical Data
Infrastructure
(Co-PI: Amarnath Gupta)
Funding:
$38,000, April 2000-July 2000
Sponsor: National Science Foundation
20. Experimental facility for research
in PArallel Database Systems (PADS)
Funding: $40,205, one-time, equipment grant
Sponsor: AT&T Foundation
21. Multicomputer system for parallel database processing
Funding: $94,728, one-time, equipment grant
Sponsor: National Science Foundation
22. Database processing on a cube-connected multicomputer system
Funding: $59,650, September 1987 to February
1990
Sponsor: National Science Foundation
23. Parallel architecture features for supporting efficient database
computation
Funding: $9,930, May 1986 to April 1987
Sponsor: Rackham Graduate School, University
of Michigan
Funded Research: Co-principal
Investigator (10 grants, $8M funding)
1. Management and Analysis of Environmental Observatory Data
Using the Kepler Scientific Workflow System (PI: Matt Jones, UC Santa Barbara).
Funding: $2,713,203, 2006-2010
Sponsor: NSF BIO
2. NEON CI/ECI Preliminary
Design and Diagnostic Testbed (PI: David Schimel,
NEON Inc)
Funding: $482,348, 2007-2008
Sponsor: NSF BIO
3. Transforming Morphological Systematics From Desktop to Web
Applications: Development of the Online Workspace Morphobank.org 3.0 (PI: Maureen OÕLeary,
SUNY Stony Brook).
Funding: $610.614
Sponsor: NSF BIO
4. Enabling the Creation and Use of GeoGrids for
Next Generation Geospatial Information (PI: Peggy Agouris, Univ of Maine).
Funding: $448,717,
September 2001—August 2006.
Sponsor: NSF CISE/Digital Government
5.
CHRONOS Network for Earth Systems
History: Development of Integrated Databases and Toolkits Accessible Through a
Common Portal (PI: Cinzia Cervato, Iowa State Univ)
(Collaborative project, with Iowa State)
Funding: $281,000, August 2003-July 2005
Sponsor: NSF GEO/EAR
6.
CLEANER: Research for the Coastal
Margin (PI: Ilya
Zaslavsky, SDSC)
(Collaborative project led by Texas A&M)
Funding: $5,421 (SDSC allocation), July
2004-May 2005
Sponsor: NSF OCI
7.
Building the Geoinformatics System:
A Workshop on Planning and Coordinating Environmental Cyberinfrastructure Activities
in the Earth Sciences
(PI: Lee Alison)
Funding: $61,258
8.
Biomedical Imaging and Research
Network Coordinating Center (PI: Mark
Ellisman, UCSD)
Funding: $4.057,157, May 2004-April 2005
Sponsor: NCRR, National Institutes of Health
9.
Data management for electronic
commerce applications
(Co-PI, with Yannis Papakonstantinou,
UCSD)
Funding: $100,000, January 1997 to December
2000
Sponsor: Center for Advanced Studies, IBM
Toronto Labs
10.
Study of performance monitoring and
evaluation techniques for parallel database systems (Co-PI, with Jeffrey Naughton, Univ. of Wisconsin)
Funding: $100,000, January 1995 to December
1998
Sponsor: Center for Advanced Studies, IBM
Toronto Labs
Funded Research: Senior Personnel
Served as Senior Personnel in over
20 projects.
Books
1. Geoinformatics: Cyberinfrastructure for the Solid Earth Sciences. Edited by Randy Keller and
Chaitanya Baru. Cambridge University Press. Under preparation.
2. Portal Science and Archaeology – View from the Mediterranean Lands. Edited by Thomas E. Levy, Stephen
H. Savage, and Chaitanya Baru, and Oystein LaBianca. Equinox Press. Under preparation.
Book Chapters
1. ÒScience
Gateways: Harnessing clouds and software services for science,Ó N. Wilkins-Diehr, C. Baru, D. Gannon, K. Keahey,
J. McGee, M. Pierce, R. Wolski, W. Wu. To appear in Cloud Computing and Software Services:
Theory and Techniques, under preparation. CRC Press.
2. ÒData
Intensive Computing,Ó (with Moore, R., Marciano, R., Rajasekar, A., and Wan,
M.), in The Grid: Blueprint for a New
Computing Infrastructure, edited by Ian Foster and Carl Kesselman, January
1999, Morgan Kaufmann.
3. ÒVirtualization
Services for Data GridsÓ (with Moore, R.), to appear in, Grid Computing:
Making the Global Infrastructure a Reality, edited by Fran Berman, Geoffrey
Fox, and Tony Hey, to be published February 2003, Wiley.
Journal Articles
1.
ÒMediating
among GeoSciML resources,Ó C. Baru and K. Lin. Volume 2, Supplement 1, 2009. International Journal of Digital Earth. Publishers:
Taylor & Francis.
2.
ÒThe
GEON service-oriented architecture,Ó C. Baru, S. Chandra, K. Lin, A. Memon, C. Youn. Volume 2, Supplement 1, 2009. International Journal of Digital Earth. Publishers: Taylor &
Francis.
3.
"An
extensible information model for shared scientific data collections,"
(with A. Gupta), Journal on Future
Generation Computer Systems, 1999.
4.
"DB2
Parallel Edition," (with Fecteau et al), IBM Systems Journal, April 1995.
5.
"Site
selection and query scheduling policies for a multicomputer database
system," (with O. Frieder), IEEE
Transactions on Knowledge and Data Engineering, August 1994.
6.
"Inter-subcube
routing in hypercubes," (with S. Padmanabhan), accepted for publication in
the IEEE Transactions on Parallel and
Distributed Systems, 1993.
7.
"Join
and data redistribution algorithms for hypercubes," (with S. Padmanabhan),
IEEE Transactions on Knowledge and Data
Engineering, June 1993.
8.
"Squashed
embedding of E-R schemas in hypercubes," (with P. Goel), Journal of Parallel and Distributed
Computing, Vol.8, No.4, April 1990, pp.340-348, Academic Press.
9.
"Database
operations in a cube-connected multicomputer system," (with O. Frieder), IEEE Trans. on Computers, Vol. 38, No.
6, June 1989, pp.920-927.
10. "The Architecture of SM3: A
dynamically partitionable multicomputer system," (with S.Y.W. Su), IEEE Trans. on Computers, Vol. 35, No.
9, Sept. 1986, pp.790-802. Also
appeared in the IEEE Tutorial on Parallel Architecture for Database Systems,
IEEE Computer Society Press, 1989.
11. "Dynamically partitionable multicomputers
with switchable memories," (with S.Y.W. Su), Journal of Parallel and Distributed Computing, Vol. 1, No. 2,
Academic Press, Nov. 1984, pp.152-184.
Conference
Presentations
1. M. Smeekens, C. Baru, G.R. Keller,
R. Arrowsmith, C. Crosby, ÒTeaching and Training in Geoinformatics: Experiences
from the Cyberinfrastructure Summer Institute for Geoscientists (CSIG),Ó to be
presented at the Fall AGU, Dec. 2009, San Francisco.
2. C. Crosby, V. Nandigam,
J. R. Arrowsmith, S. Balakrishnan, N. Alex, C. Baru
(2008), ÒA Cyberinfrastructure Platform for Distribution of GeoEarthScope
LiDAR Topography Data,Ó Eos Trans. AGU, 89(53), Fall Meet. Suppl., Abstract
IN51A-1146.
3. C. Baru and K. Lin, ÒFederating
metadata catalogs,Ó (invited), European Geosciences Union, General Assembly,
Vienna, Austria, 19-24, 2009, EGU2009-11848.
4. S. Andelman,
C. Baru, S. Chandra, E. Fergaus, K. Lin, R. Unwin (2009), ÒA data scheduling and management
infrastructure for the TEAM Network,Ó European Geosciences Union, General
Assembly, Vienna, Austria, 19-24, 2009, EGU2009-11095.
5. C. Crosby, V. Nandigam,
C. Baru (2009), ÒRecommendation systems for Geoscience data portals –
Harnessing usage patterns,Ó European Geosciences Union, General Assembly,
Vienna, Austria, 19-24, 2009, EGU2009-6620.
6. A. Agarwal,
C. Baru, C. Crosby, R. Keller, K. V. Subbarao, V. Nandigam,
ÒiGEON-India: An international collaborative activity
of the GEON Project,Ó European Geosciences Union, General Assembly, Vienna,
Austria, 19-24, 2009, EGU2009-11792.
7. ÒGenerating composite thematic maps
from semantically-different collections of shapefiles and map services,Ó
November 2004, Memon, G., A. Memon, K. Lin, I. Zaslavsky,
C. Baru, ESRI 2005
8. Geoinformatics: Coordination of
Environmental Cyberinfrastructure for the Earth Sciences
November 2002, Allison, M.L., C. Baru, T.H. Jordan,
Abstract, GSA Annual Meeting, November 2-5, 2003, Seattle, WA
9. ÒThe GEON Grid Software
Architecture,Ó Twenty-fourth Annual ESRI International User Conference, August
9-13, 2004, San Diego, CA.
10. ÒStandards-based Secure Invocation
of ArcWeb Services,Ó (with Ashraf Memon, Ilya
Zaslavsky, Steve Mock, and Amit Behere),
Twenty-fourth Annual ESRI International User Conference, August 9-13, 2004, San
Diego, CA
11. ÒInteroperability of Databases and
Software Tools: Building Geoinformatics and the Cyberinfrastructure,Ó poster
(with Lee Allison), American Association of Petroleum Geologists Annual
Meeting, April 18-21, 2004, Dallas, TX
12. ÒGEON: Cyberinfrastructure for the Geosciences,Ó
(with Dogan Seber, Randy Keller, and Krishna Sinha), American Geophysical
Union, Fall Meeting, 2003, San Francisco, CA.
13. ÒCreating Grid Services to Enable
Data Inter-operability: An Example from the GEON Project,Ó (with Bhatia, K.,
Memon, A., Zaslavsky, I., Seber, D), GSA Annual Meeting, November 2-5, 2003,
Seattle, WA.
14. ÒSemantic Mediation Services in Geologic
Data Integration: A Case Study from the GEON Grid,Ó (with Lin, Kai, Bertram
Ludaescher, Boyan Brodaric, Dogan Seber, and Krishna Sinha), GSA Annual
Meeting, November 2-5, 2003, Seattle, WA
15. ÒBuilding the Geoinformatics System:
Coordination of the environmental Cyperinfrastructure
for the Earth Sciences,Ó (with Lee Allison and Tom Jordan), GSA Annual Meeting,
November 2-5, 2003, Seattle, WA.
16. ÒGEON: The Geoinformatics Network,Ó
(with Krishna Sinha), Annual Meeting of the Geological Society of America,
October 26-30, 2002, Denver, CO.
17. ÒGrid-Enabled Mediation Services for
Geospatial Information,Ó (with Ilya Zaslavsky), Workshop on Next-Generation
Geospatial Information, October 20-21, 2003, Cambridge, MA.
18. ÒData Access and Management Services
on the Grid,Ó (with Raman, V., Narang, I., Crone, C.,
Haas, L., Malaika, S., Mukai,
T., Wolfson, D.), Global Grid Forum 5, July 21-24,
2002, Edinburgh, Scotland.
19. ÒInformation integration of geospatial
informationÓ, Demo, NSF Digital Government Conference, May 19-21, 2002, Redondo
Beach, CA.
20. "Information Integration
Technologies for Current Research Information Systems," CRIS 2000 Conference, May 25-27, 2000,
Helsinki, Finland.
21. "XML-based information
mediation for digital libraries," (with Papakonstantinou, Gupta, Marciano,
Ludaescher, Velikhov), demo at ACM Digital Libraries'99, Berkeley, CA, August 1999.
22. "XML-based information
mediation with MIX," (with Papakonstantinou, Gupta, Marciano, Ludaescher, Velikhov), demo at ACM-SIGMOD'99,
Philadelphia, PA, June 1999.
23. "Querying Web Mediators,"
(with Chu, Gupta, Ludaescher, Marciano, Papakonstantinou, Velikhov),
demo at the Internet2 meeting,
Washington, DC, April 1999.
24. "Xviews:
XML views of relational schemas," in Database
and Expert Systems Applications (DEXA'99), Sept. 2, 1999, Florence, Italy.
25. ÒIntegrating GIS and Imagery through
XML-Based Information Mediation,Ó (with Gupta, Marciano, Zaslavsky), NSF
International Workshop on Integrated Spatial Databases: Digital Images and GIS,
June 14-16, 1999, Portland, Maine.
26. "The SDSC Storage Resource
Broker," (with R. Moore, A. Rajasekar, M. Wan), in the Proceedings of CASCON'98, Nov.30-Dec.3,
1998, Toronto, ON.
27. "A Hierarchical Access Control
Scheme for Digital Libraries," (short paper, with A. Rajasekar), in the Proceedings of the ACM Digital Libraries
Conference 1998, June 24-26, 1998, Pittsburgh, PA.
28. "Data Handling Architecture for
a Prototype Federal Application," (with Choi et al), Proceedings of the IEEE Mass Storage Systems Conference, March
1998, College Park, MD.
29. "Managing Very Large Scientific
Data Collections," (poster), 5th
International Conference on High Performance Computing (HiPC'98),
Dec.17-20, 1998, Chennai, India.
30. ÒFeatures and requirements for an
XML view definition language: Lessons from XML information mediation,Ó (with
Ludaescher, Papakonstantinou, Vianu), position paper
in QL'98-The Query Languages Workshop, World Wide Web Consortium, Dec.3-4 1998,
Boston, MA.
31. "Integrating DB2 with
HPSS," (poster, with M. Lo, S. Padmanabhan, and V. Gottemukalla)
to be presented at Supercomputing'98,
Nov.10-12, Orlando, FL.
32. "Archiving Metadata,"
(poster), European Conference on Digital
Libraries, EuroDL'98, September 21-23, 1998, Crete, Greece.
33. "Information Based
Computing," (with Moore, Karin, Rajasekar) Proceedings of the Workshop on Research and Development Opportunities
in Federal Information Services, March 1997.
34. "Metadata to Support
Information-Based Computing Environments," (with Moore et al), IEEE Conference on Metadata, Metadata-97,
September 10-11, 1997, Silver Springs, MD.
35. "An Overview of DB2 Parallel
Edition," (with G. Fecteau et al), ACM SIGMOD-95, International Conference on the
Management of Data, San Jose,
CA, May 1995.
36. "TPC-D: Analysis of an industry
standard decision support systems benchmark," (with P. Larson, T.S. Liu,
B. Schiefer, and S. Retnasena),
Proceedings of DB/EXPO-94, San
Francisco, May 1994.
37. "Data reorganization in
parallel database systems," (with Daniele C. Zilio, IEEE Workshop on Advances in Parallel and Distributed Systems,
October 6, 1993, Princeton, N.J.
38. "Data placement in
shared-nothing parallel database systems," (with S. Padmanabhan), Proceedings of the Conference on Information
and Knowledge Management, November 1992, Baltimore, MD.
39. "Routing between subcubes in a hypercube," (with S. Padmanabhan), Proceedings of the 6th Distributed Memory
Computing Conference, Portland, OR, April 1991.
40. "A comparison of join
algorithms for hypercubes," (with S.
Padmanabhan, Q. Stout, and B. Wagar), Proceedings of the 4th Hypercube Conference,
Monterey, CA., March 1989.
41. "Mapping E-R schema graphs onto
hypercubes," (with P. Goel), Proceedings of the 4th Hypercube
Conference, Monterey, CA., March 1989.
42. "Query scheduling and site
selection algorithms for a cube-connected multicomputer system," (with O.
Frieder), Proceedings of the
International Conference On Distributed Computer Systems, June 1988, San
Jose, CA.
43. "Study of dynamic data
redistribution for database processing," (with O. Frieder),
Proceedings of the 3rd Hypercube
Conference, Pasadena, CA., Jan. 1988.
44. "Join on a cube: Analysis,
simulation, and implementation," (with O. Frieder, D. Kandlur, and M.
Segal), Proceedings of the 5th International
Workshop on Database Machines, Oct.
1987, Karuizawa, Japan, pp.74-87.
45. "Data distribution and query
scheduling policies for a cube-connected multicomputer system," (with O.
Frieder), Proceedings of the 2nd
International Conference on Supercomputing Systems, San Francisco, CA. May
1987.
46. "Issues in query processing on
a cube-connected multicomputer system," (with O. Frieder), Proceedings of the 6th IEEE Pheonix
Conference on Computers and Communications, Scottsdale, AZ., Feb. 25-27,
1987, pp.315-319.
47. "Implementing relational
database operations in a cube-connected multicomputer system," (with O.
Frieder), Proceedings of the 3rd IEEE
International Conference on Data Engineering, Los Angeles, CA., Feb.2-6, 1987.
48. "Matrix multiplication on a
multicomputer system with switchable main memory modules," (with A.
Thakore and S.Y.W. Su), Proceedings of
the 1st International Supercomputing Systems Conference, Tarpon Springs,
FL., Dec.17-20, 1985, pp.650-659.
49. "Performance evaluation of the
statistical aggregation by categorization in the SM3 system," (with S.Y.W.
Su), Proceedings of the ACM-SIGMOD
International Conference on Management of Data, Boston, MA., June 18-21, 1984, pp.77-89.
50. "SM3: A dynamically
partitionable multicomputer system with switchable main memory modules," (with T.
Fei and S.Y.W. Su), Proceedings of the
IEEE International Conference on Data Engineering, Los Angeles, CA., April
24-27, 1984, pp.42-49.
Conference Posters
1. C. Baru, S. Krishnan, K. Lin, J.
Moreland, D. Nadeau (2009), ÒService-oriented infrastructure for scientific
data mashups,Ó to be presented at the Fall AGU
Meeting, Dec. 2009, San Francisco.
2. J. Moreland, D. Nadeau, C. Baru, C.
Crosby (2009), ÒAn OpenEarth Framework (OEF) for integrating and visualizing
earth science data,Ó to be presented at the Fall AGU Meeting, Dec. 2009, San
Francisco.
3. V. Nandigam,
C. Crosby, C. Baru (2009), ÒPromoting scientific collaboration and research
through integrated social networking capabilities,Ó European Geosciences Union,
General Assembly, Vienna, Austria, 19-24, 2009, EGU2009-9430.
4. K. Lin, A. Memon, C. Baru (2008),
ÒLinking the EarthScope Data Virtual Catalog to the GEON Portal,Ó Eos Trans.
AGU 89(53), Fall Meet. Suppl. Abstract IN51A-1144.
5. A. Memon, K. Lin, C. Baru (2008),
ÒCatalog federation and interoperability for Geoinformatics,Ó Eos Trans. AGU
89(53), Fall Meet. Suppl. Abstract IN33A-1156.
6. C. Baru, G. R. Keller, B. Wallet, C.
Crosby, J. Moreland, D. Nadeau, ÒIntegrating diverse geophysical and geological
data to construct multi-dimensional earth models: The OpenEarth Framework,Ó Eos
Trans. AGU 89(53), Fall Meet. Suppl. Abstract IN51A-1147.
7. C. Crosby, J.L. Blair, V. Nandigam, A. Memon, C. Baru, J. R. Arrowsmith, ÒKML-based
access and visualization of high resolution LiDAR topography,Ó Eos Trans. AGU
89(53), Fall Meet. Suppl. Abstract IN41B-1149.
Conference and
Workshop Organization
Involved as organizer, co-organizer, Program Committee
member, or session chair for over 30 different conferences / workshops.
Teaching
á Undergraduate:
Courses in Data Structures and Algorithms, Logic Design, and Database
Management Systems, EECS Dept., University of Michigan, Ann Arbor.
á Graduate:
Courses in Database Machine Architecture, Database Management Systems, and
Advanced Topics in Database Systems, EECS Dept., University of Michigan, Ann
Arbor.
Tutorials
á Tutorial on "Warehousing
Scientific Data," NASA Workshop on Data Mining and Data Warehousing,
August 19-20, 1997, Silver Springs, MD.
á Tutorial on "Managing Large
Data Sets," CASCON'96, November 12-14, 1996, IBM Toronto Labs, Toronto,
ON.
á Tutorial on "Parallel Database
Systems: The First Generation Products,"
International Conference on Parallel and Distributed Information Systems (PDIS),
September 28-30, 1994, Austin, TX.
Invited Talks
Have given over 50 invited talks at various institutions and
conferences.
Education
1983 – 85 Doctor
of Philosophy (Ph.D.) in Electrical
Engineering, University of Florida, Gainesville, FL, 1985.
1981 – 83 Master's
in Engineering (M.E.) in Electrical
Engineering, University of Florida, Gainesville, FL,
1983 (received Outstanding MasterÕs Thesis Award).
1974 – 79 Bachelor
of Technology (B.Tech.)
in Electrical Engineering, Indian Institute of Technology, Madras, India, 1979.