CHAITANYA K. BARU

Distinguished Scientist

San Diego Supercomputer Center

 

Director, Center for Large-scale Data Systems research (CLDS), clds.sdsc.edu

Director, Advanced Cyberinfrastructure Development Group (ACID), acid.sdsc.edu

 

Address: University of California, San Diego, 9500 Gilman Drive, La Jolla, CA 92093-0505

Phone: 858-534-5035

email: baru@sdsc.edu

 

RESEARCH INTERESTS

 

Scientific data management. Large-scale data systems. Data integration. Data analytics. Parallel database systems.

 

Employment

San Diego Supercomputer Center, UC San Diego

2008 –             Distinguished Scientist, SDSC. Also, Chair, SDSC Project Advisory Council and Director of my R&D group, Advanced Cyberinfrastructure Development (ACID).

2004 – 2008    Division Director, Science R&D Division.

                        Member of the SDSC Senior Management Team. Provided leadership and advice to researchers in this Division, while also serving as a PI in my own right. Also, Director of the SDSC/Calit2 Synthesis Center (www.syncenter.org) and co-Director, National Laboratory for Advanced Data Research (NLADR, www.nladr.org).

2004 –             Member, California Institute for Information Technology and Telecommunication

                        Principal Investigator at Calit2. Also, promoted collaborations between Calit2 and SDSC, including creation of the SDSC/Calit2 Synthesis Center.

2001 – 04        Co-Director, Data and Knowledge Systems Program.

                        Provided leadership for all data-oriented R&D activities, including establishing “Labs” at SDSC. Also, served as a PI in my own right.

2000 – 01        Assistant Director, Data Intensive Computing Environments (DICE) group.

                        Duties were the same as my previous job at SDSC (see below for details), which was technically with a different employer.

 

San Diego Supercomputer Center, General Atomics, La Jolla, CA

1996–2000      Senior Principal Scientist, Data Intensive Computing Environments (DICE) group.

                        Technical Project Manager for the Distributed Object Computation Testbed (DOCT) project led by Reagan Moore. Provided leadership in data-related R&D activities for the group.

 

IBM Corporation     

1992 – 95        Advisory Programmer, Database Technology Institute, IBM Almaden Research Labs, San Jose, CA (1995).

                        Advisory Development Analyst and Group Lead, Database Technology Group, IBM Toronto Labs (1992-95).

                        One of the three team leads responsible for the design and development of DB2 Parallel Edition V1, released December 1995.

 

University of Michigan, Ann Arbor, MI

1985 – 92        Assistant Professor, Department of Electrical Engineering and Computer Science, University of Michigan, Ann Arbor, MI 48109-2122.

Š                Member of Advanced Computer Architecture Lab (ACAL)

Š                Member of Software Systems Research Lab (SSRL)

Š                Member, Executive Committee, Univ. of Michigan Human Genome Center,

led by Dr. Francis Collins.

Performed research and taught undergraduate and graduate courses in computer science and engineering.

                        Received NSF and other research awards; graduated 3 Ph.D. students; received Outstanding Teacher award from Dean of Engineering.

 

Cyberinfrastructure Leadership Activities

1.     Project Director, The Geosciences Network (GEON, www.geongrid.org), since 2002.

PI of the original large NSF ITR grant ($11.5M total funding) involving 12 PI institutions. The project was renewed as GEON 2.0 and is currently active. It also spun off another independently funded project called, OpenTopography.org.

2.     Director of Cyberinfrastructure for NSF National Earthquake Engineering Simulations (NEESit), 2007-present.

Joined NEESit as an Cyberinfrastructure Advisor in 2006. In 2007, helped retain funding for NEESit and continue operations at UCSD. Served as Director for NEESIt since Oct 2007.

3.     Cyberinfrastructure Lead/PI for the Tropical Ecology, Assessment and Monitoring Network (TEAM), 2007-present.

TEAM is a project of Conservation International, funded by the Moore Foundation. The grant is through Calit2.

4.     Member of Senior Management Team, NSF National Ecological Observatory Network (NEON, www.neoninc.org), 2005-2007.

Cyberinfrastructure Lead for the NEON Testbed, and co-PI of the NEON Cyberinfrastructure Diagnostic Testbed.

5.     Lead, KatrinaSafe Database Project. Worked in close collaboration with American Red Cross to develop KatrinaSafe, a “live” database of victims of Hurricane Katrina. This led to the development of DisasterSafe, hosted at SDSC—a standard service offered by Red Cross for victims of any disaster.

6.     Member, IRIS Data Management System Standing Committee, 2007-2009.

IRIS is the NSF data archives for seismological data.

7.     Co-Director, National Laboratory for Advanced Data Research (NLADR, www.nladr.org)

Joint activity with the National Computational Science Alliance (NCSA) with Dr. Michael Welge as the other co-Director.

8.     Executive Director, SDSC/Calit2 Synthesis Center (www.syncenter.org), 2005-2008.

Joint facility consisting of SDSC staff and equipment located at Calit2.

9.     Member of Cyberinfrastructure Advisory Committee, Long-Term Ecological Research Network (LTER, www.lternet.org), 2006.

10.  Member of Advisory Board, CLEANER Project Office, 2005-2006.

11.  Co-Convener, NSF Earth Science CyberInfrastructure (ES-CI) Task Force (with Lee Allison and Tom Jordan), 2004.

12.  SDSC PI for CUAHSI Hydrologic Information System (http://his.cuahsi.org), 2004-2008.

13.  Member of Leadership Team, Biomedical Informatics Research Network (BIRN, www.nbirn.net), 2001-2004. One of the co-Investigators of the original BIRN Coordinating Center (BIRN-CC).

 

UCSD and other UC Collaborations

1.     CARTA: Cyberinfrastructure and bioinformatics lead for the UCSD / Salk Institute-led ORU Center for Advanced Research and Training in Anthropogeny led by Profs. Ajit Varki, Margaret Schoeninger, and Rusty Gage (Salk). Funded by the Mathers Foundation. Duration: 2007—ongoing.

2.     CYCORE: Co-PI of Cyberinfrastructure for Comparative Effectiveness Research project funded by NIH. Project is led by Dr. Kevin Patrick (SOM & Calit2) in collaboration with M.D. Anderson Cancer Center, Houston. Duration: October 2009—September 2010.

3.     CISA3: Co-PI with Profs. Tom Levy and Falko Kuester of the Mediterranean Archaeology Network (MedArchNet). Funded by the UCSD Chancellor’s Collaboratory initiative, for 2009-2010 academic year.

4.     911: PI of NSF-funded project on Spatiotemporal Analysis of 911 Call Stream Data, with Prof. William Hodgkiss, SIO, as co-PI. Duration: 2004-2008.

5.     Hazards: Coordinated a hazards initiative on campus with funding from OVCR, JSOE, SIO, and SDSC. Pre-proposal on Cyberinfrastructure Center for Urgent Response to Emergencies (CICURE) submitted to the NSF STC program (not selected). Other proposal planning activities are under way.

6.     WIISARD: Co-investigator (with Prof. Leslie Lenert as PI) on the original Wireless Internet Information System for Medical Response in Disasters project. Funded by NIH, 2005-2007. Was responsible for the data management component.

7.     BIRN: Collaborated with Prof. Mark Elisman as co-Investigator on the original BIRN-CC project, funded by NIH, 2001. Was responsible for the data integration component.

8.     I2T: PI of the NSF-funded Information Integration Testbed (I2T) project with Prof. Yannis Papakonstantinou (CSE) as co-PI. Duration: 2002-2004.

9.     UC-SGH: Co-PI on a proposal for a Center of Excellence on Disasters to the UC School of Global Health with Prof. Craig Van Dyke, UCSF (PI), and Profs. Gretchen Kalonji (UCOP) and Nicholas Sitar (UCB). (not selected).

10.  RISC MRU: Co-PI on a multi-campus research unit (MRU) proposal on Rapid Information for Science during Catastrophes (RISC), led by Prof. Emily Brodsky, UCSC. (not selected).

 

Software Development

1.     One of the group leaders and developers of IBM’s DB2 Parallel Edition Version 1.0, released commercially in December 1995.

2.     One of the designers of the SDSC Storage Resource Broker (SRB). Version 1 was released in September 1997.

3.     One of the designers of the Data Integration Cart™ technology for ontology-based data integration (invention disclosure filed: 2007).

 

U.S. Patents

1.     Persistent Archives, R. Moore, A. Rajasekar, C. Baru, B. Ludaescher, A. Gupta, R. Marciano, US Patent 7,349,915, March 25, 2008.

2.     Persistent Archives, R. Moore, A. Rajasekar, C. Baru, B. Ludaescher, A. Gupta, R. Marciano, US Patent 6,963,875, November 8, 2005.

3.     System and method for construction, storage, and transport of presentation-independent multimedia content, C. Baru, J. Chase, T. Elvins, R. Fassett, E. Nebel, Patent No. 7,028,252, March 22, 2001.

4.     Method and apparatus for achieving uniform data distribution in a parallel database system, C. Baru and F. Koo, Patent No.US5970495, IBM, Oct.19, 1999.

5.     Method and apparatus for implementing partial declustering in a parallel database system, C. Baru, G. Fecteau, J. Kirton, L. Kollar, F. Koo, Patent No. US5878409, IBM, March 2, 1999.

 

Ph.D. Committees Chaired

Š       Ophir Frieder.  Dissertation title: "Database processing on a cube-connected multicomputer system," EECS Dept., University of Michigan, Dec. 1987.  Recipient of IBM Graduate Fellowship Award. 

Currently, Chaired Professor, Illinois Institute of Technology, Chicago, IL.

Š       Piyush Goel.  Dissertation title: "Dataflow query processing and optimization," EECS Dept., University of Michigan, May 1992.

Was co-Founder of Everypath.com, San Jose, CA.

Š       Sriram Padmanabhan.  Dissertation title: "Data placement in shared-nothing parallel database systems," EECS Dept., University of Michigan, July 1992. Recipient of IBM Graduate Fellowship Award.

Currently, Distinguished Engineer, IBM Silicon Valley Labs.

 

Key Committee Memberships

Š       Lead, TeraGrid Data Working Group, 20012002.

Š       Member of Review Committee, Canada Research Chairs program, Natural Sciences and Engineering Research Council (NSERC) of Canada, 20002002.

Š       Member of the Architecture Working Group, California Digital Library, University of California, Office of the President, Oakland, CA, 1998–2000.

Š       Member of the Grants Selection Committee (GSC) for Computer and Information Sciences, Natural Sciences and Engineering Research Council (NSERC) of Canada, 1994-97.  (The GSC is responsible for annually reviewing grant proposals from computer science faculty in Canada and making funding decisions).

Š       IBM representative on the Transaction Processing Council's TPC-D Benchmark Standard Subcommittee, 1993-95. Participated in drafting the original TPC-D specification.

 

Funded Research: Principal Investigator (23 grants, $15.3M funding)

1.     Geoinformatics: GEON 2.0: A Data Integration Facility for the Earth Sciences (Co-PIs: Randy Keller, U. Oklahoma; Ramon Arrowsmith, Arizona State; Charles Meertens, UNAVCO; Ann Gates, UT El Paso)

Funding: $1,158,150, Aug 2008-July 2010

Sponsor: NSF GEO/Earth Sciences: Geoinformatics

 

2.     Performance Evaluation of On-Demand Provisioning of Data Intensive Applications (Co-PI: Sriram Krishnan, SDSC)

Funding: $450,000, Jan 2009-Dec 2010

Sponsor: NSF CISE/Cluster Exploratory program

 

3.     OpenTopography - A National Hub for High Resolution Topographic Data, Tools, and Knowledge (Co-PI: Ramon Arrowsmith, Arizona State)

Funding: $1,400,000, Sept 2009-Aug 2012

Sponsor: NSF GEO/Earth Science Facilities

 

4.     Cyberinfrastructure for the Tropical Ecology Assessment and Monitoring Network

Funding: $916,417, Aug 2006 – current

Sponsor: Moore Foundation (via Conservation International)

 

5.     Cyberinfrastructure Preparedness for Emergency Response and Relief:  Learning the lessons from Hurricane Katrina

Funding: $200,000, 2007

Sponsor: NSF CISE/Integrated Information Systems

 

6.     Cyberinfrastructure Design for NEON

Funding: $156,861, 2006-2007

Sponsor: NSF BIO

 

7.     Geoinformatics 2006

Funding: $38,986, 2006

Sponsor: NSF EAR/Geoinformatics

 

8.     Cyberinfrastructure Summer Institute for Geoscientists

Funding $70,000, 2004

Sponsor: NSF EAR/Geoinformatics

 

9.     GEON: The Geosciences Network—A Research Project to Create Cyberinfrastructure for the Geosciences

Collaborative grant with UCSD as lead institution and 10 other PI institutions and 2 subcontracts.

Funding: Total: $11,6000,000 (UCSD: $5,987,927), October 2002—September 2007

Sponsor: NSF ITR Program

 

10.  Development of Informatics Infrastructure for Hydrologic Sciences (Co-PI: Ilya Zaslavsky.

Collaborative project with UT Austin, UIUC, Drexel)

Funding: $1,271,221, April 2004-March 2006

Sponsor: NSF GEO/EAR

 

11.  Spatiotemporal Analysis of 9-1-1 Call Stream Data

Funding: $918,970, August 2004—September 2007

Sponsor: NSF CISE/Digital Government

 

12.  Data Intensive Grid Benchmarks (Co-PIs: Allan Snavely, Henri Casanova, SDSC)

Funding: $749,999, October 2002—September 2006

Sponsor: National Middleware Initiative (NMI), CISE, National Science Foundation

 

13.  INGREIN Phase II: Integrated Georeferenced Environmental Information Network

Funding: $250,000, September 2002-September 2003

Sponsor: University of California Life Science Informatics Program (LSIT) and Environmental Systems Research Institute Inc. (ESRI), Redlands

 

14.  Building a Scalable Biological Information Network - A Workshop Proposal (Co-PI: Peter Arzberger)

Funding: $93,106, 2006

Sponsor: NSF BIO

 

15.  I2T: An Information Integration Testbed for Digital Government (Co-PIs: Yannis Papakonstantinou, Amarnath Gupta) (Collaboration with U.Penn and U.Michigan)

Funding: $720,000, July 2000-June 2003

Sponsor: National Science Foundation

 

16.  INGREIN: Integrated Georeferenced Environmental Information Network

Funding: $250,000, May 2001-April 2002

Sponsor: University of California Life Science Informatics Program (LSIT) and Environmental Systems Research Institute Inc. (ESRI), Redlands

 

17.  Current Research Information Systems

Funding: $200,000, January 2000 to December 2001

Sponsor: Community of Science, Inc., Baltimore, MD

 

18.  Evaluation of the ArcXML Specification

Funding: $124,000, 1.5 years. June 2000 to December 2001

Sponsor: ESRI Inc., Redlands, CA

 

19.  Information Integration Architecture for a National Statistical Data Infrastructure (Co-PI: Amarnath Gupta)

Funding: $38,000, April 2000-July 2000                              

Sponsor: National Science Foundation

 

20.  Experimental facility for research in PArallel Database Systems (PADS)

Funding: $40,205, one-time, equipment grant

Sponsor: AT&T Foundation

 

21.  Multicomputer system for parallel database processing

Funding: $94,728, one-time, equipment grant

Sponsor: National Science Foundation

 

22.  Database processing on a cube-connected multicomputer system

Funding: $59,650, September 1987 to February 1990

Sponsor: National Science Foundation

 

23.  Parallel architecture features for supporting efficient database computation

Funding: $9,930, May 1986 to April 1987

Sponsor: Rackham Graduate School, University of Michigan

 

Funded Research: Co-principal Investigator (10 grants, $8M funding)

1.     Management and Analysis of Environmental Observatory Data Using the Kepler Scientific Workflow System (PI: Matt Jones, UC Santa Barbara).

Funding: $2,713,203, 2006-2010

Sponsor: NSF BIO

 

2.     NEON CI/ECI Preliminary Design and Diagnostic Testbed (PI: David Schimel, NEON Inc)

Funding: $482,348, 2007-2008

Sponsor: NSF BIO

 

3.     Transforming Morphological Systematics From Desktop to Web Applications: Development of the Online Workspace Morphobank.org 3.0 (PI: Maureen O’Leary, SUNY Stony Brook).

Funding: $610.614

Sponsor: NSF BIO

 

4.     Enabling the Creation and Use of GeoGrids for Next Generation Geospatial Information (PI: Peggy Agouris, Univ of Maine).

Funding: $448,717, September 2001—August 2006.

Sponsor: NSF CISE/Digital Government

 

5.     CHRONOS Network for Earth Systems History: Development of Integrated Databases and Toolkits Accessible Through a Common Portal (PI: Cinzia Cervato, Iowa State Univ)

(Collaborative project, with Iowa State)

Funding: $281,000, August 2003-July 2005

Sponsor: NSF GEO/EAR

 

6.     CLEANER: Research for the Coastal Margin  (PI: Ilya Zaslavsky, SDSC)

(Collaborative project led by Texas A&M)

Funding: $5,421 (SDSC allocation), July 2004-May 2005

Sponsor: NSF OCI

 

7.     Building the Geoinformatics System: A Workshop on Planning and Coordinating Environmental Cyberinfrastructure Activities in the Earth Sciences (PI: Lee Alison)

Funding: $61,258

 

8.     Biomedical Imaging and Research Network Coordinating Center  (PI: Mark Ellisman, UCSD)

Funding: $4.057,157, May 2004-April 2005

Sponsor: NCRR, National Institutes of Health

 

9.     Data management for electronic commerce applications (Co-PI, with Yannis Papakonstantinou, UCSD)

Funding: $100,000, January 1997 to December 2000

Sponsor: Center for Advanced Studies, IBM Toronto Labs

 

10.  Study of performance monitoring and evaluation techniques for parallel database systems (Co-PI, with Jeffrey Naughton, Univ. of Wisconsin)

Funding: $100,000, January 1995 to December 1998

Sponsor: Center for Advanced Studies, IBM Toronto Labs

 

Funded Research: Senior Personnel

Served as Senior Personnel in over 20 projects.

 

Books

1.     Geoinformatics: Cyberinfrastructure for the Solid Earth Sciences. Edited by Randy Keller and Chaitanya Baru. Cambridge University Press. Under preparation.

2.     Portal Science and Archaeology – View from the Mediterranean Lands. Edited by Thomas E. Levy, Stephen H. Savage, and Chaitanya Baru, and Oystein LaBianca. Equinox Press. Under preparation.

 

Book Chapters

1.     “Science Gateways: Harnessing clouds and software services for science,” N. Wilkins-Diehr, C. Baru, D. Gannon, K. Keahey, J. McGee, M. Pierce, R. Wolski, W. Wu. To appear in Cloud Computing and Software Services: Theory and Techniques, under preparation. CRC Press.

2.     “Data Intensive Computing,” (with Moore, R., Marciano, R., Rajasekar, A., and Wan, M.), in The Grid: Blueprint for a New Computing Infrastructure, edited by Ian Foster and Carl Kesselman, January 1999, Morgan Kaufmann.

3.     “Virtualization Services for Data Grids” (with Moore, R.), to appear in, Grid Computing: Making the Global Infrastructure a Reality, edited by Fran Berman, Geoffrey Fox, and Tony Hey, to be published February 2003, Wiley.

 

Journal Articles

1.     “Mediating among GeoSciML resources,” C. Baru and K. Lin. Volume 2, Supplement 1, 2009. International Journal of Digital Earth. Publishers: Taylor & Francis.

2.     “The GEON service-oriented architecture,” C. Baru, S. Chandra, K. Lin, A. Memon, C. Youn. Volume 2, Supplement 1, 2009. International Journal of Digital Earth. Publishers: Taylor & Francis.

3.     "An extensible information model for shared scientific data collections," (with A. Gupta), Journal on Future Generation Computer Systems, 1999.

4.     "DB2 Parallel Edition," (with Fecteau et al), IBM Systems Journal, April 1995.

5.     "Site selection and query scheduling policies for a multicomputer database system," (with O. Frieder), IEEE Transactions on Knowledge and Data Engineering, August 1994.

6.     "Inter-subcube routing in hypercubes," (with S. Padmanabhan), accepted for publication in the IEEE Transactions on Parallel and Distributed Systems, 1993.

7.     "Join and data redistribution algorithms for hypercubes," (with S. Padmanabhan), IEEE Transactions on Knowledge and Data Engineering, June 1993.

8.     "Squashed embedding of E-R schemas in hypercubes," (with P. Goel), Journal of Parallel and Distributed Computing, Vol.8, No.4, April 1990,  pp.340-348, Academic Press.

9.     "Database operations in a cube-connected multicomputer system," (with O. Frieder), IEEE Trans. on Computers, Vol. 38, No. 6, June 1989, pp.920-927.

10.  "The Architecture of SM3: A dynamically partitionable multicomputer system," (with S.Y.W. Su), IEEE Trans. on Computers, Vol. 35, No. 9, Sept. 1986, pp.790-802.  Also appeared in the IEEE Tutorial on Parallel Architecture for Database Systems, IEEE Computer Society Press, 1989.

11.  "Dynamically partitionable multicomputers with switchable memories," (with S.Y.W. Su), Journal of Parallel and Distributed Computing, Vol. 1, No. 2, Academic Press, Nov. 1984, pp.152-184.

 

Conference Presentations

1.     M. Smeekens, C. Baru, G.R. Keller, R. Arrowsmith, C. Crosby, “Teaching and Training in Geoinformatics: Experiences from the Cyberinfrastructure Summer Institute for Geoscientists (CSIG),” to be presented at the Fall AGU, Dec. 2009, San Francisco.

2.     C. Crosby, V. Nandigam, J. R. Arrowsmith, S. Balakrishnan, N. Alex, C. Baru (2008), “A Cyberinfrastructure Platform for Distribution of GeoEarthScope LiDAR Topography Data,” Eos Trans. AGU, 89(53), Fall Meet. Suppl., Abstract IN51A-1146.

3.     C. Baru and K. Lin, “Federating metadata catalogs,” (invited), European Geosciences Union, General Assembly, Vienna, Austria, 19-24, 2009, EGU2009-11848.

4.     S. Andelman, C. Baru, S. Chandra, E. Fergaus, K. Lin, R. Unwin (2009), “A data scheduling and management infrastructure for the TEAM Network,” European Geosciences Union, General Assembly, Vienna, Austria, 19-24, 2009, EGU2009-11095.

5.     C. Crosby, V. Nandigam, C. Baru (2009), “Recommendation systems for Geoscience data portals – Harnessing usage patterns,” European Geosciences Union, General Assembly, Vienna, Austria, 19-24, 2009, EGU2009-6620.

6.     A. Agarwal, C. Baru, C. Crosby, R. Keller, K. V. Subbarao, V. Nandigam, “iGEON-India: An international collaborative activity of the GEON Project,” European Geosciences Union, General Assembly, Vienna, Austria, 19-24, 2009, EGU2009-11792.

7.     “Generating composite thematic maps from semantically-different collections of shapefiles and map services,” November 2004, Memon, G., A. Memon, K. Lin, I. Zaslavsky, C. Baru, ESRI 2005

8.     Geoinformatics: Coordination of Environmental Cyberinfrastructure for the Earth Sciences

November 2002, Allison, M.L., C. Baru, T.H. Jordan, Abstract, GSA Annual Meeting, November 2-5, 2003, Seattle, WA

9.     “The GEON Grid Software Architecture,” Twenty-fourth Annual ESRI International User Conference, August 9-13, 2004, San Diego, CA.

10.  “Standards-based Secure Invocation of ArcWeb Services,” (with Ashraf Memon, Ilya Zaslavsky, Steve Mock, and Amit Behere), Twenty-fourth Annual ESRI International User Conference, August 9-13, 2004, San Diego, CA

11.  “Interoperability of Databases and Software Tools: Building Geoinformatics and the Cyberinfrastructure,” poster (with Lee Allison), American Association of Petroleum Geologists Annual Meeting, April 18-21, 2004, Dallas, TX

12.  “GEON: Cyberinfrastructure for the Geosciences,” (with Dogan Seber, Randy Keller, and Krishna Sinha), American Geophysical Union, Fall Meeting, 2003, San Francisco, CA.

13.  “Creating Grid Services to Enable Data Inter-operability: An Example from the GEON Project,” (with Bhatia, K., Memon, A., Zaslavsky, I., Seber, D), GSA Annual Meeting, November 2-5, 2003, Seattle, WA.

14.   “Semantic Mediation Services in Geologic Data Integration: A Case Study from the GEON Grid,” (with Lin, Kai, Bertram Ludaescher, Boyan Brodaric, Dogan Seber, and Krishna Sinha), GSA Annual Meeting, November 2-5, 2003, Seattle, WA

15.  “Building the Geoinformatics System: Coordination of the environmental Cyperinfrastructure for the Earth Sciences,” (with Lee Allison and Tom Jordan), GSA Annual Meeting, November 2-5, 2003, Seattle, WA.

16.  “GEON: The Geoinformatics Network,” (with Krishna Sinha), Annual Meeting of the Geological Society of America, October 26-30, 2002, Denver, CO.

17.  “Grid-Enabled Mediation Services for Geospatial Information,” (with Ilya Zaslavsky), Workshop on Next-Generation Geospatial Information, October 20-21, 2003, Cambridge, MA.

18.  “Data Access and Management Services on the Grid,” (with Raman, V., Narang, I., Crone, C., Haas, L., Malaika, S., Mukai, T., Wolfson, D.), Global Grid Forum 5, July 21-24, 2002, Edinburgh, Scotland.

19.   “Information integration of geospatial information”, Demo, NSF Digital Government Conference, May 19-21, 2002, Redondo Beach, CA.

20.  "Information Integration Technologies for Current Research Information Systems," CRIS 2000 Conference, May 25-27, 2000, Helsinki, Finland.

21.  "XML-based information mediation for digital libraries," (with Papakonstantinou, Gupta, Marciano, Ludaescher, Velikhov), demo at ACM Digital Libraries'99, Berkeley, CA, August 1999.

22.  "XML-based information mediation with MIX," (with Papakonstantinou, Gupta, Marciano, Ludaescher, Velikhov), demo at ACM-SIGMOD'99, Philadelphia, PA, June 1999.

23.  "Querying Web Mediators," (with Chu, Gupta, Ludaescher, Marciano, Papakonstantinou, Velikhov), demo at the Internet2 meeting, Washington, DC, April 1999.

24.  "Xviews: XML views of relational schemas," in Database and Expert Systems Applications (DEXA'99), Sept. 2, 1999, Florence, Italy.

25.  “Integrating GIS and Imagery through XML-Based Information Mediation,” (with Gupta, Marciano, Zaslavsky), NSF International Workshop on Integrated Spatial Databases: Digital Images and GIS, June 14-16, 1999, Portland, Maine.

26.  "The SDSC Storage Resource Broker," (with R. Moore, A. Rajasekar, M. Wan), in the Proceedings of CASCON'98, Nov.30-Dec.3, 1998, Toronto, ON.

27.  "A Hierarchical Access Control Scheme for Digital Libraries," (short paper, with A. Rajasekar), in the Proceedings of the ACM Digital Libraries Conference 1998, June 24-26, 1998, Pittsburgh, PA.

28.  "Data Handling Architecture for a Prototype Federal Application," (with Choi et al), Proceedings of the IEEE Mass Storage Systems Conference, March 1998, College Park, MD.

29.  "Managing Very Large Scientific Data Collections," (poster), 5th International Conference on High Performance Computing (HiPC'98), Dec.17-20, 1998, Chennai, India.

30.  “Features and requirements for an XML view definition language: Lessons from XML information mediation,” (with Ludaescher, Papakonstantinou, Vianu), position paper in QL'98-The Query Languages Workshop, World Wide Web Consortium, Dec.3-4 1998, Boston, MA.

31.  "Integrating DB2 with HPSS," (poster, with M. Lo, S. Padmanabhan, and V. Gottemukalla) to be presented at Supercomputing'98, Nov.10-12, Orlando, FL.

32.  "Archiving Metadata," (poster), European Conference on Digital Libraries, EuroDL'98, September 21-23, 1998, Crete, Greece.

33.  "Information Based Computing," (with Moore, Karin, Rajasekar) Proceedings of the Workshop on Research and Development Opportunities in Federal Information Services, March 1997.

34.  "Metadata to Support Information-Based Computing Environments," (with Moore et al), IEEE Conference on Metadata, Metadata-97, September 10-11, 1997, Silver Springs, MD.

35.  "An Overview of DB2 Parallel Edition," (with G. Fecteau et al), ACM SIGMOD-95,  International Conference on the Management of Data, San Jose,  CA, May 1995.

36.  "TPC-D: Analysis of an industry standard decision support systems benchmark," (with P. Larson, T.S. Liu, B. Schiefer, and S. Retnasena), Proceedings of DB/EXPO-94, San Francisco, May 1994.

37.  "Data reorganization in parallel database systems," (with Daniele C. Zilio, IEEE Workshop on Advances in Parallel and Distributed Systems, October 6, 1993, Princeton, N.J.

38.  "Data placement in shared-nothing parallel database systems," (with S. Padmanabhan), Proceedings of the Conference on Information and Knowledge Management, November 1992, Baltimore, MD.

39.  "Routing between subcubes in a hypercube," (with S. Padmanabhan), Proceedings of the 6th Distributed Memory Computing Conference, Portland, OR, April 1991.

40.  "A comparison of join algorithms for hypercubes," (with S. Padmanabhan, Q. Stout, and B. Wagar), Proceedings of the 4th Hypercube Conference, Monterey, CA., March 1989.

41.  "Mapping E-R schema graphs onto hypercubes," (with P. Goel), Proceedings of the 4th Hypercube Conference, Monterey, CA., March 1989.

42.  "Query scheduling and site selection algorithms for a cube-connected multicomputer system," (with O. Frieder), Proceedings of the International Conference On Distributed Computer Systems, June 1988, San Jose, CA.

43.  "Study of dynamic data redistribution for database processing," (with O. Frieder), Proceedings of the 3rd Hypercube Conference, Pasadena, CA., Jan. 1988.

44.  "Join on a cube: Analysis, simulation, and implementation," (with O. Frieder, D. Kandlur, and M. Segal), Proceedings of the 5th International Workshop on Database Machines, Oct.  1987, Karuizawa, Japan, pp.74-87.

45.  "Data distribution and query scheduling policies for a cube-connected multicomputer system," (with O. Frieder), Proceedings of the 2nd International Conference on Supercomputing Systems, San Francisco, CA. May 1987.

46.  "Issues in query processing on a cube-connected multicomputer system," (with O. Frieder), Proceedings of the 6th IEEE Pheonix Conference on Computers and Communications, Scottsdale, AZ., Feb. 25-27, 1987, pp.315-319.

47.  "Implementing relational database operations in a cube-connected multicomputer system," (with O. Frieder), Proceedings of the 3rd IEEE International Conference on Data Engineering,  Los Angeles, CA., Feb.2-6, 1987.

48.  "Matrix multiplication on a multicomputer system with switchable main memory modules," (with A. Thakore and S.Y.W. Su), Proceedings of the 1st International Supercomputing Systems Conference, Tarpon Springs, FL., Dec.17-20, 1985, pp.650-659.

49.  "Performance evaluation of the statistical aggregation by categorization in the SM3 system," (with S.Y.W. Su), Proceedings of the ACM-SIGMOD International Conference on Management of Data, Boston, MA., June 18-21, 1984, pp.77-89.

50.  "SM3: A dynamically partitionable multicomputer system with switchable main  memory modules," (with T. Fei and S.Y.W. Su), Proceedings of the IEEE International Conference on Data Engineering, Los Angeles, CA., April 24-27, 1984, pp.42-49.

 

Conference Posters

1.     C. Baru, S. Krishnan, K. Lin, J. Moreland, D. Nadeau (2009), “Service-oriented infrastructure for scientific data mashups,” to be presented at the Fall AGU Meeting, Dec. 2009, San Francisco.

2.     J. Moreland, D. Nadeau, C. Baru, C. Crosby (2009), “An OpenEarth Framework (OEF) for integrating and visualizing earth science data,” to be presented at the Fall AGU Meeting, Dec. 2009, San Francisco.

3.     V. Nandigam, C. Crosby, C. Baru (2009), “Promoting scientific collaboration and research through integrated social networking capabilities,” European Geosciences Union, General Assembly, Vienna, Austria, 19-24, 2009, EGU2009-9430.

4.     K. Lin, A. Memon, C. Baru (2008), “Linking the EarthScope Data Virtual Catalog to the GEON Portal,” Eos Trans. AGU 89(53), Fall Meet. Suppl. Abstract IN51A-1144.

5.     A. Memon, K. Lin, C. Baru (2008), “Catalog federation and interoperability for Geoinformatics,” Eos Trans. AGU 89(53), Fall Meet. Suppl. Abstract IN33A-1156.

6.     C. Baru, G. R. Keller, B. Wallet, C. Crosby, J. Moreland, D. Nadeau, “Integrating diverse geophysical and geological data to construct multi-dimensional earth models: The OpenEarth Framework,” Eos Trans. AGU 89(53), Fall Meet. Suppl. Abstract IN51A-1147.

7.     C. Crosby, J.L. Blair, V. Nandigam, A. Memon, C. Baru, J. R. Arrowsmith, “KML-based access and visualization of high resolution LiDAR topography,” Eos Trans. AGU 89(53), Fall Meet. Suppl. Abstract IN41B-1149.

 

Conference and Workshop Organization

Involved as organizer, co-organizer, Program Committee member, or session chair for over 30 different conferences / workshops.

 

Teaching

Š       Undergraduate: Courses in Data Structures and Algorithms, Logic Design, and Database Management Systems, EECS Dept., University of Michigan, Ann Arbor.

Š       Graduate: Courses in Database Machine Architecture, Database Management Systems, and Advanced Topics in Database Systems, EECS Dept., University of Michigan, Ann Arbor.

Tutorials

Š       Tutorial on "Warehousing Scientific Data," NASA Workshop on Data Mining and Data Warehousing, August 19-20, 1997, Silver Springs, MD.

Š       Tutorial on "Managing Large Data Sets," CASCON'96, November 12-14, 1996, IBM Toronto Labs, Toronto, ON.

Š       Tutorial on "Parallel Database Systems: The First Generation Products," International Conference on Parallel and Distributed Information Systems (PDIS), September 28-30, 1994, Austin, TX.

 

Invited Talks

Have given over 50 invited talks at various institutions and conferences.

 

Education

1983 – 85        Doctor of Philosophy (Ph.D.) in Electrical Engineering, University of Florida, Gainesville, FL, 1985.

1981 – 83        Master's in Engineering (M.E.) in Electrical Engineering, University of Florida, Gainesville, FL, 1983 (received Outstanding Master’s Thesis Award).

1974 – 79        Bachelor of Technology (B.Tech.) in Electrical Engineering, Indian Institute of Technology, Madras, India, 1979.