(Time is God and In god I trust)
Arun S. Jagatheesan
San Diego Supercomputer Center
9500 Gilman Drive, MC0505
La Jolla, CA 92093
Phone : (858) 822 5452
Fax : (858) 534 5077
E-Mail: arun AT sdsc.edu
- Internet and Grid Computing (Distributed Computing)
- Data Grid Management Systems (Collaborative data storage management)
- Information Infrastructure Networks (Infrastructure for XL-size information management)
- Workflow and automation for data storage management
Current research and development
- Dataflow specialist (Data Grids), San Diego Supercomputer Center, University of California, San Diego
- Co-Chair, Grid File Systems Working Group, Global Grid Forum
- Lead, SDSC Matrix Project
- (Also the important role of being just another human being who wants to add value to things around me ;-)
LUSciD coLLaboration: LUSciD collaboration is a joint collaborative effort between Lawrence Livermore National Laboratory (LLNL), University of California and San Diego Supercomputer Center.
The objective of the coLLaboration is to apply advanced scientific data management technologies to improve the conduct of large scale science.
Most of the these requirements could be categorized as what i call as "rebels and misfits of existing technology".
If we analyze the history of computer science (especially data storage) there may be few accidental discoveries.
But, most of the new technologies are a result of some requirements that were trying to push the limits of an existing technology.
These esoteric requirements were usually from high-end users who wanted "more" in terms of performance, throughput or scalability.
The requirements of these high-end users could not be satisfied by the existing technology.
From the existing technology’s perspective, these users could be considered as "rebels or misfits of technology" who were pushing the envelope too much and are trying to overthrow the existing technology.
LLNL, SDSC SRB,
What is a Grid?, History of the Grid
LSST Project: The Large Synoptic Survey Telescope (LSST) is a proposed ground-based 8.4-meter, 10 square-degree-field telescope that will provide digital imaging of faint astronomical objects across the entire sky, night after night.
The IT part here is to manage Petabytes of data. Nature's Editorial cites this project to be "Steering the future of Computing".
We might be storing and managing over 150 Petabytes of digital data. Our work here is two fold (Hardware infrastructure and Software infrastructure). SDSC is the data access center for LSST providing hardware infrastructure (we just worked on estimating its cost).
We are also involved in middleware for data management, working on the software for LSST.
I started to work on this project voluntarily, based on the request from LLNL. I now take part in the data management and architecture discussions (usually from a data grid perspective). A good group of people to work with.
LSST in BBC,
LSST in Forbes,
LSST in Nature, Data Challenge, NCSA,
LLNL GDO: This is not a real big project now. But, i hope it will turn out to be something big or useful at LLNL. The task here is to design a data management architecture for data sharing between LLNL and its external partners.
Apart from the IT perspective, we need to make sure a pragmatic approach is designed so that users can use the resources without exploiting them.
There are several security and access control restrictions that need to be met. All data in GDO are considered to be public or UNCLASSIFIED (so i can mention this project here). I collaborate with Jeff Long from LLNL on this work.
Related Links: LLNL GDO, SDSC SRB
SDSC Matrix Project:
Matrix is a Grid Workflow process management system. Matrix provides the
protocols and software infrastructure needed by Inter-organizational data
management services to create, access and manage grid workflow pipelines.
Matrix uses the Data Grid Language, which can be used to describe,
query and control process-flow pipelines. Matrix provides the software
mechanisms to define and execute long lived datagrid administrative tasks.
While data grids like SRB provide logical namespaces to manage unstructured
inter-organizational data, Matrix provides mechanisms to provide mappings
from logical namespace to process namespace and vice-versa. (e.g) an insert
or delete on a logical namespace could trigger a process that has to be
operated in the datagrid.
Related Links: SDSC Matrix Project
Data Grid Management System (DGMS): Each organization in a data
grid needs a system composed of services that will enable it to
dynamically form or join communities and coordinate the management of
inter/intra-organizational data and resources. DGMS is a P2P middleware that provides a
logical view of inter/intra-organizational data and resources to its applications.
The key difference between the DBMS and DGMS is that whereas the
physical organization of resources (storage) is hidden in DBMS, it is provided
as another logical view to the applications in DGMS along with data. Grid Applications can use the logical layer
of distributed data as they do in DBMS (with out being worried about the physical location of data). In addition they can use the
logical view of the shared, distributed and heterogeneous resources
in the grid environment. The basic operations (like simple query plan or ingestion
of data) in the data grid can use this logical view resources (storage) to
decide on which shared physical resources to use. More protocols and more challenges
lie ahead for DGMS.
Grid File System (GGF-WG):
A standard mechanism to describe and organize file-based data is essential for
facilitating access to this large amount of data. The GGF Grid File System
Working Group (GFS-WG) will provide specifications of Grid File System
Directory Services and Architecture of Grid File System Services. The GFS
standards might serve as the common denominator for different datagrid
systems. GFS is a collaborative effort along with Storage Networking Industry
Disclaimer: These documents are made available as a means to ensure
timely dissemination of scholarly and technical work on a non-commercial basis.
Copyright and all rights therein are maintained by the authors or by other
copyright holders, notwithstanding that they have offered their works here
electronically. It is understood that all persons copying this information will
adhere to the terms and constraints invoked by each copyright holder. These
works may not be reposted without the explicit permission of the copyright
- DBLP Bibliography Link
- Grid Computing : Data Grid Management Systems
- Jagatheesan, A., Moore, R. and Brieger, L., “The Storage Resource Broker (SRB) Data Grid Management System : FAQ for Technical Managers and Decision Makers ”
SDSC Technical Report, March, 2006, San Diego Supercomputer Center . SDSC Tech Reports
- Jagatheesan, A., Weinberg, J., Mathew, R., Ding, A., Vandekieft. E., Moore. D., Moore. R., Gilbert. L., Tran. M., and Kuramoto. J., “Datagridflows: Managing Long-run Processes on Datagrids, ”
VLDB Workshop on Data Management in Grids (VLDB DMG 2005) and Lecture Notes on Computer Science-3836
- Jagatheesan, A., (Editor) “The GGF Grid File System (GFS) Architecture Workbook, ”
The Global Grid Forum, April, 2005, GGF. doc
- Weinberg, J., Jagatheesan, A., Ding, A., Fareman, M. and Hu, Y., “Gridflow Description, Query, and Execution at SCEC using the SDSC Matrix, ”
Proceedings of the 13th IEEE International Symposium on High-Performance Distributed Computing (HPDC), June 4-6, 2004, Honolulu, Hawaii, USA. [poster paper]
- Moore, R.W., Jagatheesan, A., Rajasekar, A., Wan, M. and Schroeder, W., “Data Grid Management Systems, ”
Proceedings of the 21stIEEE/NASA Conference on Mass Storage Systems and Technologies (MSST), April 13-16, 2004, College Park, Maryland, USA.
- Rajasekar, A., Wan, M., Moore, R.W., Schroeder, W., Kremenek, G., Jagatheesan, A., Cowart, C., Zhu, B., Chen, S.Y. and Olschanowsky, R., “Storage Resource Broker - Managing Distributed Data in a Grid, ”
Journal of Computer Society of India, special issue on SAN, 2003.
- Rajasekar, A., Wan, M., Moore, R.W., Jagatheesan, A. and Kremenek, G., “Real Experiences with Data Grids – Case-studies in using the SRB,”
Proceedings of the 6th International Conference/Exhibition
on High Performance Computing Conference in Asia Pacific Region (HPC-Asia),
December 2002, Bangalore, India.
- Jagatheesan, A., Moore, R.,
Rajasekar, A. and Zhu, B., “Virtual Services in Data Grids”, In
the 11th IEEE International Symposium on High Performance
Distributed Computing (HPDC), July 2002, Scotland. (pdf),(doc) [poster]
- Service Oriented Architecture : Web Service Brokering
- Helal, A., Su, S.Y.W., Meng, J., Krithivasan, R. and
Jagatheesan. A., “The Internet Enterprise,” Proceedings of Second
IEEE/IPSJ Symposium on Applications and the Internet (SAINT 02), February
2002, Japan. (pdf)
- Helal, A., Wang, M., Jagatheesan,
A. and Krithivasan, R., “Brokering Based Self Organizing E-Service Communities,”
Proceedings of the fifth IEEE International Symposium on Autonomous Decentralized
Systems (ISADS) with an Emphasis on Electronic Commerce, March 2001,
Dallas, Texas. (pdf)
- Jagatheesan, A., “Architecture
and Protocols for Sangam Communities and Sangam E-Services Broker,” Technical
(Master's Thesis) CISE Department, University of Florida, 2001
- Helal, A., Wang, M. and Jagatheesan,
A., “Service-Centric Brokering in Dynamic E-business Agent Communities,”
Journal of Electronic Commerce Research (JECR), Baltzar Science Publishers,
- Fuzzy Logic based Applications and Software Agents
- Jagatheesan, A., Pinheiro, J. and Sreenivasan, N., "Software Agents for Information Brokerage", Technical Report, Kumaraguru College of Technology’, Coimbatore, India.
- Subramanium, T. and Jagatheesan, A., "Fuzzy-Logic based controller in Wind Energy Applications", Proceedings
of SENSORS’98, Regional Engineering College, Trichy, India.
- Subramanium, T. and Jagatheesan, A., "Application of Fuzzy-Logic in SAWECS", Proceedings of
WAVES’97, Student Technical Symposium at Anna University, India. (awarded best paper)
Talks and Tutorials
Upcoming Tutorial talks
- Data Grid Management Systems Tutorial on DGMS concepts. Also the relevance of DGMS to the emerging Indian IT trends in government and industry.
The 13th International Conference on Management of Data , December 14-16, 2006. Indian Institute of Technology (IIT),Delhi, India
- Data Grid Management Systems Tutorial on DGMS concepts. Includes a section on how DGMS concepts could be used in the emerging Brazil IT infrastructure including e-Gov, universities and private enterprises.
The Brazilian Symposium on Databases , October 16-20, 2006, Florianópolis, Brazil
- Status of Grid File System (GFS) at GGF-17 Remote Talk at the Global Grid Forum 17, May 11, 2006, Tokyo. ppt
- Data Grid Management Systems: From Concepts to Markets A small tutorial/talk on DGMS concepts presented to industry folks at SNW with focus on how DGMS helped the academic projects and how it might help the emerging commercial markets for these technologies. ComputerWorld Storage Networking World (SNW) April 3-6, 2006, San Diego, California USA. talk summary, ppt
- Looking ahead for Grid File System (GFS) Remote Talk at the Global Grid Forum 16, February, 2006, Athens, Greece. ppt
- Grid File System (GFS) and Data Grids Talk at the SNIA Grid BoF, SNIA Winter Symposium 2006, January 23-27, 2006, San Jose. ppt
- An Introduction to Data Grid Management Systems Tutorial (featuring the Simpsons) at the Sixth International Conference on Web Information Systems Engineering (WISE 2005), November 20 - 22, 2005, New York City, USA.
- (Data Grids and) A Data Storage Language for the Requirements of Rebels and Misfits Talk at the 11th Biennial High Performance Transaction Systems Workshop (HPTS). I was very excited to attend this 'invite-only' workshop at Asilomar - It had people from System R (the god father of all databases that started it all) and also hands-on tech folks from industry like Google, eBay, Amazon etc. September 25-28, 2005, Asilomar, Another nice refuge by the california coast. ppt
- NCHC Tutorial for SRB as a Data Grid Solution Invited tutorial talks on SRB as a Data Grid Solution August 3, 2005, NCHC Hsinchu, Taiwan. ppt (With a introductory Taiwanese (?) for a kick)
- GGF Grid File System Architecture v1.0 GFS Architecture. Presented at GGF-14, June 28-30, 2005, Chicago, USA doc
- A Data Storage Language for the Rebels and Misfits A teaser about the new Data Grid Language (DGL). 22nd IEEE/NASA Goddard Conference on Mass Storage Systems and Technologies April 11-14, 2005, Monterey, California USA. ppt
- GGF Grid File System Architecture v0.09 Initial outline of GFS Architecture based on the multi-vendor/institutional discussions. Presented at GGF-13, March 16, 2005, Seoul, Korea (Thanks to VTC/Polycon) ppt
- SDSC SRB 2004 User Survey Results Report on SDSC SRB User survey. Only a few generic results are presented in this publically available slides. SDSC SRB StepForward day at the awesome SCCOOS facility, February 4, 2005. ppt
- GGF Grid File System Working Group Talk introducing GFS-WG to SNIA members, SNIA Winter Symposium, January 18, 2005, San Jose. ppt
- Programming SRB Gridflows using Matrix Talk focussing on developers and their usage of Data Grid Language, November 17, 2004, San Diego. ppt
- SDSC Matrix: Data Gridflows in SRB Informal talk about Matrix and Data Grid Language at CCLRC Rutherford Appleton Laboratory (RAL), November 05, 2004, Oxfordshire, UK. ppt
- Data Grid Management Systems Tutorial at the Fourth IEEE International Conference on Data Mining (ICDM 2004), November 01 - 04, 2004, Brighton, UK.
- Data Grid Management Systems Tutorial at the ACM/IFIP/USENIX 5th International Middleware Conference (Middleware 2004), October 18th - 22nd, 2004, Toronto, Ontario, Canada.
- Designing the Architecture of Grid File Systems (GFS) Session at the 12th Global Grid Forum (GGF 12), September 20-23, 2004, Brussels, Belgium. ppt design picture drawn on board
- Dataflows in SRB using SDSC Matrix Talk/Lecture at the 10th Annual NPACI/SDSC Summer Computing Institute, August 23-27, 2004, San Diego, California, USA. ppt
- Data Grid and Gridflow Management Systems Tutorial at the IEEE International Conference on Web Services (ICWS 2004), July 6-9, 2004, San Diego, California, USA.
- Architecture of the Grid File System Talk at the Global Grid Forum (GGF 11), June 6-10, 2004, Honolulu, Hawaii. ppt html
These charts explain about the GFS architecture as was discussed in the GGF. I strongly believe this is feasible and will turn out to be commercially useful in the near future. If you have any questions about this open community standardization effort, i will be glad to provide answers.
- Data Grids and Workflows Tutorial at the Joint Conference on Digital Libraries (JCDL 2004), June 7-11, 2004, Tucson, Arizona.
- DGL: The Assembly Language for Grid Computing Demo at the IEEE International Symposium on High-Performance Distributed Computing (HPDC), 2004, June 4-6, 2004, Honolulu, Hawaii, USA. ppt (3 MB) (Thanks to Cynthia Bailey Lee for presenting this)
- DGL: The Assembly Language for Grid Computing Talk at the NSF GriPhyN All Hands Meeting, 2004, May 17-19, 2004, University of Chicago, Chicago. ppt (3 MB)
- A Language for Grid File System Talk at the GGF Grid File System Face to face meeting, April 23, 2004, IBM Almaden Research Center, Almaden. ppt
- Data Grid Managemnet Systems (& SRB) Tutorial at the IEEE/NASA Goddard Conference on Mass Storage Systems and Technologies, April 13-16, 2004, College Park, Maryland.
- Data Grid Management Systems Invited Talk at the Storage Networking World, Spring 2004, April 5-8, 2004, Phoenix, Arizona.
- SAGA and Grid File System Working Group Relevance of SAGA document and the GGF Grid File System Working Group. Also talks about the DGMS Cyberinfrastructure from the GGF perspective, GGF Data Area Workshop,
Global Grid Forum 10, March 9~13, 2004, Berlin, Germany. ppt
- SDSC MATRIX Project: A Passionate Workflow towards Scientific Perfection Introduction to Grid Workflows, Matrix Project and demo of Matrix (2.1.1), Supercomputing conference,
November 15~21, 2003, Phoenix. ppt
- Data Grids: Opportunities and Technical Challenges Ahead Invited
talk at the Pacific Neighborhood Consortium 2003 (PNC 2003), November 7-9, Bangkok, Thailand.
- What is the Matrix? Introduces Grid
Workflow and SDSC Matrix Project at high level without jumping into details.
DAKS Meeting, November 3, SDSC. ppt
- POSIX-like OGSA/SOAP Services Introduces Grid File System(GFS) Group and the need for schemas in GFS. At the Global Grid Forum-9,
October 7, 2003, Chicago. ppt
- Grid File System-BoF Introduction to Grid File System Group at the Global Grid Forum,
October 7, 2003, Chicago. ppt, Prepared along with the other co-chairs: Osamu Tatebe, AIST and Jane Xu, IBM.
- Grid Data Management Systems & Services
Tutorial at 29th International Conference on Very Large Databases (VLDB 2003),
September 11, 2003, Berlin, Germany. Co-lectured with Reagan Moore, Paul
Watson and Norman W. Paton.
Data Grid Services and Pipelines (Data Grid Language and Data Grid Pipelines)
NPACI Summer Computing Institute, August 22, 2003, San Diego. ppt (1.19 MB).
Data Grid Management Systems.
Tutorial at ACM Special Interest Group on Management of Data (ACM SIGMOD), June 12, 2003, San Diego, USA. . Co-lectured with Arcot Rajasekar
Data Grid Management System.
Tutorial at IEEE International Conference on High Performance Computing (HiPC), December 18, 2002, Bangalore, India. . Co-lectured with Arcot Rajasekar
- Schematic description of Grid Exceptions.
Grid Reliability Workshop, Information Sciences Institute (ISI), University of Southern California. (ppt)
- Data Grid Management System (DGMS).
GGF GCE Workshop Indiana University
- Data Grid Management System (DGMS). GGF Grid Computing Environments RG session, Feb 19, 2002, Toronto. (ppt)
(Presented by Arcot Rajasekar) <I will have to admit:The slides here
are not the correct interpretation of DGMS. It talks only about SRB webservices.
I had to go back to my drawing board to design DGMS - look out for the tutorial
slides from HiPC 2002>
Grid Services -> Grid Webservices.
GGF Grid Computing Environments Workshop on Grid Webservices, Dec 6-7/2001,
Palo Alto. (ppt)
My Interests & Activities
"The views and opinions expressed in this page are strictly those of the page author. The contents of this page have not been reviewed or approved by SDSC." - SDSC Web Policy