Jump to content

Parallel database: Difference between revisions

From Wikipedia, the free encyclopedia
Content deleted Content added
Good faith revert of edit(s) by 27.123.125.20 using STiki
No edit summary
Line 1: Line 1:
{{Unreferenced|date=September 2009}}
{{Unreferenced|date=September 2009}}


A '''parallel [[database]]''' system seeks to improve performance through [[parallelization]] of various operations, such as loading data, building indexes and evaluating queries. Although data may be stored in a distributed fashion, the distribution is governed solely by performance considerations. Parallel databases improve processing and [[input/output]] speeds by using multiple [[CPU]]s and disks in parallel. Centralized and [[client–server]] database systems are not powerful enough to handle such applications. In parallel processing, many operations are performed simultaneously, as opposed to serial processing, in which the computational steps are performed sequentially.
A '''parallel [[database]]''' system seeks to improve performance through [[parallelization]] of various operations, such as loading data, building indexes and evaluating queries. Although data may be stored in a distributed fashion, the distribution is governed solely by performance considerations. Parallel databases improve processing and [[input/output]] speeds by using multiple [[CPU]]s and di, where multiple [[Central processing unit|processors]] share the [[main memory]] space, as well as mass storage (e.g. [[hard disk drive]]s).

Parallel databases can be roughly divided into two groups, the first group of architecture is the multiprocessor architecture, the alternatives of which are the followings :
* '''[[Shared memory#In hardware|Shared memory architecture]]''', where multiple [[Central processing unit|processors]] share the [[main memory]] space, as well as mass storage (e.g. [[hard disk drive]]s).
* '''Shared disk architecture''', where each node has its own main memory, but all nodes share mass storage, usually a [[storage area network]]. In practice, each node usually also has multiple processors.
* '''Shared disk architecture''', where each node has its own main memory, but all nodes share mass storage, usually a [[storage area network]]. In practice, each node usually also has multiple processors.
* '''[[Shared nothing architecture]]''', where each node has its own mass storage as well as main memory.
* '''[[Shared nothing architecture]]''', where each node has its own mass storage as well as main memory.

Revision as of 20:30, 27 May 2012

A parallel database system seeks to improve performance through parallelization of various operations, such as loading data, building indexes and evaluating queries. Although data may be stored in a distributed fashion, the distribution is governed solely by performance considerations. Parallel databases improve processing and input/output speeds by using multiple CPUs and di, where multiple processors share the main memory space, as well as mass storage (e.g. hard disk drives).

  • Shared disk architecture, where each node has its own main memory, but all nodes share mass storage, usually a storage area network. In practice, each node usually also has multiple processors.
  • Shared nothing architecture, where each node has its own mass storage as well as main memory.

The other architecture group is called hybrid architecture, which includes:

  • Non-Uniform Memory Architecture (NUMA), which involves the Non-Uniform Memory Access.
  • Cluster (shared nothing + shared disk: SAN/NAS), which is formed by a group of connected computers.

References

Communications of the ACM: Parallel database systems: the future of high performance database systems