- SQL Server Reference Guide
- Introduction
- SQL Server Reference Guide Overview
- Table of Contents
- Microsoft SQL Server Defined
- SQL Server Editions
- SQL Server Access
- Informit Articles and Sample Chapters
- Online Resources
- Microsoft SQL Server Features
- SQL Server Books Online
- Clustering Services
- Data Transformation Services (DTS) Overview
- Replication Services
- Database Mirroring
- Natural Language Processing (NLP)
- Analysis Services
- Microsot SQL Server Reporting Services
- XML Overview
- Notification Services for the DBA
- Full-Text Search
- SQL Server 2005 - Service Broker
- Using SQL Server as a Web Service
- SQL Server Encryption Options Overview
- SQL Server 2008 Overview
- SQL Server 2008 R2 Overview
- SQL Azure
- The Utility Control Point and Data Application Component, Part 1
- The Utility Control Point and Data Application Component, Part 2
- Microsoft SQL Server Administration
- The DBA Survival Guide: The 10 Minute SQL Server Overview
- Preparing (or Tuning) a Windows System for SQL Server, Part 1
- Preparing (or Tuning) a Windows System for SQL Server, Part 2
- Installing SQL Server
- Upgrading SQL Server
- SQL Server 2000 Management Tools
- SQL Server 2005 Management Tools
- SQL Server 2008 Management Tools
- SQL Azure Tools
- Automating Tasks with SQL Server Agent
- Run Operating System Commands in SQL Agent using PowerShell
- Automating Tasks Without SQL Server Agent
- Storage – SQL Server I/O
- Service Packs, Hotfixes and Cumulative Upgrades
- Tracking SQL Server Information with Error and Event Logs
- Change Management
- SQL Server Metadata, Part One
- SQL Server Meta-Data, Part Two
- Monitoring - SQL Server 2005 Dynamic Views and Functions
- Monitoring - Performance Monitor
- Unattended Performance Monitoring for SQL Server
- Monitoring - User-Defined Performance Counters
- Monitoring: SQL Server Activity Monitor
- SQL Server Instances
- DBCC Commands
- SQL Server and Mail
- Database Maintenance Checklist
- The Maintenance Wizard: SQL Server 2000 and Earlier
- The Maintenance Wizard: SQL Server 2005 (SP2) and Later
- The Web Assistant Wizard
- Creating Web Pages from SQL Server
- SQL Server Security
- Securing the SQL Server Platform, Part 1
- Securing the SQL Server Platform, Part 2
- SQL Server Security: Users and other Principals
- SQL Server Security – Roles
- SQL Server Security: Objects (Securables)
- Security: Using the Command Line
- SQL Server Security - Encrypting Connections
- SQL Server Security: Encrypting Data
- SQL Server Security Audit
- High Availability - SQL Server Clustering
- SQL Server Configuration, Part 1
- SQL Server Configuration, Part 2
- Database Configuration Options
- 32- vs 64-bit Computing for SQL Server
- SQL Server and Memory
- Performance Tuning: Introduction to Indexes
- Statistical Indexes
- Backup and Recovery
- Backup and Recovery Examples, Part One
- Backup and Recovery Examples, Part Two: Transferring Databases to Another System (Even Without Backups)
- SQL Profiler - Reverse Engineering An Application
- SQL Trace
- SQL Server Alerts
- Files and Filegroups
- Partitioning
- Full-Text Indexes
- Read-Only Data
- SQL Server Locks
- Monitoring Locking and Deadlocking
- Controlling Locks in SQL Server
- SQL Server Policy-Based Management, Part One
- SQL Server Policy-Based Management, Part Two
- SQL Server Policy-Based Management, Part Three
- Microsoft SQL Server Programming
- An Outline for Development
- Database
- Database Services
- Database Objects: Databases
- Database Objects: Tables
- Database Objects: Table Relationships
- Database Objects: Keys
- Database Objects: Constraints
- Database Objects: Data Types
- Database Objects: Views
- Database Objects: Stored Procedures
- Database Objects: Indexes
- Database Objects: User Defined Functions
- Database Objects: Triggers
- Database Design: Requirements, Entities, and Attributes
- Business Process Model Notation (BPMN) and the Data Professional
- Business Questions for Database Design, Part One
- Business Questions for Database Design, Part Two
- Database Design: Finalizing Requirements and Defining Relationships
- Database Design: Creating an Entity Relationship Diagram
- Database Design: The Logical ERD
- Database Design: Adjusting The Model
- Database Design: Normalizing the Model
- Creating The Physical Model
- Database Design: Changing Attributes to Columns
- Database Design: Creating The Physical Database
- Database Design Example: Curriculum Vitae
- NULLs
- The SQL Server Sample Databases
- The SQL Server Sample Databases: pubs
- The SQL Server Sample Databases: NorthWind
- The SQL Server Sample Databases: AdventureWorks
- The SQL Server Sample Databases: Adventureworks Derivatives
- UniversalDB: The Demo and Testing Database, Part 1
- UniversalDB: The Demo and Testing Database, Part 2
- UniversalDB: The Demo and Testing Database, Part 3
- UniversalDB: The Demo and Testing Database, Part 4
- Getting Started with Transact-SQL
- Transact-SQL: Data Definition Language (DDL) Basics
- Transact-SQL: Limiting Results
- Transact-SQL: More Operators
- Transact-SQL: Ordering and Aggregating Data
- Transact-SQL: Subqueries
- Transact-SQL: Joins
- Transact-SQL: Complex Joins - Building a View with Multiple JOINs
- Transact-SQL: Inserts, Updates, and Deletes
- An Introduction to the CLR in SQL Server 2005
- Design Elements Part 1: Programming Flow Overview, Code Format and Commenting your Code
- Design Elements Part 2: Controlling SQL's Scope
- Design Elements Part 3: Error Handling
- Design Elements Part 4: Variables
- Design Elements Part 5: Where Does The Code Live?
- Design Elements Part 6: Math Operators and Functions
- Design Elements Part 7: Statistical Functions
- Design Elements Part 8: Summarization Statistical Algorithms
- Design Elements Part 9:Representing Data with Statistical Algorithms
- Design Elements Part 10: Interpreting the Data—Regression
- Design Elements Part 11: String Manipulation
- Design Elements Part 12: Loops
- Design Elements Part 13: Recursion
- Design Elements Part 14: Arrays
- Design Elements Part 15: Event-Driven Programming Vs. Scheduled Processes
- Design Elements Part 16: Event-Driven Programming
- Design Elements Part 17: Program Flow
- Forming Queries Part 1: Design
- Forming Queries Part 2: Query Basics
- Forming Queries Part 3: Query Optimization
- Forming Queries Part 4: SET Options
- Forming Queries Part 5: Table Optimization Hints
- Using SQL Server Templates
- Transact-SQL Unit Testing
- Index Tuning Wizard
- Unicode and SQL Server
- SQL Server Development Tools
- The SQL Server Transact-SQL Debugger
- The Transact-SQL Debugger, Part 2
- Basic Troubleshooting for Transact-SQL Code
- An Introduction to Spatial Data in SQL Server 2008
- Performance Tuning
- Performance Tuning SQL Server: Tools and Processes
- Performance Tuning SQL Server: Tools Overview
- Creating a Performance Tuning Audit - Defining Components
- Creating a Performance Tuning Audit - Evaluation Part One
- Creating a Performance Tuning Audit - Evaluation Part Two
- Creating a Performance Tuning Audit - Interpretation
- Creating a Performance Tuning Audit - Developing an Action Plan
- Understanding SQL Server Query Plans
- Performance Tuning: Implementing Indexes
- Performance Monitoring Tools: Windows 2008 (and Higher) Server Utilities, Part 1
- Performance Monitoring Tools: Windows 2008 (and Higher) Server Utilities, Part 2
- Performance Monitoring Tools: Windows System Monitor
- Performance Monitoring Tools: Logging with System Monitor
- Performance Monitoring Tools: User Defined Counters
- General Transact-SQL (T-SQL) Performance Tuning, Part 1
- General Transact-SQL (T-SQL) Performance Tuning, Part 2
- General Transact-SQL (T-SQL) Performance Tuning, Part 3
- Performance Monitoring Tools: An Introduction to SQL Profiler
- Performance Tuning: Introduction to Indexes
- Performance Monitoring Tools: SQL Server 2000 Index Tuning Wizard
- Performance Monitoring Tools: SQL Server 2005 Database Tuning Advisor
- Performance Monitoring Tools: SQL Server Management Studio Reports
- Performance Monitoring Tools: SQL Server 2008 Activity Monitor
- The SQL Server 2008 Management Data Warehouse and Data Collector
- Performance Monitoring Tools: Evaluating Wait States with PowerShell and Excel
- Practical Applications
- Choosing the Back End
- The DBA's Toolbox, Part 1
- The DBA's Toolbox, Part 2
- Scripting Solutions for SQL Server
- Building a SQL Server Lab
- Using Graphics Files with SQL Server
- Enterprise Resource Planning
- Customer Relationship Management (CRM)
- Building a Reporting Data Server
- Building a Database Documenter, Part 1
- Building a Database Documenter, Part 2
- Data Management Objects
- Data Management Objects: The Server Object
- Data Management Objects: Server Object Methods
- Data Management Objects: Collections and the Database Object
- Data Management Objects: Database Information
- Data Management Objects: Database Control
- Data Management Objects: Database Maintenance
- Data Management Objects: Logging the Process
- Data Management Objects: Running SQL Statements
- Data Management Objects: Multiple Row Returns
- Data Management Objects: Other Database Objects
- Data Management Objects: Security
- Data Management Objects: Scripting
- Powershell and SQL Server - Overview
- PowerShell and SQL Server - Objects and Providers
- Powershell and SQL Server - A Script Framework
- Powershell and SQL Server - Logging the Process
- Powershell and SQL Server - Reading a Control File
- Powershell and SQL Server - SQL Server Access
- Powershell and SQL Server - Web Pages from a SQL Query
- Powershell and SQL Server - Scrubbing the Event Logs
- SQL Server 2008 PowerShell Provider
- SQL Server I/O: Importing and Exporting Data
- SQL Server I/O: XML in Database Terms
- SQL Server I/O: Creating XML Output
- SQL Server I/O: Reading XML Documents
- SQL Server I/O: Using XML Control Mechanisms
- SQL Server I/O: Creating Hierarchies
- SQL Server I/O: Using HTTP with SQL Server XML
- SQL Server I/O: Using HTTP with SQL Server XML Templates
- SQL Server I/O: Remote Queries
- SQL Server I/O: Working with Text Files
- Using Microsoft SQL Server on Handheld Devices
- Front-Ends 101: Microsoft Access
- Comparing Two SQL Server Databases
- English Query - Part 1
- English Query - Part 2
- English Query - Part 3
- English Query - Part 4
- English Query - Part 5
- RSS Feeds from SQL Server
- Using SQL Server Agent to Monitor Backups
- Reporting Services - Creating a Maintenance Report
- SQL Server Chargeback Strategies, Part 1
- SQL Server Chargeback Strategies, Part 2
- SQL Server Replication Example
- Creating a Master Agent and Alert Server
- The SQL Server Central Management System: Definition
- The SQL Server Central Management System: Base Tables
- The SQL Server Central Management System: Execution of Server Information (Part 1)
- The SQL Server Central Management System: Execution of Server Information (Part 2)
- The SQL Server Central Management System: Collecting Performance Metrics
- The SQL Server Central Management System: Centralizing Agent Jobs, Events and Scripts
- The SQL Server Central Management System: Reporting the Data and Project Summary
- Time Tracking for SQL Server Operations
- Migrating Departmental Data Stores to SQL Server
- Migrating Departmental Data Stores to SQL Server: Model the System
- Migrating Departmental Data Stores to SQL Server: Model the System, Continued
- Migrating Departmental Data Stores to SQL Server: Decide on the Destination
- Migrating Departmental Data Stores to SQL Server: Design the ETL
- Migrating Departmental Data Stores to SQL Server: Design the ETL, Continued
- Migrating Departmental Data Stores to SQL Server: Attach the Front End, Test, and Monitor
- Tracking SQL Server Timed Events, Part 1
- Tracking SQL Server Timed Events, Part 2
- Patterns and Practices for the Data Professional
- Managing Vendor Databases
- Consolidation Options
- Connecting to a SQL Azure Database from Microsoft Access
- SharePoint 2007 and SQL Server, Part One
- SharePoint 2007 and SQL Server, Part Two
- SharePoint 2007 and SQL Server, Part Three
- Querying Multiple Data Sources from a Single Location (Distributed Queries)
- Importing and Exporting Data for SQL Azure
- Working on Distributed Teams
- Professional Development
- Becoming a DBA
- Certification
- DBA Levels
- Becoming a Data Professional
- SQL Server Professional Development Plan, Part 1
- SQL Server Professional Development Plan, Part 2
- SQL Server Professional Development Plan, Part 3
- Evaluating Technical Options
- System Sizing
- Creating a Disaster Recovery Plan
- Anatomy of a Disaster (Response Plan)
- Database Troubleshooting
- Conducting an Effective Code Review
- Developing an Exit Strategy
- Data Retention Strategy
- Keeping Your DBA/Developer Job in Troubled Times
- The SQL Server Runbook
- Creating and Maintaining a SQL Server Configuration History, Part 1
- Creating and Maintaining a SQL Server Configuration History, Part 2
- Creating an Application Profile, Part 1
- Creating an Application Profile, Part 2
- How to Attend a Technical Conference
- Tips for Maximizing Your IT Budget This Year
- The Importance of Blue-Sky Planning
- Application Architecture Assessments
- Transact-SQL Code Reviews, Part One
- Transact-SQL Code Reviews, Part Two
- Cloud Computing (Distributed Computing) Paradigms
- NoSQL for the SQL Server Professional, Part One
- NoSQL for the SQL Server Professional, Part Two
- Object-Role Modeling (ORM) for the Database Professional
- Business Intelligence
- BI Explained
- Developing a Data Dictionary
- BI Security
- Gathering BI Requirements
- Source System Extracts and Transforms
- ETL Mechanisms
- Business Intelligence Landscapes
- Business Intelligence Layouts and the Build or Buy Decision
- A Single Version of the Truth
- The Operational Data Store (ODS)
- Data Marts – Combining and Transforming Data
- Designing Data Elements
- The Enterprise Data Warehouse — Aggregations and the Star Schema
- On-Line Analytical Processing (OLAP)
- Data Mining
- Key Performance Indicators
- BI Presentation - Client Tools
- BI Presentation - Portals
- Implementing ETL - Introduction to SQL Server 2005 Integration Services
- Building a Business Intelligence Solution, Part 1
- Building a Business Intelligence Solution, Part 2
- Building a Business Intelligence Solution, Part 3
- Tips and Troubleshooting
- SQL Server and Microsoft Excel Integration
- Tips for the SQL Server Tools: SQL Server 2000
- Tips for the SQL Server Tools – SQL Server 2005
- Transaction Log Troubles
- SQL Server Connection Problems
- Orphaned Database Users
- Additional Resources
- Tools and Downloads
- Utilities (Free)
- Tool Review (Free): DBDesignerFork
- Aqua Data Studio
- Microsoft SQL Server Best Practices Analyzer
- Utilities (Cost)
- Quest Software's TOAD for SQL Server
- Quest Software's Spotlight on SQL Server
- SQL Server on Microsoft's Virtual PC
- Red Gate SQL Bundle
- Microsoft's Visio for Database Folks
- Quest Capacity Manager
- SQL Server Help
- Visual Studio Team Edition for Database Professionals
- Microsoft Assessment and Planning Solution Accelerator
- Aggregating Server Data from the MAPS Tool
Replication is a feature included with all editions of SQL Server from version 2000 and higher (to some degree) that allows you to copy data from one database to another. In this feature overview I’ll explain (at a high-level) the concepts, how the process works, and explain a practical example of the way I’ve implemented it in the past. In future tutorials I’ll walk through setting up replication from one server to another that you can test using only a single system.
The basic process for setting up replication is to define the way you want to move the data, the servers that send and receive it, and then the data you want to copy. In this tutorial, we’ll examine the process for planning the replication setup, called a replication schema.
Concepts Overview
Microsoft uses a publisher-subscriber metaphor to describe their implementation of replication. In other words, think of a magazine and someone who buys the magazine. There are several parties involved in making the magazine, getting it to the reader, and reading it. This is a pretty common way of replicating data, whether a database or some other function. Here is the structure Microsoft uses boiled down to a single sentence:
A Publisher has a Publication which is composed of Articles that are sent (picked up) by the Distributor to one or more Subscribers which have Subscriptions.
Yes, it’s kind of simple, but keeping it simple like that will help you understand replication when things get a bit more...complicated. I’ll use that sentence to guide you through the rest of this article.
The Microsoft documentation starts out by planning the physical layout of the process (servers-out), then the data. I normally ask questions that begin the plan at the other end (data-in), and then decide the physical layout based on the answers. It really doesn’t matter which route you choose, so long as you create your plan before you start setting up the servers. Microsoft provides a full set of wizards to guide you through the entire setup process on all parts of the replication schema, but you should understand what you want to accomplish before you run them.
I’ll begin by defining some of the terms I introduced in that paragraph earlier. Along the way I’ll add a few more terms. You can install all of these functions on one system, but in production you should have multiple servers involved in each part of the process. You’ll want to pay attention to what purpose the server will be used for, and adjust the number of servers based on the load. For instance, in a large replication farm, you might have one server or even multiple servers whose only role is one function within the process. In smaller installations, one server might perform two or more roles.
The Publisher
Starting from the data you want to send, the first system in the chain is the Publisher. The Publisher is the SQL Server Database Server that has the data you want to send to other systems. As you progress through the Replication setup wizard, you’ll be asked which system will act as the Publisher. The Publisher has one or more databases that hold the data that you will send to another system.
In the magazine example, this is like the “head office” that has all of the articles for the magazine in it. Not everything a writer types goes in a particular magazine, but we’ll come to that.
Publications and Articles
The Publication is the grouping of data that you want to replicate. You’ll create a Publication during the second phase of the wizard, and once the Publisher is configured you’ll start the wizard by creating more Publications. A Publication is composed of one or more Articles. You’ll always work with the data from here on out at the level of the Publication.
Articles are the individual data parts that make up a Publication. An Article can be a table, or the results of a stored procedure or a view. In this way the Publisher holds the data, the Publication packages it up, and the Articles have the actual data inside. If you think of each of those things as a circle, with the Articles inside the Publication, you have the idea.
Remember that you send out a Publication to another server, not an Article. In other words, the Publication is the smallest unit of data that a receiving server works with.
In the magazine analogy, you can think of the Publication as an issue of the magazine, and the Articles as, well, articles in that magazine. Your readers buy an magazine (Publication), not an individual article (in printed magazines, anyway).
The Distributor
The Distributor is the next part of the process. This server (and remember that this function can be on the same box as the Publisher) does the work of transferring the Publication out. In large replication scenarios, this is often a separate server, or sometimes multiple servers. During the wizard process you’re asked to choose the server which will act as the Distributor.
The important thing to keep in mind is that the Distributor has a large requirement for two key components: drive space and network connectivity. The Distributor has to be able to transfer all of the data to the servers that want it, waiting until the last one requests and receives the data.
In the magazine example, you can think of this as the place where the magazine is dropped off to be picked up by trucks to take to various newsstands, airports, stores and so on. It’s like the in/out warehouse. Obviously, this building has to be large enough to hold all of the magazines that will move in or out.
The Subscriber and Subscriptions
The final server in the mix is the Subscriber. This server is the one that receives the data, using a Subscription to a Publication. There can be several Subscriber systems to one Publication, or a single system can have multiple Subscriptions.
The Subscription is the final end of replication. It’s the meta-data associated with the way the Subscriber will receive the data. Always remember that a Subscription is to a Publication, not an Article. In other words, the Subscriber doesn’t choose the data that is sent. It receives the entire Publication.
In the magazine example, this is the purchaser of the magazine. Might be a single person, or might be a newsstand.
Agents and Jobs
SQL Server 2000 uses many Agents, which are a combination of SQL Server Jobs and stored procedures, that watch the Publications and creates schedules to move the data along.
The Replication Process
With all of the servers in place, you need to understand a little about how they all fir together. The two main concepts here are the types of replication and the direction they flow. It’s here that the magazine analogy breaks down a little.
Replication Types
Now that you have the servers down, I’ll talk a bit about the types of replication. There are three types of replication: Snapshot, Transactional, and Merge.
The simplest type of replication is the Snapshot. This type takes an entire Publication sends it to another server’s database, overwriting the current data. It doesn’t overwrite the whole database, just the data in the Publication.
The next type of replication is Transactional. This type also takes a Snapshot, but after that replicates only the changed data from one system to another.
The final type, Merge (or peer to peer) replication, takes a comparison between two databases and then transfers the differences between them. You have the option of making one server the winner of any conflicts, or of having those conflicts stored so that you can resolve them manually. What is really happening here is that each system is both a Publisher and a Subscriber to each other – but that’s another aticle.
Replication Directions
You can set up the servers to either Push or Pull the data in the Subscription. This choice sets whether the Publisher decides when the data is to be sent (Push), or that the Subscriber decides when to ask for the data (Pull). This choice has implications for all the servers. For instance, if you choose to pull data only once in a while, the Distributor will have to hold that data until it’s asked for. That can add up to some serious storage and bandwidth transfer considerations.
Practical Application
Now that you understand all of the parts of the process and the choices for the types of data that go back and forth and who will send or pick up the data, it’s time to put it all together. You can read more about this in Books Online, but here is the process I’ve followed in the past.
First, I define what the data needs are. Next, I set consider the schedule I need for the data, and then who "owns" the data. Finally I decide the location and type of system that will receive the data. Here’s a typical example that I’ve implemented.
The sales team approached me with the need for product information to be stored on their laptops, using an application that allows them to review catalog records, and they told me that they didn’t need to update the information. They needed the data while they were on the road. They used the SQL Server Express engine with a local application our team wrote to see the data.
SQL Server Express can participate in all types of replication as long as it is the subscriber. SQL Server can also replicate to other types of database engines such as Oracle and Microsoft Access, but that’s another tutorial.
To help my sales team I first detailed the data they needed, and then located the server that had that data , designating it as the Publisher.
I then decided if they need only the data changes or the whole set of data each time. At first I thought they might just need the changes, but on investigation I found our catalog changed frequently, so I decide to use the Snapshot type of replication. That way they get the entire catalog every time they synch up. I verified that they didn’t need to edit the data, just that they needed to see it.
Since Snapshot replication sends everything, I verified that the data in the Publication was small enough to be able to transfer over what might be a poor connection. Since our entire catalog could be represented as one view that returns only 10,000 rows, Snapshot replication was the right choice.
Had the data set been quite large, I would have the clients synchronize the data on our LAN before they left, and I would have then chosen the Transactional type of replication since it sends only the changed rows.
Next I needed to detail the schedule that the sales force required. Since they were on the road most of the time, I never knew when they would be able to connect. This meant I had to set up their database as the Subscriber using a Pull Subscription. When they connected, we coded a button in their client program that allowed them to initiate the transfer.
Finally, I knew that the Publisher "owns" the data. This means that the client would not need to update the data, and their copy can be overwritten. If the needs had been different, say that they needed to update inventory, then I would have chosen Merge replication.
You can see from the simple exercise above that your plan can describe the servers, types of replication and so forth that you need to implement.
InformIT Articles and Sample Chapters
SQL Server 2005 and higher adds more capabilities to Replication. You can read about some of those in this section of Enterprise Data Management in SQL Server 2005.
Books and eBooks
I cover more information on SQL Server replication, along with an example in my book, Administrator's Guide to SQL Server 2005. The information largely still holds true for the later versions.
Online Resources
More information on SQL Server Replication from Microsoft is here.