Plan network requirements for Skype for Business

Summary: Review the network component considerations below before implementing Skype for Business Server.

The information in these articles is also discussed in the whitepaper Network Planning, Monitoring, and Troubleshooting with Lync Server with more details and depth. While the content refers explicitly to Lync 2010 and Lync 2013, the considerations for Skype for Business Server are unchanged.

Likewise, if your network involves wi-fi as well as wired access, the whitepaper Delivering Lync 2013 Real-Time Communications over Wi-Fi is a good reference and is equally applicable to Skype for Business Server.

Server hardware

The network adapter of each server in the Skype for Business Server topology must support at least 1 gigabit per second (Gbps). In general, you should connect all server roles within the Skype for Business Server topology using a low latency and high bandwidth local area network (LAN). The size of the LAN depends on the size of the topology:

For public switched telephone network (PSTN) integration, you can integrate by using either T1/E1 lines or SIP trunking.

Audio/Video network requirements

Network requirements for audio/video (A/V) in a Skype for Business Server deployment include the following:

Important If you have an Edge pool and are using a hardware load balancer, you must use public IP addresses on the Edge Servers and you can't use NAT for the servers or the pool at your NAT-capable device (for example, a firewall appliance or LAN switch. For details, see Edge Server scenarios in Skype for Business Server.

To provide optimal media quality, do the following:

IPsec exceptions

For enterprise networks where Internet Protocol security (IPsec) (see IETF RFC 4301-4309) has been deployed, IPsec must be disabled over the range of ports used for the delivery of audio, video, and panorama video. The recommendation is motivated by the need to avoid any delay in the allocation of media ports due to IPsec negotiation.

The following table explains the recommended IPsec exception settings.

Recommended IPsec Exceptions

Rule name Source IP Destination IP Protocol Source port Destination port Authentication Requirement
A/V Edge Server Internal Inbound Any A/V Edge Server Internal UDP and TCP Any Any Don't authenticate
A/V Edge Server External Inbound Any A/V Edge Server External UDP and TCP Any Any Don't authenticate
A/V Edge Server Internal Outbound A/V Edge Server Internal A/V Edge Server External UDP and TCP Any Any Don't authenticate
A/V Edge Server External Outbound A/V Edge Server External Any UDP and TCP Any Any Don't authenticate
Mediation Server Inbound Any Mediation Server(s) UDP and TCP Any Any Don't authenticate
Mediation Server Outbound Mediation Server(s) Any UDP and TCP Any Any Don't authenticate
Conferencing Attendant Inbound Any Front End Server running Conferencing Attendant UDP and TCP Any Any Don't authenticate
Conferencing Attendant Outbound Front End Server running Conferencing Attendant Any UDP and TCP Any Any Don't authenticate
A/V Conferencing Inbound Any Front End Servers UDP and TCP Any Any Don't authenticate
A/V Conferencing Outbound Front End Servers Any UDP and TCP Any Any Don't authenticate
Exchange Inbound Any Exchange Unified Messaging UDP and TCP Any Any Don't authenticate
Application Sharing Servers Inbound Any Application Sharing Servers UDP and TCP Any Any Don't authenticate
Application Sharing Server Outbound Application Sharing Servers Any UDP and TCP Any Any Don't authenticate
Exchange Outbound Exchange Unified Messaging Any UDP and TCP Any Any Don't authenticate
Clients Any Any UDP and TCP Any Any Don't authenticate

Conferencing network requirements

The bandwidth used to download conference content from the Internet Information Services (IIS) server depends on the size of the content. You may choose to monitor the actual usage and adjust bandwidth planning accordingly.

Network bandwidth requirements for media traffic

An important part of network planning is ensuring that your network can handle the media traffic generated by Skype for Business Server. This section helps you plan for that media traffic.

Media traffic network usage

The media traffic bandwidth usage can be challenging to calculate because of the number of different variables, such as codec usage, resolution, and activity levels. The bandwidth usage is a function of the codec that is used to and the activity of the stream, which can vary between scenarios. The following table lists the audio codecs typically used in Skype for Business Server scenarios.

Audio codec bandwidth

Audio codec Scenario Audio payload bit rate (KBPS) Bandwidth audio payload and IP header only (Kbps) Bandwidth audio payload, IP header, UDP, RTP and SRTP (Kbps) Bandwidth audio payload, IP header, UDP, RTP, SRTP and forward error correction (Kbps)
RTAudio Wideband
Peer-to-peer
29.0
45.0
57.0
86.0
RTAudio Narrowband
Peer-to-peer PSTN
11.8
27.8
39.8
51.6
G.722
Conferencing
64.0
80.0
95.6
159.6
G.722 Stereo
Peer-to-peer Conferencing
128.0
144.0
159.6
223.6
G.711
PSTN, Conferencing
64.0
80.0
92.0
156.0
Siren
Conferencing
16.0
32.0
47.6
63.6
SILK Wideband
Peer-to-peer
36.0
52.0
64.0
100.0
SILK Wideband
Peer-to-peer
26.0
42.0
54.0
80.0
SILK Wideband
Peer-to-peer
20.0
36.0
48.0
68.0
SILK wideband/narrowband
Peer-to-peer
13.0
29.0
41.0
54.0

PSTN calls from the Skype for Business client usually use the G.711 codec, which requires a high bandwidth. If enough bandwidth is not available for that codec, then calls can fail with an error that resembles the following in the Media logs: Atleast one codec must be enabled, hr: c0042004. Media logs (.blog files) are encrypted and can be decoded only by Microsoft support personnel.

The bandwidth numbers in the previous table are based on 20 ms packetization (50 packets per second) and for the Siren and G.722 codecs include the other secure real-time transport protocol (SRTP) overhead from conferencing scenarios and assume the stream is 100% active. Forward Error Correction (FEC) is used dynamically when there's packet loss on the link to help maintain the quality of the audio stream.

The stereo version of the G.722 codec is used by systems that are based on the Lync Room System, which uses a single stereo microphone or a pair of mono microphones to allow listeners to better distinguish multiple speakers in the meeting room.

Video Resolution Bandwidth

Video codec Resolution and aspect ratio Maximum video payload bit rate (Kbps) Minimum video payload bit rate (Kbps)
H.264
320x180 (16:9)
212x160 (4:3)
250
15
H.264/RTVideo
424x240 (16:9)
320x240 (4:3)
350
100
H.264
480x270 (16:9)
424x320 (4:3)
450
200
H.264/RTVideo
640x360 (16:9)
640x480 (4:3)
800
300
H.264
848x480 (16:9)
1500
400
H.264
960x540 (16:9)
2000
500
H.264/RTVideo
1280x720 (16:9)
2500
700
H.264
1920x1080 (16:9)
4000
1500
H.264/RTVideo
960x144 (20:3)
500
15
H.264
1280x192 (20:3)
1000
250
H.264
1920x288 (20:3)
2000
500

The default codec for video is the H.264 /MPEG-4 Part 10 Advanced Video Coding standard, together with its scalable video coding extensions for temporal scalability. To maintain interoperability with legacy clients, the RTVideo codec is still used for peer-to-peer calls between Skype for Business Server and legacy clients. In conference sessions with both Skype for Business Server and legacy clients the Skype for Business Server endpoint may encode the video using both video codecs and send the H.264 bitstream to the Skype for Business Server clients and the RTVideo bitstream to legacy clients.

The bandwidth required depends on the resolution, quality, frame rate, and the amount of motion or change in the picture. For each resolution, there are two pertinent bit rates:

Skype for Business Server supports many resolutions. This allows Skype for Business Server to adjust to different network bandwidth and receiving client capabilities. The default aspect ratio for Skype for Business Server is 16:9. The legacy 4:3 aspect ratio is still supported for webcams, which don't allow capture in the 16:9 aspect ratio.

Video FEC is always included in the video payload bit rate when it's used so there are no separate values for with video FEC and without video FEC.

Endpoints don't stream audio or video packets continuously. Depending on the scenario there are different levels of stream activity, which indicate how often packets are sent for a stream. The activity of a stream depends on the media and the scenario, and doesn't depend on the codec being used. In a peer-to-peer scenario:

In a conferencing scenario:

In addition to the bandwidth required for the real-time transport protocol (RTP) traffic for audio and video media, bandwidth is required for real-time transport control protocol (RTCP). RTCP is used for reporting statistics and out-of-band control of the RTP stream. For planning, use the bandwidth numbers in the following table for RTCP traffic. These values represent the maximum bandwidth used for RTCP and are different for audio and video streams because of differences in the control data

RTCP Bandwidth

Media RTCP maximum bandwidth (Kbps)
Audio
5
Video (Only H.264 or RTVideo being sent/received)
10
Video (H.264 and RTVideo being sent/received)
15

For capacity planning, the following two statistics are of interest:

The following tables also list an other bandwidth value, Typical bandwidth. This is the average bandwidth that a stream consumes. This includes the typical activity of the stream and the typical codec that is used in the scenario. This bandwidth can be used for approximating how much bandwidth is being consumed by media traffic at a specific time, but shouldn't be used for capacity planning, because individual calls will exceed this value when the activity level is greater than average. The typical video stream bandwidth in the tables below is based on a mix of different video resolutions as observed in measured customer data, and smaller installations are likely to have actual numbers that differ from the table data. For example, in peer-to-peer sessions most users would use the default video render window whereas some percentage of users would increase or maximize the Skype for Business Server application to allow better video resolutions.

The following tables provide values for the various scenarios.

Audio/Video Capacity Planning for Peer-to-Peer Sessions

Media Codec Typical stream bandwidth (Kbps) Maximum stream bandwidth without FEC Maximum stream bandwidth with FEC
Audio
RTAudio Wideband
39.8
62
91
Audio
RTAudio Narrowband
29.3
44.8
56.6
Audio
SILK Wideband
44.3
69
105
Main video when calling Skype for Business Server endpoints
H.264
460
4010 (for maximum resolution of 1920x1080)
Already included
Main video when calling Lync 2010 or Office Communicator 2007 R2 endpoints
RTVideo
460
2510 (for maximum resolution of 1280x720)
Already included
Panoramic video when calling Skype for Business Server endpoints
H.264
190
2010 (for maximum resolution of 1920x288)
Already included
Panoramic video when calling Lync 2010 endpoints
RTVideo
190
510 (for maximum resolution of 960x144)
Already included

Audio/Video Capacity Planning for Conferences

Media Typical codec Typical stream bandwidth (Kbps) Maximum stream bandwidth without FEC Maximum stream bandwidth with FEC
Audio
G.722
46.1
100.6
164.6
Audio
Siren
25.5
52.6
68.6
Main video receive
H.264 and RTVideo¹
260
8015
Not applicable
Main video send
H.264 and RTVideo
270
8015
Not applicable
Panoramic video receive
H.264 and RTVideo
190
2010 (for maximum resolution of 1920x288)
Not applicable
Panoramic video send
H.264 and RTVideo
190
2515 ²
Not applicable
  1. RT Video is sent in addition to H.264 when Lync 2010 clients are connected to the conference.
  2. If there are multiple streams, they dynamically share the allocated bandwidth.

For the main video the typical stream bandwidth is the aggregated bandwidth over all received video streams and the maximum stream is the bandwidth over all send video streams. Even with multiple video streams the typical video bandwidth is smaller than in the peer-to-peer scenario because many video conferences are using content sharing that leads to much smaller video windows and therefore smaller video resolutions. The maximum supported aggregated video payload bandwidth is 8000 Kbps for both, send and receive streams, which would be used (for example, if there are two incoming 1920x1080p video streams). Maximum values are only rarely seen in actual implementations.

When building out a multiparty conference that uses the gallery view feature, bandwidth utilization increases initially as participants join, then decreases as resolutions are dropped to fit within the maximum.

2 Participants 3 Participants 4 Participants 5 Participants 6 Participants
Max resolutions received
1920x1080
1280x720
640x360
640x360 320x240
640x360 320x240
Total average bit rate
2128
4050
1304
1224
1565
Total Maximum bit rate
4063
5890
2860
2699
3017

The typical stream bandwidth for panoramic video is based on devices that only stream up to 960x144 panoramic video. Expect the typical stream bandwidth to increase when using devices with 1920x288 panoramic video.

Audio Capacity Planning for PSTN

Media Typical codec Typical stream bandwidth (Kbps) Maximum stream bandwidth without FEC Maximum stream bandwidth with FEC
Audio
G.711 (this includes PSTN participants in conferences)
64.8
97
161
Audio
RTAudio Narrowband
30.9
44.8
56.6

The network bandwidth numbers in these tables represent one-way traffic only and include 5 Kbps for RTCP traffic overhead for each stream.

Managing Quality of Service

Quality of Service (QoS) is a networking technology that is used in some organizations to help provide an optimal end-user experience for audio and video communications. QoS is most frequently used on networks where bandwidth is limited: with a large number of network packets competing for a fairly small amount of available bandwidth, QoS enables administrators to assign higher priorities to packets carrying audio or video data. By giving these packets a higher priority, audio and video communications are likely to complete faster, and with less interruption, than network sessions involving things such as file transfers, web browsing, or database backups. That's because network packets used for file transfers or database backups are assigned a "best effort" priority.

As a rule, QoS applies only to communication sessions on your internal network. When you implement QoS, you configure your servers and routers to support packet marking in a particular manner that may not be supported on the Internet or on other networks. Even if Quality of Service is supported on other networks, there is no guarantee that QoS will be configured in exactly the same way you configured the service. If you are using MPLS, you'll need to work with your MPLS provider.

Skype for Business Server doesn't require QoS, but it's recommended. If you experience packet loss issues on the network your available solutions are to add more bandwidth or to implement QoS. If adding more bandwidth isn't possible, then implementing QoS might be your only toll to resolve the problem.

Skype for Business Server offers full support for QoS: that means that organizations that are already using QoS can easily integrate Skype for Business Server into their existing network infrastructure. To do this you must follow these steps:

If you are using Windows Server 2012 or Windows Server 2012 R2 you might be interested in the new set of Windows PowerShell cmdlets available for managing QoS on that platform. For more information, see Windows PowerShell Cmdlets for Networking.

QoS is also discussed in the whitepaper Network Planning, Monitoring, and Troubleshooting with Lync Server with more details and depth. While the content refers explicitly to Lync 2010 and Lync 2013, the considerations for Skype for Business Server are unchanged.