Controlling Samsung TVs Using Swift-Based Websockets

3 minute read

Samsung Smart TVs can be controlled remotely using WebSockets, allowing developers to send commands for navigation, volume control, power, and more. In this post, we’ll explore a basic approach to handling WebSocket connections, authorization, and command-writing with a Samsung TV using Swift.

For a more complete version of this that includes more commands, features, unit tests and error-handling, feel free to visit my TVCommanderKit repo on GitHub.

1. Establishing a WebSocket Connection

To begin, we need to build a URL using a given TV IP address, client app name, and optional token value. Afterwards, we’ll import Starscream and use this URL to create a WebSocket and establish a secure connection to the TV’s WebSocket server.

Connecting to the TV

First, we’ll declare a simple TVWebSocket class along with a delegate protocol and simple certificate pinner:

Note - For the purpose of this example, I am using a simple certificate pinner TVCertificatePinner. If you’re worried about MITM attacks, make sure to use a more refined approach for certificate-pinning.

import Foundation
import Starscream

protocol TVWebSocketDelegate: AnyObject { 
}

class TVWebSocket {
    private var webSocket: WebSocket?
    private weak var delegate: (any TVWebSocketDelegate)?

    func configure(delegate: (any TVWebSocketDelegate)?) {
        self.delegate = delegate
    }

    func connect(to ipAddress: String, app: String, token: String?) {
        var components = URLComponents()
        components.path = "/api/v2/channels/samsung.remote.control"
        components.port = 8002
        components.scheme = "wss"
        components.host = ipAddress
        var queryItems = [URLQueryItem]()
        if let encodedApp = app.data(using: .utf8)?.base64EncodedString() {
            queryItems.append(.init(name: "name", value: encodedApp))
        }
        if let token = token {
            queryItems.append(.init(name: "token", value: token))
        }
        components.queryItems = queryItems
        guard let url = components.url?.absoluteString
            .removingPercentEncoding.flatMap(URL.init) else {
            return
        }
        webSocket = WebSocket(request: .init(url: url), certPinner: TVCertificatePinner())
        webSocket?.connect()
    }

    func disconnect() {
        webSocket?.disconnect()
    }
}

class TVCertificatePinner: CertificatePinning {
    func evaluateTrust(trust: SecTrust, domain: String?, completion: ((PinningState) -> ())) {
        completion(.success)
    }
}

2. Handling Authorization

Samsung TVs require authorization when sending commands. Upon the first connection, the TV will prompt the user to allow or deny control. After choosing “allow”, the TV will send a packet containing our new token, indicating the connection was authorized.

Receiving Authorization Token

Add the following data types to encapsulate the auth response we need to parse:

struct TVResponse<Body: Codable>: Codable {
    let data: Body?
    let event: String
}

struct TVAuthResponseBody: Codable {
    let clients: [TVClient]
    let token: String?
}

struct TVClient: Codable {
    struct Attributes: Codable {
        let name: String?
        let token: String?
    }

    let attributes: Attributes
    let deviceName: String
}

Update TVWebSocketDelegate:

protocol TVWebSocketDelegate: AnyObject { 
    func tvDidConnect()
    func tvDidDisconnect()
    func tvDidReceiveError(_ error: Error?)
    func tvDidReceiveToken(_ token: String?)
}

Conform TVWebSocket to WebSocketDelegate, set webSocket?.delegate within the initial connection method, and add the following in order to receive connection updates, errors, and packets containing authorization info for new and refreshed tokens:

func didReceive(event: WebSocketEvent, client: WebSocketClient) {
    switch event {
    case .connected:
        delegate?.tvDidConnect()
    case .cancelled, .disconnected:
        delegate?.tvDidDisconnect()
    case .text(let text):
        text.data(using: .utf8).flatMap(handlePacket(_:))
    case .binary(let data):
        handlePacket(data)
    case .error(let error):
        delegate?.tvDidReceiveError(error)
    default:
        break
    }
}

private func handlePacket(_ data: Data) {
    guard let authResponse = try? JSONDecoder().decode(
        TVResponse<TVAuthResponseBody>.self, from: data),
            authResponse.event == "ms.channel.connect" else {
        return
    }
    if let newToken = authResponse.data?.token {
        delegate?.tvDidReceiveToken(newToken)
    } else if let existingToken = authResponse.data?.clients.first?.attributes.token {
        delegate?.tvDidReceiveToken(existingToken)
    }
}

3. Sending Commands to the TV

With authorization complete, commands can be sent over the WebSocket connection.

Writing a Command

Add the following data type for command packets:

struct TVCommand: Codable {
    struct Params: Codable {
        let cmd: String
        let dataOfCmd: String
        let option: Bool
        let typeOfRemote: String

        enum CodingKeys: String, CodingKey {
            case cmd = "Cmd"
            case dataOfCmd = "DataOfCmd"
            case option = "Option"
            case typeOfRemote = "TypeOfRemote"
        }
    }

    let method: String
    let params: Params
}

Add the following to TVWebSocket to send commands:

func sendCommand(_ key: String, completion: @escaping () -> Void) {
    let command = TVCommand(
        method: "ms.remote.control",
        params: .init(
            cmd: "Click",
            dataOfCmd: key,
            option: false,
            typeOfRemote: "SendRemoteKey"
        )
    )
    if let data = try? JSONEncoder().encode(command),
       let packet = String(data: data, encoding: .utf8) {
        webSocket?.write(string: packet, completion: completion)
    }
}

Here are a few common codes to start with:

Action Key Code
Volume Up KEY_VOLUP
Volume Down KEY_VOLDOWN
Mute KEY_MUTE
Channel Up KEY_CHUP
Channel Down KEY_CHDOWN
Power Off KEY_POWEROFF
Home KEY_HOME