NovaWindows2 Driver
===================

NovaWindows2 Driver is a custom Appium driver designed to tackle the limitations of existing Windows automation solutions like WinAppDriver. It supports testing Universal Windows Platform (UWP), Windows Forms (WinForms), Windows Presentation Foundation (WPF), and Classic Windows (Win32) apps on Windows 10 and later.

Built to improve performance and reliability for traditional desktop applications, it offers:
- Faster XPath locator performance — Reduces element lookup times, even in complex UIs.
- RawView element support — Access elements typically hidden from the default ControlView/ContentView.
- Enhanced text input handling — Fast text entry with support for various keyboard layouts.
- Platform-specific commands — Supports direct window manipulation, advanced UI interactions, and more.
- Seamless Setup — Designed to work without Developer Mode or additional software.

---

📑 Table of Contents

- Getting Started
- Configuration
- Example Usage
- Key Features
- Element Location
- Attribute Retrieval
- PowerShell Execution
- Platform-Specific Extensions
- Mouse & Pointer
- Keyboard
- Element Operations
- Selection Management
- Window Management
- System & State
- Development

---

🚀 Getting Started

$3

The driver is built for Appium 3. To install it, run:
``

bash
appium driver install --source=npm appium-novawindows2-driver


$3

- Host OS: Windows 10 or later.
- No Developer Mode or extra dependencies required.
---
⚙️ Configuration
NovaWindows2 Driver supports the following capabilities:

| Capability Name | Description | Default | Example | | :--- | :--- | :--- | :--- | |platformName | Must be set to Windows (case-insensitive). | (Required) | Windows| |automationName | Must be set to NovaWindows2 (case-insensitive). | (Required) | NovaWindows2| |smoothPointerMove | CSS-like easing function (including valid Bezier curve). This controls the smooth movement of the mouse for delayBeforeClick ms. | (None) | ease-in, linear, ease, ease-in, ease-out, ease-in-out, cubic-bezier(0.42, 0, 0.58, 1)| |delayBeforeClick | Time in milliseconds before a click is performed. | 0 | 500| |delayAfterClick | Time in milliseconds after a click is performed. | 0 | 500| |appTopLevelWindow | The handle of an existing application top-level window to attach to. It can be a number or string (not necessarily hexadecimal). | (None) | 12345, 0x12345| |shouldCloseApp | Whether to close the window of the application in test after the session finishes. | true | false| |appArguments | Optional string of arguments to pass to the app on launch. | (None) | --debug| |appWorkingDir | Optional working directory path for the application. | (None) | C:\Temp| |prerun | An object containing either script or command key. The value of each key must be a valid PowerShell script or command to be executed prior to the WinAppDriver session startup. See Power Shell commands execution for more details. | (None) | {script: 'Get-Process outlook -ErrorAction SilentlyContinue'}| |postrun | An object containing either script or command key. The value of each key must be a valid PowerShell script or command to be executed after WinAppDriver session is stopped. See Power Shell commands execution for more details. | (None) | {command: '...'}| |isolatedScriptExecution | Whether PowerShell scripts are executed in an isolated session. | false | true| |powerShellCommandTimeout | Timeout (ms) for PowerShell script execution. | 60000 | 30000| |convertAbsoluteXPathToRelativeFromElement | Convert absolute XPath to relative when searching from an element. | true | true| |includeContextElementInSearch | Include the context element itself in the search. | true | true| |releaseModifierKeys | Whether to release modifier keys after sendKeys. | true | true| |typeDelay | Time in milliseconds to wait after inputting each character. Note that this delay does not apply to modifier keys (Shift, Ctrl, Alt, Win). | 0 | 100 |

---

`💡 Example Usage`

Check out the examples/refactor directory for comprehensive examples.

`$3`

python
from appium import webdriver
from appium.options.windows import WindowsOptions
options = WindowsOptions()
options.app = 'C:\\Windows\\System32\\notepad.exe'
options.automation_name = 'NovaWindows2'
driver = webdriver.Remote('http://127.0.0.1:4723', options=options)
... tests ...

driver.quit()


---
✨ Key Features
$3

Appium Windows Driver supports the same location strategies the WinAppDriver supports, but also includes Windows UIAutomation conditions:

`$3`


Retrieve comprehensive details about UI elements using standard or bulk methods.

- Bulk Retrieval: Use the "all"keyword to get 80+ properties in a single JSON object. - Dotted Names: Access pattern-specific properties directly (e.g.,Window.CanMaximize, LegacyIAccessible.Name).

`python

`getAttributes returns all properties as a JSON string`


all_attributes = element.get_attribute("all")


$3

Execute internal PowerShell scripts or commands directly from your test. This requires the

power_shell insecure feature to be enabled on the Appium server.

It is possible to execute a single PowerShell command or a whole script. Note that powerShell is case-insensitive.

`python

`Execute a command string`


driver.execute_script('powerShell', {'command': 'Get-Process Notepad'})
Execute a script string

driver.execute_script('powerShell', {'script': '$p = Get-Process Notepad; $p.Kill();'})
Shorthand (executes as command/script depending on context)

driver.execute_script('powerShell', 'Get-Process')


$3

You can specify the delay directly within the text string using the

[delay:ms] pattern. This overrides the session setting (set via windows: typeDelay or typeDelay capability) for that specific action.

`python driver.find_element(...).send_keys("[delay:500]Slow text")`

---

`🛠 Platform-Specific Extensions`

All extensions are invoked via driver.executeScript("windows: ", ...args). Below are the detailed descriptions and arguments for each command.

> Note > In most cases, commands can be used more intuitively by passing the element as the first argument (if required) and other parameters subsequently.

`$3`

#### windows: clickThis is a shortcut for a single mouse click gesture.

| Name | Type | Required | Description | Example | | :--- | :--- | :--- | :--- | :--- | |elementId | string | no | Hexadecimal identifier of the element to click on. If this parameter is missing then given coordinates will be parsed as absolute ones. Otherwise they are parsed as relative to the top left corner of this element. | 123e4567-e89b...| |x | number | no | Integer horizontal coordinate of the click point. Both x and y coordinates must be provided or none of them if elementId is present. | 100| |y | number | no | Integer vertical coordinate of the click point. Both x and y coordinates must be provided or none of them if elementId is present. | 100| |button | string | no | Name of the mouse button to be clicked. Supported button names are: left, middle, right, back, forward. The default value is left. | right| |modifierKeys | string[] \| string | no | List of possible keys or a single key name to depress while the click is being performed. Supported key names are: Shift, Ctrl, Alt, Win. | ['ctrl', 'alt']| |durationMs | number | no | The number of milliseconds to wait between pressing and releasing the mouse button. By default no delay is applied. | 500| |times | number | no | How many times the click must be performed. One by default. | 2| |interClickDelayMs | number | no | Duration of the pause between each click gesture. Only makes sense if times is greater than one. 100ms by default. | 10 |

#### Usage

Scenario 1: Using Element ID (Clicks Center)`python driver.execute_script('windows: click', { 'elementId': element.id, 'button': 'right', 'times': 2 })`

Scenario 2: Using Absolute Coordinates`python driver.execute_script('windows: click', { 'x': 500, 'y': 300, 'button': 'left' })`

Scenario 3: Using Element ID with Offset (Relative to Top-Left)`python driver.execute_script('windows: click', { 'elementId': element.id, 'x': 10, # 10px from the left edge of the element 'y': 10 # 10px from the top edge of the element })`

#### windows: clickAndDragPerforms a click-and-drag gesture.

#### Usage

Scenario 1: Element to Element (Center to Center)`python driver.execute_script('windows: clickAndDrag', { 'startElementId': element1.id, 'endElementId': element2.id, 'durationMs': 2000 })`

Scenario 2: Absolute Coordinates`python driver.execute_script('windows: clickAndDrag', { 'startX': 100, 'startY': 100, 'endX': 500, 'endY': 500, 'smoothPointerMove': 'linear' })`

Scenario 3: Element with Offset (Drag from specific point inside element)`python driver.execute_script('windows: clickAndDrag', { 'startElementId': element1.id, 'startX': 10, # Start 10px from left of element1 'startY': 10, # Start 10px from top of element1 'endElementId': element2.id, 'endX': 50, # End 50px from left of element2 'endY': 50 # End 50px from top of element2 })`

#### windows: scrollThis is a shortcut for a mouse wheel scroll gesture. The API is a thin wrapper over the SendInput WinApi call.

| Name | Type | Required | Description | Example | | :--- | :--- | :--- | :--- | :--- | |elementId | string | no | Same as in windows: click. | 123e4567-e89b...| |x | number | no | Same as in windows: click. | 100| |y | number | no | Same as in windows: click. | 100| |deltaX | number | no | The amount of horizontal wheel movement measured in wheel clicks. Positive = right, Negative = left. | -5| |deltaY | number | no | The amount of vertical wheel movement. Positive = forward (away), Negative = backward (toward). | 5| |modifierKeys | string[] \| string | no | Same as in windows: click. | win |

#### Usage`python driver.execute_script('windows: scroll', { 'elementId': element.id, 'deltaY': -120, # Scroll down 3 lines 'modifierKeys': 'shift' })`

#### windows: hoverThis is a shortcut for a hover gesture.

| Name | Type | Required | Description | Example | | :--- | :--- | :--- | :--- | :--- | |startElementId | string | no | Same as in windows: click. | 123e4567-e89b...| |startX | number | no | Same as in windows: click. | 100| |startY | number | no | Same as in windows: click. | 100| |endElementId | string | no | Same as in windows: click. | 123e4567-e89b...| |endX | number | no | Same as in windows: click. | 200| |endY | number | no | Same as in windows: click. | 200| |modifierKeys | string[] \| string | no | Same as in windows: click. | shift| |durationMs | number | no | The number of milliseconds between moving the cursor from the starting to the ending hover point. 500ms by default. | 700 |

#### Usage

Scenario 1: Element to Element (Center to Center)`python driver.execute_script('windows: hover', { 'startElementId': element1.id, 'endElementId': element2.id, 'durationMs': 1000 })`

Scenario 2: Absolute Coordinates`python driver.execute_script('windows: hover', { 'startX': 100, 'startY': 100, 'endX': 500, 'endY': 500 })`

Scenario 3: Element with Offset (Hover specific point)`python driver.execute_script('windows: hover', { 'startElementId': element1.id, 'startX': 5, 'startY': 5, 'endElementId': element2.id, 'endX': 5, 'endY': 5 })`

`$3`

#### windows: typeDelaySets the delay between key injections in milliseconds. This persistent setting applies to the entire session until changed.

| Name | Type | Required | Description | Example | | :--- | :--- | :--- | :--- | :--- | |delay | number | yes | Delay in milliseconds. | 100 |

#### Usage`python driver.execute_script('windows: typeDelay', {'delay': 500})

`or shorthand`


driver.execute_script('windows: typeDelay', '500')

#### windows: keysThis is a shortcut for a customized keyboard input. Selenium keys should also work as modifier keys.

| Name | Type | Required | Description | Example | | :--- | :--- | :--- | :--- | :--- | |actions | KeyAction[] \| KeyAction | yes | One or more KeyAction dictionaries. | [{'virtualKeyCode': 0x10, 'down': true}]| |forceUnicode | boolean | no | Forces the characters to be sent as unicode characters. | true |

##### KeyAction Dictionary

| Name | Type | Required | Description | Example | | :--- | :--- | :--- | :--- | :--- | |pause | number | no | Allows to set a delay in milliseconds between key input series. | 100| |text | string | no | Non-empty string of Unicode text to type. | Hello| |virtualKeyCode | number | no | Valid virtual key code. | 0x10| |down | boolean | no | If set to true then the corresponding key will be depressed, false - released. | true |

#### Usage`python driver.execute_script('windows: keys', { 'actions': [ {'virtualKeyCode': 0x10, 'down': True}, # Shift Down {'text': 'Hello World'}, {'virtualKeyCode': 0x10, 'down': False} # Shift Up ] })`

`$3`

#### windows: setClipboardSets Windows clipboard content to the given text or a PNG image.

| Name | Type | Required | Description | Example | | :--- | :--- | :--- | :--- | :--- | |b64Content | string | yes | Base64-encoded content of the clipboard to be set. | QXBwaXVt| |contentType | string | no | Set to plaintext (default) or image. | image |

#### Usage`python driver.execute_script('windows: setClipboard', { 'b64Content': 'SGVsbG8=', # "Hello" in Base64 'contentType': 'plaintext' })`

#### windows: getClipboardRetrieves Windows clipboard content.

| Name | Type | Required | Description | Example | | :--- | :--- | :--- | :--- | :--- | |contentType | string | no | Set to plaintext (default) or image. | image |

#### Usage`python content = driver.execute_script('windows: getClipboard', { 'contentType': 'plaintext' }) print(content)`

#### windows: pushCacheRequestThis is an asynchronous function that sends cache requests based on specific conditions.

#### Usage`python driver.execute_script('windows: pushCacheRequest', { 'treeFilter': 'RawView', 'treeScope': 'SubTree' })`

`$3`

#### windows: invokeInvokes a UI element pattern, simulating an interaction like clicking or activating the element.

| Position | Type | Description | Example | | :--- | :--- | :--- | :--- | | 1 |Element | The UI element on which the InvokePattern is called. | element |

#### Usage`python driver.execute_script('windows: invoke', element)`

#### windows: expandExpands a UI element that supports theExpandPattern.

| Position | Type | Description | Example | | :--- | :--- | :--- | :--- | | 1 |Element | The UI element to expand. | element |

#### Usage`python driver.execute_script('windows: expand', element)`

#### windows: collapseCollapses a UI element that supports theCollapsePattern.

| Position | Type | Description | Example | | :--- | :--- | :--- | :--- | | 1 |Element | The UI element to collapse. | element |

#### Usage`python driver.execute_script('windows: collapse', element)`

#### windows: setValueSets the value of a UI element using theValuePattern.

| Position | Type | Description | Example | | :--- | :--- | :--- | :--- | | 1 |Element | The UI element whose value will be set. | element| | 2 |string | The value to be set. | "new value" |

#### Usage`python driver.execute_script('windows: setValue', element, 'New Value')`

#### windows: getValueGets the current value of a UI element that supports theValuePattern.

| Position | Type | Description | Example | | :--- | :--- | :--- | :--- | | 1 |Element | The UI element from which to retrieve the value. | element |

#### Usage`python value = driver.execute_script('windows: getValue', element)`

#### windows: scrollIntoViewScrolls the UI element into view using theScrollItemPattern.

| Position | Type | Description | Example | | :--- | :--- | :--- | :--- | | 1 |Element | The UI element to bring into view. | element |

#### Usage`python driver.execute_script('windows: scrollIntoView', element)`

> Note > You can also use the standard JavaScript way: >`python > driver.execute_script('arguments[0].scrollIntoView()', element) >`

#### windows: toggleToggles a UI element’s state using theTogglePattern.

| Position | Type | Description | Example | | :--- | :--- | :--- | :--- | | 1 |Element | The UI element to toggle. | element |

#### Usage`python driver.execute_script('windows: toggle', element)`

`$3`

#### windows: selectSelects a UI element using theSelectionPattern.

| Position | Type | Description | Example | | :--- | :--- | :--- | :--- | | 1 |Element | The UI element to select. | element |

#### Usage`python driver.execute_script('windows: select', element)`

#### windows: addToSelectionAdds an element to the current selection on a UI element that supports theSelectionPattern.

| Position | Type | Description | Example | | :--- | :--- | :--- | :--- | | 1 |Element | The UI element to add to the selection. | element |

#### Usage`python driver.execute_script('windows: addToSelection', element)`

#### windows: removeFromSelectionRemoves an element from the current selection on a UI element that supports theSelectionPattern.

| Position | Type | Description | Example | | :--- | :--- | :--- | :--- | | 1 |Element | The UI element to remove from the selection. | element |

#### Usage`python driver.execute_script('windows: removeFromSelection', element)`

#### windows: isMultipleChecks if a UI element supports multiple selection using theSelectionPattern.

| Position | Type | Description | Example | | :--- | :--- | :--- | :--- | | 1 |Element | The UI element to check. | element |

#### Usage`python is_multi = driver.execute_script('windows: isMultiple', element)`

#### windows: selectedItemGets the selected item from a UI element that supports theSelectionPattern.

| Position | Type | Description | Example | | :--- | :--- | :--- | :--- | | 1 |Element | The UI element from which to retrieve the selected item. | element |

#### Usage`python selected_el = driver.execute_script('windows: selectedItem', element)`

#### windows: allSelectedItemsGets all selected items from a UI element that supports theSelectionPattern.

| Position | Type | Description | Example | | :--- | :--- | :--- | :--- | | 1 |Element | The UI element from which to retrieve all selected items. | element |

#### Usage`python selected_els = driver.execute_script('windows: allSelectedItems', element)`

`$3`

#### windows: maximizeMaximizes a window or UI element using theWindowPattern.

| Position | Type | Description | Example | | :--- | :--- | :--- | :--- | | 1 |Element | The window or UI element to maximize. | element |

#### Usage`python driver.execute_script('windows: maximize', element)`

#### windows: minimizeMinimizes a window or UI element using theWindowPattern.

| Position | Type | Description | Example | | :--- | :--- | :--- | :--- | | 1 |Element | The window or UI element to minimize. | element |

#### Usage`python driver.execute_script('windows: minimize', element)`

#### windows: restoreRestores a window or UI element to its normal state using theWindowPattern.

| Position | Type | Description | Example | | :--- | :--- | :--- | :--- | | 1 |Element | The window or UI element to restore. | element |

#### Usage`python driver.execute_script('windows: restore', element)`

#### windows: closeCloses a window or UI element using theWindowPattern.

| Position | Type | Description | Example | | :--- | :--- | :--- | :--- | | 1 |Element | The window or UI element to close. | element |

#### Usage`python driver.execute_script('windows: close', element)`

#### windows: setProcessForegroundBrings the main window of the specified process to the foreground.

| Name | Type | Required | Description | Example | | :--- | :--- | :--- | :--- | :--- | |process | string | yes | The name of the process whose window should be brought to the foreground. | notepad.exe |

#### Usage`python driver.execute_script('windows: setProcessForeground', { 'process': 'notepad.exe' })`

#### windows: setFocusSets focus to the specified UI element using UIAutomationElement'sSetFocus method.

| Position | Type | Description | Example | | :--- | :--- | :--- | :--- | | 1 |Element | The UI element to set focus on. | element |

#### Usage`python driver.execute_script('windows: setFocus', element)`

---

`🛠 Development`

Recommended VS Code plugin: Comment tagged templates for syntax highlighting.

`bash npm install # Setup dependencies npm run lint # Code quality check npm run build # Transpile TypeScript to JS``

NovaWindows2 Driver
===================

---

📑 Table of Contents

---

🚀 Getting Started

$3

The driver is built for Appium 3. To install it, run:
``

bash
appium driver install --source=npm appium-novawindows2-driver


$3

- Host OS: Windows 10 or later.
- No Developer Mode or extra dependencies required.
---
⚙️ Configuration
NovaWindows2 Driver supports the following capabilities:

---

`💡 Example Usage`

Check out the examples/refactor directory for comprehensive examples.

`$3`

python
from appium import webdriver
from appium.options.windows import WindowsOptions
options = WindowsOptions()
options.app = 'C:\\Windows\\System32\\notepad.exe'
options.automation_name = 'NovaWindows2'
driver = webdriver.Remote('http://127.0.0.1:4723', options=options)
... tests ...

driver.quit()


---
✨ Key Features
$3

Appium Windows Driver supports the same location strategies the WinAppDriver supports, but also includes Windows UIAutomation conditions:

`$3`


Retrieve comprehensive details about UI elements using standard or bulk methods.

`python

`getAttributes returns all properties as a JSON string`


all_attributes = element.get_attribute("all")


$3

Execute internal PowerShell scripts or commands directly from your test. This requires the

power_shell insecure feature to be enabled on the Appium server.

It is possible to execute a single PowerShell command or a whole script. Note that powerShell is case-insensitive.

`python

`Execute a command string`


driver.execute_script('powerShell', {'command': 'Get-Process Notepad'})
Execute a script string

driver.execute_script('powerShell', {'script': '$p = Get-Process Notepad; $p.Kill();'})
Shorthand (executes as command/script depending on context)

driver.execute_script('powerShell', 'Get-Process')


$3

You can specify the delay directly within the text string using the

[delay:ms] pattern. This overrides the session setting (set via windows: typeDelay or typeDelay capability) for that specific action.

`python driver.find_element(...).send_keys("[delay:500]Slow text")`

---

`🛠 Platform-Specific Extensions`

All extensions are invoked via driver.executeScript("windows: ", ...args). Below are the detailed descriptions and arguments for each command.

> Note > In most cases, commands can be used more intuitively by passing the element as the first argument (if required) and other parameters subsequently.

`$3`

#### windows: clickThis is a shortcut for a single mouse click gesture.

#### Usage

Scenario 1: Using Element ID (Clicks Center)`python driver.execute_script('windows: click', { 'elementId': element.id, 'button': 'right', 'times': 2 })`

Scenario 2: Using Absolute Coordinates`python driver.execute_script('windows: click', { 'x': 500, 'y': 300, 'button': 'left' })`

#### windows: clickAndDragPerforms a click-and-drag gesture.

#### Usage

Scenario 1: Element to Element (Center to Center)`python driver.execute_script('windows: clickAndDrag', { 'startElementId': element1.id, 'endElementId': element2.id, 'durationMs': 2000 })`

Scenario 2: Absolute Coordinates`python driver.execute_script('windows: clickAndDrag', { 'startX': 100, 'startY': 100, 'endX': 500, 'endY': 500, 'smoothPointerMove': 'linear' })`

#### windows: scrollThis is a shortcut for a mouse wheel scroll gesture. The API is a thin wrapper over the SendInput WinApi call.

#### Usage`python driver.execute_script('windows: scroll', { 'elementId': element.id, 'deltaY': -120, # Scroll down 3 lines 'modifierKeys': 'shift' })`

#### windows: hoverThis is a shortcut for a hover gesture.

#### Usage

Scenario 1: Element to Element (Center to Center)`python driver.execute_script('windows: hover', { 'startElementId': element1.id, 'endElementId': element2.id, 'durationMs': 1000 })`

Scenario 2: Absolute Coordinates`python driver.execute_script('windows: hover', { 'startX': 100, 'startY': 100, 'endX': 500, 'endY': 500 })`

`$3`

#### windows: typeDelaySets the delay between key injections in milliseconds. This persistent setting applies to the entire session until changed.

| Name | Type | Required | Description | Example | | :--- | :--- | :--- | :--- | :--- | |delay | number | yes | Delay in milliseconds. | 100 |

#### Usage`python driver.execute_script('windows: typeDelay', {'delay': 500})

`or shorthand`


driver.execute_script('windows: typeDelay', '500')