Fix reStructuredText issues: correct cross-refs, links, and code blocks

Copilot · leofang · Copilot · commit d576ddab0099 · 2025-08-23T20:18:38.000Z
Co-authored-by: leofang &lt;5534781+leofang@users.noreply.github.com&gt;
diff --git a/cuda_bindings/docs/source/conduct.rst b/cuda_bindings/docs/source/conduct.rst
@@ -84,10 +84,8 @@ members of the project's leadership.
 Attribution
 -----------
 
-This Code of Conduct is adapted from the [Contributor Covenant][homepage], version 1.4,
+This Code of Conduct is adapted from the `Contributor Covenant <https://www.contributor-covenant.org>`_, version 1.4,
 available at https://www.contributor-covenant.org/version/1/4/code-of-conduct.html
 
-[homepage]: https://www.contributor-covenant.org
-
 For answers to common questions about this code of conduct, see
 https://www.contributor-covenant.org/faq
diff --git a/cuda_core/docs/source/conduct.rst b/cuda_core/docs/source/conduct.rst
@@ -84,10 +84,8 @@ members of the project's leadership.
 Attribution
 -----------
 
-This Code of Conduct is adapted from the [Contributor Covenant][homepage], version 1.4,
+This Code of Conduct is adapted from the `Contributor Covenant <https://www.contributor-covenant.org>`_, version 1.4,
 available at https://www.contributor-covenant.org/version/1/4/code-of-conduct.html
 
-[homepage]: https://www.contributor-covenant.org
-
 For answers to common questions about this code of conduct, see
 https://www.contributor-covenant.org/faq
diff --git a/cuda_core/docs/source/getting-started.rst b/cuda_core/docs/source/getting-started.rst
@@ -1,6 +1,8 @@
 .. SPDX-FileCopyrightText: Copyright (c) 2025 NVIDIA CORPORATION & AFFILIATES. All rights reserved.
 .. SPDX-License-Identifier: Apache-2.0
 
+.. currentmodule:: cuda.core.experimental
+
 Overview
 ========
 
@@ -18,26 +20,25 @@ including:
 - and much more!
 
 Rather than providing 1:1 equivalents of the CUDA driver and runtime APIs
-(for that, see [``cuda.bindings``][bindings]), ``cuda.core`` provides high-level constructs such as:
+(for that, see `cuda.bindings <https://nvidia.github.io/cuda-python/cuda-bindings/latest/>`_), ``cuda.core`` provides high-level constructs such as:
 
-- {class}``Device <cuda.core.experimental.Device>`` class for GPU device operations and context management.
-- {class}``Buffer <cuda.core.experimental.Buffer>`` and {class}``MemoryResource <cuda.core.experimental.MemoryResource>`` classes for memory allocation and management.
-- {class}``Program <cuda.core.experimental.Program>`` for JIT compilation of CUDA kernels.
-- {class}``GraphBuilder <cuda.core.experimental.GraphBuilder>`` for building and executing CUDA graphs.
-- {class}``Stream <cuda.core.experimental.Stream>`` and {class}``Event <cuda.core.experimental.Event>`` for asynchronous execution and timing.
+- :class:`Device` class for GPU device operations and context management.
+- :class:`Buffer` and :class:`MemoryResource` classes for memory allocation and management.
+- :class:`Program` for JIT compilation of CUDA kernels.
+- :class:`GraphBuilder` for building and executing CUDA graphs.
+- :class:`Stream` and :class:`Event` for asynchronous execution and timing.
 
 Example: Compiling and Launching a CUDA kernel
 ----------------------------------------------
 
 To get a taste for ``cuda.core``, let's walk through a simple example that compiles and launches a vector addition kernel.
-You can find the complete example in [``vector_add.py``][vector_add_example].
+You can find the complete example in `vector_add.py <https://github.com/NVIDIA/cuda-python/tree/main/cuda_core/examples/vector_add.py>`_.
 
 First, we define a string containing the CUDA C++ kernel. Note that this is a templated kernel:
 
 .. code-block:: python
 
-   compute c = a + b
-   =================
+   # compute c = a + b
    code = """
    template<typename T>
    __global__ void vector_add(const T* A,
@@ -51,9 +52,9 @@ First, we define a string containing the CUDA C++ kernel. Note that this is a te
    }
    """
 
-Next, we create a {class}``Device <cuda.core.experimental.Device>`` object
-and a corresponding {class}``Stream <cuda.core.experimental.Stream>``.
-Don't forget to use {meth}``Device.set_current() <cuda.core.experimental.Device.set_current>``!
+Next, we create a :class:`Device` object
+and a corresponding :class:`Stream`.
+Don't forget to use :meth:`Device.set_current`!
 
 .. code-block:: python
 
@@ -64,9 +65,9 @@ Don't forget to use {meth}``Device.set_current() <cuda.core.experimental.Device.
    dev.set_current()
    s = dev.create_stream()
 
-Next, we compile the CUDA C++ kernel from earlier using the {class}``Program <cuda.core.experimental.Program>`` class.
+Next, we compile the CUDA C++ kernel from earlier using the :class:`Program` class.
 The result of the compilation  is saved as a CUBIN.
-Note the use of the ``name_expressions`` parameter to the {meth}``Program.compile() <cuda.core.experimental.Program.compile>`` method to specify which kernel template instantiations to compile:
+Note the use of the ``name_expressions`` parameter to the :meth:`Program.compile` method to specify which kernel template instantiations to compile:
 
 .. code-block:: python
 
@@ -76,28 +77,26 @@ Note the use of the ``name_expressions`` parameter to the {meth}``Program.compil
    mod = prog.compile("cubin", name_expressions=("vector_add<float>",))
 
 Next, we retrieve the compiled kernel from the CUBIN and prepare the arguments and kernel configuration.
-We're using [CuPy][cupy] arrays as inputs for this example, but you can use PyTorch tensors too
-(we show how to do this in one of our [examples][examples]).
+We're using `CuPy <https://cupy.dev/>`_ arrays as inputs for this example, but you can use PyTorch tensors too
+(we show how to do this in one of our `examples <https://github.com/NVIDIA/cuda-python/tree/main/cuda_core/examples>`_).
 
 .. code-block:: python
 
    ker = mod.get_kernel("vector_add<float>")
    
-   Prepare input/output arrays (using CuPy)
-   ========================================
+   # Prepare input/output arrays (using CuPy)
    size = 50000
    rng = cp.random.default_rng()
    a = rng.random(size, dtype=cp.float32)
    b = rng.random(size, dtype=cp.float32)
    c = cp.empty_like(a)
    
-   Configure launch parameters
-   ===========================
+   # Configure launch parameters
    block = 256
    grid = (size + block - 1) // block
    config = LaunchConfig(grid=grid, block=block)
 
-Finally, we use the {func}``launch <cuda.core.experimental.launch>`` function to execute our kernel on the specified stream with the given configuration and arguments. Note the use of ``.data.ptr`` to get the pointer to the array data.
+Finally, we use the :func:`launch` function to execute our kernel on the specified stream with the given configuration and arguments. Note the use of ``.data.ptr`` to get the pointer to the array data.
 
 .. code-block:: python
 
@@ -113,11 +112,4 @@ Examples and Recipes
 As we mentioned before, ``cuda.core`` can do much more than just compile and launch kernels.
 
 The best way to explore and learn the different features ``cuda.core`` is through
-our [``examples``][examples]. Find one that matches your use-case, and modify it to fit your needs!
-
-[bindings]: https://nvidia.github.io/cuda-python/cuda-bindings/latest/
-[cai]: https://numba.readthedocs.io/en/stable/cuda/cuda_array_interface.html
-[cupy]: https://cupy.dev/
-[dlpack]: https://dmlc.github.io/dlpack/latest/
-[examples]: https://github.com/NVIDIA/cuda-python/tree/main/cuda_core/examples
-[vector_add_example]: https://github.com/NVIDIA/cuda-python/tree/main/cuda_core/examples/vector_add.py
+our `examples <https://github.com/NVIDIA/cuda-python/tree/main/cuda_core/examples>`_. Find one that matches your use-case, and modify it to fit your needs!