from root, run (either CPU or GPU)
pytest test/distributed/tensor/test_dtensor.py
run specific test cases and print stdout/stderr:
pytest test/distributed/tensor/test_dtensor.py -s -k test_from_local