[RUNTIME] Proper Device Attribute Query for AMD GPU #4305

petrex · 2019-11-11T19:10:21Z

The PR implements proper device queries through hip runtime API.

One primary motivation is to support devices across different architectures. Note that we had some hardcoded value before (for example, max thread per block/ warp size), which may not be optimal for the new architecture.

One minor change: replace hipGetDeviceProperties() with hipDeviceGetAttribute() for better perf.

petrex · 2019-11-12T21:14:03Z

The error doesn't seem relevant to me, let me kick off the CI again.

___________________________ test_forward_placeholder ___________________________
    def test_forward_placeholder():
        '''test a simple pb with Placeholder node in the end of GraphDef'''
        with tf.Graph().as_default():
>           graph_def = tf_testing.get_workload("Custom/placeholder.pb")
tests/python/frontend/tensorflow/test_forward.py:1879: 
_ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ 
python/tvm/relay/testing/tf.py:209: in get_workload
    path_model = download_testdata(model_url, model_path, module='tf')
python/tvm/contrib/download.py:158: in download_testdata
    download(url, abspath, overwrite=False, size_compare=True)
python/tvm/contrib/download.py:63: in download
    res_get = urllib2.urlopen(url)
/usr/lib/python3.6/urllib/request.py:223: in urlopen
    return opener.open(url, data, timeout)
/usr/lib/python3.6/urllib/request.py:532: in open
    response = meth(req, response)
/usr/lib/python3.6/urllib/request.py:642: in http_response
    'http', request, response, code, msg, hdrs)
/usr/lib/python3.6/urllib/request.py:570: in error
    return self._call_chain(*args)
/usr/lib/python3.6/urllib/request.py:504: in _call_chain
    result = func(*args)
_ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ 
self = <urllib.request.HTTPDefaultErrorHandler object at 0x7f8c54531f98>
req = <urllib.request.Request object at 0x7f8c9c49a748>
fp = <http.client.HTTPResponse object at 0x7f8c9c2e5278>, code = 503
msg = 'Service Unavailable'
hdrs = <http.client.HTTPMessage object at 0x7f8c801b3080>
    def http_error_default(self, req, fp, code, msg, hdrs):
>       raise HTTPError(req.full_url, code, msg, hdrs, fp)
E       urllib.error.HTTPError: HTTP Error 503: Service Unavailable
/usr/lib/python3.6/urllib/request.py:650: HTTPError

src/runtime/rocm/rocm_device_api.cc

petrex · 2019-11-14T18:33:06Z

See wiki. One architecture with different compute capabilities.

masahi · 2019-11-15T04:53:13Z

@petrex sorry I merged #4341 first and there is a conflict. Can you rebase?

masahi · 2019-11-15T23:20:56Z

@petrex can you check indentation issues? I see some weird indentation done by clang-format. It should be consistent with the rest of the code base.

petrex · 2019-11-16T00:15:11Z

Morning @masahi. That weird clang-format behavior was due to a missing { from a manual merging. I've pushed the fix (passing linter and build, now with unit tests). Let's wait for the signal from CI. thanks

masahi self-assigned this Nov 12, 2019

petrex force-pushed the rocm_device branch from 34a46e7 to fcb8bb5 Compare November 12, 2019 21:17

t-vi reviewed Nov 14, 2019

View reviewed changes

src/runtime/rocm/rocm_device_api.cc Show resolved Hide resolved

petrex mentioned this pull request Nov 14, 2019

[RUNTIME] Add device query for AMD GcnArch #4341

Merged

petrex changed the title ~~Proper Device Attribute Query for AMD GPU~~ [RUNTIME] Proper Device Attribute Query for AMD GPU Nov 14, 2019

petrex mentioned this pull request Nov 14, 2019

Add workgroup size attribute to AMDGPU functions in codegen #4342

Merged

petrex force-pushed the rocm_device branch from fcb8bb5 to c642a3e Compare November 14, 2019 23:32

petrex mentioned this pull request Nov 14, 2019

[RELEASE][DRAFT] TVM v0.6 Release candidate #4259

Closed

masahi approved these changes Nov 15, 2019

View reviewed changes

petrex force-pushed the rocm_device branch 3 times, most recently from 18df399 to 60064ec Compare November 15, 2019 22:11

proper device query through rocm api

3d46e93

petrex force-pushed the rocm_device branch from 60064ec to 3d46e93 Compare November 15, 2019 23:43

masahi merged commit 022b285 into apache:master Nov 16, 2019

zxy844288792 pushed a commit to zxy844288792/tvm that referenced this pull request Nov 26, 2019

proper device query through rocm api (apache#4305)

bdcc64b

yongwww pushed a commit to neo-ai/tvm that referenced this pull request Nov 26, 2019

proper device query through rocm api (apache#4305)

ba45d25

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[RUNTIME] Proper Device Attribute Query for AMD GPU #4305

[RUNTIME] Proper Device Attribute Query for AMD GPU #4305

petrex commented Nov 11, 2019 •

edited

Loading

petrex commented Nov 12, 2019

petrex commented Nov 14, 2019

masahi commented Nov 15, 2019

masahi commented Nov 15, 2019

petrex commented Nov 16, 2019

[RUNTIME] Proper Device Attribute Query for AMD GPU #4305

[RUNTIME] Proper Device Attribute Query for AMD GPU #4305

Conversation

petrex commented Nov 11, 2019 • edited Loading

petrex commented Nov 12, 2019

petrex commented Nov 14, 2019

masahi commented Nov 15, 2019

masahi commented Nov 15, 2019

petrex commented Nov 16, 2019

petrex commented Nov 11, 2019 •

edited

Loading