Skip to content

Conversation

YuliangLiu0306
Copy link
Contributor

@YuliangLiu0306 YuliangLiu0306 commented Jan 4, 2023

What does this PR do

This PR provide a alpha-beta profiler to profile the alpha and beta value of given device list.

This profiler could be used individually to profile the metrics of communication performance for a cluster, but also could be used in auto-parallel system to give infomation to search the best logical device mesh of the given device list.

Usage:

    >>> physical_devices = [0, 1, 4, 5]
    >>> ab_profiler = AlphaBetaProfiler(physical_devices)
    >>> ab_dict = profiler.profile_ab()
    >>> print(ab_dict)
    {(0, 1): (1.9641406834125518e-05, 4.74049549614719e-12), (0, 4): (1.9506998360157013e-05, 6.97421973297474e-11), (0, 5): (2.293858677148819e-05, 7.129930361393644e-11),
     (1, 4): (1.9010603427886962e-05, 7.077968863788975e-11), (1, 5): (1.9807778298854827e-05, 6.928845708992215e-11), (4, 5): (1.8681809306144713e-05, 4.7522367291330524e-12),
     (1, 0): (1.9641406834125518e-05, 4.74049549614719e-12), (4, 0): (1.9506998360157013e-05, 6.97421973297474e-11), (5, 0): (2.293858677148819e-05, 7.129930361393644e-11),
     (4, 1): (1.9010603427886962e-05, 7.077968863788975e-11), (5, 1): (1.9807778298854827e-05, 6.928845708992215e-11), (5, 4): (1.8681809306144713e-05, 4.7522367291330524e-12)}

@YuliangLiu0306 YuliangLiu0306 force-pushed the feature/alpha_beta_profiler branch from b1fae9f to 236d840 Compare January 5, 2023 07:19
@YuliangLiu0306 YuliangLiu0306 merged commit 9c9246c into hpcaitech:main Jan 5, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants