To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????T 00111111001111110011111100111111001111110011111100111111001111110011111101010100 3f3f3f3f3f3f3f3f3f54
SJIS-WIN ???汝??釗??T 001111110011111100111111100100111111000000111111001111111111101110111011001111110011111101010100 3f3f3f93f03f3ffbbb3f3f54
EUC-JP ???汝??釗??T 00111111001111110011111111000110111100100011111100111111100011111110001110100110001111110011111101010100 3f3f3fc6f23f3f8fe3a63f3f54
UTF-8 溜삠뀛汝싲졎釗숇젙T 11101111101001111000101111101100100000101010000011101011100000001001101111100110101100011001110111101100100010111011001011101100101000011000111011101001100001111001011111101100100010001000011111101100101000001001100101010100 efa78bec82a0eb809be6b19dec8bb2eca18ee98797ec8887eca09954
UHC 溜삠뀛汝싲졎釗숇젙T 11101010111111101011101111100011100001011001010011100110101000111001101011101011101000001011101111100001111100101001100111101011101000001001010101010100 eafebbe38594e6a39aeba0bbe1f299eba09554

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)