To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????S 00111111001111110011111100111111001111110011111100111111001111110011111101010011 3f3f3f3f3f3f3f3f3f53
SJIS-WIN 繞??認??怨??S 11100011100001010011111100111111100101000100011000111111001111111000100110000101001111110011111101010011 e3853f3f94463f3f89853f3f53
EUC-JP 繞??認??怨??S 11100101111001010011111100111111110001111010011100111111001111111011000111100101001111110011111101010011 e5e53f3fc7a73f3fb1e53f3f53
UTF-8 繞섏슃認⑼쭓怨뺤졅S 11100111101110011001111011101100100001001000111111101100100010101000001111101000101010101000110111100010100100011011110011101100101011011001001111100110100000001010100011101011101110101010010011101100101000011000010101010011 e7b99eec848fec8a83e8aa8de291bcecad93e680a8ebbaa4eca18553
UHC 繞섏슃認⑼쭓怨뺤졅S 11101001101001001001100011101100100110101001010111101100111000111010100111101111101001111000101111101010101100111001010111101100101000001011011001010011 e9a498ec9a95ece3a9efa78beab395eca0b653

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)