To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????????\ 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011100 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5c
SJIS-WIN ??咐??八健ダ⊂⊂??咐??〓??濯\ 0011111100111111100110011111001100111111001111111001010010101010100011001001001010000011010111111000000110111100100000011011110000111111001111111001100111110011001111110011111110000001101011000011111100111111100100011111001101011100 3f3f99f33f3f94aa8c92835f81bc81bc3f3f99f33f3f81ac3f3f91f35c
EUC-JP ??咐??八健ダ⊂⊂??咐??〓??濯\ 0011111100111111110100101111010100111111001111111100100010101100101101111111001010100101110000001010001010111110101000101011111000111111001111111101001011110101001111110011111110100010101011100011111100111111110000101111010101011100 3f3fd2f53f3fc8acb7f2a5c0a2bea2be3f3fd2f53f3fa2ae3f3fc2f55c
UTF-8 룶웩咐룶웩八健ダ⊂⊂룶웩咐룶엌〓룶웩濯\ 11101011101000111011011011101100100110111010100111100101100100101001000011101011101000111011011011101100100110111010100111100101100001011010101111100101100000011010010111100011100000111000000011100010100010101000001011100010100010101000001011101011101000111011011011101100100110111010100111100101100100101001000011101011101000111011011011101100100101111000110011100011100000001001001111101011101000111011011011101100100110111010100111100110101111111010111101011100 eba3b6ec9ba9e59290eba3b6ec9ba9e585abe581a5e38380e28a82e28a82eba3b6ec9ba9e59290eba3b6ec978ce38093eba3b6ec9ba9e6bfaf5c
UHC 룶웩咐룶웩八健ダ⊂⊂룶웩咐룶엌〓룶웩濯\ 100011111010101111000000101000011101110011111011100011111010101111000000101000011111100010100010110010111110110110101011110000001010000111111000101000011111100010001111101010111100000010100001110111001111101110001111101010111011111011111101101000011110101110001111101010111100000010100001111101101111101101011100 8fabc0a1dcfb8fabc0a1f8a2cbedabc0a1f8a1f88fabc0a1dcfb8fabbefda1eb8fabc0a1f6fb5c

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)