To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ??⊂??八健ダ⊂⊂??⊂??八健ダ⊂〓 0011111100111111100000011011110000111111001111111001010010101010100011001001001010000011010111111000000110111100100000011011110000111111001111111000000110111100001111110011111110010100101010101000110010010010100000110101111110000001101111001000000110101100 3f3f81bc3f3f94aa8c92835f81bc81bc3f3f81bc3f3f94aa8c92835f81bc81ac
EUC-JP ??⊂??八健ダ⊂⊂??⊂??八健ダ⊂〓 0011111100111111101000101011111000111111001111111100100010101100101101111111001010100101110000001010001010111110101000101011111000111111001111111010001010111110001111110011111111001000101011001011011111110010101001011100000010100010101111101010001010101110 3f3fa2be3f3fc8acb7f2a5c0a2bea2be3f3fa2be3f3fc8acb7f2a5c0a2bea2ae
UTF-8 룶엌⊂룶웩八健ダ⊂⊂룶엌⊂룶웩八健ダ⊂〓 111010111010001110110110111011001001011110001100111000101000101010000010111010111010001110110110111011001001101110101001111001011000010110101011111001011000000110100101111000111000001110000000111000101000101010000010111000101000101010000010111010111010001110110110111011001001011110001100111000101000101010000010111010111010001110110110111011001001101110101001111001011000010110101011111001011000000110100101111000111000001110000000111000101000101010000010111000111000000010010011 eba3b6ec978ce28a82eba3b6ec9ba9e585abe581a5e38380e28a82e28a82eba3b6ec978ce28a82eba3b6ec9ba9e585abe581a5e38380e28a82e38093
UHC 룶엌⊂룶웩八健ダ⊂⊂룶엌⊂룶웩八健ダ⊂〓 10001111101010111011111011111101101000011111100010001111101010111100000010100001111110001010001011001011111011011010101111000000101000011111100010100001111110001000111110101011101111101111110110100001111110001000111110101011110000001010000111111000101000101100101111101101101010111100000010100001111110001010000111101011 8fabbefda1f88fabc0a1f8a2cbedabc0a1f8a1f88fabbefda1f88fabc0a1f8a2cbedabc0a1f8a1eb

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)