To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????@???????????@B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110100000000111111001111110011111100111111001111110011111100111111001111110011111100111111001111110100000001000010 3f3f3f3f3f3f3f3f3f3f3f403f3f3f3f3f3f3f3f3f3f3f4042
SJIS-WIN ??⊂??八健ダ⊂?竊@??⊂??八健ダ⊂?竊@B 00111111001111111000000110111100001111110011111110010100101010101000110010010010100000110101111110000001101111000011111111100010100001100100000000111111001111111000000110111100001111110011111110010100101010101000110010010010100000110101111110000001101111000011111111100010100001100100000001000010 3f3f81bc3f3f94aa8c92835f81bc3fe286403f3f81bc3f3f94aa8c92835f81bc3fe2864042
EUC-JP ??⊂??八健ダ⊂?竊@??⊂??八健ダ⊂?竊@B 00111111001111111010001010111110001111110011111111001000101011001011011111110010101001011100000010100010101111100011111111100011111001100100000000111111001111111010001010111110001111110011111111001000101011001011011111110010101001011100000010100010101111100011111111100011111001100100000001000010 3f3fa2be3f3fc8acb7f2a5c0a2be3fe3e6403f3fa2be3f3fc8acb7f2a5c0a2be3fe3e64042
UTF-8 룶엌⊂룶웩八健ダ⊂룫竊@룶엌⊂룶웩八健ダ⊂룫竊@B 111010111010001110110110111011001001011110001100111000101000101010000010111010111010001110110110111011001001101110101001111001011000010110101011111001011000000110100101111000111000001110000000111000101000101010000010111010111010001110101011111001111010101110001010010000001110101110100011101101101110110010010111100011001110001010001010100000101110101110100011101101101110110010011011101010011110010110000101101010111110010110000001101001011110001110000011100000001110001010001010100000101110101110100011101010111110011110101011100010100100000001000010 eba3b6ec978ce28a82eba3b6ec9ba9e585abe581a5e38380e28a82eba3abe7ab8a40eba3b6ec978ce28a82eba3b6ec9ba9e585abe581a5e38380e28a82eba3abe7ab8a4042
UHC 룶엌⊂룶웩八健ダ⊂룫竊@룶엌⊂룶웩八健ダ⊂룫竊@B 1000111110101011101111101111110110100001111110001000111110101011110000001010000111111000101000101100101111101101101010111100000010100001111110001000111110100010111011111011110001000000100011111010101110111110111111011010000111111000100011111010101111000000101000011111100010100010110010111110110110101011110000001010000111111000100011111010001011101111101111000100000001000010 8fabbefda1f88fabc0a1f8a2cbedabc0a1f88fa2efbc408fabbefda1f88fabc0a1f8a2cbedabc0a1f88fa2efbc4042

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)