To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????[?????????[^ 001111110011111100111111001111110011111100111111001111110011111100111111010110110011111100111111001111110011111100111111001111110011111100111111001111110101101101011110 3f3f3f3f3f3f3f3f3f5b3f3f3f3f3f3f3f3f3f5b5e
SJIS-WIN ??????汝??[??????汝??[^ 0011111100111111001111110011111100111111001111111001001111110000001111110011111101011011001111110011111100111111001111110011111100111111100100111111000000111111001111110101101101011110 3f3f3f3f3f3f93f03f3f5b3f3f3f3f3f3f93f03f3f5b5e
EUC-JP 薏??沅??汝??[薏??沅??汝??[^ 10001111110110011101111000111111001111111000111111000110111010010011111100111111110001101111001000111111001111110101101110001111110110011101111000111111001111111000111111000110111010010011111100111111110001101111001000111111001111110101101101011110 8fd9de3f3f8fc6e93f3fc6f23f3f5b8fd9de3f3f8fc6e93f3fc6f23f3f5b5e
UTF-8 薏몄젚沅룡쐿汝끿젲[薏몄젚沅룡쐿汝끿젲[^ 111010001001011010001111111010111010101010000100111011001010000010011010111001101011001010000101111010111010001110100001111011001001000010111111111001101011000110011101111010111000000110111111111011001010000010110010010110111110100010010110100011111110101110101010100001001110110010100000100110101110011010110010100001011110101110100011101000011110110010010000101111111110011010110001100111011110101110000001101111111110110010100000101100100101101101011110 e8968febaa84eca09ae6b285eba3a1ec90bfe6b19deb81bfeca0b25be8968febaa84eca09ae6b285eba3a1ec90bfe6b19deb81bfeca0b25b5e
UHC 薏몄젚沅룡쐿汝끿젲[薏몄젚沅룡쐿汝끿젲[^ 111010111111101110111000111011001010000010010110111010101011011010110111111001101001110010011111111001101010001110000101111001111010000010100110010110111110101111111011101110001110110010100000100101101110101010110110101101111110011010011100100111111110011010100011100001011110011110100000101001100101101101011110 ebfbb8eca096eab6b7e69c9fe6a385e7a0a65bebfbb8eca096eab6b7e69c9fe6a385e7a0a65b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)