To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 狎??誼??循??悟??狎??誼??循??悟??B 111000001011111000111111001111111000101101100010001111110011111110001111011110100011111100111111100011001110010100111111001111111110000010111110001111110011111110001011011000100011111100111111100011110111101000111111001111111000110011100101001111110011111101000010 e0be3f3f8b623f3f8f7a3f3f8ce53f3fe0be3f3f8b623f3f8f7a3f3f8ce53f3f42
EUC-JP 狎??誼??循??悟??狎??誼??循??悟??B 111000001100000000111111001111111011010111000011001111110011111110111101110110110011111100111111101110001110011100111111001111111110000011000000001111110011111110110101110000110011111100111111101111011101101100111111001111111011100011100111001111110011111101000010 e0c03f3fb5c33f3fbddb3f3fb8e73f3fe0c03f3fb5c33f3fbddb3f3fb8e73f3f42
UTF-8 狎쀫갭誼쀥럳循뚮펶悟딅컖狎쀫갭誼쀥럳循뚮펶悟딅컖B 11100111100010111000111011101100100000001010101111101010101100001010110111101000101010101011110011101100100000001010010111101011100111111011001111100101101111101010101011101011100110101010111011101101100011101011011011100110100000101001111111101011100101001000010111101100101110111001011011100111100010111000111011101100100000001010101111101010101100001010110111101000101010101011110011101100100000001010010111101011100111111011001111100101101111101010101011101011100110101010111011101101100011101011011011100110100000101001111111101011100101001000010111101100101110111001011001000010 e78b8eec80abeab0ade8aabcec80a5eb9fb3e5beaaeb9aaeed8eb6e6829feb9485ecbb96e78b8eec80abeab0ade8aabcec80a5eb9fb3e5beaaeb9aaeed8eb6e6829feb9485ecbb9642
UHC 狎쀫갭誼쀥럳循뚮펶悟딅컖狎쀫갭誼쀥럳循뚮펶悟딅컖B 11100100111001001001011111101011101100001011100011101011111111101001011111100101100011101001001111100010111000001000110011101011101111001000011111100111111101101000101011101011101100001000000111100100111001001001011111101011101100001011100011101011111111101001011111100101100011101001001111100010111000001000110011101011101111001000011111100111111101101000101011101011101100001000000101000010 e4e497ebb0b8ebfe97e58e93e2e08cebbc87e7f68aebb081e4e497ebb0b8ebfe97e58e93e2e08cebbc87e7f68aebb08142

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)