To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????n}????????n{^ 001111110011111100111111001111110011111100111111001111110011111101101110011111010011111100111111001111110011111100111111001111110011111100111111011011100111101101011110 3f3f3f3f3f3f3f3f6e7d3f3f3f3f3f3f3f3f6e7b5e
SJIS-WIN 瓦??鈺??奄?n}瓦??鈺??奄?n{^ 100010101010001000111111001111111111101111000100001111110011111110001001100000100011111101101110011111011000101010100010001111110011111111111011110001000011111100111111100010011000001000111111011011100111101101011110 8aa23f3ffbc43f3f89823f6e7d8aa23f3ffbc43f3f89823f6e7b5e
EUC-JP 瓦??鈺??奄?n}瓦??鈺??奄?n{^ 1011010010100100001111110011111110001111111000111101010100111111001111111011000111100010001111110110111001111101101101001010010000111111001111111000111111100011110101010011111100111111101100011110001000111111011011100111101101011110 b4a43f3f8fe3d53f3fb1e23f6e7db4a43f3f8fe3d53f3fb1e23f6e7b5e
UTF-8 瓦븝슴鈺됧룚奄턮n}瓦븝슴鈺됧룚奄턮n{^ 1110011110010011101001101110101110111000100111011110110010001010101101001110100110001000101110101110101110010000101001111110101110100011100110101110010110100101100001001110110110000100101011100110111001111101111001111001001110100110111010111011100010011101111011001000101010110100111010011000100010111010111010111001000010100111111010111010001110011010111001011010010110000100111011011000010010101110011011100111101101011110 e793a6ebb89dec8ab4e988baeb90a7eba39ae5a584ed84ae6e7de793a6ebb89dec8ab4e988baeb90a7eba39ae5a584ed84ae6e7b5e
UHC 瓦븝슴鈺됧룚奄턮n}瓦븝슴鈺됧룚奄턮n{^ 11101000101111111011101011101111101111011011111111101000101011011000100111100101100011111001011011100101111100101011011001101111011011100111110111101000101111111011101011101111101111011011111111101000101011011000100111100101100011111001011011100101111100101011011001101111011011100111101101011110 e8bfbaefbdbfe8ad89e58f96e5f2b66f6e7de8bfbaefbdbfe8ad89e58f96e5f2b66f6e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)