To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 矮??暗?????娃??矮??暗?????娃??^ 11100001111000100011111100111111100010001100001100111111001111110011111100111111001111111000100010100001001111110011111111100001111000100011111100111111100010001100001100111111001111110011111100111111001111111000100010100001001111110011111101011110 e1e23f3f88c33f3f3f3f3f88a13f3fe1e23f3f88c33f3f3f3f3f88a13f3f5e
EUC-JP 矮??暗?????娃??矮??暗?????娃??^ 11100010111001000011111100111111101100001100010100111111001111110011111100111111001111111011000010100011001111110011111111100010111001000011111100111111101100001100010100111111001111110011111100111111001111111011000010100011001111110011111101011110 e2e43f3fb0c53f3f3f3f3fb0a33f3fe2e43f3fb0c53f3f3f3f3fb0a33f3f5e
UTF-8 矮듬젵暗뽯젿略듬젘娃뗭겑矮듬젵暗뽯젿略듬젘娃뗭걽^ 11100111100111111010111011101011100100111010110011101100101000001011010111100110100110101001011111101011101111011010111111101100101000001011111111101111101001011011011011101011100100111010110011101100101000001001100011100101101010001000001111101011100101111010110111101010101100101001000111100111100111111010111011101011100100111010110011101100101000001011010111100110100110101001011111101011101111011010111111101100101000001011111111101111101001011011011011101011100100111010110011101100101000001001100011100101101010001000001111101011100101111010110111101010101100011011110101011110 e79faeeb93aceca0b5e69a97ebbdafeca0bfefa5b6eb93aceca098e5a883eb97adeab291e79faeeb93aceca0b5e69a97ebbdafeca0bfefa5b6eb93aceca098e5a883eb97adeab1bd5e
UHC 矮듬젵暗뽯젿略듬젘娃뗭겑矮듬젵暗뽯젿略듬젘娃뗭걽^ 11101000111000011011010111101011101000001010100111100100110111101001011011101011101000001011000111100101101100101011010111101011101000001001010011101000110111111000101111101100100000011010100111101000111000011011010111101011101000001010100111100100110111101001011011101011101000001011000111100101101100101011010111101011101000001001010011101000110111111000101111101100100000011010000001011110 e8e1b5eba0a9e4de96eba0b1e5b2b5eba094e8df8bec81a9e8e1b5eba0a9e4de96eba0b1e5b2b5eba094e8df8bec81a05e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)