To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???飮??級異?????鸚??揖??兪??? 0011111100111111001111111001111101011010001111110011111110001011100010011000100011011001001111110011111100111111001111110011111111101010010111110011111100111111100101110100101100111111001111111001100101100000001111110011111100111111 3f3f3f9f5a3f3f8b8988d93f3f3f3f3fea5f3f3f974b3f3f99603f3f3f
EUC-JP ???飮??級異?????鸚??揖??兪??? 0011111100111111001111111101110110111011001111110011111110110101111010011011000011011011001111110011111100111111001111110011111111110011110000000011111100111111110011011010110000111111001111111101000111000001001111110011111100111111 3f3f3fddbb3f3fb5e9b0db3f3f3f3f3ff3c03f3fcdac3f3fd1c13f3f3f
UTF-8 凉깅냵飮긷럳級異뜻벴戮녹댅鸚쒖눦揖쇔선兪낆댋凉 111011111010010110111001111010101011100110000101111010111000001110110101111010011010001110101110111010101011100010110111111010111001111110110011111001111011010010011010111001111001010110110000111010111001110010111011111010111011001010110100111011111010011110010010111010111000010110111001111010111000110010000101111010011011100010011010111011001001001010010110111010111000100010100110111001101000111110010110111011001000011110010100111011001000010010100000111001011000010110101010111010111000001010000110111010111000110010001011111011111010010110111001 efa5b9eab985eb83b5e9a3aeeab8b7eb9fb3e7b49ae795b0eb9cbbebb2b4efa792eb85b9eb8c85e9b89aec9296eb88a6e68f96ec8794ec84a0e585aaeb8286eb8c8befa5b9
UHC 凉깅냵飮긷럳級異뜻벴戮녹댅鸚쒖눦揖쇔선兪낆댋凉 11100101101111001011000111101011100001101000010111101011111001101011000111100101100011101001001111010000111001001110110010110110101101101110011010111010101010111110101110111101101100111110110010001000101011111110010110100100100111001110110010000111101111011110101111100111101111001110010110111100101100011110101011100100100001011110110010001000101101001110010110111100 e5bcb1eb8685ebe6b1e58e93d0e4ecb6b6e6baabebbdb3ec88afe5a49cec87bdebe7bce5bcb1eae485ec88b4e5bc

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)