To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 嚥≪?議??柔レ?筌??揖??飮??余 10011010100010111000000111100001001111111000101101100011001111110011111110001111010111111000001110001100001111111110001010100011001111110011111110010111010010110011111100111111100111110101101000111111001111111001011101011101 9a8b81e13f8b633f3f8f5f838c3fe2a33f3f974b3f3f9f5a3f3f975d
EUC-JP 嚥≪?議??柔レ?筌??揖??飮??余 11010011111010111010001011100011001111111011010111000100001111110011111110111101110000001010010111101100001111111110010010100101001111110011111111001101101011000011111100111111110111011011101100111111001111111100110110111110 d3eba2e33fb5c43f3fbdc0a5ec3fe4a53f3fcdac3f3fddbb3f3fcdbe
UTF-8 嚥≪늾議끾썫柔レ쨧筌뗫쓬揖득에飮뉗죪余 111001011001101010100101111000101000100110101010111010111000101010111110111010001010110110110000111010111000000110111110111011001000110110101011111001101001111110010100111000111000001110101100111011001010100010100111111001111010110110001100111010111001011110101011111011001001001110101100111001101000111110010110111010111001001110011101111011001001011110010000111010011010001110101110111010111000100110010111111011001010001110101010111001001011110110011001 e59aa5e289aaeb8abee8adb0eb81beec8dabe69f94e383aceca8a7e7ad8ceb97abec93ace68f96eb939dec9790e9a3aeeb8997eca3aae4bd99
UHC 嚥≪늾議끾썫柔レ쨧筌뗫쓬揖득에飮뉗죪余 1110011010111111101000011110110010001000100001111110110010100001100001011110011010011011100111001110101011110101101010111110110010100100100000101110111110100111100010111110101110011101100011001110101111100111101101011110011010111111101000011110101111100110100001111110110010100001100001011110010111111001 e6bfa1ec8887eca185e69b9ceaf5abeca482efa78beb9d8cebe7b5e6bfa1ebe687eca185e5f9

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)