To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN ???嚥????嚥 0011111100111111001111111001101010001011001111110011111100111111001111111001101010001011 3f3f3f9a8b3f3f3f3f9a8b
EUC-JP ???嚥????嚥 0011111100111111001111111101001111101011001111110011111100111111001111111101001111101011 3f3f3fd3eb3f3f3f3fd3eb
UTF-8 曆꿨맃嚥쨑曆꿨맃嚥 111011111010011010001011111010101011111110101000111010111010011110000011111001011001101010100101111011001010100010010001111011111010011010001011111010101011111110101000111010111010011110000011111001011001101010100101 efa68beabfa8eba783e59aa5eca891efa68beabfa8eba783e59aa5
UHC 曆꿨맃嚥쨑曆꿨맃嚥 111001101011011110110010111001011001000010011101111001101011111110100100011010001110011010110111101100101110010110010000100111011110011010111111 e6b7b2e5909de6bfa468e6b7b2e5909de6bf

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)