To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???魏??懿??嚥 00111111001111110011111111101001101100000011111100111111100111001111001000111111001111111001101010001011 3f3f3fe9b03f3f9cf23f3f9a8b
EUC-JP ???魏??懿??嚥 00111111001111110011111111110010101100100011111100111111110110001111010000111111001111111101001111101011 3f3f3ff2b23f3fd8f43f3fd3eb
UTF-8 咽됱빍魏섊럦懿몄쨭嚥 111011111010011010011110111010111001000010110001111010111011100110001101111010011010110110001111111011001000010010001010111010111001111110100110111001101000011110111111111010111010101010000100111011001010100010101101111001011001101010100101 efa69eeb90b1ebb98de9ad8fec848aeb9fa6e687bfebaa84eca8ade59aa5
UHC 咽됱빍魏섊럦懿몄쨭嚥 1110011011101100100010011110110010010101101100101110101011100000100110001110011110001110100010011110101111110011101110001110110010100100100001111110011010111111 e6ec89ec95b2eae098e78e89ebf3b8eca487e6bf

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)