To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???瓦わ????? 001111110011111100111111100010101010001010000010111011010011111100111111001111110011111100111111 3f3f3f8aa282ed3f3f3f3f3f
EUC-JP ???瓦わ????? 001111110011111100111111101101001010010010100100111011110011111100111111001111110011111100111111 3f3f3fb4a4a4ef3f3f3f3f3f
UTF-8 了먲쉥瓦わ슭僚뽩쨰掠 111011111010011010111010111010111010100010110010111011001000100110100101111001111001001110100110111000111000001010001111111011001000101010101101111011111010011010111011111010111011110110101001111011001010100010110000111011111010010110110101 efa6baeba8b2ec89a5e793a6e3828fec8aadefa6bbebbda9eca8b0efa5b5
UHC 了먲쉥瓦わ슭僚뽩쨰掠 1110100011100111100100001110111110111101101010111110100010111111101010101110111110111101101111101110100011101000100101101110010110100100100010101110010110110001 e8e790efbdabe8bfaaefbdbee8e896e5a48ae5b1

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)