To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????A 00111111001111110011111100111111001111110011111100111111001111110011111101000001 3f3f3f3f3f3f3f3f3f41
SJIS-WIN ???移?/酉??A 00111111001111110011111110001000110110100011111110000001010111101001001111010001001111110011111101000001 3f3f3f88da3f815e93d13f3f41
EUC-JP ???移?/酉??A 00111111001111110011111110110000110111000011111110100001101111111100011011010011001111110011111101000001 3f3f3fb0dc3fa1bfc6d33f3f41
UTF-8 蓮곴퀓移뷴/酉귦뫞A 11101111101001101001100111101010101100111011010011101101100000001001001111100111101001111011101111101011101101111011010011101111101111001000111111101001100001011000100111101010101101111010011011101011101010111001111001000001 efa699eab3b4ed8093e7a7bbebb7b4efbc8fe98589eab7a6ebab9e41
UHC 蓮곴퀓移뷴/酉귦뫞A 11100110111001011000000111101010101100111000100011101100101110011011101011100101101000111010111111101011101101111000001011101101100100011011111001000001 e6e581eab388ecb9bae5a3afebb782ed91be41

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)