To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????? 0011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f
SJIS-WIN 妖?霽?妖?訂? 100101110110010000111111111010001100011100111111100101110110010000111111100100101111100100111111 97643fe8c73f97643f92f93f
EUC-JP 妖?霽?妖?訂? 110011011100010100111111111100001100100100111111110011011100010100111111110001001111101100111111 cdc53ff0c93fcdc53fc4fb3f
UTF-8 妖렢霽렢妖렢訂렦 111001011010011010010110111010111010000010100010111010011001110010111101111010111010000010100010111001011010011010010110111010111010000010100010111010001010100010000010111010111010000010100110 e5a696eba0a2e99cbdeba0a2e5a696eba0a2e8a882eba0a6
UHC 妖렢霽렢妖렢訂렦 11101000111011011000111010110011111100001011100010001110101100111110100011101101100011101011001111101111111101001000111010110101 e8ed8eb3f0b88eb3e8ed8eb3eff48eb5

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)