To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????}????{^ 0011111100111111001111110011111101111101001111110011111100111111001111110111101101011110 3f3f3f3f7d3f3f3f3f7b5e
SJIS-WIN ??屑堯}??屑堯{^ 001111110011111110001011111110111110101010011111011111010011111100111111100010111111101111101010100111110111101101011110 3f3f8bfbea9f7d3f3f8bfbea9f7b5e
EUC-JP ??屑堯}??屑堯{^ 001111110011111110110110111111011111010010100001011111010011111100111111101101101111110111110100101000010111101101011110 3f3fb6fdf4a17d3f3fb6fdf4a17b5e
UTF-8 쐛숷屑堯}쐛숷屑堯{^ 111011001001000010011011111011001000100010110111111001011011000110010001111001011010000010101111011111011110110010010000100110111110110010001000101101111110010110110001100100011110010110100000101011110111101101011110 ec909bec88b7e5b191e5a0af7dec909bec88b7e5b191e5a0af7b5e
UHC 쐛숷屑堯}쐛숷屑堯{^ 10011100100000011001101001001100111000001101101011101000111010110111110110011100100000011001101001001100111000001101101011101000111010110111101101011110 9c819a4ce0dae8eb7d9c819a4ce0dae8eb7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)