To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????B 001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f42
SJIS-WIN 襍ヲ螳檗襍ヲ螳檗B 111010001011010110100110111001011010111010011111010000001110100010110101101001101110010110101110100111110100000001000010 e8b5a6e5ae9f40e8b5a6e5ae9f4042
EUC-JP 襍ヲ螳檗襍ヲ螳檗B 1111000010110111100011101010011011101010101100001101110110100001111100001011011110001110101001101110101010110000110111011010000101000010 f0b78ea6eab0dda1f0b78ea6eab0dda142
UTF-8 襍ヲ螳檗襍ヲ螳檗B 11101000101001011000110111101111101111011010011011101000100111101011001111100110101010101001011111101000101001011000110111101111101111011010011011101000100111101011001111100110101010101001011101000010 e8a58defbda6e89eb3e6aa97e8a58defbda6e89eb3e6aa9742
UHC ??螳檗??螳檗B 00111111001111111101001111011001110110111111110000111111001111111101001111011001110110111111110001000010 3f3fd3d9dbfc3f3fd3d9dbfc42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)