To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????B 00111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f42
SJIS-WIN ??+???猷ユ?B 00111111001111111000000101111011001111110011111100111111100101110101000110000011100001100011111101000010 3f3f817b3f3f3f975183863f42
EUC-JP ??+孼??猷ユ?B 001111110011111110100001110111001000111110111010110000110011111100111111110011011011001010100101111001100011111101000010 3f3fa1dc8fbac33f3fcdb2a5e63f42
UTF-8 惡롫+孼싪솊猷ユ룠B 11101111101001101011100111101011101000011010101111101111101111001000101111100101101011011011110011101100100010111010101011101100100001101000101011100111100011001011011111100011100000111010011011101011101000111010000001000010 efa6b9eba1abefbc8be5adbcec8baaec868ae78cb7e383a6eba3a042
UHC 惡롫+孼싪솊猷ユ룠B 11100111111101111000111011101011101000111010101111100101111011011001101011101000100110011000111011101011101000111010101111100110100011111001101001000010 e7f78eeba3abe5ed9ae8998eeba3abe68f9a42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)