To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ?鋼????鋼???^ 00111111100011010111110000111111001111110011111100111111100011010111110000111111001111110011111101011110 3f8d7c3f3f3f3f8d7c3f3f3f5e
EUC-JP ?鋼????鋼???^ 00111111101110011101110100111111001111110011111100111111101110011101110100111111001111110011111101011110 3fb9dd3f3f3f3fb9dd3f3f3f5e
UTF-8 뤶鋼쭗흐섐뤶鋼쭗흐섐^ 11101011101001001011011011101001100010111011110011101100101011011001011111101101100111011001000011101100100001001001000011101011101001001011011011101001100010111011110011101100101011011001011111101101100111011001000011101100100001001001000001011110 eba4b6e98bbcecad97ed9d90ec8490eba4b6e98bbcecad97ed9d90ec84905e
UHC 뤶鋼쭗흐섐뤶鋼쭗흐섐^ 100011111110010011001011101111001010011110001111110010001110010110111100101010111000111111100100110010111011110010100111100011111100100011100101101111001010101101011110 8fe4cbbca78fc8e5bcab8fe4cbbca78fc8e5bcab5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)