To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN 襍ヲ譚楢伏迴撮 11101000101101011010011011100110100111011001001111101000100101011001101011100111100011111000111001000010 e8b5a6e69d93e8959ae78f8e42
EUC-JP 襍ヲ譚楢伏迴撮 1111000010110111100011101010011011101011111111011100011011101010110010011111101011101101111011111011101110100011 f0b78ea6ebfdc6eac9faedefbba3
UTF-8 襍ヲ譚楢伏迴撮 111010001010010110001101111011111011110110100110111010001010110110011010111001101010010110100010111001001011110010001111111010001011111110110100111001101001001010101110 e8a58defbda6e8ad9ae6a5a2e4bc8fe8bfb4e692ae
UHC ??譚楢伏?撮 0011111100111111110100111100100111101010111110011101110011010001001111111111010111001001 3f3fd3c9eaf9dcd13ff5c9

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)