To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????B 00111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f42
SJIS-WIN ?洩洩?洩洩B 0011111110001001011010111000100101101011001111111000100101101011100010010110101101000010 3f896b896b3f896b896b42
EUC-JP ?洩洩?洩洩B 0011111110110001110011001011000111001100001111111011000111001100101100011100110001000010 3fb1ccb1cc3fb1ccb1cc42
UTF-8 蟬洩洩蟬洩洩B 11101000100111111010110011100110101101001010100111100110101101001010100111101000100111111010110011100110101101001010100111100110101101001010100101000010 e89face6b4a9e6b4a9e89face6b4a9e6b4a942
UHC 蟬洩洩蟬洩洩B 11100000110100011110000011011101111000001101110111100000110100011110000011011101111000001101110101000010 e0d1e0dde0dde0d1e0dde0dd42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)