To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 狎??鎰??矣??永 1110000010111110001111110011111111101000010011000011111100111111111000011110000100111111001111111000100101101001 e0be3f3fe84c3f3fe1e13f3f8969
EUC-JP 狎??鎰??矣??永 1110000011000000001111110011111111101111101011010011111100111111111000101110001100111111001111111011000111001010 e0c03f3fefad3f3fe2e33f3fb1ca
UTF-8 狎놁닂鎰쒐독矣먮굶永 111001111000101110001110111010111000011010000001111010111000101110000010111010011000111010110000111011001001001010010000111010111000111110000101111001111001111110100011111010111010100010101110111010101011010110110110111001101011000010111000 e78b8eeb8681eb8b82e98eb0ec9290eb8f85e79fa3eba8aeeab5b6e6b0b8
UHC 狎놁닂鎰쒐독矣먮굶永 1110010011100100100001101110110010001000100010111110110011110000100111001110011110110101101101101110101111111000100100001110101110110001101111101110011110110101 e4e486ec888becf09ce7b5b6ebf890ebb1bee7b5

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)