To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 鵝??濡?????鵝 11101010010000000011111100111111100101000100011100111111001111110011111100111111001111111110101001000000 ea403f3f94473f3f3f3f3fea40
EUC-JP 鵝??濡?????鵝 11110011101000010011111100111111110001111010100000111111001111110011111100111111001111111111001110100001 f3a13f3fc7a83f3f3f3f3ff3a1
UTF-8 鵝롫젲濡뗫뙎溜곕젺鵝 111010011011010110011101111010111010000110101011111011001010000010110010111001101011111110100001111010111001011110101011111010111001100110001110111011111010011110001011111010101011001110010101111011001010000010111010111010011011010110011101 e9b59deba1abeca0b2e6bfa1eb97abeb998eefa78beab395eca0bae9b59d
UHC 鵝롫젲濡뗫뙎溜곕젺鵝 1110010010111101100011101110101110100000101001101110101110100001100010111110101110001100100100111110101011111110101100001110101110100000101011011110010010111101 e4bd8eeba0a6eba18beb8c93eafeb0eba0ade4bd

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)