To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 鴨??援ц?怨??永 100010101001101100111111001111111000100110000111100001001000100000111111100010011000010100111111001111111000100101101001 8a9b3f3f898784883f89853f3f8969
EUC-JP 鴨??援ц?怨??永 101100111111101100111111001111111011000111100111101001111110100000111111101100011110010100111111001111111011000111001010 b3fb3f3fb1e7a7e83fb1e53f3fb1ca
UTF-8 鴨뱀옊援ц쵟怨꾧뉘永 1110100110110100101010001110101110110001100000001110110010011000100010101110011010001111101101001101000110000110111011001011010110011111111001101000000010101000111010101011111010100111111010111000100110011000111001101011000010111000 e9b4a8ebb180ec988ae68fb4d186ecb59fe680a8eabea7eb8998e6b0b8
UHC 鴨뱀옊援ц쵟怨꾧뉘永 1110010011100101101110011110110010011110100100101110101010110101101011001110100010101100101000001110101010110011100001001110101010110100101101011110011110110101 e4e5b9ec9e92eab5ace8aca0eab384eab4b5e7b5

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)