To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 獰??壓??姙??? 11100000110101100011111100111111100110101101100000111111001111111001101101001011001111110011111100111111 e0d63f3f9ad83f3f9b4b3f3f3f
EUC-JP 獰??壓??姙??? 11100000110110000011111100111111110101001101101000111111001111111101010110101100001111110011111100111111 e0d83f3fd4da3f3fd5ac3f3f3f
UTF-8 獰쎈젩壓꾨쑙姙방셀溜 111001111000110110110000111011001000111010001000111011001010000010101001111001011010001110010011111010101011111010101000111011001001000110011001111001011010011110011001111010111011000010101001111011001000010110000000111011111010011110001011 e78db0ec8e88eca0a9e5a393eabea8ec9199e5a799ebb0a9ec8580efa78b
UHC 獰쎈젩壓꾨쑙姙방셀溜 1110011110111110101111011110101110100000101000011110010011100010100001001110101110011100101110001110110011110101101110011110011010111100101111111110101011111110 e7bebdeba0a1e4e284eb9cb8ecf5b9e6bcbfeafe

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)