To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 形??型??亦??亦 1000110001100000001111110011111110001100010111100011111100111111100101101001001000111111001111111001011010010010 8c603f3f8c5e3f3f96923f3f9692
EUC-JP 形??型??亦??亦 1011011111000001001111110011111110110111101111110011111100111111110010111111001000111111001111111100101111110010 b7c13f3fb7bf3f3fcbf23f3fcbf2
UTF-8 形앾슈型됵슬亦샛씗亦 111001011011110110100010111011001001010110111110111011001000101010001000111001011001111010001011111010111001000010110101111011001000101010101100111001001011101010100110111011001000001110011011111011001001010010010111111001001011101010100110 e5bda2ec95beec8a88e59e8beb90b5ec8aace4baa6ec839bec9497e4baa6
UHC 形앾슈型됵슬亦샛씗亦 1111101110100001100111011110111110111101101101001111101011111110100010011110111110111101101111011110011010110010101110111111101110011101101011001110011010110010 fba19defbdb4fafe89efbdbde6b2bbfb9dace6b2

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)