To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 霓?????矣??艶c?椅?┐誘??域 111010001011110100111111001111110011111100111111001111111110000111100001001111110011111110001001100100001000001010000011001111111000100011010110001111111000010010100010100101110101010100111111001111111000100011100110 e8bd3f3f3f3f3fe1e13f3f899082833f88d63f84a297553f3f88e6
EUC-JP 霓??沅??矣??艶c?椅?┐誘??域 1111000010111111001111110011111110001111110001101110100100111111001111111110001011100011001111110011111110110001111100001010001111100011001111111011000011011000001111111010100010100100110011011011011000111111001111111011000011101000 f0bf3f3f8fc6e93f3fe2e33f3fb1f0a3e33fb0d83fa8a4cdb63f3fb0e8
UTF-8 霓얠떝沅좑㎖矣섎짎艶c끇椅좑┐誘좊쳴域 111010011001110010010011111011001001011010100000111010111001011010011101111001101011001010000101111011001010001010010001111000111000111010010110111001111001111110100011111011001000010010001110111011001010011110001110111010001000100110110110111011111011110110000011111010111000000110000111111001101010010010000101111011001010001010010001111000101001010010010000111010001010101010011000111011001010001010001010111011001011001110110100111001011001111110011111 e99c93ec96a0eb969de6b285eca291e38e96e79fa3ec848eeca78ee889b6efbd83eb8187e6a485eca291e29490e8aa98eca28aecb3b4e59f9f
UHC 霓얠떝沅좑㎖矣섎짎艶c끇椅좑┐誘좊쳴域 1110011111100111101111101110110010001011101100111110101010110110101000001110111110100111101000101110101111111000100110001110101110100011100110101110011011111101101000111110001110000101101110111110101111110101101000001110111110100110101001001110101110101111101000001110101110101011100101111110011010110100 e7e7beec8bb3eab6a0efa7a2ebf898eba39ae6fda3e385bbebf5a0efa6a4ebafa0ebab97e6b4

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)