To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????^ 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ??????瀕?????????瀕???^ 0011111100111111001111110011111100111111001111111001010101101101001111110011111100111111001111110011111100111111001111110011111100111111100101010110110100111111001111110011111101011110 3f3f3f3f3f3f956d3f3f3f3f3f3f3f3f3f956d3f3f3f5e
EUC-JP 侄?????瀕???侄?????瀕???^ 100011111011000011111110001111110011111100111111001111110011111111001001110011100011111100111111001111111000111110110000111111100011111100111111001111110011111100111111110010011100111000111111001111110011111101011110 8fb0fe3f3f3f3f3fc9ce3f3f3f8fb0fe3f3f3f3f3fc9ce3f3f3f5e
UTF-8 侄롖뤰탮컣툘瀕렒롅롚侄롖뤰탮컣툘瀕렒롅롘^ 11100100101111101000010011101011101000011001011011101011101001001011000011101101100000111010111011101100101110111010001111101101100010001001100011100111100000001001010111101011101000001001001011101011101000011000010111101011101000011001101011100100101111101000010011101011101000011001011011101011101001001011000011101101100000111010111011101100101110111010001111101101100010001001100011100111100000001001010111101011101000001001001011101011101000011000010111101011101000011001100001011110 e4be84eba196eba4b0ed83aeecbba3ed8898e78095eba092eba185eba19ae4be84eba196eba4b0ed83aeecbba3ed8898e78095eba092eba185eba1985e
UHC 侄롖뤰탮컣툘瀕렒롅롚侄롖뤰탮컣툘瀕렒롅롘^ 1111001011101001100011101101101010001111110111101011010110001110101100001000111010111000100011111101111010110101100011101010011110001110110010111000111011011110111100101110100110001110110110101000111111011110101101011000111010110000100011101011100010001111110111101011010110001110101001111000111011001011100011101101110001011110 f2e98eda8fdeb58eb08eb88fdeb58ea78ecb8edef2e98eda8fdeb58eb08eb88fdeb58ea78ecb8edc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)