To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????­? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111111010110100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3fad3f
SJIS-WIN ?????ぜ矣??阿??油??柔ろ?鈺??釗 001111110011111100111111001111110011111110000010101110101110000111100001001111110011111110001000101000100011111100111111100101101111101100111111001111111000111101011111100000101110101100111111111110111100010000111111001111111111101110111011 3f3f3f3f3f82bae1e13f3f88a23f3f96fb3f3f8f5f82eb3ffbc43f3ffbbb
EUC-JP ???靷?ぜ矣??阿??油??柔ろ?鈺??釗 00111111001111110011111110001111111001111011110100111111101001001011110011100010111000110011111100111111101100001010010000111111001111111100110011111101001111110011111110111101110000001010010011101101001111111000111111100011110101010011111100111111100011111110001110100110 3f3f3f8fe7bd3fa4bce2e33f3fb0a43f3fccfd3f3fbdc0a4ed3f8fe3d53f3f8fe3a6
UTF-8 嶺뚮뿫靷뽬ぜ矣몄춷阿숇벊油롧춯柔ろ돪鈺곕­釗 1110111110100110101010111110101110011010101011101110101110111111101010111110100110011101101101111110101110111101101011001110001110000001100111001110011110011111101000111110101110101010100001001110110010110110101101111110100110011000101111111110110010001000100001111110101110110010100010101110011010110010101110011110101110100001101001111110110010110110101011111110011010011111100101001110001110000010100011011110101110001111101010101110100110001000101110101110101010110011100101011100001010101101111010011000011110010111 efa6abeb9aaeebbfabe99db7ebbdace3819ce79fa3ebaa84ecb6b7e998bfec8887ebb28ae6b2b9eba1a7ecb6afe69f94e3828deb8faae988baeab395c2ade98797
UHC 嶺뚮뿫靷뽬ぜ矣몄춷阿숇벊油롧춯柔ろ돪鈺곕­釗 1110011110101101100011001110101110010111101010111110110011100110100101101110100010101010101111001110101111111000101110001110110010101101100100111110010010111001100110011110101110010011101011011110101011111010100011101110011110101101100011001110101011110101101010101110110110001001101011011110100010101101101100001110101110100001101010011110000111110010 e7ad8ceb97abece696e8aabcebf8b8ecad93e4b999eb93adeafa8ee7ad8ceaf5aaed89ade8adb0eba1a9e1f2

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)