To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???維??惟?????溢??揄щ??l? 001111110011111100111111100010001101101100111111001111111000100011010010001111110011111100111111001111110011111110001000111011000011111100111111100111011000100110000100100010110011111100111111100000101000110000111111 3f3f3f88db3f3f88d23f3f3f3f3f88ec3f3f9d89848b3f3f828c3f
EUC-JP ???維??惟?????溢??揄щ??l? 001111110011111100111111101100001101110100111111001111111011000011010100001111110011111100111111001111110011111110110000111011100011111100111111110110011110100110100111111010110011111100111111101000111110110000111111 3f3f3fb0dd3f3fb0d43f3f3f3f3fb0ee3f3fd9e9a7eb3f3fa3ec3f
UTF-8 嶺뚢꽓維곻㎕惟㎯뀋亮쎈맩溢띄춯揄щ㎦力l뼗 1110111110100110101010111110101110011010101000101110101010111101100100111110011110110110101011011110101010110011101110111110001110001110100101011110011010000011100111111110001110001110101011111110101110000000100010111110111110100101101101111110110010001110100010001110101110100111101010011110011010111010101000101110101110011101100001001110110010110110101011111110011010001111100001001101000110001001111000111000111010100110111011111010011010001010111011111011110110001100111010111011110010010111 efa6abeb9aa2eabd93e7b6adeab3bbe38e95e6839fe38eafeb808befa5b7ec8e88eba7a9e6baa2eb9d84ecb6afe68f84d189e38ea6efa68aefbd8cebbc97
UHC 嶺뚢꽓維곻㎕惟㎯뀋亮쎈맩溢띄춯揄щ㎦力l뼗 111001111010110110001100111000101000010010100010111010111010101110000001111011111010011110100001111010101110111010100111111000111000010110000111111001011011100110111101111010111001000010110001111011001110111010110110111001111010110110001100111010101111000110101100111010111010011110101010111001101011001110100011111011001001011010011111 e7ad8ce284a2ebab81efa7a1eaeea7e38587e5b9bdeb90b1eceeb6e7ad8ceaf1aceba7aae6b3a3ec969f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)