To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???誼??濡レ?潁??艤?????筌??伊 0011111100111111001111111000101101100010001111110011111110010100010001111000001110001100001111111001111111110001001111110011111111100100011111100011111100111111001111110011111100111111111000101010001100111111001111111000100011001001 3f3f3f8b623f3f9447838c3f9ff13f3fe47e3f3f3f3f3fe2a33f3f88c9
EUC-JP ??Ŧ誼??濡レ?潁??艤??洧??筌??伊 001111110011111110001111101010011010111110110101110000110011111100111111110001111010100010100101111011000011111111011110111100110011111100111111111001111101111100111111001111111000111111000111101101000011111100111111111001001010010100111111001111111011000011001011 3f3f8fa9afb5c33f3fc7a8a5ec3fdef33f3fe7df3f3f8fc7b43f3fe4a53f3fb0cb
UTF-8 樂낅Ŧ誼숅턁濡レ졎潁딉퐞艤섓쭓洧뱀퐡筌뚮뛾伊 1110111110100110101111111110101110000010100001011100010110100110111010001010101010111100111011001000100010000101111011011000010010000001111001101011111110100001111000111000001110101100111011001010000110001110111001101011110110000001111010111001010010001001111011011001000010011110111010001000100110100100111011001000010010010011111011001010110110010011111001101011010010100111111010111011000110000000111011011001000010100001111001111010110110001100111010111001101010101110111010111001101110111110111001001011110010001010 efa6bfeb8285c5a6e8aabcec8885ed8481e6bfa1e383aceca18ee6bd81eb9489ed909ee889a4ec8493ecad93e6b4a7ebb180ed90a1e7ad8ceb9aaeeb9bbee4bc8a
UHC 樂낅Ŧ誼숅턁濡レ졎潁딉퐞艤섓쭓洧뱀퐡筌뚮뛾伊 1110100011111001100001011110101110101000101011101110101111111110100110011110100110110101100111011110101110100001101010111110110010100000101110111110011110111000100010101110111110111101100001111110101111111010100110001110111110100111100010111110101011111011101110011110110010111101100010101110111110100111100011001110101110001101100001001110110010100101 e8f985eba8aeebfe99e9b59deba1abeca0bbe7b88aefbd87ebfa98efa78beafbb9ecbd8aefa78ceb8d84eca5

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)