To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ?l?誼??貫誼??貫誼??肯宜??巡??伍 0011111110000010100011000011111110001011011000100011111100111111100010101101000110001011011000100011111100111111100010101101000110001011011000100011111100111111100011010110110110001011010110000011111100111111100011111000010000111111001111111000110011011110 3f828c3f8b623f3f8ad18b623f3f8ad18b623f3f8d6d8b583f3f8f843f3f8cde
EUC-JP 渶l?誼??貫誼??貫誼??肯宜??巡??伍 10001111110001111110110110100011111011000011111110110101110000110011111100111111101101001101001110110101110000110011111100111111101101001101001110110101110000110011111100111111101110011100111010110101101110010011111100111111101111011110010000111111001111111011100011100000 8fc7eda3ec3fb5c33f3fb4d3b5c33f3fb4d3b5c33f3fb9ceb5b93f3fbde43f3fb8e0
UTF-8 渶l쉶誼숁굝貫誼삥굝貫誼삣맫肯宜삼쭓巡볦졎伍 111001101011100010110110111011111011110110001100111011001000100110110110111010001010101010111100111011001000100010000001111010101011010110011101111010001011001010101011111010001010101010111100111011001000001010100101111010101011010110011101111010001011001010101011111010001010101010111100111011001000001010100011111010111010011110101011111010001000001010101111111001011010111010011100111011001000001010111100111011001010110110010011111001011011011110100001111010111011001110100110111011001010000110001110111001001011110010001101 e6b8b6efbd8cec89b6e8aabcec8881eab59de8b2abe8aabcec82a5eab59de8b2abe8aabcec82a3eba7abe882afe5ae9cec82bcecad93e5b7a1ebb3a6eca18ee4bc8d
UHC 渶l쉶誼숁굝貫誼삥굝貫誼삣맫肯宜삼쭓巡볦졎伍 1110011110110111101000111110110010011010100011001110101111111110100110011110011010000010100001011100111010111011111010111111111010111011111001101000001010000101110011101011101111101011111111101011101111100101100100001011001111010000111010011110101111110001101110111110111110100111100010111110001011011110100100111110110010100000101110111110011111101010 e7b7a3ec9a8cebfe99e68285cebbebfebbe68285cebbebfebbe590b3d0e9ebf1bbefa78be2de93eca0bbe7ea

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)