To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 霑・繧画コ夜b谿芽ソ・籵画コ冶烽譌ャ^ 11101000101111111010010111100011100000101000100111100110101110101001011011101001100000101000001011100110101011101000100111101000101111111010010111100010111000001000100111100110101110101001011011101000111000001000001011100110100101111010110001011110 e8bfa5e38289e6ba96e98282e6ae89e8bfa5e2e089e6ba96e8e082e697ac5e
EUC-JP 霑・繧画コ夜b谿芽ソ・籵画コ冶烽譌ャ^ 11110000110000011000111010100101111001011110001010110010111010001000111010111010110011001110101110100011111000101110110010110000101100101110101010001110101111111000111010100101111001001110001010110010111010001000111010111010110011001110101011011111111000101110101111110111100011101010110001011110 f0c18ea5e5e2b2e88ebacceba3e2ecb0b2ea8ebf8ea5e4e2b2e88ebacceadfe2ebf78eac5e
UTF-8 霑・繧画コ夜b谿芽ソ・籵画コ冶烽譌ャ^ 11101001100111001001000111101111101111011010010111100111101110011010011111100111100101001011101111101111101111011011101011100101101001001001110011101111101111011000001011101000101100001011111111101000100010101011110111101111101111011011111111101111101111011010010111100111101100011011010111100111100101001011101111101111101111011011101011100101100001101011011011100111100000111011110111101000101011011000110011101111101111011010110001011110 e99c91efbda5e7b9a7e794bbefbdbae5a49cefbd82e8b0bfe88abdefbdbfefbda5e7b1b5e794bbefbdbae586b6e783bde8ad8cefbdac5e
UHC 霑????夜b谿芽?????冶烽??^ 1110111111000101001111110011111100111111001111111110010110101000101000111110001011001101101011001110010010110100001111110011111100111111001111110011111111100101101001111101110011101011001111110011111101011110 efc53f3f3f3fe5a8a3e2cdace4b43f3f3f3f3fe5a7dceb3f3f5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)