To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????}?????????{^ 001111110011111100111111001111110011111100111111001111110011111100111111011111010011111100111111001111110011111100111111001111110011111100111111001111110111101101011110 3f3f3f3f3f3f3f3f3f7d3f3f3f3f3f3f3f3f3f7b5e
SJIS-WIN ー霑溿ヲラホ}ー霑溿ヲラホ{^ 11110010101100111011000011101000101111111111101101001010111101111000111010100110111100011110000111010111110011100111110111110010101100111011000011101000101111111111101101001010111101111000111010100110111100011110000111010111110011100111101101011110 f2b3b0e8bffb4af78ea6f1e1d7ce7df2b3b0e8bffb4af78ea6f1e1d7ce7b5e
EUC-JP ?ー霑溿?ヲ?ラホ}?ー霑溿?ヲ?ラホ{^ 0011111110001110101100001111000011000001100011111100100010110001001111111000111010100110001111111000111011010111100011101100111001111101001111111000111010110000111100001100000110001111110010001011000100111111100011101010011000111111100011101101011110001110110011100111101101011110 3f8eb0f0c18fc8b13f8ea63f8ed78ece7d3f8eb0f0c18fc8b13f8ea63f8ed78ece7b5e
UTF-8 ー霑溿ヲラホ}ー霑溿ヲラホ{^ 111011101000011110101010111011111011110110110000111010011001110010010001111001101011101010111111111011101001010110110001111011111011110110100110111011101000010110011100111011111011111010010111111011111011111010001110011111011110111010000111101010101110111110111101101100001110100110011100100100011110011010111010101111111110111010010101101100011110111110111101101001101110111010000101100111001110111110111110100101111110111110111110100011100111101101011110 ee87aaefbdb0e99c91e6babfee95b1efbda6ee859cefbe97efbe8e7dee87aaefbdb0e99c91e6babfee95b1efbda6ee859cefbe97efbe8e7b5e
UHC ??霑??????}??霑??????{^ 0011111100111111111011111100010100111111001111110011111100111111001111110011111101111101001111110011111111101111110001010011111100111111001111110011111100111111001111110111101101011110 3f3fefc53f3f3f3f3f3f7d3f3fefc53f3f3f3f3f3f7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)