To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????[?????????[^ 001111110011111100111111001111110011111100111111001111110011111100111111010110110011111100111111001111110011111100111111001111110011111100111111001111110101101101011110 3f3f3f3f3f3f3f3f3f5b3f3f3f3f3f3f3f3f3f5b5e
SJIS-WIN ー霑溿ヲラモ[ー霑溿ヲラモ[^ 11110010101100111011000011101000101111111111101101001010111101111000111010100110111100011110000111010111110100110101101111110010101100111011000011101000101111111111101101001010111101111000111010100110111100011110000111010111110100110101101101011110 f2b3b0e8bffb4af78ea6f1e1d7d35bf2b3b0e8bffb4af78ea6f1e1d7d35b5e
EUC-JP ?ー霑溿?ヲ?ラモ[?ー霑溿?ヲ?ラモ[^ 0011111110001110101100001111000011000001100011111100100010110001001111111000111010100110001111111000111011010111100011101101001101011011001111111000111010110000111100001100000110001111110010001011000100111111100011101010011000111111100011101101011110001110110100110101101101011110 3f8eb0f0c18fc8b13f8ea63f8ed78ed35b3f8eb0f0c18fc8b13f8ea63f8ed78ed35b5e
UTF-8 ー霑溿ヲラモ[ー霑溿ヲラモ[^ 111011101000011110101010111011111011110110110000111010011001110010010001111001101011101010111111111011101001010110110001111011111011110110100110111011101000010110011100111011111011111010010111111011111011111010010011010110111110111010000111101010101110111110111101101100001110100110011100100100011110011010111010101111111110111010010101101100011110111110111101101001101110111010000101100111001110111110111110100101111110111110111110100100110101101101011110 ee87aaefbdb0e99c91e6babfee95b1efbda6ee859cefbe97efbe935bee87aaefbdb0e99c91e6babfee95b1efbda6ee859cefbe97efbe935b5e
UHC ??霑??????[??霑??????[^ 0011111100111111111011111100010100111111001111110011111100111111001111110011111101011011001111110011111111101111110001010011111100111111001111110011111100111111001111110101101101011110 3f3fefc53f3f3f3f3f3f5b3f3fefc53f3f3f3f3f3f5b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)