To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????^ 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 縡?杖?贈?禹絲??縡?杖?贈?禹絲??^ 11100011011100010011111110001111111100010011111110010001101000010011111111100010010110101110001101001110001111110011111111100011011100010011111110001111111100010011111110010001101000010011111111100010010110101110001101001110001111110011111101011110 e3713f8ff13f91a13fe25ae34e3f3fe3713f8ff13f91a13fe25ae34e3f3f5e
EUC-JP 縡?杖?贈?禹絲??縡?杖?贈?禹絲??^ 11100101110100100011111110111110111100110011111111000010101000110011111111100011101110111110010110101111001111110011111111100101110100100011111110111110111100110011111111000010101000110011111111100011101110111110010110101111001111110011111101011110 e5d23fbef33fc2a33fe3bbe5af3f3fe5d23fbef33fc2a33fe3bbe5af3f3f5e
UTF-8 縡렭杖렱贈렱禹絲렋렠縡렭杖렱贈렱禹絲렋렟^ 11100111101110001010000111101011101000001010110111100110100111011001011011101011101000001011000111101000101101001000100011101011101000001011000111100111101001101011100111100111101101011011001011101011101000001000101111101011101000001010000011100111101110001010000111101011101000001010110111100110100111011001011011101011101000001011000111101000101101001000100011101011101000001011000111100111101001101011100111100111101101011011001011101011101000001000101111101011101000001001111101011110 e7b8a1eba0ade69d96eba0b1e8b488eba0b1e7a6b9e7b5b2eba08beba0a0e7b8a1eba0ade69d96eba0b1e8b488eba0b1e7a6b9e7b5b2eba08beba09f5e
UHC 縡렭杖렱贈렱禹絲렋렠縡렭杖렱贈렱禹絲렋렟^ 1110111010101101100011101011101011101101111010001000111010111110111100011111110010001110101111101110100111100000110111101110101010001110101000101000111010110001111011101010110110001110101110101110110111101000100011101011111011110001111111001000111010111110111010011110000011011110111010101000111010100010100011101011000001011110 eead8ebaede88ebef1fc8ebee9e0deea8ea28eb1eead8ebaede88ebef1fc8ebee9e0deea8ea28eb05e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)