To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????}v????????}vB 001111110011111100111111001111110011111100111111001111110011111101111101011101100011111100111111001111110011111100111111001111110011111100111111011111010111011001000010 3f3f3f3f3f3f3f3f7d763f3f3f3f3f3f3f3f7d7642
SJIS-WIN セヘ社蕉ヤ舎症ヌ}vセヘ社蕉ヤ舎症ヌ}vB 1011111011001101100011101101000010001111110101001101010010001110110010011000111111000111110001110111110101110110101111101100110110001110110100001000111111010100110101001000111011001001100011111100011111000111011111010111011001000010 becd8ed08fd4d48ec98fc7c77d76becd8ed08fd4d48ec98fc7c77d7642
EUC-JP セヘ社蕉ヤ舎症ヌ}vセヘ社蕉ヤ舎症ヌ}vB 10001110101111101000111011001101101111001101001010111110110101101000111011010100101111001100101110111110110010011000111011000111011111010111011010001110101111101000111011001101101111001101001010111110110101101000111011010100101111001100101110111110110010011000111011000111011111010111011001000010 8ebe8ecdbcd2bed68ed4bccbbec98ec77d768ebe8ecdbcd2bed68ed4bccbbec98ec77d7642
UTF-8 セヘ社蕉ヤ舎症ヌ}vセヘ社蕉ヤ舎症ヌ}vB 1110111110111101101111101110111110111110100011011110011110100100101111101110100010010101100010011110111110111110100101001110100010001000100011101110011110010111100001111110111110111110100001110111110101110110111011111011110110111110111011111011111010001101111001111010010010111110111010001001010110001001111011111011111010010100111010001000100010001110111001111001011110000111111011111011111010000111011111010111011001000010 efbdbeefbe8de7a4bee89589efbe94e8888ee79787efbe877d76efbdbeefbe8de7a4bee89589efbe94e8888ee79787efbe877d7642
UHC ??社蕉??症?}v??社蕉??症?}vB 001111110011111111011110111001001111010110101111001111110011111111110001111110000011111101111101011101100011111100111111110111101110010011110101101011110011111100111111111100011111100000111111011111010111011001000010 3f3fdee4f5af3f3ff1f83f7d763f3fdee4f5af3f3ff1f83f7d7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)