To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 妺七霄ース示七妺七霄ース示ユソ 1111101010100101100011101011010111101000101110101011000011110001100011101011110110001110101001101111000111100111100011101011010111111010101001011000111010110101111010001011101010110000111100011000111010111101100011101010011011110001111001111101010110111111 faa58eb5e8bab0f18ebd8ea6f1e78eb5faa58eb5e8bab0f18ebd8ea6f1e7d5bf
EUC-JP 妺七霄ー?ス示?七妺七霄ー?ス示?ユソ 100011111011100110110111101111001011011111110000101111001000111010110000001111111000111010111101101111001010100000111111101111001011011110001111101110011011011110111100101101111111000010111100100011101011000000111111100011101011110110111100101010000011111110001110110101011000111010111111 8fb9b7bcb7f0bc8eb03f8ebdbca83fbcb78fb9b7bcb7f0bc8eb03f8ebdbca83f8ed58ebf
UTF-8 妺七霄ース示七妺七霄ース示ユソ 111001011010011010111010111001001011100010000011111010011001110010000100111011111011110110110000111011101000010010001001111011111011110110111101111001111010010010111010111011101000010110100010111001001011100010000011111001011010011010111010111001001011100010000011111010011001110010000100111011111011110110110000111011101000010010001001111011111011110110111101111001111010010010111010111011101000010110100010111011111011111010010101111011111011110110111111 e5a6bae4b883e99c84efbdb0ee8489efbdbde7a4baee85a2e4b883e5a6bae4b883e99c84efbdb0ee8489efbdbde7a4baee85a2efbe95efbdbf
UHC ?七????示?七?七????示??? 001111111111011011010010001111110011111100111111001111111110001111000110001111111111011011010010001111111111011011010010001111110011111100111111001111111110001111000110001111110011111100111111 3ff6d23f3f3f3fe3c63ff6d23ff6d23f3f3f3fe3c63f3f3f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)