To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 辷オ蜀咎エォ蠎願蕪辷オ蜀咎エォ蠎願争^ 11100111100010001011010111100101100001101001100111101001101101001010101111100101101110101000101011101000100101011001001111100111100010001011010111100101100001101001100111101001101101001010101111100101101110101000101011101000100100011000100001011110 e788b5e58699e9b4abe5ba8ae89593e788b5e58699e9b4abe5ba8ae891885e
EUC-JP 辷オ蜀咎エォ蠎願蕪辷オ蜀咎エォ蠎願争^ 11101101111010001000111010110101111010011110011011010010111010111000111010110100100011101010101111101010101111001011010011101010110010011111001111101101111010001000111010110101111010011110011011010010111010111000111010110100100011101010101111101010101111001011010011101010110000011110100001011110 ede88eb5e9e6d2eb8eb48eabeabcb4eac9f3ede88eb5e9e6d2eb8eb48eabeabcb4eac1e85e
UTF-8 辷オ蜀咎エォ蠎願蕪辷オ蜀咎エォ蠎願争^ 11101000101111101011011111101111101111011011010111101000100111001000000011100101100100101000111011101111101111011011010011101111101111011010101111101000101000001000111011101001101000011001100011101000100101011010101011101000101111101011011111101111101111011011010111101000100111001000000011100101100100101000111011101111101111011011010011101111101111011010101111101000101000001000111011101001101000011001100011100100101110101000100101011110 e8beb7efbdb5e89c80e5928eefbdb4efbdabe8a08ee9a198e895aae8beb7efbdb5e89c80e5928eefbdb4efbdabe8a08ee9a198e4ba895e
UHC ??蜀咎???願蕪??蜀咎???願?^ 0011111100111111111101011011100111001111101001000011111100111111001111111110101011000011110110011111001100111111001111111111010110111001110011111010010000111111001111110011111111101010110000110011111101011110 3f3ff5b9cfa43f3f3feac3d9f33f3ff5b9cfa43f3f3feac33f5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)