To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 鬯ゥ蠅凪隗驍冗「托スュ螟願棺鞜擾スッB 11101001101011001010100111100101101000101001001111100010111010001010011111101001100000101000111111100111101000101001000111101111101111011010110111100101101001001000101011101000100010101011101111101000110111111000111111101111101111011010111101000010 e9aca9e5a293e2e8a7e9828fe7a291efbdade5a48ae88abbe8df8fefbdaf42
EUC-JP 鬯ゥ蠅凪隗驍冗「托スュ螟願棺鞜擾スッB 11110010101011101000111010101001111010101010010011000110111001001111000010101001111100011110001010111110111010011000111010100010110000101111000110001110101111011000111010101101111010101010011010110100111010101011010010111101111100001110000110111110111100011000111010111101100011101010111101000010 f2ae8ea9eaa4c6e4f0a9f1e2bee98ea2c2f18ebd8eadeaa6b4eab4bdf0e1bef18ebd8eaf42
UTF-8 鬯ゥ蠅凪隗驍冗「托スュ螟願棺鞜擾スッB 11101001101011001010111111101111101111011010100111101000101000001000010111100101100001111010101011101001100110101001011111101001101010011000110111100101100001101001011111101111101111011010001011100110100010011001100011101111101111011011110111101111101111011010110111101000100111101001111111101001101000011001100011100110101000111011101011101001100111101001110011100110100100111011111011101111101111011011110111101111101111011010111101000010 e9acafefbda9e8a085e587aae99a97e9a98de58697efbda2e68998efbdbdefbdade89e9fe9a198e6a3bae99e9ce693beefbdbdefbdaf42
UHC ??蠅??驍冗?托??螟願棺?擾??B 001111110011111111100011101100100011111100111111111111011010010011101001101101110011111111110110111101010011111100111111110110011010110111101010110000111100111010110010001111111110100011110110001111110011111101000010 3f3fe3b23f3ffda4e9b73ff6f53f3fd9adeac3ceb23fe8f63f3f42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)