To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????^ 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 陷雁」ア迢ュ蟲オ蟄コ陷雁」ア迢ュ蟲オ蟄ク^ 11101000100111001000101011100101101000111011000111100111100010111010110111100101101100111011010111100101101011011011101011101000100111001000101011100101101000111011000111100111100010111010110111100101101100111011010111100101101011011011100001011110 e89c8ae5a3b1e78bade5b3b5e5adbae89c8ae5a3b1e78bade5b3b5e5adb85e
EUC-JP 陷雁」ア迢ュ蟲オ蟄コ陷雁」ア迢ュ蟲オ蟄ク^ 1110111111111100101101001110011110001110101000111000111010110001111011011110101110001110101011011110101010110101100011101011010111101010101011111000111010111010111011111111110010110100111001111000111010100011100011101011000111101101111010111000111010101101111010101011010110001110101101011110101010101111100011101011100001011110 effcb4e78ea38eb1edeb8eadeab58eb5eaaf8ebaeffcb4e78ea38eb1edeb8eadeab58eb5eaaf8eb85e
UTF-8 陷雁」ア迢ュ蟲オ蟄コ陷雁」ア迢ュ蟲オ蟄ク^ 11101001100110011011011111101001100110111000000111101111101111011010001111101111101111011011000111101000101111111010001011101111101111011010110111101000100111111011001011101111101111011011010111101000100111111000010011101111101111011011101011101001100110011011011111101001100110111000000111101111101111011010001111101111101111011011000111101000101111111010001011101111101111011010110111101000100111111011001011101111101111011011010111101000100111111000010011101111101111011011100001011110 e999b7e99b81efbda3efbdb1e8bfa2efbdade89fb2efbdb5e89f84efbdbae999b7e99b81efbda3efbdb1e8bfa2efbdade89fb2efbdb5e89f84efbdb85e
UHC 陷雁????蟲?蟄?陷雁????蟲?蟄?^ 1111100111101000111001001101001000111111001111110011111100111111111101011111100100111111111101101101111000111111111110011110100011100100110100100011111100111111001111110011111111110101111110010011111111110110110111100011111101011110 f9e8e4d23f3f3f3ff5f93ff6de3ff9e8e4d23f3f3f3ff5f93ff6de3f5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)