To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????^ 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 貉俶ィー鬯假セ搾セ孺貉俶ィー鬯假セ搾セ學^ 111001101011100110011000111001101010100010110000111010011010110010011000111011111011111010001101111011111011111010011011011111011110011010111001100110001110011010101000101100001110100110101100100110001110111110111110100011011110111110111110100110110111101101011110 e6b998e6a8b0e9ac98efbe8defbe9b7de6b998e6a8b0e9ac98efbe8defbe9b7b5e
EUC-JP 貉俶ィー鬯假セ搾セ孺貉俶ィー鬯假セ搾セ學^ 1110110010111011110100001110100010001110101010001000111010110000111100101010111011010000111100011000111010111110101110101111000110001110101111101101010111011110111011001011101111010000111010001000111010101000100011101011000011110010101011101101000011110001100011101011111010111010111100011000111010111110110101011101110001011110 ecbbd0e88ea88eb0f2aed0f18ebebaf18ebed5deecbbd0e88ea88eb0f2aed0f18ebebaf18ebed5dc5e
UTF-8 貉俶ィー鬯假セ搾セ孺貉俶ィー鬯假セ搾セ學^ 11101000101100101000100111100100101111111011011011101111101111011010100011101111101111011011000011101001101011001010111111100101100000011000011111101111101111011011111011100110100100001011111011101111101111011011111011100101101011011011101011101000101100101000100111100100101111111011011011101111101111011010100011101111101111011011000011101001101011001010111111100101100000011000011111101111101111011011111011100110100100001011111011101111101111011011111011100101101011011011100001011110 e8b289e4bfb6efbda8efbdb0e9acafe58187efbdbee690beefbdbee5adbae8b289e4bfb6efbda8efbdb0e9acafe58187efbdbee690beefbdbee5adb85e
UHC ?????假?搾?孺?????假?搾?學^ 001111110011111100111111001111110011111111001010101000110011111111110011101101100011111111101010111010000011111100111111001111110011111100111111110010101010001100111111111100111011011000111111111110011100101001011110 3f3f3f3f3fcaa33ff3b63feae83f3f3f3f3fcaa33ff3b63ff9ca5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)