To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????}?????????{^ 001111110011111100111111001111110011111100111111001111110011111100111111011111010011111100111111001111110011111100111111001111110011111100111111001111110111101101011110 3f3f3f3f3f3f3f3f3f7d3f3f3f3f3f3f3f3f3f7b5e
SJIS-WIN 陷域ぁ萓幢セゑスー}陷域ぁ萓幢セゑスー{^ 111010001001110010001000111001101000001010011111111001001011111010011011111011111011111010000010111011111011110110110000011111011110100010011100100010001110011010000010100111111110010010111110100110111110111110111110100000101110111110111101101100000111101101011110 e89c88e6829fe4be9befbe82efbdb07de89c88e6829fe4be9befbe82efbdb07b5e
EUC-JP 陷域ぁ萓幢セゑスー}陷域ぁ萓幢セゑスー{^ 111011111111110010110000111010001010010010100001111010001100000011010110111100011000111010111110101001001111000110001110101111011000111010110000011111011110111111111100101100001110100010100100101000011110100011000000110101101111000110001110101111101010010011110001100011101011110110001110101100000111101101011110 effcb0e8a4a1e8c0d6f18ebea4f18ebd8eb07deffcb0e8a4a1e8c0d6f18ebea4f18ebd8eb07b5e
UTF-8 陷域ぁ萓幢セゑスー}陷域ぁ萓幢セゑスー{^ 111010011001100110110111111001011001111110011111111000111000000110000001111010001001000010010011111001011011100110100010111011111011110110111110111000111000001010010001111011111011110110111101111011111011110110110000011111011110100110011001101101111110010110011111100111111110001110000001100000011110100010010000100100111110010110111001101000101110111110111101101111101110001110000010100100011110111110111101101111011110111110111101101100000111101101011110 e999b7e59f9fe38181e89093e5b9a2efbdbee38291efbdbdefbdb07de999b7e59f9fe38181e89093e5b9a2efbdbee38291efbdbdefbdb07b5e
UHC 陷域ぁ?幢?ゑ??}陷域ぁ?幢?ゑ??{^ 11111001111010001110011010110100101010101010000100111111110100111101001100111111101010101111000100111111001111110111110111111001111010001110011010110100101010101010000100111111110100111101001100111111101010101111000100111111001111110111101101011110 f9e8e6b4aaa13fd3d33faaf13f3f7df9e8e6b4aaa13fd3d33faaf13f3f7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)