To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????U}??????????U{^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111010101010111110100111111001111110011111100111111001111110011111100111111001111110011111100111111010101010111101101011110 3f3f3f3f3f3f3f3f3f3f557d3f3f3f3f3f3f3f3f3f3f557b5e
SJIS-WIN 症?梓基??泣???U}症?梓基??泣???U{^ 100011111100011100111111100010001011001010001010111011100011111100111111100010111000001100111111001111110011111101010101011111011000111111000111001111111000100010110010100010101110111000111111001111111000101110000011001111110011111100111111010101010111101101011110 8fc73f88b28aee3f3f8b833f3f3f557d8fc73f88b28aee3f3f8b833f3f3f557b5e
EUC-JP 症?梓基??泣???U}症?梓基??泣???U{^ 101111101100100100111111101100001011010010110100111100000011111100111111101101011110001100111111001111110011111101010101011111011011111011001001001111111011000010110100101101001111000000111111001111111011010111100011001111110011111100111111010101010111101101011110 bec93fb0b4b4f03f3fb5e33f3f3f557dbec93fb0b4b4f03f3fb5e33f3f3f557b5e
UTF-8 症렜梓基렰렒泣닿렓렮U}症렜梓基렰렒泣닿렓렮U{^ 1110011110010111100001111110101110100000100111001110011010100010100100111110010110011111101110101110101110100000101100001110101110100000100100101110011010110011101000111110101110001011101111111110101110100000100100111110101110100000101011100101010101111101111001111001011110000111111010111010000010011100111001101010001010010011111001011001111110111010111010111010000010110000111010111010000010010010111001101011001110100011111010111000101110111111111010111010000010010011111010111010000010101110010101010111101101011110 e79787eba09ce6a293e59fbaeba0b0eba092e6b3a3eb8bbfeba093eba0ae557de79787eba09ce6a293e59fbaeba0b0eba092e6b3a3eb8bbfeba093eba0ae557b5e
UHC 症렜梓基렰렒泣닿렓렮U}症렜梓基렰렒泣닿렓렮U{^ 111100011111100010001110101011101110111010101001110100001111000110001110101111011000111010100111111010111110100010110100111010101000111010101000100011101011101101010101011111011111000111111000100011101010111011101110101010011101000011110001100011101011110110001110101001111110101111101000101101001110101010001110101010001000111010111011010101010111101101011110 f1f88eaeeea9d0f18ebd8ea7ebe8b4ea8ea88ebb557df1f88eaeeea9d0f18ebd8ea7ebe8b4ea8ea88ebb557b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)