To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????[??????????[^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111101011011001111110011111100111111001111110011111100111111001111110011111100111111001111110101101101011110 3f3f3f3f3f3f3f3f3f3f5b3f3f3f3f3f3f3f3f3f3f5b5e
SJIS-WIN テ淌氾古乏テ淌氾古某[テ淌氾古乏テ淌氾古某[^ 110000111001111111000011100101001100001110001100110000111001011001010010110000111001111111000011100101001100001110001100110000111001011001011110010110111100001110011111110000111001010011000011100011001100001110010110010100101100001110011111110000111001010011000011100011001100001110010110010111100101101101011110 c39fc394c38cc39652c39fc394c38cc3965e5bc39fc394c38cc39652c39fc394c38cc3965e5b5e
EUC-JP テ淌氾古乏テ淌氾古某[テ淌氾古乏テ淌氾古某[^ 10001110110000111101111011000101110010001100010110111000110001011100101110110011100011101100001111011110110001011100100011000101101110001100010111001011101111110101101110001110110000111101111011000101110010001100010110111000110001011100101110110011100011101100001111011110110001011100100011000101101110001100010111001011101111110101101101011110 8ec3dec5c8c5b8c5cbb38ec3dec5c8c5b8c5cbbf5b8ec3dec5c8c5b8c5cbb38ec3dec5c8c5b8c5cbbf5b5e
UTF-8 テ淌氾古乏テ淌氾古某[テ淌氾古乏テ淌氾古某[^ 111011111011111010000011111001101011011110001100111001101011000010111110111001011000111110100100111001001011100110001111111011111011111010000011111001101011011110001100111001101011000010111110111001011000111110100100111001101001111110010000010110111110111110111110100000111110011010110111100011001110011010110000101111101110010110001111101001001110010010111001100011111110111110111110100000111110011010110111100011001110011010110000101111101110010110001111101001001110011010011111100100000101101101011110 efbe83e6b78ce6b0bee58fa4e4b98fefbe83e6b78ce6b0bee58fa4e69f905befbe83e6b78ce6b0bee58fa4e4b98fefbe83e6b78ce6b0bee58fa4e69f905b5e
UHC ??氾古乏??氾古某[??氾古乏??氾古某[^ 0011111100111111110110111111000011001101101011111111100110111001001111110011111111011011111100001100110110101111110110011011101101011011001111110011111111011011111100001100110110101111111110011011100100111111001111111101101111110000110011011010111111011001101110110101101101011110 3f3fdbf0cdaff9b93f3fdbf0cdafd9bb5b3f3fdbf0cdaff9b93f3fdbf0cdafd9bb5b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)