To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????^ 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN テつ劾テつ佚δ陛つ劾テつ劾テつ佚δ陛つ劾^ 11000011100000101100001010001010010011101100001110000010110000101001100011000011100000111100001010010101110000111000001011000010100010100100111011000011100000101100001010001010010011101100001110000010110000101001100011000011100000111100001010010101110000111000001011000010100010100100111001011110 c382c28a4ec382c298c383c295c382c28a4ec382c28a4ec382c298c383c295c382c28a4e5e
EUC-JP テつ劾テつ佚δ陛つ劾テつ劾テつ佚δ陛つ劾^ 1000111011000011101001001100010010110011101011111000111011000011101001001100010011010000110001011010011011000100110010101100010110100100110001001011001110101111100011101100001110100100110001001011001110101111100011101100001110100100110001001101000011000101101001101100010011001010110001011010010011000100101100111010111101011110 8ec3a4c4b3af8ec3a4c4d0c5a6c4cac5a4c4b3af8ec3a4c4b3af8ec3a4c4d0c5a6c4cac5a4c4b3af5e
UTF-8 テつ劾テつ佚δ陛つ劾テつ劾テつ佚δ陛つ劾^ 1110111110111110100000111110001110000001101001001110010110001010101111101110111110111110100000111110001110000001101001001110010010111101100110101100111010110100111010011001100110011011111000111000000110100100111001011000101010111110111011111011111010000011111000111000000110100100111001011000101010111110111011111011111010000011111000111000000110100100111001001011110110011010110011101011010011101001100110011001101111100011100000011010010011100101100010101011111001011110 efbe83e381a4e58abeefbe83e381a4e4bd9aceb4e9999be381a4e58abeefbe83e381a4e58abeefbe83e381a4e4bd9aceb4e9999be381a4e58abe5e
UHC ?つ劾?つ佚δ陛つ劾?つ劾?つ佚δ陛つ劾^ 00111111101010101100010011111010101101100011111110101010110001001110110011101010101001011110010011111000110011101010101011000100111110101011011000111111101010101100010011111010101101100011111110101010110001001110110011101010101001011110010011111000110011101010101011000100111110101011011001011110 3faac4fab63faac4eceaa5e4f8ceaac4fab63faac4fab63faac4eceaa5e4f8ceaac4fab65e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)