To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????}??????????{^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111101111101001111110011111100111111001111110011111100111111001111110011111100111111001111110111101101011110 3f3f3f3f3f3f3f3f3f3f7d3f3f3f3f3f3f3f3f3f3f7b5e
SJIS-WIN テつ堙つ淌つ咼テつ妝}テつ堙つ淌つ咼テつ妝{^ 110000111000001011000010100110101100001110000010110000101001111111000011100000101100001010011010010001001100001110000010110000101001101101000010011111011100001110000010110000101001101011000011100000101100001010011111110000111000001011000010100110100100010011000011100000101100001010011011010000100111101101011110 c382c29ac382c29fc382c29a44c382c29b427dc382c29ac382c29fc382c29a44c382c29b427b5e
EUC-JP テつ堙つ淌つ咼テつ妝}テつ堙つ淌つ咼テつ妝{^ 10001110110000111010010011000100110101001100010110100100110001001101111011000101101001001100010011010011101001011000111011000011101001001100010011010101101000110111110110001110110000111010010011000100110101001100010110100100110001001101111011000101101001001100010011010011101001011000111011000011101001001100010011010101101000110111101101011110 8ec3a4c4d4c5a4c4dec5a4c4d3a58ec3a4c4d5a37d8ec3a4c4d4c5a4c4dec5a4c4d3a58ec3a4c4d5a37b5e
UTF-8 テつ堙つ淌つ咼テつ妝}テつ堙つ淌つ咼テつ妝{^ 111011111011111010000011111000111000000110100100111001011010000010011001111000111000000110100100111001101011011110001100111000111000000110100100111001011001001010111100111011111011111010000011111000111000000110100100111001011010011010011101011111011110111110111110100000111110001110000001101001001110010110100000100110011110001110000001101001001110011010110111100011001110001110000001101001001110010110010010101111001110111110111110100000111110001110000001101001001110010110100110100111010111101101011110 efbe83e381a4e5a099e381a4e6b78ce381a4e592bcefbe83e381a4e5a69d7defbe83e381a4e5a099e381a4e6b78ce381a4e592bcefbe83e381a4e5a69d7b5e
UHC ?つ?つ?つ??つ?}?つ?つ?つ??つ?{^ 00111111101010101100010000111111101010101100010000111111101010101100010000111111001111111010101011000100001111110111110100111111101010101100010000111111101010101100010000111111101010101100010000111111001111111010101011000100001111110111101101011110 3faac43faac43faac43f3faac43f7d3faac43faac43faac43f3faac43f7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)