To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????^ 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 蟾舌o莠「蟾絶熄荵凡蟾舌o莠「蟾絶熄荵本^ 111001011011011110010000111000111000001010001111111001001011101010100010111001011011011110010000111000101110000010001111111001001011100110010110011111011110010110110111100100001110001110000010100011111110010010111010101000101110010110110111100100001110001011100000100011111110010010111001100101100111101101011110 e5b790e3828fe4baa2e5b790e2e08fe4b9967de5b790e3828fe4baa2e5b790e2e08fe4b9967b5e
EUC-JP 蟾舌o莠「蟾絶熄荵凡蟾舌o莠「蟾絶熄荵本^ 1110101010111001110000001110010110100011111011111110100010111100100011101010001011101010101110011100000011100100110111111110111111101000101110111100101111011110111010101011100111000000111001011010001111101111111010001011110010001110101000101110101010111001110000001110010011011111111011111110100010111011110010111101110001011110 eab9c0e5a3efe8bc8ea2eab9c0e4dfefe8bbcbdeeab9c0e5a3efe8bc8ea2eab9c0e4dfefe8bbcbdc5e
UTF-8 蟾舌o莠「蟾絶熄荵凡蟾舌o莠「蟾絶熄荵本^ 11101000100111111011111011101000100010001000110011101111101111011000111111101000100011101010000011101111101111011010001011101000100111111011111011100111101101011011011011100111100001101000010011101000100011011011010111100101100001111010000111101000100111111011111011101000100010001000110011101111101111011000111111101000100011101010000011101111101111011010001011101000100111111011111011100111101101011011011011100111100001101000010011101000100011011011010111100110100111001010110001011110 e89fbee8888cefbd8fe88ea0efbda2e89fbee7b5b6e78684e88db5e587a1e89fbee8888cefbd8fe88ea0efbda2e89fbee7b5b6e78684e88db5e69cac5e
UHC 蟾舌o??蟾絶熄?凡蟾舌o??蟾絶熄?本^ 1110000011101010111000001101111110100011111011110011111100111111111000001110101011101111101111101110001111011000001111111101101111101101111000001110101011100000110111111010001111101111001111110011111111100000111010101110111110111110111000111101100000111111110111001110001001011110 e0eae0dfa3ef3f3fe0eaefbee3d83fdbede0eae0dfa3ef3f3fe0eaefbee3d83fdce25e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)