To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????B 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN テ敕愿アテ愿ーテュテアテ愿ョテ、テキツァB 11000011100111011100001110011100110000111011000111000011100111001100001110110000110000111010110111000011101100011100001110011100110000111010111011000011101001001100001110110111110000101010011101000010 c39dc39cc3b1c39cc3b0c3adc3b1c39cc3aec3a4c3b7c2a742
EUC-JP テ敕愿アテ愿ーテュテアテ愿ョテ、テキツァB 1000111011000011110110101100010111011000110001011000111010110001100011101100001111011000110001011000111010110000100011101100001110001110101011011000111011000011100011101011000110001110110000111101100011000101100011101010111010001110110000111000111010100100100011101100001110001110101101111000111011000010100011101010011101000010 8ec3dac5d8c58eb18ec3d8c58eb08ec38ead8ec38eb18ec3d8c58eae8ec38ea48ec38eb78ec28ea742
UTF-8 テ敕愿アテ愿ーテュテアテ愿ョテ、テキツァB 11101111101111101000001111100110100101011001010111100110100001001011111111101111101111011011000111101111101111101000001111100110100001001011111111101111101111011011000011101111101111101000001111101111101111011010110111101111101111101000001111101111101111011011000111101111101111101000001111100110100001001011111111101111101111011010111011101111101111101000001111101111101111011010010011101111101111101000001111101111101111011011011111101111101111101000001011101111101111011010011101000010 efbe83e69595e684bfefbdb1efbe83e684bfefbdb0efbe83efbdadefbe83efbdb1efbe83e684bfefbdaeefbe83efbda4efbe83efbdb7efbe82efbda742
UHC ??愿??愿??????愿???????B 001111110011111111101010101101000011111100111111111010101011010000111111001111110011111100111111001111110011111111101010101101000011111100111111001111110011111100111111001111110011111101000010 3f3feab43f3feab43f3f3f3f3f3feab43f3f3f3f3f3f3f42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)