To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????v??????????vB 0011111100111111001111110011111100111111001111110011111100111111001111110011111101110110001111110011111100111111001111110011111100111111001111110011111100111111001111110111011001000010 3f3f3f3f3f3f3f3f3f3f763f3f3f3f3f3f3f3f3f3f7642
SJIS-WIN テョツステ古仰湘静ーvテョツステ古仰湘静ーvB 11000011101011101100001010111101110000111000110011000011100010111100001010001111110000111001000011000011101100000111011011000011101011101100001010111101110000111000110011000011100010111100001010001111110000111001000011000011101100000111011001000010 c3aec2bdc38cc38bc28fc390c3b076c3aec2bdc38cc38bc28fc390c3b07642
EUC-JP テョツステ古仰湘静ーvテョツステ古仰湘静ーvB 10001110110000111000111010101110100011101100001010001110101111011000111011000011101110001100010110110110110001001011111011000101110000001100010110001110101100000111011010001110110000111000111010101110100011101100001010001110101111011000111011000011101110001100010110110110110001001011111011000101110000001100010110001110101100000111011001000010 8ec38eae8ec28ebd8ec3b8c5b6c4bec5c0c58eb0768ec38eae8ec28ebd8ec3b8c5b6c4bec5c0c58eb07642
UTF-8 テョツステ古仰湘静ーvテョツステ古仰湘静ーvB 111011111011111010000011111011111011110110101110111011111011111010000010111011111011110110111101111011111011111010000011111001011000111110100100111001001011101110110000111001101011100110011000111010011001110110011001111011111011110110110000011101101110111110111110100000111110111110111101101011101110111110111110100000101110111110111101101111011110111110111110100000111110010110001111101001001110010010111011101100001110011010111001100110001110100110011101100110011110111110111101101100000111011001000010 efbe83efbdaeefbe82efbdbdefbe83e58fa4e4bbb0e6b998e99d99efbdb076efbe83efbdaeefbe82efbdbdefbe83e58fa4e4bbb0e6b998e99d99efbdb07642
UHC ?????古仰湘??v?????古仰湘??vB 0011111100111111001111110011111100111111110011011010111111100100111001101101111111001111001111110011111101110110001111110011111100111111001111110011111111001101101011111110010011100110110111111100111100111111001111110111011001000010 3f3f3f3f3fcdafe4e6dfcf3f3f763f3f3f3f3fcdafe4e6dfcf3f3f7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)