To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 巽端続誰遜贈巽端続誰遜贈B 10010010010001101001001001011011100100011011000110010010010011101001000110111011100100011010000110010010010001101001001001011011100100011011000110010010010011101001000110111011100100011010000101000010 9246925b91b1924e91bb91a19246925b91b1924e91bb91a142
EUC-JP 巽端続誰遜贈巽端続誰遜贈B 11000011101001111100001110111100110000101011001111000011101011111100001010111101110000101010001111000011101001111100001110111100110000101011001111000011101011111100001010111101110000101010001101000010 c3a7c3bcc2b3c3afc2bdc2a3c3a7c3bcc2b3c3afc2bdc2a342
UTF-8 巽端続誰遜贈巽端続誰遜贈B 11100101101101111011110111100111101010111010111111100111101101101001101011101000101010101011000011101001100000011001110011101000101101001000100011100101101101111011110111100111101010111010111111100111101101101001101011101000101010101011000011101001100000011001110011101000101101001000100001000010 e5b7bde7abafe7b69ae8aab0e9819ce8b488e5b7bde7abafe7b69ae8aab0e9819ce8b48842
UHC 巽端?誰遜贈巽端?誰遜贈B 1110000111011110110100111010111000111111111000101100000111100001111000011111000111111100111000011101111011010011101011100011111111100010110000011110000111100001111100011111110001000010 e1ded3ae3fe2c1e1e1f1fce1ded3ae3fe2c1e1e1f1fc42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)