To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 嚥??臆??猥れ?嚥??臆??猥れ?^ 100110101000101100111111001111111000100110110000001111110011111111100000110011101000001011101010001111111001101010001011001111110011111110001001101100000011111100111111111000001100111010000010111010100011111101011110 9a8b3f3f89b03f3fe0ce82ea3f9a8b3f3f89b03f3fe0ce82ea3f5e
EUC-JP 嚥??臆??猥れ?嚥??臆??猥れ?^ 110100111110101100111111001111111011001010110010001111110011111111100000110100001010010011101100001111111101001111101011001111110011111110110010101100100011111100111111111000001101000010100100111011000011111101011110 d3eb3f3fb2b23f3fe0d0a4ec3fd3eb3f3fb2b23f3fe0d0a4ec3f5e
UTF-8 嚥잙젇臆띕젽猥れ뙣嚥잙젇臆띕젽猥れ뙟^ 11100101100110101010010111101100100111101001100111101100101000001000011111101000100001111000011011101011100111011001010111101100101000001011110111100111100011001010010111100011100000101000110011101011100110011010001111100101100110101010010111101100100111101001100111101100101000001000011111101000100001111000011011101011100111011001010111101100101000001011110111100111100011001010010111100011100000101000110011101011100110011001111101011110 e59aa5ec9e99eca087e88786eb9d95eca0bde78ca5e3828ceb99a3e59aa5ec9e99eca087e88786eb9d95eca0bde78ca5e3828ceb999f5e
UHC 嚥잙젇臆띕젽猥れ뙣嚥잙젇臆띕젽猥れ뙟^ 11100110101111111001111111101011101000001000101011100101111001101011011011101011101000001010111111101000111001011010101011101100100011001010100011100110101111111001111111101011101000001000101011100101111001101011011011101011101000001010111111101000111001011010101011101100100011001010010001011110 e6bf9feba08ae5e6b6eba0afe8e5aaec8ca8e6bf9feba08ae5e6b6eba0afe8e5aaec8ca45e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)