To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癰??濡??松?癰??濡??松 111000011001111000111111001111111001010001000111001111110011111110001111101111000011111111100001100111100011111100111111100101000100011100111111001111111000111110111100 e19e3f3f94473f3f8fbc3fe19e3f3f94473f3f8fbc
EUC-JP 癰??濡??松?癰??濡??松 111000011111111000111111001111111100011110101000001111110011111110111110101111100011111111100001111111100011111100111111110001111010100000111111001111111011111010111110 e1fe3f3fc7a83f3fbebe3fe1fe3f3fc7a83f3fbebe
UTF-8 癰잙젿濡뗪엥松쬱癰잙젿濡뗪엥松 111001111001100110110000111011001001111010011001111011001010000010111111111001101011111110100001111010111001011110101010111011001001011110100101111001101001110110111110111011001010110010110001111001111001100110110000111011001001111010011001111011001010000010111111111001101011111110100001111010111001011110101010111011001001011110100101111001101001110110111110 e799b0ec9e99eca0bfe6bfa1eb97aaec97a5e69dbeecacb1e799b0ec9e99eca0bfe6bfa1eb97aaec97a5e69dbe
UHC 癰잙젿濡뗪엥松쬱癰잙젿濡뗪엥松 111010001011100110011111111010111010000010110001111010111010000110001011111010101011111110101000111000011110011010100111011010001110100010111001100111111110101110100000101100011110101110100001100010111110101010111111101010001110000111100110 e8b99feba0b1eba18beabfa8e1e6a768e8b99feba0b1eba18beabfa8e1e6

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)