To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲λ?諭℡?應??癲j?依?┘???櫻 1110000110011111100000111100100100111111100101110100000010000111100001000011111110011100111001000011111100111111111000011001111110000010100010100011111110001000110010110011111110000100101000110011111100111111001111111001111101001110 e19f83c93f974087843f9ce43f3fe19f828a3f88cb3f84a33f3f3f9f4e
EUC-JP 癲λ?諭??應??癲j?依?┘???櫻 11100010101000011010011011001011001111111100110110100001001111110011111111011000111001100011111100111111111000101010000110100011111010100011111110110000110011010011111110101000101001010011111100111111001111111101110110101111 e2a1a6cb3fcda13f3fd8e63f3fe2a1a3ea3fb0cd3fa8a53f3f3fddaf
UTF-8 癲λ쉘諭℡렚應뱀몜癲j난依욑┘戮⑸괍櫻 1110011110011001101100101100111010111011111011001000100110011000111010001010101110101101111000101000010010100001111010111010000010011010111001101000011110001001111010111011000110000000111010111010101010011100111001111001100110110010111011111011110110001010111010111000001010011100111001001011111010011101111011001001101010010001111000101001010010011000111011111010011110010010111000101001000110111000111010101011010010001101111001101010101110111011 e799b2cebbec8998e8abade284a1eba09ae68789ebb180ebaa9ce799b2efbd8aeb829ce4be9dec9a91e29498efa792e291b8eab48de6abbb
UHC 癲λ쉘諭℡렚應뱀몜癲j난依욑┘戮⑸괍櫻 1110111110100110101001011110101110111101101010011110101110110001101000101110010110001110101011011110101111101011101110011110110010010001100010101110111110100110101000111110101010110011101011011110101111101110100111101110111110100110101001011110101110111101101010011110101110110001101000101110010110100001 efa6a5ebbda9ebb1a2e58eadebebb9ec918aefa6a3eab3adebee9eefa6a5ebbda9ebb1a2e5a1

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)