To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 藥??逸??癒ъ?也??藥??逸??癒ъ?也??B 1110010101011010001111110011111110001000111011010011111100111111100101101111110010000100100011000011111110010110111001110011111100111111111001010101101000111111001111111000100011101101001111110011111110010110111111001000010010001100001111111001011011100111001111110011111101000010 e55a3f3f88ed3f3f96fc848c3f96e73f3fe55a3f3f88ed3f3f96fc848c3f96e73f3f42
EUC-JP 藥??逸??癒ъ?也??藥??逸??癒ъ?也??B 1110100110111011001111110011111110110000111011110011111100111111110011001111111010100111111011000011111111001100111010010011111100111111111010011011101100111111001111111011000011101111001111110011111111001100111111101010011111101100001111111100110011101001001111110011111101000010 e9bb3f3fb0ef3f3fccfea7ec3fcce93f3fe9bb3f3fb0ef3f3fccfea7ec3fcce93f3f42
UTF-8 藥띲끏逸썹독癒ъ뿯也㏓콝藥띲끏逸썹독癒ъ뿯也㏓콝B 1110100010010111101001011110101110011101101100101110101110000001100011111110100110000000101110001110110010001101101110011110101110001111100001011110011110011001100100101101000110001010111010111011111110101111111001001011100110011111111000111000111110010011111011001011110110011101111010001001011110100101111010111001110110110010111010111000000110001111111010011000000010111000111011001000110110111001111010111000111110000101111001111001100110010010110100011000101011101011101111111010111111100100101110011001111111100011100011111001001111101100101111011001110101000010 e897a5eb9db2eb818fe980b8ec8db9eb8f85e79992d18aebbfafe4b99fe38f93ecbd9de897a5eb9db2eb818fe980b8ec8db9eb8f85e79992d18aebbfafe4b99fe38f93ecbd9d42
UHC 藥띲끏逸썹독癒ъ뿯也㏓콝藥띲끏逸썹독癒ъ뿯也㏓콝B 11100101101101111000110111100011100001011011111111101100111011111011110111100111101101011011011011101011101010001010110011101100100101111010111111100101101001011010011111101011101100011001010111100101101101111000110111100011100001011011111111101100111011111011110111100111101101011011011011101011101010001010110011101100100101111010111111100101101001011010011111101011101100011001010101000010 e5b78de385bfecefbde7b5b6eba8acec97afe5a5a7ebb195e5b78de385bfecefbde7b5b6eba8acec97afe5a5a7ebb19542

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)