To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 鴦???額?????魏??????????? 11101001111100010011111100111111001111111000101001111010001111110011111100111111001111110011111111101001101100000011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 e9f13f3f3f8a7a3f3f3f3f3fe9b03f3f3f3f3f3f3f3f3f3f3f
EUC-JP 鴦???額??彛??魏??????????? 111100101111001100111111001111110011111110110011110110110011111100111111100011111011110011111010001111110011111111110010101100100011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 f2f33f3f3fb3db3f3f8fbcfa3f3ff2b23f3f3f3f3f3f3f3f3f3f3f
UTF-8 鴦꾆쇱쪠額됰씛彛볠틦魏뉖룴嶺뚮뿪留륅쭩硫⑸연 111010011011010010100110111010101011111010000110111011001000011110110001111011001010101010100000111010011010000110001101111010111001000010110000111011001001010010011011111001011011110110011011111010111011001110100000111011011000101110100110111010011010110110001111111010111000100110010110111010111010001110110100111011111010011010101011111010111001101010101110111010111011111110101010111011111010011110001101111010111010010110000101111011001010110110101001111011111010011110001110111000101001000110111000111011001001011110110000 e9b4a6eabe86ec87b1ecaaa0e9a18deb90b0ec949be5bd9bebb3a0ed8ba6e9ad8feb8996eba3b4efa6abeb9aaeebbfaaefa78deba585ecada9efa78ee291b8ec97b0
UHC 鴦꾆쇱쪠額됰씛彛볠틦魏뉖룴嶺뚮뿪留륅쭩硫⑸연 1110010011101100100001001100111010111100111011001010010110011001111001001111111010001001111010111001110110110000111011001010110110010011111001101011101010010000111010101110000010000111111010111000111110101001111001111010110110001100111010111001011110101010111010111010011110001111111011111010011110011101111010111010100110101001111010111011111110101100 e4ec84cebceca599e4fe89eb9db0ecad93e6ba90eae087eb8fa9e7ad8ceb97aaeba78fefa79deba9a9ebbfac

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)