To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????v?????????vB 001111110011111100111111001111110011111100111111001111110011111100111111011101100011111100111111001111110011111100111111001111110011111100111111001111110111011001000010 3f3f3f3f3f3f3f3f3f763f3f3f3f3f3f3f3f3f7642
SJIS-WIN 午??徇??譯??v午??徇??譯??vB 100011001101111100111111001111111001110001101101001111110011111111100110101000010011111100111111011101101000110011011111001111110011111110011100011011010011111100111111111001101010000100111111001111110111011001000010 8cdf3f3f9c6d3f3fe6a13f3f768cdf3f3f9c6d3f3fe6a13f3f7642
EUC-JP 午??徇??譯??v午??徇??譯??vB 101110001110000100111111001111111101011111001110001111110011111111101100101000110011111100111111011101101011100011100001001111110011111111010111110011100011111100111111111011001010001100111111001111110111011001000010 b8e13f3fd7ce3f3feca33f3f76b8e13f3fd7ce3f3feca33f3f7642
UTF-8 午댄뤃徇껊젳譯롫졁v午댄뤃徇껊젳譯롫졁vB 111001011000110110001000111010111000110010000100111010111010010010000011111001011011111010000111111010101011101110001010111011001010000010110011111010001010110110101111111010111010000110101011111011001010000110000001011101101110010110001101100010001110101110001100100001001110101110100100100000111110010110111110100001111110101010111011100010101110110010100000101100111110100010101101101011111110101110100001101010111110110010100001100000010111011001000010 e58d88eb8c84eba483e5be87eabb8aeca0b3e8adafeba1abeca18176e58d88eb8c84eba483e5be87eabb8aeca0b3e8adafeba1abeca1817642
UHC 午댄뤃徇껊젳譯롫졁v午댄뤃徇껊젳譯롫졁vB 111001111110110110110100111011011000111110110100111000101101111110000011111010111010000010100111111001101011101110001110111010111010000010110010011101101110011111101101101101001110110110001111101101001110001011011111100000111110101110100000101001111110011010111011100011101110101110100000101100100111011001000010 e7edb4ed8fb4e2df83eba0a7e6bb8eeba0b276e7edb4ed8fb4e2df83eba0a7e6bb8eeba0b27642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)