To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 罌??飮??癒ъ?狎??泣?Ⅴ???畑 111000111010000000111111001111111001111101011010001111110011111110010110111111001000010010001100001111111110000010111110001111110011111110001011100000110011111110000111010110000011111100111111001111111001010010101000 e3a03f3f9f5a3f3f96fc848c3fe0be3f3f8b833f87583f3f3f94a8
EUC-JP 罌??飮??癒ъ?狎??泣??洧??畑 11100110101000100011111100111111110111011011101100111111001111111100110011111110101001111110110000111111111000001100000000111111001111111011010111100011001111110011111110001111110001111011010000111111001111111100100010101010 e6a23f3fddbb3f3fccfea7ec3fe0c03f3fb5e33f3f8fc7b43f3fc8aa
UTF-8 罌삠룇飮긺독癒ъ럞狎녿씛泣곻Ⅴ洧븍늼畑 1110011110111101100011001110110010000010101000001110101110100011100001111110100110100011101011101110101010111000101110101110101110001111100001011110011110011001100100101101000110001010111010111001111110011110111001111000101110001110111010111000010110111111111011001001010010011011111001101011001110100011111010101011001110111011111000101000010110100100111001101011010010100111111010111011100010001101111010111000101010111100111001111001010110010001 e7bd8cec82a0eba387e9a3aeeab8baeb8f85e79992d18aeb9f9ee78b8eeb85bfec949be6b3a3eab3bbe285a4e6b4a7ebb88deb8abce79591
UHC 罌삠룇飮긺독癒ъ럞狎녿씛泣곻Ⅴ洧븍늼畑 1110010110100010101110111110001110001111100001101110101111100110101100011110011110110101101101101110101110101000101011001110110010001110100000011110010011100100100001101110101110011101101100001110101111101000100000011110111110100101101101001110101011111011101110101110101110001000100001011110111110100101 e5a2bbe38f86ebe6b1e7b5b6eba8acec8e81e4e486eb9db0ebe881efa5b4eafbbaeb8885efa5

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)