To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 佯??鍮??幽??嚥△??⑨?怨??鵝 100110001101000100111111001111111110100001001010001111110011111110010111010010000011111100111111100110101000101110000001101000100011111100111111100001110100100000111111100010011000010100111111001111111110101001000000 98d13f3fe84a3f3f97483f3f9a8b81a23f3f87483f89853f3fea40
EUC-JP 佯??鍮??幽??嚥△?沅??怨??鵝 11010000110100110011111100111111111011111010101100111111001111111100110110101001001111110011111111010011111010111010001010100100001111111000111111000110111010010011111100111111101100011110010100111111001111111111001110100001 d0d33f3fefab3f3fcda93f3fd3eba2a43f8fc6e93f3fb1e53f3ff3a1
UTF-8 佯몃돆鍮뽩첎幽뚯춹嚥△벀沅⑨쭪怨ㅼ춳鵝 111001001011110110101111111010111010101010000011111010111000111110000110111010011000110110101110111010111011110110101001111011001011001010001110111001011011100110111101111010111001101010101111111011001011011010111001111001011001101010100101111000101001011010110011111010111011001010000000111001101011001010000101111000101001000110101000111011001010110110101010111001101000000010101000111000111000010110111100111011001011011010110011111010011011010110011101 e4bdafebaa83eb8f86e98daeebbda9ecb28ee5b9bdeb9aafecb6b9e59aa5e296b3ebb280e6b285e291a8ecadaae680a8e385bcecb6b3e9b59d
UHC 佯몃돆鍮뽩첎幽뚯춹嚥△벀沅⑨쭪怨ㅼ춳鵝 1110010110111010101110001110101110001001100101111110101110111001100101101110010110101010100110111110101011101011100011001110110010101101100101011110011010111111101000011110001010010011101001101110101010110110101010001110111110100111100111101110101010110011101001001110110010101101100011111110010010111101 e5bab8eb8997ebb996e5aa9beaeb8cecad95e6bfa1e293a6eab6a8efa79eeab3a4ecad8fe4bd

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)