To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 沃?????碎??嚥〓?猷??怨???μ? 10010111100000000011111100111111001111110011111100111111111000011110101000111111001111111001101010001011100000011010110000111111100101110101000100111111001111111000100110000101001111110011111100111111100000111100101000111111 97803f3f3f3f3fe1ea3f3f9a8b81ac3f97513f3f89853f3f3f83ca3f
EUC-JP 沃??堉??碎??嚥〓?猷??怨???μ? 110011011110000000111111001111111000111110110111111111010011111100111111111000101110110000111111001111111101001111101011101000101010111000111111110011011011001000111111001111111011000111100101001111110011111100111111101001101100110000111111 cde03f3f8fb7fd3f3fe2ec3f3fd3eba2ae3fcdb23f3fb1e53f3f3fa6cc3f
UTF-8 沃욌쪇堉먲쭒碎ⓦ럶嚥〓뀍猷뗧넭怨뤄폍若μ윫 1110011010110010100000111110110010011010100011001110110010101010100001111110010110100000100010011110101110101000101100101110110010101101100100101110011110100010100011101110001010010011101001101110101110011111101101101110010110011010101001011110001110000000100100111110101110000000100011011110011110001100101101111110101110010111101001111110101110000100101011011110011010000000101010001110101110100100100001001110110110001111100011011110111110100101101101001100111010111100111011001001110010101011 e6b283ec9a8cecaa87e5a089eba8b2ecad92e7a28ee293a6eb9fb6e59aa5e38093eb808de78cb7eb97a7eb84ade680a8eba484ed8f8defa5b4cebcec9cab
UHC 沃욌쪇堉먲쭒碎ⓦ럶嚥〓뀍猷뗧넭怨뤄폍若μ윫 111010001010101010011110111010111010010110000001111010111011110010010000111011111010011110001010111000011110111110101000111000111000111010010101111001101011111110100001111010111000010110001000111010111010001110001011111001111000011010101100111010101011001110110111111011111011110010011000111001011010111010100101111011001001111110101010 e8aa9eeba581ebbc90efa78ae1efa8e38e95e6bfa1eb8588eba38be786aceab3b7efbc98e5aea5ec9faa

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)