To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 弔????縡?儀??兢??製????? 100100101010001000111111001111110011111100111111111000110111000100111111100010110101011000111111001111111001100101011101001111110011111110010000101110110011111100111111001111110011111100111111 92a23f3f3f3fe3713f8b563f3f995d3f3f90bb3f3f3f3f3f
EUC-JP 弔?勖?汶縡?儀??兢??製?勖??? 110001001010010000111111100011111011001111101101001111111000111111000110111001011110010111010010001111111011010110110111001111110011111111010001101111100011111100111111110000001011110100111111100011111011001111101101001111110011111100111111 c4a43f8fb3ed3f8fc6e5e5d23fb5b73f3fd1be3f3fc0bd3f8fb3ed3f3f3f
UTF-8 弔렲勖쾌汶縡렕儀븀렚兢렏렕製렩勖렢亐렕 111001011011110010010100111010111010000010110010111001011000101110010110111011001011111010001100111001101011000110110110111001111011100010100001111010111010000010010101111001011000010010000000111010111011100010000000111010111010000010011010111001011000010110100010111010111010000010001111111010111010000010010101111010001010001110111101111010111010000010101001111001011000101110010110111010111010000010100010111001001011101010010000111010111010000010010101 e5bc94eba0b2e58b96ecbe8ce6b1b6e7b8a1eba095e58480ebb880eba09ae585a2eba08feba095e8a3bdeba0a9e58b96eba0a2e4ba90eba095
UHC 弔렲勖쾌汶縡렕儀븀렚兢렏렕製렩勖렢亐렕 1111000011000000100011101011111111101001111011011100010011101000110110101010000111101110101011011000111010101010111010111111000010111010111001111000111010101101110100001110011110001110101001011000111010101010111100001011001010001110101101111110100111101101100011101011001111101010101001111000111010101010 f0c08ebfe9edc4e8daa1eead8eaaebf0bae78eadd0e78ea58eaaf0b28eb7e9ed8eb3eaa78eaa

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)