To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????z?????????zB 001111110011111100111111001111110011111100111111001111110011111100111111011110100011111100111111001111110011111100111111001111110011111100111111001111110111101001000010 3f3f3f3f3f3f3f3f3f7a3f3f3f3f3f3f3f3f3f7a42
SJIS-WIN 儀??儀?????z儀??儀?????zB 10001011010101100011111100111111100010110101011000111111001111110011111100111111001111110111101010001011010101100011111100111111100010110101011000111111001111110011111100111111001111110111101001000010 8b563f3f8b563f3f3f3f3f7a8b563f3f8b563f3f3f3f3f7a42
EUC-JP 儀??儀?????z儀??儀?????zB 10110101101101110011111100111111101101011011011100111111001111110011111100111111001111110111101010110101101101110011111100111111101101011011011100111111001111110011111100111111001111110111101001000010 b5b73f3fb5b73f3f3f3f3f7ab5b73f3fb5b73f3f3f3f3f7a42
UTF-8 儀붾젣儀붾젫溜딅줃z儀붾젣儀붾젫溜딅줃zB 111001011000010010000000111010111011011010111110111011001010000010100011111001011000010010000000111010111011011010111110111011001010000010101011111011111010011110001011111010111001010010000101111011001010010010000011011110101110010110000100100000001110101110110110101111101110110010100000101000111110010110000100100000001110101110110110101111101110110010100000101010111110111110100111100010111110101110010100100001011110110010100100100000110111101001000010 e58480ebb6beeca0a3e58480ebb6beeca0abefa78beb9485eca4837ae58480ebb6beeca0a3e58480ebb6beeca0abefa78beb9485eca4837a42
UHC 儀붾젣儀붾젫溜딅줃z儀붾젣儀붾젫溜딅줃zB 111010111111000010010100111010111010000010011100111010111111000010010100111010111010000010100011111010101111111010001010111010111010000110011010011110101110101111110000100101001110101110100000100111001110101111110000100101001110101110100000101000111110101011111110100010101110101110100001100110100111101001000010 ebf094eba09cebf094eba0a3eafe8aeba19a7aebf094eba09cebf094eba0a3eafe8aeba19a7a42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)