To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????J}?????????J{^ 0011111100111111001111110011111100111111001111110011111100111111001111110100101001111101001111110011111100111111001111110011111100111111001111110011111100111111010010100111101101011110 3f3f3f3f3f3f3f3f3f4a7d3f3f3f3f3f3f3f3f3f4a7b5e
SJIS-WIN 五??徇??倭??J}五??徇??倭??J{^ 1000110011011100001111110011111110011100011011010011111100111111100110000110000000111111001111110100101001111101100011001101110000111111001111111001110001101101001111110011111110011000011000000011111100111111010010100111101101011110 8cdc3f3f9c6d3f3f98603f3f4a7d8cdc3f3f9c6d3f3f98603f3f4a7b5e
EUC-JP 五??徇??倭??J}五??徇??倭??J{^ 1011100011011110001111110011111111010111110011100011111100111111110011111100000100111111001111110100101001111101101110001101111000111111001111111101011111001110001111110011111111001111110000010011111100111111010010100111101101011110 b8de3f3fd7ce3f3fcfc13f3f4a7db8de3f3fd7ce3f3fcfc13f3f4a7b5e
UTF-8 五밧릦徇쒒릶倭며퉵J}五밧릦徇쒒릶倭며퉵J{^ 1110010010111010100101001110101110110000101001111110101110100110101001101110010110111110100001111110110010010010100100101110101110100110101101101110010110000000101011011110101110101001101100001110110110001001101101010100101001111101111001001011101010010100111010111011000010100111111010111010011010100110111001011011111010000111111011001001001010010010111010111010011010110110111001011000000010101101111010111010100110110000111011011000100110110101010010100111101101011110 e4ba94ebb0a7eba6a6e5be87ec9292eba6b6e580adeba9b0ed89b54a7de4ba94ebb0a7eba6a6e5be87ec9292eba6b6e580adeba9b0ed89b54a7b5e
UHC 五밧릦徇쒒릶倭며퉵J}五밧릦徇쒒릶倭며퉵J{^ 1110011111101001101110011110010110010000100010001110001011011111100111001110100110010000100101001110100011011110101110001110011110111001100011010100101001111101111001111110100110111001111001011001000010001000111000101101111110011100111010011001000010010100111010001101111010111000111001111011100110001101010010100111101101011110 e7e9b9e59088e2df9ce99094e8deb8e7b98d4a7de7e9b9e59088e2df9ce99094e8deb8e7b98d4a7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)