To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????^ 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 丈燻ワ上ソ鉦ヒ室簪酌丈燻ワ上ソ鉦ヒ室簪灼^ 1000111111100100111000001000111011011100100011111110001110111111100011111101111011001011100011101011101011100010110011111000111011011110100011111110010011100000100011101101110010001111111000111011111110001111110111101100101110001110101110101110001011001111100011101101110001011110 8fe4e08edc8fe3bf8fdecb8ebae2cf8ede8fe4e08edc8fe3bf8fdecb8ebae2cf8edc5e
EUC-JP 丈燻ワ上ソ鉦ヒ室簪酌丈燻ワ上ソ鉦ヒ室簪灼^ 1011111011100110110111111110111010001110110111001011111011100101100011101011111110111110111000001000111011001011101111001011110011100100110100011011110011100000101111101110011011011111111011101000111011011100101111101110010110001110101111111011111011100000100011101100101110111100101111001110010011010001101111001101111001011110 bee6dfee8edcbee58ebfbee08ecbbcbce4d1bce0bee6dfee8edcbee58ebfbee08ecbbcbce4d1bcde5e
UTF-8 丈燻ワ上ソ鉦ヒ室簪酌丈燻ワ上ソ鉦ヒ室簪灼^ 11100100101110001000100011100111100001111011101111101111101111101001110011100100101110001000101011101111101111011011111111101001100010011010011011101111101111101000101111100101101011101010010011100111101100001010101011101001100001011000110011100100101110001000100011100111100001111011101111101111101111101001110011100100101110001000101011101111101111011011111111101001100010011010011011101111101111101000101111100101101011101010010011100111101100001010101011100111100000011011110001011110 e4b888e787bbefbe9ce4b88aefbdbfe989a6efbe8be5aea4e7b0aae9858ce4b888e787bbefbe9ce4b88aefbdbfe989a6efbe8be5aea4e7b0aae781bc5e
UHC 丈燻?上?鉦?室簪酌丈燻?上?鉦?室簪灼^ 1110110111011011111111011011100000111111110111111011111000111111111011111111101000111111111000111111100011101101110110001110110111001100111011011101101111111101101110000011111111011111101111100011111111101111111110100011111111100011111110001110110111011000111011011100011101011110 eddbfdb83fdfbe3feffa3fe3f8edd8edcceddbfdb83fdfbe3feffa3fe3f8edd8edc75e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)