To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲?8諭よ?純??檍??愿??猷??阿??逸 1110000110011111001111111000001001010111100101110100000010000010111001100011111110001111100000110011111100111111100111101111100000111111001111111001110011000011001111110011111110010111010100010011111100111111100010001010001000111111001111111000100011101101 e19f3f8257974082e63f8f833f3f9ef83f3f9cc33f3f97513f3f88a23f3f88ed
EUC-JP 癲?8諭よ?純??檍??愿??猷??阿??逸 1110001010100001001111111010001110111000110011011010000110100100111010000011111110111101111000110011111100111111110111001111101000111111001111111101100011000101001111110011111111001101101100100011111100111111101100001010010000111111001111111011000011101111 e2a13fa3b8cda1a4e83fbde33f3fdcfa3f3fd8c53f3fcdb23f3fb0a43f3fb0ef
UTF-8 癲쒕8諭よ땻純쏇맧檍욍꺁愿뚳쬅猷붽틙阿쇺벂逸 111001111001100110110010111011001001001010010101111011111011110010011000111010001010101110101101111000111000001010001000111010111001010110111011111001111011010010010100111011001000111110000111111010111010011110100111111001101010101010001101111011001001101010001101111010101011101010000001111001101000010010111111111010111001101010110011111011001010110010000101111001111000110010110111111010111011011010111101111011011000101110011001111010011001100010111111111011001000011110111010111010111011001010000010111010011000000010111000 e799b2ec9295efbc98e8abade38288eb95bbe7b494ec8f87eba7a7e6aa8dec9a8deaba81e684bfeb9ab3ecac85e78cb7ebb6bded8b99e998bfec87baebb282e980b8
UHC 癲쒕8諭よ땻純쏇맧檍욍꺁愿뚳쬅猷붽틙阿쇺벂逸 1110111110100110100111001110101110100011101110001110101110110001101010101110100010001011100100011110001011101101100110111110110110010000101100001110010111100101101111111110001110000011101010101110101010110100100011001110111110100110100111001110101110100011100101001110101010111010100001101110010010111001100110011110001010010011101010001110110011101111 efa69ceba3b8ebb1aae88b91e2ed9bed90b0e5e5bfe383aaeab48cefa69ceba394eaba86e4b999e293a8ecef

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)