To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????R????^[????R????^[^ 0011111100111111001111110011111101010010001111110011111100111111001111110101111001011011001111110011111100111111001111110101001000111111001111110011111100111111010111100101101101011110 3f3f3f3f523f3f3f3f5e5b3f3f3f3f523f3f3f3f5e5b5e
SJIS-WIN 渉ミ靍舅R渉ミ靍舅^[渉ミ靍舅R渉ミ靍舅^[^ 1000111111000010110100001111101111110000111001000110111001010010100011111100001011010000111110111111000011100100011011100101111001011011100011111100001011010000111110111111000011100100011011100101001010001111110000101101000011111011111100001110010001101110010111100101101101011110 8fc2d0fbf0e46e528fc2d0fbf0e46e5e5b8fc2d0fbf0e46e528fc2d0fbf0e46e5e5b5e
EUC-JP 渉ミ?舅R渉ミ?舅^[渉ミ?舅R渉ミ?舅^[^ 1011111011000100100011101101000000111111111001111100111101010010101111101100010010001110110100000011111111100111110011110101111001011011101111101100010010001110110100000011111111100111110011110101001010111110110001001000111011010000001111111110011111001111010111100101101101011110 bec48ed03fe7cf52bec48ed03fe7cf5e5bbec48ed03fe7cf52bec48ed03fe7cf5e5b5e
UTF-8 渉ミ靍舅R渉ミ靍舅^[渉ミ靍舅R渉ミ靍舅^[^ 11100110101110001000100111101111101111101001000011101001100111011000110111101000100010001000010101010010111001101011100010001001111011111011111010010000111010011001110110001101111010001000100010000101010111100101101111100110101110001000100111101111101111101001000011101001100111011000110111101000100010001000010101010010111001101011100010001001111011111011111010010000111010011001110110001101111010001000100010000101010111100101101101011110 e6b889efbe90e99d8de8888552e6b889efbe90e99d8de888855e5be6b889efbe90e99d8de8888552e6b889efbe90e99d8de888855e5b5e
UHC ???舅R???舅^[???舅R???舅^[^ 001111110011111100111111110011111100000001010010001111110011111100111111110011111100000001011110010110110011111100111111001111111100111111000000010100100011111100111111001111111100111111000000010111100101101101011110 3f3f3fcfc0523f3f3fcfc05e5b3f3f3fcfc0523f3f3fcfc05e5b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)