To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????[??????????[^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111101011011001111110011111100111111001111110011111100111111001111110011111100111111001111110101101101011110 3f3f3f3f3f3f3f3f3f3f5b3f3f3f3f3f3f3f3f3f3f5b5e
SJIS-WIN 渦??擁?塋ょク癌?[渦??擁?塋ょク癌?[^ 1000100101010001001111110011111110010111011010010011111110011010110010001000001011100101100000110100111010001010111000000011111101011011100010010101000100111111001111111001011101101001001111111001101011001000100000101110010110000011010011101000101011100000001111110101101101011110 89513f3f97693f9ac882e5834e8ae03f5b89513f3f97693f9ac882e5834e8ae03f5b5e
EUC-JP 渦??擁?塋ょク癌?[渦??擁?塋ょク癌?[^ 1011000110110010001111110011111111001101110010100011111111010100110010101010010011100111101001011010111110110100111000100011111101011011101100011011001000111111001111111100110111001010001111111101010011001010101001001110011110100101101011111011010011100010001111110101101101011110 b1b23f3fcdca3fd4caa4e7a5afb4e23f5bb1b23f3fcdca3fd4caa4e7a5afb4e23f5b5e
UTF-8 渦겼뜕擁쿐塋ょク癌큒[渦겼뜕擁쿐塋ょク癌큒[^ 111001101011100010100110111010101011001010111100111010111001110010010101111001101001001110000001111011001011111110010000111001011010000110001011111000111000001010000111111000111000001010101111111001111001100110001100111011011000000110010010010110111110011010111000101001101110101010110010101111001110101110011100100101011110011010010011100000011110110010111111100100001110010110100001100010111110001110000010100001111110001110000010101011111110011110011001100011001110110110000001100100100101101101011110 e6b8a6eab2bceb9c95e69381ecbf90e5a18be38287e382afe7998ced81925be6b8a6eab2bceb9c95e69381ecbf90e5a18be38287e382afe7998ced81925b5e
UHC 渦겼뜕擁쿐塋ょク癌큒[渦겼뜕擁쿐塋ょク癌큒[^ 11101000101111101011000011100101100011011001100011101000101101101011001101000101111001111010101110101010111001111010101110101111111001001101111110110100011000100101101111101000101111101011000011100101100011011001100011101000101101101011001101000101111001111010101110101010111001111010101110101111111001001101111110110100011000100101101101011110 e8beb0e58d98e8b6b345e7abaae7abafe4dfb4625be8beb0e58d98e8b6b345e7abaae7abafe4dfb4625b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)