To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????[??????????[^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111101011011001111110011111100111111001111110011111100111111001111110011111100111111001111110101101101011110 3f3f3f3f3f3f3f3f3f3f5b3f3f3f3f3f3f3f3f3f3f5b5e
SJIS-WIN 渦??唯?渦??唯?[渦??唯?渦??唯?[^ 10001001010100010011111100111111100101110100001000111111100010010101000100111111001111111001011101000010001111110101101110001001010100010011111100111111100101110100001000111111100010010101000100111111001111111001011101000010001111110101101101011110 89513f3f97423f89513f3f97423f5b89513f3f97423f89513f3f97423f5b5e
EUC-JP 渦??唯?渦??唯?[渦??唯?渦??唯?[^ 10110001101100100011111100111111110011011010001100111111101100011011001000111111001111111100110110100011001111110101101110110001101100100011111100111111110011011010001100111111101100011011001000111111001111111100110110100011001111110101101101011110 b1b23f3fcda33fb1b23f3fcda33f5bb1b23f3fcda33fb1b23f3fcda33f5b5e
UTF-8 渦깅끇唯럝渦깅끇唯럝[渦깅끇唯럝渦깅끇唯럝[^ 111001101011100010100110111010101011100110000101111010111000000110000111111001011001010010101111111010111001111110011101111001101011100010100110111010101011100110000101111010111000000110000111111001011001010010101111111010111001111110011101010110111110011010111000101001101110101010111001100001011110101110000001100001111110010110010100101011111110101110011111100111011110011010111000101001101110101010111001100001011110101110000001100001111110010110010100101011111110101110011111100111010101101101011110 e6b8a6eab985eb8187e594afeb9f9de6b8a6eab985eb8187e594afeb9f9d5be6b8a6eab985eb8187e594afeb9f9de6b8a6eab985eb8187e594afeb9f9d5b5e
UHC 渦깅끇唯럝渦깅끇唯럝[渦깅끇唯럝渦깅끇唯럝[^ 11101000101111101011000111101011100001011011101111101010111001101000111001111010111010001011111010110001111010111000010110111011111010101110011010001110011110100101101111101000101111101011000111101011100001011011101111101010111001101000111001111010111010001011111010110001111010111000010110111011111010101110011010001110011110100101101101011110 e8beb1eb85bbeae68e7ae8beb1eb85bbeae68e7a5be8beb1eb85bbeae68e7ae8beb1eb85bbeae68e7a5b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)