To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????[?????????[^ 001111110011111100111111001111110011111100111111001111110011111100111111010110110011111100111111001111110011111100111111001111110011111100111111001111110101101101011110 3f3f3f3f3f3f3f3f3f5b3f3f3f3f3f3f3f3f3f5b5e
SJIS-WIN 諺℡?言??厓??[諺℡?言??厓??[^ 1000110010111111100001111000010000111111100011001011111000111111001111111111101010001101001111110011111101011011100011001011111110000111100001000011111110001100101111100011111100111111111110101000110100111111001111110101101101011110 8cbf87843f8cbe3f3ffa8d3f3f5b8cbf87843f8cbe3f3ffa8d3f3f5b5e
EUC-JP 諺??言??厓??[諺??言??厓??[^ 1011100011000001001111110011111110111000110000000011111100111111100011111011010011000111001111110011111101011011101110001100000100111111001111111011100011000000001111110011111110001111101101001100011100111111001111110101101101011110 b8c13f3fb8c03f3f8fb4c73f3f5bb8c13f3fb8c03f3f8fb4c73f3f5b5e
UTF-8 諺℡컟言됭갬厓⒴씏[諺℡컟言됭갬厓⒴씏[^ 111010001010101110111010111000101000010010100001111011001011101110011111111010001010100010000000111010111001000010101101111010101011000010101100111001011000111010010011111000101001001010110100111011001001010010001111010110111110100010101011101110101110001010000100101000011110110010111011100111111110100010101000100000001110101110010000101011011110101010110000101011001110010110001110100100111110001010010010101101001110110010010100100011110101101101011110 e8abbae284a1ecbb9fe8a880eb90adeab0ace58e93e292b4ec948f5be8abbae284a1ecbb9fe8a880eb90adeab0ace58e93e292b4ec948f5b5e
UHC 諺℡컟言됭갬厓⒴씏[諺℡컟言됭갬厓⒴씏[^ 111001011110110010100010111001011011000010001010111001011110101110001001111010001011000010110111111001001110110110101001111001011001110110100110010110111110010111101100101000101110010110110000100010101110010111101011100010011110100010110000101101111110010011101101101010011110010110011101101001100101101101011110 e5eca2e5b08ae5eb89e8b0b7e4eda9e59da65be5eca2e5b08ae5eb89e8b0b7e4eda9e59da65b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)