To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????n}??????????n{^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111011011100111110100111111001111110011111100111111001111110011111100111111001111110011111100111111011011100111101101011110 3f3f3f3f3f3f3f3f3f3f6e7d3f3f3f3f3f3f3f3f3f3f6e7b5e
SJIS-WIN 狹莵狹シ眈ラ狹莢狹猥n}狹莵狹シ眈ラ狹莢狹猥n{^ 1110000011000011111001001011011111100000110000111011110011100001101111001101011111100000110000111110010010110000111000001100001111100000110011100110111001111101111000001100001111100100101101111110000011000011101111001110000110111100110101111110000011000011111001001011000011100000110000111110000011001110011011100111101101011110 e0c3e4b7e0c3bce1bcd7e0c3e4b0e0c3e0ce6e7de0c3e4b7e0c3bce1bcd7e0c3e4b0e0c3e0ce6e7b5e
EUC-JP 狹莵狹シ眈ラ狹莢狹猥n}狹莵狹シ眈ラ狹莢狹猥n{^ 111000001100010111101000101110011110000011000101100011101011110011100010101111101000111011010111111000001100010111101000101100101110000011000101111000001101000001101110011111011110000011000101111010001011100111100000110001011000111010111100111000101011111010001110110101111110000011000101111010001011001011100000110001011110000011010000011011100111101101011110 e0c5e8b9e0c58ebce2be8ed7e0c5e8b2e0c5e0d06e7de0c5e8b9e0c58ebce2be8ed7e0c5e8b2e0c5e0d06e7b5e
UTF-8 狹莵狹シ眈ラ狹莢狹猥n}狹莵狹シ眈ラ狹莢狹猥n{^ 1110011110001011101110011110100010001110101101011110011110001011101110011110111110111101101111001110011110011100100010001110111110111110100101111110011110001011101110011110100010001110101000101110011110001011101110011110011110001100101001010110111001111101111001111000101110111001111010001000111010110101111001111000101110111001111011111011110110111100111001111001110010001000111011111011111010010111111001111000101110111001111010001000111010100010111001111000101110111001111001111000110010100101011011100111101101011110 e78bb9e88eb5e78bb9efbdbce79c88efbe97e78bb9e88ea2e78bb9e78ca56e7de78bb9e88eb5e78bb9efbdbce79c88efbe97e78bb9e88ea2e78bb9e78ca56e7b5e
UHC 狹?狹?眈?狹莢狹猥n}狹?狹?眈?狹莢狹猥n{^ 111110101111010100111111111110101111010100111111111101111010111100111111111110101111010111111010111110001111101011110101111010001110010101101110011111011111101011110101001111111111101011110101001111111111011110101111001111111111101011110101111110101111100011111010111101011110100011100101011011100111101101011110 faf53ffaf53ff7af3ffaf5faf8faf5e8e56e7dfaf53ffaf53ff7af3ffaf5faf8faf5e8e56e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)