To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 筌??泣ワ??κ????筌??泣ワ??κ????B 111000101010001100111111001111111000101110000011100000111000111100111111001111111000001111001000001111110011111100111111001111111110001010100011001111110011111110001011100000111000001110001111001111110011111110000011110010000011111100111111001111110011111101000010 e2a33f3f8b83838f3f3f83c83f3f3f3fe2a33f3f8b83838f3f3f83c83f3f3f3f42
EUC-JP 筌??泣ワ?洹κ????筌??泣ワ?洹κ????B 11100100101001010011111100111111101101011110001110100101111011110011111110001111110001111011101010100110110010100011111100111111001111110011111111100100101001010011111100111111101101011110001110100101111011110011111110001111110001111011101010100110110010100011111100111111001111110011111101000010 e4a53f3fb5e3a5ef3f8fc7baa6ca3f3f3f3fe4a53f3fb5e3a5ef3f8fc7baa6ca3f3f3f3f42
UTF-8 筌뚮뿦泣ワ㎗洹κ묘閱곕뒼筌뚮뿦泣ワ㎗洹κ묘閱곕뒼B 1110011110101101100011001110101110011010101011101110101110111111101001101110011010110011101000111110001110000011101011111110001110001110100101111110011010110100101110011100111010111010111010111010110010011000111010011001011010110001111010101011001110010101111010111001001010111100111001111010110110001100111010111001101010101110111010111011111110100110111001101011001110100011111000111000001110101111111000111000111010010111111001101011010010111001110011101011101011101011101011001001100011101001100101101011000111101010101100111001010111101011100100101011110001000010 e7ad8ceb9aaeebbfa6e6b3a3e383afe38e97e6b4b9cebaebac98e996b1eab395eb92bce7ad8ceb9aaeebbfa6e6b3a3e383afe38e97e6b4b9cebaebac98e996b1eab395eb92bc42
UHC 筌뚮뿦泣ワ㎗洹κ묘閱곕뒼筌뚮뿦泣ワ㎗洹κ묘閱곕뒼B 11101111101001111000110011101011100101111010011011101011111010001010101111101111101001111010001111101010101101111010010111101010101110011010011011100110111100111011000011101011100010101011001011101111101001111000110011101011100101111010011011101011111010001010101111101111101001111010001111101010101101111010010111101010101110011010011011100110111100111011000011101011100010101011001001000010 efa78ceb97a6ebe8abefa7a3eab7a5eab9a6e6f3b0eb8ab2efa78ceb97a6ebe8abefa7a3eab7a5eab9a6e6f3b0eb8ab242

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)