To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 弔???除???疑??梯?畯孟∧??除 100100101010001000111111001111110011111110001111100111000011111100111111001111111000101101011110001111110011111110010010111100100011111111111011011011111001011011010000100000011100100000111111001111111000111110011100 92a23f3f3f8f9c3f3f3f8b5e3f3f92f23ffb6f96d081c83f3f8f9c
EUC-JP 弔???除???疑??梯?畯孟∧??除 11000100101001000011111100111111001111111011110111111100001111110011111100111111101101011011111100111111001111111100010011110100001111111000111111001101101110111100110011010010101000101100101000111111001111111011110111111100 c4a43f3f3fbdfc3f3f3fb5bf3f3fc4f43f8fcdbbccd2a2ca3f3fbdfc
UTF-8 弔렲罹렗除곈렖렕疑얕욱梯렟畯孟∧亐렕除 111001011011110010010100111010111010000010110010111011111010011110100110111010111010000010010111111010011001100110100100111010101011001110001000111010111010000010010110111010111010000010010101111001111001011010010001111011001001011010010101111011001001101010110001111001101010001010101111111010111010000010011111111001111001010110101111111001011010110110011111111000101000100010100111111001001011101010010000111010111010000010010101111010011001100110100100 e5bc94eba0b2efa7a6eba097e999a4eab388eba096eba095e79691ec9695ec9ab1e6a2afeba09fe795afe5ad9fe288a7e4ba90eba095e999a4
UHC 弔렲罹렗除곈렖렕疑얕욱梯렟畯孟∧亐렕除 1111000011000000100011101011111111101100101110101000111010101100111100001011011010110000111010011000111010101011100011101010101011101011111101111011111011101000101111111110110111110000101011001000111010110000111100011110000111011000111010111010000111111100111010101010011110001110101010101111000010110110 f0c08ebfecba8eacf0b6b0e98eab8eaaebf7bee8bfedf0ac8eb0f1e1d8eba1fceaa78eaaf0b6

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)