To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ??較莎???捲?訊?診??? 0011111100111111100010100111001011100100101100110011111100111111001111111000110010011110001111111001000001110101001111111001000001100110001111110011111100111111 3f3f8a72e4b33f3f3f8c9e3f90753f90663f3f3f
EUC-JP ??較莎???捲?訊?診??? 0011111100111111101100111101001111101000101101010011111100111111001111111011011111111110001111111011111111010110001111111011111111000111001111110011111100111111 3f3fb3d3e8b53f3f3fb7fe3fbfd63fbfc73f3f3f
UTF-8 뤵칿較莎븍롙뤃捲쮲訊롁診렏얀렻 111010111010010010110101111011001011100110111111111010001011110010000011111010001000111010001110111010111011100010001101111010111010000110011001111010111010010010000011111001101000110110110010111011001010111010110010111010001010100010001010111010111010000110000001111010001010100010111010111010111010000010001111111011001001011010000000111010111010000010111011 eba4b5ecb9bfe8bc83e88e8eebb88deba199eba483e68db2ecaeb2e8a88aeba181e8a8baeba08fec9680eba0bb
UHC 뤵칿較莎븍롙뤃捲쮲訊롁診렏얀렻 100011111110001110101111100011101100111011110010110111101110110110111010111010111000111011011101100011111011010011001111111011001010100010001111111000111111001010001110110010001111001011100000100011101010010110111110111000011000111011000011 8fe3af8ecef2deedbaeb8edd8fb4cfeca88fe3f28ec8f2e08ea5bee18ec3

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)