To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 烏????┼誘κ?藥??揖??袁ル??り?? 10001001010001110011111100111111001111110011111110000100101010011001011101010101100000111100100000111111111001010101101000111111001111111001011101001011001111110011111111100101110011011000001110001011001111110011111110000010111010000011111100111111 89473f3f3f3f84a9975583c83fe55a3f3f974b3f3fe5cd838b3f3f82e83f3f
EUC-JP 烏??薏?┼誘κ?藥??揖??袁ル??り?? 101100011010100000111111001111111000111111011001110111100011111110101000101010111100110110110110101001101100101000111111111010011011101100111111001111111100110110101100001111110011111111101010110011111010010111101011001111110011111110100100111010100011111100111111 b1a83f3f8fd9de3fa8abcdb6a6ca3fe9bb3f3fcdac3f3feacfa5eb3f3fa4ea3f3f
UTF-8 烏띾ㅇ薏껓┼誘κ섭藥띾엪揖쇗땡袁ル솿閭り난璘 1110011110000011100011111110101110011101101111101110001110000101100001111110100010010110100011111110101010111011100100111110001010010100101111001110100010101010100110001100111010111010111011001000010010101101111010001001011110100101111010111001110110111110111011001001011110101010111001101000111110010110111011001000011110010111111010111001010110100001111010001010001010000001111000111000001110101011111011001000011010111111111011111010011010000110111000111000001010001010111010111000001010011100111011111010011110101111 e7838feb9dbee38587e8968feabb93e294bce8aa98cebaec84ade897a5eb9dbeec97aae68f96ec8797eb95a1e8a281e383abec86bfefa686e3828aeb829cefa7af
UHC 烏띾ㅇ薏껓┼誘κ섭藥띾엪揖쇗땡袁ル솿閭り난璘 1110100010100001100011011110101110100100101101111110101111111011100000111110111110100110101010111110101110101111101001011110101010111100101101111110010110110111100011011110101110011110100000111110101111100111101111001110011010110110101011111110101010111110101010111110101110011001101100111110011010101101101010101110101010110011101011011110110011011110 e8a18deba4b7ebfb83efa6abebafa5eabcb7e5b78deb9e83ebe7bce6b6afeabeabeb99b3e6adaaeab3adecde

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)