To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????M 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101001101 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f4d
SJIS-WIN 恁??????⑥?姨???⑥?姨?????M 100111001000110000111111001111110011111100111111001111110011111110000111010001010011111110011011010010000011111100111111001111111000011101000101001111111001101101001000001111110011111100111111001111110011111101001101 9c8c3f3f3f3f3f3f87453f9b483f3f3f87453f9b483f3f3f3f3f4d
EUC-JP 恁????????姨?????姨?????M 11010111111011000011111100111111001111110011111100111111001111110011111100111111110101011010100100111111001111110011111100111111001111111101010110101001001111110011111100111111001111110011111101001101 d7ec3f3f3f3f3f3f3f3fd5a93f3f3f3f3fd5a93f3f3f3f3f4d
UTF-8 恁㏉슗淋싴슊梨⑥콡姨먰쉱梨⑥콡姨랁삧吏숈껌M 11100110100000011000000111100011100011111000100111101100100010101001011111101111101001111011010111101100100010111011010011101100100010101000101011101111101001111010001011100010100100011010010111101100101111011010000111100101101001111010100011101011101010001011000011101100100010011011000111101111101001111010001011100010100100011010010111101100101111011010000111100101101001111010100011101011100111101000000111101100100000101010011111101111101001111001111011101100100010001000100011101010101110111000110001001101 e68181e38f89ec8a97efa7b5ec8bb4ec8a8aefa7a2e291a5ecbda1e5a7a8eba8b0ec89b1efa7a2e291a5ecbda1e5a7a8eb9e81ec82a7efa79eec8888eabb8c4d
UHC 恁㏉슗淋싴슊梨⑥콡姨먰쉱梨⑥콡姨랁삧吏숈껌M 11101100111101101010011111101101100110101010011011101100111110001001101011101101100110101001101011101100101100011010100011101100101100011001100111101100101010011001000011101101100110101000100011101100101100011010100011101100101100011001100111101100101010011000110111101101100110001010011011101100101001111001100111101100101100101010110101001101 ecf6a7ed9aa6ecf89aed9a9aecb1a8ecb199eca990ed9a88ecb1a8ecb199eca98ded98a6eca799ecb2ad4d

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)