To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???泣??淫??域??肉??柔モ?沃?? 00111111001111110011111110001011100000110011111100111111100010001111101000111111001111111000100011100110001111110011111110010011111101110011111100111111100011110101111110000011100000100011111110010111100000000011111100111111 3f3f3f8b833f3f88fa3f3f88e63f3f93f73f3f8f5f83823f97803f3f
EUC-JP ???泣??淫??域??肉??柔モ?沃?? 00111111001111110011111110110101111000110011111100111111101100001111110000111111001111111011000011101000001111110011111111000110111110010011111100111111101111011100000010100101111000100011111111001101111000000011111100111111 3f3f3fb5e33f3fb0fc3f3fb0e83f3fc6f93f3fbdc0a5e23fcde03f3f
UTF-8 嶺뚭램泣뚩돞淫볦춷域㏃슦肉쇘춯柔モ닪沃쇰옟 111011111010011010101011111010111001101010101101111010111001111010101000111001101011001110100011111010111001101010101001111010111000111110011110111001101011011110101011111010111011001110100110111011001011011010110111111001011001111110011111111000111000111110000011111011001000101010100110111010001000001010001001111011001000011110011000111011001011011010101111111001101001111110010100111000111000001110100010111010111000101110101010111001101011001010000011111011001000011110110000111011001001100010011111 efa6abeb9aadeb9ea8e6b3a3eb9aa9eb8f9ee6b7abebb3a6ecb6b7e59f9fe38f83ec8aa6e88289ec8798ecb6afe69f94e383a2eb8baae6b283ec87b0ec989f
UHC 嶺뚭램泣뚩돞淫볦춷域㏃슦肉쇘춯柔モ닪沃쇰옟 111001111010110110001100111010101011011110100101111010111110100010001100111010001000100110100100111010111110001010010011111011001010110110010011111001101011010010100111111011001001101010110000111010111011111110111100111001111010110110001100111010101111010110101011111000101000100010100101111010001010101010111100111010111001111010100001 e7ad8ceab7a5ebe88ce889a4ebe293ecad93e6b4a7ec9ab0ebbfbce7ad8ceaf5abe288a5e8aabceb9ea1

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)