To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???夷ゆ?幽??奄???よい怨??孃 001111110011111100111111100010001100111010000010111001000011111110010111010010000011111100111111100010011000001000111111001111110011111110000010111001101000001010100010100010011000010100111111001111111001101101101111 3f3f3f88ce82e43f97483f3f89823f3f3f82e682a289853f3f9b6f
EUC-JP ???夷ゆ?幽??奄???よい怨??孃 001111110011111100111111101100001101000010100100111001100011111111001101101010010011111100111111101100011110001000111111001111110011111110100100111010001010010010100100101100011110010100111111001111111101010111010000 3f3f3fb0d0a4e63fcda93f3fb1e23f3f3fa4e8a4a4b1e53f3fd5d0
UTF-8 嶺뚭낮夷ゆ룚幽꾪닞奄멸램藺よい怨룸쿃孃 111011111010011010101011111010111001101010101101111010111000001010101110111001011010010010110111111000111000001010000110111010111010001110011010111001011011100110111101111010101011111010101010111010111000101110011110111001011010010110000100111010111010100110111000111010111001111010101000111011111010011110110000111000111000001010001000111000111000000110000100111001101000000010101000111010111010001110111000111011001011111110000011111001011010110110000011 efa6abeb9aadeb82aee5a4b7e38286eba39ae5b9bdeabeaaeb8b9ee5a584eba9b8eb9ea8efa7b0e38288e38184e680a8eba3b8ecbf83e5ad83
UHC 嶺뚭낮夷ゆ룚幽꾪닞奄멸램藺よい怨룸쿃孃 1110011110101101100011001110101010110011101101111110110010101000101010101110011010001111100101101110101011101011100001001110110110001000100111101110010111110010101110001110101010110111101001011110110011100001101010101110100010101010101001001110101010110011101101111110101110110010100110011110010110111110 e7ad8ceab3b7eca8aae68f96eaeb84ed889ee5f2b8eab7a5ece1aae8aaa4eab3b7ebb299e5be

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)