To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 艾?????音??????癲??巽ο?擬?? 11100100100010000011111100111111001111110011111100111111100010011011100100111111001111110011111100111111001111110011111111100001100111110011111100111111100100100100011010000011110011010011111110001011010110110011111100111111 e4883f3f3f3f3f89b93f3f3f3f3f3fe19f3f3f924683cd3f8b5b3f3f
EUC-JP 艾?????音??????癲??巽ο?擬?? 11100111111010000011111100111111001111110011111100111111101100101011101100111111001111110011111100111111001111110011111111100010101000010011111100111111110000111010011110100110110011110011111110110101101111000011111100111111 e7e83f3f3f3f3fb2bb3f3f3f3f3f3fe2a13f3fc3a7a6cf3fb5bc3f3f
UTF-8 艾싳궠梨욘룚音좊엠捻믍뗫뼏癲용굝巽ο㎟擬륁뜠 1110100010001001101111101110110010001011101100111110101010110110101000001110111110100111101000101110110010011010100110001110101110100011100110101110100110011111101100111110110010100010100010101110110010010111101000001110111110100110101001001110101110101111100011011110101110010111101010111110101110111100100011111110011110011001101100101110110010011010101010011110101010110101100111011110010110110111101111011100111010111111111000111000111010011111111001101001001110101100111010111010010110000001111010111001110010100000 e889beec8bb3eab6a0efa7a2ec9a98eba39ae99fb3eca28aec97a0efa6a4ebaf8deb97abebbc8fe799b2ec9aa9eab59de5b7bdcebfe38e9fe693aceba581eb9ca0
UHC 艾싳궠梨욘룚音좊엠捻믍뗫뼏癲용굝巽ο㎟擬륁뜠 1110010011110101100110101110110010000010101100111110110010110001101111111110011010001111100101101110101111100101101000001110101110111111101001011110011011110111100100101101000110001011111010111001011010010111111011111010011010111111111010111000001010000101111000011101111010100101111011111010011110110001111010111111010010001111111011001000110110100011 e4f59aec82b3ecb1bfe68f96ebe5a0ebbfa5e6f792d18beb9697efa6bfeb8285e1dea5efa7b1ebf48fec8da3

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)