To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲??裕??揄???る?泣э?擬??艾??爾 11100001100111110011111100111111100101110101010000111111001111111001110110001001001111110011111100111111100000101110100100111111100010111000001110000100100011110011111110001011010110110011111100111111111001001000100000111111001111111000111010100010 e19f3f3f97543f3f9d893f3f3f82e93f8b83848f3f8b5b3f3fe4883f3f8ea2
EUC-JP 癲??裕??揄???る?泣э?擬??艾??爾 11100010101000010011111100111111110011011011010100111111001111111101100111101001001111110011111100111111101001001110101100111111101101011110001110100111111011110011111110110101101111000011111100111111111001111110100000111111001111111011110010100100 e2a13f3fcdb53f3fd9e93f3f3fa4eb3fb5e3a7ef3fb5bc3f3fe7e83f3fbca4
UTF-8 癲븍쵉裕끻♤揄몄돹閭る돍泣э쬃擬쀫렱艾싲떵爾 1110011110011001101100101110101110111000100011011110110010110101100010011110100010100011100101011110101110000001101110111110001010011001101001001110011010001111100001001110101110101010100001001110101110001111101110011110111110100110100001101110001110000010100010111110101110001111100011011110011010110011101000111101000110001101111011001010110010000011111001101001001110101100111011001000000010101011111010111010000010110001111010001000100110111110111011001000101110110010111010111001011010110101111001111000100010111110 e799b2ebb88decb589e8a395eb81bbe299a4e68f84ebaa84eb8fb9efa686e3828beb8f8de6b3a3d18decac83e693acec80abeba0b1e889beec8bb2eb96b5e788be
UHC 癲븍쵉裕끻♤揄몄돹閭る돍泣э쬃擬쀫렱艾싲떵爾 1110111110100110101110101110101110101100100010111110101110101110100001011110010110100010101110111110101011110001101110001110110010001001101111001110011010101101101010101110101110001001100110111110101111101000101011001110111110100110100110101110101111110100100101111110101110001110101111101110010011110101100110101110101110110110101110101110110010110011 efa6baebac8bebae85e5a2bbeaf1b8ec89bce6adaaeb899bebe8acefa69aebf497eb8ebee4f59aebb6baecb3

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)