To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲??裕??????る?泣э?擬??艾??魏 111000011001111100111111001111111001011101010100001111110011111100111111001111110011111100111111100000101110100100111111100010111000001110000100100011110011111110001011010110110011111100111111111001001000100000111111001111111110100110110000 e19f3f3f97543f3f3f3f3f3f82e93f8b83848f3f8b5b3f3fe4883f3fe9b0
EUC-JP 癲??裕??????る?泣э?擬??艾??魏 111000101010000100111111001111111100110110110101001111110011111100111111001111110011111100111111101001001110101100111111101101011110001110100111111011110011111110110101101111000011111100111111111001111110100000111111001111111111001010110010 e2a13f3fcdb53f3f3f3f3f3fa4eb3fb5e3a7ef3fb5bc3f3fe7e83f3ff2b2
UTF-8 癲븍쵉裕끻깷類㏉뮆閭る돍泣э쬃擬쀫렱艾싲떼魏 1110011110011001101100101110101110111000100011011110110010110101100010011110100010100011100101011110101110000001101110111110101010111001101101111110111110100111100100001110001110001111100010011110101110101110100001101110111110100110100001101110001110000010100010111110101110001111100011011110011010110011101000111101000110001101111011001010110010000011111001101001001110101100111011001000000010101011111010111010000010110001111010001000100110111110111011001000101110110010111010111001011010111100111010011010110110001111 e799b2ebb88decb589e8a395eb81bbeab9b7efa790e38f89ebae86efa686e3828beb8f8de6b3a3d18decac83e693acec80abeba0b1e889beec8bb2eb96bce9ad8f
UHC 癲븍쵉裕끻깷類㏉뮆閭る돍泣э쬃擬쀫렱艾싲떼魏 1110111110100110101110101110101110101100100010111110101110101110100001011110010110000011101001011110101110111010101001111110110110010010100101011110011010101101101010101110101110001001100110111110101111101000101011001110111110100110100110101110101111110100100101111110101110001110101111101110010011110101100110101110101110110110101111001110101011100000 efa6baebac8bebae85e583a5ebbaa7ed9295e6adaaeb899bebe8acefa69aebf497eb8ebee4f59aebb6bceae0

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)