To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 矮??擁?????泣③??よ?擬??鉛??^ 1110000111100010001111110011111110010111011010010011111100111111001111110011111100111111100010111000001110000111010000100011111100111111100000101110011000111111100010110101101100111111001111111000100110010100001111110011111101011110 e1e23f3f97693f3f3f3f3f8b8387423f3f82e63f8b5b3f3f89943f3f5e
EUC-JP 矮??擁?????泣???よ?擬??鉛??^ 11100010111001000011111100111111110011011100101000111111001111110011111100111111001111111011010111100011001111110011111100111111101001001110100000111111101101011011110000111111001111111011000111110100001111110011111101011110 e2e43f3fcdca3f3f3f3f3fb5e33f3f3fa4e83fb5bc3f3fb1f43f3f5e
UTF-8 矮곷젶擁얠뼇淋먪뎳泣③긽溜よ뗀擬묓쉺鉛놁뒯^ 11100111100111111010111011101010101100111011011111101100101000001011011011100110100100111000000111101100100101101010000011101011101111001000011111101111101001111011010111101011101010001010101011101011100011101011001111100110101100111010001111100010100100011010001011101010101110001011110111101111101001111000101111100011100000101000100011101011100101111000000011100110100100111010110011101011101011001001001111101100100010011011101011101001100010011001101111101011100001101000000111101011100100101010111101011110 e79faeeab3b7eca0b6e69381ec96a0ebbc87efa7b5eba8aaeb8eb3e6b3a3e291a2eab8bdefa78be38288eb9780e693acebac93ec89bae9899beb8681eb92af5e
UHC 矮곷젶擁얠뼇淋먪뎳泣③긽溜よ뗀擬묓쉺鉛놁뒯^ 11101000111000011000000111101011101000001010101011101000101101101011111011101100100101101001000111101100111110001001000011100111100010011000011011101011111010001010100011101001100000111000000111101010111111101010101011101000101101101011111011101011111101001001000111101101100110101001000011100110111001111000011011101100100010101010100001011110 e8e181eba0aae8b6beec9691ecf890e78986ebe8a8e98381eafeaae8b6beebf491ed9a90e6e786ec8aa85e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)