To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ??????蒻??踰??幽??閻?????衰 001111110011111100111111001111110011111100111111111001001110100000111111001111111110011011111010001111110011111110010111010010000011111100111111111010001000010100111111001111110011111100111111001111111001000010001010 3f3f3f3f3f3fe4e83f3fe6fa3f3f97483f3fe8853f3f3f3f3f908a
EUC-JP ???洧??蒻??踰??幽??閻?????衰 0011111100111111001111111000111111000111101101000011111100111111111010001110101000111111001111111110110011111100001111110011111111001101101010010011111100111111111011111110010100111111001111110011111100111111001111111011111111101010 3f3f3f8fc7b43f3fe8ea3f3fecfc3f3fcda93f3fefe53f3f3f3f3fbfea
UTF-8 樂끸뱫洧붾뜦蒻몃뿫踰쇿뵱幽곷껐閻롫쓹痢띺썒衰 111011111010011010111111111010111000000110111000111010111011000110101011111001101011010010100111111010111011011010111110111010111001110010100110111010001001001010111011111010111010101010000011111010111011111110101011111010001011100010110000111011001000011110111111111010111011010110110001111001011011100110111101111010101011001110110111111010101011101110010000111010011001011010111011111010111010000110101011111011001001001110111001111011111010011110100101111010111001110110111010111011001000110110010010111010001010000110110000 efa6bfeb81b8ebb1abe6b4a7ebb6beeb9ca6e892bbebaa83ebbfabe8b8b0ec87bfebb5b1e5b9bdeab3b7eabb90e996bbeba1abec93b9efa7a5eb9dbaec8d92e8a1b0
UHC 樂끸뱫洧붾뜦蒻몃뿫踰쇿뵱幽곷껐閻롫쓹痢띺썒衰 1110100011111001100001011110001010010011100100011110101011111011100101001110101110001101101010011110010110110110101110001110101110010111101010111110101110110010100110011110010110010100101011111110101011101011100000011110101110110010101100001110011110100010100011101110101110011101100101011110110010111000100011011110100110011011100001011110000111110001 e8f985e29391eafb94eb8da9e5b6b8eb97abebb299e594afeaeb81ebb2b0e7a28eeb9d95ecb88de99b85e1f1

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)