To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 猥??音??豫?????惟る??????? 1110000011001110001111110011111110001001101110010011111100111111100110001010110000111111001111110011111100111111001111111000100011010010100000101110100100111111001111110011111100111111001111110011111100111111 e0ce3f3f89b93f3f98ac3f3f3f3f3f88d282e93f3f3f3f3f3f3f
EUC-JP 猥??音??豫?????惟る????庾?? 11100000110100000011111100111111101100101011101100111111001111111101000010101110001111110011111100111111001111110011111110110000110101001010010011101011001111110011111100111111001111111000111110111100110011100011111100111111 e0d03f3fb2bb3f3fd0ae3f3f3f3f3fb0d4a4eb3f3f3f3f8fbcce3f3f
UTF-8 猥롢뀧音곗몥豫곕객留뚧룄惟る눤亮쎈맧庾껆븨 111001111000110010100101111010111010000110100010111010111000000010100111111010011001111110110011111010101011001110010111111010111010101010100101111010001011000110101011111010101011001110010101111010101011000010011101111011111010011110001101111010111001101010100111111010111010001110000100111001101000001110011111111000111000001010001011111010111000100010100100111011111010010110110111111011001000111010001000111010111010011110100111111001011011101010111110111010101011101110000110111010111011100010101000 e78ca5eba1a2eb80a7e99fb3eab397ebaaa5e8b1abeab395eab09defa78deb9aa7eba384e6839fe3828beb88a4efa5b7ec8e88eba7a7e5babeeabb86ebb8a8
UHC 猥롢뀧音곗몥豫곕객留뚧룄惟る눤亮쎈맧庾껆븨 111010001110010110001110111000111000010110011110111010111110010110110000111011001001000110010011111001111110001110110000111010111011000010110100111010111010011110001100111001101000111110000100111010101110111010101010111010111000011110111011111001011011100110111101111010111001000010110000111010101110110010000011111001111001010110010001 e8e58ee3859eebe5b0ec9193e7e3b0ebb0b4eba78ce68f84eaeeaaeb87bbe5b9bdeb90b0eaec83e79591

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)