To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 鶯??伊??揄??巍ル?悠???????? 111010011111001000111111001111111000100011001001001111110011111110011101100010010011111100111111100110111101100110000011100010110011111110010111010010010011111100111111001111110011111100111111001111110011111100111111 e9f23f3f88c93f3f9d893f3f9bd9838b3f97493f3f3f3f3f3f3f3f
EUC-JP 鶯??伊??揄??巍ル?悠??洧??獒?? 11110010111101000011111100111111101100001100101100111111001111111101100111101001001111110011111111010110110110111010010111101011001111111100110110101010001111110011111110001111110001111011010000111111001111111000111111001011101110110011111100111111 f2f43f3fb0cb3f3fd9e93f3fd6dba5eb3fcdaa3f3f8fc7b43f3f8fcbbb3f3f
UTF-8 鶯ㅺ퉮伊됪젔揄우돺巍ル쵐悠방갭洧붿맼獒뺣뒴 111010011011011010101111111000111000010110111010111011011000100110101110111001001011110010001010111010111001000010101010111011001010000010010100111001101000111110000100111011001001101010110000111010111000111110111010111001011011011110001101111000111000001110101011111011001011010110010000111001101000001010100000111010111011000010101001111010101011000010101101111001101011010010100111111010111011011010111111111010111010011110111100111001111000110110010010111010111011101010100011111010111001001010110100 e9b6afe385baed89aee4bc8aeb90aaeca094e68f84ec9ab0eb8fbae5b78de383abecb590e682a0ebb0a9eab0ade6b4a7ebb6bfeba7bce78d92ebbaa3eb92b4
UHC 鶯ㅺ퉮伊됪젔揄우돺巍ル쵐悠방갭洧붿맼獒뺣뒴 111001011010001110100100111010101011100110000110111011001010010110001001111001101010000010010010111010101111000110111111111011001000100110111101111010001110010010101011111010111010110010010010111010101110110110111001111001101011000010111000111010101111101110010100111011001001000010111101111010001010001110010101111010111000101010101101 e5a3a4eab986eca589e6a092eaf1bfec89bde8e4abebac92eaedb9e6b0b8eafb94ec90bde8a395eb8aad

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)