To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 昻??竊??儒??獄??椅??惟?????異 1111101011010000001111110011111111100010100001100011111100111111100011101111001000111111001111111000110110010110001111110011111110001000110101100011111100111111100010001101001000111111001111110011111100111111001111111000100011011001 fad03f3fe2863f3f8ef23f3f8d963f3f88d63f3f88d23f3f3f3f3f88d9
EUC-JP ???竊??儒??獄??椅??惟?????異 00111111001111110011111111100011111001100011111100111111101111001111010000111111001111111011100111110110001111110011111110110000110110000011111100111111101100001101010000111111001111110011111100111111001111111011000011011011 3f3f3fe3e63f3fbcf43f3fb9f63f3fb0d83f3fb0d43f3f3f3f3fb0db
UTF-8 昻뉗떜竊섉껸儒띠쾸獄쏄퀡椅졿첀惟ㅻ쳳烈쀣끁異 111001101001100010111011111010111000100110010111111010111001011010011100111001111010101110001010111011001000010010001001111010101011101110111000111001011000010010010010111010111001110110100000111011001011111010111000111001111000110110000100111011001000111110000100111011011000000010100001111001101010010010000101111011001010000110111111111011001011001010000000111001101000001110011111111000111000010110111011111011001011001110110011111011111010011010011111111011001000000010100011111010111000000110000001111001111001010110110000 e698bbeb8997eb969ce7ab8aec8489eabbb8e58492eb9da0ecbeb8e78d84ec8f84ed80a1e6a485eca1bfecb280e6839fe385bbecb3b3efa69fec80a3eb8181e795b0
UHC 昻뉗떜竊섉껸儒띠쾸獄쏄퀡椅졿첀惟ㅻ쳳烈쀣끁異 1110010011101001100001111110110010001011101100101110111110111100100110001110011010110010101110011110101011100011101101101110110010110010100011101110100010101011100110111110101010110011100101011110101111110101101000001110011010101010100011011110101011101110101001001110101110101011100101101110011011101111100101111110001110000101101101111110110010110110 e4e987ec8bb2efbc98e6b2b9eae3b6ecb28ee8ab9beab395ebf5a0e6aa8deaeea4ebab96e6ef97e385b7ecb6

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)