To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 鸚??意??議?????愉??擬???k? 111010100101111100111111001111111000100011010011001111110011111110001011011000110011111100111111001111110011111100111111100101101111100100111111001111111000101101011011001111110011111100111111100000101000101100111111 ea5f3f3f88d33f3f8b633f3f3f3f3f96f93f3f8b5b3f3f3f828b3f
EUC-JP 鸚??意??議?????愉??擬???k? 111100111100000000111111001111111011000011010101001111110011111110110101110001000011111100111111001111110011111100111111110011001111101100111111001111111011010110111100001111110011111100111111101000111110101100111111 f3c03f3fb0d53f3fb5c43f3f3f3f3fccfb3f3fb5bc3f3f3fa3eb3f
UTF-8 鸚쒓퍓意쎿룚議용꺅呂얠럩愉뚦슫擬꾨뙔力k뜥 111010011011100010011010111011001001001010010011111011011000110110010011111001101000010010001111111011001000111010111111111010111010001110011010111010001010110110110000111011001001101010101001111010101011101010000101111011111010011010000000111011001001011010100000111010111001111110101001111001101000010010001001111010111001101010100110111011001000101010101011111001101001001110101100111010101011111010101000111010111001100110010100111011111010011010001010111011111011110110001011111010111001110010100101 e9b89aec9293ed8d93e6848fec8ebfeba39ae8adb0ec9aa9eaba85efa680ec96a0eb9fa9e68489eb9aa6ec8aabe693aceabea8eb9994efa68aefbd8beb9ca5
UHC 鸚쒓퍓意쎿룚議용꺅呂얠럩愉뚦슫擬꾨뙔力k뜥 111001011010010010011100111010101011101110001010111010111111001010011011111001101000111110010110111011001010000110111111111010111011001010100110111001011111101110111110111011001000111010001100111010101111000010001100111001011001101010110100111010111111010010000100111010111000110010011001111001101011001110100011111010111000110110101000 e5a49ceabb8aebf29be68f96eca1bfebb2a6e5fbbeec8e8ceaf08ce59ab4ebf484eb8c99e6b3a3eb8da8

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)