To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 渦??媛??猷??嶸∽?議??閻??侑 100010010101000100111111001111111001010101010001001111110011111110010111010100010011111100111111111110101011010010000001111001000011111110001011011000110011111100111111111010001000010100111111001111111001100011010000 89513f3f95513f3f97513f3ffab481e43f8b633f3fe8853f3f98d0
EUC-JP 渦??媛??猷??嶸∽?議??閻??侑 10110001101100100011111100111111110010011011001000111111001111111100110110110010001111110011111110001111101110111111010010100010111001100011111110110101110001000011111100111111111011111110010100111111001111111101000011010010 b1b23f3fc9b23f3fcdb23f3f8fbbf4a2e63fb5c43f3fefe53f3fd0d2
UTF-8 渦기뫀媛뉑틦猷↔뻗嶸∽쭑議우쐨閻롫챷侑 111001101011100010100110111010101011100010110000111010111010101110000000111001011010101010011011111010111000100110010001111011011000101110100110111001111000110010110111111000101000011010010100111010111011101110010111111001011011011010111000111000101000100010111101111011001010110110010001111010001010110110110000111011001001101010110000111011001001000010101000111010011001011010111011111010111010000110101011111011001011000110110111111001001011111010010001 e6b8a6eab8b0ebab80e5aa9beb8991ed8ba6e78cb7e28694ebbb97e5b6b8e288bdecad91e8adb0ec9ab0ec90a8e996bbeba1abecb1b7e4be91
UHC 渦기뫀媛뉑틦猷↔뻗嶸∽쭑議우쐨閻롫챷侑 1110100010111110101100011110001010010001101001001110101010110000100001111110011010111010100100001110101110100011101000011110101010111011101110001110011110101110101000011110111110100111100010011110110010100001101111111110110010011100100011011110011110100010100011101110101110101010100001001110101011100010 e8beb1e291a4eab087e6ba90eba3a1eabbb8e7aea1efa789eca1bfec9c8de7a28eebaa84eae2

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)