To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 娃??玉?????節よ?阿??曜??要 1000100010100001001111110011111110001011110010100011111100111111001111110011111100111111100100001101111110000010111001100011111110001000101000100011111100111111100101110110101000111111001111111001011101110110 88a13f3f8bca3f3f3f3f3f90df82e63f88a23f3f976a3f3f9776
EUC-JP 娃??玉??孼??節よ?阿??曜??要 10110000101000110011111100111111101101101100110000111111001111111000111110111010110000110011111100111111110000001110000110100100111010000011111110110000101001000011111100111111110011011100101100111111001111111100110111010111 b0a33f3fb6cc3f3f8fbac33f3fc0e1a4e83fb0a43f3fcdcb3f3fcdd7
UTF-8 娃띰쉠玉먪돉孼닻겤節よ춯阿쀩뙋曜깍쉼要 111001011010100010000011111010111001110110110000111011001000100110100000111001111000111010001001111010111010100010101010111010111000111110001001111001011010110110111100111010111000101110111011111010101011001010100100111001111010111110000000111000111000001010001000111011001011011010101111111010011001100010111111111011001000000010101001111010111001100110001011111001101001101110011100111010101011100110001101111011001000100110111100111010001010011010000001 e5a883eb9db0ec89a0e78e89eba8aaeb8f89e5adbceb8bbbeab2a4e7af80e38288ecb6afe998bfec80a9eb998be69b9ceab98dec89bce8a681
UHC 娃띰쉠玉먪돉孼닻겤節よ춯阿쀩뙋曜깍쉼要 1110100011011111101101101110111110111101101010101110100010101100100100001110011110001001100110011110010111101101101101001110100110000001101101101110111110111101101010101110100010101101100011001110010010111001100101111110100110001100100100001110100011111000101100011110111110111101101100001110100110101001 e8dfb6efbdaae8ac90e78999e5edb4e981b6efbdaae8ad8ce4b997e98c90e8f8b1efbdb0e9a9

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)