To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????X 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011000 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f58
SJIS-WIN 汚??節??曄②????傲??X 100010011001100000111111001111111001000011011111001111110011111110011110010000001000011101000001001111110011111100111111001111111001100011111100001111110011111101011000 89983f3f90df3f3f9e4087413f3f3f3f98fc3f3f58
EUC-JP 汚??節??曄?????傲??X 1011000111111000001111110011111111000000111000010011111100111111110110111010000100111111001111110011111100111111001111111101000011111110001111110011111101011000 b1f83f3fc0e13f3fdba13f3f3f3f3fd0fe3f3f58
UTF-8 汚얕닃節뤺뜵曄②읈隸뚳퐦傲좑슐X 11100110101100011001101011101100100101101001010111101011100010111000001111100111101011111000000011101011101001001011101011101011100111001011010111100110100110111000010011100010100100011010000111101100100111011000100011101111101001101011100011101011100110101011001111101101100100001010011011100101100000101011001011101100101000101001000111101100100010101001000001011000 e6b19aec9695eb8b83e7af80eba4baeb9cb5e69b84e291a1ec9d88efa6b8eb9ab3ed90a6e582b2eca291ec8a9058
UHC 汚얕닃節뤺뜵曄②읈隸뚳퐦傲좑슐X 11100111111111011011111011101000100010001000110011101111101111011000111111101000100011011011001111100111101001011010100011101000100111111011111011100111111001101000110011101111101111011000111111100111111011001010000011101111101111011011011001011000 e7fdbee8888cefbd8fe88db3e7a5a8e89fbee7e68cefbd8fe7eca0efbdb658

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)