To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 贈???靖私??麻第?蔚?樗?錠??? 100100011010000100111111001111110011111110010110111101011000111010000100001111110011111110010110100000111001000111100110001111111000100101010101001111111001001010010100001111111000111111111001001111110011111100111111 91a13f3f3f96f58e843f3f968391e63f89553f92943f8ff93f3f3f
EUC-JP 贈???靖私??麻第?蔚?樗?錠??? 110000101010001100111111001111110011111111001100111101111011101111100100001111110011111111001011111000111100001011101000001111111011000110110110001111111100001111110100001111111011111011111011001111110011111100111111 c2a33f3f3fccf7bbe43f3fcbe3c2e83fb1b63fc3f43fbefb3f3f3f
UTF-8 贈숄렰렟靖私렎ㅺ麻第렰蔚렯樗렊錠골렰렧 111010001011010010001000111011001000100010000100111010111010000010110000111010111010000010011111111010011001110110010110111001111010011110000001111010111010000010001110111000111000010110111010111010011011101010111011111001111010110010101100111010111010000010110000111010001001010010011010111010111010000010101111111001101010100010010111111010111010000010001010111010011000110010100000111010101011001110101000111010111010000010110000111010111010000010100111 e8b488ec8884eba0b0eba09fe99d96e7a781eba08ee385bae9babbe7acaceba0b0e8949aeba0afe6a897eba08ae98ca0eab3a8eba0b0eba0a7
UHC 贈숄렰렟靖私렎ㅺ麻第렰蔚렯樗렊錠골렰렧 1111000111111100101111001111000110001110101111011000111010110000111011111111111011011110111001111000111010100100101001001110101011011000101010111111000010101111100011101011110111101010101001011000111010111100111011101100000010001110101000011110111111111100101100001111000110001110101111011000111010110110 f1fcbcf18ebd8eb0effedee78ea4a4ead8abf0af8ebdeaa58ebceec08ea1effcb0f18ebd8eb6

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)