To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 羽?億???倚警?敎?億???倚警?界 10001001010010000011111110001001101011010011111100111111001111111001100011011111100011000111100000111111111110101100110100111111100010011010110100111111001111110011111110011000110111111000110001111000001111111000101001000101 89483f89ad3f3f3f98df8c783ffacd3f89ad3f3f3f98df8c783f8a45
EUC-JP 羽?億???倚警???億???倚警?界 101100011010100100111111101100101010111100111111001111110011111111010000111000011011011111011001001111110011111100111111101100101010111100111111001111110011111111010000111000011011011111011001001111111011001110100110 b1a93fb2af3f3f3fd0e1b7d93f3f3fb2af3f3f3fd0e1b7d93fb3a6
UTF-8 羽렡億띨렠렲倚警렑敎ㅄ億띨렠렲倚警렑界 111001111011111010111101111010111010000010100001111001011000010010000100111010111001110110101000111010111010000010100000111010111010000010110010111001011000000010011010111010001010110110100110111010111010000010010001111001101001010110001110111000111000010110000100111001011000010010000100111010111001110110101000111010111010000010100000111010111010000010110010111001011000000010011010111010001010110110100110111010111010000010010001111001111001010110001100 e7bebdeba0a1e58484eb9da8eba0a0eba0b2e5809ae8ada6eba091e6958ee38584e58484eb9da8eba0a0eba0b2e5809ae8ada6eba091e7958c
UHC 羽렡億띨렠렲倚警렑敎ㅄ億띨렠렲倚警렑界 1110100111100010100011101011001011100101111000101011011011101110100011101011000110001110101111111110101111101111110011001110110110001110101001101100111011100111101001001011010011100101111000101011011011101110100011101011000110001110101111111110101111101111110011001110110110001110101001101100110110100011 e9e28eb2e5e2b6ee8eb18ebfebefcced8ea6cee7a4b4e5e2b6ee8eb18ebfebefcced8ea6cda3

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)