To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????N}????????N{^ 001111110011111100111111001111110011111100111111001111110011111101001110011111010011111100111111001111110011111100111111001111110011111100111111010011100111101101011110 3f3f3f3f3f3f3f3f4e7d3f3f3f3f3f3f3f3f4e7b5e
SJIS-WIN 艶?兀?艶?兀?N}艶?兀?艶?兀?N{^ 1000100110010000001111111001100101011001001111111000100110010000001111111001100101011001001111110100111001111101100010011001000000111111100110010101100100111111100010011001000000111111100110010101100100111111010011100111101101011110 89903f99593f89903f99593f4e7d89903f99593f89903f99593f4e7b5e
EUC-JP 艶?兀?艶?兀?N}艶?兀?艶?兀?N{^ 1011000111110000001111111101000110111010001111111011000111110000001111111101000110111010001111110100111001111101101100011111000000111111110100011011101000111111101100011111000000111111110100011011101000111111010011100111101101011110 b1f03fd1ba3fb1f03fd1ba3f4e7db1f03fd1ba3fb1f03fd1ba3f4e7b5e
UTF-8 艶쵲兀늜艶쵲兀늗N}艶쵲兀늜艶쵲兀늗N{^ 1110100010001001101101101110110010110101101100101110010110000101100000001110101110001010100111001110100010001001101101101110110010110101101100101110010110000101100000001110101110001010100101110100111001111101111010001000100110110110111011001011010110110010111001011000010110000000111010111000101010011100111010001000100110110110111011001011010110110010111001011000010110000000111010111000101010010111010011100111101101011110 e889b6ecb5b2e58580eb8a9ce889b6ecb5b2e58580eb8a974e7de889b6ecb5b2e58580eb8a9ce889b6ecb5b2e58580eb8a974e7b5e
UHC 艶쵲兀늜艶쵲兀늗N}艶쵲兀늜艶쵲兀늗N{^ 11100110111111011010110101001101111010001011010010001000011010001110011011111101101011010100110111101000101101001000100001100110010011100111110111100110111111011010110101001101111010001011010010001000011010001110011011111101101011010100110111101000101101001000100001100110010011100111101101011110 e6fdad4de8b48868e6fdad4de8b488664e7de6fdad4de8b48868e6fdad4de8b488664e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)