To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ?????????業??鶯????????^ 001111110011111100111111001111110011111100111111001111110011111100111111100010111100011000111111001111111110100111110010001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f8bc63f3fe9f23f3f3f3f3f3f3f3f5e
EUC-JP ?????????業??鶯????????^ 001111110011111100111111001111110011111100111111001111110011111100111111101101101100100000111111001111111111001011110100001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3fb6c83f3ff2f43f3f3f3f3f3f3f3f5e
UTF-8 溜삣짍溜딅졎溜딅졎業롫졎鶯숇젇溜삳젇溜삥삇^ 11101111101001111000101111101100100000101010001111101100101001111000110111101111101001111000101111101011100101001000010111101100101000011000111011101111101001111000101111101011100101001000010111101100101000011000111011100110101001011010110111101011101000011010101111101100101000011000111011101001101101101010111111101100100010001000011111101100101000001000011111101111101001111000101111101100100000101011001111101100101000001000011111101111101001111000101111101100100000101010010111101100100000101000011101011110 efa78bec82a3eca78defa78beb9485eca18eefa78beb9485eca18ee6a5adeba1abeca18ee9b6afec8887eca087efa78bec82b3eca087efa78bec82a5ec82875e
UHC 溜삣짍溜딅졎溜딅졎業롫졎鶯숇젇溜삳젇溜삥삇^ 11101010111111101011101111100101101000111001100111101010111111101000101011101011101000001011101111101010111111101000101011101011101000001011101111100101111101101000111011101011101000001011101111100101101000111001100111101011101000001000101011101010111111101011101111101011101000001000101011101010111111101011101111100110100110001000111001011110 eafebbe5a399eafe8aeba0bbeafe8aeba0bbe5f68eeba0bbe5a399eba08aeafebbeba08aeafebbe6988e5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)