To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????\ 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011100 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5c
SJIS-WIN ????????訛??????????訛??\ 00111111001111110011111100111111001111110011111100111111001111111110011001100001001111110011111100111111001111110011111100111111001111110011111100111111001111111110011001100001001111110011111101011100 3f3f3f3f3f3f3f3fe6613f3f3f3f3f3f3f3f3f3fe6613f3f5c
EUC-JP ????????訛??????????訛??\ 00111111001111110011111100111111001111110011111100111111001111111110101111000010001111110011111100111111001111110011111100111111001111110011111100111111001111111110101111000010001111110011111101011100 3f3f3f3f3f3f3f3febc23f3f3f3f3f3f3f3f3f3febc23f3f5c
UTF-8 센솝센세센솎센셜訛섧솝센솝센세센솎센셜訛섧솝\ 11101100100001001011110011101100100001101001110111101100100001001011110011101100100001001011100011101100100001001011110011101100100001101000111011101100100001001011110011101100100001011001110011101000101010001001101111101100100001001010011111101100100001101001110111101100100001001011110011101100100001101001110111101100100001001011110011101100100001001011100011101100100001001011110011101100100001101000111011101100100001001011110011101100100001011001110011101000101010001001101111101100100001001010011111101100100001101001110101011100 ec84bcec869dec84bcec84b8ec84bcec868eec84bcec859ce8a89bec84a7ec869dec84bcec869dec84bcec84b8ec84bcec868eec84bcec859ce8a89bec84a7ec869d5c
UHC 센솝센세센솎센셜訛섧솝센솝센세센솎센셜訛섧솝\ 101111001011111010111100110110011011110010111110101111001011110010111100101111101011110011010100101111001011111010111100110010001110100011000101101111001011010110111100110110011011110010111110101111001101100110111100101111101011110010111100101111001011111010111100110101001011110010111110101111001100100011101000110001011011110010110101101111001101100101011100 bcbebcd9bcbebcbcbcbebcd4bcbebcc8e8c5bcb5bcd9bcbebcd9bcbebcbcbcbebcd4bcbebcc8e8c5bcb5bcd95c

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)