To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 懈①?懈?》懈①?懈?》懈??懈③?B 100111001110011010000111010000000011111110011100111001100011111110000001011101001001110011100110100001110100000000111111100111001110011000111111100000010111010010011100111001100011111100111111100111001110011010000111010000100011111101000010 9ce687403f9ce63f81749ce687403f9ce63f81749ce63f3f9ce687423f42
EUC-JP 懈??懈?》懈??懈?》懈??懈??B 110110001110100000111111001111111101100011101000001111111010000111010101110110001110100000111111001111111101100011101000001111111010000111010101110110001110100000111111001111111101100011101000001111110011111101000010 d8e83f3fd8e83fa1d5d8e83f3fd8e83fa1d5d8e83f3fd8e83f3f42
UTF-8 懈①왃懈⅛》懈①왃懈⅛》懈⅛옠懈③츕B 11100110100001111000100011100010100100011010000011101100100110011000001111100110100001111000100011100010100001011001101111100011100000001000101111100110100001111000100011100010100100011010000011101100100110011000001111100110100001111000100011100010100001011001101111100011100000001000101111100110100001111000100011100010100001011001101111101100100110001010000011100110100001111000100011100010100100011010001011101100101110001001010101000010 e68788e291a0ec9983e68788e2859be3808be68788e291a0ec9983e68788e2859be3808be68788e2859bec98a0e68788e291a2ecb89542
UHC 懈①왃懈⅛》懈①왃懈⅛》懈⅛옠懈③츕B 11111010101010111010100011100111100111101011011011111010101010111010100011111011101000011011011111111010101010111010100011100111100111101011011011111010101010111010100011111011101000011011011111111010101010111010100011111011100111101010001011111010101010111010100011101001101011101000111101000010 faaba8e79eb6faaba8fba1b7faaba8e79eb6faaba8fba1b7faaba8fb9ea2faaba8e9ae8f42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)