To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN ?????????伊↓?擬?????吾?ⅰB 001111110011111100111111001111110011111100111111001111110011111100111111100010001100100110000001101010110011111110001011010110110011111100111111001111110011111100111111100011001110000100111111111110100100000001000010 3f3f3f3f3f3f3f3f3f88c981ab3f8b5b3f3f3f3f3f8ce13ffa4042
EUC-JP ?????????伊↓?擬??洧??吾??B 00111111001111110011111100111111001111110011111100111111001111110011111110110000110010111010001010101101001111111011010110111100001111110011111110001111110001111011010000111111001111111011100011100011001111110011111101000010 3f3f3f3f3f3f3f3f3fb0cba2ad3fb5bc3f3f8fc7b43f3fb8e33f3f42
UTF-8 療귥뼶溜깅씟溜김릪伊↓쟼擬밸젿洧노젿吾뚯ⅰB 11101111101001111000000111101010101101111010010111101011101111001011011011101111101001111000101111101010101110011000010111101100100101001001111111101111101001111000101111101010101110011000000011101011101001101010101011100100101111001000101011100010100001101001001111101100100111111011110011100110100100111010110011101011101100001011100011101100101000001011111111100110101101001010011111101011100001011011100011101100101000001011111111100101100100001011111011101011100110101010111111100010100001011011000001000010 efa781eab7a5ebbcb6efa78beab985ec949fefa78beab980eba6aae4bc8ae28693ec9fbce693acebb0b8eca0bfe6b4a7eb85b8eca0bfe590beeb9aafe285b042
UHC 療귥뼶溜깅씟溜김릪伊↓쟼擬밸젿洧노젿吾뚯ⅰB 11101000111111101000001011101100100101101011100111101010111111101011000111101011100111011011001111101010111111101011000111101000100100001000110011101100101001011010000111101001101000001000001011101011111101001011100111101011101000001011000111101010111110111011001111101011101000001011000111100111111011101000110011101100101001011010000101000010 e8fe82ec96b9eafeb1eb9db3eafeb1e8908ceca5a1e9a082ebf4b9eba0b1eafbb3eba0b1e7ee8ceca5a142

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)